15
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      SynthEye: Investigating the Impact of Synthetic Data on Artificial Intelligence-assisted Gene Diagnosis of Inherited Retinal Disease

      research-article
      , MSc 1 , 2 , , PhD 1 , 2 , , PhD 3 , , PhD 2 , , PhD, MBBS 1 , 2 , , BMBCh 1 , 2 , , PhD, MD 1 , 2 , , PhD, MD 1 , 2 , , PhD 2 , , MBBChir MD(Res) 2 , , PhD 3 , , FRCOphth 1 , 2 , , PhD, MBBChir 1 , 2 , , MD, MB BCH BAO 1 , 2 , , MD 1 , 2 , , MD 1 , 2 , , PhD 1 , 2 ,
      Ophthalmology Science
      Elsevier
      Synthetic data, Deep Learning, Generative Adversarial Networks, Inherited Retinal Diseases, Class imbalance, Clinical Decision-Support Model, AUROC, area under the receiver operating characteristic curve, BRISQUE, Blind/Referenceless Image Spatial Quality Evaluator, DL, deep learning, FAF, fundas autofluorescence, FRR, Fake Recognition Rate, GAN, generative adversarial network, IRD, inherited retinal disease, MEH, Moorfields Eye Hospital, R, baseline model, RB, rebalanced model, S, synthetic data trained model, TRR, True Recognition Rate, UMAP, Universal Manifold Approximation and Projection

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Purpose

          Rare disease diagnosis is challenging in medical image-based artificial intelligence due to a natural class imbalance in datasets, leading to biased prediction models. Inherited retinal diseases (IRDs) are a research domain that particularly faces this issue. This study investigates the applicability of synthetic data in improving artificial intelligence-enabled diagnosis of IRDs using generative adversarial networks (GANs).

          Design

          Diagnostic study of gene-labeled fundus autofluorescence (FAF) IRD images using deep learning.

          Participants

          Moorfields Eye Hospital (MEH) dataset of 15 692 FAF images obtained from 1800 patients with confirmed genetic diagnosis of 1 of 36 IRD genes.

          Methods

          A StyleGAN2 model is trained on the IRD dataset to generate 512 × 512 resolution images. Convolutional neural networks are trained for classification using different synthetically augmented datasets, including real IRD images plus 1800 and 3600 synthetic images, and a fully rebalanced dataset. We also perform an experiment with only synthetic data. All models are compared against a baseline convolutional neural network trained only on real data.

          Main Outcome Measures

          We evaluated synthetic data quality using a Visual Turing Test conducted with 4 ophthalmologists from MEH. Synthetic and real images were compared using feature space visualization, similarity analysis to detect memorized images, and Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) score for no-reference-based quality evaluation. Convolutional neural network diagnostic performance was determined on a held-out test set using the area under the receiver operating characteristic curve (AUROC) and Cohen’s Kappa (κ).

          Results

          An average true recognition rate of 63% and fake recognition rate of 47% was obtained from the Visual Turing Test. Thus, a considerable proportion of the synthetic images were classified as real by clinical experts. Similarity analysis showed that the synthetic images were not copies of the real images, indicating that copied real images, meaning the GAN was able to generalize. However, BRISQUE score analysis indicated that synthetic images were of significantly lower quality overall than real images ( P < 0.05). Comparing the rebalanced model (RB) with the baseline (R), no significant change in the average AUROC and κ was found (R-AUROC = 0.86[0.85-88], RB-AUROC = 0.88[0.86-0.89], R-k = 0.51[0.49-0.53], and RB-k = 0.52[0.50-0.54]). The synthetic data trained model (S) achieved similar performance as the baseline (S-AUROC = 0.86[0.85-87], S-k = 0.48[0.46-0.50]).

          Conclusions

          Synthetic generation of realistic IRD FAF images is feasible. Synthetic data augmentation does not deliver improvements in classification performance. However, synthetic data alone deliver a similar performance as real data, and hence may be useful as a proxy to real data. Financial Disclosure(s): Proprietary or commercial disclosure may be found after the references.

          Related collections

          Most cited references30

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Image-to-Image Translation with Conditional Adversarial Networks

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            No-reference image quality assessment in the spatial domain.

            We propose a natural scene statistic-based distortion-generic blind/no-reference (NR) image quality assessment (IQA) model that operates in the spatial domain. The new model, dubbed blind/referenceless image spatial quality evaluator (BRISQUE) does not compute distortion-specific features, such as ringing, blur, or blocking, but instead uses scene statistics of locally normalized luminance coefficients to quantify possible losses of "naturalness" in the image due to the presence of distortions, thereby leading to a holistic measure of quality. The underlying features used derive from the empirical distribution of locally normalized luminances and products of locally normalized luminances under a spatial natural scene statistic model. No transformation to another coordinate frame (DCT, wavelet, etc.) is required, distinguishing it from prior NR IQA approaches. Despite its simplicity, we are able to show that BRISQUE is statistically better than the full-reference peak signal-to-noise ratio and the structural similarity index, and is highly competitive with respect to all present-day distortion-generic NR IQA algorithms. BRISQUE has very low computational complexity, making it well suited for real time applications. BRISQUE features may be used for distortion-identification as well. To illustrate a new practical application of BRISQUE, we describe how a nonblind image denoising algorithm can be augmented with BRISQUE in order to perform blind image denoising. Results show that BRISQUE augmentation leads to performance improvements over state-of-the-art methods. A software release of BRISQUE is available online: http://live.ece.utexas.edu/research/quality/BRISQUE_release.zip for public use and evaluation.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Survey on deep learning with class imbalance

                Bookmark

                Author and article information

                Journal
                Ophthalmol Sci
                Ophthalmol Sci
                Ophthalmology Science
                Elsevier
                2666-9145
                22 November 2022
                June 2023
                22 November 2022
                : 3
                : 2
                : 100258
                Affiliations
                [1 ]University College London Institute of Ophthalmology, University College London, London, UK
                [2 ]Moorfields Eye Hospital, London, UK
                [3 ]University College London Cancer Institute, University College London, London, UK
                Author notes
                []Correspondence: Nikolas Pontikos, PhD, University College London Institute of Ophthalmology, 11-43 Bath Street, London EC1V 9EL, UK. E-mail: n.pontikos@ucl.ac.uk
                Article
                S2666-9145(22)00147-6 100258
                10.1016/j.xops.2022.100258
                9852957
                36685715
                ed8b8ab5-0721-4a0a-8b9a-c27a941007b7
                © 2022 by the American Academy of Ophthalmology. Published by Elsevier Inc.

                This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

                History
                : 30 June 2022
                : 8 November 2022
                : 9 November 2022
                Categories
                Artificial Intelligence and Big Data

                synthetic data,deep learning,generative adversarial networks,inherited retinal diseases,class imbalance,clinical decision-support model,auroc, area under the receiver operating characteristic curve,brisque, blind/referenceless image spatial quality evaluator,dl, deep learning,faf, fundas autofluorescence,frr, fake recognition rate,gan, generative adversarial network,ird, inherited retinal disease,meh, moorfields eye hospital,r, baseline model,rb, rebalanced model,s, synthetic data trained model,trr, true recognition rate,umap, universal manifold approximation and projection

                Comments

                Comment on this article