Applications of Artificial Intelligence in Ophthalmology: General Overview.
As population aging has become a major demographic trend around the world, patients suffering from eye diseases are expected to increase steeply. Early detection and appropriate treatment of eye diseases are of great significance to prevent vision loss and promote living quality. Conventional diagnose methods are tremendously depend on physicians' professional experience and knowledge, which lead to high misdiagnosis rate and huge waste of medical data. Deep integration of ophthalmology and artificial intelligence (AI) has the potential to revolutionize current disease diagnose pattern and generate a significant clinical impact.
Proposed in 1956 by Dartmouth scholar John McCarthy, AI is a general term that "refers to hardware or software that exhibits behavior which appears intelligent" . Though occurred sixty years ago, it is until recently that the effectiveness of AI has been highlighted because of the development of new algorithms, specialized hardware, cloud-based services, and big data. Machine learning (ML), occurred in 1980s, is a subset of AI, and is defined as a set of methods that automatically detect patterns in data and then incorporate this information to predict future data under uncertain conditions. Deep learning (DL), occurred in 2000s, is a burgeoning technology of ML and has revolutionized the world of AI. These technologies power many aspects of modern society, such as objects' recognition in images, real-time languages' translation, device manipulation via speech (such as Apple's Siri, Amazon Alexa, and Microsoft Cortana), and so on.
The field of healthcare has been the forefront of the AI application in recent years. Multiple studies have shown that DL algorithms performed at a high level when applied to breast histopathology analysis , skin cancer classification , cardiovascular diseases' risk prediction , and lung cancer detection . These impressive research studies inspire numerous studies to apply AI in ophthalmology. Advanced AI algorithms together with multiple accessible data sets, such as EyePACS , Messidor , and Kaggle's data set , can make breakthroughs on various ophthalmological issues.
The rapid rise in AI technology requires physicians and computer scientists to have a good mutual understanding of not only the technology but also the medical practice to enhance medical care in the near future. Miguel Caixinha and Sandrina Nunes introduced conventional machine learning (CML) techniques and reviewed applications of CML for diagnosis and monitoring of multimodal ocular disease, without the mention about DL . Litjens et al.  detailly introduced various DL methods for different tasks and provided an overview of studies per application area, whereas the "retina" section majorly focused on the fundus images only. Lee et al.  introduced the AI development in ophthalmology generally. Rahimy  focused on DL applications in the ophthalmology field, without the mention about CML. Louis J. Catania and Ernst Nicolitz systemically reviewed AI and robotic applications in multiple categories of vision and eye care but mentioned little about AI diagnosis of retinal diseases .
In this review, we systematically reviewed the application of AI (both CML and DL) in diagnosing ocular diseases, including the four leading cause of adult blindness diabetic retinopathy (DR), glaucoma, age-related macular degeneration (AMD), and cataract. We also introduced the existing AI methods, the ophthalmic imaging modalities, detailed steps for building AI models, and evaluation metrics in AI diagnosis. We hope we can provide both ophthalmologists and computer scientists a meaningful and comprehensive summary on AI applications in ophthalmology and facilitate promising AI projects in the ophthalmology field.
2. AI Algorithms
As we mentioned above, ML is one subset of AI and includes DL and CML (Figure 1(a)). The defining characteristic of ML algorithms is the quality of predictions improved with experience . The more data we provide (usually up to a platform), the better the prediction model we can achieve.
Supervised learning and unsupervised learning are two forms of ML. Supervised learning is to train a model from already labeled training data, tunes the weightings of the inputs to improve the accuracy of its predictions until they are optimized, and then map test data sets as corresponding outputs. It may expedite classification process and would be useful for discriminating clinical outcomes. Unsupervised learning is to train a model with unlabeled data (without a human-labeled process), infers a function to describe hidden structures that usually invisible to humans, and could bring new discoveries, such as new encephalic region relevant to Alzheimer's disease  and new impact factors of cardiovascular diseases beyond human's recognition . So far, methods adopted in most research studies are in supervised form because the accuracy and efficacy are better under supervised condition .
CML can get satisfactory outcome with small data sets, but a cumbersome step to select specific visual features manually prior to classification is indispensable . This selection can result in a set of suboptimal features and overfitting (the trained model is not generalized to other data except for the training set), which limits CML algorithms' application.
Existing CML algorithms used in AI diagnosis include decision trees , random forests (RF) , support vector machines (SVM) , Bayesian classifiers , k-nearest neighbors , k-means , linear discriminant analysis , and neural networks (NN)  (Table 1). Among them, RF and SVM are the most commonly used CML technologies in the ophthalmology field  (Figures 1(b) and 1(c)).
DL, a burgeoning technology of ML, has the ability to discover intricate structures in data sets without the need to specify rules explicitly. A DL network is an NN with multiple layers between the input and output layers (Figure 1(d)). It has dramatically improved the state-of-the-art in image recognition . When applied to image classification, a key difference between DL and CML algorithms is how they select and process image features. Features of input data are automatically learned in an unsupervised way by DL algorithms, avoiding manual segmenting and depicting lesions' areas [15, 26]. However, large data set is needed to train a DL algorithm. Transfer learning is to retrain an algorithm, which has already been pretrained on millions of general images before, on a specific data set. This method allows the training of a highly accurate model with a relatively small training data set .
DL algorithms are known as "black boxes." The networks generate comprehensive and discriminative features that are much too high dimensional to be accessible for human interpretation. Little is known about how they analyze pattern and make a decision at the image level . Heatmaps can show which pixels play a role in the image-level predictions. In the medical field, the visualization highlighted highly possible abnormal regions in the input image for future review and analysis, potentially aiding real-time clinical validation of automated diagnoses at the point of care. Existing methods of DL include long-term and short-term memory , deep Boltzmann machines , deep kernel machines , deep recurrent neural networks , and convolutional neural networks (CNN) . Among them, the most used DL method in the medical image recognition field is CNN. The CNN consists of multiple convolutional layers that extract features and transform input images into hierarchical feature maps: from simple features, such as edges and lines, to complicated features, such as shapes and colors. It also includes layers that can merge semantically similar features into one to reduce the dimensionality of the extracted features, and layers that can combine these features and output a final probability value for the class. Existing CNN architectures used in the medical image recognition field include AlexNet , VGG , ResNet , and GoogleNet [34-37](Table 2).
3. Building AI Models
Various imaging modalities have been used in AI diagnosis, such as radiology images (X-ray, CT, and MRI) , electrophysiological signal records (electrocardiograph  and electroencephalogram ), visible wavelength images (dermoscopy images and biopsy images ), ultrasound images , angiography images , and so on. We introduce the ophthalmic imaging modalities in AI diagnosis in Table 3.
The steps for building an AI model include preprocessing image data, train, validate and test the model, and evaluate the trained model's performance.
3.1. Data Preprocessing. In order to increase AI prediction efficiency, raw data need to be preprocessed. The preprocessed work includes the following [8, 43]: (1) noise reduction: noise reduction needs to be performed in almost all relevant research. Denoising can promote the quality of data set and optimize learning process. (2) Data integration and normalization: data collected from different sources should be integrated and adjusted to a common scale. (3) Feature selection and extraction: the most relevant features are usually selected and extracted to improve the learning process performance.
3.2. Training, Validation, and Test. To achieve a good performance, the data set is randomly partitioned into two independent subsets, one is for modeling and the other is for testing. The data in the former sets will be partitioned again into training set and validation set in most cases. The training set is used to fit the parameters of a model. The validation set is used to estimate how well the model had been trained and tune the parameters or to compare the performances of the prediction algorithms achieved based on the training set. The test set is used to evaluate the final performance of the trained model (Figure 2(a)).
Cross-validation methods have been widely used to estimate and optimize algorithms . The most adopted cross-validation is "K-fold cross-validation." It is an effective method to avoid overfitting and underfitting. All data are equally divided into K subsets, 1 for validation and K - 1 for training. This process will repeat K times, and average metrics are used to evaluate the trained model (Figure 2(b)). Fivefold cross-validation and 10-fold cross-validation are most commonly used .
3.3. Evaluation. Receiver operating characteristic curve (ROC) is a useful tool to depict algorithms' performance. It is created by plotting the detection probability for each algorithm across a continuum of threshold. For each threshold, the sensitivity and the false positive rate (1 - specificity) are plotted against each other. The area under receiver operating characteristic curves (AUC) is the most used evaluation metrics for quantitative assessment of a model in AI diagnosis. The AUCs of effective models range from 0.5 to 1; the higher the value of AUC, the better the performance of the model . Table 4 provides introduction of other metrics to evaluate the performance of a model.
4. AI Application in Ophthalmology
Two hundred forty-three articles of AI application in diagnosing ophthalmological diseases have been published (search by PubMed, Sep 20, 2018). Among them, the most intensively studied are DR, glaucoma, AMD, and cataract (Figure 3(a)). Figure 3(b) shows the breakdown of the papers of these four diseases in year of publication.
4.1. Diabetic Retinopathy. Diabetes affects more than 415 million people worldwide, meaning 1 in every 11 adults is affected . DR, a chronic diabetic complication, is a vasculopathy that affects one-third of diabetic patients and can lead to irreversible blindness . Automated techniques for DR diagnosis have been explored to improve the management of patients with DR and alleviate social burden. AI was used to predict DR risk and DR progression among diabetic patients to combat with this worldwide disease [48, 49].
The specific abnormalities such as macular edema [50-53], exudates , cotton-wool , microaneurysms [55, 56], and neovascularization on optic disk  can be detected by CML. Based on these hallmarks, the early diagnose of DR in an automated fashion has been explored . Additionally, a system focused on timely and effectively proliferative DR (PDR) detection has been developed to ensure immediate attention and intervention [59, 60].
Gulshan et al. were the first to report the application of DL for DR identification . They used large fundus image data sets to train a deep CNN (DCNN) in a supervised manner. They showed that the method based on DL techniques had very high sensitivity and specificity, and the AUC came up to 0.99 for detecting referable DR . In the past two years, a number of DL models with impressive performance have been developed for the automated detection of DR [46, 62, 63]. Additionally, some studies applied DL to automatically stage DR through fundus images [62-65], making up the deficiency of Gulshan's study that they only detected referable DR but did not provide comparable data on sight-threatening DR or other DR stages.
The majority of aforementioned studies focused mainly on the analysis of fundus photographs. There were some other imaging modalities used to build models for DR. ElTanboly et al. developed a DL-based computer-aided system to detect DR through 52 optical coherence tomography (OCT) images, achieving an AUC of 0.98 . Despite the good outcomes in the cross-validation process, the system needs to be further validated in larger patient cohorts. A computer-aided diagnostic (CAD) system based on CML algorithms using optical coherence tomography angiography (OCTA) images to automatically diagnose nonproliferative DR (NPDR) also achieved high accuracy and AUC .
The visualization of which pixels play an important role in the image-level predictions has been applied into DR diagnostic models [7, 46]. It represents intuitively the learning procedure of the DL network and highlights important abnormal regions, assisting physicians' better understanding of the DR predictions. The visualization method can enhance the applicability of intelligent diagnostic models in real clinical practice.
4.2. Glaucoma. Glaucoma is the third largest sight-threatening eye disease around the world and has critical impact on global blindness . Glaucoma patients suffered from high intraocular pressure, damage of the optic nerve head (ONH), retina nerve fiber layer (RNFL) defect, and gradual vision loss. Automatically detecting features related to glaucoma has great significance on its timely diagnosis.
The optic cup-to-disc ratio (CDR) can be used to detect glaucoma patients . Based on automatically localization of ONH and extraction of optic disc and optic cup from fundus images , CDR can be calculated to assist glaucoma diagnose at an early stage by AI models [71-74]. Spectrum domain OCT (SD-OCT) is another imaging modality to evaluate CDR. After approximately locating the coarse disc margin by a spatial correlation smoothness constraint, a SVM model is trained to find the most likely patch on OCT images to determine a reference plane that can calculate the CDR. The proposed algorithm can achieve high segmentation accuracy and a low CDR evaluation error .
RNFL defect can serve as the earliest sign of glaucoma . Several researchers have explored diagnostic accuracy of different methods using RNFL thickness parameters to diagnose glaucoma [77-79]. However, high myopia patients can also suffer from RNFL thickness reduction [80-83]. Recently, reports on how to distinguish the normal retina from glaucoma in high myopia via OCT parameters and optic disc morphology have been published. This indicates us to take account into the existence of other eye diseases in future's research about glaucoma's intelligent diagnosis to improve the accuracy of algorithms.
Visual field (VF) defect is a main alteration of visual function during glaucoma progress. Recent studies showed that changes in the central visual field may already occurred in the early stage of the disease, which is consistent with the results of imaging studies . Thus, the early detection of glaucomatous VF changes is significant to glaucoma's successful detection and management . Applying ML methods can improve the detection of preperimetric glaucoma VFs from healthy VFs significantly . Although a standard automated VF test plays a key role in diagnosing glaucoma, it consumes too much time and resources. What is more, such a manual process performed by patients is subjective and has shown strong variability in epidemiologic studies . The combination of all features mentioned above is required for the accurate intelligent diagnosis, for any of the single symptom is not the guarantee sign of glaucoma [83,88]. This kind of research shows great performance in classifying glaucoma and healthy eyes. Clinicians may reference these prediction results and make better decisions.
Studies using DL methods to diagnose glaucoma are few. So far, fundus images [73, 89, 90], VFs , and wide-field OCT scans  have all been used to construct DL-based glaucomatous diagnostic models. Preperimetric open-angle glaucoma (OAG) eyes can be detected through DL with better performance than those got from CML techniques . Holistic and local features of optic disc on fundus images have been used together to mitigate the influence of misalignment when located optic disc for glaucoma diagnosis . The AUC was 0.8384, which is quite close to the manual detection results. Li et al. demonstrated that DL can be applied to identify referable glaucomatous optic neuropathy with high sensitivity and specificity .
4.3. Age-Related Macular Degeneration. AMD is the leading cause of irreversible blindness among old people in the developed world . The goal of using ML algorithms is to automatically identify AMD-related lesions to improve AMD diagnosis and treatment. Detection of drusen [93, 94], fluid [94, 95], reticular pseudodrusen , and geographic atrophy  from fundus images and SD-OCT using ML  has been studied. The accuracy is usually over 80% [93, 96-98], and the agreement between the models and retina specialists can reach 90%.
Drusen regression, an anatomic endpoint of intermediate AMD and the onset of advanced AMD, can be predicted through the specifically designed, fully automated, ML-based classifier. Bogunovic et al. develop a data-driven interpretable predictive model to predict the progression risk in intermediate AMD . Automated image analysis steps were applied to identify and characterize individual drusen at baseline, and their development was monitored at every follow-up visit. Using such characterization and analysis, they developed an ML method based on survival analysis to estimate a risk score and predict the incoming regression of individual drusen. Above all, these automated detections of the retinal lesions combined with interpretation of disease activity are feasible and have the potential to become a powerful tool in clinical practice .
Using ML to predict anti-vascular endothelial growth factor (anti-VEGF) injection requirements in eye diseases such as neovascular AMD and PDR can alleviate patients' economic burden and facilitate resource management. Bogunovic et al. fed corresponding OCT images of patients with low or high anti-VEGF injection requirements into RF to obtain a predictive model. A solid AUC of 70% to 80% was achieved for treatment requirement prediction . Prahs et al. trained a DCNN neural network by OCT images to facilitate decision-making regarding anti-VEGF injection , and the outcomes were better than that of using CML . These studies are an important step toward image-guided prediction of treatment intervals in the management of neovascular AMD or PDR.
Multiple CML techniques have been applied for automated diagnosis and grading of AMD [101, 102]. But the most impressive work was based on DL techniques over the past 2 years [103-105]. Treder et al. establish a model to automatically detect exudative AMD from SD-OCT . In research studies based on fundus images, images with AMD were assigned into 4 classes of classification (no evidence of AMD, early-stage AMD, intermediate-stage AMD, and advanced AMD) , or 2-class classification (no or early-stage AMD and intermediate or advanced stage AMD) . The diagnostic accuracy is better in the 2-class classification in current studies. The DCNN appears to perform a screening function in these experiments, and the performance is comparable with physicians. DL algorithms have also been used to automatically detect abnormalities such as exudates , macular edema [51, 52], drusen, and choroidal neovascularization .
4.4. Cataract. Cataract is a disease with cloudy lens and has bothered millions of old people. Early detection and treatment can bring the light to cataract patients and improve their living quality. ML algorithms such as RF and SVM have been applied to diagnose and grading cataract from fundus images, ultrasounds images, and visible wavelength eye images [107-109]. The risk prediction model for posterior capsule opacification after phacoemulsification has also been built .
Researchers can now use DL models to diagnose senile cataract , but a more impressive work is about the pediatric cataract. It is one of the primary causes of childhood blindness . Long et al. constructed a CNN-based computer-aided diagnosis (CAD) framework to classify and grade pediatric cataract. What is more, a cloud-based platform integrated the AI agent for multihospital collaboration has been established. They even developed a software to realize clinical application for ophthalmologists and patients and have applied it in Zhong Shan Ophthalmic Center [113, 114]. These proposed methods are serviceable for improving clinical workflow of cataract's diagnosis in the background of large-population screening and mainly shed a light on other ocular images.
In addition to DR, glaucoma, AMD, and cataract, AI has also been used to diagnose other eye diseases. AI algorithms can be used to detect keratoconus or identify eyes with preclinical signs of keratoconus using data from a Scheimpflug camera [115,116], to evaluate corneal power after myopic corneal refractive surgery , to make surgical plans for horizontal strabismus , and to detect pigment epithelial detachment in polypoidal choroidal vasculopathy .
Previous studies have summarized articles about the application of CML techniques in eye diseases . In this review, we summarized studies on glaucoma, DR, AMD, and cataract using DL techniques in Table 5.
5. Future of AI Application in Clinic
In recent years, AI techniques have shown to be an effective diagnostic tool to identify various diseases in healthcare. Applications of AI can make great contributions to provide support to patients in remote areas by sharing expert knowledge and limited resources. While the accuracy of the models is incredible promising, we need to remain prudent and sober when considering how to deploy these models to the real world.
Most studies regarding intelligent diagnosis of eye diseases focused on binary classification problems, whereas in clinical setting, visiting patients suffer from multicategorical retinal diseases. For instance, a model trained to detect AMD will fail to consider a patient with glaucoma as diseased because the model only has the ability to discriminate AMD from non-AMD. Choi and his colleagues carried out a work applying DL to automatically detect multiple different retinal diseases with fundus photographs. When only normal and DR fundus images were involved in the proposed DL model, the classification accuracy was 87.4%. However, the accuracy fell to 30.5% when all 10 categories were included . It indicated that the model's accuracy declined while the number of diseases increased. To further enhance the applicability of AI in clinic practice, we should make more efforts to build intelligent systems that can detect different retinal diseases with high accuracy.
Additionally, a single abnormality detected from one imaging technique cannot always guarantee the correct diagnosis of a specific retinal disease (e.g., DR or glaucoma) in clinical practice. Multimodal clinical images, such as optical coherence tomography angiography, visual field, and fundus images, should be integrated together to build a generalized AI system for more reliable AI diagnosis.
However, the need of huge amount of data remains the most fundamental problem. Although various data sets have been available, they only incorporate a small part of diseases human suffered from. Images with severe diseases or rare diseases are particularly insufficient. The population characteristics, the existence of various systematic diseases, and the diverse disease' phenotypes should be considered when select input data. Larger data sets from larger patient cohorts under different settings and conditions, such as diverse ethnics and environments, are also needed in some automated diagnosis systems with impressive outcomes for further validation.
The high dependency on the data quality should be considered. Different imaging devices, various imaging protocols, and intrinsic noise of data can affect the data's quality, which may have huge influences on models' performance . In addition to data preprocessing, universal useful methods to analyze images with different qualities need to be developed urgently.
Although the DL-based methods show excellent results most of the time, their "black box" nature makes it difficult to interpret how algorithms make decisions. In this era of "evidence-based medicine," it is difficult for clinicians and patients to trust a mysterious machine that cannot provide explanations of why the patient is diagnosed with a certain disease. What is more, the techniques that make the AI models more transparent can also detect potential bias in the training data and ensure that the algorithms perform well . Heatmaps and the occlusion test are two of these kinds of techniques that can highlight highly possible abnormal regions for predictions and make models interpretable to some extent [7, 27]. More methods to interpret AI models should be developed and applied in AI diagnosis. Moreover, the standards to systematically assess these methods should also be considered and developed.
Above all, by building interpretable systematic AI platforms using sufficient high-quality and multimodal data and advanced techniques, we can enhance the applicability of AI in clinical circumstances. In some day, we might make it possible to adopt intelligent systems in certain process of clinical work. Though ethical, regulatory, and legal issues arise, AI will contribute remarkably to revolutionize current disease diagnostic pattern and generate a significant clinical impact in the near future.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
This work was supported by the National Natural Science Foundation of China (81470628) and International Science & Technology Cooperation Program of China (2017YFE0103400).
 N. Graham, Artificial Intelligence, Vol. 1076, Blue Ridge Summit: Tab Books, Philadelphia, PA, USA, 1979.
 B. E. Bejnordi, G. Zuidhof, M. Balkenhol et al., "Context-aware stacked convolutional neural networks for classification of breast carcinomas in whole-slide histopathology images," Journal of Medical Imaging, vol. 4, no. 4, article 44504, 2017.
 A. Esteva, B. Kuprel, R. A. Novoa et al., "Dermatologist-level classification of skin cancer with deep neural networks," Nature, vol. 542, no. 7639, pp. 115-118, 2017.
 S. F. Weng, J. Reps, J. Kai, J. M. Garibaldi, and N. Qureshi, "Can machine-learning improve cardiovascular risk prediction using routine clinical data?," PLoS One, vol. 12, no. 4, Article ID e174944, 2017.
 B. van Ginneken, "Fifty years of computer analysis in chest imaging: rule-based, machine learning, deep learning," Radiological Physics and Technology, vol. 10, no. 1, pp. 23-32, 2017.
 V. Gulshan, L. Peng, M. Coram et al., "Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs," JAMA, vol. 316, no. 22, p. 2402, 2016.
 G. Quellec, K. Charriere, Y. Boudi, B. Cochener, and M. Lamard, "Deep image mining for diabetic retinopathy screening," Medical Image Analysis, vol. 39, pp. 178-193,2017.
 M. Caixinha and S. Nunes, "Machine learning techniques in clinical vision sciences," Current Eye Research, vol. 42, no. 1, pp. 1-15, 2017.
 G. Litjens, T. Kooi, B. E. Bejnordi et al., "A survey on deep learning in medical image analysis," Medical Image Analysis, vol. 42, pp. 60-88, 2017.
 A. Lee, P. Taylor, J. Kalpathy-Cramer, and A. Tufail, "Machine learning has arrived!," Ophthalmology, vol. 124, no. 12, pp. 1726-1728, 2017.
 E. Rahimy, "Deep learning applications in ophthalmology," Current Opinion in Ophthalmology, vol. 29, no. 3, pp. 254-260, 2018.
 L. J. Catania and E. Nicolitz, "Artificial intelligence and its applications in vision and eye care," Advances in Ophthalmology and Optometry, vol. 3, no. 1, pp. 21-38, 2018.
 Z. Ghahramani, "Probabilistic machine learning and artificial intelligence," Nature, vol. 521, no. 7553, pp. 452-459, 2015.
 A. K. Ambastha and T. Y. Leong, "A deep learning approach to neuroanatomical characterisation of Alzheimer's disease," Studies in Health Technology and Informatics, vol. 245, p. 1249, 2017.
 Y. LeCun, Y. Bengio, and G. Hinton, "Deep learning," Nature, vol. 521, no. 7553, pp. 436-444, 2015.
 D. E. Freund, N. Bressler, and P. Burlina, "Automated detection of drusen in the macula," in Proceedings of the IEEE International Symposium on Biomedical Imaging: From Nano to Macro, pp. 61-64, Boston, MA, USA, June 2009.
 L. Rokach and O. Maimon, "Top-down induction of decision trees classifiers - a survey," IEEE Transactions on Systems Man and Cybernetics Part C, vol. 35, no. 4, pp. 476-487, 2005.
 L. Breiman, "Random forests," Machine Learning, vol. 45, no. 1, pp. 5-32, 2001.
 C. C. Chang and C. J. Lin, "LIBSVM: a library for support vector machines," ACM Transactions on Intelligent Systems and Technology, vol. 2, no. 3, pp. 1-27, 2011.
 P. Langley and K. Thompson, "An analysis of Bayesian classifiers," in Proceedings of AAAI, pp. 223-228, San Jose, CA, USA, March 1992.
 J. M. Keller, M. R. Gray, and J. A. Givens, "A fuzzy K-nearest neighbor algorithm," IEEE Transactions on Systems Man and Cybernetics, vol. 15, no. 4, pp. 580-585, 2012.
 T. Kanungo, D. M. Mount, N. S. Netanyahu, C. D. Piatko, R. Silverman, and A. Y. Wu, "An efficient k-means clustering algorithm: analysis and implementation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 881-892, 2002.
 J. Ye, "Two-dimensional linear discriminant analysis," Advances in Neural Information Processing Systems, vol. 17, no. 6, pp. 1431-1441, 2005.
 M. T. Hagan, H. B. Demuth, and M. Beale, Neural Network Design, PWS Publishing Co., Boston, MA, USA, 1997.
 A. Statnikov, L. Wang, and C. F. Aliferis, "A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification," BMC BIOINFORMATICS, vol. 9, no. 1, pp. 1-10, 2008.
 P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. Lecun, "OverFeat: integrated recognition, localization and detection using convolutional networks," Arxiv, 2013.
 D. S. Kermany, M. Goldbaum, W. Cai et al., "Identifying medical diagnoses and treatable diseases by image-based deep learning," Cell, vol. 172, no. 5, pp. 1122-1131, 2018.
 R. Salakhutdinov and G. Hinton, "An efficient learning procedure for deep Boltzmann machines," Neural Computation, vol. 24, no. 8, pp. 1967-2006, 2012.
 Y. Cho and L. K. Saul, "Kernel methods for deep learning," in Advances in Neural Information Processing Systems 22: Conference on Neural Information Processing Systems, pp. 342-350, Vancouver, British Columbia, Canada, 2009.
 N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, "SMOTE: synthetic minority oversampling technique," Journal of Artificial Intelligence Research, vol. 16, no. 1, pp. 321-357, 2002.
 A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," in
Proceedings of International Conference on Neural Information Processing Systems, pp. 1097-1105, Lake Tahoe, Nevada, December 2012.
 K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," Computer Science, http://arxiv.org/abs/arXiv:1409.1556, 2014.
 K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770-778, Las Vegas, NV, June 2016.
 C. Szegedy, W. Liu, Y. Jia et al., "Going deeper with convolutions," http://arxiv.org/abs/arXiv:1409.4842, 2014.
 S. Ioffe and C. Szegedy, "Batch normalization: accelerating deep network training by reducing internal covariate shift," http://arxiv.org/abs/arXiv:1502.03167, 2015.
 C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, "Rethinking the inception architecture for computer vision," in Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2818-2826, Las Vegas, NV, June 2016.
 C. Szegedy, S. Ioffe, V. Vanhoucke, and A. Alemi, "Inceptionv4, inception-ResNet and the impact of residual connections on learning," http://arxiv.org/abs/arXiv:1602.07261, 2016.
 J. Lee, S. Jun, Y. Cho et al., "Deep learning in medical imaging: general overview," Korean Journal of Radiology, vol. 18, no. 4, p. 570, 2017.
 S. Savalia and V. Emamian, "Cardiac arrhythmia classification by multi-layer perceptron and convolution neural networks," Bioengineering, vol. 5, no. 2, p. 35, 2018.
 P. Fergus, D. Hignett, A. Hussain, D. Al-Jumeily, and K. Abdel-Aziz, "Automatic epileptic seizure detection using scalp EEG and advanced artificial intelligence techniques," BioMed Research International, vol. 2015, Article ID 986736, 17 pages, 2015.
 S. Yu, K. K. Tan, L. S. Ban, S. Li, and A. T. H. Sia, "Feature extraction and classification for ultrasound images of lumbar spine with support vector machine," in Proceedings of 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 4659-4662, Chicago, IL, USA, August 2014.
 G. Xiong, D. Kola, R. Heo, K. Elmore, I. Cho, and J. K. Min, "Myocardial perfusion analysis in cardiac computed tomography angiographic images at rest," Medical Image Analysis, vol. 24, no. 1, pp. 77-89, 2015.
 X. Jiang and Y. Huang, "Research on data pre-process and feature extraction based on wavelet packet analysis," in Proceedings of 6th World Congress on Intelligent Control and Automation, pp. 5850-5853, Dalian, China, June 2006.
 V. Cherkassky and F. Mulier, "Statistical learning theory," Encyclopedia of the Sciences of Learning, vol. 41, no. 4, p. 3185, 1998.
 K. Hajian-Tilaki, "Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation," Caspian Journal of Internal Medicine, vol. 4, no. 2, pp. 627-635, 2013.
 R. Gargeya and T. Leng, "Automated identification of diabetic retinopathy using deep learning," Ophthalmology, vol. 124, no. 7, pp. 962-969, 2017.
 N. G. Congdon, D. S. Friedman, and T. Lietman, "Important causes of visual impairment in the world today," JAMA, vol. 290, no. 15, pp. 2057-2060, 2003.
 E. Oh, T. K. Yoo, and E. C. Park, "Diabetic retinopathy risk prediction for fundus examination using sparse learning: a cross-sectional study," BMC Medical Informatics and Decision Making, vol. 13, no. 1, p. 106, 2013.
 M. L. Ribeiro, S. G. Nunes, and J. G. Cunha-Vaz, "Microaneurysm turnover at the macula predicts risk of development of clinically significant macular edema in persons with mild nonproliferative diabetic retinopathy," Investigative Ophthalmology and Visual Science, vol. 54, no. 7, pp. 4595-4604, 2013.
 T. Hassan, M. U. Akram, B. Hassan, A. M. Syed, and S. A. Bazaz, "Automated segmentation of subretinal layers for the detection of macular edema," Applied Optics, vol. 55, no. 3, pp. 454-461, 2016.
 C. S. Lee, A. J. Tyring, N. P. Deruyter, Y. Wu, A. Rokem, and A. Y. Lee, "Deep-learning based, automated segmentation of macular edema in optical coherence tomography," Biomedical Optics Express, vol. 8, no. 7, p. 3440, 2017.
 A. G. Roy, S. Conjeti, S. P. K. Karri et al., "ReLayNet: retinal layer and fluid segmentation of macular optical coherence tomography using fully convolutional networks," Biomedical Optics Express, vol. 8, no. 8, p. 3627, 2017.
 M. U. Akram, A. Tariq, S. A. Khan, and M. Y. Javed, "Automated detection of exudates and macula for grading of diabetic macular edema," Computer Methods and Programs in Biomedicine, vol. 114, no. 2, pp. 141-152, 2014.
 M. Niemeijer, B. van Ginneken, S. R. Russell, M. S. A. Suttorp-Schulten, and M. D. Abra Moff, "Automated detection and differentiation of drusen, exudates, and cotton-wool spots in digital color fundus photographs for diabetic retinopathy diagnosis," Investigative Opthalmology and Visual Science, vol. 48, no. 5, p. 2260, 2007.
 S. Wang, H. L. Tang, L. I. Al Turk et al., "Localizing microaneurysms in fundus images through singular spectrum analysis," IEEE Transactions on Biomedical Engineering, vol. 64, no. 5, pp. 990-1002, 2017.
 J. Wu, J. Xin, L. Hong, and J. You, "New hierarchical approach for microaneurysms detection with matched filter and machine learning," in Proceedings of 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), p. 4322, Milano, Italy, August 2015.
 S. Yu, D. Xiao, and Y. Kanagasingam, "Automatic detection of neovascularization on optic disk region with feature extraction and support vector machine," in Proceedings of 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), p. 1324, Orlando, FL, USA, August 2016.
 S. Roychowdhury, D. D. Koozekanani, and K. K. Parhi, "DREAM: diabetic retinopathy analysis using machine learning," IEEE Journal of Biomedical and Health Informatics, vol. 18, no. 5, pp. 1717-1728, 2014.
 R. A. Welikala, J. Dehmeshki, A. Hoppe et al., "Automated detection of proliferative diabetic retinopathy using a modified line operator and dual classification," Computer Methods and Programs in Biomedicine, vol. 114, no. 3, pp. 247-261, 2014.
 J. I. Orlando, K. K. Van, J. B. Barbosa, H. L. Manterola, M. B. Blaschko, and A. Clausse, "Proliferative diabetic retinopathy characterization based on fractal features: evaluation on a publicly available data set," Medical Physics, vol. 44, no. 12, pp. 6425-6434, 2017.
 T. Y. Wong and N. M. Bressler, "Artificial intelligence with deep learning technology looks into diabetic retinopathy screening," JAMA, vol. 316, no. 22, pp. 2366-2367, 2016.
 M. D. Abremoff, Y. Lou, A. Erginay et al., "Improved automated detection of diabetic retinopathy on a publicly available dataset through integration of deep learning," Investigative Opthalmology & Visual Science, vol. 57, no. 13, p. 5200, 2016.
 I. Ardiyanto, H. A. Nugroho, and R. Buana, "Deep learning-based diabetic retinopathy assessment on embedded system," in Proceedings of International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 1760-1763, Jeju Island, Republic of Korea, July 2017.
 Q. Abbas, I. Fondon, A. Sarmiento, S. Jimenez, and P. Alemany, "Automatic recognition of severity level for diagnosis of diabetic retinopathy using deep visual features," Medical & Biological Engineering & Computing, vol. 55, no. 11, pp. 1959-1974, 2017.
 H. Takahashi, H. Tampo, Y. Arai, Y. Inoue, and H. Kawashima, "Applying artificial intelligence to disease staging: deep learning for improved staging of diabetic retinopathy," PLoS One, vol. 12, no. 6, Article ID e179790, 2017.
 A. ElTanboly, M. Ismail, A. Shalaby et al., "A computer-aided diagnostic system for detecting diabetic retinopathy in optical coherence tomography images," Medical Physics, vol. 44, no. 3, pp. 914-923, 2017.
 H. S. Sandhu, N. Eladawi, M. Elmogy et al., "Automated diabetic retinopathy detection using optical coherence tomography angiography: a pilot study," British Journal of Ophthalmology, vol. 102, no. 11, pp. 1564-1569, 2018.
 J. M. Roodhooft, "Leading causes of blindness worldwide," Bulletin De La Societe Belge Dophtalmologie, vol. 283, no. 283, p. 19, 2002.
 G. J. Tangelder, N. J. Reus, and H. G. Lemij, "Estimating the clinical usefulness of optic disc biometry for detecting glaucomatous change over time," Eye, vol. 20, no. 7, pp. 755-763, 2006.
 H. Muhammad Salman, H. Liangxiu, V. H. Jano, and L. Baihua, "Automatic extraction of retinal features from colour retinal images for glaucoma diagnosis: a review," Computerized Medical Imaging and Graphics, vol. 37, no. 7-8, pp. 581-596, 2013.
 C. Raja and N. Gangatharan, "A hybrid swarm algorithm for optimizing glaucoma diagnosis," Computers in Biology and Medicine, vol. 63, pp. 196-207, 2015.
 A. Singh, M. K. Dutta, M. ParthaSarathi, V. Uher, and R. Burget, "Image processing based automatic diagnosis of glaucoma using wavelet features of segmented optic disc from fundus image," Computer Methods and Programs in Biomedicine, vol. 124, pp. 108-120, 2016.
 X. Chen, Y. Xu, D. W. K. Wong, T. Y. Wong, and J. Liu, "Glaucoma detection based on deep convolutional neural network," in Proceedings of 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC),, Milano, Italy, August 2015.
 M. S. Haleem, L. Han, J. V. Hemert et al., "Regional image features model for automatic classification between normal and glaucoma in fundus and scanning laser ophthalmoscopy (SLO) images," Journal of Medical Systems, vol. 40, no. 6, pp. 1-19, 2016.
 M. Wu, T. Leng, S. L. De, D. L. Rubin, and Q. Chen, "Automated segmentation of optic disc in SD-OCT images and cup-to-disc ratios quantification by patch searching-based neural canal opening detection," Optics Express, vol. 23, no. 24, article 31216, 2015.
 J. B. Jonas, W. M. Budde, and S. Pandajonas, "Ophthalmoscopic evaluation of the optic nerve head," Survey of Ophthalmology, vol. 43, no. 4, p. 293, 1999.
 D. Bizios, A. Heijl, J. L. Hougaard, and B. Bengtsson, "Machine learning classifiers for glaucoma diagnosis based on classification of retinal nerve fibre layer thickness parameters measured by Stratus OCT," Acta Ophthalmologica, vol. 88, no. 1, pp. 44-52, 2010.
 K. A. Barella, V. P. Costa, V. V. Goncalves, F. R. Silva, M. Dias, and E. S. Gomi, "Glaucoma diagnostic accuracy of machine learning classifiers using retinal nerve fiber layer and optic nerve data from SD-OCT," Journal of Ophthalmology, vol. 2013, no. 10, Article ID 789129, 2013.
 S. Yousefi, M. H. Goldbaum, M. Balasubramanian et al., "Glaucoma progression detection using structural retinal nerve fiber layer measurements and functional visual field points," IEEE Transactions on Biomedical Engineering, vol. 61, no. 4, pp. 1143-1154, 2014.
 F. M. Rauscher, N. Sekhon, W. J. Feuer, and D. L. Budenz, "Myopia affects retinal nerve fiber layer measurements as determined by optical coherence tomography," Journal of Glaucoma, vol. 18, no. 7, pp. 501-505, 2009.
 S. H. Kang, S. W. Hong, S. K. Im, S. H. Lee, and M. D. Ahn, "Effect of myopia on the thickness of the retinal nerve fiber layer measured by Cirrus HD optical coherence tomography," Investigative Opthalmology & Visual Science, vol. 51, no. 8, p. 4075, 2010.
 C. K. Leung, S. Mohamed, K. S. Leung et al., "Retinal nerve fiber layer measurements in myopia: an optical coherence tomography study," Investigative Opthalmology & Visual Science, vol. 47, no. 12, pp. 5171-5176, 2006.
 S. J. Kim, K. J. Cho, and S. Oh, "Development of machine learning models for diagnosis of glaucoma," PLoS One, vol. 12, no. 5, Article ID e177726, 2017.
 D. C. Hood, A. S. Raza, C. G. de Moraes, C. A. Johnson, J. M. Liebmann, and R. Ritch, "The nature of macular damage in glaucoma as revealed by averaging optical coherence tomography data," Translational Vision Science & Technology, vol. 1, no. 1, p. 3, 2012.
 G. Roberti, G. Manni, I. Riva et al., "Detection of central visual field defects in early glaucomatous eyes: comparison of Humphrey and Octopus perimetry," PLoS One, vol. 12, no. 10, Article ID e186793, 2017.
 R. Asaoka, A. Iwase, K. Hirasawa, H. Murata, and M. Araie, "Identifying "preperimetric" glaucoma in standard automated perimetry visual fields," Investigative Ophthalmology & Visual Science, vol. 55, no. 12, pp. 7814-7820, 2014.
 E. Oh, T. K. Yoo, and S. Hong, "Artificial neural network approach for differentiating open-angle glaucoma from glaucoma suspect without a visual field test," Investigative Opthalmology & Visual Science, vol. 56, no. 6, pp. 3957-3966, 2015.
 F. R. Silva, V. G. Vidotti, F. Cremasco, M. Dias, E. S. Gomi, and V. P. Costa, "Sensitivity and specificity of machine learning classifiers for glaucoma diagnosis using spectral domain OCT and standard automated perimetry," Arquivos Brasileiros de Oftalmologia, vol. 76, no. 3, pp. 170-174, 2013.
 A. Li, J. Cheng, D. W. Wong et al., "Integrating holistic and local deep features for glaucoma classification," in Proceedings of 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), p. 13-28, Orlando, FL, USA, August 2016.
 Z. Li, Y. He, S. Keel, W. Meng, R. T. Chang, and M. He, "Efficacy of a deep learning system for detecting glaucomatous optic neuropathy based on color fundus photographs," Ophthalmology, vol. 125, no. 8, pp. 1199-1206, 2018.
 R. Asaoka, H. Murata, A. Iwase, and M. Araie, "Detecting preperimetric glaucoma with standard automated perimetry using a deep learning classifier," Ophthalmology, vol. 123, no. 9, pp. 1974-1980, 2016.
 H. Muhammad, T. J. Fuchs, C. N. De et al., "Hybrid deep learning on single wide-field optical coherence tomography scans accurately classifies glaucoma suspects," Journal of Glaucoma, vol. 26, no. 12, pp. 1086-1094, 2017.
 Q. Chen, T. Leng, L. Zheng et al., "Automated drusen segmentation and quantification in SD-OCT images," Medical Image Analysis, vol. 17, no. 8, pp. 1058-1072, 2013.
 H. Bogunovic, A. Montuoro, M. Baratsits et al., "Machine learning of the progression of intermediate age-related macular degeneration based on OCT imaging," Investigative Opthalmology & Visual Science, vol. 58, no. 6, pp. O141-O150, 2017.
 U. Chakravarthy, D. Goldenberg, G. Young et al., "Automated identification of lesion activity in neovascular age-related macular degeneration," Ophthalmology, vol. 123, no. 8, pp. 1731-1736, 2016.
 M. J. van Grinsven, G. H. Buitendijk, C. Brussee et al., "Automatic identification of reticular pseudodrusen using multimodal retinal image analysis," Investigative Ophthalmology & Visual Science, vol. 56, no. 1, pp. 633-639, 2015.
 A. K. Feeny, M. Tadarati, D. E. Freund, N. M. Bressler, and P. Burlina, "Automated segmentation of geographic atrophy of the retinal epithelium via random forests in AREDS color fundus images," Computers in Biology and Medicine, vol. 65, pp. 124-136, 2015.
 S. Lahmiri and M. Boukadoum, "Automated detection of circinate exudates in retina digital images using empirical mode decomposition and the entropy and uniformity of the intrinsic mode functions," Biomedical Engineering, vol. 59, no. 4, pp. 5527-5535, 2014.
 H. Bogunovic, S. M. Waldstein, T. Schlegl et al., "Prediction of anti-VEGF treatment requirements in neovascular AMD using a machine learning approach," Investigative Opthalmology & Visual Science, vol. 58, no. 7, p. 3240, 2017.
 P. Prahs, V. Radeck, C. Mayer et al., "OCT-based deep learning algorithm for the evaluation of treatment indication with anti-vascular endothelial growth factor medications," Graefe's Archive for Clinical and Experimental Ophthalmology, vol. 256, no. 1, pp. 91-98, 2017.
 M. R. K. Mookiah, U. R. Acharya, H. Fujita et al., "Local configuration pattern features for age-related macular degeneration characterization and classification," Computers in Biology and Medicine, vol. 63, pp. 208-218, 2015.
 P. Fraccaro, M. Nicolo, M. Bonetto et al., "Combining macula clinical signs and patient characteristics for age-related macular degeneration diagnosis: a machine learning approach," BMC Ophthalmology, vol. 15, no. 1, p. 10, 2015.
 P. M. Burlina, N. Joshi, M. Pekala, K. D. Pacheco, D. E. Freund, and N. M. Bressler, "Automated grading of age-related macular degeneration from color fundus images using deep convolutional neural networks," JAMA Ophthalmology, vol. 135, no. 11, p. 1170, 2017.
 P. Burlina, K. D. Pacheco, N. Joshi, D. E. Freund, and N. M. Bressler, "Comparing humans and deep learning performance for grading AMD: a study in using universal deep features and transfer learning for automated AMD analysis," Computers in Biology and Medicine, vol. 82, pp. 80-86, 2017.
 M. Treder, J. L. Lauermann, and N. Eter, "Automated detection of exudative age-related macular degeneration in spectral domain optical coherence tomography using deep learning," Graefe's Archive for Clinical and Experimental Ophthalmology, vol. 256, no. 2, pp. 259-265, 2017.
 P. Prentasic and S. Loncaric, "Detection of exudates in fundus photographs using deep neural networks and anatomical landmark detection fusion," Computer Methods and Programs in Biomedicine, vol. 137, pp. 281-292, 2016.
 J. J. Yang, J. Li, R. Shen et al., "Exploiting ensemble learning for automatic cataract detection and grading," Computer Methods and Programs in Biomedicine, vol. 124, pp. 45-57, 2016.
 M. Caixinha, J. Amaro, M. Santos, F. Perdigao, M. Gomes, and J. Santos, "In-vivo automatic nuclear cataract detection and classification in an animal model by ultrasounds," IEEE Transactions on Biomedical Engineering, vol. 63, no. 11, pp. 2326-2335, 2016.
 S. V. MK and R. Gunasundari, "Computer-aided diagnosis of anterior segment eye abnormalities using visible wavelength image analysis based machine learning," Journal of Medical Systems, vol. 42, no. 7, 2018.
 S. Mohammadi, M. Sabbaghi, H. Z-Mehrjardi et al., "Using artificial intelligence to predict the risk for posterior capsule opacification after phacoemulsification," Journal of Cataract and Refractive Surgery, vol. 38, no. 3, pp. 403-408, 2012.
 X. Gao, S. Lin, and T. Y. Wong, "Automatic feature learning to grade nuclear cataracts based on deep learning," IEEE Transactions on Biomedical Engineering, vol. 62, no. 11, pp. 2693-2701, 2015.
 D. Lin, J. Chen, Z. Lin et al., "10-Year overview of the hospital-based prevalence and treatment of congenital cataracts: the CCPMOH experience," PLoS One, vol. 10, no. 11, Article ID e142298, 2015.
 X. Liu, J. Jiang, K. Zhang et al., "Localization and diagnosis framework for pediatric cataracts based on slit-lamp images using deep features of a convolutional neural network," PLoS One, vol. 12, no. 3, Article ID e168606, 2017.
 E. Long, H. Lin, Z. Liu et al., "An artificial intelligence platform for the multihospital collaborative management of congenital cataracts," Nature Biomedical Engineering, vol. 1, no. 2, p. 24, 2017.
 I. Kovacs, K. Mihaltz, K. Kranitz et al., "Accuracy of machine learning classifiers using bilateral data from a Scheimpflug camera for identifying eyes with preclinical signs of keratoconus," Journal of Cataract and Refractive Surgery, vol. 42, no. 2, pp. 275-283, 2016.
 I. Ruiz Hidalgo, P. Rodriguez, J. J. Rozema et al., "Evaluation of a machine-learning classifier for keratoconus detection based on Scheimpflug," Tomography, vol. 35, no. 6, pp. 827-832, 2016.
 R. Koprowski, M. Lanza, and C. Irregolare, "Corneal power evaluation after myopic corneal refractive surgery using artificial neural networks," BioMedical Engineering OnLine, vol. 15, no. 1, 2016.
 J. D. S. D. Almeida, A. C. Silva, J. A. M. Teixeira, A. C. Paiva, and M. Gattass, "Surgical planning for horizontal strabismus using support vector regression," Computers in Biology and Medicine, vol. 63, pp. 178-186, 2015.
 Y. Xu, K. Yan, J. Kim et al., "Dual-stage deep learning framework for pigment epithelium detachment segmentation in polypoidal choroidal vasculopathy," Biomedical Optics Express, vol. 8, no. 9, p. 4061, 2017.
 J. Y. Choi, T. K. Yoo, J. G. Seo, J. Kwak, T. T. Um, and T. H. Rim, "Multi-categorical deep learning neural network to classify retinal images: a pilot study employing small database," PLoS One, vol. 12, no. 11, Article ID e187336, 2017.
 L. H. Gilpin, D. Bau, B. Z. Yuan, A. Bajwa, M. Specter, and L. Kagal, "Explaining explanations: an approach to evaluating interpretability of machine learning," http://arxiv.org/ abs/1806.00069, 2018.
Wei Lu [ID], (1) Yan Tong [ID], (1) Yue Yu [ID] (2) Yiqiao Xing [ID], (1) Changzheng Chen [ID], (1) and Yin Shen [ID] (1)
(1) Eye Center, Renmin Hospital of Wuhan University, Eye Institute of Wuhan University, Wuhan, Hubei, China
(2) Hisee Medical Artificial Intelligent Lab, Wuhan University, Wuhan, Hubei, China
Correspondence should be addressed to Yin Shen; firstname.lastname@example.org
Received 7 July 2018; Revised 6 October 2018; Accepted 17 October 2018; Published 19 November 2018
Academic Editor: Hiroshi Kunikata
Caption: Figure 1: Introduction of AI algorithms. (a) The relationship among AI, ML, and DL. (b) The workflow of a RF. (c) The principle of an SVM. (d) The schematic diagram of a typical deep neural network.
Caption: Figure 2: Data partitioning method during data processing. (a) A brief introduction of data partition. (b) An illustration of a specific process of 5-fold cross-validation.
Caption: Figure 3: Publication of AI application in diagnosing ophthalmological diseases. (a) Publication statistics per ophthalmological diseases. (b) Publication statistics per year (Jan 1, 2007 to Sep 20, 2018).
Table 1: Introduction of existing CML techniques in the AI medical field. Classifiers Principles Decision (i) Tree-like structure trees (ii) Solve classification and regression problems based on rules to binary split data Random (i) Ensemble a multitude of decision trees for forests classification (ii) The ultimate prediction is made by majority voting Support vector Build a hyperplane that separates the positive and machines negative examples as wide as possible to minimize the separation error (i) Based on the probability approach Bayesian (ii) Assign a new sample to the category with classifiers maximum posterior probability, depending on the given prior probability, cost function, and category conditional density k-nearest Search for k-nearest training instances and classify neighbors a new instance into the most frequent class of these k instances k-means Partition n samples into k clusters in which each sample belongs to the cluster with the nearest mean Linear (i) Create predictive functions that maximize the discriminant discrimination between previously established categories analysis (i) Consists of a collection of connected units, which can process signals Neural (ii) Connections between them can transmit a signal networks to another (iii) Units are organized in layers (iv) Signals travel from the input layer to the output layer Table 2: Concise introduction of CNN algorithms used in AI diagnosis. Models Layers Top-5 error * (%) ILSVRC# AlexNet (2012) 8 layers 15.3 2012 VGG (2014) 19 layers 7.3 2014 ResNet-152 (2015) 152 layers 3.57 2015 ResNet-101 101 layers 4.6 -- ResNet-50 50 layers 5.25 -- ResNet-34 34 layers 5.6 -- GoogleNet/inception 22 layers 6.7 2014 v1 (2014)  Inception v2 33 layers 4.8 -- (2015)  Inception v3 47 layers 3.5 -- (2015)  Inception 77 layers 3.08 -- v4 (2016)  * The fraction of test images for which the correct label is not among the five labels considered most probable by the algorithm. The lower the top-5 error, the better the classifier perform. #ImageNet large-scale visual recognition challenge. Table 3: The ophthalmic imaging modalities in AI diagnosis. Imaging modalities Image features Applications Fundus image Show a magnified and Retinal diseases subtle view of the diagnose surface of the retina Optical coherence Show micrometer- Retinal diseases tomography resolution, cross- diagnose sectional images of the retina Show a rough cross- Evaluate the Ocular sectional view of the condition of lens, ultrasound B-scan eye and the orbit vitreous, retina, and tumor Slit-lamp image Provides a stereoscopic Anterior segment magnified view of the diseases diagnose anterior segment in detail Visual field Show the size and shape To find disorders of of field-of-view the visual signal processing system that includes the retina, optic nerve, and brain Table 4: Introduction of metrics to evaluate the performance of a model. Metrics Definitions Accuracy Measure the proportion of samples that are correctly identified by a classifier among all samples Sensitivity/ The number of actual positives divided by the recall rate samples number of all samples that have been identified as positive by a gold standard Specificity The number of actual negatives divided by the number of all samples that have been identified as negative by a gold standard The number of actual positives divided by the Precision/ number of all positives identified by a positive classifier predictive value Kappa value To examine the agreement between a model with the ground truth on the assignment of categories Dice coefficient/ Harmonic average of the precision and recall, where a F1 score reaches its best value at 1 (perfect precision and recall) and worst at 0 Table 5: Studies on eye diseases using DL techniques. Groups Aim Data sets Gulshan et al.  DR detection Public: EyePACS, Messidor (Google Inc.) 128175 fundus images Gargeya and Leng DR detection Public: EyePACS, Messidor,  (Byers Eye E-ophtha 77348 fundus Institute at images Stanford) Quellec et al.  DR detection Public: Kaggle, DiaretDB, (Brest University) heatmaps E-ophtha 196590 fundus creation images Ardiyanto et al. DR grading Public: FINDeRS 315 fundus  (Universitas images Gadjah Mada) ElTanboly et al. DR detection Local: 52 SD-OCT scans  (Mansoura University) Give an Prahs et al.  indication of Local: 183,402 OCT B-scans (Department of the treatment Ophthalmology, of anti-VEGF University injection Regensburg) Abramoff et al.  DR detection Public: Messidor 1748 (University of Iowa fundus images Hospitals and Clinics) University Takahashi et al. DR grading Local: 9939 fundus images  (Department of Ophthalmology, Jichi Medical University) Abbas et al.  DR grading Public: Messidor, (Surgery Department DiaretDB, FAZ 500 fundus and Glaucoma Unity, images Local: 250 fundus University Hospital images Puerta del Mar, Cadiz) Chen et al.  Glaucoma Public: Origa, Sees 2326 (Institute for detection fundus images Infocomm Research, Agency for Science, Technology and Research; Singapore National Eye Centre) Li et al.  Glaucoma Public: Origa 650 fundus (Institute for detection images Infocomm Research, Agency for Science, Technology and Research) Asaoka et al.  Preperimetric Local: 279 VFs (Department of OAG detection Ophthalmology, The University of Tokyo) Muhammad et al.  Glaucoma Local: 612 single wide- Department of detection field OCT images Physiology, Weill Cornell Medicine) Li et al.  Glaucoma Local: 8000 fundus images (Zhongshan detection Ophthalmic Center, Sun Yatsen University) Burlina et al.  AMD Grading Public: AREDS 5664 fundus (Retina Division, images Wilmer Eye Institute, Johns Hopkins University School of Medicine) Burlina et al.  AMD detection Public: AREDS 130000 (Retina Division, fundus images Wilmer Eye Institute, Johns Hopkins University School of Medicine) Treder et al.  Department of AMD detection Local: 1112 SD-OCT images Ophthalmology, University of Muunster Medical Center) Gao et al.  Nuclear Public: ACHIKO-NC 5378 (Microsoft Research cataracts slit-lamp images Asia and Singapore grading Eye Research Institute) Long et al.  Pediatric Local: CCPMOH 886 slit- (Zhongshan cataracts lamp images Ophthalmic Centre, detection Sun Yatsen University) Choi et al.  Multiple Public: STARE 397 fundus (Department retinal images of Ophthalmology, diseases Yonsei University detection College of Medicine) Groups Deep learning Performance techniques Gulshan et al.  DCNN AUC (Google Inc.) 0.991 for EyePACS 0.990 for Messidor Gargeya and Leng DCNN AUC  (Byers Eye 0.94 for Messidor data set Institute at 0.95 for E-ophtha data set Stanford) Quellec et al.  CNN AUC = 0.954 in Kaggle's (Brest University) data set AUC = 0.949 in E- ophtha data set Ardiyanto et al. CNN Detection  (Universitas Accuracy: 95.71% Gadjah Mada) Sensitivity: 76.92% Specificity: 100% Grading Accuracy: 60.28% Sensitivity: 65.40% Specificity: 73.37% ElTanboly et al. DFCN AUC: 0.98  (Mansoura Accuracy: 92% Univerdity) Sensitivity: 83% Specificity: 100% Prahs et al.  DCNN AUC: 0.968 Accuracy: 95.5% (Department of (GoogLeNet) Sensitivity: 90.1% Ophthalmology, Specificity: 96.2% University Regensburg) Abramoff et al.  CNN Referable DR: (University of Iowa AUC: 0.980 Hospitals and Sensitivity: 96.8% Clinics) University Specificity: 87% Vision threatening DR: AUC: 0.989 Sensitivity: 100% Specificity: 90.8% Takahashi et al. DCNN (GoogLeNet) Accuracy: 0.64-0.82  (Department of Ophthalmology, Jichi Medical University) Abbas et al.  DNN AUC: 0.924 (Surgery Department Sensitivity: 92.18% and Glaucoma Unity, Specificity: 94.50% University Hospital Puerta del Mar, Cadiz) Chen et al.  DCNN AUC: 0.831 for Origa (Institute for 0.887 for Sees Infocomm Research, Agency for Science, Technology and Research; Singapore National Eye Centre) Li et al.  DCNN (AlexNet, Best AUC: 0.8384 (Institute for VGG-19, VGG- AlexNet > VGG-19 = VG-16 Infocomm Research, 16, GoogLeNet) > GoogLeNet Agency for Science, Technology and Research) Asaoka et al.  DFNN AUC: 92.6% (Department of Ophthalmology, The University of Tokyo) Muhammad et al.  DCNN (AlexNet) Accuracy: 65.7%~92.4% Department of Physiology, Weill Cornell Medicine) Li et al.  DCNN (GoogleNet) AUC: 0.986 (Zhongshan Sensitivity: 95.6% Ophthalmic Center, Specificity: 92% Sun Yatsen University) Burlina et al.  DCNN Accuracy (Retina Division, 79.4% (4-class) Wilmer Eye 81.5% (3-class) Institute, Johns 93.4% (2-class) Hopkins University School of Medicine) Burlina et al.  DCNN (AlexNet) AUC: 0.94-0.96 (Retina Division, Accuracy: 88.4%-91.6% Wilmer Eye Institute, Johns Hopkins University School of Medicine) Treder et al.  Department of DCNN Sensitivity: 100% Ophthalmology, Specificity: 92% University of Accuracy: 96% Muunster Medical Center) Gao et al.  CNN and SVM Accuracy: 70.7% (Microsoft Research Asia and Singapore Eye Research Institute) Long et al.  DCNN Accuracy (Zhongshan 98.87% (detection) Ophthalmic Centre, 97.56% (treatment suggestion) Sun Yatsen University) Choi et al.  DCNN (VGG-19) Accuracy 30.5% (all (Department categories were included) of Ophthalmology, 36.7% (using ensemble Yonsei University classifiers) 72.8% College of Medicine) (considering only normal, DR and AMD) Groups Conclusions Gulshan et al.  The DCNN had high (Google Inc.) sensitivity and specificity for detecting referable DR (moderate and worse DR, referable diabetic macular edema, or both) Gargeya and Leng The DCNN can be used to  (Byers Eye screen fundus images to Institute at identify DR with high Stanford) reliability Quellec et al.  The proposed method is a (Brest University) promising image mining tool to discover new biomarkers in images. The model trained to detect referable DR can detect some lesions and outperforms recent algorithms trained to detect those lesions specifically Ardiyanto et al. The network needs more data  (Universitas to train. And, this work Gadjah Mada) opens up the future possibility to establish an integrated DR system to grade the severity at a low cost ElTanboly et al. The proposed CAD system for  (Mansoura early DR detection using Univerdity) the OCT retinal images has good outcome and outperforms than other 4 conventional classifiers Prahs et al.  The DCNN neural networks are (Department of effective in assessing OCT scans Ophthalmology, with regard to treatment University indication with anti-VEGF Regensburg) medications Abramoff et al.  The DL enhanced algorithms (University of Iowa have the potential to Hospitals and improve the efficiency of Clinics) University DR screening Takahashi et al. The proposed novel AI  (Department of disease-staging system have Ophthalmology, Jichi the ability to grade DR Medical University) involving retinal areas not typically visualized on fundoscopy Abbas et al.  The system is appropriate (Surgery Department for early detection of DR and Glaucoma Unity, and provides an effective University Hospital treatment for prediction Puerta del Mar, type of diabetes Cadiz) Chen et al.  Present a DL framework for (Institute for glaucoma detection based on Infocomm Research, DCNN Agency for Science, Technology and Research; Singapore National Eye Centre) Li et al.  The proposed method that (Institute for integrates both local and Infocomm Research, holistic features of optic Agency for Science, disc to detect glaucoma is Technology and Conclusions reliable Research) Asaoka et al.  Using a deep FNN can (Department of distinguish preperimetric Ophthalmology, The glaucoma VFs from healthy University of Tokyo) VFs with very high accuracy, which is better than the outcome obtained from ML techniques Muhammad et al.  The proposed protocol Department of outperforms standard OCT Physiology, Weill and VF in distinguishing Cornell Medicine) healthy suspect eyes from eyes with early glaucoma Li et al.  DL can be applied to detect (Zhongshan referable glaucomatous Ophthalmic Center, optic neuropathy with high Sun Yatsen sensitivity and specificity University) Burlina et al.  Demonstrates comparable (Retina Division, performance between Wilmer Eye computer and physician Institute, Johns grading Hopkins University School of Medicine) Burlina et al.  Applying a DL-based (Retina Division, automated assessment of AMD Wilmer Eye from fundus images can Institute, Johns produce results that are Hopkins University similar to human School of Medicine) performance levels Treder et al.  Department of With the DL-based approach, Ophthalmology, it is possible to detect University of AMDin SD-OCT with good Muunster Medical outcome. With more image Center) data, the model can get more practical value in clinical decisions Gao et al.  The proposed method is (Microsoft Research useful for assisting And Asia and Singapore improving diagnosis of the Eye Research disease in the background Institute) of large-population screening and has the potential to be applied to other eye diseases Long et al.  The AI agent using DL have (Zhongshan the ability to accurately Ophthalmic Centre, diagnose and provide Sun Yatsen treatment decisions for University) congenital cataracts. And the AI agent and individual ophthalmologists perform equally well. A cloud- based platform integrated with the AI agent for multihospital collaboration was built to improve disease management Choi et al.  As the number of categories (Department increased, the performance of Ophthalmology, of the DL model has Yonsei University declined. Several ensemble College of Medicine) classifiers enhanced the multicategorical classification performance. Large data sets should be applied to confirm the effectiveness of the proposed model
|Printer friendly Cite/link Email Feedback|
|Author:||Lu, Wei; Tong, Yan; Yu, Yue; Xing, Yiqiao; Chen, Changzheng; Shen, Yin|
|Publication:||Journal of Ophthalmology|
|Date:||Jan 1, 2018|
|Previous Article:||Refraction and Ocular Biometry of Preschool Children in Shanghai, China.|
|Next Article:||In Vivo Confocal Microscopic Observations of Vortex Keratopathy in Patients with Amiodarone-Induced Keratopathy and Fabry Disease.|