Skip to main content

Preliminary analysis of predicting the first recurrence in patients with neovascular age-related macular degeneration using deep learning

Abstract

Background

To predict, using deep learning, the first recurrence in patients with neovascular age-related macular degeneration (nAMD) after three monthly loading injections of intravitreal anti-vascular endothelial growth factor (anti-VEGF).

Methods

Optical coherence tomography (OCT) images were obtained at baseline and after the loading phase. The first recurrence was defined as the initial appearance of a new retinal hemorrhage or intra/subretinal fluid accumulation after the initial resolution of exudative changes after three loading injections. Standard U-Net architecture was used to identify the three retinal fluid compartments, which include pigment epithelial detachment, subretinal fluid, and intraretinal fluid. To predict the first recurrence of nAMD, classification learning was conducted to determine whether the first recurrence occurred within three months after the loading phase. The recurrence classification architecture was built using ResNet50. The model with retinal regions of interest of the entire region and fluid region on OCT at baseline and after the loading phase is presented.

Results

A total of 1,444 eyes of 1,302 patients were included. The mean duration until the first recurrence after the loading phase was 8.20 ± 15.56 months. The recurrence classification system revealed that the model with the fluid region of OCT after the loading phase provided the highest classification performance, with an area under the receiver operating characteristic curve (AUC) of 0.725 ± 0.012. Heatmap analysis revealed that three pathological fluids, subsided choroidal neovascularization lesions, and hyperreflective foci were important areas for the first recurrence.

Conclusions

The deep learning algorithm allowed for the prediction of the first recurrence for three months after the loading phase with adequate feasibility. An automated prediction system may assist in establishing patient-specific treatment plans and the provision of individualized medical care for patients with nAMD.

Peer Review reports

Background

Neovascular age-related macular degeneration (nAMD) is a leading cause of blindness in elderly people [1, 2], and the advent of anti-vascular endothelial growth factor (anti-VEGF) therapy has revolutionized the treatment of nAMD [3,4,5]. Treatment regimens with anti-VEGF agents have relied on retinal fluid in optical coherence tomography (OCT) imaging of the central retinal region to monitor the disease activity and treatment efficacy. The as-needed (pro re nata [PRN]) regimen and the treat-and-extend (TAE) regimen are the two most common strategies used to optimize the management of individual patients. Prediction of disease progression or recurrence using these treatment regimens is especially important in patients with nAMD.

Given the very heterogeneous treatment demand and treatment need of each individual patient, individualized treatment strategies and early detection of recurrence based on changes in pathological fluid seen on OCT are warranted. Kuroda et al. [6] reported that recurrence of retinal exudative change was detected in 65.7% of patients within one year and in 74.8% of patients within two years after the resolution of retinal exudation with initial treatment. Moreover, in cases of severe disease reactivation, if massive subretinal hemorrhage were not treated, irreversible vision loss may occur [7]. Previous studies have reported that retinal thickness and retinal fluid, which include pigment epithelial detachment (PED), subretinal fluid (SRF), and intraretinal fluid (IRF), are common anatomical measures of disease activity in nAMD, and most patients respond well to anti-VEGF agents [8, 9]. Using OCT, clinicians can observe the detailed morphological characterization of macular fluid accumulation. Different morphological changes and the occurrence of retinal fluid are considered important parameters in the prognosis of nAMD [10]. In this study, we will contribute to providing individualized treatment strategies to nAMD patients by predicting the first recurrence after the loading phase.

Recent advances in artificial intelligence, especially deep learning-based convolutional neural networks (CNN), could provide novel promising strategies for the diagnosis of patients with age-related macular degeneration (AMD) [11, 12], and decision-making regarding their treatment [13, 14]. Furthermore, OCT-based response prediction of anti-VEGF treatment, as well as treatment demand in nAMD [15, 16], and visual acuity prediction after initiating treatment [17,18,19] have shown encouraging results. Nevertheless, to our knowledge, predicting the first recurrence of nAMD after the initiation phase using OCT-based deep learning in nAMD has not yet been investigated, and it is thought that recurrence may be related to quantitative evaluation of retinal thickness and qualitative observation of retinal fluid.

The first three consecutive monthly anti-VEGF injections are generally accepted in clinical practice. However, a consensus has not yet been achieved on when to start the fourth injection after initiating treatment. Depending on reimbursement policies by health insurance systems and physicians' preferences, some physicians may prefer initiating the PRN or TAE regimen after confirming the first recurrence. In contrast, others may prefer the early TAE regimen, which initiates the TAE regimen immediately after three initial doses. Since the timing of the first recurrence is very heterogeneous for individual nAMD patients and the treatment burden caused by overtreatment is high, predicting the first recurrence after initiating treatment is very important. In the current study, we aimed to observe the practical feasibility of a prediction tool using OCT-based deep learning algorithms in patients with nAMD. Specifically, we investigated the feasibility of preliminary analysis of predicting the first recurrence within three months after the initiation of anti-VEGF treatment in a routine clinical setting.

Methods

Data sources and participants

We retrospectively reviewed the medical records of 2,266 consecutive patients with treatment-naïve nAMD who visited the Seoul National University Hospital (SNUH) between February 2008 and July 2021. All patients were treated with three consecutive loading intravitreal injections of either ranibizumab (Lucentis; Novartis, Basel Switzerland), aflibercept (Eylea; Bayer Pharma, Germany), or bevacizumab (Avastin; F. Hoffmann-La Roche Ltd, Basel, Switzerland). The study was approved by the Institutional Review Board of Seoul National University Hospital (IRB approval number: 2107–223-1239) and adhered to the tenets of the Declaration of Helsinki. Institutional Review Board of the Seoul National University Hospital waived the need for written informed consent from the participants, because of the study’s retrospective design.

The inclusion criteria were (1) symptomatic nAMD; (2) age ≥ 50 years; (3) three consecutive anti-VEGF injections (i.e., received the first three injections with intervals between each injection shorter than 60 days) followed by PRN dosing; (4) dry macula after the loading phase; (5) availability of both baseline and after the loading OCT images; and (6) follow-up by the time of the first recurrence. The exclusion criteria were (1) other concomitant ocular pathologies that could interfere with visual function; (2) other macular abnormalities (i.e., myopic CNV, angioid streaks or other secondary CNV); (3) persistent exudation despite three consecutive anti-VEGF injections; (4) optical media opacity that substantially disturbed OCT image acquisition; and (5) follow-up loss before the time of the first recurrence.

After initial exclusion, 1,444 eyes from 1,302 patients who met the inclusion criteria were included in this study. Both eyes of the same patient were assessed independently. Before the loading treatment, all patients underwent a comprehensive ophthalmologic examination, including measurement of BCVA, intraocular pressure, slit-lamp biomicroscopy, indirect fundus examination, fundus photography, fluorescein and indocyanine green angiography, and spectral-domain OCT (SD-OCT). Macular 6 × 6 mm OCT scans were obtained using either a Cirrus high-definition OCT (HD OCT, Carl Zeiss Meditec, Dublin, CA, USA) or Spectralis SD-OCT imaging system (Heidelberg Engineering, Heidelberg, Germany). We reviewed the medical records of patients, including demographics, subtypes of nAMD, BCVA, anti-VEGF agents administered, refractive errors, and AL (only in patients with available data). BCVA measurements were made using a Snellen chart and converted to logarithm of the minimum angle of resolution (logMAR) units for statistical analyses. In this study, the first recurrence was defined as the initial appearance of a new retinal hemorrhage or intra/subretinal fluid accumulation after the initial resolution of exudative changes after three loading injections. Although the persistence of PED was not considered a recurrence, the increase in PED size was considered as recurrence. For the first year after the loading phase, monitoring was done every 1 − 2 months, and then every 2 − 3 months until recurrence, depending on the clinician's judgement. Recurrence was evaluated by two independent observers (S.Y.L. and U.C.P.) and a consensus was reached in each case.

Image preprocessing

Two OCT imaging devices were used to obtain OCT scans. Cirrus OCT scans were acquired at a resolution of 500 × 750 pixels per B-scan, and Spectralis SD-OCT scans were acquired at a resolution of 496 × 768 pixels per B-scan. To obtain a more uniform database, all images were normalized with respect to their horizontal orientation relative to the nose, meaning that images from left eyes were flipped to have the same orientation as the right eyes. Baseline OCT scans were obtained when the patient was first diagnosed with nAMD, whereas OCT scans after the loading phase were taken one month after three consecutive loading injections. Regarding different retinal regions of interest (ROIs), OCT scans of the entire region and fluid region were prepared independently. A scan used for the entire region was resized to 512 × 750 pixels, and a scan cropped by a fluid segmentation model to include retinal fluid was used for the fluid region. The output from the fluid segmentation model was used to determine the center point of the fluid mask, and 400 × 400 patches were cropped using the center point. Then, as an augmentation, these were randomly assigned a value between 0 and 50 to x and y, the coordinate values of the center point, to move the center point and crop the patches. This was proceeded in real time during the learning process.

Dataset split

All data were randomly divided into training (70%), validation (20%), and test (10%) sets, and both eyes of the same patient were assigned to the same dataset. Considering the potential imbalance of data between the training and test datasets, the target defined in this study was divided into balanced proportions. Five-fold cross-validation (CV) was performed after merging the training and validation sets due to the limited size of the dataset and to prevent overfitting [20]. Five-fold CV was performed by randomly partitioning the data into five subsets of equal size at the patient level. For each CV group, five instances of the recurrence classification model with different random initializations were trained on four subsets and evaluated on one subset. For the final ensemble, the average of the model instances trained in each CV group was used, and the test set was used to evaluate the final performance of each group.

CNN-based fluid segmentation

A fluid segmentation model was proposed to automatically predict the regions of different fluid compartments, including the PED, SRF, and IRF. A total of 1,105 OCT scans with a fluid mask from the annotated retinal OCT image (AROI) database were used for fluid segmentation. For the AROI database, macular SD-OCT volumes were recorded with the Zeiss Cirrus HD OCT 4000 device [21]. A total of 684 OCT scans collected from SNUH were added as internal data. Fluid segmentation was performed using manual delineation by retinal specialists as the gold standard. Public and internal OCT images was shuffled together and randomly split into 1,220 scans for training, 322 for validation, and 247 for evaluation.

The network architecture was built using U-Net [22]. Standard U-Net architecture was used to identify the three retinal fluid compartments. All the OCT images were resized to 1024 × 512 pixels. The Dice similarity coefficient loss was used to compute the loss between the true mask and the predicted mask in model training [23]. The batch size was set to 8, and an adaptive momentum estimation (Adam) optimizer was applied. The deep learning network was implemented in Python and PyTorch.

CNN-based recurrence classification

To predict the first recurrence of nAMD, classification learning was conducted to determine whether the first recurrence occurred within three months after the loading phase. The target was defined as a recurrence time interval, and if the first recurrence occurred within three months from one month after three consecutive loading injections, it was defined as a positive target. The recurrence classification architecture was built using ResNet50 [24]. To save training time and directly use diverse underlying features that are difficult to be trained well by a small or specified dataset, transfer learning was used for classification using ImageNet pre-trained weights [25]. Training was performed with a batch size of 16 and 8 for the entire region and fluid region, respectively, and the learning rate was set to 0.0001. Optimization was performed using Adam, and the training loss function for each task was given by the softmax cross-entropy loss between the ground truth label \(y\) and the model prediction \(\widehat{y}\) given by an input scan \(x\). Model training was performed for 100 epochs.

Evaluation of the predicted model and statistical analysis

The area under the receiver operating characteristic curve (AUC) score was used as a primary metric for predicting the first recurrence of nAMD [26]. In addition, accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and F1 score were used. Five-fold CV yielded five different results, so the performance measure was reported using the mean and standard deviation (SD) of the values computed in each fold. The patient-specific predictors of each model were then used as data for a test comparing the two AUCs using the popular area test proposed by DeLong et al. [27], which can determine whether two classifiers have the same AUC score. Kruskal–Wallis and Chi-squared tests were used for comparisons between datasets divided into training and test sets, and a p-value of < 0.05 was considered statistically significant. The clinical information was analyzed by dividing into subgroups for each range or subtype, and the comparison of true positive and negative rates between each subgroup was conducted using an analysis of variance (ANOVA).

Results

Characteristics of the participants

In total, 1,444 eyes of 1,302 patients were included in the current study (Table 1). Of the 1,302 patients, 716 (54.99%) were male, and 586 (45.01%) were female. The mean age of the patients at baseline was 71.95 ± 8.27 years. The mean logMAR best-corrected visual acuity (BCVA) was 0.71 ± 0.49 at baseline and 0.55 ± 0.48 1 month after the three consecutive loading injections. The axial length (AL) measurement was available for 560 eyes, and the mean AL was 23.54 ± 0.91 mm. Of the 1,444 eyes, 723 (50.07%) were oculus dexters. The nAMD subtypes were as follows: type 1 or 2 choroidal neovascularization (CNV) (1,084 eyes; 75.07%), polypoidal choroidal vasculopathy (PCV, 225 eyes; 15.58%), and retinal angiomatous proliferation (RAP, type 3 CNV, 135 eyes; 9.35%). The anti-VEGF agents administered for three consecutive loading injections were as follows: ranibizumab (659 eyes, 45.64%), aflibercept (638 eyes, 44.18%), and bevacizumab (147 eyes, 10.18%). The mean duration until the first recurrence after the loading phase was 8.20 ± 15.56 months. A total of 888 images from Cirrus and 556 images from the Spectralis OCT system were used. No statistical differences were observed in the demographics and baseline clinical characteristics between the training and test sets.

Table 1 Demographics and baseline clinical characteristics of the study participants

CNN-based fluid segmentation

Two deep learning algorithms were applied to retinal OCT scans: fluid segmentation and recurrence classification (Fig. 1). In total, 247 OCT scans with fluid masks were evaluated for the fluid segmentation model. Using the fluid segmentation algorithm, PED, IRF and SRF segmentations were computed on all OCT volumes for feasibility of quantitative score and retinal region of interest (ROI) for fluid region was extracted for qualitative observations. Examples of segmentations are shown in Fig. 2. Without considering the different types of fluids, all three fluids were considered as one retinal fluid to determine the center point to crop out the fluid region.

Fig. 1
figure 1

A schematic of deep learning model training, validation, and performance assessment for predicting the first recurrence of neovascular age-related macular degeneration patients using optical coherence tomography (OCT) image sets. A OCT scans were obtained at baseline and after the loading phase, which was taken one month after three consecutive anti-vascular endothelial growth factor (anti-VEGF) loading injections. B Retinal regions of interest (ROIs) were found using a fluid segmentation network. Recurrence classification network (ResNet50) (C) and gradient-weighted class activation mapping visualization (D) is presented

Fig. 2
figure 2

Convolutional neural network-based fluid segmentation results and segmentation color key. A Representative original images of optical coherence tomography (OCT) scans. B OCT scans with retinal fluid regions. All retinal fluid was coded in yellow. C OCT scans with three different retinal fluids. Pigment epithelial detachment (PED) was coded in pink, subretinal fluid (SRF) in green, and intraretinal fluid (IRF) in blue

CNN-based recurrence classification

Of a total of 1,444 eyes, after the loading phase, 743 eyes showed the first recurrence within three months and 701 eyes showed the first recurrence after three months. More specifically, the distribution according to the timing of recurrence was 743 (51.45%) within 3 months, 1,034 (71.61%) within 6 months, 1,214 (84.07%) within 1 year, 1,326 (91.83%) within 2 years, and 118 eyes (8.17%) had a first recurrence after 2 years. The distribution of timing of the first recurrence was as follows in each set; Training set (median 2.99 months; interquartile range 1.38 − 6.90; min − max 0.46 − 134.21), Test set (median 2.89 months; interquartile range 1.35 − 7.26; min − max 0.62 − 160.46), Total set (median 2.99 months; interquartile range 1.38 − 7.05; min − max 0.46 − 160.46). A total of 149 eyes from 130 patients in the test set were evaluated at baseline and after the loading phase. Of these eyes, 77 (51.68%) showed the first recurrence within three months, and 72 (48.32%) showed the first recurrence after three months. The results of the recurrence classification task under the four experimental conditions are shown in Table 2 and the results of the receiver operating characteristic (ROC) curve with the area under the receiver operating characteristic curve (AUC) score are shown in Fig. 3. The cutoff point was set to 0.5, which means that any observation with a predicted probability of 0.5 or greater was classified as positive, and any observation with a predicted probability less than 0.5 was classified as negative. The model with the fluid region of OCT scans after the loading phase provided the highest classification performance, with an AUC of 72.5%, followed by the model with the entire region of OCT scans after the loading phase, with an AUC of 71.6%.

Table 2 Classification performance of the convolutional neural network models
Fig. 3
figure 3

Receiver operating characteristic (ROC) curves for convolutional neural network-based recurrence classification. ROC curves for the entire region (A) and the fluid region (B) of optical coherence tomography (OCT) scans at baseline. ROC curves for the entire region (C) and the fluid region (D) of OCT scans after the loading phase

To observe the differences between these four classifiers, the AUC performance between the four classifiers was compared using DeLong’s test. The resulting p-values were 0.135 and 0.718, indicating that the estimates do not differ between the entire and fluid regions based on OCT at baseline and after the loading phase, respectively. However, the model comparison results with p-values of < 0.001 and 0.009 indicate that the model estimates differ significantly between the baseline and after the loading phase based on the same entire and fluid regions (Table 3). The OCT scan model after the loading phase outperformed the other models in predicting the first recurrence of nAMD.

Table 3 Comparison of the AUC performance between four different experimental conditions

Interpretation of the model decisions

The gradient-weighted class activation mapping (Grad-CAM) uses gradients of target to represent a localization map highlighting the main regions in the image for predicting the target [28]. Grad-CAM uses the gradients of any target concept flowing into the final convolutional layer (Conv5_3) of the model trained using OCT scans to extract the feature maps and compute the weights of the feature maps to identify the areas displaying the greatest effect of the first recurrence of nAMD.

As shown in Fig. 4, the heatmap represents the most important region in each image of the trained CNN when classified as a recurrence within three months after the loading phase. In true positive cases (Fig. 4A), heatmap analysis with Grad-CAM highlighted areas of IRF and SRF as important areas on the OCT scans. In addition, areas of hyperreflective foci are often highlighted in the OCT images. In false positive cases (Fig. 4B), recurrence occurred after three months, and it was incorrectly predicted that recurrence would occur within three months after the loading phase. Areas of persistent PED are often highlighted on the OCT scans, even though the persistence of PED was not considered to indicate recurrence in the current study. In false negative cases (Fig. 4C), recurrence occurred within three months, and it was incorrectly predicted that recurrence would occur after three months after the loading phase; heatmap analysis concentrated on areas other than the main lesion when there was hemorrhage or large lesion and obtained an incorrect prediction result. There were cases of misprediction when the recurrence occurred very close to the three months’ time point. In true negative cases (Fig. 4D), Grad-CAM mainly highlighted the main lesions of subsided CNV, with one coarse and large-scale attention. Figure 5 shows representative cases of correct predictions of the first recurrence within three months and later.

Fig. 4
figure 4

Representative cases of gradient-weighted class activation mapping (Grad-CAM) visualization. Grad-CAM extracts the feature map of the last convolutional layer and shows a heatmap within the image describing the calculated weight of the feature map. A True positive; B False positive; C False negative; D True negative cases

Fig. 5
figure 5

Representative cases of neovascular age-related macular degeneration. Optical coherence tomography (OCT) scans at baseline, after the loading phase, and at the time of the first recurrence are presented. Convolutional neural network-based fluid segmentation results and gradient-weighted class activation mapping (Grad-CAM) are also noted. The red bounding box indicates the area shown by the Grad-CAM. A OCT demonstrates the first recurrence at 1.1499 months (within three months) after the loading phase, predicting a value higher than 0.5 with a prediction of 0.6391, which is true positive. B OCT demonstrates the first recurrence at 7.3595 months (after three months) after the loading phase, predicting a value lower than 0.5 with a prediction of 0.1362, which is true negative

When classifying the recurrence based on three months, the prediction values were closer to 1 or 0 if the recurrence periods were less than two months or more than four months, respectively, whereas the prediction values were closer to 0.5 if the recurrence periods were close to the three months’ time point (Supplementary Fig. 1).

Discussion

In the present study, we evaluated the practical feasibility of a prediction tool using OCT-based deep learning algorithms in patients with nAMD. The time to first recurrence of exudation after acquiring a dry macula following three consecutive anti-VEGF loading phase in a routine clinical setting was analyzed to evaluate whether the algorithms could reliably predict the recurrence within three months. Our results demonstrate that model with the fluid region of OCT scans after the loading phase provided the highest classification performance, with an AUC of 72.5%. By proposing a deep learning algorithm to predict the first recurrence using OCT image, we believe that this study has important clinical significance for attempting to individualize decision-making for nAMD patients, which is a heterogeneous disease.

There has been an advance in recent research in AMD utilizing machine learning algorithms, regarding not only diagnosing or classifying diseases but also predicting future events. Schmidt-Erfurth et al. investigated individual disease conversion in early AMD using artificial intelligence [29]. They demonstrated that the model differentiated converting versus non-converting eyes with a performance of 68% and 80% for CNV and geographic atrophy and the most critical features for progression were outer retinal thickness, hyperreflective foci, and drusen area. Ajana et al. evaluated a prediction model for advanced AMD allowing selection of the most predictive risk factors automatically [30]. They revealed that the prediction model achieved an 92% AUC in differentiating the high-risk groups. While it is challenging to make a direct comparison between predicting the progression of early AMD to advanced AMD and predicting the reactivation of nAMD after loading treatment in the current study, the fact that the performance of Schmidt-Erfurth et al.'s model [29] did not exceed 0.8 and the best performance in this study was an AUC of 72.5% shows that it is still challenging to develop a CNN model to predict the future using only OCT images.

Previous studies predicting anti-VEGF treatment demand or frequency in nAMD suggest that machine learning may assist in establishing patient-specific treatment plans in the near future. Gallardo et al. demonstrated mean AUCs of 0.79 and 0.79 for low and high demand in the nAMD-trained models [16], and Pfau et al. revealed mean AUCs from 0.61 to 0.7 for low and high treatment requirement in nAMD patients [31]. Chandra et al. also showed AUCs of 0.79 − 0.82 and 0.79 − 0.81 for predicting few and many injections [32], and Romo-Bucheli et al. revealed AUC of 0.85 in detecting the patients with low and high treatment requirement in nAMD [33]. Meanwhile, machine learning algorithms that predict visual acuity after anti-VEGF therapy may also encourage patients to adhere to intravitreal therapy and contribute to personalized medicine. Rohm et al. evaluated visual acuity at 3 and 12 months in patients with nAMD after initial three anti-VEGF injections and revealed that machine learning allowed visual acuity to be predicted for three months with a comparable result to real visual acuity measurements [17]. Fu et al. investigated the predictive usefulness of quantitative imaging biomarkers from OCT scans in future visual outcomes of nAMD patients starting anti-VEGF therapy [18]. They revealed that visual outcomes under antiangiogenic therapy can be predicted using retinal tissue volumes that have been quantified automatically from OCT images.

In this study, the prediction of recurrence after initial anti-VEGF injections could be used in 3 main ways on an individual patient basis in the form of personalized medicine: (1) It may give caution to high-risk patients who will exhibit early recurrence within three months and encourage them to adhere to regular monitoring. (2) For low-risk patients who are expected to have a late recurrence, it could alleviate the anxieties and follow-up can be more flexible. (3) It could help clinicians in determining the follow-up duration of patients and decision-making around anti-VEGF injections, including recommending or withholding injections.

Although this study did not allow us to determine which biomarkers on OCT were more important features, we did find that the retinal morphological characteristics after the initial three anti-VEGF injections were more important in predicting the recurrence than the retinal appearance with the various pathological fluids at initial presentation. We also found that the clinical significance of pathological fluids on nAMD recurrence still remains crucial, as the model performance was better with fluid ROIs than with entire ROIs of OCT scans. As shown in Figs. 4 and 5, the heatmap results of recurrence classification and retinal fluid segmentation mainly highlighted areas of pathologic fluid, such as PED, SRF, and IRF, as important areas on OCT scans. Grad-CAM highlighted the main CNV lesions or hyperreflective foci to predict early recurrence. The areas of PED were often emphasized in false positive cases, and although persistent PED was not defined as a recurrence in this study, it is well known that PED is also related to CNV activity [34, 35]. Since other pathological fluids, such as SRF and IRF, frequently recur after PED growth, our study may identify that PED is still a meaningful biomarker associated with nAMD recurrence.

Our study has several limitations. First, the number of patients was small, and the recurrence interval was arbitrarily set to three months and classified. Among the patients who experienced recurrence after three months, there was a group of patients who experienced recurrence after 120 months; therefore, further studies are needed to further refine the timing of recurrence. Second, the study population included heterogenous group with different treatment of anti-VEGF agents and treatment interval between each injection. These variations could be confounding factors for prediction of disease recurrence. Nevertheless, we believe that this study is significant in that it evaluates the feasibility of a model to predict recurrence in a real clinical setting where treatment demand and need are highly heterogeneous. Third, it was possible to determine that the retinal fluid had a significant impact; however, it was difficult to determine the relationship between fluid volume and recurrence prediction. Although the performance of the fluid segmentation model was sufficient to detect the fluid region, it may not be accurate to compute the fluid volume on OCT images. Lastly, a series of sequential inputs were not used, but a single OCT image input was utilized in the current study. In the CNN model for time series forecast utilization, OCT images at baseline and after the loading phase could be sequentially utilized. A better model that specifically predicts whether recurrence will occur in an individual patient by combining serial OCT images and clinical information together and the actual time to recurrence is planned in future research.

Conclusion

In conclusion, we examined the feasibility of predicting the first recurrence within three months after the loading phase of anti-VEGF treatment using OCT-based deep learning algorithms in patients with nAMD in a routine clinical setting. The model with the fluid region of the OCT scans and that after the loading phase provided the highest classification performance. Heatmaps revealed that pathological fluids, such as PED, SRF, and IRF, subsided CNV lesions, and hyperreflective foci were important areas for the first recurrence on OCT scans. This automated prediction system will aid in the provision of individualized medical care for patients with nAMD.

Availability of data and materials

All data generated or analyzed during this study are included in this published article.

Abbreviations

nAMD:

Neovascular age-related macular degeneration

anti-VEGF:

Anti-vascular endothelial growth factor

OCT:

Optical coherence tomography

ROIs:

Region-of-interests

AUC:

Area under the receiver operating characteristic curve

PED:

Pigment epithelial detachment

SRF:

Subretinal fluid

IRF:

Intraretinal fluid

CNV:

Choroidal neovascularization

PRN:

Pro re nata

TAE:

Treat-and-extend

CNN:

Convolutional neural network

SNUH:

Seoul National University Hospital

BCVA:

Best-corrected visual acuity

SD-OCT:

Spectral-domain optical coherence tomography

AL:

Axial length

logMAR:

Logarithm of the minimum angle of resolution

CV:

Cross-validation

AROI:

Annotated Retinal OCT Images

Adam:

Adaptive momentum estimation

PPV:

Positive predictive values

NPV:

Negative predictive values

ROC:

Receiver operating characteristic

SD:

Standard deviation

PCV:

Polypoidal choroidal vasculopathy

RAP:

Retinal angiomatous proliferation

Grad-CAM:

Gradient-weighted class activation mapping

References

  1. Friedman DS, O’Colmain BJ, Muñoz B, Tomany SC, McCarty C, de Jong PT, Nemesure B, Mitchell P, Kempen J. Prevalence of age-related macular degeneration in the United States. Arch Ophthalmol. 2004;122(4):564–72. https://doi.org/10.1001/archopht.122.4.564.

    Article  PubMed  Google Scholar 

  2. Quartilho A, Simkiss P, Zekite A, Xing W, Wormald R, Bunce C. Leading causes of certifiable visual loss in England and Wales during the year ending 31 March 2013. Eye (Lond). 2016;30(4):602–7. https://doi.org/10.1038/eye.2015.288.

    Article  CAS  PubMed  Google Scholar 

  3. Rosenfeld PJ, Brown DM, Heier JS, Boyer DS, Kaiser PK, Chung CY, Kim RY. Ranibizumab for neovascular age-related macular degeneration. N Engl J Med. 2006;355(14):1419–31.

    Article  CAS  PubMed  Google Scholar 

  4. Regillo CD, Brown DM, Abraham P, Yue H, Ianchulev T, Schneider S, Shams N. Randomized, double-masked, sham-controlled trial of ranibizumab for neovascular age-related macular degeneration: PIER Study year 1. Am J Ophthalmol. 2008;145(2):239–48. https://doi.org/10.1016/j.ajo.2007.10.004.

    Article  CAS  PubMed  Google Scholar 

  5. Schmidt-Erfurth U, Eldem B, Guymer R, Korobelnik JF, Schlingemann RO, Axer-Siegel R, Wiedemann P, Simader C, Gekkieva M, Weichselberger A. Efficacy and safety of monthly versus quarterly ranibizumab treatment in neovascular age-related macular degeneration: the EXCITE study. Ophthalmology. 2011;118(5):831–9. https://doi.org/10.1016/j.ophtha.2010.09.004.

    Article  PubMed  Google Scholar 

  6. Kuroda Y, Yamashiro K, Miyake M, Yoshikawa M, Nakanishi H, Oishi A, Tamura H, Ooto S, Tsujikawa A, Yoshimura N. Factors associated with recurrence of age-related macular degeneration after anti-vascular endothelial growth factor treatment: a retrospective cohort study. Ophthalmology. 2015;122(11):2303–10. https://doi.org/10.1016/j.ophtha.2015.06.053.

    Article  PubMed  Google Scholar 

  7. Cho JH, Ryoo NK, Cho KH, Park SJ, Park KH, Woo SJ. Incidence rate of massive submacular hemorrhage and its risk factors in polypoidal choroidal vasculopathy. Am J Ophthalmol. 2016;169:79–88. https://doi.org/10.1016/j.ajo.2016.06.014.

    Article  PubMed  Google Scholar 

  8. Jaffe GJ, Ying GS, Toth CA, Daniel E, Grunwald JE, Martin DF, Maguire MG. Macular morphology and visual acuity in year five of the comparison of age-related macular degeneration treatments trials. Ophthalmology. 2019;126(2):252–60. https://doi.org/10.1016/j.ophtha.2018.08.035.

    Article  PubMed  Google Scholar 

  9. Kaiser PK, Wykoff CC, Singh RP, Khanani AM, Do DV, Patel H, Patel N. Retinal fluid and thickness as measures of disease activity in neovascular age-related macular degeneration. Retina. 2021;41(8):1579–86. https://doi.org/10.1097/iae.0000000000003194.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Simader C, Ritter M, Bolz M, Deák GG, Mayr-Sponer U, Golbaz I, Kundi M, Schmidt-Erfurth UM. Morphologic parameters relevant for visual outcome during anti-angiogenic therapy of neovascular age-related macular degeneration. Ophthalmology. 2014;121(6):1237–45. https://doi.org/10.1016/j.ophtha.2013.12.029.

    Article  PubMed  Google Scholar 

  11. Burlina PM, Joshi N, Pekala M, Pacheco KD, Freund DE, Bressler NM. Automated grading of age-related macular degeneration from color fundus images using deep convolutional neural networks. JAMA Ophthalmol. 2017;135(11):1170–6. https://doi.org/10.1001/jamaophthalmol.2017.3782.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Treder M, Lauermann JL, Eter N. Automated detection of exudative age-related macular degeneration in spectral domain optical coherence tomography using deep learning. Graefes Arch Clin Exp Ophthalmol. 2018;256(2):259–65. https://doi.org/10.1007/s00417-017-3850-3.

    Article  CAS  PubMed  Google Scholar 

  13. Hwang DK, Hsu CC, Chang KJ, Chao D, Sun CH, Jheng YC, Yarmishyn AA, Wu JC, Tsai CY, Wang ML, et al. Artificial intelligence-based decision-making for age-related macular degeneration. Theranostics. 2019;9(1):232–45. https://doi.org/10.7150/thno.28447.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Gutfleisch M, Ester O, Aydin S, Quassowski M, Spital G, Lommatzsch A, Rothaus K, Dubis AM, Pauleikhoff D. Clinically applicable deep learning-based decision aids for treatment of neovascular AMD. Graefes Arch Clin Exp Ophthalmol. 2022;260(7):2217–30. https://doi.org/10.1007/s00417-022-05565-1.

    Article  CAS  PubMed  Google Scholar 

  15. Bogunovic H, Waldstein SM, Schlegl T, Langs G, Sadeghipour A, Liu X, Gerendas BS, Osborne A, Schmidt-Erfurth U. Prediction of anti-VEGF treatment requirements in neovascular AMD using a machine learning approach. Invest Ophthalmol Vis Sci. 2017;58(7):3240–8. https://doi.org/10.1167/iovs.16-21053.

    Article  CAS  PubMed  Google Scholar 

  16. Gallardo M, Munk MR, Kurmann T, De Zanet S, Mosinska A, Karagoz IK, Zinkernagel MS, Wolf S, Sznitman R. Machine learning can predict anti-VEGF treatment demand in a treat-and-extend regimen for patients with neovascular AMD, DME, and RVO associated macular edema. Ophthalmol Retina. 2021;5(7):604–24. https://doi.org/10.1016/j.oret.2021.05.002.

    Article  PubMed  Google Scholar 

  17. Rohm M, Tresp V, Müller M, Kern C, Manakov I, Weiss M, Sim DA, Priglinger S, Keane PA, Kortuem K. Predicting visual acuity by using machine learning in patients treated for neovascular age-related macular degeneration. Ophthalmology. 2018;125(7):1028–36. https://doi.org/10.1016/j.ophtha.2017.12.034.

    Article  PubMed  Google Scholar 

  18. Fu DJ, Faes L, Wagner SK, Moraes G, Chopra R, Patel PJ, Balaskas K, Keenan TDL, Bachmann LM, Keane PA. Predicting incremental and future visual change in neovascular age-related macular degeneration using deep learning. Ophthalmol Retina. 2021;5(11):1074–84. https://doi.org/10.1016/j.oret.2021.01.009.

    Article  PubMed  Google Scholar 

  19. Abbas A, O’Byrne C, Fu DJ, Moraes G, Balaskas K, Struyven R, Beqiri S, Wagner SK, Korot E, Keane PA. Evaluating an automated machine learning model that predicts visual acuity outcomes in patients with neovascular age-related macular degeneration. Graefes Arch Clin Exp Ophthalmol. 2022;260(8):2461–73. https://doi.org/10.1007/s00417-021-05544-y.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Refaeilzadeh P, Tang L, Liu H. Cross-validation. Encyclopedia Database Syst. 2009;5:532–8.

    Article  Google Scholar 

  21. Melinščak M, Radmilović M, Vatavuk Z, Lončarić S. Annotated retinal optical coherence tomography images (AROI) database for joint retinal layer and fluid segmentation. Automatika. 2021;62(3–4):375–85.

    Article  Google Scholar 

  22. Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation. In: Navab N, Hornegger J, Wells W, Frangi A, editors. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015. MICCAI 2015. Lecture Notes in Computer Science, vol 9351. Cham: Springer; 2015. https://doi.org/10.1007/978-3-319-24574-4_28.

  23. Jadon S. A survey of loss functions for semantic segmentation. In: 2020 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB): 2020: IEEE; 2020. p. 1–7. https://arxiv.org/abs/2006.14822.

  24. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Proc IEEE Conf Comp Vis Pattern Recognit. 2016;2016:770–8.

    Google Scholar 

  25. Tercan H, Guajardo A, Heinisch J, Thiele T, Hopmann C, Meisen T. Transfer-learning: Bridging the gap between real and simulation data for machine learning in injection molding. Procedia Cirp. 2018;72:185–90.

    Article  Google Scholar 

  26. Mandrekar JN. Receiver operating characteristic curve in diagnostic test assessment. J Thorac Oncol. 2010;5(9):1315–6. https://doi.org/10.1097/JTO.0b013e3181ec173d.

    Article  PubMed  Google Scholar 

  27. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–45.

    Article  CAS  PubMed  Google Scholar 

  28. Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-cam: Visual explanations from deep networks via gradient-based localization. Proc IEEE Int Conf Comput Vis. 2017;2017:618–26.

    Google Scholar 

  29. Schmidt-Erfurth U, Waldstein SM, Klimscha S, Sadeghipour A, Hu X, Gerendas BS, Osborne A, Bogunovic H. Prediction of individual disease conversion in early AMD using artificial intelligence. Invest Ophthalmol Vis Sci. 2018;59(8):3199–208. https://doi.org/10.1167/iovs.18-24106.

    Article  CAS  PubMed  Google Scholar 

  30. Ajana S, Cougnard-Grégoire A, Colijn JM, Merle BMJ, Verzijden T, de Jong P, Hofman A, Vingerling JR, Hejblum BP, Korobelnik JF, et al. Predicting progression to advanced age-related macular degeneration from clinical, genetic, and lifestyle factors using machine learning. Ophthalmology. 2021;128(4):587–97. https://doi.org/10.1016/j.ophtha.2020.08.031.

    Article  PubMed  Google Scholar 

  31. Pfau M, Sahu S, Rupnow RA, Romond K, Millet D, Holz FG, Schmitz-Valckenberg S, Fleckenstein M, Lim JI, de Sisternes L, et al. Probabilistic forecasting of anti-VEGF treatment frequency in neovascular age-related macular degeneration. Transl Vis Sci Technol. 2021;10(7):30. https://doi.org/10.1167/tvst.10.7.30.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Chandra RS, Ying GS. Evaluation of multiple machine learning models for predicting number of anti-VEGF injections in the Comparison of AMD Treatment Trials (CATT). Transl Vis Sci Technol. 2023;12(1):18. https://doi.org/10.1167/tvst.12.1.18.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Romo-Bucheli D, Erfurth US, Bogunovic H. End-to-End Deep learning model for predicting treatment requirements in neovascular AMD from longitudinal retinal OCT imaging. IEEE J Biomed Health Inform. 2020;24(12):3456–65. https://doi.org/10.1109/jbhi.2020.3000136.

    Article  PubMed  Google Scholar 

  34. Freund KB, Zweifel SA, Engelbert M. Do we need a new classification for choroidal neovascularization in age-related macular degeneration? Retina. 2010;30(9):1333–49. https://doi.org/10.1097/IAE.0b013e3181e7976b.

    Article  PubMed  Google Scholar 

  35. Nagiel A, Sarraf D, Sadda SR, Spaide RF, Jung JJ, Bhavsar KV, Ameri H, Querques G, Freund KB. Type 3 neovascularization: evolution, association with pigment epithelial detachment, and treatment response as revealed by spectral domain optical coherence tomography. Retina. 2015;35(4):638–47. https://doi.org/10.1097/iae.0000000000000488.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

This study was supported by a research grant of the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science and ICT (Information and Communication Technology, NRF-2021R1F1A1045417), and was also supported by a research grant from the Seoul National University Hospital Research Fund (0320222220). This study was also supported by Electronics and Telecommunications Research Institute (ETRI) grant funded by the Korean government (23ZR1100, A Study of Hyper-Connected Thinking Internet Technology by autonomous connecting, controlling and evolving ways). The sponsor or funding organization had no role in the design or conduct of this research.

Author information

Authors and Affiliations

Authors

Contributions

All authors have made substantive intellectual contributions to this manuscript. Design and conduct of the study (Y.G.K. and E.K.L.), collection and management of the data (B.J., S.Y.L., C.K., and U.C.P.), analysis and interpretation of the data (B.J., S.Y.L., Y.G.K., and E.K.L.), manuscript preparation (B.J., S.Y.L., Y.G.K., and E.K.L.), review and approval of the manuscript (Y.G.K. and E.K.L.). All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Young-Gon Kim or Eun Kyoung Lee.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the Institutional Review Board of Seoul National University Hospital (IRB approval number: 2107–223-1239) and adhered to the tenets of the Declaration of Helsinki. Institutional Review Board of the Seoul National University Hospital waived the need for written informed consent from the participants, because of the study’s retrospective design.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Supplementary Figure 1.

Box plot of model prediction between the different groups of recurrence periods. Comparison of the model predicted confidence values for the different recurrence periods, which can be divided into less than two months, within two to four months and more than four months, was performed using an analysis of variance (ANOVA), and the p-value is presented above the box plots.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jang, B., Lee, SY., Kim, C. et al. Preliminary analysis of predicting the first recurrence in patients with neovascular age-related macular degeneration using deep learning. BMC Ophthalmol 23, 499 (2023). https://doi.org/10.1186/s12886-023-03229-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12886-023-03229-0

Keywords