Database Credentialed Access
mBRSET, a Mobile Brazilian Retinal Dataset
Luis Filipe Nakayama , Lucas Zago Ribeiro , David Restrepo , Nathan Santos Barboza , Raul Dias Fiterman , Maria luiza Vieira Sousa , Alexandre Durao Alves Pereira , Caio Regatieri , Fernando Korn Malerbi , Rafael Andrade
Published: June 26, 2024. Version: 1.0
When using this resource, please cite:
(show more options)
Nakayama, L. F., Zago Ribeiro, L., Restrepo, D., Santos Barboza, N., Dias Fiterman, R., Vieira Sousa, M. l., Pereira, A. D. A., Regatieri, C., Malerbi, F. K., & Andrade, R. (2024). mBRSET, a Mobile Brazilian Retinal Dataset (version 1.0). PhysioNet. https://doi.org/10.13026/qxpd-1y65.
Please include the standard citation for PhysioNet:
(show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.
Abstract
The Mobile Brazilian Retinal Dataset (mBRSET) presents a pioneering collection of retinal images captured via portable cameras, encompassing a diverse range of ethnic backgrounds from Itabuna, Bahia, Brazil. Comprising 5164 images from 1291 patients diagnosed with diabetes, this dataset is augmented with clinical and demographic metadata. Its significance lies in its provision as a resource for the development and validation of algorithms tailored for portable retinal cameras, which are increasingly being deployed, particularly in low and middle-income countries. By providing a dataset of retinal fundus photos captured via portable cameras, mBRSET offers an opportunity to develop and validate computer vision models for diabetic retinopathy. This resource has the potential to contribute to advancements in medical imaging and diagnostic technologies.
Background
In ophthalmological practice, imaging serves as a pivotal tool for diagnosing and monitoring ocular conditions [1]. Retinal fundus photos specifically capture the ocular posterior segment, comprising the retina, optic disc, macula, and vessels, offering crucial information into ocular health during eye examinations conducted with specialized cameras.
The global prevalence of diabetes paints a concerning picture, with an estimated 540 million people affected worldwide, projected to escalate to 543 million by 2030 [2]. This epidemic afflicts approximately 10.5% of the adult population, with half unaware of their condition, and 75% residing in low- and middle-income countries [2]. Among the myriad complications associated with diabetes, diabetic retinopathy emerges as a primary concern, impacting about one-third of individuals diagnosed with diabetes. Left untreated, this condition can progress to sight-threatening levels in roughly 10% of affected patients [3].
The utilization of retinal fundus photos and telemedicine presents a promising avenue for remote diabetic retinopathy screening. Innovative approaches such as smartphone-based retinal imaging and teleophthalmology [4,5] hold potential to enhance coverage rates of diabetic retinopathy screening programs. Coupled with timely treatment, such strategies play a pivotal role in preventing visual impairment [4,6]. Traditionally, imaging has relied on standard tabletop fundus cameras, following the guidelines set forth in the Early Treatment Diabetic Retinopathy Study [7,8]. However, this method poses barriers to patient access due to its cumbersome nature, spatial requirements, complex acquisition process, and substantial capital investment, thereby limiting its widespread adoption [9,10].
The advent of smaller, portable devices heralds a more accessible and cost-effective screening process, particularly benefiting low-resource settings and underserved populations [6,9]. Additionally, artificial intelligence (AI) algorithms hold promise in improving medical care by facilitating screening, diagnosis, and monitoring, especially in resource-limited contexts. However, concerns persist regarding the fairness and accuracy of AI algorithms, stemming from non-representative data and biased models.
In low- and middle-income countries (LMICs), the shortage and irregular distribution of ophthalmologists relative to the population underscores the importance of affordable screening programs, which are increasingly being deployed using portable retinal cameras [5,11–13]. However, the existing datasets fail to adequately represent this new imaging modality [14].
mBRSET is the first publicly available diabetic retinopathy dataset comprised of images captured with handheld retina cameras in real-life, high-burden scenarios. This dataset encompasses individuals from diverse ethnic backgrounds, offering a comprehensive resource for addressing the challenges of diabetic retinopathy screening and management.
Methods
This study was approved by the institutional review board and included retinal fundus photos, clinical, and demographic data. Identifiable protected health information was removed from images and tabular data in accordance with the Health Insurance Portability and Accountability Act (HIPAA) and the Lei Geral de Proteção de Dados (LGPD) guidelines.
Data sources
We included data from patients who attended the Itabuna Diabetes Campaign, which took place in November 2022 in Itabuna, Bahia State, Northeast Brazil. Bahia State is a region with miscegenation. Nearly half of its population has European ancestry, around 40% trace their African ancestry, and 10% carry Native American ancestry [19]. This blend of ancestries creates a dynamic and multifaceted society, reflecting the unique fusion of cultures that defines Bahia. This annual event mobilizes many of the city’s inhabitants and involves diabetes awareness, counseling, screening, and treatment of diabetes complications [20]. After signing informed consent, participants answered a questionnaire for demographic and self-reported clinical characteristics and underwent ocular imaging.
Data collection
Images present in this dataset were collected using the Phelcom Eyer (Phelcom Technologies, Sao Carlos, Brazil), a portable, handheld, smartphone-based retinal fundus camera. Integrated into a Samsung Galaxy S10 running Android 11, the Eyer camera captures retinal images at a 45-degree angle, utilizing a high-resolution 12-megapixel sensor to produce images with dimensions of 1600x1600 pixels. The camera features an autofocus control with a range of -20 to +20 diopters. Trained non-medical personnel proficient in operating the Eyer camera captured the retinal photos under pharmacological mydriasis. Additionally, demographic information and relevant medical features were gathered through patient interviews.
Dataset preparation
The file identification was removed from all fundus photos, as well as sensitive data (e.g., patient name). Every image was reviewed to ensure the absence of protected health information in images. The images were exported from the Eyer cloud system in JPEG format, and no preprocessing techniques were performed. The image viewpoint can be macula-centered or optic disc-centered. The dataset does not include fluorescein angiogram photos and non-retinal images.
Labeling
Two ophthalmologists, experts in retina and vitreous, meticulously labeled all the images according to predefined criteria set by the research group [21]. In cases of discordance in diabetic retinopathy grading, a third senior specialist provided adjudication. Kappa and weighted kappa statistics were employed to assess the agreement between graders, ensuring a robust evaluation of inter-rater reliability.
The following characteristics were labeled:
- Quality control parameters: The evaluation of parameters encompasses the detection of artifacts, which encompass various issues such as image focus, illumination discrepancies, and the presence of dust. Additionally, each image is categorized based on its quality, thus determining whether it is deemed satisfactory or unsatisfactory for clinical assessment.
- Diabetic retinopathy classification: Diabetic retinopathy and diabetic macular edema were classified using the International Clinic Diabetic Retinopathy (ICDR) grading [22].
- 0 - Normal: No abnormalities
- 1 - Mild non-proliferative DR: Microaneurysms only
- 2 - Moderate non-proliferative DR: More than just microaneurysms but less than severe non-proliferative diabetic retinopathy
- 3 - Severe non-proliferative DR: Any of the following: > 20 intra-retinal hemorrhages in each of 4 quadrants, definite venous beading in ≥2 quadrants, prominent intraretinal microvascular abnormalities in ≥1 quadrant, or no signs of proliferative retinopathy
- 4 - Proliferative DR: One or more of the following: neovascularization and/or vitreous or preretinal hemorrhages and/or panphotocoagulation scars
- Macular edema: Exudates or apparent thickening within one disc diameter from the fovea
Data Description
The dataset comprises of a comma-delimited metadata file containing labels (labels_mbrset.csv
), along with an associated collection of images.
Metadata file
The metadata file (labels_mbrset.csv
) includes the following columns:
patient
: patient identifier.age
: patient age in years.sex
: 0 for female and 1 for male.dm_time
: diabetes diagnosis time in years.insulin
: self-referred use of insulin - 0 for no and 1 for yes.insulin_time
: insulin use time in years.oraltreatment_dm
: self-referred use of oral drugs for diabetes - 0 for no and 1 for yessystemic_hypertension
: self-referred diagnosis of systemic arterial hypertension 0 for no and 1 for yesinsurance
: 0 for no and 1 for yeseducational_level
: higher educational level1
: Illiterate2
: Incomplete Primary3
: Complete Primary4
: Incomplete Secondary5
: Complete Secondary6
: Incomplete Tertiary7
: Complete Tertiary
alcohol_consumption
: self-referred regular alcohol consumption 0 for no and 1 for yessmoking
: self-referred active smoking - 0 for no and 1 for yesobesity
: self-referred obesity diagnosis - 0 for no and 1 for yesvascular_disease
: self-referred diabetes-related vascular disease - 0 for no and 1 for yesacute_myocardial_infarction
: self-referred previous acute myocardial infarction - 0 for no and 1 for yesnephropathy
: self-referred diabetes-related nephropathy - 0 for no and 1 for yesneuropathy
: self-referred diabetes-related neuropathy - 0 for no and 1 for yesdiabetic_foot
: 0 for no and 1 for yesfile
: image identifierlaterality
: retinal fundus photo lateralityfinal_artifacts
: presence of artifacts - yes or nofinal_quality
: final quality assessment - yes or nofinal_icdr
: ICDR score- 0 No retinopathy.
- 1 Mild non-proliferative diabetic retinopathy.
- 2 Moderate non-proliferative diabetic retinopathy.
- 3 Severe non-proliferative diabetic retinopathy.
- 4 Proliferative diabetic retinopathy and post-laser status.
final_edema
: macular edema - yes or no
Patient level analysis
The dataset contains 5164 images from 1291 patients. The sex distribution is 451 (34.93%) male and 840 (65.06%) female patients. The average age is 61.44 years (SD 11.63). (Table 1)
Missing |
Overall |
|||||
n |
|
1291 |
||||
sex, n (%) |
Female |
0 |
840 (65.1) |
|||
Male |
451 (34.9) |
|||||
insurance, n (%) |
No |
12 |
1180 (92.3) |
|||
Yes |
99 (7.7) |
|||||
educational_level, n (%) |
Illiterate (1) |
13 |
176 (13.8) |
|||
Incomplete Primary (2) |
539 (42.2) |
|||||
Complete Primary (3) |
116 (9.1) |
|||||
Incomplete Secondary (4) |
120 (9.4) |
|||||
Complete Secondary (5) |
247 (19.3) |
|||||
Incomplete Tertiary (6) |
21 (1.6) |
|||||
Complete Tertiary (7) |
59 (4.6) |
|||||
age, mean (SD) |
|
0 |
61.4 (11.6) |
The average duration of diabetes is 9.53 years (SD 8.64), with 1083 patients (84.61%) undergoing oral treatment and 266 patients (20.79%) receiving insulin therapy. (Table 2)
Missing |
Overall |
||
n |
|
1291 |
|
systemic_hypertension, n (%) |
No |
11 |
366 (28.6) |
Yes |
914 (71.4) |
||
alcohol_consumption, n (%) |
No |
19 |
1094 (86.0) |
Yes |
178 (14.0) |
||
smoking, n (%) |
No |
22 |
1188 (93.6) |
Yes |
81 (6.4) |
||
obesity, n (%) |
No |
19 |
1170 (92.0) |
Yes |
102 (8.0) |
||
vascular_disease, n (%) |
No |
19 |
1055 (82.9) |
Yes |
217 (17.1) |
||
acute_myocardial_infarction, n (%) |
No |
21 |
1172 (92.3) |
Yes |
98 (7.7) |
||
nephropathy, n (%) |
No |
20 |
1225 (96.4) |
Yes |
46 (3.6) |
||
neuropathy, n (%) |
No |
19 |
1217 (95.7) |
Yes |
55 (4.3) |
||
diabetic_foot, n (%) |
No |
27 |
1091 (86.3) |
Yes |
173 (13.7) |
Image level analysis
In terms of the image findings, among the total dataset, 3759 images (76.79%) revealed no indications of diabetic retinopathy (DR). Mild non-proliferative DR was observed in 272 images (5.56%), while moderate non-proliferative DR was present in 570 images (11.64%). Additionally, 82 images (1.67%) of severe non-proliferative DR, and 212 images (4.33%) of proliferative DR. Furthermore, macular edema was identified in 427 images (8.69%). (Table 3)
Regarding inter-grader agreement, the Weighted Kappa for the ICDR score was 0.863, indicating near-perfect agreement, while for macular edema was 0.618, representing substantial agreement.
Missing |
Overall |
||
n |
|
5164 |
|
laterality, n (%) |
Left |
0 |
2584 (50.0) |
Right |
2580 (50.0) |
||
final_icdr, n (%) |
0.0 |
281 |
3750 (76.8) |
1.0 |
272 (5.6) |
||
2.0 |
568 (11.6) |
||
3.0 |
82 (1.7) |
||
4.0 |
212 (4.3) |
||
final_edema, n (%) |
No |
266 |
4472 (91.3) |
Yes |
427 (8.7) |
||
referable_dr, n (%) |
No |
0 |
4167 (80.7) |
Yes |
997 (19.3) |
Regarding the quality distribution, 4283 (82.7%) of the images have some artifacts, and 4833 (94.3%) have good quality that enables image assessment.
Usage Notes
This is the first mobile camera retinal fungus dataset, and future releases may increase the dataset size and provide race information. Our objective is to reduce the under-represented countries in the ophthalmology dataset pool and enable validation studies using this modality.
All the codes used in this paper for the dataset setup, data analysis, and experiments are found in a GitHub repository at https://github.com/luisnakayama/mBRSET [23]. Best practice guidelines should be followed when analyzing the data, and we incentivize sharing codes and models to promote reproducibility.
Release Notes
v1.0: First public release. In future releases, we plan to expand the dataset with more images and patients from screening programs and add self-declared races.
Ethics
This study was approved by the INSTITUTO DE ENSINO SUPERIOR PRESIDENTE TANCREDO DE ALMEIDA NEVES (IPTAN) institutional review board (CAAE 64219922.3.0000.9667). All participants provided written informed consent, and the sharing of data was authorized by both patients and institutions.
Conflicts of Interest
The authors declare no conflicts of interest to declare.
References
- Nakayama LF, Matos J, Quion J, Novaes F, Mitchell WG, Mwavu R, et al. Unmasking biases and navigating pitfalls in the ophthalmic Artificial Intelligence lifecycle: A review. 2023. doi:10.48550/ARXIV.2310.04997
- Sun H, Saeedi P, Karuranga S, Pinkepank M, Ogurtsova K, Duncan BB, et al. IDF Diabetes Atlas: Global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res Clin Pract. 2022;183: 109119. doi:10.1016/j.diabres.2021.109119
- Sivaprasad S, Pearce E. The unmet need for better risk stratification of non-proliferative diabetic retinopathy. Diabet Med. 2019;36: 424–433. doi:10.1111/dme.13868
- Lim G, Bellemo V, Xie Y, Lee XQ, Yip MYT, Ting DSW. Different fundus imaging modalities and technical factors in AI screening for diabetic retinopathy: a review. Eye Vis (Lond). 2020;7: 21. doi:10.1186/s40662-020-00182-7
- Sengupta S, Sindal MD, Besirli CG, Upadhyaya S, Venkatesh R, Niziol LM, et al. Screening for vision-threatening diabetic retinopathy in South India: comparing portable non-mydriatic and standard fundus cameras and clinical exam. Eye . 2018;32: 375–383. doi:10.1038/eye.2017.199
- Salongcay RP, Aquino LAC, Salva CMG, Saunar AV, Alog GP, Sun JK, et al. Comparison of Handheld Retinal Imaging with ETDRS 7-Standard Field Photography for Diabetic Retinopathy and Diabetic Macular Edema. Ophthalmol Retina. 2022;6: 548–556. doi:10.1016/j.oret.2022.03.002
- Vujosevic S, Benetti E, Massignan F, Pilotto E, Varano M, Cavarzeran F, et al. Screening for diabetic retinopathy: 1 and 3 nonmydriatic 45-degree digital fundus photographs vs 7 standard early treatment diabetic retinopathy study fields. Am J Ophthalmol. 2009;148: 111–118. doi:10.1016/j.ajo.2009.02.031
- Vujosevic S, Aldington SJ, Silva P, Hernández C, Scanlon P, Peto T, et al. Screening for diabetic retinopathy: new perspectives and challenges. Lancet Diabetes Endocrinol. 2020;8: 337–347. doi:10.1016/S2213-8587(19)30411-5
- Piyasena MMPN, Yip JLY, MacLeod D, Kim M, Gudlavalleti VSM. Diagnostic test accuracy of diabetic retinopathy screening by physician graders using a hand-held non-mydriatic retinal camera at a tertiary level medical clinic. BMC Ophthalmol. 2019;19: 89. doi:10.1186/s12886-019-1092-3
- Midena E, Zennaro L, Lapo C, Torresin T, Midena G, Pilotto E, et al. Handheld Fundus Camera for Diabetic Retinopathy Screening: A Comparison Study with Table-Top Fundus Camera in Real-Life Setting. J Clin Med Res. 2022;11. doi:10.3390/jcm11092352
- Malerbi FK, Andrade RE. Real-World diabetic retinopathy screening with a handheld fundus camera in a high-burden setting. Acta Ophthalmol. 2022;100: e1771. doi:10.1111/aos.15170
- Jin K, Lu H, Su Z, Cheng C, Ye J, Qian D. Telemedicine screening of retinal diseases with a handheld portable non-mydriatic fundus camera. BMC Ophthalmol. 2017;17: 89. doi:10.1186/s12886-017-0484-5
- Caceres J, Zhang Y, Boe L, Zhou Y, Besirli C, Paulus YM, et al. Diabetic Retinopathy Screening Using a Portable Retinal Camera in Vanuatu. Clin Ophthalmol. 2023;17: 2919–2927. doi:10.2147/OPTH.S410425
- Khan SM, Liu X, Nath S, Korot E, Faes L, Wagner SK, et al. A global review of publicly available datasets for ophthalmological imaging: barriers to access, usability, and generalisability. Lancet Digit Health. 2021;3: e51–e66. doi:10.1016/s2589-7500(20)30240-5
- Nakayama LF, Goncalves M, Zago Ribeiro L, Santos H, Ferraz D, Malerbi F, et al. A Brazilian Multilabel Ophthalmological Dataset (BRSET). PhysioNet; 2023. doi:10.13026/XCXW-8198
- Cuadros J, Bresnick G. EyePACS: an adaptable telemedicine system for diabetic retinopathy screening. J Diabetes Sci Technol. 2009;3: 509–516. doi:10.1177/193229680900300315
- Decencièreb E, Zhang X, Cazuguel G, Laÿ B, Cochener B, Trone C, et al. FEEDBACK ON A PUBLICLY DISTRIBUTED IMAGE DATABASE: THE MESSIDOR DATABASE. doi:10.5566/ias.1155
- Pachade S, Porwal P, Thulkar D, Kokare M, Deshmukh G, Sahasrabuddhe V, et al. Retinal Fundus Multi-Disease Image Dataset (RFMiD): A Dataset for Multi-Disease Detection Research. Brown Univ Dig Addict Theory Appl. 2021;6: 14. doi:10.3390/data6020014
- Souza AM de, Resende SS, Sousa TN de, Brito CFA de. A systematic scoping review of the genetic ancestry of the Brazilian population. Genet Mol Biol. 2019;42: 495–508. doi:10.1590/1678-4685-GMB-2018-0076
- Malerbi FK, Andrade RE, Morales PH, Stuchi JA, Lencione D, de Paulo JV, et al. Diabetic Retinopathy Screening Using Artificial Intelligence and Handheld Smartphone-Based Retinal Camera. J Diabetes Sci Technol. 2021; 1932296820985567. doi:10.1177/1932296820985567
- Nakayama LF, Gonçalves MB, Ribeiro LZ, Malerbi FK, Regatieri CVS. Diabetic Retinopathy Labeling Protocol for the Brazilian Multilabel Ophthalmological Dataset. 2023. doi:10.31219/osf.io/puznm
- Wilkinson CP, Ferris FL 3rd, Klein RE, Lee PP, Agardh CD, Davis M, et al. Proposed international clinical diabetic retinopathy and diabetic macular edema disease severity scales. Ophthalmology. 2003;110: 1677–1682. doi:10.1016/S0161-6420(03)00475-5
- Nakayama L. mBRSET: Repository of the Mobile Brazilian Retinal Dataset (mBRSET). Github; Available: https://github.com/luisnakayama/mBRSET
Parent Projects
Access
Access Policy:
Only credentialed users who sign the DUA can access the files.
License (for files):
PhysioNet Credentialed Health Data License 1.5.0
Data Use Agreement:
PhysioNet Credentialed Health Data Use Agreement 1.5.0
Required training:
CITI Data or Specimens Only Research
Discovery
DOI (version 1.0):
https://doi.org/10.13026/qxpd-1y65
DOI (latest version):
https://doi.org/10.13026/m10j-dj56
Topics:
retina
ophthalmology
Corresponding Author
Files
- be a credentialed user
- complete required training:
- CITI Data or Specimens Only Research You may submit your training here.
- sign the data use agreement for the project