Resources


Database Credentialed Access

ENCoDE, mEasuring skiN Color to correct pulse Oximetry DisparitiEs: skin tone and clinical data from a prospective trial on acute care patients.

Sicheng Hao, Katelyn Dempsey, João Matos, Mahmoud Alwakeel, Jared Houghtaling, An Kwok Wong

A prospective collected EHR-linked skin tone measurements database in OMOP format with emphasis on pulse oximetry disparities.

Published: Aug. 22, 2024. Version: 1.0.0


Database Credentialed Access

EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records

Yeonsu Kwon, Jiho Kim, Gyubok Lee, Seongsu Bae, Daeun Kyung, Wonchul Cha, Tom Pollard, Alistair Johnson, Edward Choi

Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records

Published: Aug. 19, 2024. Version: 1.0.0


Database Credentialed Access

ReXPref-Prior: A MIMIC-CXR Preference Dataset for Reducing Hallucinated Prior Exams in Radiology Report Generation

Oishi Banerjee, Hong-Yu Zhou, Subathra Adithan, Stephen Kwak, Kay Wu, Pranav Rajpurkar

We propose ReXPref-Prior, an adapted version of MIMIC-CXR where GPT-4 has removed references to prior exams from both findings and impression sections of chest X-ray reports.

chest x-rays reinforcement learning hallucination

Published: Aug. 14, 2024. Version: 1.0.0


Database Credentialed Access

A Brazilian Multilabel Ophthalmological Dataset (BRSET)

Luis Filipe Nakayama, Mariana Goncalves, Lucas Zago Ribeiro, Helen Santos, Daniel Ferraz, Fernando Malerbi, Leo Anthony Celi, Caio Regatieri

This is the first Brazilian Multilabel Ophthalmological Dataset with demographic information and retinal photos labeled images according to anatomical parameters, quality control, and presumed diagnosis.

dataset retina ophthalmology

Published: Aug. 14, 2024. Version: 1.0.1


Database Open Access

SHDB-AF: a Japanese Holter ECG database of atrial fibrillation

Kenta Tsutsui, Shany Biton Brimer, Joachim Behar

Holter ECG database from Japan, containing data from 100 unique patients with paroxysmal AF including expert annotations of Supraventricular arrhythmias at the beat level.

atrial fibrillation ecg holters

Published: Aug. 12, 2024. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

INSPIRE, a publicly available research dataset for perioperative medicine

Leerang Lim, Hyung-Chul Lee

A public dataset that contains information related to surgery, anesthesia, laboratory results, medications, diagnosis, and outcomes from 50% of the patients who received surgery at Seoul National University Hospital between 2011 and 2020.

surgery open dataset multi-center perioperative medicine

Published: Aug. 12, 2024. Version: 1.3


Database Credentialed Access

RadGraph2: Tracking Findings Over Time in Radiology Reports

Adam Dejl, Sameer Khanna, Patricia Therese Pile, Kibo Yoon, Steven QH Truong, Hanh Duong, Agustina Saenz, Pranav Rajpurkar

RadGraph2 is a dataset of 800 chest radiology reports annotated using a fine-grained entity-relationship schema, which captures key findings as well as mentions of changes that occurred in comparison with the previous radiology studies.

chest x-rays relation extraction disease progression information extraction radiology reports named entity recognition

Published: Aug. 8, 2024. Version: 1.0.0


Database Restricted Access

OpenOximetry Repository

Nicholas Fong, Michael Lipnick, Philip Bickler, John Feiner, Tyler Law

A repository of matched arterial oxygen and pulse oximeter readings obtained under controlled conditions, with high-frequency physiologic waveforms and skin color measurements.

Published: July 30, 2024. Version: 1.0.1


Database Credentialed Access

EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images

Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei JI, Eric Chang, Tackeun Kim, Edward Choi

We present EHRXQA, the first multi-modal EHR QA dataset combining structured patient records with aligned chest X-ray images. EHRXQA contains a comprehensive set of QA pairs covering image-related, table-related, and image+table-related questions.

question answering benchmark evaluation visual question answering electronic health records multi-modal question answering deep learning ehr question answering semantic parsing machine learning chest x-ray

Published: July 23, 2024. Version: 1.0.0


Database Credentialed Access

MIMIC-CXR Database

Alistair Johnson, Tom Pollard, Roger Mark, Seth Berkowitz, Steven Horng

Chest radiographs in DICOM format with associated free-text reports.

computer vision chest x-rays natural language processing radiology machine learning mimic

Published: July 23, 2024. Version: 2.1.0