Resources


Database Restricted Access

Community-Acquired Pneumonia, Endotypes and Phenotypes (NACef): Prospective, observational cohort study of Translational Medicine

Luis Felipe Reyes, Natalia Sanabria, Esteban Garcia Gallo

Community-Acquired Pneumonia (CAP) poses a significant health risk, linked to high in-hospital morbidity and mortality rates. The dataset includes clinical details of 768 CAP patients at Clinica Universidad de La Sabana, Colombia.

Published: Jan. 21, 2025. Version: 1.0.0


Database Credentialed Access

INSPIRE, a publicly available research dataset for perioperative medicine

Leerang Lim, Hyung-Chul Lee

A public dataset that contains information related to surgery, anesthesia, laboratory results, medications, diagnosis, and outcomes from 50% of the patients who received surgery at Seoul National University Hospital between 2011 and 2020.

surgery open dataset multi-center perioperative medicine

Published: Aug. 12, 2024. Version: 1.3


Challenge Credentialed Access

BioNLP Workshop 2023 Shared Task 1A: Problem List Summarization

Yanjun Gao, Dmitriy Dligach, Timothy Miller, Majid Afshar

This is the data storage for BioNLP Workshop Shared Task 1A: Problem List Summarization.

bionlp clinical natural language processing electronic health record summarization

Published: Nov. 12, 2023. Version: 2.0.0


Database Open Access

A large scale 12-lead electrocardiogram database for arrhythmia study

Jianwei Zheng, Hangyuan Guo, Huimin Chu

A 12-lead electrocardiogram database for arrhythmia research covering more than 10,000 patients

Published: Aug. 24, 2022. Version: 1.0.0

Visualize waveforms

Database Credentialed Access

GOSSIS-1-eICU, the eICU-CRD subset of the Global Open Source Severity of Illness Score (GOSSIS-1) dataset

Jesse Raffa, Alistair Johnson, Tom Pollard, Omar Badawi

GOSSIS-1 is an in-hospital mortality prediction algorithm for critical care patients. GOSSIS-1 was trained using data from three countries. This dataset corresponds with the USA subset of the GOSSIS-1 dataset for the 2022 publication below.

icu critical care severity of illness global gossis apache mortality prediction benchmarking

Published: July 20, 2022. Version: 1.0.0


Database Credentialed Access

MIMIC-IV-ECG-Ext-ICD: Diagnostic labels for MIMIC-IV-ECG

Nils Strodthoff, Juan Miguel Lopez Alcaraz, Wilhelm Haverkamp

Dataset that links ECG records from MIMIC-IV-ECG to ED discharge and hospital discharge diagnoses, which enables to train general ECG prediction models based on clinical labels and facilitates the retrieval of further clinical metadata from MIMIC-IV.

machine learning electrocardiography mimic

Published: Aug. 30, 2024. Version: 1.0.1


Database Contributor Review

CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools

Eulalia Farre Maduell, Salvador Lima-Lopez, Santiago Andres Frid, Artur Conesa, Elisa Asensio, Antonio Lopez-Rueda, Helena Arino, Elena Calvo, Maria Jesús Bertran, Maria Angeles Marcos, Montserrat Nofre Maiz, Laura Tañá Velasco, Antonia Marti, Ricardo Farreres, Xavier Pastor, Xavier Borrat Frigola, Martin Krallinger

CARMEN-I is a Spanish corpus of 2,000 clinical records from Hospital Clínic, Barcelona. It covers COVID-19 patients and comorbidities, serving as a resource for training clinical NLP models and researchers in NLP applied to clinical documents.

de-identification clinical ner anonymization

Published: April 20, 2024. Version: 1.0.1


Database Credentialed Access

Nosocomial Risk Datasets from MIMIC-III

Travis Goodwin

Text-based Longitudinal Data for Predicting Nosocomial Disease Risk as used by CANTRIP.

pressure injury risk prediction acute kidney injury anemia forecasting natural language processing deep learning

Published: Sept. 15, 2022. Version: 1.0


Database Contributor Review

BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language

Henrique Dias, Ana Helena Dias Pereira dos Ulbrich

Brazilian clinical dataset containing over 70,000 admissions from 10 hospitals in two Brazilian states.

prescriptions exams tertiary care natural language processing clinical notes

Published: July 14, 2022. Version: 1.1


Database Credentialed Access

Maternal fat ultrasound measurement and nutritional assessment during pregnancy: A dataset centered in gestational outcomes

Alexandre da Silva Rocha, Juliana Rombaldi Bernardi, Alice Schoffel, Daniela Kretzer, Salete Matos, José Antônio Magalhães, Marcelo Goldani

Dataset collected as part of a prospective study in which abdominal maternal fat tissue measurements were compared with outcomes during hospitalization for labor and delivery.

pregnancy ultrasound abdominal

Published: Dec. 4, 2020. Version: 1.0.0