Resources


Database Credentialed Access

AMR-UTI: Antimicrobial Resistance in Urinary Tract Infections

Michael Oberst, Soorajnath Boominathan, Helen Zhou, Sanjat Kanjilal, David Sontag

AMR-UTI is a freely accessible dataset, derived from electronic health record (EHR) information on over 100,000 urinary tract infections (UTI) treated at Massachusetts General Hospital and Brigham & Women's Hospital in Boston, MA, USA.

antibiotic resistance causal inference policy learning antimicrobial resistance urinary tract infection clinical decision support machine learning

Published: Nov. 4, 2020. Version: 1.0.0


Database Credentialed Access

MIMIC-III - SequenceExamples for TensorFlow modeling

Jonas Kemp, Kun Zhang, Andrew Dai

MIMIC-III data converted into TensorFlow SequenceExample format, for use in modeling pipelines.

tensorflow sequence modeling deep learning machine learning

Published: Sept. 29, 2020. Version: 1.0.0


Database Credentialed Access

Phenotype Annotations for Patient Notes in the MIMIC-III Database

Edward Moseley, Leo Anthony Celi, Joy Wu, Franck Dernoncourt

Clinical notes, annotated by at least two expert annotators for over ten patient phenotypes, including advanced cancer, substance abuse, and treatment non-adherence.

patient classification natural language processing

Published: March 5, 2020. Version: 1.20.03


Model Credentialed Access

What's in a Note? Unpacking Predictive Value in Clinical Note Representations

Tristan Naumann, William Boag

Word vectors corresponding to the AMIA 2018 Informatics Summit paper of the same name.

Published: Jan. 7, 2018. Version: 0.1


Database Open Access

Clinical data from the MIMIC-II database for a case study on indwelling arterial catheters

Jesse Raffa

Dataset extracted from MIMIC-II for a tutorial on effectiveness of indwelling arterial catheters in hemodynamically stable patients with respiratory failure for mortality outcomes.

Published: Oct. 28, 2016. Version: 1.0


Database Credentialed Access

MIMIC-Ext-MIMIC-CXR-VQA: A Complex, Diverse, And Large-Scale Visual Question Answering Dataset for Chest X-ray Images

Seongsu Bae, Daeun Kyung, Jaehee Ryu, Eunbyeol Cho, Gyubok Lee, Sunjun Kweon, Jungwoo Oh, Lei JI, Eric Chang, Tackeun Kim, Edward Choi

We introduce MIMIC-Ext-MIMIC-CXR-VQA, a complex, diverse, and large-scale dataset designed for Visual Question Answering (VQA) tasks within the medical domain, focusing primarily on chest radiographs.

question answering benchmark radiology evaluation visual question answering deep learning electronic health records machine learning multimodal chest x-ray

Published: July 19, 2024. Version: 1.0.0


Database Credentialed Access

MIMIC-IV-ED

Alistair Johnson, Lucas Bulgarelli, Tom Pollard, Leo Anthony Celi, Roger Mark, Steven Horng

A large database of emergency department admissions.

emergency department ed emergency mimic-iv electronic health record mimic

Published: Jan. 5, 2023. Version: 2.2


Database Credentialed Access

MIMIC-IV-Note: Deidentified free-text clinical notes

Alistair Johnson, Tom Pollard, Steven Horng, Leo Anthony Celi, Roger Mark

Deidentified free-text clinical notes for patients in the MIMIC-IV Clinical Database.

deidentification critical care natural language processing electronic health record clinical notes mimic

Published: Jan. 6, 2023. Version: 2.2


Database Credentialed Access

MIMIC-IV on FHIR

Alex Bennett, Joshua Wiedekopf, Hannes Ulrich, Philip van Damme, Piotr Szul, John Grimes, Alistair Johnson

MIMIC-IV and MIMIC-IV-ED data mapped into FHIR resources.

mimic-iv fhir electronic health record us core fast healthcare interoperability resources mimic

Published: Nov. 12, 2024. Version: 2.1


Database Credentialed Access

CORAL: expert-Curated medical Oncology Reports to Advance Language model inference

Madhumita Sushil, Vanessa Kennedy, Divneet Mandair, Brenda Miao, Travis Zack, Atul Butte

Medical oncology progress notes annotated with advanced, comprehensive oncology-relevant concepts and relationships.

information extraction artificial intelligence oncology natural language processing large language models electronic health records

Published: Feb. 7, 2024. Version: 1.0