Database Open Access

MIMIC-III Waveform Database

Benjamin Moody George Moody Mauricio Villarroel Gari D. Clifford Ikaro Silva

Published: April 7, 2020. Version: 1.0


When using this resource, please cite: (show more options)
Moody, B., Moody, G., Villarroel, M., Clifford, G. D., & Silva, I. (2020). MIMIC-III Waveform Database (version 1.0). PhysioNet. https://doi.org/10.13026/c2607m.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

The MIMIC-III Waveform Database contains 67,830 record sets for approximately 30,000 ICU patients. Almost all record sets include a waveform record containing digitized signals (typically including ECG, ABP, respiration, and PPG, and frequently other signals) and a “numerics” record containing time series of periodic measurements, each presenting a quasi-continuous recording of vital signs of a single patient throughout an ICU stay (typically a few days, but many are several weeks in duration). A subset of this database contains waveform and numerics records that have been matched and time-aligned with MIMIC-III Clinical Database records.


Background

The MIMIC-III Waveform Database contains thousands of recordings of multiple physiologic signals (“waveforms”) and time series of vital signs (“numerics”) collected from bedside patient monitors in adult and neonatal intensive care units (ICUs).

The MIMIC-III Waveform Database is a companion to the MIMIC-III Clinical Database, which contains detailed clinical information about most of the patients represented in the Waveform Database [1]. Since the contents of each database were collected independently, in partially deidentified form, matching the clinical data with the waveform data is a non-trivial task, and only a subset of Waveform Database records has been matched with Clinical Database records. See the MIMIC-III Waveform Database Matched Subset for more information.


Methods

Unlike the original MIMIC Database, waveforms were collected in a largely automated fashion, from all of the bedside monitors in certain adult and neonatal ICUs. Not all of the ICUs in the hospital were included, and the data archiving process did not run continuously, but while it was running, all waveforms from those ICUs were captured and archived. As a result, these records represent a random sample of patients in those specific ICUs.

Recorded waveforms and numerics vary depending on choices made by the ICU staff. Waveforms almost always include one or more ECG signals, and often include continuous arterial blood pressure (ABP) waveforms, fingertip photoplethysmogram (PPG) signals, and respiration, with additional waveforms (up to 8 simultaneously) as available. Numerics typically include heart and respiration rates, SpO2, and systolic, mean, and diastolic blood pressure, together with others as available. Recording lengths also vary; most are a few days in duration, but some are shorter and others are several weeks long.

The project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.


Data Description

Each recording comprises two records (a waveform record and a matching numerics record) in a single record directory (“folder”) with the name of the record. To reduce access time, the record directories have been distributed among ten intermediate-level directories (listed below). The names of these intermediate directories (30, 31, ..., 39) match the first two digits of the record directories they contain.

In almost all cases, the waveform records comprise multiple segments, each of which can be read as a separate record. Each segment contains an uninterrupted recording of a set of simultaneously observed signals, and the signal gains do not change at any time during the segment. Whenever the ICU staff changed the signals being monitored or adjusted the amplitude of a signal being monitored, this event was recorded in the raw data dump, and a new segment begins at that time.

Each composite waveform record includes a list of the segments that comprise it in its master header file. The list begins on the second line of the master header with a layout header file that specifies all of the signals that are observed in any segment belonging to the record. Each segment has its own header file and (except for the layout header) a matching (binary) signal (.dat) file. Occasionally, the monitor may be disconnected entirely for a short time; these intervals are recorded as gaps in the master header file, but there are no header or signal files corresponding to gaps.

The numerics records (designated by the letter n appended to the record name) are not divided into segments, since the storage savings that would be achieved by doing so would be relatively little.

Physiologic waveform records in this database contain up to eight simultaneously recorded signals digitized at 125 Hz with 8-, 10-, or (occasionally) 12-bit resolution. Numerics records typically contain 10 or more time series of vital signs sampled once per second or once per minute.

Technical Limitations

Waveforms or numerics missing:
Occasionally, technical limitations of the data acquisition system make it possible to create a physiologic waveform record but not a numerics record, or vice versa.
A given signal may not be available throughout an entire record:
Records in the MIMIC-III Waveform Database vary in length; some are several weeks in duration. It is common for the physiologic signals to be interrupted or changed occasionally during recordings of such long duration. When using a viewer such as LightWAVE, all signals available at any time during a record are listed, although in most cases only a subset is visible at any given time.
Gaps and patient identification:
The waveform and numerics records have been extracted from raw data dumps collected from the bedside monitors using a facility provided by the monitor manufacturer. The raw data dumps contain files of data collected from a single patient monitor during a single monitoring session (which may last days or weeks). Usually the monitoring session ends when the patient is discharged, so that the data in a single file come from a single patient. Occasionally, however, the monitor is not reset when the patient is discharged, and the session continues after a new patient has been admitted; in this case the raw data file contains data from two (or more) patients, with a gap (an interval during which no waveforms or numerics are recorded) that is typically an hour or more in duration. Such gaps may also appear if the monitor is temporarily disconnected (for example, for a laboratory test) and then reconnected to the same patient. Since the raw data files do not usually contain patient identifiers, it is not trivial to determine with certainty if the data before and after a gap were collected from the same patient.
Ideally, each MIMIC-III Waveform Database record should contain data from only one patient. All raw data files containing gaps of an hour or more have been split into separate records in order to decrease the likelihood that any record contains data from multiple patients. An ongoing project is to examine the sets of records created this way, matching them with MIMIC-III Clinical Database records when possible, to determine if and how they should be reassembled.
Inter-waveform alignment problems:
The method used for MIMIC waveform data extraction was not designed for inter-waveform analysis. The waveform data contain unspecified/unknown filtering delays and/or unknown inter-channel delays, which may not be constant in a given record. Therefore, although the ECGs are time-aligned with each other, there may be a (changing) delay of up to 500ms between any of the other waveforms in the data. For example, the pulse transit time measured between different waveforms may be unreliable (either in absolute or relative terms).
ECG limitations:
The ECG signals in the waveform records were originally sampled with 12-bit precision at a high sampling rate, and were then scaled and decimated to 500 samples per second (per signal). The scaling reduced the effective amplitude resolution from 12 bits to 9 or 10 bits in typical cases, and as little as 7 bits in some cases. From each set of 4 consecutive decimated samples of the same ECG signal, one was recorded (chosen using a turning-point compressor, a technique sometimes called “peak-picking”). The result is an ECG signal sampled 125 times per second, but at intervals that vary between 2 and 14 ms (averaging 8 ms). Since the interval between any given pair of samples was not available to us, the reconstructions of the ECG signals assume uniform 8 ms intervals. These signals with reduced time and amplitude resolution, and sampling jitter introduced by the “peak-picking”, were the only ECG signals that were possible to capture from the ICU monitors. Although ECGs reconstructed in this way can be readily interpreted visually, they are unsuitable as input for certain algorithms for ECG analysis, particularly those that are sensitive to frequency-domain features of the signal. Note that these limitations apply only to the ECG signals, not to the other signals, which were originally sampled at uniform 8 ms intervals (125 samples per second) and were not scaled prior to capture.

Usage Notes

The following example illustrates the organization of the database:

  • Intermediate directory 31 contains all records with names that begin with 31.
  • Record directory 3141595 is contained within intermediate directory 31.
  • All files associated with physiologic waveform record 3141595 and its companion numerics record 3141595n are contained within record directory 31/3141595.
    • The first line of the master header file for waveform record 314595 (31/3141595/3141595.hea) indicates that the record is 242353557 sample intervals (about 22 days at 125 samples per second) in duration, and that it contains 427 segments and gaps. (See header(5) in the WFDB Applications Guide for details on the format of this text file.) The first segment is named 3141595_0001, and it is 2888500 sample intervals (6 hours, 15 minutes, and 8 seconds, at 125 samples per second) in duration. At the end of the master header file, a comment (# Location: nicu) specifies the ICU in which the recording was made (the neonatal ICU, in this case).
    • The layout header file for this record (31/3141595/3141595_layout.hea) indicates that five ECG signals (I, II, III, AVR, and “V”), a respiration signal, and a PPG signal are available during portions of the record. (The five ECG signals are not all available simultaneously.)
    • The header file for the first segment of this record (31/3141595/3141595_0001.hea) shows that a PPG signal (“PLETH”), a respiration signal, and ECG leads II and AVR are available throughout this initial segment.
  • The matching numerics record is named 3141595n, and its header file (31/3141595/3141595n.hea) shows that it is 1938730 sample intervals (about 22 days at 1 sample per second) in duration, and that it contains heart rate (“HR”, which is measured from the ECG, as well as “PULSE”, measured from one or more pulsatile signals), noninvasive blood pressure (raw as well as systolic, diastolic, and mean), respiration rate, and SpO2.

Any WFDB application can read any waveform record from this database directly from the PhysioNet web server (i.e., without downloading the record first) using a record name of the form mimic3wdb/3x/3xyyyyy/. Numerics records can be read using the longer form mimic3wdb/3x/3xyyyyy/3xyyyyyn (note that the final 3xyyyyy must be repeated and followed by n to specify the numerics record).

For example, if you have installed the WFDB Software Package, you can read the first 10 seconds of waveform record 3141595 using this rdsamp command:

rdsamp -r mimic3wdb/31/3141595/ -p -v -t 10

To read the first 10 seconds of the matching numerics record 3141595n, use this command instead:

rdsamp -r mimic3wdb/31/3141595/3141595n -p -v -t 10

Notice that the first command produces 1250 samples of each waveform (125 samples per second, for 10 seconds), but the second command produces only 10 samples of each vital sign (1 sample per second, for 10 seconds).


Release Notes

Version 1.0 of the MIMIC-III Waveform Database supersedes previously-released versions of the MIMIC-II Waveform Database. The numbered records (3000003 to 3999988) are identical to those in version 3.2 of the MIMIC-II Waveform Database. The Matched Subset, however, uses different subject IDs and surrogate dates, corresponding to version 1.4 of the MIMIC-III Clinical Database.


Acknowledgements

We wish to thank Philips Healthcare, as well as the Beth Israel Deaconess Medical Center, for their invaluable support in making this project possible.

Many people have contributed to this project over the past 18 years, and it would be impossible to list them all. In particular, we would like to acknowledge Michael Craig, Tin Kyaw, and Mohammed Saeed, for their efforts in collecting and organizing the original MIMIC-II Waveform Database, upon which this database is based.


Conflicts of Interest

The authors have no conflicts of interests to declare.


References

  1. Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://dx.doi.org/10.1038/sdata.2016.35

Parent Projects
MIMIC-III Waveform Database was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Discovery

DOI (version 1.0):
https://doi.org/10.13026/c2607m

DOI (latest version):
https://doi.org/10.13026/gs83-bd50

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 6.7 TB.

Access the files

Visualize waveforms

Folder Navigation: <base>/matched/p05
Name Size Modified
Parent Directory
p050004
p050006
p050015
p050026
p050034
p050041
p050050
p050074
p050079
p050080
p050089
p050093
p050094
p050100
p050110
p050113
p050130
p050136
p050140
p050141
p050151
p050161
p050170
p050174
p050178
p050182
p050189
p050191
p050197
p050201
p050212
p050217
p050237
p050259
p050273
p050289
p050302
p050315
p050321
p050334
p050336
p050337
p050353
p050358
p050370
p050384
p050385
p050387
p050417
p050424
p050440
p050445
p050447
p050450
p050476
p050479
p050480
p050484
p050486
p050487
p050494
p050528
p050532
p050537
p050547
p050549
p050561
p050567
p050575
p050576
p050579
p050581
p050603
p050618
p050620
p050621
p050624
p050626
p050634
p050643
p050649
p050664
p050703
p050710
p050721
p050722
p050729
p050735
p050744
p050762
p050767
p050770
p050772
p050793
p050804
p050816
p050817
p050822
p050824
p050826
p050827
p050832
p050846
p050847
p050863
p050877
p050880
p050882
p050883
p050885
p050888
p050899
p050915
p050928
p050941
p050976
p050984
p050985
p050987
p050991
p051000
p051013
p051017
p051021
p051025
p051039
p051045
p051053
p051064
p051072
p051078
p051082
p051086
p051108
p051109
p051121
p051136
p051145
p051179
p051180
p051188
p051202
p051203
p051226
p051256
p051259
p051277
p051291
p051300
p051301
p051321
p051322
p051327
p051337
p051343
p051349
p051358
p051359
p051374
p051377
p051385
p051387
p051390
p051393
p051399
p051424
p051439
p051446
p051451
p051459
p051462
p051466
p051482
p051484
p051490
p051495
p051497
p051506
p051515
p051517
p051519
p051529
p051542
p051545
p051577
p051582
p051596
p051597
p051625
p051628
p051635
p051642
p051648
p051660
p051663
p051670
p051687
p051694
p051716
p051722
p051724
p051728
p051733
p051754
p051761
p051767
p051786
p051790
p051791
p051793
p051795
p051798
p051802
p051805
p051821
p051823
p051847
p051856
p051858
p051863
p051864
p051871
p051882
p051890
p051891
p051909
p051912
p051929
p051933
p051936
p051942
p051951
p051964
p051966
p051985
p051986
p051992
p052011
p052018
p052021
p052034
p052039
p052054
p052057
p052068
p052076
p052084
p052087
p052089
p052094
p052109
p052118
p052119
p052125
p052146
p052172
p052183
p052191
p052197
p052199
p052205
p052207
p052229
p052234
p052238
p052260
p052264
p052269
p052296
p052298
p052302
p052307
p052311
p052315
p052319
p052329
p052330
p052347
p052355
p052359
p052363
p052370
p052409
p052420
p052436
p052441
p052452
p052453
p052456
p052462
p052478
p052482
p052490
p052503
p052505
p052506
p052529
p052530
p052532
p052547
p052550
p052556
p052566
p052574
p052582
p052586
p052592
p052593
p052602
p052604
p052620
p052622
p052641
p052642
p052647
p052666
p052680
p052693
p052695
p052696
p052697
p052703
p052710
p052728
p052730
p052736
p052739
p052740
p052746
p052750
p052762
p052764
p052766
p052778
p052779
p052791
p052796
p052802
p052807
p052808
p052815
p052816
p052828
p052846
p052848
p052867
p052872
p052875
p052876
p052878
p052897
p052898
p052899
p052900
p052932
p052934
p052945
p052952
p052969
p052972
p052974
p052978
p052996
p053013
p053014
p053015
p053019
p053020
p053023
p053036
p053084
p053098
p053102
p053111
p053119
p053132
p053136
p053149
p053173
p053176
p053191
p053193
p053205
p053216
p053228
p053238
p053247
p053252
p053282
p053283
p053290
p053294
p053321
p053322
p053342
p053348
p053355
p053397
p053404
p053417
p053419
p053425
p053435
p053440
p053441
p053443
p053451
p053462
p053470
p053492
p053514
p053531
p053541
p053545
p053548
p053567
p053594
p053596
p053608
p053609
p053612
p053632
p053636
p053639
p053642
p053663
p053669
p053695
p053707
p053714
p053722
p053724
p053731
p053759
p053770
p053804
p053806
p053812
p053821
p053822
p053833
p053835
p053842
p053845
p053850
p053856
p053865
p053866
p053868
p053875
p053876
p053878
p053896
p053919
p053939
p053944
p053947
p053978
p054003
p054005
p054009
p054020
p054041
p054043
p054050
p054073
p054078
p054088
p054090
p054096
p054110
p054120
p054124
p054132
p054134
p054135
p054138
p054145
p054147
p054153
p054154
p054174
p054177
p054182
p054183
p054187
p054191
p054195
p054197
p054209
p054221
p054241
p054264
p054276
p054289
p054305
p054332
p054341
p054353
p054369
p054385
p054386
p054397
p054406
p054420
p054429
p054444
p054465
p054470
p054479
p054487
p054523
p054540
p054541
p054563
p054585
p054586
p054589
p054592
p054600
p054609
p054610
p054613
p054620
p054636
p054639
p054641
p054643
p054660
p054661
p054663
p054675
p054679
p054681
p054682
p054683
p054690
p054695
p054703
p054708
p054716
p054729
p054735
p054736
p054755
p054757
p054775
p054817
p054822
p054823
p054825
p054826
p054850
p054872
p054878
p054882
p054888
p054893
p054900
p054904
p054911
p054922
p054929
p054934
p054935
p054940
p054958
p054960
p054961
p054968
p054969
p054979
p054987
p054994
p055022
p055023
p055027
p055030
p055044
p055069
p055074
p055078
p055083
p055090
p055094
p055104
p055106
p055115
p055121
p055122
p055143
p055149
p055186
p055193
p055201
p055204
p055219
p055247
p055260
p055263
p055273
p055308
p055332
p055337
p055357
p055363
p055365
p055386
p055393
p055402
p055423
p055446
p055473
p055477
p055507
p055512
p055523
p055526
p055539
p055545
p055559
p055563
p055575
p055579
p055585
p055591
p055597
p055601
p055611
p055616
p055624
p055638
p055657
p055659
p055669
p055673
p055677
p055679
p055682
p055689
p055703
p055704
p055716
p055722
p055725
p055729
p055730
p055731
p055753
p055772
p055781
p055821
p055841
p055849
p055853
p055867
p055886
p055901
p055909
p055910
p055920
p055921
p055922
p055963
p055973
p055987
p055992
p056025
p056027
p056029
p056038
p056040
p056046
p056060
p056069
p056076
p056128
p056130
p056155
p056179
p056187
p056191
p056201
p056204
p056224
p056227
p056229
p056243
p056257
p056264
p056267
p056269
p056285
p056287
p056289
p056290
p056294
p056307
p056319
p056322
p056331
p056332
p056333
p056361
p056364
p056378
p056391
p056409
p056429
p056440
p056443
p056449
p056460
p056464
p056490
p056492
p056502
p056549
p056562
p056583
p056593
p056620
p056634
p056661
p056674
p056678
p056697
p056740
p056746
p056751
p056772
p056796
p056798
p056802
p056819
p056829
p056840
p056849
p056854
p056858
p056867
p056878
p056880
p056890
p056930
p056947
p056960
p056963
p056965
p056986
p056988
p056996
p057001
p057004
p057023
p057036
p057052
p057054
p057056
p057061
p057073
p057083
p057091
p057092
p057093
p057097
p057100
p057105
p057120
p057130
p057133
p057143
p057157
p057158
p057171
p057172
p057199
p057208
p057215
p057220
p057231
p057239
p057251
p057255
p057256
p057276
p057283
p057293
p057299
p057306
p057307
p057308
p057314
p057321
p057330
p057408
p057443
p057445
p057449
p057454
p057465
p057476
p057485
p057489
p057490
p057496
p057506
p057507
p057511
p057528
p057535
p057545
p057562
p057568
p057585
p057594
p057599
p057614
p057619
p057678
p057686
p057690
p057697
p057741
p057751
p057752
p057767
p057770
p057774
p057795
p057815
p057848
p057853
p057865
p057869
p057872
p057877
p057878
p057886
p057887
p057899
p057904
p057905
p057907
p057911
p057934
p057935
p057955
p057964
p057968
p057972
p057981
p057983
p057989
p057990
p057997
p058008
p058010
p058022
p058063
p058077
p058099
p058102
p058113
p058128
p058134
p058135
p058144
p058155
p058163
p058187
p058199
p058205
p058218
p058237
p058238
p058240
p058242
p058247
p058258
p058264
p058265
p058269
p058270
p058271
p058278
p058286
p058296
p058297
p058300
p058303
p058310
p058313
p058319
p058321
p058327
p058337
p058356
p058371
p058377
p058391
p058392
p058416
p058430
p058431
p058433
p058438
p058449
p058452
p058456
p058475
p058480
p058483
p058501
p058505
p058508
p058519
p058521
p058526
p058530
p058541
p058570
p058574
p058576
p058586
p058590
p058616
p058617
p058631
p058634
p058640
p058654
p058662
p058667
p058668
p058672
p058689
p058732
p058740
p058753
p058757
p058771
p058773
p058792
p058793
p058802
p058809
p058812
p058817
p058834
p058852
p058855
p058868
p058899
p058903
p058917
p058932
p058938
p058939
p058965
p058967
p058975
p058984
p058991
p058993
p059004
p059039
p059049
p059053
p059057
p059073
p059076
p059085
p059087
p059101
p059102
p059120
p059133
p059135
p059147
p059152
p059158
p059161
p059169
p059186
p059188
p059194
p059198
p059199
p059200
p059201
p059210
p059215
p059222
p059225
p059227
p059252
p059267
p059268
p059270
p059276
p059285
p059290
p059291
p059301
p059307
p059314
p059318
p059333
p059347
p059364
p059367
p059374
p059375
p059381
p059385
p059411
p059417
p059447
p059448
p059457
p059462
p059469
p059490
p059494
p059503
p059505
p059507
p059513
p059537
p059543
p059546
p059570
p059585
p059590
p059603
p059638
p059642
p059657
p059669
p059674
p059700
p059701
p059703
p059707
p059710
p059720
p059726
p059736
p059757
p059761
p059762
p059777
p059783
p059785
p059788
p059789
p059795
p059797
p059801
p059807
p059828
p059833
p059841
p059844
p059845
p059848
p059864
p059870
p059883
p059886
p059889
p059907
p059918
p059924
p059930
p059936
p059941
p059948
p059960
p059964
p059976
p059991
p059997