Database Open Access
Leipzig Heart Center ECG-Database: Arrhythmias in Children and Patients with Congenital Heart Disease
Sophia Klehs , Daniel Franke , Bayhas Alhamad , Roman Gebauer , Linus Teich , Tobias Teich , Christian Paech
Published: March 19, 2025. Version: 1.0.0
When using this resource, please cite:
(show more options)
Klehs, S., Franke, D., Alhamad, B., Gebauer, R., Teich, L., Teich, T., & Paech, C. (2025). Leipzig Heart Center ECG-Database: Arrhythmias in Children and Patients with Congenital Heart Disease (version 1.0.0). PhysioNet. https://doi.org/10.13026/7a4j-vn37.
Klehs, S., Franke, D., Alhamad, B., Gebauer, R., Teich, L., Dähnert, I., Teich, T., & Paech, C. Leipzig Heart Center ECG-Database: Arrhythmias in children and patients with congenital heart disease.
Please include the standard citation for PhysioNet:
(show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.
Abstract
Interpretation of Electrocardiograms (ECG) is increasingly complemented by algorithms. These algorithms are based on large datasets. This ECG database consists of children and adults with congenital heart defects (CHD) including many arrhythmia annotations. This dataset, notable for its manual annotations and inclusion of intracardiac electrograms alongside traditional 12-lead ECGs, offers 1075.85 minutes of recordings (in total 113924 annotated beats) that capture various cardiac rhythms and arrhythmias such as supraventricular tachycardia and ventricular tachycardia. The data were meticulously collected from patients undergoing electrophysiological studies, with subsequent annotations by expert reviewers using the LightWAVE® software. The significance of this database lies in its focus on pediatric arrhythmias and arrhythmias of patients with congenital heart defect areas currently underrepresented in existing datasets, which predominantly feature adult pathologies. This resource aims to enhance algorithmic development for ECG interpretation, leveraging machine learning to improve diagnosis and treatment outcomes in these sensitive groups. The dataset not only serves as a critical tool for developing precision medicine but also sets a precedent for future expansions to include a broader spectrum of congenital heart defects conditions, thereby supporting the evolution of cardiac care through advanced computational techniques.
Background
Electrocardiography (ECG) is the most important diagnostic tool, in diagnosing cardiac arrhythmias. Interpretation of ECG is increasingly complemented by algorithms based on large datasets. So far these ECG-datasets mainly contain ECGs of adults, and the focus is primarily on adult pathologies like myocardial infarction, atrial fibrillation and conduction diseases [1-4]. The main arrhythmias of children, however, are completely different from arrhythmias of adults. The most common arrhythmias in children are supraventricular tachycardias like atrioventricular reentrant tachycardias and atrioventricular nodal reentrant tachycardias. Databases containing these arrhythmias are very rare. The aim of this study was to create an ECG-database of children with arrhythmia annotations beat per beat. Additionally to the arrhythmias of children, this is the first database containing also ECG data of patients with congenital heart disease. These patients have a high percentage of conduction anomalies (like right bundle branch block) and a risk for ventricular tachycardias. This resource aims to enhance algorithmic development for ECG interpretation, leveraging machine learning to improve diagnosis and treatment outcomes in children and patients with congenital heart disease.
Methods
The raw signal data were extracted from electrophysiological studies of children and patients with congenital heart defects.
Signals were recorded using CardioLab® (GE Medical Systems, Milwaukee, WI). Data were exported as a .txt format. The dataset is provided in the WaveForm Database (WFDB) format. The WFDB Software package was used to convert the data into the WFDB format [5]. The WFDB Software package was also used for various processing steps and validation in the annotation process of the data. The recordings were digitized at 977 samples per second. Filter settings were 0.05 Hz-100 Hz. The signals from the individual leads are the raw data as exported by CardioLab®. No filters or other signal quality enhancements have been applied afterward.
To annotate the dataset, the Software LightWAVE® was used [6, 7]. The annotations were manually annotated by two ECG experts. The first expert annotated all ECGs. To support the process, the first expert could annotate long ranges of the same annotation by marking the start and end. By using the WFDB Software package and an R-peak detection algorithm, the ranges were filled [8]. The second expert checked all annotations and made corrections if necessary. For unclear parts of the record, further experts were consulted.
Data Description
The ECG dataset comprises detailed recordings from both children with supraventricular tachycardias and adultswith congenital heart defects, specifically categorized into separate groups. The files are systematically named to reflect the patient group: filenames x001 through x029 represent children's ECGs, while x100 through x109 denote adult patients' ECGs.
The files children-subject-info.csv and adults-subject-info.csv provide the following information:
-
patient ID
-
patient demographics (gender, age)
-
patient diagnosis
-
the location of the accessory pathway
-
the duration of the ECG that is provided for the patient
The dataset consists of 39 ECGs of 39 patients (length ranging from 00:01:17 hours to 02:30:31 hours). In total 113924 beats were annotated. Normal sinus rhythm beats are the majority (33%), but the database contains also sinus rhythm beats with Preexcitation (12%) or complete right bundle branch block (11%). Supraventricular tachycardias (23%) are subdivided into atrioventricular reentrant tachycardias (15%) and atrioventricular nodal reentrant tachycardias ( 8%). 12% of all beats are paced beats (7% atrial paced beats and 5% ventricular paced beats) and 7% are premature beats, as premature ventricular beats, premature atrial beats and preexcited premature atrial beats.
The distribution of beat types annotated in this database is as follows:
Explanation of Beats |
Symbol |
Aux String |
---|---|---|
Sinus rhythm |
||
Normal beats |
N or • |
|
Preexcitation* |
N |
N-Prex |
Complete right bundle branch block |
R |
|
Complete left bundle branch block |
L |
|
AV-Block 1°# |
b |
BI |
Junctional escape beats |
j |
|
Tachycardias |
||
Ventricular Tachycardia# |
X |
VT |
Accelerated Idioventricular Rhythm# |
X |
IVR |
Supraventricular Tachycardia |
||
Atrioventricular reentrant tachycardia# |
X |
AVRT |
Atrioventricular nodal reentrant tachycardia# |
X |
AVNRT |
Aberrated AVRT# |
X |
avrt |
Aberrated AVNRT# |
X |
avnrt |
AVNRT with AV-Block 2°# |
X |
AVNRT+BII |
Atrial Tachycardia |
||
Atrial Fibrillation# |
X |
AFIB |
Ectopic atrial tachycardia# |
X |
EAT |
Atrial Flutter# |
X |
AFL |
Premature Beats |
||
Premature atrial beats |
A |
|
Aberrated premature atrial beats |
a |
|
Preexcitated premature atrial beats* |
A |
A-Prex |
Premature ventricular beats |
V |
|
Fusion beats |
F |
|
Premature junctional beats |
J |
|
Paced beats |
||
Atrial Paced beats* |
/ |
/A |
Ventricular Paced beats* |
/ |
/V |
Fusion of Ventricular paced beat and Normal Beat |
f |
*custom Aux Strings for standard WFDB annotations to further specify these annotations.
#custom annotations used in addition to the standard WFDB annotations
This dataset includes the following types of rhythm annotations:
Explanation of Rhythm |
Symbol |
Aux String |
---|---|---|
Sinus rhythm |
+ |
(N |
Tachycardias |
||
Ventricular Tachycardia |
+ |
(VT |
Accelerated Idioventricular Rhythm |
+ |
(IVR |
Supraventricular Tachycardia |
||
Atrioventricular reentrant tachycardia |
+ |
(AVRT |
Atrioventricular nodal reentrant tachcardia |
+ |
(AVNRT |
Atrial Tachycardia |
||
Atrial Fibrillation |
+ |
(AFIB |
Ectopic atrial tachycardia |
+ |
(EAT |
Atrial Flutter |
+ |
(AFL |
Ectopic rhythm |
||
Ectopic atrial rhythm |
+ |
(A |
Ventricular bigeminy rhythm |
+ |
(B |
Junctional rhythm |
+ |
(J |
Paced beats |
||
Atrial Paced rhythm |
+ |
(/A |
Ventricular Paced rhythm |
+ |
(/V |
For information about the entire dataset and the abbreviations and annotations used in it, the file dataset_info.csv can be used.
Usage Notes
ECG datasets are crucial for developing AI-based arrhythmia detection algorithms. However, most existing datasets focus on adult pathologies, such as myocardial infarction, conduction diseases, or atrial fibrillation [1-4, 9]. Datasets including children are rare [2], and those featuring pediatric-specific conditions like supraventricular tachycardias are especially limited [1]. This manually annotated open-source database addresses this gap by providing normal and pathological ECGs of children, supporting advancements in AI-based rhythm analysis [10, 11].
This dataset provides ECG recordings with arrhythmia annotations, suitable for developing machine learning models to detect various cardiac conditions. The ECGs include:
-
Supraventricular tachycardias (e.g., AVRT, AVNRT)
-
Ventricular tachycardias
-
Preexcitation syndrome
-
Paced rhythms
-
Complete right bundle branch block (CRBBB)
To utilize the data, researchers may use the WFDB Software Package for reading ECG data.
Each patient's data is encapsulated in three associated files:
-
Header File (.hea): This file provides metadata about the ECG recordings, including the number of channels, sampling frequency, and signal gain, essential for understanding and processing the ECG data correctly.
-
Data File (.dat): This file contains the actual ECG signal data, allowing detailed analysis and application of processing algorithms.
-
Annotation File (.atr): This file includes annotations of the ECG signals, detailing various cardiac events and rhythms identified by expert analysis, which are crucial for training and validating diagnostic algorithms.
Further details about how to use and how to get started with the dataset can be found in the README.md file.
As these ECGs are extracted from electrophysiological studies, they include segments with non-physiological stimulation. Currently, the database of adults with congenital heart defects is limited to patients with Tetralogy of Fallot, but plans are underway to include other congenital heart defects.
Ethics
Patients or their parents/legal guardians provided written informed consent prior to electrophysiological study. The study was approved by the institutional ethics committee (ethics approval number 120/23-ek) and fully complies with the Declaration of Helsinki.
Conflicts of Interest
The authors have no conflicts of interest to declare.
References
- Zheng J, Zhang J, Danioko S, Yao H, Guo H, Rakovski C. A 12-lead electrocardiogram database for arrhythmia research covering more than 10,000 patients. Sci Data. 2020;7(1):48. https://doi.org/10.1038/s41597-020-0386-x
- Wagner P, Strodthoff N, Bousseljot RD, Kreiseler D, Lunze FI, Samek W, et al. PTB-XL: A large publicly available ECG dataset. Sci Data. 2020;7(1):154. https://doi.org/10.1038/s41597-020-0495-6
- Attia ZI, Noseworthy PA, Lopez-Jimenez F, Asirvatham SJ, Deshmukh AJ, Gersh BJ, et al. An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: a retrospective analysis of outcome prediction. Lancet. 2019;394(10201):861–867. https://doi.org/10.1016/S0140-6736(19)31721-0
- Yoo H, Yum Y, Park SW, Lee JM, Jang M, Kim Y, et al. Standardized database of 12-lead electrocardiograms with a common standard for the promotion of cardiovascular research: KURIAS-ECG. Healthc Inform Res. 2023;29(2):132–144. https://doi.org/10.4258/hir.2023.29.2.132
- Xie C, McCullum L, Johnson A, Pollard T, Gow B, Moody B. Waveform Database Software Package (WFDB) for Python (version 4.1.0). PhysioNet. 2023. https://doi.org/10.13026/9njx-6322
- Goldberger AL, Amaral LA, Glass L, Hausdorff JM, Ivanov PC, Mark RG, et al. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation. 2000;101(23):E215–220. https://doi.org/10.1161/01.cir.101.23.e215
- Moody GB, Mark RG. The impact of the MIT-BIH arrhythmia database. IEEE Eng Med Biol Mag. 2001;20(3):45–50. https://doi.org/10.1109/51.932724
- Kathirvel P, Sabarimalai Manikandan M, Prasanna SRM. An efficient R-peak detection based on new nonlinear transformation and first-order Gaussian differentiator. Cardiovasc Eng Technol. 2011;2(4):408–425. https://doi.org/10.1007/s13239-011-0065-3
- Boulif A, Ananou B, Ouladsine M, Delliaux S. A literature review: ECG-based models for arrhythmia diagnosis using artificial intelligence techniques. Bioinform Biol Insights. 2023;17:11779322221149600. https://doi.org/10.1177/11779322221149600
- Teich L, Franke D, Michaelis A, Dähnert I, Gebauer RA, Markel F, et al. Development of an AI-based automated analysis of pediatric Apple Watch iECGs. Front Pediatr. 2023;11:1185629. https://doi.org/10.3389/fped.2023.1185629
- Yildirim O, Talo M, Ciaccio EJ, Tan RS, Acharya UR. Accurate deep neural network model to detect cardiac arrhythmia on more than 10,000 individual subject ECG records. Comput Methods Programs Biomed. 2020;197:105740. https://doi.org/10.1016/j.cmpb.2020.105740
Access
Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.
License (for files):
Open Data Commons Attribution License v1.0
Discovery
DOI (version 1.0.0):
https://doi.org/10.13026/7a4j-vn37
DOI (latest version):
https://doi.org/10.13026/2ha4-km43
Topics:
artificial intelligence
12-lead
ecg
arrhythmias
chd
intracardiac recordings
annotated
congenital heart disease
Corresponding Author
Files
Total uncompressed size: 3.3 GB.
Access the files
- Download the ZIP file (1.2 GB)
-
Download the files using your terminal:
wget -r -N -c -np https://physionet.org/files/leipzig-heart-center-ecg/1.0.0/
Name | Size | Modified |
---|---|---|
ANNOTATORS (download) | 73 B | 2024-12-11 |
LICENSE.txt (download) | 19.9 KB | 2025-03-03 |
README.md (download) | 7.8 KB | 2024-12-18 |
RECORDS (download) | 215 B | 2024-08-01 |
SHA256SUMS.txt (download) | 9.2 KB | 2025-03-19 |
adults-subject-info.csv (download) | 478 B | 2024-12-11 |
children-subject-info.csv (download) | 1.3 KB | 2024-12-11 |
dataset_info.csv (download) | 2.1 KB | 2024-12-11 |
dataset_info_data_dictionary.csv (download) | 359 B | 2024-12-11 |
subject_data_dictionary.csv (download) | 671 B | 2024-12-11 |
x001.atr (download) | 62.6 KB | 2024-08-02 |
x001.dat (download) | 463.6 MB | 2024-08-02 |
x001.hea (download) | 899 B | 2024-08-02 |
x0010.atr (download) | 8.4 KB | 2024-08-02 |
x0010.dat (download) | 16.9 MB | 2024-08-02 |
x0010.hea (download) | 917 B | 2024-08-02 |
x0011.atr (download) | 19.5 KB | 2024-08-02 |
x0011.dat (download) | 38.1 MB | 2024-08-02 |
x0011.hea (download) | 927 B | 2024-08-02 |
x0012.atr (download) | 5.7 KB | 2024-08-02 |
x0012.dat (download) | 37.1 MB | 2024-08-02 |
x0012.hea (download) | 920 B | 2024-08-02 |
x0013.atr (download) | 1.9 KB | 2024-08-02 |
x0013.dat (download) | 12.4 MB | 2024-08-02 |
x0013.hea (download) | 904 B | 2024-08-02 |
x0014.atr (download) | 5.7 KB | 2024-08-02 |
x0014.dat (download) | 10.6 MB | 2024-08-02 |
x0014.hea (download) | 904 B | 2024-08-02 |
x0015.atr (download) | 3.1 KB | 2024-08-02 |
x0015.dat (download) | 9.8 MB | 2024-08-02 |
x0015.hea (download) | 856 B | 2024-08-02 |
x0016.atr (download) | 8.4 KB | 2024-08-02 |
x0016.dat (download) | 36.7 MB | 2024-08-02 |
x0016.hea (download) | 921 B | 2024-08-02 |
x0017.atr (download) | 4.9 KB | 2024-08-02 |
x0017.dat (download) | 11.0 MB | 2024-08-02 |
x0017.hea (download) | 920 B | 2024-08-02 |
x0018.atr (download) | 7.0 KB | 2024-08-02 |
x0018.dat (download) | 18.9 MB | 2024-08-02 |
x0018.hea (download) | 917 B | 2024-08-02 |
x0019.atr (download) | 3.6 KB | 2024-08-02 |
x0019.dat (download) | 12.1 MB | 2024-08-02 |
x0019.hea (download) | 910 B | 2024-08-02 |
x002.atr (download) | 48.7 KB | 2024-08-02 |
x002.dat (download) | 317.9 MB | 2024-08-02 |
x002.hea (download) | 856 B | 2024-08-02 |
x0020.atr (download) | 11.5 KB | 2024-08-02 |
x0020.dat (download) | 26.7 MB | 2024-08-02 |
x0020.hea (download) | 898 B | 2024-08-02 |
x0021.atr (download) | 11.9 KB | 2024-08-02 |
x0021.dat (download) | 29.4 MB | 2024-08-02 |
x0021.hea (download) | 903 B | 2024-08-02 |
x0022.atr (download) | 1.1 KB | 2024-08-02 |
x0022.dat (download) | 13.1 MB | 2024-08-02 |
x0022.hea (download) | 919 B | 2024-08-02 |
x0023.atr (download) | 6.5 KB | 2024-08-02 |
x0023.dat (download) | 22.1 MB | 2024-08-02 |
x0023.hea (download) | 974 B | 2024-08-02 |
x0024.atr (download) | 10.6 KB | 2024-08-02 |
x0024.dat (download) | 38.3 MB | 2024-08-02 |
x0024.hea (download) | 906 B | 2024-08-02 |
x0025.atr (download) | 11.8 KB | 2024-08-02 |
x0025.dat (download) | 35.4 MB | 2024-08-02 |
x0025.hea (download) | 921 B | 2024-08-02 |
x0026.atr (download) | 1.9 KB | 2024-08-02 |
x0026.dat (download) | 11.0 MB | 2024-08-02 |
x0026.hea (download) | 901 B | 2024-08-02 |
x0027.atr (download) | 1.4 KB | 2024-08-02 |
x0027.dat (download) | 4.1 MB | 2024-08-02 |
x0027.hea (download) | 925 B | 2024-08-02 |
x0028.atr (download) | 4.8 KB | 2024-08-02 |
x0028.dat (download) | 13.5 MB | 2024-08-02 |
x0028.hea (download) | 960 B | 2024-08-02 |
x0029.atr (download) | 8.9 KB | 2024-08-02 |
x0029.dat (download) | 37.7 MB | 2024-08-02 |
x0029.hea (download) | 887 B | 2024-08-02 |
x003.atr (download) | 105.6 KB | 2024-08-02 |
x003.dat (download) | 479.7 MB | 2024-08-02 |
x003.hea (download) | 903 B | 2024-08-02 |
x004.atr (download) | 120.1 KB | 2024-08-02 |
x004.dat (download) | 439.3 MB | 2024-08-02 |
x004.hea (download) | 902 B | 2024-08-02 |
x005.atr (download) | 4.6 KB | 2024-08-02 |
x005.dat (download) | 93.4 MB | 2024-08-02 |
x005.hea (download) | 864 B | 2024-08-02 |
x006.atr (download) | 51.8 KB | 2024-08-02 |
x006.dat (download) | 440.3 MB | 2024-08-02 |
x006.hea (download) | 906 B | 2024-08-02 |
x007.atr (download) | 1.8 KB | 2024-08-02 |
x007.dat (download) | 9.9 MB | 2024-08-02 |
x007.hea (download) | 897 B | 2024-08-02 |
x008.atr (download) | 6.7 KB | 2024-08-02 |
x008.dat (download) | 23.4 MB | 2024-08-02 |
x008.hea (download) | 905 B | 2024-08-02 |
x009.atr (download) | 23.1 KB | 2024-08-02 |
x009.dat (download) | 46.6 MB | 2024-08-02 |
x009.hea (download) | 908 B | 2024-08-02 |
x100.atr (download) | 5.4 KB | 2024-08-02 |
x100.dat (download) | 41.5 MB | 2024-08-02 |
x100.hea (download) | 888 B | 2024-08-02 |
x101.atr (download) | 3.8 KB | 2024-08-02 |
x101.dat (download) | 56.9 MB | 2024-08-02 |
x101.hea (download) | 894 B | 2024-08-02 |
x102.atr (download) | 4.4 KB | 2024-08-02 |
x102.dat (download) | 66.8 MB | 2024-08-02 |
x102.hea (download) | 655 B | 2024-08-02 |
x103.atr (download) | 7.9 KB | 2024-08-02 |
x103.dat (download) | 60.5 MB | 2024-08-02 |
x103.hea (download) | 896 B | 2024-08-02 |
x104.atr (download) | 13.7 KB | 2024-08-02 |
x104.dat (download) | 183.1 MB | 2024-08-02 |
x104.hea (download) | 894 B | 2024-08-02 |
x105.atr (download) | 11.4 KB | 2024-08-02 |
x105.dat (download) | 65.1 MB | 2024-08-02 |
x105.hea (download) | 671 B | 2024-08-02 |
x106.atr (download) | 5.0 KB | 2024-08-02 |
x106.dat (download) | 56.7 MB | 2024-08-02 |
x106.hea (download) | 878 B | 2024-08-02 |
x107.atr (download) | 4.2 KB | 2024-08-02 |
x107.dat (download) | 46.6 MB | 2024-08-02 |
x107.hea (download) | 666 B | 2024-08-02 |
x108.atr (download) | 9.5 KB | 2024-08-02 |
x108.dat (download) | 59.1 MB | 2024-08-02 |
x108.hea (download) | 658 B | 2024-08-02 |
x109.atr (download) | 4.2 KB | 2024-08-02 |
x109.dat (download) | 24.9 MB | 2024-08-02 |
x109.hea (download) | 666 B | 2024-08-02 |