Database Open Access

Leipzig Heart Center ECG-Database: Arrhythmias in Children and Patients with Congenital Heart Disease

Sophia Klehs Daniel Franke Bayhas Alhamad Roman Gebauer Linus Teich Tobias Teich Christian Paech

Published: March 19, 2025. Version: 1.0.0


When using this resource, please cite: (show more options)
Klehs, S., Franke, D., Alhamad, B., Gebauer, R., Teich, L., Teich, T., & Paech, C. (2025). Leipzig Heart Center ECG-Database: Arrhythmias in Children and Patients with Congenital Heart Disease (version 1.0.0). PhysioNet. https://doi.org/10.13026/7a4j-vn37.

Additionally, please cite the original publication:

Klehs, S., Franke, D., Alhamad, B., Gebauer, R., Teich, L., Dähnert, I., Teich, T., & Paech, C. Leipzig Heart Center ECG-Database: Arrhythmias in children and patients with congenital heart disease.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

Interpretation of Electrocardiograms (ECG) is increasingly complemented by algorithms. These algorithms are based on large datasets. This ECG database consists of children and adults with congenital heart defects (CHD) including many arrhythmia annotations. This dataset, notable for its manual annotations and inclusion of intracardiac electrograms alongside traditional 12-lead ECGs, offers 1075.85 minutes of recordings (in total 113924 annotated beats) that capture various cardiac rhythms and arrhythmias such as supraventricular tachycardia and ventricular tachycardia. The data were meticulously collected from patients undergoing electrophysiological studies, with subsequent annotations by expert reviewers using the LightWAVE® software. The significance of this database lies in its focus on pediatric arrhythmias and arrhythmias of patients with congenital heart defect areas currently underrepresented in existing datasets, which predominantly feature adult pathologies. This resource aims to enhance algorithmic development for ECG interpretation, leveraging machine learning to improve diagnosis and treatment outcomes in these sensitive groups. The dataset not only serves as a critical tool for developing precision medicine but also sets a precedent for future expansions to include a broader spectrum of congenital heart defects conditions, thereby supporting the evolution of cardiac care through advanced computational techniques.


Background

Electrocardiography (ECG) is the most important diagnostic tool, in diagnosing cardiac arrhythmias. Interpretation of ECG is increasingly complemented by algorithms based on large datasets. So far these ECG-datasets mainly contain ECGs of adults, and the focus is primarily on adult pathologies like myocardial infarction, atrial fibrillation and conduction diseases [1-4]. The main arrhythmias of children, however, are completely different from arrhythmias of adults. The most common arrhythmias in children are supraventricular tachycardias like atrioventricular reentrant tachycardias and atrioventricular nodal reentrant tachycardias. Databases containing these arrhythmias are very rare. The aim of this study was to create an ECG-database of children with arrhythmia annotations beat per beat. Additionally to the arrhythmias of children, this is the first database containing also ECG data of patients with congenital heart disease. These patients have a high percentage of conduction anomalies (like right bundle branch block) and a risk for ventricular tachycardias. This resource aims to enhance algorithmic development for ECG interpretation, leveraging machine learning to improve diagnosis and treatment outcomes in children and patients with congenital heart disease.


Methods

The raw signal data were extracted from electrophysiological studies of children and patients with congenital heart defects.

Signals were recorded using CardioLab® (GE Medical Systems, Milwaukee, WI). Data were exported as a .txt format. The dataset is provided in the WaveForm Database (WFDB) format. The WFDB Software package was used to convert the data into the WFDB format [5]. The WFDB Software package was also used for various processing steps and validation in the annotation process of the data. The recordings were digitized at 977 samples per second. Filter settings were 0.05 Hz-100 Hz. The signals from the individual leads are the raw data as exported by CardioLab®. No filters or other signal quality enhancements have been applied afterward.

To annotate the dataset, the Software LightWAVE® was used [6, 7]. The annotations were manually annotated by two ECG experts. The first expert annotated all ECGs. To support the process, the first expert could annotate long ranges of the same annotation by marking the start and end. By using the WFDB Software package and an R-peak detection algorithm, the ranges were filled [8]. The second expert checked all annotations and made corrections if necessary. For unclear parts of the record, further experts were consulted.


Data Description

The ECG dataset comprises detailed recordings from both children with supraventricular tachycardias and adultswith congenital heart defects, specifically categorized into separate groups. The files are systematically named to reflect the patient group: filenames x001 through x029 represent children's ECGs, while x100 through x109 denote adult patients' ECGs.

The files children-subject-info.csv and adults-subject-info.csv provide the following information:

  • patient ID

  • patient demographics (gender, age)

  • patient diagnosis

  • the location of the accessory pathway

  • the duration of the ECG that is provided for the patient

The dataset consists of 39 ECGs of 39 patients (length ranging from 00:01:17 hours to 02:30:31 hours). In total 113924 beats were annotated. Normal sinus rhythm beats are the majority (33%), but the database contains also sinus rhythm beats with Preexcitation (12%) or complete right bundle branch block (11%). Supraventricular tachycardias (23%) are subdivided into atrioventricular reentrant tachycardias (15%) and atrioventricular nodal reentrant tachycardias ( 8%). 12% of all beats are paced beats (7% atrial paced beats and 5% ventricular paced beats) and 7% are premature beats, as premature ventricular beats, premature atrial beats and preexcited premature atrial beats.

The distribution of beat types annotated in this database is as follows:

Explanation of Beats

Symbol

Aux String

Sinus rhythm

Normal beats

N or •

Preexcitation*

N

N-Prex

Complete right bundle branch block

R

Complete left bundle branch block

L

AV-Block 1°#

b

BI

Junctional escape beats

j

Tachycardias

Ventricular Tachycardia#

X

VT

Accelerated Idioventricular Rhythm#

X

IVR

Supraventricular Tachycardia

Atrioventricular reentrant tachycardia#

X

AVRT

Atrioventricular nodal reentrant tachycardia#

X

AVNRT

Aberrated AVRT#

X

avrt

Aberrated AVNRT#

X

avnrt

AVNRT with AV-Block 2°#

X

AVNRT+BII

Atrial Tachycardia

Atrial Fibrillation#

X

AFIB

Ectopic atrial tachycardia#

X

EAT

Atrial Flutter#

X

AFL

Premature Beats

Premature atrial beats

A

Aberrated premature atrial beats

a

Preexcitated premature atrial beats*

A

A-Prex

Premature ventricular beats

V

Fusion beats

F

Premature junctional beats

J

Paced beats

Atrial Paced beats*

/

/A

Ventricular Paced beats*

/

/V

Fusion of Ventricular paced beat and Normal Beat

f

*custom Aux Strings for standard WFDB annotations to further specify these annotations.

#custom annotations used in addition to the standard WFDB annotations


This dataset includes the following types of rhythm annotations:

Explanation of Rhythm

Symbol

Aux String

Sinus rhythm

+

(N

Tachycardias

Ventricular Tachycardia

+

(VT

Accelerated Idioventricular Rhythm

+

(IVR

Supraventricular Tachycardia

Atrioventricular reentrant tachycardia

+

(AVRT

Atrioventricular nodal reentrant tachcardia

+

(AVNRT

Atrial Tachycardia

Atrial Fibrillation

+

(AFIB

Ectopic atrial tachycardia

+

(EAT

Atrial Flutter

+

(AFL

Ectopic rhythm

Ectopic atrial rhythm

+

(A

Ventricular bigeminy rhythm

+

(B

Junctional rhythm

+

(J

Paced beats

Atrial Paced rhythm

+

(/A

Ventricular Paced rhythm

+

(/V

For information about the entire dataset and the abbreviations and annotations used in it, the file dataset_info.csv can be used.


Usage Notes

ECG datasets are crucial for developing AI-based arrhythmia detection algorithms. However, most existing datasets focus on adult pathologies, such as myocardial infarction, conduction diseases, or atrial fibrillation [1-4, 9]. Datasets including children are rare [2], and those featuring pediatric-specific conditions like supraventricular tachycardias are especially limited [1]. This manually annotated open-source database addresses this gap by providing normal and pathological ECGs of children, supporting advancements in AI-based rhythm analysis [10, 11].

This dataset provides ECG recordings with arrhythmia annotations, suitable for developing machine learning models to detect various cardiac conditions. The ECGs include:

  • Supraventricular tachycardias (e.g., AVRT, AVNRT)

  • Ventricular tachycardias

  • Preexcitation syndrome

  • Paced rhythms

  • Complete right bundle branch block (CRBBB)

To utilize the data, researchers may use the WFDB Software Package for reading ECG data.

Each patient's data is encapsulated in three associated files:

  1. Header File (.hea): This file provides metadata about the ECG recordings, including the number of channels, sampling frequency, and signal gain, essential for understanding and processing the ECG data correctly.

  2. Data File (.dat): This file contains the actual ECG signal data, allowing detailed analysis and application of processing algorithms.

  3. Annotation File (.atr): This file includes annotations of the ECG signals, detailing various cardiac events and rhythms identified by expert analysis, which are crucial for training and validating diagnostic algorithms.

Further details about how to use and how to get started with the dataset can be found in the README.md file.

As these ECGs are extracted from electrophysiological studies, they include segments with non-physiological stimulation. Currently, the database of adults with congenital heart defects is limited to patients with Tetralogy of Fallot, but plans are underway to include other congenital heart defects.


Ethics

Patients or their parents/legal guardians provided written informed consent prior to electrophysiological study. The study was approved by the institutional ethics committee (ethics approval number 120/23-ek) and fully complies with the Declaration of Helsinki.


Conflicts of Interest

The authors have no conflicts of interest to declare.


References

  1. Zheng J, Zhang J, Danioko S, Yao H, Guo H, Rakovski C. A 12-lead electrocardiogram database for arrhythmia research covering more than 10,000 patients. Sci Data. 2020;7(1):48. https://doi.org/10.1038/s41597-020-0386-x
  2. Wagner P, Strodthoff N, Bousseljot RD, Kreiseler D, Lunze FI, Samek W, et al. PTB-XL: A large publicly available ECG dataset. Sci Data. 2020;7(1):154. https://doi.org/10.1038/s41597-020-0495-6
  3. Attia ZI, Noseworthy PA, Lopez-Jimenez F, Asirvatham SJ, Deshmukh AJ, Gersh BJ, et al. An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: a retrospective analysis of outcome prediction. Lancet. 2019;394(10201):861–867. https://doi.org/10.1016/S0140-6736(19)31721-0
  4. Yoo H, Yum Y, Park SW, Lee JM, Jang M, Kim Y, et al. Standardized database of 12-lead electrocardiograms with a common standard for the promotion of cardiovascular research: KURIAS-ECG. Healthc Inform Res. 2023;29(2):132–144. https://doi.org/10.4258/hir.2023.29.2.132
  5. Xie C, McCullum L, Johnson A, Pollard T, Gow B, Moody B. Waveform Database Software Package (WFDB) for Python (version 4.1.0). PhysioNet. 2023. https://doi.org/10.13026/9njx-6322
  6. Goldberger AL, Amaral LA, Glass L, Hausdorff JM, Ivanov PC, Mark RG, et al. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation. 2000;101(23):E215–220. https://doi.org/10.1161/01.cir.101.23.e215
  7. Moody GB, Mark RG. The impact of the MIT-BIH arrhythmia database. IEEE Eng Med Biol Mag. 2001;20(3):45–50. https://doi.org/10.1109/51.932724
  8. Kathirvel P, Sabarimalai Manikandan M, Prasanna SRM. An efficient R-peak detection based on new nonlinear transformation and first-order Gaussian differentiator. Cardiovasc Eng Technol. 2011;2(4):408–425. https://doi.org/10.1007/s13239-011-0065-3
  9. Boulif A, Ananou B, Ouladsine M, Delliaux S. A literature review: ECG-based models for arrhythmia diagnosis using artificial intelligence techniques. Bioinform Biol Insights. 2023;17:11779322221149600. https://doi.org/10.1177/11779322221149600
  10. Teich L, Franke D, Michaelis A, Dähnert I, Gebauer RA, Markel F, et al. Development of an AI-based automated analysis of pediatric Apple Watch iECGs. Front Pediatr. 2023;11:1185629. https://doi.org/10.3389/fped.2023.1185629
  11. Yildirim O, Talo M, Ciaccio EJ, Tan RS, Acharya UR. Accurate deep neural network model to detect cardiac arrhythmia on more than 10,000 individual subject ECG records. Comput Methods Programs Biomed. 2020;197:105740. https://doi.org/10.1016/j.cmpb.2020.105740

Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Attribution License v1.0

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 3.3 GB.

Access the files

Visualize waveforms

Folder Navigation: <base>
Name Size Modified
ANNOTATORS (download) 73 B 2024-12-11
LICENSE.txt (download) 19.9 KB 2025-03-03
README.md (download) 7.8 KB 2024-12-18
RECORDS (download) 215 B 2024-08-01
SHA256SUMS.txt (download) 9.2 KB 2025-03-19
adults-subject-info.csv (download) 478 B 2024-12-11
children-subject-info.csv (download) 1.3 KB 2024-12-11
dataset_info.csv (download) 2.1 KB 2024-12-11
dataset_info_data_dictionary.csv (download) 359 B 2024-12-11
subject_data_dictionary.csv (download) 671 B 2024-12-11
x001.atr (download) 62.6 KB 2024-08-02
x001.dat (download) 463.6 MB 2024-08-02
x001.hea (download) 899 B 2024-08-02
x0010.atr (download) 8.4 KB 2024-08-02
x0010.dat (download) 16.9 MB 2024-08-02
x0010.hea (download) 917 B 2024-08-02
x0011.atr (download) 19.5 KB 2024-08-02
x0011.dat (download) 38.1 MB 2024-08-02
x0011.hea (download) 927 B 2024-08-02
x0012.atr (download) 5.7 KB 2024-08-02
x0012.dat (download) 37.1 MB 2024-08-02
x0012.hea (download) 920 B 2024-08-02
x0013.atr (download) 1.9 KB 2024-08-02
x0013.dat (download) 12.4 MB 2024-08-02
x0013.hea (download) 904 B 2024-08-02
x0014.atr (download) 5.7 KB 2024-08-02
x0014.dat (download) 10.6 MB 2024-08-02
x0014.hea (download) 904 B 2024-08-02
x0015.atr (download) 3.1 KB 2024-08-02
x0015.dat (download) 9.8 MB 2024-08-02
x0015.hea (download) 856 B 2024-08-02
x0016.atr (download) 8.4 KB 2024-08-02
x0016.dat (download) 36.7 MB 2024-08-02
x0016.hea (download) 921 B 2024-08-02
x0017.atr (download) 4.9 KB 2024-08-02
x0017.dat (download) 11.0 MB 2024-08-02
x0017.hea (download) 920 B 2024-08-02
x0018.atr (download) 7.0 KB 2024-08-02
x0018.dat (download) 18.9 MB 2024-08-02
x0018.hea (download) 917 B 2024-08-02
x0019.atr (download) 3.6 KB 2024-08-02
x0019.dat (download) 12.1 MB 2024-08-02
x0019.hea (download) 910 B 2024-08-02
x002.atr (download) 48.7 KB 2024-08-02
x002.dat (download) 317.9 MB 2024-08-02
x002.hea (download) 856 B 2024-08-02
x0020.atr (download) 11.5 KB 2024-08-02
x0020.dat (download) 26.7 MB 2024-08-02
x0020.hea (download) 898 B 2024-08-02
x0021.atr (download) 11.9 KB 2024-08-02
x0021.dat (download) 29.4 MB 2024-08-02
x0021.hea (download) 903 B 2024-08-02
x0022.atr (download) 1.1 KB 2024-08-02
x0022.dat (download) 13.1 MB 2024-08-02
x0022.hea (download) 919 B 2024-08-02
x0023.atr (download) 6.5 KB 2024-08-02
x0023.dat (download) 22.1 MB 2024-08-02
x0023.hea (download) 974 B 2024-08-02
x0024.atr (download) 10.6 KB 2024-08-02
x0024.dat (download) 38.3 MB 2024-08-02
x0024.hea (download) 906 B 2024-08-02
x0025.atr (download) 11.8 KB 2024-08-02
x0025.dat (download) 35.4 MB 2024-08-02
x0025.hea (download) 921 B 2024-08-02
x0026.atr (download) 1.9 KB 2024-08-02
x0026.dat (download) 11.0 MB 2024-08-02
x0026.hea (download) 901 B 2024-08-02
x0027.atr (download) 1.4 KB 2024-08-02
x0027.dat (download) 4.1 MB 2024-08-02
x0027.hea (download) 925 B 2024-08-02
x0028.atr (download) 4.8 KB 2024-08-02
x0028.dat (download) 13.5 MB 2024-08-02
x0028.hea (download) 960 B 2024-08-02
x0029.atr (download) 8.9 KB 2024-08-02
x0029.dat (download) 37.7 MB 2024-08-02
x0029.hea (download) 887 B 2024-08-02
x003.atr (download) 105.6 KB 2024-08-02
x003.dat (download) 479.7 MB 2024-08-02
x003.hea (download) 903 B 2024-08-02
x004.atr (download) 120.1 KB 2024-08-02
x004.dat (download) 439.3 MB 2024-08-02
x004.hea (download) 902 B 2024-08-02
x005.atr (download) 4.6 KB 2024-08-02
x005.dat (download) 93.4 MB 2024-08-02
x005.hea (download) 864 B 2024-08-02
x006.atr (download) 51.8 KB 2024-08-02
x006.dat (download) 440.3 MB 2024-08-02
x006.hea (download) 906 B 2024-08-02
x007.atr (download) 1.8 KB 2024-08-02
x007.dat (download) 9.9 MB 2024-08-02
x007.hea (download) 897 B 2024-08-02
x008.atr (download) 6.7 KB 2024-08-02
x008.dat (download) 23.4 MB 2024-08-02
x008.hea (download) 905 B 2024-08-02
x009.atr (download) 23.1 KB 2024-08-02
x009.dat (download) 46.6 MB 2024-08-02
x009.hea (download) 908 B 2024-08-02
x100.atr (download) 5.4 KB 2024-08-02
x100.dat (download) 41.5 MB 2024-08-02
x100.hea (download) 888 B 2024-08-02
x101.atr (download) 3.8 KB 2024-08-02
x101.dat (download) 56.9 MB 2024-08-02
x101.hea (download) 894 B 2024-08-02
x102.atr (download) 4.4 KB 2024-08-02
x102.dat (download) 66.8 MB 2024-08-02
x102.hea (download) 655 B 2024-08-02
x103.atr (download) 7.9 KB 2024-08-02
x103.dat (download) 60.5 MB 2024-08-02
x103.hea (download) 896 B 2024-08-02
x104.atr (download) 13.7 KB 2024-08-02
x104.dat (download) 183.1 MB 2024-08-02
x104.hea (download) 894 B 2024-08-02
x105.atr (download) 11.4 KB 2024-08-02
x105.dat (download) 65.1 MB 2024-08-02
x105.hea (download) 671 B 2024-08-02
x106.atr (download) 5.0 KB 2024-08-02
x106.dat (download) 56.7 MB 2024-08-02
x106.hea (download) 878 B 2024-08-02
x107.atr (download) 4.2 KB 2024-08-02
x107.dat (download) 46.6 MB 2024-08-02
x107.hea (download) 666 B 2024-08-02
x108.atr (download) 9.5 KB 2024-08-02
x108.dat (download) 59.1 MB 2024-08-02
x108.hea (download) 658 B 2024-08-02
x109.atr (download) 4.2 KB 2024-08-02
x109.dat (download) 24.9 MB 2024-08-02
x109.hea (download) 666 B 2024-08-02