Zhao Sally, Ye Zhan, Adhin Bhavna, Vuori Matti, Laukkanen Jari, Fisch Sudeshna
Pfizer (United States), New York, United States.
Pfizer (United States), 1 Portland St, Boston, MA, United States, 1 6175513000.
JMIR Med Inform. 2025 Aug 12;13:e73353. doi: 10.2196/73353.
The heart and kidneys have vital functions in the human body that reciprocally influence each other physiologically. Pathological changes in 1 organ can damage the other. Epidemiologic studies show that greater than 50% of patients with heart failure (HF) have preserved ejection fraction (HFpEF). Additionally, 1 in 6 patients identified as having chronic kidney disease (CKD) also has HF. Thus, it is important to be able to predict and identify the cardiorenal relationship between HFpEF and CKD.
Creating an electrocardiogram (ECG)-enabled model that stratifies suspected patients with HFpEF would help identify CKD-enriched HFpEF clusters and phenogroups. Simultaneously, a minimal set of significant ECG features derived from the stratification model would aid precision medicine and practical diagnoses due to being more accessible and widely readable than a large set of clinical inputs. Furthermore, the validation of the existing cardiorenal relationship using this ECG-enabled model may lead to better biological understanding.
Using unsupervised clustering on all extractable ECG features from FinnGen, patients with an indication of HFpEF (filtered by left ventricular ejection fraction [LVEF] values ≥50% and N-terminal pro B-type natriuretic peptide [NT-proBNP] values >450 pg/mL) were categorized into different phenogroups and analyzed for CKD risk. After isolating significant predictive ECG features, unsupervised clustering and risk analysis were performed again to demonstrate the efficacy of using a minimal set of features for phenogrouping. These clusters were then compared to clusters formed using dynamic time warping (DTW) on raw ECG time series electrical signals. Afterward, these clusters were analyzed for CKD enrichment.
The PR interval and QRS duration stood out as significant features and were used as the minimal feature set. After generating and comparing clusters (k-means with all extracted ECG features, k-means with a minimal feature set, and DTW with full lead II ECG waveform), the DTW-generated clusters were most stable. ANOVA analysis also showed that several HFpEF clusters exhibited a deviation of CKD risk from baseline, allowing for further trajectory analysis. Specifically, the creatinine levels (a proxy for CKD) of several DTW-created clusters showed significant differences from the average. Based on the Jaccard score, the DTW clusters also showed the greatest alignment to baseline comparison clusters created by clustering on creatinine. In comparison, the other 2 sets of clusters (created by all extracted ECG features and the minimal set) performed similarly.
This project validates both the known cardiorenal relationship between HFpEF and CKD and the importance of the PR interval and QRS duration. After exploring the use of ECG data for patient clustering and stratification, DTW clustering with lead II waveforms resulted in the most clinically meaningful clusters in the context of HFpEF and CKD. This methodology may prove useful in exploring ECG clustering applications outside of HFpEF as well.
心脏和肾脏在人体中具有重要功能,在生理上相互影响。一个器官的病理变化会损害另一个器官。流行病学研究表明,超过50%的射血分数保留型心力衰竭(HFpEF)患者存在这种情况。此外,每6名被确诊患有慢性肾脏病(CKD)的患者中就有1人患有HFpEF。因此,能够预测和识别HFpEF与CKD之间的心肾关系非常重要。
创建一个基于心电图(ECG)的模型,对疑似HFpEF患者进行分层,这将有助于识别富含CKD的HFpEF簇和表型组。同时,从分层模型中得出的一组最少的重要ECG特征,由于比大量临床输入更易于获取且更具广泛可读性,将有助于精准医学和实际诊断。此外,使用这个基于ECG的模型对现有的心肾关系进行验证,可能会带来更好的生物学理解。
对来自芬兰基因库(FinnGen)的所有可提取ECG特征进行无监督聚类,有HFpEF迹象的患者(通过左心室射血分数[LVEF]值≥50%和N端前脑钠肽[NT-proBNP]值>450 pg/mL筛选)被分类到不同的表型组,并分析其CKD风险。在分离出重要的预测性ECG特征后,再次进行无监督聚类和风险分析,以证明使用一组最少的特征进行表型分组的有效性。然后将这些簇与在原始ECG时间序列电信号上使用动态时间规整(DTW)形成的簇进行比较。之后,对这些簇进行CKD富集分析。
PR间期和QRS时限作为显著特征脱颖而出,并被用作最少特征集。在生成并比较簇(使用所有提取的ECG特征的k均值聚类、使用最少特征集的k均值聚类以及使用完整II导联ECG波形的DTW聚类)后,DTW生成的簇最稳定。方差分析还表明,几个HFpEF簇的CKD风险与基线存在偏差,从而可以进行进一步的轨迹分析。具体而言,几个由DTW创建的簇的肌酐水平(CKD的一个替代指标)与平均值存在显著差异。基于杰卡德评分(Jaccard score),DTW簇与通过对肌酐进行聚类创建的基线比较簇的一致性也最高。相比之下,另外两组簇(由所有提取特征和最少特征集创建)表现相似。
该项目验证了HFpEF与CKD之间已知的心肾关系以及PR间期和QRS时限的重要性。在探索将ECG数据用于患者聚类和分层后,使用II导联波形的DTW聚类在HFpEF和CKD背景下产生了最具临床意义的簇。这种方法可能在探索HFpEF之外的ECG聚类应用中也有用。