• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习在识别不同心脏细胞类型先天性心脏病的标记基因中的应用

Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types.

作者信息

Ma Qinglan, Zhang Yu-Hang, Guo Wei, Feng Kaiyan, Huang Tao, Cai Yu-Dong

机构信息

School of Life Sciences, Shanghai University, Shanghai 200444, China.

Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA.

出版信息

Life (Basel). 2024 Aug 19;14(8):1032. doi: 10.3390/life14081032.

DOI:10.3390/life14081032
PMID:39202774
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11355424/
Abstract

Congenital heart disease (CHD) represents a spectrum of inborn heart defects influenced by genetic and environmental factors. This study advances the field by analyzing gene expression profiles in 21,034 cardiac fibroblasts, 73,296 cardiomyocytes, and 35,673 endothelial cells, utilizing single-cell level analysis and machine learning techniques. Six CHD conditions: dilated cardiomyopathy (DCM), donor hearts (used as healthy controls), hypertrophic cardiomyopathy (HCM), heart failure with hypoplastic left heart syndrome (HF_HLHS), Neonatal Hypoplastic Left Heart Syndrome (Neo_HLHS), and Tetralogy of Fallot (TOF), were investigated for each cardiac cell type. Each cell sample was represented by 29,266 gene features. These features were first analyzed by six feature-ranking algorithms, resulting in several feature lists. Then, these lists were fed into incremental feature selection, containing two classification algorithms, to extract essential gene features and classification rules and build efficient classifiers. The identified essential genes can be potential CHD markers in different cardiac cell types. For instance, the LASSO identified key genes specific to various heart cell types in CHD subtypes. was found to be up-regulated in cardiac fibroblasts for both Dilated and hypertrophic cardiomyopathy. In cardiomyocytes, distinct genes such as , , , , and were linked to dilated cardiomyopathy, Neo-Hypoplastic Left Heart Syndrome, hypertrophic cardiomyopathy, HF-Hypoplastic Left Heart Syndrome, and Tetralogy of Fallot, respectively. Endothelial cell analysis further revealed , , and as significant genes for dilated cardiomyopathy, hypertrophic cardiomyopathy, and Tetralogy of Fallot. LightGBM, Catboost, MCFS, RF, and XGBoost further delineated key genes for specific CHD subtypes, demonstrating the efficacy of machine learning in identifying CHD-specific genes. Additionally, this study developed quantitative rules for representing the gene expression patterns related to CHDs. This research underscores the potential of machine learning in unraveling the molecular complexities of CHD and establishes a foundation for future mechanism-based studies.

摘要

先天性心脏病(CHD)是一系列受遗传和环境因素影响的先天性心脏缺陷。本研究通过利用单细胞水平分析和机器学习技术,对21034个心脏成纤维细胞、73296个心肌细胞和35673个内皮细胞的基因表达谱进行分析,推动了该领域的发展。研究针对六种CHD病症:扩张型心肌病(DCM)、供体心脏(用作健康对照)、肥厚型心肌病(HCM)、左心发育不全综合征伴心力衰竭(HF_HLHS)、新生儿左心发育不全综合征(Neo_HLHS)和法洛四联症(TOF),对每种心脏细胞类型进行了调查。每个细胞样本由29266个基因特征表示。这些特征首先通过六种特征排名算法进行分析,得到几个特征列表。然后,将这些列表输入包含两种分类算法的增量特征选择中,以提取关键基因特征和分类规则并构建高效分类器。所确定的关键基因可能是不同心脏细胞类型中潜在的CHD标志物。例如,套索(LASSO)算法确定了CHD亚型中各种心脏细胞类型特有的关键基因。发现该基因在扩张型和肥厚型心肌病的心脏成纤维细胞中上调。在心肌细胞中,不同的基因如[具体基因1]、[具体基因2]、[具体基因3]、[具体基因4]和[具体基因5]分别与扩张型心肌病、新生儿左心发育不全综合征、肥厚型心肌病、HF-左心发育不全综合征和法洛四联症相关联。内皮细胞分析进一步揭示[具体基因6]、[具体基因7]和[具体基因8]是扩张型心肌病、肥厚型心肌病和法洛四联症的重要基因。LightGBM、Catboost、MCFS、RF和XGBoost进一步描绘了特定CHD亚型的关键基因,证明了机器学习在识别CHD特异性基因方面的有效性。此外,本研究制定了用于表示与CHD相关的基因表达模式的定量规则。这项研究强调了机器学习在揭示CHD分子复杂性方面的潜力,并为未来基于机制的研究奠定了基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/4cbae040ca5e/life-14-01032-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/bacc9ee492a6/life-14-01032-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/aa9711b7e889/life-14-01032-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/b91444f2becc/life-14-01032-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/f884376a34fd/life-14-01032-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/e231a6e6c350/life-14-01032-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/d435f7bfc9bc/life-14-01032-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/49862b34b347/life-14-01032-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/3f61d8d037d3/life-14-01032-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/4cbae040ca5e/life-14-01032-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/bacc9ee492a6/life-14-01032-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/aa9711b7e889/life-14-01032-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/b91444f2becc/life-14-01032-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/f884376a34fd/life-14-01032-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/e231a6e6c350/life-14-01032-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/d435f7bfc9bc/life-14-01032-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/49862b34b347/life-14-01032-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/3f61d8d037d3/life-14-01032-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa8f/11355424/4cbae040ca5e/life-14-01032-g009.jpg

相似文献

1
Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types.机器学习在识别不同心脏细胞类型先天性心脏病的标记基因中的应用
Life (Basel). 2024 Aug 19;14(8):1032. doi: 10.3390/life14081032.
2
Integrated multi-omic characterization of congenital heart disease.先天性心脏病的综合多组学特征分析。
Nature. 2022 Aug;608(7921):181-191. doi: 10.1038/s41586-022-04989-3. Epub 2022 Jun 22.
3
Single-cell reconstruction and mutation enrichment analysis identifies dysregulated cardiomyocyte and endothelial cells in congenital heart disease.单细胞重构和突变富集分析鉴定先天性心脏病中失调的心肌细胞和内皮细胞。
Physiol Genomics. 2023 Dec 1;55(12):634-646. doi: 10.1152/physiolgenomics.00070.2023. Epub 2023 Oct 9.
4
Genetic Association Between Hypoplastic Left Heart Syndrome and Cardiomyopathies.左心发育不全综合征与心肌病的遗传关联。
Circ Genom Precis Med. 2021 Feb;14(1):e003126. doi: 10.1161/CIRCGEN.120.003126. Epub 2020 Dec 16.
5
Tetralogy of Fallot and Hypoplastic Left Heart Syndrome - Complex Clinical Phenotypes Meet Complex Genetic Networks.法洛四联症和左心发育不良综合征——复杂临床表型与复杂遗传网络相遇。
Curr Genomics. 2015 Jun;16(3):141-58. doi: 10.2174/1389202916666150303232520.
6
Predicting Heart Cell Types by Using Transcriptome Profiles and a Machine Learning Method.利用转录组图谱和机器学习方法预测心脏细胞类型
Life (Basel). 2022 Jan 31;12(2):228. doi: 10.3390/life12020228.
7
Cardiac regenerative capacity is age- and disease-dependent in childhood heart disease.儿童心脏病中心脏的再生能力取决于年龄和疾病。
PLoS One. 2018 Jul 25;13(7):e0200342. doi: 10.1371/journal.pone.0200342. eCollection 2018.
8
Palliated Hypoplastic Left Heart Syndrome Patients Experience Superior Waitlist and Comparable Post-Heart Transplant Survival to Non-Single Ventricle Congenital Heart Disease Patients.姑息性左心发育不全综合征患者在等待名单上的情况更佳,且心脏移植后的生存率与非单心室先天性心脏病患者相当。
Semin Thorac Cardiovasc Surg. 2024 Summer;36(2):230-241. doi: 10.1053/j.semtcvs.2022.08.019. Epub 2022 Nov 29.
9
Executive Function in Children and Adolescents with Critical Cyanotic Congenital Heart Disease.患有严重青紫型先天性心脏病的儿童和青少年的执行功能
J Int Neuropsychol Soc. 2015 Jan;21(1):34-49. doi: 10.1017/S1355617714001027. Epub 2014 Dec 9.
10
Cardiac structures in marine animals provide insight on potential directions for interventions for pediatric congenital heart defects.海洋动物的心脏结构为儿科先天性心脏缺陷的干预提供了潜在方向的见解。
Am J Physiol Heart Circ Physiol. 2022 Jan 1;322(1):H1-H7. doi: 10.1152/ajpheart.00451.2021. Epub 2021 Oct 15.

引用本文的文献

1
Transcriptomic analysis and machine learning modeling identifies novel biomarkers and genetic characteristics of hypertrophic cardiomyopathy.转录组分析和机器学习建模确定了肥厚型心肌病的新型生物标志物和遗传特征。
Front Genet. 2025 Jun 17;16:1596049. doi: 10.3389/fgene.2025.1596049. eCollection 2025.

本文引用的文献

1
A human cell atlas of the pressure-induced hypertrophic heart.压力诱导性肥厚心脏的人类细胞图谱。
Nat Cardiovasc Res. 2022 Feb;1(2):174-185. doi: 10.1038/s44161-022-00019-7. Epub 2022 Feb 14.
2
Precision medicine in cardiovascular therapeutics: Evaluating the role of pharmacogenetic analysis prior to drug treatment.心血管治疗中的精准医学:药物治疗前评估药物遗传学分析的作用。
J Intern Med. 2024 May;295(5):583-598. doi: 10.1111/joim.13772. Epub 2024 Feb 11.
3
Improved multi-label classifiers for predicting protein subcellular localization.
改进的多标签分类器用于预测蛋白质亚细胞定位。
Math Biosci Eng. 2024 Jan;21(1):214-236. doi: 10.3934/mbe.2024010. Epub 2022 Dec 11.
4
PredictEFC: a fast and efficient multi-label classifier for predicting enzyme family classes.PredictEFC:一种用于预测酶家族类别的快速高效的多标签分类器。
BMC Bioinformatics. 2024 Jan 30;25(1):50. doi: 10.1186/s12859-024-05665-1.
5
Homozygous variants in two unrelated patients with craniosynostosis and radioulnar synostosis.两名颅缝早闭和桡尺骨融合症患者的同源变异。
J Med Genet. 2024 Mar 21;61(4):363-368. doi: 10.1136/jmg-2023-109151.
6
Identification of key genes associated with persistent immune changes and secondary immune activation responses induced by influenza vaccination after COVID-19 recovery by machine learning methods.利用机器学习方法鉴定与 COVID-19 康复后流感疫苗接种引起的持续免疫变化和二次免疫激活反应相关的关键基因。
Comput Biol Med. 2024 Feb;169:107883. doi: 10.1016/j.compbiomed.2023.107883. Epub 2023 Dec 22.
7
PCDA-HNMP: Predicting circRNA-disease association using heterogeneous network and meta-path.PCDA-HNMP:基于异质网络和元路径预测 circRNA-疾病关联
Math Biosci Eng. 2023 Nov 14;20(12):20553-20575. doi: 10.3934/mbe.2023909.
8
Identification of key gene expression associated with quality of life after recovery from COVID-19.鉴定与 COVID-19 康复后生活质量相关的关键基因表达。
Med Biol Eng Comput. 2024 Apr;62(4):1031-1048. doi: 10.1007/s11517-023-02988-8. Epub 2023 Dec 21.
9
RMTLysPTM: recognizing multiple types of lysine PTM sites by deep analysis on sequences.RMTLysPTM:通过对序列进行深度分析来识别多种类型的赖氨酸翻译后修饰位点
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad450.
10
Patterns of Gene Expression Profiles Associated with Colorectal Cancer in Colorectal Mucosa by Using Machine Learning Methods.利用机器学习方法分析结直肠黏膜中与结直肠癌相关的基因表达谱模式。
Comb Chem High Throughput Screen. 2024;27(19):2921-2934. doi: 10.2174/0113862073266300231026103844.