文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

人工智能在表型-基因型映射中的应用。

The Use of AI for Phenotype-Genotype Mapping.

作者信息

Sharma Jyoti, Goel Prabudh

机构信息

Department of Paediatric Surgery, All India Institute of Medical Sciences, New Delhi, India.

出版信息

Methods Mol Biol. 2025;2952:369-410. doi: 10.1007/978-1-0716-4690-8_21.


DOI:10.1007/978-1-0716-4690-8_21
PMID:40553344
Abstract

The mapping of genotypes to phenotypes is a cornerstone of genetics, critical for understanding disease mechanisms and advancing precision medicine. The advent of next-generation sequencing (NGS) technologies has enabled the generation of extensive genomic datasets, yet the complexity and scale of these data demand innovative analytical approaches. Artificial intelligence (AI) has emerged as a transformative tool, integrating genotype and phenotype data, uncovering intricate patterns, and driving advancements in diagnosis, therapy, and research.AI applications in phenotype-genotype mapping span various machine learning and deep learning techniques. Supervised learning methods, such as Support Vector Machines (SVMs), Random Forests, and Gradient Boosting, predict variant pathogenicity and classify genetic risks by leveraging curated datasets. Unsupervised approaches, including k-Means clustering and hierarchical clustering, uncover hidden patterns in data, enabling the identification of disease subtypes and novel associations. Dimensionality reduction techniques like Principal Component Analysis (PCA) and t-Distributed Stochastic Neighbor Embedding (t-SNE) simplify high-dimensional genomic data for analysis and visualization. Neural networks, including Convolutional and Recurrent Neural Networks (CNNs and RNNs), excel at extracting insights from complex datasets like gene expression profiles and genomic sequences. These methodologies have found applications in rare disease diagnosis, drug discovery, and risk assessment for complex diseases. AI tools integrate genetic and phenotypic data to prioritize pathogenic variants, significantly improving diagnostic yields for unresolved cases. Multi-omic data integration, incorporating genomics, transcriptomics, and proteomics, offers a holistic perspective on genotype-phenotype relationships. In drug discovery, AI identifies therapeutic targets and predicts drug efficacy, accelerating the development of precision treatments.Despite its potential, challenges persist. Data heterogeneity, limited interpretability of AI models, privacy concerns, and insufficient datasets for rare diseases impede broader implementation. To address these issues, AI frameworks incorporate data standardization, explainability techniques like SHAP and LIME, federated learning for secure collaborative research, and data augmentation methods such as transfer learning and GANs. Future directions include the integration of multi-omic data, advanced explainable AI for clinical adoption, and the expansion of federated learning to facilitate cross-institutional collaborations. By bridging the gap between genotype and phenotype, AI-driven methodologies are transforming clinical genomics and personalized medicine. This chapter explores the methodologies, applications, challenges, and future prospects of AI in phenotype-genotype mapping, highlighting its pivotal role in advancing genetic research and improving healthcare outcomes.

摘要

基因型到表型的映射是遗传学的基石,对于理解疾病机制和推动精准医学发展至关重要。下一代测序(NGS)技术的出现使得大量基因组数据集得以生成,但这些数据的复杂性和规模需要创新的分析方法。人工智能(AI)已成为一种变革性工具,它整合基因型和表型数据,揭示复杂模式,并推动诊断、治疗和研究的进步。

AI在表型-基因型映射中的应用涵盖各种机器学习和深度学习技术。监督学习方法,如支持向量机(SVM)、随机森林和梯度提升,通过利用经过整理的数据集来预测变异致病性并对遗传风险进行分类。无监督方法,包括k均值聚类和层次聚类,揭示数据中的隐藏模式,从而能够识别疾病亚型和新的关联。主成分分析(PCA)和t分布随机邻域嵌入(t-SNE)等降维技术简化了高维基因组数据,便于分析和可视化。神经网络,包括卷积神经网络和循环神经网络(CNN和RNN),擅长从基因表达谱和基因组序列等复杂数据集中提取见解。这些方法已应用于罕见病诊断、药物发现和复杂疾病的风险评估。AI工具整合遗传和表型数据,对致病性变异进行优先级排序,显著提高未解决病例的诊断率。多组学数据整合,包括基因组学、转录组学和蛋白质组学,提供了对基因型-表型关系的整体视角。在药物发现中,AI识别治疗靶点并预测药物疗效,加速精准治疗的开发。

尽管具有潜力,但挑战依然存在。数据异质性、AI模型的有限可解释性、隐私问题以及罕见病数据集不足阻碍了更广泛的应用。为了解决这些问题,AI框架纳入了数据标准化、SHAP和LIME等可解释性技术、用于安全协作研究的联邦学习以及迁移学习和生成对抗网络(GAN)等数据增强方法。未来的方向包括多组学数据的整合、用于临床应用的先进可解释AI以及扩展联邦学习以促进跨机构合作。通过弥合基因型和表型之间的差距,AI驱动的方法正在改变临床基因组学和个性化医学。本章探讨了AI在表型-基因型映射中的方法、应用、挑战和未来前景,突出了其在推进遗传研究和改善医疗结果方面的关键作用。

相似文献

[1]
The Use of AI for Phenotype-Genotype Mapping.

Methods Mol Biol. 2025

[2]
Deep Genomics: Deep Learning-Based Analysis of Genome-Sequenced Data for Identification of Gene Alterations.

Methods Mol Biol. 2025

[3]
Advancements in AI for Computational Biology and Bioinformatics: A Comprehensive Review.

Methods Mol Biol. 2025

[4]
AI-Driven Antimicrobial Peptide Discovery: Mining and Generation.

Acc Chem Res. 2025-6-17

[5]
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.

Br J Dermatol. 2024-7-16

[6]
The dawn of a new era: can machine learning and large language models reshape QSP modeling?

J Pharmacokinet Pharmacodyn. 2025-6-16

[7]
AI in Medical Questionnaires: Innovations, Diagnosis, and Implications.

J Med Internet Res. 2025-6-23

[8]
Emerging research trends in artificial intelligence for cancer diagnostic systems: A comprehensive review.

Heliyon. 2024-8-23

[9]
Chemical imaging for biological systems: techniques, AI-driven processing, and applications.

J Mater Chem B. 2025-6-18

[10]
Machine learning in oral squamous cell carcinoma: Current status, clinical concerns and prospects for future-A systematic review.

Artif Intell Med. 2021-5

本文引用的文献

[1]
Deep learning-based approaches for multi-omics data integration and analysis.

BioData Min. 2024-10-2

[2]
Ensemble and optimization algorithm in support vector machines for classification of wheat genotypes.

Sci Rep. 2024-9-30

[3]
GNN4DM: a graph neural network-based method to identify overlapping functional disease modules.

Bioinformatics. 2024-10-1

[4]
Deep learning approaches for non-coding genetic variant effect prediction: current progress and future prospects.

Brief Bioinform. 2024-7-25

[5]
Exploring the Genotype-Phenotype Correlations in a Child with Inherited Seizure and Thrombocytopenia by Digenic Network Analysis.

Genes (Basel). 2024-7-31

[6]
Computational identification of disease models through cross-species phenotype comparison.

Dis Model Mech. 2024-6-1

[7]
Artificial intelligence for cardiovascular disease risk assessment in personalised framework: a scoping review.

EClinicalMedicine. 2024-5-27

[8]
Diagnostic yield of exome and genome sequencing after non-diagnostic multi-gene panels in patients with single-system diseases.

Orphanet J Rare Dis. 2024-5-24

[9]
Increased Diagnostic Yield by Reanalysis of Whole Exome Sequencing Data in Mitochondrial Disease.

J Neuromuscul Dis. 2024

[10]
Characterization of CD34 Cells from Patients with Acute Myeloid Leukemia (AML) and Myelodysplastic Syndromes (MDS) Using a t-Distributed Stochastic Neighbor Embedding (t-SNE) Protocol.

Cancers (Basel). 2024-3-28

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索