• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习方法在罕见遗传病诊断和预后中的系统评价

A systematic review on machine learning approaches in the diagnosis and prognosis of rare genetic diseases.

机构信息

Division of Otolaryngology, Department of Surgery, Instituto de Investigación Biosanitaria, ibs.GRANADA, Universidad de Granada, Granada, Spain; Otology and Neurotology Group CTS495, Department of Genomic Medicine, GENYO - Centre for Genomics and Oncological Research - Pfizer, University of Granada, Junta de Andalucía, PTS, Granada, Spain; Sensorineural Pathology Programme, Centro de Investigación Biomédica en Red en Enfermedades Raras, CIBERER, Madrid, Spain.

Meniere's Disease Neuroscience Research Program, Faculty of Medicine & Health, School of Medical Sciences, The Kolling Institute, University of Sydney, Sydney, New South Wales, Australia; Division of Otolaryngology, Department of Surgery, Instituto de Investigación Biosanitaria, ibs.GRANADA, Universidad de Granada, Granada, Spain; Otology and Neurotology Group CTS495, Department of Genomic Medicine, GENYO - Centre for Genomics and Oncological Research - Pfizer, University of Granada, Junta de Andalucía, PTS, Granada, Spain; Sensorineural Pathology Programme, Centro de Investigación Biomédica en Red en Enfermedades Raras, CIBERER, Madrid, Spain.

出版信息

J Biomed Inform. 2023 Jul;143:104429. doi: 10.1016/j.jbi.2023.104429. Epub 2023 Jun 22.

DOI:10.1016/j.jbi.2023.104429
PMID:37352901
Abstract

BACKGROUND

The diagnosis of rare genetic diseases is often challenging due to the complexity of the genetic underpinnings of these conditions and the limited availability of diagnostic tools. Machine learning (ML) algorithms have the potential to improve the accuracy and speed of diagnosis by analyzing large amounts of genomic data and identifying complex multiallelic patterns that may be associated with specific diseases. In this systematic review, we aimed to identify the methodological trends and the ML application areas in rare genetic diseases.

METHODS

We performed a systematic review of the literature following the PRISMA guidelines to search studies that used ML approaches to enhance the diagnosis of rare genetic diseases. Studies that used DNA-based sequencing data and a variety of ML algorithms were included, summarized, and analyzed using bibliometric methods, visualization tools, and a feature co-occurrence analysis.

FINDINGS

Our search identified 22 studies that met the inclusion criteria. We found that exome sequencing was the most frequently used sequencing technology (59%), and rare neoplastic diseases were the most prevalent disease scenario (59%). In rare neoplasms, the most frequent applications of ML models were the differential diagnosis or stratification of patients (38.5%) and the identification of somatic mutations (30.8%). In other rare diseases, the most frequent goals were the prioritization of rare variants or genes (55.5%) and the identification of biallelic or digenic inheritance (33.3%). The most employed method was the random forest algorithm (54.5%). In addition, the features of the datasets needed for training these algorithms were distinctive depending on the goal pursued, including the mutational load in each gene for the differential diagnosis of patients, or the combination of genotype features and sequence-derived features (such as GC-content) for the identification of somatic mutations.

CONCLUSIONS

ML algorithms based on sequencing data are mainly used for the diagnosis of rare neoplastic diseases, with random forest being the most common approach. We identified key features in the datasets used for training these ML models according to the objective pursued. These features can support the development of future ML models in the diagnosis of rare genetic diseases.

摘要

背景

由于这些疾病的遗传基础复杂,且诊断工具有限,因此罕见遗传疾病的诊断常常具有挑战性。机器学习 (ML) 算法通过分析大量基因组数据并识别可能与特定疾病相关的复杂多等位基因模式,有可能提高诊断的准确性和速度。在本系统评价中,我们旨在确定罕见遗传疾病中 ML 应用的方法学趋势和领域。

方法

我们按照 PRISMA 指南进行了系统的文献检索,以搜索使用 ML 方法来增强罕见遗传疾病诊断的研究。纳入了使用 DNA 测序数据和各种 ML 算法的研究,使用文献计量学方法、可视化工具和特征共现分析对这些研究进行了总结和分析。

结果

我们的检索共确定了 22 项符合纳入标准的研究。我们发现外显子组测序是最常用的测序技术(59%),罕见肿瘤性疾病是最常见的疾病情况(59%)。在罕见肿瘤中,ML 模型最常见的应用是患者的鉴别诊断或分层(38.5%)和体细胞突变的识别(30.8%)。在其他罕见疾病中,最常见的目标是优先考虑罕见变异或基因(55.5%)和识别双等位基因或双基因遗传(33.3%)。最常使用的方法是随机森林算法(54.5%)。此外,根据所追求的目标,这些算法所需的数据集特征也有所不同,包括用于患者鉴别诊断的每个基因中的突变负荷,或用于识别体细胞突变的基因型特征和序列衍生特征(如 GC 含量)的组合。

结论

基于测序数据的 ML 算法主要用于罕见肿瘤性疾病的诊断,其中随机森林是最常见的方法。我们根据所追求的目标确定了用于训练这些 ML 模型的数据集的关键特征。这些特征可以为罕见遗传疾病诊断中未来 ML 模型的开发提供支持。

相似文献

1
A systematic review on machine learning approaches in the diagnosis and prognosis of rare genetic diseases.机器学习方法在罕见遗传病诊断和预后中的系统评价
J Biomed Inform. 2023 Jul;143:104429. doi: 10.1016/j.jbi.2023.104429. Epub 2023 Jun 22.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
Uterotonic agents for preventing postpartum haemorrhage: a network meta-analysis.预防产后出血的宫缩剂:一项网状Meta分析
Cochrane Database Syst Rev. 2018 Apr 25;4(4):CD011689. doi: 10.1002/14651858.CD011689.pub2.
4
Interventions for promoting habitual exercise in people living with and beyond cancer.促进癌症患者及康复者进行习惯性锻炼的干预措施。
Cochrane Database Syst Rev. 2018 Sep 19;9(9):CD010192. doi: 10.1002/14651858.CD010192.pub3.
5
Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.成人全身麻醉后预防术后恶心呕吐的药物:网状Meta分析
Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.
6
Blood biomarkers for the non-invasive diagnosis of endometriosis.用于子宫内膜异位症无创诊断的血液生物标志物。
Cochrane Database Syst Rev. 2016 May 1;2016(5):CD012179. doi: 10.1002/14651858.CD012179.
7
The Use of Machine Learning for Analyzing Real-World Data in Disease Prediction and Management: Systematic Review.机器学习在疾病预测与管理中分析真实世界数据的应用:系统评价
JMIR Med Inform. 2025 Jun 19;13:e68898. doi: 10.2196/68898.
8
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
9
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
10
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.

引用本文的文献

1
Construction of a feature gene and machine prediction model for inflammatory bowel disease based on multichip joint analysis.基于多芯片联合分析构建炎症性肠病特征基因及机器预测模型
J Transl Med. 2025 Aug 19;23(1):937. doi: 10.1186/s12967-025-06838-z.
2
Advancing genome-based precision medicine: a review on machine learning applications for rare genetic disorders.推进基于基因组的精准医学:关于机器学习在罕见遗传疾病中的应用综述
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf329.
3
Artificial intelligence for children with attention deficit/hyperactivity disorder: a scoping review.
人工智能在注意力缺陷多动障碍儿童中的应用:一项范围综述。
Exp Biol Med (Maywood). 2025 Apr 24;250:10238. doi: 10.3389/ebm.2025.10238. eCollection 2025.
4
Applying artificial intelligence to rare diseases: a literature review highlighting lessons from Fabry disease.将人工智能应用于罕见病:一项以法布里病为例的文献综述
Orphanet J Rare Dis. 2025 Apr 17;20(1):186. doi: 10.1186/s13023-025-03655-x.
5
Exploring AI-Driven Machine Learning Approaches for Optimal Classification of Peri-Implantitis Based on Oral Microbiome Data: A Feasibility Study.基于口腔微生物组数据探索人工智能驱动的机器学习方法用于种植体周围炎的最佳分类:一项可行性研究
Diagnostics (Basel). 2025 Feb 10;15(4):425. doi: 10.3390/diagnostics15040425.
6
AI Thinking: a framework for rethinking artificial intelligence in practice.人工智能思维:一个在实践中重新思考人工智能的框架。
R Soc Open Sci. 2025 Jan 8;12(1):241482. doi: 10.1098/rsos.241482. eCollection 2025 Jan.
7
Would Artificial Intelligence Improve the Quality of Care of Patients With Rare Diseases?人工智能会提高罕见病患者的护理质量吗?
Glob J Qual Saf Healthc. 2024 Aug 2;7(4):149-150. doi: 10.36401/JQSH-24-X3. eCollection 2024 Nov.
8
Advancements in genetic research by the Hispanic Community Health Study/Study of Latinos: A 10-year retrospective review.西班牙裔社区健康研究/拉丁裔研究在基因研究方面的进展:十年回顾
HGG Adv. 2025 Jan 9;6(1):100376. doi: 10.1016/j.xhgg.2024.100376. Epub 2024 Oct 29.
9
Artificial intelligence empowering rare diseases: a bibliometric perspective over the last two decades.人工智能赋能罕见病研究:过去二十年的文献计量学视角
Orphanet J Rare Dis. 2024 Sep 13;19(1):345. doi: 10.1186/s13023-024-03352-1.
10
NeurostimML: a machine learning model for predicting neurostimulation-induced tissue damage.神经刺激标记语言:用于预测神经刺激诱导的组织损伤的机器学习模型。
J Neural Eng. 2024 Jun 27;21(3):036054. doi: 10.1088/1741-2552/ad593e.