• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习减少新生儿筛查中的假阳性结果

Reducing False-Positive Results in Newborn Screening Using Machine Learning.

作者信息

Peng Gang, Tang Yishuo, Cowan Tina M, Enns Gregory M, Zhao Hongyu, Scharfe Curt

机构信息

Department of Genetics, Yale University School of Medicine, New Haven, CT 06520, USA.

Department of Biostatistics, Yale University School of Public Health, New Haven, CT 06520, USA.

出版信息

Int J Neonatal Screen. 2020 Mar;6(1). doi: 10.3390/ijns6010016. Epub 2020 Mar 3.

DOI:10.3390/ijns6010016
PMID:32190768
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7080200/
Abstract

Newborn screening (NBS) for inborn metabolic disorders is a highly successful public health program that by design is accompanied by false-positive results. Here we trained a Random Forest machine learning classifier on screening data to improve prediction of true and false positives. Data included 39 metabolic analytes detected by tandem mass spectrometry and clinical variables such as gestational age and birth weight. Analytical performance was evaluated for a cohort of 2777 screen positives reported by the California NBS program, which consisted of 235 confirmed cases and 2542 false positives for one of four disorders: glutaric acidemia type 1 (GA-1), methylmalonic acidemia (MMA), ornithine transcarbamylase deficiency (OTCD), and very long-chain acyl-CoA dehydrogenase deficiency (VLCADD). Without changing the sensitivity to detect these disorders in screening, Random Forest-based analysis of all metabolites reduced the number of false positives for GA-1 by 89%, for MMA by 45%, for OTCD by 98%, and for VLCADD by 2%. All primary disease markers and previously reported analytes such as methionine for MMA and OTCD were among the top-ranked analytes. Random Forest's ability to classify GA-1 false positives was found similar to results obtained using Clinical Laboratory Integrated Reports (CLIR). We developed an online Random Forest tool for interpretive analysis of increasingly complex data from newborn screening.

摘要

针对先天性代谢紊乱的新生儿筛查(NBS)是一项非常成功的公共卫生项目,但从设计上来说,它会伴随出现假阳性结果。在此,我们基于筛查数据训练了一个随机森林机器学习分类器,以改进对真阳性和假阳性的预测。数据包括通过串联质谱法检测到的39种代谢分析物以及诸如胎龄和出生体重等临床变量。对加利福尼亚新生儿筛查项目报告的2777例筛查阳性病例队列的分析性能进行了评估,该队列包括235例确诊病例以及四种疾病之一的2542例假阳性病例,这四种疾病分别为:1型戊二酸血症(GA - 1)、甲基丙二酸血症(MMA)、鸟氨酸转氨甲酰酶缺乏症(OTCD)和极长链酰基辅酶A脱氢酶缺乏症(VLCADD)。在不改变筛查中检测这些疾病敏感性的情况下,基于随机森林对所有代谢物的分析将GA - 1的假阳性数量减少了89%,MMA减少了45%,OTCD减少了98%,VLCADD减少了2%。所有主要疾病标志物以及先前报道的分析物,如MMA和OTCD的蛋氨酸,都在排名靠前的分析物之中。发现随机森林对GA - 1假阳性的分类能力与使用临床实验室综合报告(CLIR)获得的结果相似。我们开发了一个在线随机森林工具,用于对来自新生儿筛查的日益复杂的数据进行解释性分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c2f/7422977/795eb3c6e8b0/IJNS-06-00016-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c2f/7422977/985cbfd0c95f/IJNS-06-00016-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c2f/7422977/be205591b4b0/IJNS-06-00016-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c2f/7422977/5969bc3ea550/IJNS-06-00016-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c2f/7422977/795eb3c6e8b0/IJNS-06-00016-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c2f/7422977/985cbfd0c95f/IJNS-06-00016-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c2f/7422977/be205591b4b0/IJNS-06-00016-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c2f/7422977/5969bc3ea550/IJNS-06-00016-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c2f/7422977/795eb3c6e8b0/IJNS-06-00016-g004.jpg

相似文献

1
Reducing False-Positive Results in Newborn Screening Using Machine Learning.使用机器学习减少新生儿筛查中的假阳性结果
Int J Neonatal Screen. 2020 Mar;6(1). doi: 10.3390/ijns6010016. Epub 2020 Mar 3.
2
Validation of a targeted metabolomics panel for improved second-tier newborn screening.靶向代谢组学 panel 用于提高二级新生儿筛查的验证。
J Inherit Metab Dis. 2023 Mar;46(2):194-205. doi: 10.1002/jimd.12591. Epub 2023 Feb 2.
3
Ethnic variability in newborn metabolic screening markers associated with false-positive outcomes.与假阳性结果相关的新生儿代谢筛查标志物的种族变异性。
J Inherit Metab Dis. 2020 Sep;43(5):934-943. doi: 10.1002/jimd.12236. Epub 2020 Apr 17.
4
Combining newborn metabolic and DNA analysis for second-tier testing of methylmalonic acidemia.将新生儿代谢和 DNA 分析相结合,进行甲基丙二酸血症的二级检测。
Genet Med. 2019 Apr;21(4):896-903. doi: 10.1038/s41436-018-0272-5. Epub 2018 Sep 13.
5
Infants suspected to have very-long chain acyl-CoA dehydrogenase deficiency from newborn screening.新生儿筛查疑诊极长链酰基辅酶 A 脱氢酶缺乏症的婴儿。
Mol Genet Metab. 2014 Apr;111(4):484-92. doi: 10.1016/j.ymgme.2014.01.009. Epub 2014 Jan 23.
6
Prevalence and mutation analysis of short/branched chain acyl-CoA dehydrogenase deficiency (SBCADD) detected on newborn screening in Wisconsin.威斯康星州新生儿筛查中发现的短链/支链酰基辅酶 A 脱氢酶缺乏症 (SBCADD) 的流行率和突变分析。
Mol Genet Metab. 2013 Sep-Oct;110(1-2):111-5. doi: 10.1016/j.ymgme.2013.03.021. Epub 2013 Apr 15.
7
Diversity in the incidence and spectrum of organic acidemias, fatty acid oxidation disorders, and amino acid disorders in Asian countries: Selective screening vs. expanded newborn screening.亚洲国家有机酸血症、脂肪酸氧化障碍和氨基酸障碍的发病率及谱系多样性:选择性筛查与扩大新生儿筛查
Mol Genet Metab Rep. 2018 May 21;16:5-10. doi: 10.1016/j.ymgmr.2018.05.003. eCollection 2018 Sep.
8
Improving the second-tier classification of methylmalonic acidemia patients using a machine learning ensemble method.利用机器学习集成方法改善甲基丙二酸血症患者的二级分类。
World J Pediatr. 2024 Oct;20(10):1090-1101. doi: 10.1007/s12519-023-00788-6. Epub 2024 Feb 24.
9
Analysis of 2-methylcitric acid, methylmalonic acid, and total homocysteine in dried blood spots by LC-MS/MS for application in the newborn screening laboratory: A dual derivatization approach.通过液相色谱-串联质谱法分析干血斑中的2-甲基柠檬酸、甲基丙二酸和总同型半胱氨酸在新生儿筛查实验室中的应用:一种双重衍生化方法。
J Mass Spectrom Adv Clin Lab. 2021 Mar 17;20:1-10. doi: 10.1016/j.jmsacl.2021.03.001. eCollection 2021 Apr.
10
Heptadecanoylcarnitine (C17) a novel candidate biomarker for newborn screening of propionic and methylmalonic acidemias.十七烷酰肉碱(C17):一种用于丙酸血症和甲基丙二酸血症新生儿筛查的新型候选生物标志物。
Clin Chim Acta. 2015 Oct 23;450:342-8. doi: 10.1016/j.cca.2015.09.012. Epub 2015 Sep 11.

引用本文的文献

1
Effective algorithm to differentiate NBS MCADD cases from carriers and non-carriers and an assessment of the utility of the second newborn screen for MCADD.区分新生儿筛查中甲基丙二酸血症(MCADD)病例与携带者及非携带者的有效算法,以及对MCADD第二次新生儿筛查效用的评估。
Mol Genet Metab. 2025 Aug;145(4):109183. doi: 10.1016/j.ymgme.2025.109183. Epub 2025 Jun 25.
2
Propionyl Carnitine Metabolic Profile: Optimizing the Newborn Screening Strategy Through Customized Cut-Offs.丙酰肉碱代谢谱:通过定制临界值优化新生儿筛查策略
Metabolites. 2025 May 6;15(5):308. doi: 10.3390/metabo15050308.
3
Integrating Machine Learning and Follow-Up Variables to Improve Early Detection of Hepatocellular Carcinoma in Tyrosinemia Type 1: A Multicenter Study.

本文引用的文献

1
Misclassification of VLCAD carriers due to variable confirmatory testing after a positive NBS result.由于新生儿筛查(NBS)结果呈阳性后确证检测的可变性,导致极长链酰基辅酶A脱氢酶(VLCAD)携带者的误诊。
J Community Genet. 2019 Oct;10(4):447-451. doi: 10.1007/s12687-019-00409-8. Epub 2019 Feb 5.
2
Urinary metabolomics reveals unique metabolic signatures in infants with cystic fibrosis.尿代谢组学揭示囊性纤维化婴儿的独特代谢特征。
J Cyst Fibros. 2019 Jul;18(4):507-515. doi: 10.1016/j.jcf.2018.10.016. Epub 2018 Nov 23.
3
Combining newborn metabolic and DNA analysis for second-tier testing of methylmalonic acidemia.
整合机器学习与随访变量以改善1型酪氨酸血症中肝细胞癌的早期检测:一项多中心研究
Int J Mol Sci. 2025 Apr 18;26(8):3839. doi: 10.3390/ijms26083839.
4
Beyond Targeted Newborn Screening: A Nontargeted Metabolomics Workflow to Investigate Birthweight-Metabolome Correlations.超越靶向新生儿筛查:一种用于研究出生体重与代谢组相关性的非靶向代谢组学工作流程。
Anal Chem. 2025 Apr 1;97(12):6563-6570. doi: 10.1021/acs.analchem.4c06061. Epub 2025 Mar 18.
5
Digital-Tier Strategy Improves Newborn Screening for Glutaric Aciduria Type 1.数字层级策略改善1型戊二酸血症的新生儿筛查。
Int J Neonatal Screen. 2024 Dec 21;10(4):83. doi: 10.3390/ijns10040083.
6
Improving methylmalonic acidemia (MMA) screening and MMA genotype prediction using random forest classifier in two Chinese populations.应用随机森林分类器在中国两个群体中提高甲基丙二酸血症(MMA)的筛查和 MMA 基因型预测。
Eur J Med Res. 2024 Nov 10;29(1):540. doi: 10.1186/s40001-024-02115-9.
7
Prediction of inherited metabolic disorders using tandem mass spectrometry data with the help of artificial neural networks.利用人工神经网络结合串联质谱数据预测遗传代谢障碍。
Turk J Med Sci. 2024 Jul 12;54(4):710-717. doi: 10.55730/1300-0144.5840. eCollection 2024.
8
The Multi-Omic Approach to Newborn Screening: Opportunities and Challenges.新生儿筛查的多组学方法:机遇与挑战。
Int J Neonatal Screen. 2024 Jun 21;10(3):42. doi: 10.3390/ijns10030042.
9
Current Status of Newborn Bloodspot Screening Worldwide 2024: A Comprehensive Review of Recent Activities (2020-2023).《2024年全球新生儿血斑筛查现状:2020 - 2023年近期活动综合回顾》
Int J Neonatal Screen. 2024 May 23;10(2):38. doi: 10.3390/ijns10020038.
10
Newborn Screening for Inborn Errors of Metabolism by Next-Generation Sequencing Combined with Tandem Mass Spectrometry.通过下一代测序结合串联质谱法进行先天性代谢缺陷的新生儿筛查。
Int J Neonatal Screen. 2024 Mar 29;10(2):28. doi: 10.3390/ijns10020028.
将新生儿代谢和 DNA 分析相结合,进行甲基丙二酸血症的二级检测。
Genet Med. 2019 Apr;21(4):896-903. doi: 10.1038/s41436-018-0272-5. Epub 2018 Sep 13.
4
A Machine Learning Application Based in Random Forest for Integrating Mass Spectrometry-Based Metabolomic Data: A Simple Screening Method for Patients With Zika Virus.一种基于随机森林的机器学习应用,用于整合基于质谱的代谢组学数据:一种针对寨卡病毒患者的简单筛查方法。
Front Bioeng Biotechnol. 2018 Apr 11;6:31. doi: 10.3389/fbioe.2018.00031. eCollection 2018.
5
Precision newborn screening for lysosomal disorders.精确新生儿溶酶体贮积症筛查。
Genet Med. 2018 Aug;20(8):847-854. doi: 10.1038/gim.2017.194. Epub 2017 Nov 9.
6
Moonlighting newborn screening markers: the incidental discovery of a second-tier test for Pompe disease.兼职新生儿筛查标志物:庞贝病二线检测的偶然发现。
Genet Med. 2018 Aug;20(8):840-846. doi: 10.1038/gim.2017.190. Epub 2017 Nov 2.
7
The Newborn Screening Paradox: Sensitivity vs. Overdiagnosis in VLCAD Deficiency.新生儿筛查悖论:极长链酰基辅酶A脱氢酶缺乏症中的敏感性与过度诊断
JIMD Rep. 2016;27:101-6. doi: 10.1007/8904_2015_476. Epub 2015 Oct 10.
8
Continuous age- and sex-adjusted reference intervals of urinary markers for cerebral creatine deficiency syndromes: a novel approach to the definition of reference intervals.连续的年龄和性别调整的脑肌酸缺乏综合征尿液标志物参考区间:定义参考区间的新方法。
Clin Chem. 2015 May;61(5):760-8. doi: 10.1373/clinchem.2014.235564. Epub 2015 Mar 10.
9
Postanalytical tools improve performance of newborn screening by tandem mass spectrometry.分析后工具可提高串联质谱法新生儿筛查的性能。
Genet Med. 2014 Dec;16(12):889-95. doi: 10.1038/gim.2014.62. Epub 2014 May 29.
10
Infants suspected to have very-long chain acyl-CoA dehydrogenase deficiency from newborn screening.新生儿筛查疑诊极长链酰基辅酶 A 脱氢酶缺乏症的婴儿。
Mol Genet Metab. 2014 Apr;111(4):484-92. doi: 10.1016/j.ymgme.2014.01.009. Epub 2014 Jan 23.