Suppr超能文献

基于网络的代谢性疾病新生儿筛查系统:机器学习与临床医生的比较

Web-based newborn screening system for metabolic diseases: machine learning versus clinicians.

作者信息

Chen Wei-Hsin, Hsieh Sheau-Ling, Hsu Kai-Ping, Chen Han-Ping, Su Xing-Yu, Tseng Yi-Ju, Chien Yin-Hsiu, Hwu Wuh-Liang, Lai Feipei

机构信息

National Taiwan University, Graduate Institute of Biomedical Electronics and Bioinformatics, Taipei, Taiwan.

出版信息

J Med Internet Res. 2013 May 23;15(5):e98. doi: 10.2196/jmir.2495.

Abstract

BACKGROUND

A hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification.

OBJECTIVE

The objective of this study is to describe a system that enhanced the neonatal screening system of the Newborn Screening Center at the National Taiwan University Hospital. The system was designed and deployed according to a service-oriented architecture (SOA) framework under the Web services .NET environment. The system consists of sample collection, testing, diagnosis, evaluation, treatment, and follow-up services among collaborating hospitals. To improve the accuracy of newborn screening, machine learning and optimal feature selection mechanisms were investigated for screening newborns for inborn errors of metabolism.

METHODS

The framework of the Newborn Screening Hospital Information System (NSHIS) used the embedded Health Level Seven (HL7) standards for data exchanges among heterogeneous platforms integrated by Web services in the C# language. In this study, machine learning classification was used to predict phenylketonuria (PKU), hypermethioninemia, and 3-methylcrotonyl-CoA-carboxylase (3-MCC) deficiency. The classification methods used 347,312 newborn dried blood samples collected at the Center between 2006 and 2011. Of these, 220 newborns had values over the diagnostic cutoffs (positive cases) and 1557 had values that were over the screening cutoffs but did not meet the diagnostic cutoffs (suspected cases). The original 35 analytes and the manifested features were ranked based on F score, then combinations of the top 20 ranked features were selected as input features to support vector machine (SVM) classifiers to obtain optimal feature sets. These feature sets were tested using 5-fold cross-validation and optimal models were generated. The datasets collected in year 2011 were used as predicting cases.

RESULTS

The feature selection strategies were implemented and the optimal markers for PKU, hypermethioninemia, and 3-MCC deficiency were obtained. The results of the machine learning approach were compared with the cutoff scheme. The number of the false positive cases were reduced from 21 to 2 for PKU, from 30 to 10 for hypermethioninemia, and 209 to 46 for 3-MCC deficiency.

CONCLUSIONS

This SOA Web service-based newborn screening system can accelerate screening procedures effectively and efficiently. An SVM learning methodology for PKU, hypermethioninemia, and 3-MCC deficiency metabolic diseases classification, including optimal feature selection strategies, is presented. By adopting the results of this study, the number of suspected cases could be reduced dramatically.

摘要

背景

医院和家长通常需要一个能够整合筛查数据及数据解读功能的医院信息系统(HIS)。然而,由于疾病特征和用于分类的分析物,疾病分类的准确性可能较低。

目的

本研究的目的是描述一种增强台湾大学医院新生儿筛查中心新生儿筛查系统的系统。该系统是在Web服务.NET环境下根据面向服务的架构(SOA)框架设计和部署的。该系统包括合作医院之间的样本采集、检测、诊断、评估、治疗和随访服务。为提高新生儿筛查的准确性,研究了机器学习和最优特征选择机制,用于筛查新生儿先天性代谢缺陷。

方法

新生儿筛查医院信息系统(NSHIS)的框架使用嵌入式卫生信息交换标准(HL7),以C#语言在由Web服务集成的异构平台之间进行数据交换。在本研究中,使用机器学习分类来预测苯丙酮尿症(PKU)、高甲硫氨酸血症和3-甲基巴豆酰辅酶A羧化酶(3-MCC)缺乏症。分类方法使用了2006年至2011年期间在该中心采集的347312份新生儿干血样本。其中,220名新生儿的值超过诊断临界值(阳性病例),1557名新生儿的值超过筛查临界值但未达到诊断临界值(疑似病例)。根据F分数对最初的35种分析物和表现出的特征进行排名,然后选择排名前20的特征组合作为支持向量机(SVM)分类器的输入特征,以获得最优特征集。使用5折交叉验证对这些特征集进行测试,并生成最优模型。将2011年收集的数据集用作预测病例。

结果

实施了特征选择策略,获得了PKU、高甲硫氨酸血症和3-MCC缺乏症的最优标志物。将机器学习方法的结果与临界值方案进行了比较。PKU的假阳性病例数从21例减少到2例,高甲硫氨酸血症从30例减少到10例,3-MCC缺乏症从209例减少到46例。

结论

这种基于SOA Web服务的新生儿筛查系统可以有效且高效地加速筛查程序。提出了一种用于PKU、高甲硫氨酸血症和3-MCC缺乏症代谢疾病分类的SVM学习方法,包括最优特征选择策略。通过采用本研究的结果,可以显著减少疑似病例的数量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e462/3668606/12847ff7ecb3/jmir_v15i5e98_fig1.jpg

相似文献

1
Web-based newborn screening system for metabolic diseases: machine learning versus clinicians.
J Med Internet Res. 2013 May 23;15(5):e98. doi: 10.2196/jmir.2495.
2
A newborn screening system based on service-oriented architecture embedded support vector machine.
J Med Syst. 2010 Oct;34(5):899-907. doi: 10.1007/s10916-009-9305-6. Epub 2009 May 5.
3
Newborn screening healthcare information system based on service-oriented architecture.
J Med Syst. 2010 Aug;34(4):519-30. doi: 10.1007/s10916-009-9265-x. Epub 2009 Mar 24.
4
Nationwide survey of extended newborn screening by tandem mass spectrometry in Taiwan.
J Inherit Metab Dis. 2010 Oct;33(Suppl 2):S295-305. doi: 10.1007/s10545-010-9129-z. Epub 2010 Jun 22.
6
Supervised machine learning techniques for the classification of metabolic disorders in newborns.
Bioinformatics. 2004 Nov 22;20(17):2985-96. doi: 10.1093/bioinformatics/bth343. Epub 2004 Jun 4.
7
Bloodspot acylcarnitine and amino acid analysis in cord blood samples: efficacy and reference data from a large cohort study.
J Inherit Metab Dis. 2009 Feb;32(1):95-101. doi: 10.1007/s10545-008-1047-y. Epub 2009 Jan 13.

引用本文的文献

4
Validation of a targeted metabolomics panel for improved second-tier newborn screening.
J Inherit Metab Dis. 2023 Mar;46(2):194-205. doi: 10.1002/jimd.12591. Epub 2023 Feb 2.
5
Random forest classifier improving phenylketonuria screening performance in two Chinese populations.
Front Mol Biosci. 2022 Oct 11;9:986556. doi: 10.3389/fmolb.2022.986556. eCollection 2022.
6
Opportunities and challenges in machine learning-based newborn screening-A systematic literature review.
JIMD Rep. 2022 Mar 23;63(3):250-261. doi: 10.1002/jmd2.12285. eCollection 2022 May.
7
Machine and cognitive intelligence for human health: systematic review.
Brain Inform. 2022 Feb 12;9(1):5. doi: 10.1186/s40708-022-00153-9.
8
Severity modeling of propionic acidemia using clinical and laboratory biomarkers.
Genet Med. 2021 Aug;23(8):1534-1542. doi: 10.1038/s41436-021-01173-2. Epub 2021 May 18.
9
Improving the Diagnosis of Phenylketonuria by Using a Machine Learning-Based Screening Model of Neonatal MRM Data.
Front Mol Biosci. 2020 Jul 7;7:115. doi: 10.3389/fmolb.2020.00115. eCollection 2020.
10
Applications of Machine Learning in miRNA Discovery and Target Prediction.
Curr Genomics. 2019 Dec;20(8):537-544. doi: 10.2174/1389202921666200106111813.

本文引用的文献

2
Enhanced interpretation of newborn screening results without analyte cutoff values.
Genet Med. 2012 Jul;14(7):648-55. doi: 10.1038/gim.2012.2. Epub 2012 Feb 16.
4
Adenosine kinase deficiency disrupts the methionine cycle and causes hypermethioninemia, encephalopathy, and abnormal liver function.
Am J Hum Genet. 2011 Oct 7;89(4):507-15. doi: 10.1016/j.ajhg.2011.09.004. Epub 2011 Sep 28.
7
Acylcarnitines: analysis in plasma and whole blood using tandem mass spectrometry.
Methods Mol Biol. 2011;708:55-72. doi: 10.1007/978-1-61737-985-7_3.
8
Determination of total homocysteine, methylmalonic acid, and 2-methylcitric acid in dried blood spots by tandem mass spectrometry.
Clin Chem. 2010 Nov;56(11):1686-95. doi: 10.1373/clinchem.2010.148957. Epub 2010 Aug 31.
9
Nationwide survey of extended newborn screening by tandem mass spectrometry in Taiwan.
J Inherit Metab Dis. 2010 Oct;33(Suppl 2):S295-305. doi: 10.1007/s10545-010-9129-z. Epub 2010 Jun 22.
10
Useful second-tier tests in expanded newborn screening of isovaleric acidemia and methylmalonic aciduria.
J Inherit Metab Dis. 2010 Oct;33(Suppl 2):S283-8. doi: 10.1007/s10545-010-9111-9. Epub 2010 May 4.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验