Suppr超能文献

使用基于机器学习的方法提高微阵列数据中CNV的检测性能。

Improving CNV Detection Performance in Microarray Data Using a Machine Learning-Based Approach.

作者信息

Goh Chul Jun, Kwon Hyuk-Jung, Kim Yoonhee, Jung Seunghee, Park Jiwoo, Lee Isaac Kise, Park Bo-Ram, Kim Myeong-Ji, Kim Min-Jeong, Lee Min-Seob

机构信息

Eone-Diagnomics Genome Center, Inc., 143, Gaetbeol-ro, Yeonsu-gu, Incheon 21999, Republic of Korea.

Department of Computer Science and Engineering, Incheon National University (INU), Incheon 22012, Republic of Korea.

出版信息

Diagnostics (Basel). 2023 Dec 29;14(1):84. doi: 10.3390/diagnostics14010084.

Abstract

Copy number variation (CNV) is a primary source of structural variation in the human genome, leading to several disorders. Therefore, analyzing neonatal CNVs is crucial for managing CNV-related chromosomal disabilities. However, genomic waves can hinder accurate CNV analysis. To mitigate the influences of the waves, we adopted a machine learning approach and developed a new method that uses a modified log R ratio instead of the commonly used log R ratio. Validation results using samples with known CNVs demonstrated the superior performance of our method. We analyzed a total of 16,046 Korean newborn samples using the new method and identified CNVs related to 39 genetic disorders were identified in 342 cases. The most frequently detected CNV-related disorder was Joubert syndrome 4. The accuracy of our method was further confirmed by analyzing a subset of the detected results using NGS and comparing them with our results. The utilization of a genome-wide single nucleotide polymorphism array with wave offset was shown to be a powerful method for identifying CNVs in neonatal cases. The accurate screening and the ability to identify various disease susceptibilities offered by our new method could facilitate the identification of CNV-associated chromosomal disease etiologies.

摘要

拷贝数变异(CNV)是人类基因组结构变异的主要来源,会导致多种疾病。因此,分析新生儿CNV对于管理与CNV相关的染色体残疾至关重要。然而,基因组波动会阻碍准确的CNV分析。为了减轻波动的影响,我们采用了机器学习方法并开发了一种新方法,该方法使用修正的对数R比率而非常用的对数R比率。使用已知CNV的样本进行的验证结果证明了我们方法的卓越性能。我们使用新方法分析了总共16046例韩国新生儿样本,在342例中鉴定出与39种遗传疾病相关的CNV。最常检测到的与CNV相关的疾病是4型乔伯特综合征。通过使用NGS分析检测结果的一个子集并将其与我们的结果进行比较,进一步证实了我们方法的准确性。利用具有波动偏移的全基因组单核苷酸多态性阵列被证明是一种在新生儿病例中识别CNV的强大方法。我们的新方法提供的准确筛查和识别各种疾病易感性的能力有助于识别与CNV相关的染色体疾病病因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03cb/10871075/40068b3ceb49/diagnostics-14-00084-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验