Rutgers Institute for Health, Health Care Policy and Aging Research, Rutgers University, 112 Paterson St, New Brunswick, NJ, USA.
Rutgers Cancer Institute of New Jersey, Rutgers University, 195 Little Albany St, New Brunswick, NJ, USA.
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac191.
Precision medicine uses genetic, environmental and lifestyle factors to more accurately diagnose and treat disease in specific groups of patients, and it is considered one of the most promising medical efforts of our time. The use of genetics is arguably the most data-rich and complex components of precision medicine. The grand challenge today is the successful assimilation of genetics into precision medicine that translates across different ancestries, diverse diseases and other distinct populations, which will require clever use of artificial intelligence (AI) and machine learning (ML) methods. Our goal here was to review and compare scientific objectives, methodologies, datasets, data sources, ethics and gaps of AI/ML approaches used in genomics and precision medicine. We selected high-quality literature published within the last 5 years that were indexed and available through PubMed Central. Our scope was narrowed to articles that reported application of AI/ML algorithms for statistical and predictive analyses using whole genome and/or whole exome sequencing for gene variants, and RNA-seq and microarrays for gene expression. We did not limit our search to specific diseases or data sources. Based on the scope of our review and comparative analysis criteria, we identified 32 different AI/ML approaches applied in variable genomics studies and report widely adapted AI/ML algorithms for predictive diagnostics across several diseases.
精准医学利用遗传、环境和生活方式因素,更准确地诊断和治疗特定患者群体的疾病,被认为是当代最有前途的医学努力之一。遗传的应用可以说是精准医学中数据最丰富、最复杂的部分。当今的巨大挑战是成功地将遗传学融入精准医学中,使其能够在不同的祖源、不同的疾病和其他不同的人群中转化,这将需要巧妙地利用人工智能 (AI) 和机器学习 (ML) 方法。我们的目标是回顾和比较基因组学和精准医学中使用的 AI/ML 方法的科学目标、方法、数据集、数据源、伦理和差距。我们选择了过去 5 年内发表的高质量文献,这些文献通过 PubMed Central 进行了索引和提供。我们的范围缩小到了报道使用 AI/ML 算法进行统计和预测分析的文章,这些分析使用全基因组和/或外显子组测序来检测基因变异,以及 RNA-seq 和微阵列来检测基因表达。我们没有将搜索范围限制在特定的疾病或数据源上。基于我们的综述和比较分析标准的范围,我们确定了 32 种不同的 AI/ML 方法应用于不同的基因组学研究,并报告了广泛适用于多种疾病的预测诊断的 AI/ML 算法。