Suppr超能文献

从基因组解读关键评估(CAGI)挑战中的外显子预测静脉血栓栓塞风险。

Predicting venous thromboembolism risk from exomes in the Critical Assessment of Genome Interpretation (CAGI) challenges.

机构信息

Biomedical Informatics Training Program, Stanford University, Stanford, California.

Department of Dermatology, Stanford School of Medicine, Stanford, California.

出版信息

Hum Mutat. 2019 Sep;40(9):1314-1320. doi: 10.1002/humu.23825. Epub 2019 Jun 24.

Abstract

Genetics play a key role in venous thromboembolism (VTE) risk, however established risk factors in European populations do not translate to individuals of African descent because of the differences in allele frequencies between populations. As part of the fifth iteration of the Critical Assessment of Genome Interpretation, participants were asked to predict VTE status in exome data from African American subjects. Participants were provided with 103 unlabeled exomes from patients treated with warfarin for non-VTE causes or VTE and asked to predict which disease each subject had been treated for. Given the lack of training data, many participants opted to use unsupervised machine learning methods, clustering the exomes by variation in genes known to be associated with VTE. The best performing method using only VTE related genes achieved an area under the ROC curve of 0.65. Here, we discuss the range of methods used in the prediction of VTE from sequence data and explore some of the difficulties of conducting a challenge with known confounders. In addition, we show that an existing genetic risk score for VTE that was developed in European subjects works well in African Americans.

摘要

遗传学在静脉血栓栓塞症(VTE)风险中起着关键作用,然而,由于人群之间等位基因频率的差异,欧洲人群中已确定的风险因素并不能转化为非洲裔个体。作为基因组解读关键评估第五轮的一部分,参与者被要求根据非 VTE 病因或 VTE 接受华法林治疗的非裔美国人受试者的外显子数据预测 VTE 状态。参与者提供了 103 个未经标记的外显子,这些外显子来自因非 VTE 病因或 VTE 而接受华法林治疗的患者,并被要求预测每位受试者接受治疗的疾病。由于缺乏训练数据,许多参与者选择使用无监督机器学习方法,根据与 VTE 相关的基因的变异对外显子进行聚类。仅使用与 VTE 相关的基因的表现最佳的方法,ROC 曲线下面积达到 0.65。在这里,我们讨论了从序列数据预测 VTE 时使用的各种方法,并探讨了在存在已知混杂因素的情况下进行挑战的一些困难。此外,我们还表明,在欧洲人群中开发的用于 VTE 的现有遗传风险评分在非洲裔美国人中效果良好。

相似文献

引用本文的文献

1
Artificial intelligence in clinical thrombosis and hemostasis: A review.临床血栓形成与止血中的人工智能:综述
Res Pract Thromb Haemost. 2025 Jul 24;9(5):102984. doi: 10.1016/j.rpth.2025.102984. eCollection 2025 Jul.
6
Genome interpretation using in silico predictors of variant impact.使用变异影响的计算机预测因子进行基因组解读。
Hum Genet. 2022 Oct;141(10):1549-1577. doi: 10.1007/s00439-022-02457-6. Epub 2022 Apr 30.

本文引用的文献

7
Racial differences in venous thromboembolism.种族差异与静脉血栓栓塞症。
J Thromb Haemost. 2011 Oct;9(10):1877-82. doi: 10.1111/j.1538-7836.2011.04443.x.
8
Genetics of venous thrombosis.静脉血栓形成的遗传学
J Thromb Haemost. 2009 Jul;7 Suppl 1:301-4. doi: 10.1111/j.1538-7836.2009.03394.x.
9
Venous thromboembolism and ethnicity.静脉血栓栓塞与种族
Br J Haematol. 2009 Aug;146(4):369-83. doi: 10.1111/j.1365-2141.2009.07786.x. Epub 2009 Jun 22.
10
SNAP predicts effect of mutations on protein function.SNAP预测突变对蛋白质功能的影响。
Bioinformatics. 2008 Oct 15;24(20):2397-8. doi: 10.1093/bioinformatics/btn435. Epub 2008 Aug 30.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验