• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

亚马逊森林:致病变异的计算机元预测

AmazonForest: In Silico Metaprediction of Pathogenic Variants.

作者信息

Palheta Helber Gonzales Almeida, Gonçalves Wanderson Gonçalves, Brito Leonardo Miranda, Dos Santos Arthur Ribeiro, Dos Reis Matsumoto Marlon, Ribeiro-Dos-Santos Ândrea, de Araújo Gilderlanio Santana

机构信息

Laboratory of Human and Medical Genetics, Graduate Program of Genetics and Molecular Biology, Institute of Biological Sciences, Federal University of Pará, Belém 66075-110, Brazil.

Research Center on Oncology, Graduate Program of Oncology and Medical Science, Federal University of Pará, Belém 66073-000, Brazil.

出版信息

Biology (Basel). 2022 Mar 31;11(4):538. doi: 10.3390/biology11040538.

DOI:10.3390/biology11040538
PMID:35453737
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9024711/
Abstract

ClinVar is a web platform that stores ∼789,000 genetic associations with complex diseases. A partial set of these cataloged genetic associations has challenged clinicians and geneticists, often leading to conflicting interpretations or uncertain clinical impact significance. In this study, we addressed the (re)classification of genetic variants by AmazonForest, which is a random-forest-based pathogenicity metaprediction model that works by combining functional impact data from eight prediction tools. We evaluated the performance of representation learning algorithms such as autoencoders to propose a better strategy. All metaprediction models were trained with ClinVar data, and genetic variants were annotated with eight functional impact predictors cataloged with SnpEff/SnpSift. AmazonForest implements the best random forest model with a one hot data-encoding strategy, which shows an Area Under ROC Curve of ≥0.93. AmazonForest was employed for pathogenicity prediction of a set of ∼101,000 genetic variants of uncertain significance or conflict of interpretation. Our findings revealed ∼24,000 variants with high pathogenic probability (RFprob≥0.9). In addition, we show results for Alzheimer's Disease as a demonstration of its application in clinical interpretation of genetic variants in complex diseases. Lastly, AmazonForest is available as a web tool and R object that can be loaded to perform pathogenicity predictions.

摘要

ClinVar是一个网络平台,存储了约78.9万个与复杂疾病相关的基因关联信息。这些编入目录的基因关联信息的一部分给临床医生和遗传学家带来了挑战,常常导致相互矛盾的解释或不确定的临床影响意义。在本研究中,我们探讨了利用AmazonForest对基因变异进行(重新)分类,AmazonForest是一种基于随机森林的致病性元预测模型,它通过结合来自八个预测工具的功能影响数据来工作。我们评估了诸如自动编码器等表示学习算法的性能,以提出更好的策略。所有元预测模型均使用ClinVar数据进行训练,基因变异用SnpEff/SnpSift编目的八个功能影响预测器进行注释。AmazonForest采用一种独热数据编码策略实现了最佳随机森林模型,其ROC曲线下面积≥0.93。AmazonForest被用于对一组约10.1万个意义不确定或解释存在冲突的基因变异进行致病性预测。我们的研究结果揭示了约2.4万个具有高致病概率(RFprob≥0.9)的变异。此外,我们展示了阿尔茨海默病的结果,以证明其在复杂疾病基因变异临床解释中的应用。最后,AmazonForest作为一个网络工具和R对象可用,可加载以进行致病性预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3dc/9024711/ddf2afebc518/biology-11-00538-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3dc/9024711/cc559f983f11/biology-11-00538-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3dc/9024711/765bfb5a7c3d/biology-11-00538-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3dc/9024711/ddf2afebc518/biology-11-00538-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3dc/9024711/cc559f983f11/biology-11-00538-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3dc/9024711/765bfb5a7c3d/biology-11-00538-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3dc/9024711/ddf2afebc518/biology-11-00538-g003.jpg

相似文献

1
AmazonForest: In Silico Metaprediction of Pathogenic Variants.亚马逊森林:致病变异的计算机元预测
Biology (Basel). 2022 Mar 31;11(4):538. doi: 10.3390/biology11040538.
2
Evaluation of performance of leading algorithms for variant pathogenicity predictions and designing a combinatory predictor method: application to Rett syndrome variants.用于变异致病性预测的领先算法性能评估及组合预测方法设计:应用于雷特综合征变异
PeerJ. 2019 Nov 27;7:e8106. doi: 10.7717/peerj.8106. eCollection 2019.
3
Evaluation of in silico pathogenicity prediction tools for the classification of small in-frame indels.小框移码突变的计算机预测工具的分类评价。
BMC Med Genomics. 2023 Feb 28;16(1):36. doi: 10.1186/s12920-023-01454-6.
4
Type IV Collagen Variants in CKD: Performance of Computational Predictions for Identifying Pathogenic Variants.慢性肾脏病中的IV型胶原蛋白变体:识别致病变体的计算预测性能
Kidney Med. 2021 Feb 10;3(2):257-266. doi: 10.1016/j.xkme.2020.12.007. eCollection 2021 Mar-Apr.
5
Machine learning-based reclassification of germline variants of unknown significance: The RENOVO algorithm.基于机器学习的未知意义种系变异的重新分类:RENOVO 算法。
Am J Hum Genet. 2021 Apr 1;108(4):682-695. doi: 10.1016/j.ajhg.2021.03.010. Epub 2021 Mar 23.
6
Predicting Ovarian/Breast Cancer Pathogenic Risks of Human BRCA1 Gene Variants of Unknown Significance.预测人类 BRCA1 基因未知意义变异的卵巢/乳腺癌发病风险。
Biomed Res Int. 2021 Apr 14;2021:6667201. doi: 10.1155/2021/6667201. eCollection 2021.
7
Machine learning models for accurate prioritization of variants of uncertain significance.用于准确确定意义未明变异优先级的机器学习模型。
Hum Mutat. 2022 Apr;43(4):449-460. doi: 10.1002/humu.24339. Epub 2022 Feb 19.
8
Machine learning random forest for predicting oncosomatic variant NGS analysis.机器学习随机森林预测肿瘤体细胞变异 NGS 分析。
Sci Rep. 2021 Nov 8;11(1):21820. doi: 10.1038/s41598-021-01253-y.
9
In Silico Predictions of KCNQ Variant Pathogenicity in Epilepsy.在癫痫中 KCNQ 变异致病性的计算机预测。
Pediatr Neurol. 2021 May;118:48-54. doi: 10.1016/j.pediatrneurol.2021.01.006. Epub 2021 Jan 27.
10
Pathogenicity Prediction of GABA Receptor Missense Variants.γ-氨基丁酸受体错义变体的致病性预测
bioRxiv. 2023 Nov 16:2023.11.14.567135. doi: 10.1101/2023.11.14.567135.

引用本文的文献

1
Variant Impact Predictor database (VIPdb), version 2: trends from three decades of genetic variant impact predictors.变异影响预测器数据库(VIPdb),版本 2:三十年来遗传变异影响预测器的趋势。
Hum Genomics. 2024 Aug 28;18(1):90. doi: 10.1186/s40246-024-00663-z.
2
Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors.变异影响预测数据库(VIPdb),版本2:25年基因变异影响预测的趋势
bioRxiv. 2024 Jun 28:2024.06.25.600283. doi: 10.1101/2024.06.25.600283.

本文引用的文献

1
Familial Alzheimer's Disease Mutations in PSEN1 Lead to Premature Human Stem Cell Neurogenesis.早发性家族性阿尔茨海默病突变导致 PSEN1 人类干细胞神经发生。
Cell Rep. 2021 Jan 12;34(2):108615. doi: 10.1016/j.celrep.2020.108615.
2
An Ensemble Approach to Predict the Pathogenicity of Synonymous Variants.一种预测同义变体致病性的集成方法。
Genes (Basel). 2020 Sep 21;11(9):1102. doi: 10.3390/genes11091102.
3
Array programming with NumPy.使用 NumPy 进行数组编程。
Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.
4
Annotation of Human Exome Gene Variants with Consensus Pathogenicity.人类外显子基因变异的共识致病性注释。
Genes (Basel). 2020 Sep 14;11(9):1076. doi: 10.3390/genes11091076.
5
Differential Expression and miRNA-Gene Interactions in Early and Late Mild Cognitive Impairment.早期和晚期轻度认知障碍中的差异表达及miRNA-基因相互作用
Biology (Basel). 2020 Aug 28;9(9):251. doi: 10.3390/biology9090251.
6
A decision tree to improve identification of pathogenic mutations in clinical practice.一种有助于在临床实践中提高致病性突变识别能力的决策树。
BMC Med Inform Decis Mak. 2020 Mar 10;20(1):52. doi: 10.1186/s12911-020-1060-0.
7
Genetic Variations and Precision Medicine.基因变异与精准医学
Perspect Health Inf Manag. 2019 Apr 1;16(Spring):1a. eCollection 2019 Spring.
8
ADAM10 in Alzheimer's disease: Pharmacological modulation by natural compounds and its role as a peripheral marker.ADAM10 在阿尔茨海默病中的作用:天然化合物的药理学调节及其作为外周标志物的作用。
Biomed Pharmacother. 2019 May;113:108661. doi: 10.1016/j.biopha.2019.108661. Epub 2019 Mar 2.
9
Neuroinflammatory Cytokines Induce Amyloid Beta Neurotoxicity through Modulating Amyloid Precursor Protein Levels/Metabolism.神经炎症细胞因子通过调节淀粉样前体蛋白水平/代谢诱导β淀粉样蛋白神经毒性。
Biomed Res Int. 2018 Oct 25;2018:3087475. doi: 10.1155/2018/3087475. eCollection 2018.
10
Evaluation of computational techniques for predicting non-synonymous single nucleotide variants pathogenicity.评价预测非同义单核苷酸变异致病性的计算技术。
Genomics. 2019 Jul;111(4):869-882. doi: 10.1016/j.ygeno.2018.05.013. Epub 2018 May 26.