• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类基因组中罕见等位基因突变严重程度谱可预测疾病类型。

Mutation severity spectrum of rare alleles in the human genome is predictive of disease type.

机构信息

Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America.

Departments of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America.

出版信息

PLoS Comput Biol. 2020 May 15;16(5):e1007775. doi: 10.1371/journal.pcbi.1007775. eCollection 2020 May.

DOI:10.1371/journal.pcbi.1007775
PMID:32413045
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7255613/
Abstract

The human genome harbors a variety of genetic variations. Single-nucleotide changes that alter amino acids in protein-coding regions are one of the major causes of human phenotypic variation and diseases. These single-amino acid variations (SAVs) are routinely found in whole genome and exome sequencing. Evaluating the functional impact of such genomic alterations is crucial for diagnosis of genetic disorders. We developed DeepSAV, a deep-learning convolutional neural network to differentiate disease-causing and benign SAVs based on a variety of protein sequence, structural and functional properties. Our method outperforms most stand-alone programs, and the version incorporating population and gene-level information (DeepSAV+PG) has similar predictive power as some of the best available. We transformed DeepSAV scores of rare SAVs in the human population into a quantity termed "mutation severity measure" for each human protein-coding gene. It reflects a gene's tolerance to deleterious missense mutations and serves as a useful tool to study gene-disease associations. Genes implicated in cancer, autism, and viral interaction are found by this measure as intolerant to mutations, while genes associated with a number of other diseases are scored as tolerant. Among known disease-associated genes, those that are mutation-intolerant are likely to function in development and signal transduction pathways, while those that are mutation-tolerant tend to encode metabolic and mitochondrial proteins.

摘要

人类基因组中存在多种遗传变异。改变蛋白质编码区域中氨基酸的单核苷酸变化是人类表型变异和疾病的主要原因之一。这些单氨基酸变异(SAVs)在全基因组和外显子组测序中经常被发现。评估这些基因组改变的功能影响对于遗传疾病的诊断至关重要。我们开发了 DeepSAV,这是一种基于多种蛋白质序列、结构和功能特性的深度学习卷积神经网络,用于区分致病和良性 SAVs。我们的方法优于大多数独立程序,而整合了人群和基因水平信息的版本(DeepSAV+PG)与一些最可用的程序具有相似的预测能力。我们将人群中罕见 SAV 的 DeepSAV 分数转化为每个人类蛋白质编码基因的“突变严重程度度量”。它反映了一个基因对有害错义突变的容忍度,是研究基因-疾病关联的有用工具。通过这种衡量标准,发现了与癌症、自闭症和病毒相互作用相关的基因对突变不敏感,而与许多其他疾病相关的基因则被评为耐受。在已知的与疾病相关的基因中,那些突变不敏感的基因可能在发育和信号转导途径中起作用,而那些突变耐受的基因则倾向于编码代谢和线粒体蛋白。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/24aa90c6c895/pcbi.1007775.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/3759fadd242e/pcbi.1007775.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/6f9417be12f9/pcbi.1007775.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/ff4aa37d61ec/pcbi.1007775.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/2bf00b4eb18a/pcbi.1007775.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/29a73f43f236/pcbi.1007775.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/c2e7b9badef8/pcbi.1007775.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/24aa90c6c895/pcbi.1007775.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/3759fadd242e/pcbi.1007775.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/6f9417be12f9/pcbi.1007775.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/ff4aa37d61ec/pcbi.1007775.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/2bf00b4eb18a/pcbi.1007775.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/29a73f43f236/pcbi.1007775.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/c2e7b9badef8/pcbi.1007775.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e95/7255613/24aa90c6c895/pcbi.1007775.g007.jpg

相似文献

1
Mutation severity spectrum of rare alleles in the human genome is predictive of disease type.人类基因组中罕见等位基因突变严重程度谱可预测疾病类型。
PLoS Comput Biol. 2020 May 15;16(5):e1007775. doi: 10.1371/journal.pcbi.1007775. eCollection 2020 May.
2
The DBSAV Database: Predicting Deleteriousness of Single Amino Acid Variations in the Human Proteome.DBSAV 数据库:预测人类蛋白质组中单氨基酸变异的有害性。
J Mol Biol. 2021 May 28;433(11):166915. doi: 10.1016/j.jmb.2021.166915. Epub 2021 Mar 4.
3
A deep catalogue of protein-coding variation in 983,578 individuals.983,578名个体蛋白质编码变异的深度目录。
Nature. 2024 Jul;631(8021):583-592. doi: 10.1038/s41586-024-07556-0. Epub 2024 May 20.
4
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
5
Deep learning of genomic variation and regulatory network data.基因组变异和调控网络数据的深度学习。
Hum Mol Genet. 2018 May 1;27(R1):R63-R71. doi: 10.1093/hmg/ddy115.
6
Unified inference of missense variant effects and gene constraints in the human genome.统一推断人类基因组中错义变异的效应和基因约束。
PLoS Genet. 2020 Jul 15;16(7):e1008922. doi: 10.1371/journal.pgen.1008922. eCollection 2020 Jul.
7
Genetic diagnosis by whole exome capture and massively parallel DNA sequencing.全外显子捕获和大规模平行 DNA 测序的基因诊断。
Proc Natl Acad Sci U S A. 2009 Nov 10;106(45):19096-101. doi: 10.1073/pnas.0910672106. Epub 2009 Oct 27.
8
Genetic intolerance analysis as a tool for protein science.遗传不耐受分析作为蛋白质科学的工具。
Biochim Biophys Acta Biomembr. 2020 Jan 1;1862(1):183058. doi: 10.1016/j.bbamem.2019.183058. Epub 2019 Sep 5.
9
Identifying Mendelian disease genes with the variant effect scoring tool.使用变异效应评分工具鉴定孟德尔疾病基因。
BMC Genomics. 2013;14 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2164-14-S3-S3. Epub 2013 May 28.
10
Quantifying the Impact of Rare and Ultra-rare Coding Variation across the Phenotypic Spectrum.量化罕见和超罕见编码变异在表型谱中的影响。
Am J Hum Genet. 2018 Jun 7;102(6):1204-1211. doi: 10.1016/j.ajhg.2018.05.002. Epub 2018 May 31.

引用本文的文献

1
Prediction of pathogenic mutations in human transmembrane proteins and their associated diseases via utilizing pre-trained Bio-LLMs.利用预训练的生物语言模型预测人类跨膜蛋白中的致病突变及其相关疾病。
Commun Biol. 2025 Jul 15;8(1):1050. doi: 10.1038/s42003-025-08452-7.
2
ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations.蛔虫:单个氨基酸变异的位置特征注释和基于蛋白质结构的表征
Comput Struct Biotechnol J. 2023 Sep 17;21:4743-4758. doi: 10.1016/j.csbj.2023.09.017. eCollection 2023.
3
VariBench, new variation benchmark categories and data sets.

本文引用的文献

1
Rare-variant collapsing analyses for complex traits: guidelines and applications.复杂性状的罕见变异合并分析:指南与应用。
Nat Rev Genet. 2019 Dec;20(12):747-759. doi: 10.1038/s41576-019-0177-4. Epub 2019 Oct 11.
2
The expanding phenotypes of cohesinopathies: one ring to rule them all!黏连蛋白病表型的不断扩展:以一环统御之!
Cell Cycle. 2019 Nov;18(21):2828-2848. doi: 10.1080/15384101.2019.1658476. Epub 2019 Sep 13.
3
Extensive disruption of protein interactions by genetic variants across the allele frequency spectrum in human populations.
VariBench,新的变异基准类别和数据集。
Front Bioinform. 2023 Sep 19;3:1248732. doi: 10.3389/fbinf.2023.1248732. eCollection 2023.
4
Genome interpretation using in silico predictors of variant impact.使用变异影响的计算机预测因子进行基因组解读。
Hum Genet. 2022 Oct;141(10):1549-1577. doi: 10.1007/s00439-022-02457-6. Epub 2022 Apr 30.
5
Predicting deleterious missense genetic variants via integrative supervised nonnegative matrix tri-factorization.通过集成监督非负矩阵三因子分解预测有害错义遗传变异。
Sci Rep. 2021 Dec 9;11(1):23747. doi: 10.1038/s41598-021-03230-x.
6
Loss of Function of Mutant IDS Due to Endoplasmic Reticulum-Associated Degradation: New Therapeutic Opportunities for Mucopolysaccharidosis Type II.由于内质网相关降解导致的突变 IDS 功能丧失:黏多糖贮积症 II 型的新治疗机会。
Int J Mol Sci. 2021 Nov 12;22(22):12227. doi: 10.3390/ijms222212227.
7
The DBSAV Database: Predicting Deleteriousness of Single Amino Acid Variations in the Human Proteome.DBSAV 数据库:预测人类蛋白质组中单氨基酸变异的有害性。
J Mol Biol. 2021 May 28;433(11):166915. doi: 10.1016/j.jmb.2021.166915. Epub 2021 Mar 4.
8
A Fifth of the Protein World: Rossmann-like Proteins as an Evolutionarily Successful Structural unit.五分之一的蛋白质世界:Rossmann 样蛋白作为一种进化成功的结构单元。
J Mol Biol. 2021 Feb 19;433(4):166788. doi: 10.1016/j.jmb.2020.166788. Epub 2020 Dec 31.
在人类种群中,广泛的遗传变异破坏了整个等位基因频率范围内的蛋白质相互作用。
Nat Commun. 2019 Sep 12;10(1):4141. doi: 10.1038/s41467-019-11959-3.
4
New insights on human essential genes based on integrated analysis and the construction of the HEGIAP web-based platform.基于整合分析和 HEGIAP 网络平台构建的人类必需基因新见解。
Brief Bioinform. 2020 Jul 15;21(4):1397-1410. doi: 10.1093/bib/bbz072.
5
Rho GTPases in cancer: friend or foe?Rho GTPases 在癌症中的作用:是敌是友?
Oncogene. 2019 Dec;38(50):7447-7456. doi: 10.1038/s41388-019-0963-7. Epub 2019 Aug 19.
6
Assessing predictions on fitness effects of missense variants in calmodulin.评估钙调蛋白中错义变异体对适应度影响的预测。
Hum Mutat. 2019 Sep;40(9):1463-1473. doi: 10.1002/humu.23857. Epub 2019 Sep 3.
7
The PSIPRED Protein Analysis Workbench: 20 years on.PSIPRED 蛋白质分析工作平台:20 年的发展
Nucleic Acids Res. 2019 Jul 2;47(W1):W402-W407. doi: 10.1093/nar/gkz297.
8
OMA standalone: orthology inference among public and custom genomes and transcriptomes.OMA 独立版:公共和定制基因组和转录组之间的同源推断。
Genome Res. 2019 Jul;29(7):1152-1163. doi: 10.1101/gr.243212.118. Epub 2019 Jun 24.
9
Modular epistasis and the compensatory evolution of gene deletion mutants.模块性上位性与基因缺失突变体的补偿性进化。
PLoS Genet. 2019 Feb 15;15(2):e1007958. doi: 10.1371/journal.pgen.1007958. eCollection 2019 Feb.
10
Heterozygous mutations cause genetic instability in a yeast model of cancer evolution.杂合突变导致癌症进化酵母模型中的遗传不稳定性。
Nature. 2019 Feb;566(7743):275-278. doi: 10.1038/s41586-019-0887-y. Epub 2019 Jan 30.