• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对人类基因组编码碱基变化的疾病易感性进行详尽预测。

Exhaustive prediction of disease susceptibility to coding base changes in the human genome.

作者信息

Kulkarni Vinayak, Errami Mounir, Barber Robert, Garner Harold R

机构信息

Mc Dermott Center for Human Growth and Development, UT Southwestern Medical Center, Dallas, TX, USA.

出版信息

BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S3. doi: 10.1186/1471-2105-9-S9-S3.

DOI:10.1186/1471-2105-9-S9-S3
PMID:18793467
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2537574/
Abstract

BACKGROUND

Single Nucleotide Polymorphisms (SNPs) are the most abundant form of genomic variation and can cause phenotypic differences between individuals, including diseases. Bases are subject to various levels of selection pressure, reflected in their inter-species conservation.

RESULTS

We propose a method that is not dependant on transcription information to score each coding base in the human genome reflecting the disease probability associated with its mutation. Twelve factors likely to be associated with disease alleles were chosen as the input for a support vector machine prediction algorithm. The analysis yielded 83% sensitivity and 84% specificity in segregating disease like alleles as found in the Human Gene Mutation Database from non-disease like alleles as found in the Database of Single Nucleotide Polymorphisms. This algorithm was subsequently applied to each base within all known human genes, exhaustively confirming that interspecies conservation is the strongest factor for disease association. For each gene, the length normalized average disease potential score was calculated. Out of the 30 genes with the highest scores, 21 are directly associated with a disease. In contrast, out of the 30 genes with the lowest scores, only one is associated with a disease as found in published literature. The results strongly suggest that the highest scoring genes are enriched for those that might contribute to disease, if mutated.

CONCLUSION

This method provides valuable information to researchers to identify sensitive positions in genes that have a high disease probability, enabling them to optimize experimental designs and interpret data emerging from genetic and epidemiological studies.

摘要

背景

单核苷酸多态性(SNPs)是基因组变异最丰富的形式,可导致个体间的表型差异,包括疾病。碱基受到不同程度的选择压力,这在它们的种间保守性中得以体现。

结果

我们提出了一种不依赖转录信息的方法,对人类基因组中的每个编码碱基进行评分,以反映与其突变相关的疾病概率。选择了十二个可能与疾病等位基因相关的因素作为支持向量机预测算法的输入。在将人类基因突变数据库中发现的疾病样等位基因与单核苷酸多态性数据库中发现的非疾病样等位基因区分开来时,该分析的灵敏度为83%,特异性为84%。随后将该算法应用于所有已知人类基因中的每个碱基,详尽地证实了种间保守性是疾病关联的最强因素。对于每个基因,计算了长度标准化的平均疾病潜力得分。得分最高的30个基因中,有21个与疾病直接相关。相比之下,得分最低的30个基因中,只有一个在已发表的文献中与疾病相关。结果强烈表明,得分最高的基因富集了那些如果发生突变可能导致疾病的基因。

结论

该方法为研究人员提供了有价值的信息,以识别具有高疾病概率的基因中的敏感位置,使他们能够优化实验设计并解释遗传和流行病学研究中出现的数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/1d7d0a9406a7/1471-2105-9-S9-S3-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/eee7e151d7c0/1471-2105-9-S9-S3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/296fc5b35d40/1471-2105-9-S9-S3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/1f17b81ac095/1471-2105-9-S9-S3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/2949827aa6c1/1471-2105-9-S9-S3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/7bc07bca1961/1471-2105-9-S9-S3-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/2a836396dad2/1471-2105-9-S9-S3-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/c925154b339a/1471-2105-9-S9-S3-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/1d7d0a9406a7/1471-2105-9-S9-S3-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/eee7e151d7c0/1471-2105-9-S9-S3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/296fc5b35d40/1471-2105-9-S9-S3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/1f17b81ac095/1471-2105-9-S9-S3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/2949827aa6c1/1471-2105-9-S9-S3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/7bc07bca1961/1471-2105-9-S9-S3-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/2a836396dad2/1471-2105-9-S9-S3-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/c925154b339a/1471-2105-9-S9-S3-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec36/2537574/1d7d0a9406a7/1471-2105-9-S9-S3-8.jpg

相似文献

1
Exhaustive prediction of disease susceptibility to coding base changes in the human genome.对人类基因组编码碱基变化的疾病易感性进行详尽预测。
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S3. doi: 10.1186/1471-2105-9-S9-S3.
2
Common neurodegenerative diseases: dissection by genome-wide association.常见神经退行性疾病:通过全基因组关联研究进行剖析
Curr Neurol Neurosci Rep. 2007 Sep;7(5):425-7. doi: 10.1007/s11910-007-0065-8.
3
Accurate prediction of deleterious protein kinase polymorphisms.有害蛋白激酶多态性的准确预测。
Bioinformatics. 2007 Nov 1;23(21):2918-25. doi: 10.1093/bioinformatics/btm437. Epub 2007 Sep 12.
4
Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays.在一对寡核苷酸阵列上对超过10万个单核苷酸多态性进行基因分型。
Nat Methods. 2004 Nov;1(2):109-11. doi: 10.1038/nmeth718.
5
SNP@Promoter: a database of human SNPs (single nucleotide polymorphisms) within the putative promoter regions.SNP@启动子:一个关于假定启动子区域内人类单核苷酸多态性(SNP)的数据库。
BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2105-9-S1-S2.
6
SNPeffect v2.0: a new step in investigating the molecular phenotypic effects of human non-synonymous SNPs.SNPeffect v2.0:研究人类非同义单核苷酸多态性分子表型效应的新进展。
Bioinformatics. 2006 Sep 1;22(17):2183-5. doi: 10.1093/bioinformatics/btl348. Epub 2006 Jun 29.
7
Ranking single nucleotide polymorphisms by potential deleterious effects.根据潜在有害效应排列单核苷酸多态性
AMIA Annu Symp Proc. 2008 Nov 6;2008:667-71.
8
A similarity-based method for genome-wide prediction of disease-relevant human genes.一种基于相似性的全基因组预测疾病相关人类基因的方法。
Bioinformatics. 2002;18 Suppl 2:S110-5. doi: 10.1093/bioinformatics/18.suppl_2.s110.
9
SNP Function Portal: a web database for exploring the function implication of SNP alleles.单核苷酸多态性(SNP)功能门户:一个用于探索SNP等位基因功能含义的网络数据库。
Bioinformatics. 2006 Jul 15;22(14):e523-9. doi: 10.1093/bioinformatics/btl241.
10
Merging personalized and participatory medicine: interpretation of individual genomes.融合个性化与参与式医学:个体基因组解读
Stud Health Technol Inform. 2014;202:24-7.

引用本文的文献

1
Pathogenic nsSNPs that increase the risks of cancers among the Orang Asli and Malays.导致奥朗阿斯利人和马来人癌症风险增加的致病性 nsSNP。
Sci Rep. 2021 Aug 9;11(1):16158. doi: 10.1038/s41598-021-95618-y.
2
Losses of human disease-associated genes in placental mammals.胎盘哺乳动物中与人类疾病相关基因的缺失。
NAR Genom Bioinform. 2019 Oct 24;2(1):lqz012. doi: 10.1093/nargab/lqz012. eCollection 2020 Mar.
3
A combined functional annotation score for non-synonymous variants.一种针对非同义变异的综合功能注释评分

本文引用的文献

1
A "silent" polymorphism in the MDR1 gene changes substrate specificity.MDR1基因中的一种“沉默”多态性改变了底物特异性。
Science. 2007 Jan 26;315(5811):525-8. doi: 10.1126/science.1135308. Epub 2006 Dec 21.
2
The comparative toxicogenomics database: a cross-species resource for building chemical-gene interaction networks.比较毒理基因组学数据库:构建化学物质-基因相互作用网络的跨物种资源。
Toxicol Sci. 2006 Aug;92(2):587-95. doi: 10.1093/toxsci/kfl008. Epub 2006 May 4.
3
Selective pressures at a codon-level predict deleterious mutations in human disease genes.
Hum Hered. 2012;73(1):47-51. doi: 10.1159/000334984. Epub 2012 Jan 18.
4
Phylomedicine: an evolutionary telescope to explore and diagnose the universe of disease mutations.演化医学:探索和诊断疾病突变宇宙的进化之眼。
Trends Genet. 2011 Sep;27(9):377-86. doi: 10.1016/j.tig.2011.06.004. Epub 2011 Jul 20.
5
A knowledge-based weighting framework to boost the power of genome-wide association studies.基于知识的加权框架,提高全基因组关联研究的效力。
PLoS One. 2010 Dec 31;5(12):e14480. doi: 10.1371/journal.pone.0014480.
6
Proceedings of the 2009 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) conference. Introduction.2009年中南计算生物学与生物信息学学会(MCBIOS)会议论文集。引言。
BMC Bioinformatics. 2009 Oct 8;10 Suppl 11(Suppl 11):S1. doi: 10.1186/1471-2105-10-S11-S1.
7
Automated inference of molecular mechanisms of disease from amino acid substitutions.从氨基酸替换自动推断疾病的分子机制。
Bioinformatics. 2009 Nov 1;25(21):2744-50. doi: 10.1093/bioinformatics/btp528. Epub 2009 Sep 3.
8
Proceedings of the 2008 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) Conference.2008年中南计算生物学与生物信息学学会(MCBIOS)会议论文集
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2105-9-S9-S1.
密码子水平的选择压力可预测人类疾病基因中的有害突变。
J Mol Biol. 2006 May 19;358(5):1390-404. doi: 10.1016/j.jmb.2006.02.067. Epub 2006 Mar 15.
4
Sequence variation in G-protein-coupled receptors: analysis of single nucleotide polymorphisms.G蛋白偶联受体中的序列变异:单核苷酸多态性分析
Nucleic Acids Res. 2005 Mar 22;33(5):1710-21. doi: 10.1093/nar/gki311. Print 2005.
5
Identification and characterization of multi-species conserved sequences.多物种保守序列的鉴定与表征
Genome Res. 2003 Dec;13(12):2507-18. doi: 10.1101/gr.1602203.
6
SIFT: Predicting amino acid changes that affect protein function.SIFT:预测影响蛋白质功能的氨基酸变化。
Nucleic Acids Res. 2003 Jul 1;31(13):3812-4. doi: 10.1093/nar/gkg509.
7
Multiple sequence alignment with the Clustal series of programs.使用Clustal系列程序进行多序列比对。
Nucleic Acids Res. 2003 Jul 1;31(13):3497-500. doi: 10.1093/nar/gkg500.
8
Human Gene Mutation Database (HGMD): 2003 update.人类基因突变数据库(HGMD):2003年更新版。
Hum Mutat. 2003 Jun;21(6):577-81. doi: 10.1002/humu.10212.
9
A functional analysis of disease-associated mutations in the androgen receptor gene.雄激素受体基因中疾病相关突变的功能分析。
Nucleic Acids Res. 2003 Apr 15;31(8):e42. doi: 10.1093/nar/gng042.
10
Understanding missense mutations in the BRCA1 gene: an evolutionary approach.理解BRCA1基因中的错义突变:一种进化方法。
Proc Natl Acad Sci U S A. 2003 Feb 4;100(3):1151-6. doi: 10.1073/pnas.0237285100. Epub 2003 Jan 16.