• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 SInBaD 探索非编码区域的功能变异发现。

Exploring functional variant discovery in non-coding regions with SInBaD.

机构信息

Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA.

出版信息

Nucleic Acids Res. 2013 Jan 7;41(1):e7. doi: 10.1093/nar/gks800. Epub 2012 Aug 31.

DOI:10.1093/nar/gks800
PMID:22941663
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3592431/
Abstract

The thousand genomes project and many similar ongoing large-scale sequencing efforts require new methods to predict functional variants in both coding and non-coding regions in order to understand phenotype and genotype relationships. We report the design of a new model SInBaD (Sequence-Information-Based-Decision-model) which relies on nucleotide conservation information to evaluate any annotated human variant in all known exons, introns, splice junctions and promoter regions. SInBaD builds separate mathematical models for promoters, exons and introns, using the human disease mutations annotated in human gene mutation database as the training dataset for functional variants. The ten-fold cross validation shows high prediction accuracy. Validations on test datasets, demonstrate that variants predicted as functional have a significantly higher occurrence in cancer patients. We also applied our model to variants found in four different individual human genomes to identify a set of functional variants, which might be of interest for further studies. Scores for any possible variants for all annotated genes are available under http://tingchenlab.cmb.usc.edu/sinbad/. SInBaD supports the current standard format of genotyping, the variant call files (VCF 4.0), making it easy to integrate it into any existing next-generation sequencing pipeline. The accuracy of SNP detection poses the only limitation to the use of SInBaD.

摘要

千基因组计划和许多类似的正在进行的大规模测序工作需要新的方法来预测编码和非编码区域中的功能变体,以便理解表型和基因型的关系。我们报告了一种新模型 SInBaD(基于序列信息的决策模型)的设计,该模型依赖于核苷酸保守性信息来评估所有已知外显子、内含子、剪接接头和启动子区域中注释的任何人类变体。SInBaD 为启动子、外显子和内含子分别构建了单独的数学模型,使用人类基因突变数据库中注释的人类疾病突变作为功能变体的训练数据集。十折交叉验证显示出较高的预测准确性。在测试数据集上的验证表明,预测为功能的变体在癌症患者中出现的频率显著更高。我们还将我们的模型应用于四个不同个体人类基因组中的变体,以确定一组可能对进一步研究感兴趣的功能变体。所有注释基因的任何可能变体的分数都可在 http://tingchenlab.cmb.usc.edu/sinbad/ 下获得。SInBaD 支持当前的基因分型标准格式,即变体调用文件(VCF 4.0),使其易于集成到任何现有的下一代测序管道中。SNP 检测的准确性是使用 SInBaD 的唯一限制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a450/3592431/1e901dcf499d/gks800f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a450/3592431/a8cde9bf9ab1/gks800f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a450/3592431/1b85342c4b4a/gks800f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a450/3592431/f56d8bdbac97/gks800f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a450/3592431/1e901dcf499d/gks800f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a450/3592431/a8cde9bf9ab1/gks800f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a450/3592431/1b85342c4b4a/gks800f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a450/3592431/f56d8bdbac97/gks800f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a450/3592431/1e901dcf499d/gks800f4p.jpg

相似文献

1
Exploring functional variant discovery in non-coding regions with SInBaD.利用 SInBaD 探索非编码区域的功能变异发现。
Nucleic Acids Res. 2013 Jan 7;41(1):e7. doi: 10.1093/nar/gks800. Epub 2012 Aug 31.
2
Computational discovery of human coding and non-coding transcripts with conserved splice sites.具有保守剪接位点的人类编码和非编码转录本的计算发现。
Bioinformatics. 2011 Jul 15;27(14):1894-900. doi: 10.1093/bioinformatics/btr314. Epub 2011 May 26.
3
LEMONS - A Tool for the Identification of Splice Junctions in Transcriptomes of Organisms Lacking Reference Genomes.LEMONS——一种用于识别缺乏参考基因组的生物体转录组中剪接位点的工具。
PLoS One. 2015 Nov 25;10(11):e0143329. doi: 10.1371/journal.pone.0143329. eCollection 2015.
4
Global inference of disease-causing single nucleotide variants from exome sequencing data.从外显子组测序数据中对致病单核苷酸变异进行全局推断。
BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):468. doi: 10.1186/s12859-016-1325-x.
5
Genome analysis and knowledge-driven variant interpretation with TGex.基因组分析和基于 TGex 的知识驱动的变异解释。
BMC Med Genomics. 2019 Dec 30;12(1):200. doi: 10.1186/s12920-019-0647-8.
6
SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome.人类基因组中转录起始位点、典型和非典型翻译起始位点的 SNP 和 indel 频率。
PLoS One. 2019 Apr 12;14(4):e0214816. doi: 10.1371/journal.pone.0214816. eCollection 2019.
7
Systematic Computational Identification of Variants That Activate Exonic and Intronic Cryptic Splice Sites.激活外显子和内含子隐蔽剪接位点的变异体的系统计算鉴定
Am J Hum Genet. 2017 May 4;100(5):751-765. doi: 10.1016/j.ajhg.2017.04.001.
8
In silico prediction of splice-altering single nucleotide variants in the human genome.人类基因组中剪接改变单核苷酸变异的计算机模拟预测
Nucleic Acids Res. 2014 Dec 16;42(22):13534-44. doi: 10.1093/nar/gku1206.
9
Misannotated Multi-Nucleotide Variants in Public Cancer Genomics Datasets Lead to Inaccurate Mutation Calls with Significant Implications.公共癌症基因组学数据集中标注错误的多核苷酸变体导致不准确的突变调用,具有重要影响。
Cancer Res. 2021 Jan 15;81(2):282-288. doi: 10.1158/0008-5472.CAN-20-2151. Epub 2020 Oct 28.
10
Deep sequencing of Danish Holstein dairy cattle for variant detection and insight into potential loss-of-function variants in protein coding genes.对丹麦荷斯坦奶牛进行深度测序,以检测变异并深入了解蛋白质编码基因中潜在的功能丧失变异。
BMC Genomics. 2015 Dec 9;16:1043. doi: 10.1186/s12864-015-2249-y.

引用本文的文献

1
Variant Impact Predictor database (VIPdb), version 2: trends from three decades of genetic variant impact predictors.变异影响预测器数据库(VIPdb),版本 2:三十年来遗传变异影响预测器的趋势。
Hum Genomics. 2024 Aug 28;18(1):90. doi: 10.1186/s40246-024-00663-z.
2
Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors.变异影响预测数据库(VIPdb),版本2:25年基因变异影响预测的趋势
bioRxiv. 2024 Jun 28:2024.06.25.600283. doi: 10.1101/2024.06.25.600283.
3
VIPdb, a genetic Variant Impact Predictor Database.

本文引用的文献

1
Prediction of functional regulatory SNPs in monogenic and complex disease.单基因病和复杂疾病中功能调控 SNP 的预测。
Hum Mutat. 2011 Oct;32(10):1183-90. doi: 10.1002/humu.21559. Epub 2011 Sep 9.
2
A probabilistic disease-gene finder for personal genomes.个人基因组的概率疾病基因查找器。
Genome Res. 2011 Sep;21(9):1529-42. doi: 10.1101/gr.123158.111. Epub 2011 Jun 23.
3
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
VIPdb,一个遗传变异影响预测数据库。
Hum Mutat. 2019 Sep;40(9):1202-1214. doi: 10.1002/humu.23858. Epub 2019 Aug 17.
4
Global inference of disease-causing single nucleotide variants from exome sequencing data.从外显子组测序数据中对致病单核苷酸变异进行全局推断。
BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):468. doi: 10.1186/s12859-016-1325-x.
5
dbWGFP: a database and web server of human whole-genome single nucleotide variants and their functional predictions.dbWGFP:人类全基因组单核苷酸变异及其功能预测的数据库和网络服务器。
Database (Oxford). 2016 Mar 17;2016. doi: 10.1093/database/baw024. Print 2016.
6
Role of non-coding sequence variants in cancer.非编码序列变异在癌症中的作用。
Nat Rev Genet. 2016 Feb;17(2):93-108. doi: 10.1038/nrg.2015.17. Epub 2016 Jan 19.
7
Prioritization Of Nonsynonymous Single Nucleotide Variants For Exome Sequencing Studies Via Integrative Learning On Multiple Genomic Data.通过对多个基因组数据进行整合学习来确定外显子组测序研究中非同义单核苷酸变异的优先级
Sci Rep. 2015 Oct 13;5:14955. doi: 10.1038/srep14955.
8
The genetics of human autoimmune disease: A perspective on progress in the field and future directions.人类自身免疫性疾病的遗传学:该领域进展及未来方向的展望
J Autoimmun. 2015 Nov;64:1-12. doi: 10.1016/j.jaut.2015.08.015. Epub 2015 Sep 4.
9
Genome-wide analysis of noncoding regulatory mutations in cancer.癌症中非编码调控突变的全基因组分析。
Nat Genet. 2014 Nov;46(11):1160-5. doi: 10.1038/ng.3101. Epub 2014 Sep 28.
10
Integrating multiple genomic data to predict disease-causing nonsynonymous single nucleotide variants in exome sequencing studies.在全外显子组测序研究中整合多个基因组数据以预测致病非同义单核苷酸变异
PLoS Genet. 2014 Mar 20;10(3):e1004237. doi: 10.1371/journal.pgen.1004237. eCollection 2014 Mar.
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
4
MutationTaster evaluates disease-causing potential of sequence alterations.MutationTaster评估序列改变的致病潜力。
Nat Methods. 2010 Aug;7(8):575-6. doi: 10.1038/nmeth0810-575.
5
Single-nucleotide evolutionary constraint scores highlight disease-causing mutations.单核苷酸进化约束分数突出显示致病突变。
Nat Methods. 2010 Apr;7(4):250-1. doi: 10.1038/nmeth0410-250.
6
Genome 10K: a proposal to obtain whole-genome sequence for 10,000 vertebrate species.基因组 10K:获取 10000 种脊椎动物全基因组序列的提案。
J Hered. 2009 Nov-Dec;100(6):659-74. doi: 10.1093/jhered/esp086. Epub 2009 Nov 5.
7
Identification of deleterious mutations within three human genomes.三个人类基因组中有害突变的鉴定。
Genome Res. 2009 Sep;19(9):1553-61. doi: 10.1101/gr.092619.109. Epub 2009 Jul 14.
8
The diploid genome sequence of an Asian individual.一名亚洲个体的二倍体基因组序列。
Nature. 2008 Nov 6;456(7218):60-5. doi: 10.1038/nature07484.
9
SNAP predicts effect of mutations on protein function.SNAP预测突变对蛋白质功能的影响。
Bioinformatics. 2008 Oct 15;24(20):2397-8. doi: 10.1093/bioinformatics/btn435. Epub 2008 Aug 30.
10
Identification of deleterious non-synonymous single nucleotide polymorphisms using sequence-derived information.利用序列衍生信息鉴定有害非同义单核苷酸多态性
BMC Bioinformatics. 2008 Jun 27;9:297. doi: 10.1186/1471-2105-9-297.