• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过利用偏向性替代模式识别新型受限元件。

Identifying novel constrained elements by exploiting biased substitution patterns.

作者信息

Garber Manuel, Guttman Mitchell, Clamp Michele, Zody Michael C, Friedman Nir, Xie Xiaohui

机构信息

Department of Biology, Broad Institute of MIT and Harvard, 7 Cambridge Center, MIT, Cambridge, MA 02142, USA.

出版信息

Bioinformatics. 2009 Jun 15;25(12):i54-62. doi: 10.1093/bioinformatics/btp190.

DOI:10.1093/bioinformatics/btp190
PMID:19478016
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2687944/
Abstract

MOTIVATION

Comparing the genomes from closely related species provides a powerful tool to identify functional elements in a reference genome. Many methods have been developed to identify conserved sequences across species; however, existing methods only model conservation as a decrease in the rate of mutation and have ignored selection acting on the pattern of mutations.

RESULTS

We present a new approach that takes advantage of deeply sequenced clades to identify evolutionary selection by uncovering not only signatures of rate-based conservation but also substitution patterns characteristic of sequence undergoing natural selection. We describe a new statistical method for modeling biased nucleotide substitutions, a learning algorithm for inferring site-specific substitution biases directly from sequence alignments and a hidden Markov model for detecting constrained elements characterized by biased substitutions. We show that the new approach can identify significantly more degenerate constrained sequences than rate-based methods. Applying it to the ENCODE regions, we identify as much as 10.2% of these regions are under selection.

AVAILABILITY

The algorithms are implemented in a Java software package, called SiPhy, freely available at http://www.broadinstitute.org/science/software/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

比较亲缘关系密切的物种的基因组为识别参考基因组中的功能元件提供了一个强大的工具。已经开发了许多方法来识别跨物种的保守序列;然而,现有方法仅将保守性建模为突变率的降低,而忽略了作用于突变模式的选择。

结果

我们提出了一种新方法,该方法利用深度测序的进化枝来识别进化选择,不仅揭示基于速率的保守性特征,还揭示经历自然选择的序列的替代模式特征。我们描述了一种用于对有偏核苷酸替代进行建模的新统计方法、一种用于直接从序列比对中推断位点特异性替代偏差的学习算法以及一种用于检测以有偏替代为特征的受限元件的隐马尔可夫模型。我们表明,新方法能够比基于速率的方法识别出更多的简并受限序列。将其应用于ENCODE区域,我们发现这些区域中多达10.2%处于选择之下。

可用性

这些算法在一个名为SiPhy的Java软件包中实现,可从http://www.broadinstitute.org/science/software/免费获取。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed87/2687944/378a5258bad7/btp190f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed87/2687944/60bc2cbf6285/btp190f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed87/2687944/493756a0f99c/btp190f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed87/2687944/85d36e32acbf/btp190f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed87/2687944/378a5258bad7/btp190f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed87/2687944/60bc2cbf6285/btp190f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed87/2687944/493756a0f99c/btp190f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed87/2687944/85d36e32acbf/btp190f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed87/2687944/378a5258bad7/btp190f4.jpg

相似文献

1
Identifying novel constrained elements by exploiting biased substitution patterns.通过利用偏向性替代模式识别新型受限元件。
Bioinformatics. 2009 Jun 15;25(12):i54-62. doi: 10.1093/bioinformatics/btp190.
2
Identifying a high fraction of the human genome to be under selective constraint using GERP++.使用 GERP++ 鉴定人类基因组中受到选择压力的部分。
PLoS Comput Biol. 2010 Dec 2;6(12):e1001025. doi: 10.1371/journal.pcbi.1001025.
3
Detection of nonneutral substitution rates on mammalian phylogenies.检测哺乳动物系统发育上的非中性替代率。
Genome Res. 2010 Jan;20(1):110-21. doi: 10.1101/gr.097857.109. Epub 2009 Oct 26.
4
ESPERR: learning strong and weak signals in genomic sequence alignments to identify functional elements.ESPERR:在基因组序列比对中学习强弱信号以识别功能元件。
Genome Res. 2006 Dec;16(12):1596-604. doi: 10.1101/gr.4537706. Epub 2006 Oct 19.
5
Mulan: multiple-sequence local alignment and visualization for studying function and evolution.木兰:用于研究功能和进化的多序列局部比对与可视化
Genome Res. 2005 Jan;15(1):184-94. doi: 10.1101/gr.3007205. Epub 2004 Dec 8.
6
Genome-wide functional element detection using pairwise statistical alignment outperforms multiple genome footprinting techniques.使用成对统计比对进行全基因组功能元件检测优于多种基因组足迹技术。
Bioinformatics. 2010 Sep 1;26(17):2116-20. doi: 10.1093/bioinformatics/btq360. Epub 2010 Jul 7.
7
Sufficient statistics and expectation maximization algorithms in phylogenetic tree models.系统发育树模型中的充分统计量和期望最大化算法。
Bioinformatics. 2011 Sep 1;27(17):2346-53. doi: 10.1093/bioinformatics/btr420. Epub 2011 Jul 14.
8
Fast and accurate phylogeny reconstruction using filtered spaced-word matches.使用过滤后的间隔词匹配进行快速准确的系统发育重建。
Bioinformatics. 2017 Apr 1;33(7):971-979. doi: 10.1093/bioinformatics/btw776.
9
Accurate anchoring alignment of divergent sequences.发散序列的精确锚定比对。
Bioinformatics. 2006 Jan 1;22(1):29-34. doi: 10.1093/bioinformatics/bti772. Epub 2005 Nov 13.
10
Specific alignment of structured RNA: stochastic grammars and sequence annealing.结构化RNA的特定比对:随机语法与序列退火
Bioinformatics. 2008 Dec 1;24(23):2677-83. doi: 10.1093/bioinformatics/btn495. Epub 2008 Sep 16.

引用本文的文献

1
Evaluating variant pathogenicity prediction tools to establish African inclusive guidelines for germline genetic testing.评估变异致病性预测工具,以制定非洲包容性的种系基因检测指南。
Commun Med (Lond). 2025 May 6;5(1):157. doi: 10.1038/s43856-025-00883-x.
2
A series of reviews in familial cancer: genetic cancer risk in context variants of uncertain significance in MMR genes: which procedures should be followed?一系列关于家族性癌症的综述:错配修复(MMR)基因中意义不确定的背景变异的遗传性癌症风险:应遵循哪些程序?
Fam Cancer. 2025 May 3;24(2):42. doi: 10.1007/s10689-025-00470-y.
3
A Novel Bradycardia-Associated Variant in as a Candidate Modifier in Type 3 Long QT Syndrome: Case Report and Deep In Silico Analysis.

本文引用的文献

1
Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals.染色质特征揭示了哺乳动物中一千多种高度保守的大型非编码RNA。
Nature. 2009 Mar 12;458(7235):223-7. doi: 10.1038/nature07672. Epub 2009 Feb 1.
2
Probabilistic phylogenetic inference with insertions and deletions.带插入和缺失的概率系统发育推断
PLoS Comput Biol. 2008 Sep 19;4(9):e1000172. doi: 10.1371/journal.pcbi.1000172.
3
Approaches to comparative sequence analysis: towards a functional view of vertebrate genomes.比较序列分析方法:迈向脊椎动物基因组的功能视角
一种新型的与心动过缓相关的变异体作为3型长QT综合征的候选修饰因子:病例报告及深度计算机模拟分析
Biomedicines. 2025 Apr 21;13(4):1008. doi: 10.3390/biomedicines13041008.
4
Isolation and Characterization of the Adamantinomatous Craniopharyngioma Primary Cells with Cancer-Associated Fibroblast Features.具有癌相关成纤维细胞特征的造釉细胞瘤型颅咽管瘤原代细胞的分离与鉴定
Biomedicines. 2025 Apr 9;13(4):912. doi: 10.3390/biomedicines13040912.
5
Common -regulatory variation modifies the penetrance of pathogenic variants in craniofacial microsomia.常见调控变异改变了颅面短小畸形中致病变异的外显率。
Genome Res. 2025 May 2;35(5):1065-1079. doi: 10.1101/gr.280047.124.
6
REVEL Is Better at Predicting Pathogenicity of Loss-of-Function than Gain-of-Function Variants.REVEL在预测功能丧失型变异的致病性方面比功能获得型变异更出色。
Hum Mutat. 2023 Dec 4;2023:8857940. doi: 10.1155/2023/8857940. eCollection 2023.
7
Severe Joubert syndrome in family with homozygous POC1B p.Arg106Pro variant is due to a co-inherited deep-intronic mutation in the neighboring CEP290 gene.患有纯合子POC1B p.Arg106Pro变异的家族中的严重乔伯特综合征是由于相邻CEP290基因中共同遗传的深度内含子突变所致。
HGG Adv. 2025 Mar 31;6(3):100429. doi: 10.1016/j.xhgg.2025.100429.
8
Critical assessment of missense variant effect predictors on disease-relevant variant data.对疾病相关变异数据的错义变异效应预测因子的批判性评估。
Hum Genet. 2025 Mar;144(2-3):281-293. doi: 10.1007/s00439-025-02732-2. Epub 2025 Mar 21.
9
GoFCards: an integrated database and analytic platform for gain of function variants in humans.GoFCards:一个用于人类功能获得性变异的综合数据库和分析平台。
Nucleic Acids Res. 2025 Jan 6;53(D1):D976-D988. doi: 10.1093/nar/gkae1079.
10
Enhancing Missense Variant Pathogenicity Prediction with MissenseNet: Integrating Structural Insights and ShuffleNet-Based Deep Learning Techniques.利用 MissenseNet 增强错义变异致病性预测:整合结构见解和基于 ShuffleNet 的深度学习技术。
Biomolecules. 2024 Sep 2;14(9):1105. doi: 10.3390/biom14091105.
Nat Rev Genet. 2008 Apr;9(4):303-13. doi: 10.1038/nrg2185.
4
Analysis of sequence conservation at nucleotide resolution.在核苷酸分辨率下分析序列保守性。
PLoS Comput Biol. 2007 Dec;3(12):e254. doi: 10.1371/journal.pcbi.0030254. Epub 2007 Nov 14.
5
Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.利用进化特征在12个果蝇基因组中发现功能元件。
Nature. 2007 Nov 8;450(7167):219-32. doi: 10.1038/nature06340.
6
28-way vertebrate alignment and conservation track in the UCSC Genome Browser.加州大学圣克鲁兹分校基因组浏览器中的28种脊椎动物序列比对与保守性追踪。
Genome Res. 2007 Dec;17(12):1797-808. doi: 10.1101/gr.6761107. Epub 2007 Nov 5.
7
2x genomes--does depth matter?2倍基因组——深度重要吗?
Genome Res. 2007 Nov;17(11):1547-9. doi: 10.1101/gr.7050807.
8
Exact and heuristic algorithms for the Indel Maximum Likelihood Problem.用于插入缺失最大似然问题的精确算法和启发式算法。
J Comput Biol. 2007 May;14(4):446-61. doi: 10.1089/cmb.2007.A006.
9
Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.ENCODE试点项目对人类基因组1%的功能元件进行鉴定与分析。
Nature. 2007 Jun 14;447(7146):799-816. doi: 10.1038/nature05874.
10
Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome.对1%人类基因组的深度哺乳动物序列比对和约束预测分析。
Genome Res. 2007 Jun;17(6):760-74. doi: 10.1101/gr.6034307.