• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对1%人类基因组的深度哺乳动物序列比对和约束预测分析。

Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome.

作者信息

Margulies Elliott H, Cooper Gregory M, Asimenos George, Thomas Daryl J, Dewey Colin N, Siepel Adam, Birney Ewan, Keefe Damian, Schwartz Ariel S, Hou Minmei, Taylor James, Nikolaev Sergey, Montoya-Burgos Juan I, Löytynoja Ari, Whelan Simon, Pardi Fabio, Massingham Tim, Brown James B, Bickel Peter, Holmes Ian, Mullikin James C, Ureta-Vidal Abel, Paten Benedict, Stone Eric A, Rosenbloom Kate R, Kent W James, Bouffard Gerard G, Guan Xiaobin, Hansen Nancy F, Idol Jacquelyn R, Maduro Valerie V B, Maskeri Baishali, McDowell Jennifer C, Park Morgan, Thomas Pamela J, Young Alice C, Blakesley Robert W, Muzny Donna M, Sodergren Erica, Wheeler David A, Worley Kim C, Jiang Huaiyang, Weinstock George M, Gibbs Richard A, Graves Tina, Fulton Robert, Mardis Elaine R, Wilson Richard K, Clamp Michele, Cuff James, Gnerre Sante, Jaffe David B, Chang Jean L, Lindblad-Toh Kerstin, Lander Eric S, Hinrichs Angie, Trumbower Heather, Clawson Hiram, Zweig Ann, Kuhn Robert M, Barber Galt, Harte Rachel, Karolchik Donna, Field Matthew A, Moore Richard A, Matthewson Carrie A, Schein Jacqueline E, Marra Marco A, Antonarakis Stylianos E, Batzoglou Serafim, Goldman Nick, Hardison Ross, Haussler David, Miller Webb, Pachter Lior, Green Eric D, Sidow Arend

机构信息

Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA.

出版信息

Genome Res. 2007 Jun;17(6):760-74. doi: 10.1101/gr.6034307.

DOI:10.1101/gr.6034307
PMID:17567995
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1891336/
Abstract

A key component of the ongoing ENCODE project involves rigorous comparative sequence analyses for the initially targeted 1% of the human genome. Here, we present orthologous sequence generation, alignment, and evolutionary constraint analyses of 23 mammalian species for all ENCODE targets. Alignments were generated using four different methods; comparisons of these methods reveal large-scale consistency but substantial differences in terms of small genomic rearrangements, sensitivity (sequence coverage), and specificity (alignment accuracy). We describe the quantitative and qualitative trade-offs concomitant with alignment method choice and the levels of technical error that need to be accounted for in applications that require multisequence alignments. Using the generated alignments, we identified constrained regions using three different methods. While the different constraint-detecting methods are in general agreement, there are important discrepancies relating to both the underlying alignments and the specific algorithms. However, by integrating the results across the alignments and constraint-detecting methods, we produced constraint annotations that were found to be robust based on multiple independent measures. Analyses of these annotations illustrate that most classes of experimentally annotated functional elements are enriched for constrained sequences; however, large portions of each class (with the exception of protein-coding sequences) do not overlap constrained regions. The latter elements might not be under primary sequence constraint, might not be constrained across all mammals, or might have expendable molecular functions. Conversely, 40% of the constrained sequences do not overlap any of the functional elements that have been experimentally identified. Together, these findings demonstrate and quantify how many genomic functional elements await basic molecular characterization.

摘要

正在进行的ENCODE项目的一个关键组成部分,涉及对人类基因组最初选定的1%进行严格的比较序列分析。在此,我们展示了针对所有ENCODE靶点的23种哺乳动物物种的直系同源序列生成、比对及进化约束分析。使用四种不同方法生成了比对结果;对这些方法的比较揭示了大规模的一致性,但在小基因组重排、灵敏度(序列覆盖度)和特异性(比对准确性)方面存在显著差异。我们描述了与比对方法选择相关联的定量和定性权衡,以及在需要多序列比对的应用中需要考虑的技术误差水平。利用生成的比对结果,我们使用三种不同方法识别了约束区域。虽然不同的约束检测方法总体上是一致的,但在基础比对和特定算法方面存在重要差异。然而,通过整合不同比对结果和约束检测方法的结果,我们生成了基于多种独立衡量标准都很可靠的约束注释。对这些注释的分析表明,大多数经实验注释的功能元件类别都富含受约束序列;然而,每个类别中的很大一部分(蛋白质编码序列除外)并不与约束区域重叠。后一类元件可能不受一级序列约束,可能并非在所有哺乳动物中都受约束,或者可能具有可消耗的分子功能。相反,40%的受约束序列并不与任何已通过实验鉴定的功能元件重叠。总之,这些发现证明并量化了还有多少基因组功能元件有待进行基础分子特征描述。

相似文献

1
Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome.对1%人类基因组的深度哺乳动物序列比对和约束预测分析。
Genome Res. 2007 Jun;17(6):760-74. doi: 10.1101/gr.6034307.
2
Distribution and intensity of constraint in mammalian genomic sequence.哺乳动物基因组序列中限制的分布与强度
Genome Res. 2005 Jul;15(7):901-13. doi: 10.1101/gr.3577405. Epub 2005 Jun 17.
3
A genome alignment of 120 mammals highlights ultraconserved element variability and placenta-associated enhancers.120 种哺乳动物的基因组比对突出了超保守元件的可变性和胎盘相关增强子。
Gigascience. 2020 Jan 1;9(1). doi: 10.1093/gigascience/giz159.
4
RepeatFiller newly identifies megabases of aligning repetitive sequences and improves annotations of conserved non-exonic elements.RepeatFiller 新鉴定了兆碱基级别的 aligning repetitive sequences,并改进了 conserved non-exonic elements 的注释。
Gigascience. 2019 Nov 1;8(11). doi: 10.1093/gigascience/giz132.
5
Identifying a high fraction of the human genome to be under selective constraint using GERP++.使用 GERP++ 鉴定人类基因组中受到选择压力的部分。
PLoS Comput Biol. 2010 Dec 2;6(12):e1001025. doi: 10.1371/journal.pcbi.1001025.
6
FRESCo: finding regions of excess synonymous constraint in diverse viruses.FRESCo:在多种病毒中寻找同义密码子过度限制区域
Genome Biol. 2015 Feb 17;16(1):38. doi: 10.1186/s13059-015-0603-7.
7
8.2% of the Human genome is constrained: variation in rates of turnover across functional element classes in the human lineage.人类基因组的8.2%受到限制:人类谱系中各功能元件类别的周转率差异。
PLoS Genet. 2014 Jul 24;10(7):e1004525. doi: 10.1371/journal.pgen.1004525. eCollection 2014 Jul.
8
Use of long sequence alignments to study the evolution and regulation of mammalian globin gene clusters.利用长序列比对研究哺乳动物珠蛋白基因簇的进化与调控。
Mol Biol Evol. 1993 Jan;10(1):73-102. doi: 10.1093/oxfordjournals.molbev.a039991.
9
Trade-offs in detecting evolutionarily constrained sequence by comparative genomics.通过比较基因组学检测进化受限序列时的权衡。
Annu Rev Genomics Hum Genet. 2005;6:143-64. doi: 10.1146/annurev.genom.6.080604.162146.
10
A high-resolution map of human evolutionary constraint using 29 mammals.利用 29 种哺乳动物绘制人类进化约束的高分辨率图谱。
Nature. 2011 Oct 12;478(7370):476-82. doi: 10.1038/nature10530.

引用本文的文献

1
Structural Analysis of Breast-Milk α-Casein: An α-Helical Conformation Is Required for TLR4-Stimulation.母乳α-酪蛋白的结构分析:TLR4 刺激需要α-螺旋构象。
Int J Mol Sci. 2024 Feb 1;25(3):1743. doi: 10.3390/ijms25031743.
2
Identification of constrained sequence elements across 239 primate genomes.在239个灵长类基因组中鉴定受限序列元件
Nature. 2024 Jan;625(7996):735-742. doi: 10.1038/s41586-023-06798-8. Epub 2023 Nov 29.
3
Increased oligodendrogenesis and myelination in the subventricular zone of aged mice and gray mouse lemurs.老年小鼠和灰鼠狐猴侧脑室下区的少突胶质细胞发生和髓鞘形成增加。
Stem Cell Reports. 2023 Feb 14;18(2):534-554. doi: 10.1016/j.stemcr.2022.12.015. Epub 2023 Jan 19.
4
Identification and characterization of constrained non-exonic bases lacking predictive epigenomic and transcription factor binding annotations.鉴定和表征缺乏预测性表观遗传和转录因子结合注释的约束性非外显子碱基。
Nat Commun. 2020 Dec 2;11(1):6168. doi: 10.1038/s41467-020-19962-9.
5
Prioritizing sequence variants in conserved non-coding elements in the chicken genome using chCADD.利用 chCADD 优先考虑鸡基因组中保守非编码元件中的序列变异。
PLoS Genet. 2020 Sep 23;16(9):e1009027. doi: 10.1371/journal.pgen.1009027. eCollection 2020 Sep.
6
Rate variation in the evolution of non-coding DNA associated with social evolution in bees.非编码 DNA 与蜜蜂社会进化相关的进化中的速率变化。
Philos Trans R Soc Lond B Biol Sci. 2019 Jul 22;374(1777):20180247. doi: 10.1098/rstb.2018.0247. Epub 2019 Jun 3.
7
Specification and formation of the neural crest: Perspectives on lineage segregation.神经嵴的特化与形成:谱系分离的观点
Genesis. 2019 Jan;57(1):e23276. doi: 10.1002/dvg.23276. Epub 2019 Jan 15.
8
Nonhuman primate models of human viral infections.人类病毒感染的非人类灵长类动物模型。
Nat Rev Immunol. 2018 Jun;18(6):390-404. doi: 10.1038/s41577-018-0005-7.
9
Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes.单碱基分辨率的进化约束图谱和主要禾本科基因组中保守元件的注释。
Genome Biol Evol. 2018 Feb 1;10(2):473-488. doi: 10.1093/gbe/evy006.
10
On causal roles and selected effects: our genome is mostly junk.关于因果作用和选择效应:我们的基因组大多是垃圾。
BMC Biol. 2017 Dec 5;15(1):116. doi: 10.1186/s12915-017-0460-9.

本文引用的文献

1
Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.ENCODE试点项目对人类基因组1%的功能元件进行鉴定与分析。
Nature. 2007 Jun 14;447(7146):799-816. doi: 10.1038/nature05874.
2
Using genomic data to unravel the root of the placental mammal phylogeny.利用基因组数据揭示胎盘哺乳动物系统发育的根源。
Genome Res. 2007 Apr;17(4):413-21. doi: 10.1101/gr.5918807. Epub 2007 Feb 23.
3
Early history of mammals is elucidated with the ENCODE multiple species sequencing data.通过ENCODE多物种测序数据阐明了哺乳动物的早期历史。
PLoS Genet. 2007 Jan 5;3(1):e2. doi: 10.1371/journal.pgen.0030002.
4
A "silent" polymorphism in the MDR1 gene changes substrate specificity.MDR1基因中的一种“沉默”多态性改变了底物特异性。
Science. 2007 Jan 26;315(5811):525-8. doi: 10.1126/science.1135308. Epub 2006 Dec 21.
5
Genetics. SNPs, silent but not invisible.遗传学。单核苷酸多态性,沉默却并非不可见。
Science. 2007 Jan 26;315(5811):466-7. doi: 10.1126/science.1138239. Epub 2006 Dec 21.
6
The ENCODE Project at UC Santa Cruz.加州大学圣克鲁兹分校的DNA元件百科全书计划。
Nucleic Acids Res. 2007 Jan;35(Database issue):D663-7. doi: 10.1093/nar/gkl1017. Epub 2006 Dec 13.
7
In vivo enhancer analysis of human conserved non-coding sequences.人类保守非编码序列的体内增强子分析
Nature. 2006 Nov 23;444(7118):499-502. doi: 10.1038/nature05295. Epub 2006 Nov 5.
8
XRate: a fast prototyping, training and annotation tool for phylo-grammars.XRate:一种用于系统发育语法的快速原型制作、训练和注释工具。
BMC Bioinformatics. 2006 Oct 3;7:428. doi: 10.1186/1471-2105-7-428.
9
Pegasoferae, an unexpected mammalian clade revealed by tracking ancient retroposon insertions.佩加索兽亚目,一个通过追踪古老反转录转座子插入而揭示的意外哺乳动物分支。
Proc Natl Acad Sci U S A. 2006 Jun 27;103(26):9929-34. doi: 10.1073/pnas.0603797103. Epub 2006 Jun 19.
10
A large family of ancient repeat elements in the human genome is under strong selection.人类基因组中一个古老的重复元件大家族正受到强烈的选择作用。
Proc Natl Acad Sci U S A. 2006 Feb 21;103(8):2740-5. doi: 10.1073/pnas.0511238103. Epub 2006 Feb 13.