• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

狗和人类直系同源、旁系同源及保守共线性的系统发育重建。

Phylogenetic reconstruction of orthology, paralogy, and conserved synteny for dog and human.

作者信息

Goodstadt Leo, Ponting Chris P

机构信息

Medical Research Council Functional Genetics Unit, University of Oxford, Department of Physiology, Anatomy, and Genetics, Oxford, United Kingdom.

出版信息

PLoS Comput Biol. 2006 Sep 29;2(9):e133. doi: 10.1371/journal.pcbi.0020133.

DOI:10.1371/journal.pcbi.0020133
PMID:17009864
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1584324/
Abstract

Accurate predictions of orthology and paralogy relationships are necessary to infer human molecular function from experiments in model organisms. Previous genome-scale approaches to predicting these relationships have been limited by their use of protein similarity and their failure to take into account multiple splicing events and gene prediction errors. We have developed PhyOP, a new phylogenetic orthology prediction pipeline based on synonymous rate estimates, which accurately predicts orthology and paralogy relationships for transcripts, genes, exons, or genomic segments between closely related genomes. We were able to identify orthologue relationships to human genes for 93% of all dog genes from Ensembl. Among 1:1 orthologues, the alignments covered a median of 97.4% of protein sequences, and 92% of orthologues shared essentially identical gene structures. PhyOP accurately recapitulated genomic maps of conserved synteny. Benchmarking against predictions from Ensembl and Inparanoid showed that PhyOP is more accurate, especially in its predictions of paralogy. Nearly half (46%) of PhyOP paralogy predictions are unique. Using PhyOP to investigate orthologues and paralogues in the human and dog genomes, we found that the human assembly contains 3-fold more gene duplications than the dog. Species-specific duplicate genes, or "in-paralogues," are generally shorter and have fewer exons than 1:1 orthologues, which is consistent with selective constraints and mutation biases based on the sizes of duplicated genes. In-paralogues have experienced elevated amino acid and synonymous nucleotide substitution rates. Duplicates possess similar biological functions for either the dog or human lineages. Having accounted for 2,954 likely pseudogenes and gene fragments, and after separating 346 erroneously merged genes, we estimated that the human genome encodes a minimum of 19,700 protein-coding genes, similar to the gene count of nematode worms. PhyOP is a fast and robust approach to orthology prediction that will be applicable to whole genomes from multiple closely related species. PhyOP will be particularly useful in predicting orthology for mammalian genomes that have been incompletely sequenced, and for large families of rapidly duplicating genes.

摘要

准确预测直系同源和旁系同源关系对于从模式生物实验中推断人类分子功能至关重要。先前用于预测这些关系的全基因组方法受到蛋白质相似性的限制,且未考虑多种剪接事件和基因预测错误。我们开发了PhyOP,一种基于同义速率估计的新的系统发育直系同源预测流程,它能准确预测密切相关基因组之间转录本、基因、外显子或基因组片段的直系同源和旁系同源关系。我们能够为Ensembl中93%的犬类基因鉴定出与人基因的直系同源关系。在1:1直系同源物中,比对覆盖蛋白质序列的中位数为97.4%,92%的直系同源物具有基本相同的基因结构。PhyOP准确地重现了保守共线性的基因组图谱。与Ensembl和Inparanoid的预测结果进行基准测试表明,PhyOP更准确,尤其是在旁系同源预测方面。PhyOP近一半(46%)的旁系同源预测是独特的。使用PhyOP研究人类和犬类基因组中的直系同源物和旁系同源物,我们发现人类基因组中的基因重复比犬类多3倍。物种特异性重复基因,即“内部旁系同源物”,通常比1:1直系同源物更短且外显子更少,这与基于重复基因大小的选择限制和突变偏差一致。内部旁系同源物经历了更高的氨基酸和同义核苷酸替换率。重复基因在犬类或人类谱系中具有相似的生物学功能。在考虑了2954个可能的假基因和基因片段,并分离了346个错误合并的基因后,我们估计人类基因组至少编码19700个蛋白质编码基因,与线虫的基因数量相似。PhyOP是一种快速且稳健的直系同源预测方法,将适用于多个密切相关物种的全基因组。PhyOP在预测未完全测序的哺乳动物基因组以及快速复制的基因大家族的直系同源关系方面将特别有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/714a17f8d953/pcbi.0020133.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/a0a7fc0f6100/pcbi.0020133.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/e72257374c32/pcbi.0020133.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/3ee2931ab543/pcbi.0020133.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/ec6d62f76560/pcbi.0020133.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/196bec1aed3c/pcbi.0020133.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/154b447cda63/pcbi.0020133.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/5bff3ad428ce/pcbi.0020133.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/ca3934597f1d/pcbi.0020133.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/b593dc8c8459/pcbi.0020133.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/714a17f8d953/pcbi.0020133.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/a0a7fc0f6100/pcbi.0020133.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/e72257374c32/pcbi.0020133.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/3ee2931ab543/pcbi.0020133.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/ec6d62f76560/pcbi.0020133.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/196bec1aed3c/pcbi.0020133.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/154b447cda63/pcbi.0020133.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/5bff3ad428ce/pcbi.0020133.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/ca3934597f1d/pcbi.0020133.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/b593dc8c8459/pcbi.0020133.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ae7/1584324/714a17f8d953/pcbi.0020133.g010.jpg

相似文献

1
Phylogenetic reconstruction of orthology, paralogy, and conserved synteny for dog and human.狗和人类直系同源、旁系同源及保守共线性的系统发育重建。
PLoS Comput Biol. 2006 Sep 29;2(9):e133. doi: 10.1371/journal.pcbi.0020133.
2
MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score.MetaPhOrs:使用基于一致性的置信分数,从多种系统发育证据预测直系同源和旁系同源。
Nucleic Acids Res. 2011 Mar;39(5):e32. doi: 10.1093/nar/gkq953. Epub 2010 Dec 11.
3
The evolution of protostome GATA factors: molecular phylogenetics, synteny, and intron/exon structure reveal orthologous relationships.原口动物GATA因子的进化:分子系统发育、共线性和内含子/外显子结构揭示直系同源关系。
BMC Evol Biol. 2008 Apr 15;8:112. doi: 10.1186/1471-2148-8-112.
4
A universal genomic coordinate translator for comparative genomics.用于比较基因组学的通用基因组坐标转换器。
BMC Bioinformatics. 2014 Jun 30;15:227. doi: 10.1186/1471-2105-15-227.
5
PhylomeDB v3.0: an expanding repository of genome-wide collections of trees, alignments and phylogeny-based orthology and paralogy predictions.系统发育基因组数据库v3.0:一个不断扩展的全基因组树集合、比对以及基于系统发育的直系同源和旁系同源预测的知识库。
Nucleic Acids Res. 2011 Jan;39(Database issue):D556-60. doi: 10.1093/nar/gkq1109. Epub 2010 Nov 12.
6
The dog genome: survey sequencing and comparative analysis.犬类基因组:测序与比较分析
Science. 2003 Sep 26;301(5641):1898-903. doi: 10.1126/science.1086432.
7
Inferring Orthology and Paralogy.推断直系同源和旁系同源关系。
Methods Mol Biol. 2019;1910:149-175. doi: 10.1007/978-1-4939-9074-0_5.
8
POInT: orthology prediction and synteny exploration for paleopolyploid genomes.POInT:古多倍体基因组的直系同源预测和同线性探索。
BMC Bioinformatics. 2023 Apr 27;24(1):174. doi: 10.1186/s12859-023-05298-w.
9
Orthology and paralogy constraints: satisfiability and consistency.直系同源和旁系同源约束:可满足性和一致性。
BMC Genomics. 2014;15 Suppl 6(Suppl 6):S12. doi: 10.1186/1471-2164-15-S6-S12. Epub 2014 Oct 17.
10
Identification of mammalian orthologs using local synteny.利用局部同线性鉴定哺乳动物的直系同源基因。
BMC Genomics. 2009 Dec 23;10:630. doi: 10.1186/1471-2164-10-630.

引用本文的文献

1
Gene fusion as an important mechanism to generate new genes in the genus Oryza.基因融合作为在稻属中产生新基因的重要机制。
Genome Biol. 2022 Jun 15;23(1):130. doi: 10.1186/s13059-022-02696-w.
2
Between the Devil and the Deep Blue Sea: Non-Coding RNAs Associated with Transmissible Cancers in Tasmanian Devil, Domestic Dog and Bivalves.进退两难:与袋獾、家犬和双壳贝类可传播癌症相关的非编码RNA
Noncoding RNA. 2021 Nov 10;7(4):72. doi: 10.3390/ncrna7040072.
3
SwiftOrtho: A fast, memory-efficient, multiple genome orthology classifier.

本文引用的文献

1
Bias of selection on human copy-number variants.人类拷贝数变异的选择偏差。
PLoS Genet. 2006 Feb;2(2):e20. doi: 10.1371/journal.pgen.0020020. Epub 2006 Feb 17.
2
Genome-wide identification of human functional DNA using a neutral indel model.使用中性插入缺失模型对人类功能性DNA进行全基因组鉴定。
PLoS Comput Biol. 2006 Jan;2(1):e5. doi: 10.1371/journal.pcbi.0020005. Epub 2006 Jan 13.
3
Tandem chimerism as a means to increase protein complexity in the human genome.串联嵌合体作为一种增加人类基因组中蛋白质复杂性的手段。
SwiftOrtho:一种快速、内存高效、多基因组同源分类器。
Gigascience. 2019 Oct 1;8(10). doi: 10.1093/gigascience/giz118.
4
Aequatus: an open-source homology browser.Aequatus:一个开源同源浏览器。
Gigascience. 2018 Nov 1;7(11):giy128. doi: 10.1093/gigascience/giy128.
5
Loose ends: almost one in five human genes still have unresolved coding status.未解之谜:近五分之一的人类基因仍存在未解决的编码状态。
Nucleic Acids Res. 2018 Aug 21;46(14):7070-7084. doi: 10.1093/nar/gky587.
6
Genome-Guided Phylo-Transcriptomic Methods and the Nuclear Phylogentic Tree of the Paniceae Grasses.基因组指导的系统发育转录组学方法及禾本科 Paniceae 族的核系统发育树。
Sci Rep. 2017 Oct 19;7(1):13528. doi: 10.1038/s41598-017-13236-z.
7
Genome-wide association study to identify potential genetic modifiers in a canine model for Duchenne muscular dystrophy.全基因组关联研究,以确定杜兴氏肌营养不良犬模型中的潜在基因修饰因子。
BMC Genomics. 2016 Aug 22;17(1):665. doi: 10.1186/s12864-016-2948-z.
8
A Genomic Analysis of Factors Driving lincRNA Diversification: Lessons from Plants.驱动长链非编码RNA多样化的因素的基因组分析:来自植物的经验教训。
G3 (Bethesda). 2016 Sep 8;6(9):2881-91. doi: 10.1534/g3.116.030338.
9
Inferring Orthologs: Open Questions and Perspectives.推断直系同源基因:未解决的问题与展望
Genomics Insights. 2016 Feb 25;9:17-28. doi: 10.4137/GEI.S37925. eCollection 2016.
10
Stressing out over long noncoding RNA.对长链非编码RNA感到焦虑。
Biochim Biophys Acta. 2016 Jan;1859(1):184-91. doi: 10.1016/j.bbagrm.2015.06.010. Epub 2015 Jul 2.
Genome Res. 2006 Jan;16(1):37-44. doi: 10.1101/gr.4145906. Epub 2005 Dec 12.
4
Transcription-mediated gene fusion in the human genome.人类基因组中的转录介导基因融合
Genome Res. 2006 Jan;16(1):30-6. doi: 10.1101/gr.4137606. Epub 2005 Dec 12.
5
Abundant novel transcriptional units and unconventional gene pairs on human chromosome 22.人类22号染色体上丰富的新型转录单元和非常规基因对。
Genome Res. 2006 Jan;16(1):45-54. doi: 10.1101/gr.3883606. Epub 2005 Dec 12.
6
Selection in favor of nucleotides G and C diversifies evolution rates and levels of polymorphism at mammalian synonymous sites.对核苷酸G和C的选择使哺乳动物同义位点的进化速率和多态性水平多样化。
J Theor Biol. 2006 Jun 21;240(4):616-26. doi: 10.1016/j.jtbi.2005.10.020. Epub 2005 Dec 15.
7
Genome sequence, comparative analysis and haplotype structure of the domestic dog.家犬的基因组序列、比较分析及单倍型结构
Nature. 2005 Dec 8;438(7069):803-19. doi: 10.1038/nature04338.
8
An initial strategy for the systematic identification of functional elements in the human genome by low-redundancy comparative sequencing.一种通过低冗余度比较测序对人类基因组中的功能元件进行系统鉴定的初步策略。
Proc Natl Acad Sci U S A. 2005 Mar 29;102(13):4795-800. doi: 10.1073/pnas.0409882102. Epub 2005 Mar 18.
9
Identification and analysis of alternative splicing events conserved in human and mouse.人类和小鼠中保守的可变剪接事件的鉴定与分析。
Proc Natl Acad Sci U S A. 2005 Feb 22;102(8):2850-5. doi: 10.1073/pnas.0409742102. Epub 2005 Feb 11.
10
Alternative splicing of conserved exons is frequently species-specific in human and mouse.保守外显子的可变剪接在人类和小鼠中通常具有物种特异性。
Trends Genet. 2005 Feb;21(2):73-7. doi: 10.1016/j.tig.2004.12.004.