• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多种果蝇物种的全基因组系统发育树

Whole genome phylogenies for multiple Drosophila species.

作者信息

Seetharam Arun, Stuart Gary W

机构信息

Department of Biology, Indiana State University, Terre Haute, Indiana 47809, USA.

出版信息

BMC Res Notes. 2012 Dec 4;5:670. doi: 10.1186/1756-0500-5-670.

DOI:10.1186/1756-0500-5-670
PMID:23210901
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3531268/
Abstract

BACKGROUND

Reconstructing the evolutionary history of organisms using traditional phylogenetic methods may suffer from inaccurate sequence alignment. An alternative approach, particularly effective when whole genome sequences are available, is to employ methods that don't use explicit sequence alignments. We extend a novel phylogenetic method based on Singular Value Decomposition (SVD) to reconstruct the phylogeny of 12 sequenced Drosophila species. SVD analysis provides accurate comparisons for a high fraction of sequences within whole genomes without the prior identification of orthologs or homologous sites. With this method all protein sequences are converted to peptide frequency vectors within a matrix that is decomposed to provide simplified vector representations for each protein of the genome in a reduced dimensional space. These vectors are summed together to provide a vector representation for each species, and the angle between these vectors provides distance measures that are used to construct species trees.

RESULTS

An unfiltered whole genome analysis (193,622 predicted proteins) strongly supports the currently accepted phylogeny for 12 Drosophila species at higher dimensions except for the generally accepted but difficult to discern sister relationship between D. erecta and D. yakuba. Also, in accordance with previous studies, many sequences appear to support alternative phylogenies. In this case, we observed grouping of D. erecta with D. sechellia when approximately 55% to 95% of the proteins were removed using a filter based on projection values or by reducing resolution by using fewer dimensions. Similar results were obtained when just the melanogaster subgroup was analyzed.

CONCLUSIONS

These results indicate that using our novel phylogenetic method, it is possible to consult and interpret all predicted protein sequences within multiple whole genomes to produce accurate phylogenetic estimations of relatedness between Drosophila species. Furthermore, protein filtering can be effectively applied to reduce incongruence in the dataset as well as to generate alternative phylogenies.

摘要

背景

使用传统系统发育方法重建生物体的进化历史可能会受到不准确序列比对的影响。另一种方法,尤其是在有全基因组序列时特别有效,是采用不使用显式序列比对的方法。我们扩展了一种基于奇异值分解(SVD)的新型系统发育方法,以重建12个已测序果蝇物种的系统发育。SVD分析可在无需事先鉴定直系同源物或同源位点的情况下,对全基因组内的大部分序列进行准确比较。使用这种方法,所有蛋白质序列都被转换为矩阵内的肽频率向量,该矩阵被分解以在降维空间中为基因组中的每个蛋白质提供简化的向量表示。这些向量被加在一起,为每个物种提供一个向量表示,这些向量之间的夹角提供用于构建物种树的距离度量。

结果

未经过滤的全基因组分析(193,622个预测蛋白质)在较高维度上有力地支持了目前被接受的12个果蝇物种的系统发育,除了公认但难以区分的直翅果蝇和雅库布果蝇之间的姐妹关系。此外,与先前的研究一致,许多序列似乎支持替代系统发育。在这种情况下,当使用基于投影值的过滤器或通过使用较少维度降低分辨率去除约55%至95%的蛋白质时,我们观察到直翅果蝇与塞舌尔果蝇聚在一起。仅分析黑腹果蝇亚组时也获得了类似结果。

结论

这些结果表明,使用我们的新型系统发育方法,可以查阅和解释多个全基因组内的所有预测蛋白质序列,以对果蝇物种之间的亲缘关系进行准确的系统发育估计。此外,蛋白质过滤可以有效地应用于减少数据集中的不一致性,并生成替代系统发育。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/5db699c7583a/1756-0500-5-670-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/221aa39a8c86/1756-0500-5-670-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/314824e4d3cb/1756-0500-5-670-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/2360a00bb323/1756-0500-5-670-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/a03a24747927/1756-0500-5-670-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/fdb4ba093946/1756-0500-5-670-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/ac6317d606be/1756-0500-5-670-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/4ee85ca41ad6/1756-0500-5-670-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/5db699c7583a/1756-0500-5-670-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/221aa39a8c86/1756-0500-5-670-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/314824e4d3cb/1756-0500-5-670-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/2360a00bb323/1756-0500-5-670-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/a03a24747927/1756-0500-5-670-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/fdb4ba093946/1756-0500-5-670-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/ac6317d606be/1756-0500-5-670-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/4ee85ca41ad6/1756-0500-5-670-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c20/3531268/5db699c7583a/1756-0500-5-670-8.jpg

相似文献

1
Whole genome phylogenies for multiple Drosophila species.多种果蝇物种的全基因组系统发育树
BMC Res Notes. 2012 Dec 4;5:670. doi: 10.1186/1756-0500-5-670.
2
Integrated gene and species phylogenies from unaligned whole genome protein sequences.基于未比对的全基因组蛋白质序列构建的整合基因和物种系统发育树。
Bioinformatics. 2002 Jan;18(1):100-8. doi: 10.1093/bioinformatics/18.1.100.
3
Widespread discordance of gene trees with species tree in Drosophila: evidence for incomplete lineage sorting.果蝇中基因树与物种树的广泛不一致:不完全谱系分选的证据。
PLoS Genet. 2006 Oct 27;2(10):e173. doi: 10.1371/journal.pgen.0020173. Epub 2006 Aug 28.
4
A comprehensive vertebrate phylogeny using vector representations of protein sequences from whole genomes.一种利用全基因组蛋白质序列的向量表示构建的全面脊椎动物系统发育树。
Mol Biol Evol. 2002 Apr;19(4):554-62. doi: 10.1093/oxfordjournals.molbev.a004111.
5
An SVD-based comparison of nine whole eukaryotic genomes supports a coelomate rather than ecdysozoan lineage.基于奇异值分解对九个真核生物全基因组的比较支持体腔动物而非蜕皮动物谱系。
BMC Bioinformatics. 2004 Dec 17;5:204. doi: 10.1186/1471-2105-5-204.
6
Ancestral polymorphism and recent invasion of transposable elements in Drosophila species.果蝇属中转座元件的祖先多态性和近期入侵。
BMC Evol Biol. 2012 Jul 23;12:119. doi: 10.1186/1471-2148-12-119.
7
Genome BLAST distance phylogenies inferred from whole plastid and whole mitochondrion genome sequences.基于整个质体和整个线粒体基因组序列推断的基因组BLAST距离系统发育树。
BMC Bioinformatics. 2006 Jul 19;7:350. doi: 10.1186/1471-2105-7-350.
8
Evolutionary pattern of the gtwin retrotransposon in the Drosophila melanogaster subgroup.黑腹果蝇亚组中gtwin反转录转座子的进化模式。
Genetica. 2007 Jun;130(2):161-8. doi: 10.1007/s10709-006-9003-y. Epub 2006 Aug 4.
9
Evolution of genes and genomes on the Drosophila phylogeny.果蝇系统发育中基因和基因组的进化。
Nature. 2007 Nov 8;450(7167):203-18. doi: 10.1038/nature06341.
10
Inferring phylogenies from RAD sequence data.从 RAD 序列数据推断系统发育关系。
PLoS One. 2012;7(4):e33394. doi: 10.1371/journal.pone.0033394. Epub 2012 Apr 6.

引用本文的文献

1
Evolutionary Loss of Activity in De-Ubiquitylating Enzymes of the OTU Family.OTU家族去泛素化酶活性的进化丧失
PLoS One. 2015 Nov 20;10(11):e0143227. doi: 10.1371/journal.pone.0143227. eCollection 2015.
2
The phylogenetic utility and functional constraint of microRNA flanking sequences.微小RNA侧翼序列的系统发育效用和功能限制
Proc Biol Sci. 2015 Mar 22;282(1803):20142983. doi: 10.1098/rspb.2014.2983.

本文引用的文献

1
Using whole genome presence/absence data to untangle function in 12 Drosophila genomes.利用全基因组存在/缺失数据解析12种果蝇基因组的功能。
Fly (Austin). 2008 Nov-Dec;2(6):291-9. doi: 10.4161/fly.7481. Epub 2008 Nov 24.
2
Signature genes as a phylogenomic tool.作为系统基因组学工具的标志性基因。
Mol Biol Evol. 2008 Aug;25(8):1659-67. doi: 10.1093/molbev/msn115. Epub 2008 May 19.
3
Drosophila biology in the genomic age.基因组时代的果蝇生物学
Genetics. 2007 Nov;177(3):1269-76. doi: 10.1534/genetics.107.074112.
4
Gene family evolution across 12 Drosophila genomes.跨越12个果蝇基因组的基因家族进化。
PLoS Genet. 2007 Nov;3(11):e197. doi: 10.1371/journal.pgen.0030197.
5
Inferring genome-scale rearrangement phylogeny and ancestral gene order: a Drosophila case study.推断基因组规模的重排系统发育和祖先基因顺序:以果蝇为例的研究
Genome Biol. 2007;8(11):R236. doi: 10.1186/gb-2007-8-11-r236.
6
Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.利用进化特征在12个果蝇基因组中发现功能元件。
Nature. 2007 Nov 8;450(7167):219-32. doi: 10.1038/nature06340.
7
Evolution of genes and genomes on the Drosophila phylogeny.果蝇系统发育中基因和基因组的进化。
Nature. 2007 Nov 8;450(7167):203-18. doi: 10.1038/nature06341.
8
Widespread discordance of gene trees with species tree in Drosophila: evidence for incomplete lineage sorting.果蝇中基因树与物种树的广泛不一致:不完全谱系分选的证据。
PLoS Genet. 2006 Oct 27;2(10):e173. doi: 10.1371/journal.pgen.0020173. Epub 2006 Aug 28.
9
Phylogenetic incongruence in the Drosophila melanogaster species group.黑腹果蝇物种组中的系统发育不一致性。
Mol Phylogenet Evol. 2007 Jun;43(3):1138-50. doi: 10.1016/j.ympev.2006.09.002. Epub 2006 Sep 9.
10
Toward automatic reconstruction of a highly resolved tree of life.迈向高分辨率生命树的自动重建。
Science. 2006 Mar 3;311(5765):1283-7. doi: 10.1126/science.1123061.