• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于蛋白质的稳健系统发育谱聚类

Robust phylogenetic profile clustering for proteins.

作者信息

Harrison Paul M

机构信息

Department of Biology, McGill University, Montreal, Quebec, Canada.

出版信息

PeerJ. 2025 Apr 28;13:e19370. doi: 10.7717/peerj.19370. eCollection 2025.

DOI:10.7717/peerj.19370
PMID:40313378
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12045281/
Abstract

BACKGROUND

Genes are continually formed and lost as a genome evolves. However, new genes may tend to appear during specific evolutionary epochs rather than others, or disappear together in a more recent organismal clade. Methods to identify gene origination might simply use the last common ancestor to contain an ortholog as the putative gene origination point, or use a heuristic threshold that allows for a certain amount of missing orthologs in the cohort of species examined. Here, to avoid such issues, an alternative approach based on the clustering of phylogenetic profiles is applied, and the results are examined for any evidence of epochal trends in gene origination, and associated trends in specific sequence traits or functional associations.

METHODS

A phylogenetic profile is simply an array indicating the presence or absence of a gene in a list of species. These profiles were compared and clustered to discern patterns in gene occurrences across >800 fungal species, centering the analysis on the budding yeast .

RESULTS

Clear epochs of gene origination were observed linked to the last common ancestors of and , and also to and earlier ancestors. These trends are independent of the proteome and genome-assembly quality of the underlying data. Clusters of phylogenetic profiles demonstrated some significant functional associations, such as to cellular spore formation and chromosome segregation in genes originating in . The phylogenetic profile clustering analysis enabled detection of parameter-independent trends in intrinsic disorder, prion-like composition and gene uniqueness as a function of epochal gene age. For example: new proteins with prion-like domains have arisen at a similar rate for most of fungal evolution centred on ; the most proteins with mild intrinsic disorder have appeared during the early epoch rather than more recently, and very recently formed genes are the least likely to be single-copy (., 'unique' yeast proteins).

CONCLUSIONS

For individual proteins, the profile cluster data generated here are useful for investigating experimental hypotheses, since they provide evidence for functional linkages that have yet to be discerned.

摘要

背景

随着基因组的进化,基因不断地形成和丢失。然而,新基因可能倾向于在特定的进化时期而非其他时期出现,或者在更近的生物类群中一起消失。识别基因起源的方法可能只是简单地将包含直系同源基因的最后一个共同祖先作为假定的基因起源点,或者使用一个启发式阈值,该阈值允许在所研究的物种群体中有一定数量的缺失直系同源基因。在这里,为了避免这些问题,应用了一种基于系统发育谱聚类的替代方法,并检查结果是否有基因起源的时代趋势以及特定序列特征或功能关联的相关趋势的证据。

方法

系统发育谱简单来说就是一个数组,表明一个基因在一系列物种中存在或不存在。比较并聚类这些谱,以识别超过800种真菌物种中基因出现的模式,分析以芽殖酵母为中心。

结果

观察到与酿酒酵母和粟酒裂殖酵母的最后共同祖先以及与白色念珠菌和更早祖先相关的明显的基因起源时期。这些趋势与基础数据的蛋白质组和基因组组装质量无关。系统发育谱聚类显示出一些显著的功能关联,例如与起源于酿酒酵母的基因中的细胞孢子形成和染色体分离有关。系统发育谱聚类分析能够检测到作为时代基因年龄函数的内在无序、类朊病毒组成和基因独特性的与参数无关的趋势。例如:在以酿酒酵母为中心的大多数真菌进化过程中,具有类朊病毒结构域的新蛋白质以相似的速率出现;大多数具有轻度内在无序的蛋白质出现在早期酿酒酵母时期而非更近时期,并且最近形成的基因最不可能是单拷贝的(即“独特的”酵母蛋白质)。

结论

对于单个蛋白质,这里生成的谱聚类数据可用于研究实验假设,因为它们为尚未识别的功能联系提供了证据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/2de180fdc8f6/peerj-13-19370-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/62f226b4fc5a/peerj-13-19370-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/ea8afad32dbe/peerj-13-19370-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/94ed11d8f7c9/peerj-13-19370-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/af2c73a915f8/peerj-13-19370-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/2de180fdc8f6/peerj-13-19370-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/62f226b4fc5a/peerj-13-19370-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/ea8afad32dbe/peerj-13-19370-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/94ed11d8f7c9/peerj-13-19370-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/af2c73a915f8/peerj-13-19370-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc16/12045281/2de180fdc8f6/peerj-13-19370-g005.jpg

相似文献

1
Robust phylogenetic profile clustering for proteins.用于蛋白质的稳健系统发育谱聚类
PeerJ. 2025 Apr 28;13:e19370. doi: 10.7717/peerj.19370. eCollection 2025.
2
Phylogenetic portrait of the Saccharomyces cerevisiae functional genome.酿酒酵母功能基因组的系统发育图谱。
G3 (Bethesda). 2013 Aug 7;3(8):1335-40. doi: 10.1534/g3.113.006585.
3
Multi-gene phylogenetic analysis reveals that shochu-fermenting Saccharomyces cerevisiae strains form a distinct sub-clade of the Japanese sake cluster.多基因系统发育分析表明,烧酒发酵型酿酒酵母菌株形成了日本清酒簇中的一个独特亚分支。
Yeast. 2017 Oct;34(10):407-415. doi: 10.1002/yea.3243. Epub 2017 Aug 17.
4
De novo origination of a new protein-coding gene in Saccharomyces cerevisiae.酿酒酵母中新蛋白质编码基因的从头起源。
Genetics. 2008 May;179(1):487-96. doi: 10.1534/genetics.107.084491.
5
A putative scenario of how de novo protein-coding genes originate in the Saccharomyces cerevisiae lineage.酵母属中从头产生蛋白质编码基因的推测情景。
BMC Genomics. 2024 Sep 5;25(Suppl 3):834. doi: 10.1186/s12864-024-10669-5.
6
Emergence and evolution of yeast prion and prion-like proteins.酵母朊病毒和类朊病毒蛋白的出现与进化。
BMC Evol Biol. 2016 Jan 25;16:24. doi: 10.1186/s12862-016-0594-3.
7
Clade- and species-specific features of genome evolution in the Saccharomycetaceae.酵母科基因组进化的分支和物种特异性特征。
FEMS Yeast Res. 2015 Aug;15(5):fov035. doi: 10.1093/femsyr/fov035. Epub 2015 Jun 10.
8
Variable absorption of mutational trends by prion-forming domains during evolution.在进化过程中,朊病毒形成结构域对突变趋势的可变吸收。
PeerJ. 2020 Aug 6;8:e9669. doi: 10.7717/peerj.9669. eCollection 2020.
9
Conservation of Prion-Like Composition and Sequence in Prion-Formers and Prion-Like Proteins of .朊病毒形成蛋白及朊病毒样蛋白中朊病毒样组成和序列的保守性 。 (你提供的原文似乎不完整,最后的“of.”后面缺少具体内容)
Front Mol Biosci. 2019 Jul 11;6:54. doi: 10.3389/fmolb.2019.00054. eCollection 2019.
10
Evolution of budding yeast prion-determinant sequences across diverse fungi.不同真菌中出芽酵母朊病毒决定簇序列的进化
J Mol Biol. 2007 Apr 20;368(1):273-82. doi: 10.1016/j.jmb.2007.01.070. Epub 2007 Feb 3.

本文引用的文献

1
De Novo Emerged Gene Search in Eukaryotes with DENSE.从头开始在真核生物中搜索新兴基因 DENSE。
Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae159.
2
Ancestral Sequence Reconstruction as a Tool to Detect and Study De Novo Gene Emergence.祖先序列重建作为一种检测和研究新基因出现的工具。
Genome Biol Evol. 2024 Aug 5;16(8). doi: 10.1093/gbe/evae151.
3
Pathogenic strategies of during torpor and arousal of hibernating bats.在冬眠蝙蝠休眠和觉醒期间的致病策略。
Science. 2024 Jul 12;385(6705):194-200. doi: 10.1126/science.adn5606. Epub 2024 Jul 11.
4
Evolutionary Trajectories of New Duplicated and Putative De Novo Genes.新复制和推定从头基因的进化轨迹。
Mol Biol Evol. 2023 May 2;40(5). doi: 10.1093/molbev/msad098.
5
Uncovering gene-family founder events during major evolutionary transitions in animals, plants and fungi using GenEra.利用 GenEra 揭示动物、植物和真菌主要进化转变过程中的基因家族起源事件。
Genome Biol. 2023 Mar 24;24(1):54. doi: 10.1186/s13059-023-02895-z.
6
The Gene Ontology knowledgebase in 2023.2023 版基因本体论知识库。
Genetics. 2023 May 4;224(1). doi: 10.1093/genetics/iyad031.
7
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
8
BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes.BUSCO 更新:用于真核生物、原核生物和病毒基因组评分的新颖且简化的工作流程以及更广泛和更深的系统发育覆盖范围。
Mol Biol Evol. 2021 Sep 27;38(10):4647-4654. doi: 10.1093/molbev/msab199.
9
IUPred3: prediction of protein disorder enhanced with unambiguous experimental annotation and visualization of evolutionary conservation.IUPred3:利用明确的实验注释和进化保守性可视化增强的蛋白质无序性预测。
Nucleic Acids Res. 2021 Jul 2;49(W1):W297-W303. doi: 10.1093/nar/gkab408.
10
Universal and taxon-specific trends in protein sequences as a function of age.蛋白质序列随年龄变化的普遍和分类群特异性趋势。
Elife. 2021 Jan 8;10:e57347. doi: 10.7554/eLife.57347.