• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CoExpPhylo——一种用于生物合成基因发现的新型流程。

CoExpPhylo - a novel pipeline for biosynthesis gene discovery.

作者信息

Grünig Nele, Pucker Boas

机构信息

Plant Biotechnology and Bioinformatics, Institute of Plant Biology & Braunschweig Integrated, Centre of Systems Biology (BRICS), TU Braunschweig, Braunschweig, Germany.

Institute for Cellular and Molecular Botany (IZMB), University of Bonn, Kirschallee 1, Bonn, 53115, Germany.

出版信息

BMC Genomics. 2025 Sep 22;26(1):807. doi: 10.1186/s12864-025-12061-3.

DOI:10.1186/s12864-025-12061-3
PMID:40983916
Abstract

BACKGROUND

The rapid advancement of sequencing technologies has drastically increased the availability of plant genomic and transcriptomic data, shifting the challenge from data generation to functional interpretation. Identifying genes involved in specialized metabolism remains difficult. While coexpression analysis is a widely used approach to identify genes acting in the same pathway or process, it has limitations, particularly in distinguishing genes coexpressed due to shared regulatory triggers from those directly involved in the same pathway. To enhance functional predictions, integrating phylogenetic analysis provides an additional layer of confidence by considering evolutionary conservation. Here, we introduce CoExpPhylo, a computational pipeline that systematically combines coexpression analysis and phylogenetics to identify candidate genes involved in specialized biosynthetic pathways across multiple species based on one to multiple bait gene candidates.

RESULTS

CoExpPhylo systematically integrates coexpression information and phylogenetic signals to identify candidate genes involved in specialized biosynthetic pathways. The pipeline consists of multiple computational steps: (1) species-specific coexpression analysis, (2) local sequence alignment to identify orthologs, (3) clustering of candidate genes into Orthologous Coexpressed Groups (OCGs), (4) functional annotation, (5) global sequence alignment, (6) phylogenetic tree generation, and optionally (7) visualization. The workflow is highly customizable, allowing users to adjust correlation thresholds, filtering parameters, and annotation sources. Benchmarking CoExpPhylo on multiple pathways, including the biosynthesis of anthocyanins, proanthocyanidins, and flavonols, as well as lutein and zeaxanthin, confirmed its ability to recover known genes while also suggesting novel candidates.

CONCLUSION

CoExpPhylo provides a systematic framework for identifying candidate genes involved in the specialized metabolism. By integrating coexpression data with phylogenetic clustering, it facilitates the discovery of both conserved and lineage-specific genes. The resulting OCGs offer a strong foundation for further experimental validation, bridging the gap between computational predictions and functional characterization. Future improvements, such as incorporating multi-species reference databases and refining clustering for large gene families, could further enhance its resolution. Overall, CoExpPhylo represents a valuable tool for accelerating pathway elucidation and advancing our understanding of specialized metabolism in plants.

摘要

背景

测序技术的迅速发展极大地增加了植物基因组和转录组数据的可得性,将挑战从数据生成转移到功能解读。鉴定参与特殊代谢的基因仍然具有难度。虽然共表达分析是一种广泛用于鉴定参与同一途径或过程的基因的方法,但它存在局限性,特别是在区分因共享调控触发因素而共表达的基因与直接参与同一途径的基因方面。为了增强功能预测,整合系统发育分析通过考虑进化保守性提供了额外的可信度。在此,我们介绍CoExpPhylo,这是一种计算流程,它系统地结合共表达分析和系统发育学,基于一到多个诱饵基因候选物来鉴定跨多个物种参与特殊生物合成途径的候选基因。

结果

CoExpPhylo系统地整合共表达信息和系统发育信号,以鉴定参与特殊生物合成途径的候选基因。该流程由多个计算步骤组成:(1)物种特异性共表达分析,(2)局部序列比对以鉴定直系同源物,(3)将候选基因聚类到直系同源共表达组(OCG)中,(4)功能注释,(5)全局序列比对,(6)系统发育树生成,以及可选的(7)可视化。该工作流程具有高度可定制性,允许用户调整相关性阈值、过滤参数和注释来源。在多个途径上对CoExpPhylo进行基准测试,包括花青素、原花青素和黄酮醇的生物合成,以及叶黄素和玉米黄质,证实了其能够找回已知基因,同时还能提出新的候选基因。

结论

CoExpPhylo为鉴定参与特殊代谢的候选基因提供了一个系统框架。通过将共表达数据与系统发育聚类相结合,它有助于发现保守基因和谱系特异性基因。所得的OCG为进一步的实验验证提供了坚实基础,弥合了计算预测与功能表征之间的差距。未来的改进,如纳入多物种参考数据库和完善对大型基因家族的聚类,可能会进一步提高其分辨率。总体而言,CoExpPhylo是加速途径阐明和推进我们对植物特殊代谢理解的有价值工具。

相似文献

1
CoExpPhylo - a novel pipeline for biosynthesis gene discovery.CoExpPhylo——一种用于生物合成基因发现的新型流程。
BMC Genomics. 2025 Sep 22;26(1):807. doi: 10.1186/s12864-025-12061-3.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
4
Post-pandemic planning for maternity care for local, regional, and national maternity systems across the four nations: a mixed-methods study.针对四个地区的地方、区域和国家孕产妇保健系统的疫情后规划:一项混合方法研究。
Health Soc Care Deliv Res. 2025 Sep;13(35):1-25. doi: 10.3310/HHTE6611.
5
Short-Term Memory Impairment短期记忆障碍
6
Exploring the impact of housing insecurity on the health and well-being of children and young people: a systematic review.探索住房不安全对儿童和年轻人健康与福祉的影响:一项系统综述。
Public Health Res (Southampt). 2023 Dec;11(13):1-71. doi: 10.3310/TWWL4501.
7
Genetic determinants of testicular sperm extraction outcomes: insights from a large multicentre study of men with non-obstructive azoospermia.睾丸精子提取结果的遗传决定因素:来自一项针对非梗阻性无精子症男性的大型多中心研究的见解
Hum Reprod Open. 2025 Aug 29;2025(3):hoaf049. doi: 10.1093/hropen/hoaf049. eCollection 2025.
8
MEANtools integrates multi-omics data to identify metabolites and predict biosynthetic pathways.MEANtools整合多组学数据以识别代谢物并预测生物合成途径。
PLoS Biol. 2025 Jul 28;23(7):e3003307. doi: 10.1371/journal.pbio.3003307. eCollection 2025 Jul.
9
Incentives for preventing smoking in children and adolescents.预防儿童和青少年吸烟的激励措施。
Cochrane Database Syst Rev. 2017 Jun 6;6(6):CD008645. doi: 10.1002/14651858.CD008645.pub3.
10
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

本文引用的文献

1
Phylogenomics and metabolic engineering reveal a conserved gene cluster in Solanaceae plants for withanolide biosynthesis.系统发育基因组学和代谢工程揭示了茄科植物中一个用于茄烷型内酯生物合成的保守基因簇。
Nat Commun. 2025 Jul 10;16(1):6367. doi: 10.1038/s41467-025-61686-1.
2
Metabolic fingerprinting reveals roles of Arabidopsis thaliana BGLU1, BGLU3, and BGLU4 in glycosylation of various flavonoids.代谢指纹图谱揭示了拟南芥BGLU1、BGLU3和BGLU4在各种黄酮类化合物糖基化中的作用。
Phytochemistry. 2025 Mar;231:114338. doi: 10.1016/j.phytochem.2024.114338. Epub 2024 Nov 26.
3
Conserved amino acid residues and gene expression patterns associated with the substrate preferences of the competing enzymes FLS and DFR.
与竞争酶 FLS 和 DFR 的底物偏好相关的保守氨基酸残基和基因表达模式。
PLoS One. 2024 Aug 28;19(8):e0305837. doi: 10.1371/journal.pone.0305837. eCollection 2024.
4
Cell type-specific control and post-translational regulation of specialized metabolism: opening new avenues for plant metabolic engineering.细胞类型特异性控制和特化代谢的翻译后调控:为植物代谢工程开辟新途径。
Curr Opin Plant Biol. 2024 Oct;81:102575. doi: 10.1016/j.pbi.2024.102575. Epub 2024 Jun 19.
5
Interactive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool.交互式生命树 (iTOL) v6:系统发育树显示和注释工具的最新更新。
Nucleic Acids Res. 2024 Jul 5;52(W1):W78-W82. doi: 10.1093/nar/gkae268.
6
OsNAC120 balances plant growth and drought tolerance by integrating GA and ABA signaling in rice.OsNAC120通过整合水稻中的赤霉素(GA)和脱落酸(ABA)信号来平衡植物生长和耐旱性。
Plant Commun. 2024 Mar 11;5(3):100782. doi: 10.1016/j.xplc.2023.100782. Epub 2023 Dec 26.
7
KIPEs3: Automatic annotation of biosynthesis pathways.KIPEs3:生物合成途径的自动注释。
PLoS One. 2023 Nov 16;18(11):e0294342. doi: 10.1371/journal.pone.0294342. eCollection 2023.
8
Multiple mechanisms explain loss of anthocyanins from betalain-pigmented Caryophyllales, including repeated wholesale loss of a key anthocyanidin synthesis enzyme.多种机制解释了含甜菜红素的石竹目植物中花色苷的丢失,包括关键花色苷合成酶的大量重复丢失。
New Phytol. 2024 Jan;241(1):471-489. doi: 10.1111/nph.19341. Epub 2023 Oct 28.
9
The catalytic role of glutathione transferases in heterologous anthocyanin biosynthesis.谷胱甘肽转移酶在异源花青素生物合成中的催化作用。
Nat Catal. 2023;6(10):927-938. doi: 10.1038/s41929-023-01018-y. Epub 2023 Aug 31.
10
Flavonols affect the interrelated glucosinolate and camalexin biosynthetic pathways in Arabidopsis thaliana.类黄酮影响拟南芥中相互关联的硫代葡萄糖苷和卡玛烯生物合成途径。
J Exp Bot. 2024 Jan 1;75(1):219-240. doi: 10.1093/jxb/erad391.