• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

代谢途径中频繁发生的基因融合事件的系统鉴定与分析。

Systematic identification and analysis of frequent gene fusion events in metabolic pathways.

作者信息

Henry Christopher S, Lerma-Ortiz Claudia, Gerdes Svetlana Y, Mullen Jeffrey D, Colasanti Ric, Zhukov Aleksey, Frelin Océane, Thiaville Jennifer J, Zallot Rémi, Niehaus Thomas D, Hasnain Ghulam, Conrad Neal, Hanson Andrew D, de Crécy-Lagard Valérie

机构信息

Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL, 60439, USA.

Computation Institute, The University of Chicago, Chicago, IL, 60637, USA.

出版信息

BMC Genomics. 2016 Jun 24;17:473. doi: 10.1186/s12864-016-2782-3.

DOI:10.1186/s12864-016-2782-3
PMID:27342196
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4921024/
Abstract

BACKGROUND

Gene fusions are the most powerful type of in silico-derived functional associations. However, many fusion compilations were made when <100 genomes were available, and algorithms for identifying fusions need updating to handle the current avalanche of sequenced genomes. The availability of a large fusion dataset would help probe functional associations and enable systematic analysis of where and why fusion events occur.

RESULTS

Here we present a systematic analysis of fusions in prokaryotes. We manually generated two training sets: (i) 121 fusions in the model organism Escherichia coli; (ii) 131 fusions found in B vitamin metabolism. These sets were used to develop a fusion prediction algorithm that captured the training set fusions with only 7 % false negatives and 50 % false positives, a substantial improvement over existing approaches. This algorithm was then applied to identify 3.8 million potential fusions across 11,473 genomes. The results of the analysis are available in a searchable database at http://modelseed.org/projects/fusions/ . A functional analysis identified 3,000 reactions associated with frequent fusion events and revealed areas of metabolism where fusions are particularly prevalent.

CONCLUSIONS

Customary definitions of fusions were shown to be ambiguous, and a stricter one was proposed. Exploring the genes participating in fusion events showed that they most commonly encode transporters, regulators, and metabolic enzymes. The major rationales for fusions between metabolic genes appear to be overcoming pathway bottlenecks, avoiding toxicity, controlling competing pathways, and facilitating expression and assembly of protein complexes. Finally, our fusion dataset provides powerful clues to decipher the biological activities of domains of unknown function.

摘要

背景

基因融合是计算机推导的功能关联中最强大的一种类型。然而,许多融合汇编是在可用基因组不足100个时完成的,识别融合的算法需要更新以处理当前大量的测序基因组。一个大型融合数据集的可用性将有助于探究功能关联,并能够对融合事件发生的位置和原因进行系统分析。

结果

在此,我们对原核生物中的融合进行了系统分析。我们手动生成了两个训练集:(i)模式生物大肠杆菌中的121个融合;(ii)在B族维生素代谢中发现的131个融合。这些集合用于开发一种融合预测算法,该算法捕获训练集融合时假阴性率仅为7%,假阳性率为50%,比现有方法有了显著改进。然后应用该算法在11473个基因组中识别出380万个潜在融合。分析结果可在http://modelseed.org/projects/fusions/的可搜索数据库中获取。功能分析确定了与频繁融合事件相关的3000个反应,并揭示了融合特别普遍的代谢区域。

结论

融合的传统定义被证明是模糊的,因此提出了一个更严格的定义。对参与融合事件的基因进行探索表明,它们最常编码转运蛋白、调节因子和代谢酶。代谢基因之间融合的主要原理似乎是克服途径瓶颈、避免毒性、控制竞争途径以及促进蛋白质复合物的表达和组装。最后,我们的融合数据集为破译未知功能结构域的生物学活性提供了有力线索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/d5c217c25b38/12864_2016_2782_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/2471a7ef5abe/12864_2016_2782_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/9b880868f392/12864_2016_2782_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/383ab213fa4a/12864_2016_2782_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/82d8680bda18/12864_2016_2782_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/3dcfba24d81c/12864_2016_2782_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/503cc1f5283b/12864_2016_2782_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/d5c217c25b38/12864_2016_2782_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/2471a7ef5abe/12864_2016_2782_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/9b880868f392/12864_2016_2782_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/383ab213fa4a/12864_2016_2782_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/82d8680bda18/12864_2016_2782_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/3dcfba24d81c/12864_2016_2782_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/503cc1f5283b/12864_2016_2782_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a02/4921024/d5c217c25b38/12864_2016_2782_Fig7_HTML.jpg

相似文献

1
Systematic identification and analysis of frequent gene fusion events in metabolic pathways.代谢途径中频繁发生的基因融合事件的系统鉴定与分析。
BMC Genomics. 2016 Jun 24;17:473. doi: 10.1186/s12864-016-2782-3.
2
The role of gene fusions in the evolution of metabolic pathways: the histidine biosynthesis case.基因融合在代谢途径进化中的作用:组氨酸生物合成案例
BMC Evol Biol. 2007 Aug 16;7 Suppl 2(Suppl 2):S4. doi: 10.1186/1471-2148-7-S2-S4.
3
Fifty years fused to lac.五十年与漆水融为一体。
Annu Rev Microbiol. 2013;67:1-19. doi: 10.1146/annurev-micro-092412-155732.
4
Tying the knot: occurrence and possible significance of gene fusions in plant metabolism and beyond.系紧纽带:植物代谢及其他领域中基因融合的发生及其可能的意义。
J Exp Bot. 2017 Jul 10;68(15):4029-4043. doi: 10.1093/jxb/erx152.
5
Improved detection of gene fusions by applying statistical methods reveals oncogenic RNA cancer drivers.通过应用统计方法提高基因融合的检测率,揭示致癌 RNA 癌症驱动因素。
Proc Natl Acad Sci U S A. 2019 Jul 30;116(31):15524-15533. doi: 10.1073/pnas.1900391116. Epub 2019 Jul 15.
6
Membrane topology of the Rickettsia prowazekii ATP/ADP translocase revealed by novel dual pho-lac reporters.新型双荧光素酶报告基因揭示的普氏立克次氏体ATP/ADP转位酶的膜拓扑结构
J Mol Biol. 1999 Jan 29;285(4):1503-13. doi: 10.1006/jmbi.1998.2412.
7
A substrate ambiguous enzyme facilitates genome reduction in an intracellular symbiont.一种底物模糊的酶促进细胞内共生体中的基因组缩减。
BMC Biol. 2014 Dec 20;12:110. doi: 10.1186/s12915-014-0110-4.
8
InFusion: Advancing Discovery of Fusion Genes and Chimeric Transcripts from Deep RNA-Sequencing Data.InFusion:从深度RNA测序数据中推进融合基因和嵌合转录本的发现
PLoS One. 2016 Dec 1;11(12):e0167417. doi: 10.1371/journal.pone.0167417. eCollection 2016.
9
An algorithm for the reduction of genome-scale metabolic network models to meaningful core models.一种将基因组规模代谢网络模型简化为有意义的核心模型的算法。
BMC Syst Biol. 2015 Aug 19;9:48. doi: 10.1186/s12918-015-0191-x.
10
DESHARKY: automatic design of metabolic pathways for optimal cell growth.DESHARKY:用于优化细胞生长的代谢途径自动设计。
Bioinformatics. 2008 Nov 1;24(21):2554-6. doi: 10.1093/bioinformatics/btn471. Epub 2008 Sep 5.

引用本文的文献

1
C11orf54 catalyzes L-xylulose formation in human metabolism.C11orf54在人体新陈代谢中催化L-木酮糖的形成。
Proc Natl Acad Sci U S A. 2025 Aug 5;122(31):e2506597122. doi: 10.1073/pnas.2506597122. Epub 2025 Jul 30.
2
Predicting Protein Function in the AI and Big Data Era.人工智能与大数据时代的蛋白质功能预测
Biochemistry. 2025 Jun 3;64(11):2345-2352. doi: 10.1021/acs.biochem.5c00186. Epub 2025 May 17.
3
Deciphering the Diversity in Bacterial Transporters That Salvage Queuosine Precursors.解析挽救queuosine前体的细菌转运蛋白的多样性

本文引用的文献

1
A family of metal-dependent phosphatases implicated in metabolite damage-control.一个与代谢物损伤控制有关的金属依赖性磷酸酶家族。
Nat Chem Biol. 2016 Aug;12(8):621-7. doi: 10.1038/nchembio.2108. Epub 2016 Jun 20.
2
Members of a Novel Kinase Family (DUF1537) Can Recycle Toxic Intermediates into an Essential Metabolite.一个新型激酶家族(DUF1537)的成员可将有毒中间体循环转化为一种必需代谢物。
ACS Chem Biol. 2016 Aug 19;11(8):2304-11. doi: 10.1021/acschembio.6b00279. Epub 2016 Jun 24.
3
Bacterial and plant HAD enzymes catalyse a missing phosphatase step in thiamin diphosphate biosynthesis.
Epigenomes. 2024 Apr 25;8(2):16. doi: 10.3390/epigenomes8020016.
4
Fusion/fission protein family identification in Archaea.古菌中融合/裂变蛋白家族的鉴定。
mSystems. 2024 Jun 18;9(6):e0094823. doi: 10.1128/msystems.00948-23. Epub 2024 May 3.
5
Alternate routes to mnmsU synthesis in Gram-positive bacteria.革兰氏阳性菌中 mnmsU 合成的替代途径。
J Bacteriol. 2024 Apr 18;206(4):e0045223. doi: 10.1128/jb.00452-23. Epub 2024 Mar 29.
6
The exceptional form and function of the giant bacterium Epulopiscium viviparus revolves around its sodium motive force.巨型细菌 Epulopiscium viviparus 的非凡形态和功能围绕着其钠离子动力展开。
Proc Natl Acad Sci U S A. 2023 Dec 26;120(52):e2306160120. doi: 10.1073/pnas.2306160120. Epub 2023 Dec 18.
7
Identification of gene fusion events in that encode chimeric proteins.鉴定编码嵌合蛋白的基因融合事件。
NAR Genom Bioinform. 2020 May 18;2(2):lqaa033. doi: 10.1093/nargab/lqaa033. eCollection 2020 Jun.
8
The evolution of metabolism: How to test evolutionary hypotheses at the genomic level.新陈代谢的进化:如何在基因组水平上检验进化假说。
Comput Struct Biotechnol J. 2020 Feb 20;18:482-500. doi: 10.1016/j.csbj.2020.02.009. eCollection 2020.
9
Multilayered horizontal operon transfers from bacteria reconstruct a thiamine salvage pathway in yeasts.多层水平操纵子从细菌转移重建酵母中的硫胺素回收途径。
Proc Natl Acad Sci U S A. 2019 Oct 29;116(44):22219-22228. doi: 10.1073/pnas.1909844116. Epub 2019 Oct 14.
10
Modeling genome-wide enzyme evolution predicts strong epistasis underlying catalytic turnover rates.基于全基因组酶进化模型预测催化周转率的强上位性。
Nat Commun. 2018 Dec 10;9(1):5270. doi: 10.1038/s41467-018-07649-1.
细菌和植物的HAD酶催化硫胺素二磷酸生物合成中缺失的磷酸酶步骤。
Biochem J. 2016 Jan 15;473(2):157-66. doi: 10.1042/BJ20150805. Epub 2015 Nov 4.
4
Genomic and experimental evidence for multiple metabolic functions in the RidA/YjgF/YER057c/UK114 (Rid) protein family.RidA/YjgF/YER057c/UK114(Rid)蛋白家族多种代谢功能的基因组及实验证据。
BMC Genomics. 2015 May 15;16(1):382. doi: 10.1186/s12864-015-1584-3.
5
DomSign: a top-down annotation pipeline to enlarge enzyme space in the protein universe.DomSign:一种自上而下的注释流程,用于拓展蛋白质世界中的酶空间。
BMC Bioinformatics. 2015 Mar 21;16:96. doi: 10.1186/s12859-015-0499-y.
6
Noncanonical coproporphyrin-dependent bacterial heme biosynthesis pathway that does not use protoporphyrin.不使用原卟啉的非经典粪卟啉依赖性细菌血红素生物合成途径。
Proc Natl Acad Sci U S A. 2015 Feb 17;112(7):2210-5. doi: 10.1073/pnas.1416285112. Epub 2015 Feb 2.
7
A directed-overflow and damage-control N-glycosidase in riboflavin biosynthesis.核黄素生物合成中一种定向溢流和损伤控制的N-糖苷酶。
Biochem J. 2015 Feb 15;466(1):137-45. doi: 10.1042/BJ20141237.
8
CDD: NCBI's conserved domain database.CDD:美国国家生物技术信息中心的保守结构域数据库。
Nucleic Acids Res. 2015 Jan;43(Database issue):D222-6. doi: 10.1093/nar/gku1221. Epub 2014 Nov 20.
9
STRING v10: protein-protein interaction networks, integrated over the tree of life.STRING v10:整合了整个生命之树的蛋白质-蛋白质相互作用网络。
Nucleic Acids Res. 2015 Jan;43(Database issue):D447-52. doi: 10.1093/nar/gku1003. Epub 2014 Oct 28.
10
Enzyme clustering accelerates processing of intermediates through metabolic channeling.酶的聚集通过代谢通道加速中间体的加工。
Nat Biotechnol. 2014 Oct;32(10):1011-8. doi: 10.1038/nbt.3018. Epub 2014 Sep 28.