• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于从头开始的 GO 分析,在三个模式植物二倍体基因组中挖掘非串联重复的功能簇。

Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes.

机构信息

CREA Research Centre for Genomics and Bioinformatics, Fiorenzuola d'Arda, Italy.

出版信息

PLoS One. 2020 Jun 19;15(6):e0234782. doi: 10.1371/journal.pone.0234782. eCollection 2020.

DOI:10.1371/journal.pone.0234782
PMID:32559249
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7304597/
Abstract

A functional Non-Tandem Duplicated Cluster (FNTDC) is a group of non-tandem-duplicated genes that are located closer than expected by mere chance and have a role in the same biological function. The identification of secondary-compounds-related FNTDC has gained increased interest in recent years, but little ab-initio attempts aiming to the identification of FNTDCs covering all biological functions, including primary metabolism compounds, have been carried out. We report an extensive FNTDC dataset accompanied by a detailed assessment on parameters used for genome scanning and their impact on FNTDC detection. We propose 70% identity and 70% alignment coverage as intermediate settings to exclude tandem duplicated genes and a dynamic scanning window of 24 genes. These settings were applied to rice, arabidopsis and grapevine genomes to call for FNTDCs. Besides the best-known secondary metabolism clusters, we identified many FNTDCs associated to primary metabolism ranging from macromolecules synthesis/editing, TOR signalling, ubiquitination, proton and electron transfer complexes. Using the intermediate FNTDC setting parameters (at P-value 1e-6), 130, 70 and 140 candidate FNTDCs were called in rice, arabidopsis and grapevine, respectively, and 20 to 30% of GO tags associated to called FNTDC were common among the 3 genomes. The datasets developed along with this work provide a rich framework for pinpointing candidate FNTDCs reflecting all GO-BP tags covering both primary and secondary metabolism with large macromolecular complexes/metabolons as the most represented FNTDCs. Noteworthy, several FNTDCs are tagged with GOs referring to organelle-targeted multi-enzyme complex, a finding that suggest the migration of endosymbiont gene chunks towards nuclei could be at the basis of these class of candidate FNTDCs. Most FNTDC appear to have evolved prior of genome duplication events. More than one-third of genes interspersed/adjacent to called FNTDCs lacked any functional annotation; however, their co-localization may provide hints towards a candidate biological role.

摘要

一个功能上非串联重复簇(FNTDC)是一组位于比仅仅通过偶然位置更接近的非串联重复基因,它们具有相同的生物学功能。近年来,人们对次生代谢产物相关的 FNTDC 的鉴定产生了越来越多的兴趣,但很少有针对包括初级代谢产物化合物在内的所有生物学功能的 FNTDC 鉴定的初步尝试。我们报告了一个广泛的 FNTDC 数据集,并对用于基因组扫描的参数及其对 FNTDC 检测的影响进行了详细评估。我们提出了 70%的身份和 70%的比对覆盖率作为排除串联重复基因的中间设置,并采用了 24 个基因的动态扫描窗口。这些设置被应用于水稻、拟南芥和葡萄基因组中,以确定 FNTDC。除了最著名的次生代谢物簇外,我们还鉴定了许多与初级代谢物相关的 FNTDC,这些 FNTDC 涉及大分子的合成/编辑、TOR 信号、泛素化、质子和电子转移复合物。使用中间 FNTDC 设置参数(在 P 值为 1e-6 时),在水稻、拟南芥和葡萄中分别调用了 130、70 和 140 个候选 FNTDC,在 3 个基因组中,与所调用的 FNTDC 相关的 20%至 30%的 GO 标签是共同的。随着这项工作的开展,数据集提供了一个丰富的框架,用于精确定位候选 FNTDC,反映了涵盖初级和次级代谢物的所有 GO-BP 标签,其中大型大分子复合物/代谢物是最具代表性的 FNTDC。值得注意的是,一些 FNTDC 被标记为细胞器靶向多酶复合物的 GO,这一发现表明,内共生体基因片段向细胞核的迁移可能是这些候选 FNTDC 类的基础。大多数 FNTDC 似乎是在基因组复制事件之前进化而来的。在调用的 FNTDC 之间散布/相邻的基因中,超过三分之一缺乏任何功能注释;然而,它们的共定位可能为候选生物学功能提供线索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/66000a1fa974/pone.0234782.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/4c4a448cd50b/pone.0234782.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/105c544f7603/pone.0234782.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/985e286c6c8a/pone.0234782.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/076fe48196df/pone.0234782.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/66000a1fa974/pone.0234782.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/4c4a448cd50b/pone.0234782.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/105c544f7603/pone.0234782.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/985e286c6c8a/pone.0234782.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/076fe48196df/pone.0234782.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0509/7304597/66000a1fa974/pone.0234782.g005.jpg

相似文献

1
Ab initio GO-based mining for non-tandem-duplicated functional clusters in three model plant diploid genomes.基于从头开始的 GO 分析,在三个模式植物二倍体基因组中挖掘非串联重复的功能簇。
PLoS One. 2020 Jun 19;15(6):e0234782. doi: 10.1371/journal.pone.0234782. eCollection 2020.
2
Generation, annotation, analysis and database integration of 16,500 white spruce EST clusters.16500个白云杉EST聚类的生成、注释、分析及数据库整合
BMC Genomics. 2005 Oct 19;6:144. doi: 10.1186/1471-2164-6-144.
3
Genome-wide analysis of basic/helix-loop-helix transcription factor family in rice and Arabidopsis.水稻和拟南芥中碱性/螺旋-环-螺旋转录因子家族的全基因组分析。
Plant Physiol. 2006 Aug;141(4):1167-84. doi: 10.1104/pp.106.080580.
4
Comparative genomic analysis of the WRKY III gene family in populus, grape, arabidopsis and rice.杨树、葡萄、拟南芥和水稻中WRKY III基因家族的比较基因组分析。
Biol Direct. 2015 Sep 8;10:48. doi: 10.1186/s13062-015-0076-3.
5
Genome-wide identification and characterization of CONSTANS-like gene family in radish (Raphanus sativus).萝卜(Raphanus sativus)中 CONSTANS 样基因家族的全基因组鉴定和特征分析。
PLoS One. 2018 Sep 24;13(9):e0204137. doi: 10.1371/journal.pone.0204137. eCollection 2018.
6
Genome-wide evolutionary characterization and expression analyses of major latex protein (MLP) family genes in Vitis vinifera.葡萄属植物主要乳蛋白(MLP)家族基因的全基因组进化特征分析及表达分析。
Mol Genet Genomics. 2018 Oct;293(5):1061-1075. doi: 10.1007/s00438-018-1440-7. Epub 2018 Apr 27.
7
Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.基于邻近基因的共表达分析预测拟南芥基因组中的操纵子样基因簇。
Gene. 2012 Jul 15;503(1):56-64. doi: 10.1016/j.gene.2012.04.043. Epub 2012 Apr 24.
8
Neutral invertases in grapevine and comparative analysis with Arabidopsis, poplar and rice.葡萄中的中性转化酶及其与拟南芥、杨树和水稻的比较分析。
Planta. 2008 Dec;229(1):129-42. doi: 10.1007/s00425-008-0815-0. Epub 2008 Sep 18.
9
Transcription factors in rice: a genome-wide comparative analysis between monocots and eudicots.水稻中的转录因子:单子叶植物和双子叶植物之间的全基因组比较分析
Plant Mol Biol. 2005 Sep;59(1):191-203. doi: 10.1007/s11103-005-6503-6.
10
Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication.古老重复基因对关键功能的缓冲作用可能导致被子植物基因组加倍的周期性。
Proc Natl Acad Sci U S A. 2006 Feb 21;103(8):2730-5. doi: 10.1073/pnas.0507782103. Epub 2006 Feb 8.

本文引用的文献

1
Durum wheat genome highlights past domestication signatures and future improvement targets.硬质小麦基因组揭示了过去的驯化特征和未来的改良目标。
Nat Genet. 2019 May;51(5):885-895. doi: 10.1038/s41588-019-0381-3. Epub 2019 Apr 8.
2
STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.STRING v11:具有增强覆盖范围的蛋白质-蛋白质相互作用网络,支持在全基因组实验数据集的功能发现。
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613. doi: 10.1093/nar/gky1131.
3
Metabolic Gene Clusters in Eukaryotes.
真核生物中的代谢基因簇。
Annu Rev Genet. 2018 Nov 23;52:159-183. doi: 10.1146/annurev-genet-120417-031237. Epub 2018 Sep 5.
4
Unravelling Protein-Protein Interaction Networks Linked to Aliphatic and Indole Glucosinolate Biosynthetic Pathways in Arabidopsis.解析拟南芥中与脂肪族和吲哚硫代葡萄糖苷生物合成途径相关的蛋白质-蛋白质相互作用网络
Front Plant Sci. 2017 Nov 29;8:2028. doi: 10.3389/fpls.2017.02028. eCollection 2017.
5
The PhytoClust tool for metabolic gene clusters discovery in plant genomes.用于在植物基因组中发现代谢基因簇的PhytoClust工具。
Nucleic Acids Res. 2017 Jul 7;45(12):7049-7063. doi: 10.1093/nar/gkx404.
6
antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification.antiSMASH 4.0——化学预测和基因簇边界识别的改进。
Nucleic Acids Res. 2017 Jul 3;45(W1):W36-W41. doi: 10.1093/nar/gkx319.
7
plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters.plantSMASH:植物生物合成基因簇的自动识别、注释和表达分析。
Nucleic Acids Res. 2017 Jul 3;45(W1):W55-W63. doi: 10.1093/nar/gkx305.
8
Genome-Wide Prediction of Metabolic Enzymes, Pathways, and Gene Clusters in Plants.植物中代谢酶、代谢途径和基因簇的全基因组预测
Plant Physiol. 2017 Apr;173(4):2041-2059. doi: 10.1104/pp.16.01942. Epub 2017 Feb 22.
9
Plant metabolic clusters - from genetics to genomics.植物代谢簇——从遗传学到基因组学
New Phytol. 2016 Aug;211(3):771-89. doi: 10.1111/nph.13981. Epub 2016 Apr 26.
10
Delineation of metabolic gene clusters in plant genomes by chromatin signatures.通过染色质特征描绘植物基因组中的代谢基因簇。
Nucleic Acids Res. 2016 Mar 18;44(5):2255-65. doi: 10.1093/nar/gkw100. Epub 2016 Feb 18.