• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

细菌调控网络比较基因组学转移方法的评估

Assessment of transfer methods for comparative genomics of regulatory networks in bacteria.

作者信息

Kılıç Sefa, Erill Ivan

机构信息

Department of Biological Sciences, University of Maryland Baltimore County (UMBC), Baltimore, MD, 21250, USA.

出版信息

BMC Bioinformatics. 2016 Aug 31;17 Suppl 8(Suppl 8):277. doi: 10.1186/s12859-016-1113-7.

DOI:10.1186/s12859-016-1113-7
PMID:27586594
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5009822/
Abstract

BACKGROUND

Comparative genomics can leverage the vast amount of available genomic sequences to reconstruct and analyze transcriptional regulatory networks in Bacteria, but the efficacy of this approach hinges on the ability to transfer regulatory network information from reference species to the genomes under analysis. Several methods have been proposed to transfer regulatory information between bacterial species, but the paucity and distributed nature of experimental information on bacterial transcriptional networks have prevented their systematic evaluation.

RESULTS

We report the compilation of a large catalog of transcription factor-binding sites across Bacteria and its use to systematically benchmark proposed transfer methods across pairs of bacterial species. We evaluate motif- and accuracy-based metrics to assess the results of regulatory network transfer and we identify the precision-recall area-under-the-curve as the best metric for this purpose due to the large class-imbalanced nature of the problem. Methods assuming conservation of the transcription factor-binding motif (motif-based) are shown to substantially outperform those assuming conservation of regulon composition (network-based), even though their efficiency can decrease sharply with increasing phylogenetic distance. Variations of the basic motif-based transfer method do not yield significant improvements in transfer accuracy. Our results indicate that detection of a large enough number of regulated orthologs is critical for network-based transfer methods, but that relaxing orthology requirements does not improve results. Using the transcriptional regulators LexA and Fur as case examples, we also show how DNA-binding domain sequence similarity can yield confounding results as an indicator of transfer efficiency for motif-based methods.

CONCLUSIONS

Counter to standard practice, our evaluation of metrics to assess the efficiency of methods for regulatory network information transfer reveals that the area under precision-recall (PR) curves is a more precise and informative metric than that of receiver-operating-characteristic (ROC) curves, confirming similar findings in other class-imbalanced settings. Our systematic assessment of transfer methods reveals that simple approaches to both motif- and network-based transfer of regulatory information provide equal or better results than more elaborate methods. We also show that there are not effective predictors of transfer efficacy, substantiating the long-standing practice of manual curation in comparative genomics analyses.

摘要

背景

比较基因组学可以利用大量现有的基因组序列来重建和分析细菌中的转录调控网络,但这种方法的有效性取决于将调控网络信息从参考物种转移到被分析基因组的能力。已经提出了几种在细菌物种之间转移调控信息的方法,但关于细菌转录网络的实验信息匮乏且分布分散,阻碍了对它们的系统评估。

结果

我们报告了一个涵盖细菌转录因子结合位点的大型目录的汇编,并利用它对成对细菌物种间提出的转移方法进行系统的基准测试。我们评估基于基序和准确性的指标来评估调控网络转移的结果,并且由于该问题存在大量类不平衡的性质,我们确定精确召回率曲线下面积是用于此目的的最佳指标。结果表明,尽管基于转录因子结合基序保守性的方法(基于基序的方法)的效率会随着系统发育距离的增加而急剧下降,但它们的表现明显优于基于操纵子组成保守性的方法(基于网络的方法)。基于基序的基本转移方法的变体在转移准确性方面没有显著提高。我们的结果表明,检测到足够数量的受调控直系同源物对于基于网络的转移方法至关重要,但放宽直系同源性要求并不会改善结果。以转录调节因子LexA和Fur为例,我们还展示了DNA结合结构域序列相似性作为基于基序方法转移效率的指标可能会产生混淆结果。

结论

与标准做法相反,我们对评估调控网络信息转移方法效率的指标进行的评估表明,精确召回率(PR)曲线下面积是比接受者操作特征(ROC)曲线更精确且更具信息量的指标,这证实了在其他类不平衡情况下的类似发现。我们对转移方法的系统评估表明,简单的基于基序和基于网络的调控信息转移方法比更复杂的方法能提供相同或更好的结果。我们还表明,不存在有效的转移效果预测指标,这证实了比较基因组学分析中长期以来的手动编辑做法合理性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11b7/5009822/0402e70a3d95/12859_2016_1113_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11b7/5009822/1a71bc7fc648/12859_2016_1113_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11b7/5009822/f1e37d98584e/12859_2016_1113_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11b7/5009822/8b4ccdcb0cc3/12859_2016_1113_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11b7/5009822/0402e70a3d95/12859_2016_1113_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11b7/5009822/1a71bc7fc648/12859_2016_1113_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11b7/5009822/f1e37d98584e/12859_2016_1113_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11b7/5009822/8b4ccdcb0cc3/12859_2016_1113_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11b7/5009822/0402e70a3d95/12859_2016_1113_Fig4_HTML.jpg

相似文献

1
Assessment of transfer methods for comparative genomics of regulatory networks in bacteria.细菌调控网络比较基因组学转移方法的评估
BMC Bioinformatics. 2016 Aug 31;17 Suppl 8(Suppl 8):277. doi: 10.1186/s12859-016-1113-7.
2
An SOS Regulon under Control of a Noncanonical LexA-Binding Motif in the Betaproteobacteria.β-变形菌纲中受非典型LexA结合基序调控的SOS调节子。
J Bacteriol. 2015 Aug;197(16):2622-30. doi: 10.1128/JB.00035-15. Epub 2015 May 18.
3
Comparative genomic reconstruction of transcriptional networks controlling central metabolism in the Shewanella genus.比较基因组重建控制希瓦氏菌属中心代谢的转录网络。
BMC Genomics. 2011 Jun 15;12 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2164-12-S1-S3.
4
RegPrecise 3.0--a resource for genome-scale exploration of transcriptional regulation in bacteria.RegPrecise 3.0——用于细菌转录调控全基因组规模探索的资源。
BMC Genomics. 2013 Nov 1;14:745. doi: 10.1186/1471-2164-14-745.
5
Inference of self-regulated transcriptional networks by comparative genomics.通过比较基因组学推断自我调节的转录网络。
Evol Bioinform Online. 2012;8:449-61. doi: 10.4137/EBO.S9205. Epub 2012 Aug 6.
6
The GlxR regulon of the amino acid producer Corynebacterium glutamicum: Detection of the corynebacterial core regulon and integration into the transcriptional regulatory network model.谷氨酸棒杆菌 GlxR 调控组:检测棒杆菌核心调控组并将其整合到转录调控网络模型中。
J Biotechnol. 2009 Sep 25;143(4):239-46. doi: 10.1016/j.jbiotec.2009.08.005. Epub 2009 Aug 7.
7
Dissimilatory metabolism of nitrogen oxides in bacteria: comparative reconstruction of transcriptional networks.细菌中氮氧化物的异化代谢:转录网络的比较重建
PLoS Comput Biol. 2005 Oct;1(5):e55. doi: 10.1371/journal.pcbi.0010055. Epub 2005 Oct 28.
8
Flexible comparative genomics of prokaryotic transcriptional regulatory networks.原核生物转录调控网络的灵活比较基因组学。
BMC Genomics. 2020 Dec 16;21(Suppl 5):466. doi: 10.1186/s12864-020-06838-x.
9
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.一种用于原核生物基因组中顺式调控基序识别的综合且适用的系统发育足迹分析框架。
BMC Genomics. 2016 Aug 9;17:578. doi: 10.1186/s12864-016-2982-x.
10
Comparative genomics of pyridoxal 5'-phosphate-dependent transcription factor regulons in .基于吡哆醛 5′-磷酸依赖型转录因子调控因子的比较基因组学研究。
Microb Genom. 2016 Jan 18;2(1):e000047. doi: 10.1099/mgen.0.000047. eCollection 2016 Jan.

引用本文的文献

1
Regulation beyond Transcription: Organizing Principles and Reconstruction of an Extended Regulatory Network Incorporating Regulations Mediated by Small RNA and Protein-Protein Interactions.转录之外的调控:整合由小RNA和蛋白质-蛋白质相互作用介导的调控的扩展调控网络的组织原则与重构
Microorganisms. 2021 Jun 28;9(7):1395. doi: 10.3390/microorganisms9071395.
2
Flexible comparative genomics of prokaryotic transcriptional regulatory networks.原核生物转录调控网络的灵活比较基因组学。
BMC Genomics. 2020 Dec 16;21(Suppl 5):466. doi: 10.1186/s12864-020-06838-x.
3
CoryneRegNet 7, the reference database and analysis platform for corynebacterial gene regulatory networks.

本文引用的文献

1
The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.在不平衡数据集上评估二元分类器时,精确率-召回率曲线比ROC曲线更具信息性。
PLoS One. 2015 Mar 4;10(3):e0118432. doi: 10.1371/journal.pone.0118432. eCollection 2015.
2
Comparative genomics and evolution of regulons of the LacI-family transcription factors.LacI 家族转录因子调控子的比较基因组学和进化。
Front Microbiol. 2014 Jun 11;5:294. doi: 10.3389/fmicb.2014.00294. eCollection 2014.
3
SPIC: a novel similarity metric for comparing transcription factor binding site motifs based on information contents.
CoryneRegNet 7,棒状杆菌基因调控网络的参考数据库和分析平台。
Sci Data. 2020 May 11;7(1):142. doi: 10.1038/s41597-020-0484-9.
4
An analysis of acquired antimicrobial resistance genes in plasmids.质粒中获得性抗菌耐药基因的分析
AIMS Microbiol. 2020 Mar 16;6(1):75-91. doi: 10.3934/microbiol.2020005. eCollection 2020.
SPIC:一种基于信息内容比较转录因子结合位点基序的新型相似性度量。
BMC Syst Biol. 2013;7 Suppl 2(Suppl 2):S14. doi: 10.1186/1752-0509-7-S2-S14. Epub 2013 Dec 17.
4
CollecTF: a database of experimentally validated transcription factor-binding sites in Bacteria.CollecTF:细菌中实验验证的转录因子结合位点数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D156-60. doi: 10.1093/nar/gkt1123. Epub 2013 Nov 14.
5
Transcriptional regulation of the carbohydrate utilization network in Thermotoga maritima.海洋栖热菌碳水化合物利用网络的转录调控。
Front Microbiol. 2013 Aug 23;4:244. doi: 10.3389/fmicb.2013.00244. eCollection 2013.
6
Comprehensive computational analysis of bacterial CRP/FNR superfamily and its target motifs reveals stepwise evolution of transcriptional networks.综合计算分析细菌 CRP/FNR 超家族及其靶基序揭示了转录网络的逐步进化。
Genome Biol Evol. 2013;5(2):267-82. doi: 10.1093/gbe/evt004.
7
RegulonDB v8.0: omics data sets, evolutionary conservation, regulatory phrases, cross-validated gold standards and more.RegulonDB v8.0:组学数据集、进化保守性、调控短语、交叉验证的黄金标准等。
Nucleic Acids Res. 2013 Jan;41(Database issue):D203-13. doi: 10.1093/nar/gks1201. Epub 2012 Nov 29.
8
STRING v9.1: protein-protein interaction networks, with increased coverage and integration.STRING v9.1:蛋白质-蛋白质相互作用网络,具有更高的覆盖度和集成度。
Nucleic Acids Res. 2013 Jan;41(Database issue):D808-15. doi: 10.1093/nar/gks1094. Epub 2012 Nov 29.
9
A functional selection model explains evolutionary robustness despite plasticity in regulatory networks.功能选择模型解释了尽管调控网络具有可塑性,但仍能保持进化稳健性。
Mol Syst Biol. 2012;8:619. doi: 10.1038/msb.2012.50.
10
Inference of self-regulated transcriptional networks by comparative genomics.通过比较基因组学推断自我调节的转录网络。
Evol Bioinform Online. 2012;8:449-61. doi: 10.4137/EBO.S9205. Epub 2012 Aug 6.