• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计算增强子预测:评估与改进。

Computational enhancer prediction: evaluation and improvements.

机构信息

Program in Genetics, Genomics, and Bioinformatics, University at Buffalo-State University of New York, 701 Ellicott St, Buffalo, NY, 14203, USA.

Department of Biochemistry, University at Buffalo-State University of New York, 701 Ellicott St, Buffalo, NY, 14203, USA.

出版信息

BMC Bioinformatics. 2019 Apr 5;20(1):174. doi: 10.1186/s12859-019-2781-x.

DOI:10.1186/s12859-019-2781-x
PMID:30953451
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6451241/
Abstract

BACKGROUND

Identifying transcriptional enhancers and other cis-regulatory modules (CRMs) is an important goal of post-sequencing genome annotation. Computational approaches provide a useful complement to empirical methods for CRM discovery, but it is critical that we develop effective means to evaluate their performance in terms of estimating their sensitivity and specificity.

RESULTS

We introduce here pCRMeval, a pipeline for in silico evaluation of any enhancer prediction tools that are flexible enough to be applied to the Drosophila melanogaster genome. pCRMeval compares the result of predictions with the extensive existing knowledge of experimentally-validated Drosophila CRMs in order to estimate the precision and relative sensitivity of the prediction method. In the case of supervised prediction methods-when training data composed of validated CRMs are used-pCRMeval can also assess the sensitivity of specific training sets. We demonstrate the utility of pCRMeval through evaluation of our SCRMshaw CRM prediction method and training data. By measuring the impact of different parameters on SCRMshaw performance, as assessed by pCRMeval, we develop a more robust version of SCRMshaw, SCRMshaw_HD, that improves the number of predictions while maintaining sensitivity and specificity. Our analysis also demonstrates that SCRMshaw_HD, when applied to increasingly less well-assembled genomes, maintains its strong predictive power with only a minor drop-off in performance.

CONCLUSION

Our pCRMeval pipeline provides a general framework for evaluation that can be applied to any CRM prediction method, particularly a supervised method. While we make use of it here primarily to test and improve a particular method for CRM prediction, SCRMshaw, pCRMeval should provide a valuable platform to the research community not only for evaluating individual methods, but also for comparing between competing methods.

摘要

背景

鉴定转录增强子和其他顺式调控模块(CRMs)是测序后基因组注释的一个重要目标。计算方法为 CRM 发现提供了一种有用的补充方法,但关键是我们要开发出有效的方法来评估它们在估计敏感性和特异性方面的性能。

结果

我们在这里介绍 pCRMeval,这是一种用于评估任何增强子预测工具的虚拟评估管道,这些工具具有足够的灵活性,可以应用于黑腹果蝇基因组。pCRMeval 将预测的结果与广泛存在的实验验证的果蝇 CRM 现有知识进行比较,以估计预测方法的精度和相对敏感性。在有监督的预测方法中-当使用由验证 CRM 组成的训练数据时-pCRMeval 还可以评估特定训练集的敏感性。我们通过评估我们的 SCRMshaw CRM 预测方法和训练数据来展示 pCRMeval 的实用性。通过测量不同参数对 pCRMeval 评估的 SCRMshaw 性能的影响,我们开发了一个更稳健的 SCRMshaw 版本,即 SCRMshaw_HD,它提高了预测数量,同时保持了敏感性和特异性。我们的分析还表明,当应用于组装质量越来越差的基因组时,SCRMshaw_HD 保持其强大的预测能力,仅略有性能下降。

结论

我们的 pCRMeval 管道提供了一种通用的评估框架,可应用于任何 CRM 预测方法,特别是有监督的方法。虽然我们在这里主要利用它来测试和改进 CRM 预测的特定方法,但 pCRMeval 应该为研究社区提供一个有价值的平台,不仅可以评估单个方法,还可以比较竞争方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/6b4847d84c45/12859_2019_2781_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/056a2c9adf70/12859_2019_2781_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/61f9b29f081d/12859_2019_2781_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/fe3c3de58516/12859_2019_2781_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/312b9279f00b/12859_2019_2781_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/ebee9b5f1fe6/12859_2019_2781_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/6b4847d84c45/12859_2019_2781_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/056a2c9adf70/12859_2019_2781_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/61f9b29f081d/12859_2019_2781_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/fe3c3de58516/12859_2019_2781_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/312b9279f00b/12859_2019_2781_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/ebee9b5f1fe6/12859_2019_2781_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5751/6451241/6b4847d84c45/12859_2019_2781_Fig6_HTML.jpg

相似文献

1
Computational enhancer prediction: evaluation and improvements.计算增强子预测:评估与改进。
BMC Bioinformatics. 2019 Apr 5;20(1):174. doi: 10.1186/s12859-019-2781-x.
2
CRM Discovery Beyond Model Insects.超越模式昆虫的CRM发现。
Methods Mol Biol. 2019;1858:117-139. doi: 10.1007/978-1-4939-8775-7_10.
3
Improved accuracy of supervised CRM discovery with interpolated Markov models and cross-species comparison.利用插值马尔可夫模型和跨物种比较提高监督式 CRM 发现的准确性。
Nucleic Acids Res. 2011 Dec;39(22):9463-72. doi: 10.1093/nar/gkr621. Epub 2011 Aug 5.
4
A novel method for predicting activity of cis-regulatory modules, based on a diverse training set.一种基于多样化训练集预测顺式调控模块活性的新方法。
Bioinformatics. 2017 Jan 1;33(1):1-7. doi: 10.1093/bioinformatics/btw552. Epub 2016 Sep 7.
5
Regulatory genome annotation of 33 insect species.33 种昆虫的调控基因组注释。
Elife. 2024 Oct 11;13:RP96738. doi: 10.7554/eLife.96738.
6
De novo prediction of cis-regulatory elements and modules through integrative analysis of a large number of ChIP datasets.通过对大量染色质免疫沉淀数据集进行综合分析,从头预测顺式调控元件和模块。
BMC Genomics. 2014 Dec 2;15:1047. doi: 10.1186/1471-2164-15-1047.
7
Identification of cis-regulatory modules encoding temporal dynamics during development.鉴定在发育过程中编码时间动态的顺式调控模块。
BMC Genomics. 2014 Jun 27;15(1):534. doi: 10.1186/1471-2164-15-534.
8
Motif-blind, genome-wide discovery of cis-regulatory modules in Drosophila and mouse.在果蝇和小鼠中进行的全基因组范围内顺式调控模块的基序盲发现。
Dev Cell. 2009 Oct;17(4):568-79. doi: 10.1016/j.devcel.2009.09.002.
9
Predictive regulatory models in Drosophila melanogaster by integrative inference of transcriptional networks.通过整合转录网络推断,预测果蝇中的调控模型。
Genome Res. 2012 Jul;22(7):1334-49. doi: 10.1101/gr.127191.111. Epub 2012 Mar 28.
10
Molecular dissection of cis-regulatory modules at the Drosophila bithorax complex reveals critical transcription factor signature motifs.在果蝇同源异型盒复合体中对顺式调控模块进行分子剖析,揭示了关键转录因子特征基序。
Dev Biol. 2011 Nov 15;359(2):290-302. doi: 10.1016/j.ydbio.2011.07.028. Epub 2011 Jul 28.

引用本文的文献

1
SCRMshaw: Supervised cis-regulatory module prediction for insect genomes.SCRMshaw:昆虫基因组的监督式顺式调控模块预测
PLoS One. 2024 Dec 5;19(12):e0311752. doi: 10.1371/journal.pone.0311752. eCollection 2024.
2
Regulatory genome annotation of 33 insect species.33 种昆虫的调控基因组注释。
Elife. 2024 Oct 11;13:RP96738. doi: 10.7554/eLife.96738.
3
Mechanisms of transcriptional regulation in revealed by allele-specific expression.通过等位基因特异性表达揭示的转录调控机制。

本文引用的文献

1
Challenges and recommendations to improve the installability and archival stability of omics computational tools.提高组学计算工具可安装性和档案稳定性的挑战和建议。
PLoS Biol. 2019 Jun 20;17(6):e3000333. doi: 10.1371/journal.pbio.3000333. eCollection 2019 Jun.
2
Exploiting regulatory heterogeneity to systematically identify enhancers with high accuracy.利用调控异质性系统地识别具有高精度的增强子。
Proc Natl Acad Sci U S A. 2019 Jan 15;116(3):900-908. doi: 10.1073/pnas.1808833115. Epub 2018 Dec 31.
3
CRM Discovery Beyond Model Insects.
Proc Biol Sci. 2024 Sep;291(2031):20241142. doi: 10.1098/rspb.2024.1142. Epub 2024 Sep 18.
4
Validated Negative Regions (VNRs) in the VISTA Database might be Truncated Forms of Bona Fide Enhancers.VISTA数据库中的验证阴性区域(VNRs)可能是真正增强子的截短形式。
Adv Genet (Hoboken). 2024 May 16;5(2):2300209. doi: 10.1002/ggn2.202300209. eCollection 2024 Jun.
5
Problems with Paralogs: The Promise and Challenges of Gene Duplicates in Evo-Devo Research.基因复制带来的问题:演化发育研究中基因复制品的前景与挑战
Integr Comp Biol. 2024 Sep 17;64(2):556-564. doi: 10.1093/icb/icae009.
6
Mechanisms of transcriptional regulation in revealed by allele specific expression.通过等位基因特异性表达揭示的转录调控机制。 (你提供的原文似乎不完整,“in”后面缺少具体内容)
bioRxiv. 2023 Dec 15:2023.11.22.568226. doi: 10.1101/2023.11.22.568226.
7
Prediction accuracy of regulatory elements from sequence varies by functional sequencing technique.从序列预测调控元件的准确性因功能测序技术而异。
Front Cell Infect Microbiol. 2023 Aug 2;13:1182567. doi: 10.3389/fcimb.2023.1182567. eCollection 2023.
8
A novel role for trithorax in the gene regulatory network for a rapidly evolving fruit fly pigmentation trait.三价X 染色体激活蛋白在快速进化的果蝇色素表型基因调控网络中的新作用。
PLoS Genet. 2023 Feb 16;19(2):e1010653. doi: 10.1371/journal.pgen.1010653. eCollection 2023 Feb.
9
REDfly: An Integrated Knowledgebase for Insect Regulatory Genomics.REDfly:昆虫调控基因组学的综合知识库。
Insects. 2022 Jul 11;13(7):618. doi: 10.3390/insects13070618.
10
Comprehensive Genomic Discovery of Non-Coding Transcriptional Enhancers in the African Malaria Vector .非洲疟疾媒介中非编码转录增强子的综合基因组发现
Front Genet. 2022 Jan 10;12:785934. doi: 10.3389/fgene.2021.785934. eCollection 2021.
超越模式昆虫的CRM发现。
Methods Mol Biol. 2019;1858:117-139. doi: 10.1007/978-1-4939-8775-7_10.
4
REDfly: the transcriptional regulatory element database for Drosophila.REDfly:果蝇转录调控元件数据库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D828-D834. doi: 10.1093/nar/gky957.
5
Enhancer identification and activity evaluation in the red flour beetle, .鉴定红麴虫中的增强子并评估其活性。
Development. 2018 Apr 5;145(7):dev160663. doi: 10.1242/dev.160663.
6
Redeployment of a conserved gene regulatory network during Aedes aegypti development.埃及伊蚊发育过程中保守基因调控网络的重新部署。
Dev Biol. 2016 Aug 15;416(2):402-13. doi: 10.1016/j.ydbio.2016.06.031. Epub 2016 Jun 21.
7
Progress and challenges in bioinformatics approaches for enhancer identification.增强子识别的生物信息学方法的进展与挑战
Brief Bioinform. 2016 Nov;17(6):967-979. doi: 10.1093/bib/bbv101. Epub 2015 Dec 3.
8
Identifying transcriptional cis-regulatory modules in animal genomes.识别动物基因组中的转录顺式调控模块。
Wiley Interdiscip Rev Dev Biol. 2015 Mar-Apr;4(2):59-84. doi: 10.1002/wdev.168. Epub 2014 Dec 29.
9
Evidence for deep regulatory similarities in early developmental programs across highly diverged insects.在高度分化的昆虫早期发育程序中存在深度调控相似性的证据。
Genome Biol Evol. 2014 Sep;6(9):2301-20. doi: 10.1093/gbe/evu184.
10
Improved accuracy of supervised CRM discovery with interpolated Markov models and cross-species comparison.利用插值马尔可夫模型和跨物种比较提高监督式 CRM 发现的准确性。
Nucleic Acids Res. 2011 Dec;39(22):9463-72. doi: 10.1093/nar/gkr621. Epub 2011 Aug 5.