• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SCRMshaw:昆虫基因组的监督式顺式调控模块预测

SCRMshaw: Supervised cis-regulatory module prediction for insect genomes.

作者信息

Asma Hasiba, Liu Luna, Halfon Marc S

机构信息

Departments of Biochemistry, University at Buffalo-State University of New York, Buffalo, NY, United States of America.

Biomedical Informatics, University at Buffalo-State University of New York, Buffalo, NY, United States of America.

出版信息

PLoS One. 2024 Dec 5;19(12):e0311752. doi: 10.1371/journal.pone.0311752. eCollection 2024.

DOI:10.1371/journal.pone.0311752
PMID:39637210
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11620701/
Abstract

As the number of sequenced insect genomes continues to grow, there is a pressing need for rapid and accurate annotation of their regulatory component. SCRMshaw is a computational tool designed to predict cis-regulatory modules ("enhancers") in the genomes of various insect species. A key advantage of SCRMshaw is its accessibility. It requires minimal resources-just a genome sequence and training data from known Drosophila regulatory sequences, which are readily available for download. Even users with modest computational skills can run SCRMshaw on a desktop computer for basic applications, although a high-performance computing cluster is recommended for optimal results. SCRMshaw can be tailored to specific needs: users can employ a single set of training data to predict enhancers associated with a particular gene expression pattern, or utilize multiple sets to provide a first-pass regulatory annotation for a newly-sequenced genome. This protocol provides an extensive update to the previously published SCRMshaw protocol and aligns with the methods used in a recent annotation of over 30 insect regulatory genomes. It includes the most recent modifications to the SCRMshaw protocol and details an end-to-end pipeline that begins with a sequenced genome and ends with a fully-annotated regulatory genome. Relevant scripts are available via GitHub, and a living protocol that will be updated as necessary is linked to this article at protocols.io.

摘要

随着已测序昆虫基因组数量的不断增加,迫切需要对其调控元件进行快速准确的注释。SCRMshaw是一种计算工具,旨在预测各种昆虫物种基因组中的顺式调控模块(“增强子”)。SCRMshaw的一个关键优势在于其易用性。它所需资源极少——只需要一个基因组序列和来自已知果蝇调控序列的训练数据,这些数据很容易下载获得。即使是计算技能一般的用户也可以在台式计算机上运行SCRMshaw进行基本应用,不过为了获得最佳结果,建议使用高性能计算集群。SCRMshaw可以根据特定需求进行定制:用户可以使用一组训练数据来预测与特定基因表达模式相关的增强子,或者使用多组数据为新测序的基因组提供初步的调控注释。本方案对先前发布的SCRMshaw方案进行了大量更新,并与最近对30多个昆虫调控基因组进行注释时所使用的方法保持一致。它包括对SCRMshaw方案的最新修改,并详细介绍了一个端到端的流程,该流程从测序的基因组开始,以完全注释的调控基因组结束。相关脚本可通过GitHub获取,并且一个会根据需要进行更新的实用方案在protocols.io上与本文链接。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b61c/11620701/bf35c8a35f61/pone.0311752.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b61c/11620701/980d2a6ef391/pone.0311752.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b61c/11620701/bf35c8a35f61/pone.0311752.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b61c/11620701/980d2a6ef391/pone.0311752.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b61c/11620701/bf35c8a35f61/pone.0311752.g002.jpg

相似文献

1
SCRMshaw: Supervised cis-regulatory module prediction for insect genomes.SCRMshaw:昆虫基因组的监督式顺式调控模块预测
PLoS One. 2024 Dec 5;19(12):e0311752. doi: 10.1371/journal.pone.0311752. eCollection 2024.
2
CRM Discovery Beyond Model Insects.超越模式昆虫的CRM发现。
Methods Mol Biol. 2019;1858:117-139. doi: 10.1007/978-1-4939-8775-7_10.
3
Computational enhancer prediction: evaluation and improvements.计算增强子预测:评估与改进。
BMC Bioinformatics. 2019 Apr 5;20(1):174. doi: 10.1186/s12859-019-2781-x.
4
Regulatory genome annotation of 33 insect species.33 种昆虫的调控基因组注释。
Elife. 2024 Oct 11;13:RP96738. doi: 10.7554/eLife.96738.
5
Annotating the Insect Regulatory Genome.注释昆虫调控基因组。
Insects. 2021 Jun 29;12(7):591. doi: 10.3390/insects12070591.
6
Identification of new Anopheles gambiae transcriptional enhancers using a cross-species prediction approach.利用跨物种预测方法鉴定新型冈比亚按蚊转录增强子。
Insect Mol Biol. 2021 Aug;30(4):410-419. doi: 10.1111/imb.12705. Epub 2021 Apr 27.
7
OGS2: genome re-annotation of the jewel wasp Nasonia vitripennis.OGS2:丽蝇蛹集金小蜂基因组的重新注释
BMC Genomics. 2016 Aug 25;17(1):678. doi: 10.1186/s12864-016-2886-9.
8
Beav: a bacterial genome and mobile element annotation pipeline.Beav:细菌基因组和移动元件注释流水线。
mSphere. 2024 Aug 28;9(8):e0020924. doi: 10.1128/msphere.00209-24. Epub 2024 Jul 22.
9
Quantitative analysis of the Drosophila segmentation regulatory network using pattern generating potentials.使用模式生成潜力对果蝇分割调控网络进行定量分析。
PLoS Biol. 2010 Aug 17;8(8):e1000456. doi: 10.1371/journal.pbio.1000456.
10
FastBill: An Improved Tool for Prediction of Cis-Regulatory Modules.FastBill:一种用于预测顺式调控模块的改进工具。
J Comput Biol. 2017 Mar;24(3):193-199. doi: 10.1089/cmb.2016.0108. Epub 2016 Oct 6.

本文引用的文献

1
Regulatory genome annotation of 33 insect species.33 种昆虫的调控基因组注释。
Elife. 2024 Oct 11;13:RP96738. doi: 10.7554/eLife.96738.
2
A comprehensive revisit of the machine-learning tools developed for the identification of enhancers in the human genome.全面回顾用于识别人类基因组增强子的机器学习工具。
Proteomics. 2023 Jul;23(13-14):e2200409. doi: 10.1002/pmic.202200409. Epub 2023 Jun 7.
3
A novel role for trithorax in the gene regulatory network for a rapidly evolving fruit fly pigmentation trait.三价X 染色体激活蛋白在快速进化的果蝇色素表型基因调控网络中的新作用。
PLoS Genet. 2023 Feb 16;19(2):e1010653. doi: 10.1371/journal.pgen.1010653. eCollection 2023 Feb.
4
Annotating the Insect Regulatory Genome.注释昆虫调控基因组。
Insects. 2021 Jun 29;12(7):591. doi: 10.3390/insects12070591.
5
Identification of new Anopheles gambiae transcriptional enhancers using a cross-species prediction approach.利用跨物种预测方法鉴定新型冈比亚按蚊转录增强子。
Insect Mol Biol. 2021 Aug;30(4):410-419. doi: 10.1111/imb.12705. Epub 2021 Apr 27.
6
Identification of genomic enhancers through spatial integration of single-cell transcriptomics and epigenomics.通过单细胞转录组学和表观基因组学的空间整合来鉴定基因组增强子。
Mol Syst Biol. 2020 May;16(5):e9438. doi: 10.15252/msb.20209438.
7
Towards a comprehensive catalogue of validated and target-linked human enhancers.迈向一个全面的已验证和与靶标相关的人类增强子目录。
Nat Rev Genet. 2020 May;21(5):292-310. doi: 10.1038/s41576-019-0209-0. Epub 2020 Jan 27.
8
Computational enhancer prediction: evaluation and improvements.计算增强子预测:评估与改进。
BMC Bioinformatics. 2019 Apr 5;20(1):174. doi: 10.1186/s12859-019-2781-x.
9
Studying Transcriptional Enhancers: The Founder Fallacy, Validation Creep, and Other Biases.研究转录增强子:奠基者谬误、验证蔓延和其他偏见。
Trends Genet. 2019 Feb;35(2):93-103. doi: 10.1016/j.tig.2018.11.004. Epub 2018 Dec 13.
10
CRM Discovery Beyond Model Insects.超越模式昆虫的CRM发现。
Methods Mol Biol. 2019;1858:117-139. doi: 10.1007/978-1-4939-8775-7_10.