• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

33 种昆虫的调控基因组注释。

Regulatory genome annotation of 33 insect species.

机构信息

Program in Genetics, Genomics, and Bioinformatics, University at Buffalo-State University of New York, Buffalo, United States.

Department of Biology, Miami University, Oxford, United States.

出版信息

Elife. 2024 Oct 11;13:RP96738. doi: 10.7554/eLife.96738.

DOI:10.7554/eLife.96738
PMID:39392676
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11469670/
Abstract

Annotation of newly sequenced genomes frequently includes genes, but rarely covers important non-coding genomic features such as the -regulatory modules-e.g., enhancers and silencers-that regulate gene expression. Here, we begin to remedy this situation by developing a workflow for rapid initial annotation of insect regulatory sequences, and provide a searchable database resource with enhancer predictions for 33 genomes. Using our previously developed SCRMshaw computational enhancer prediction method, we predict over 2.8 million regulatory sequences along with the tissues where they are expected to be active, in a set of insect species ranging over 360 million years of evolution. Extensive analysis and validation of the data provides several lines of evidence suggesting that we achieve a high true-positive rate for enhancer prediction. One, we show that our predictions target specific loci, rather than random genomic locations. Two, we predict enhancers in orthologous loci across a diverged set of species to a significantly higher degree than random expectation would allow. Three, we demonstrate that our predictions are highly enriched for regions of accessible chromatin. Four, we achieve a validation rate in excess of 70% using in vivo reporter gene assays. As we continue to annotate both new tissues and new species, our regulatory annotation resource will provide a rich source of data for the research community and will have utility for both small-scale (single gene, single species) and large-scale (many genes, many species) studies of gene regulation. In particular, the ability to search for functionally related regulatory elements in orthologous loci should greatly facilitate studies of enhancer evolution even among distantly related species.

摘要

注释新测序的基因组通常包括基因,但很少涵盖重要的非编码基因组特征,如调节基因表达的 -调控模块-例如,增强子和沉默子。在这里,我们通过开发一种快速初始注释昆虫调控序列的工作流程来开始弥补这种情况,并为 33 个基因组提供了一个可搜索的增强子预测数据库资源。使用我们之前开发的 SCRMshaw 计算增强子预测方法,我们预测了超过 280 万个调控序列,以及它们预计活跃的组织,这些序列涵盖了超过 3.6 亿年的进化历史的昆虫物种。对数据的广泛分析和验证提供了几条证据表明,我们实现了高的增强子预测真阳性率。首先,我们表明我们的预测针对特定的基因座,而不是随机的基因组位置。其次,我们在一组分化的物种中预测到了同源基因座中的增强子,其程度明显高于随机预期。第三,我们证明了我们的预测高度富集于可及染色质区域。第四,我们通过体内报告基因检测实现了超过 70%的验证率。随着我们继续注释新的组织和新的物种,我们的调控注释资源将为研究社区提供丰富的数据来源,并且对于小规模(单个基因,单个物种)和大规模(多个基因,多个物种)的基因调控研究都将具有实用性。特别是,在同源基因座中搜索功能相关的调控元件的能力应该极大地促进增强子进化的研究,即使在远缘物种之间也是如此。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/b9a950cb9124/elife-96738-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/fa7e75784f85/elife-96738-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/a5469ccd6e68/elife-96738-fig1-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/5f9bf093b7ac/elife-96738-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/c72299cc4862/elife-96738-fig2-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/16dc894921f2/elife-96738-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/085b6e01e800/elife-96738-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/78280a522e0d/elife-96738-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/12e00d9384bd/elife-96738-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/316310ad4524/elife-96738-fig6-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/263e12ae9fab/elife-96738-fig6-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/0da07ab6d411/elife-96738-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/675cdab2a4be/elife-96738-fig7-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/bdd538a60ed3/elife-96738-fig7-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/b9a950cb9124/elife-96738-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/fa7e75784f85/elife-96738-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/a5469ccd6e68/elife-96738-fig1-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/5f9bf093b7ac/elife-96738-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/c72299cc4862/elife-96738-fig2-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/16dc894921f2/elife-96738-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/085b6e01e800/elife-96738-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/78280a522e0d/elife-96738-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/12e00d9384bd/elife-96738-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/316310ad4524/elife-96738-fig6-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/263e12ae9fab/elife-96738-fig6-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/0da07ab6d411/elife-96738-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/675cdab2a4be/elife-96738-fig7-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/bdd538a60ed3/elife-96738-fig7-figsupp2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb41/11469670/b9a950cb9124/elife-96738-fig8.jpg

相似文献

1
Regulatory genome annotation of 33 insect species.33 种昆虫的调控基因组注释。
Elife. 2024 Oct 11;13:RP96738. doi: 10.7554/eLife.96738.
2
CRM Discovery Beyond Model Insects.超越模式昆虫的CRM发现。
Methods Mol Biol. 2019;1858:117-139. doi: 10.1007/978-1-4939-8775-7_10.
3
Evidence for deep regulatory similarities in early developmental programs across highly diverged insects.在高度分化的昆虫早期发育程序中存在深度调控相似性的证据。
Genome Biol Evol. 2014 Sep;6(9):2301-20. doi: 10.1093/gbe/evu184.
4
5
EnhancerAtlas 2.0: an updated resource with enhancer annotation in 586 tissue/cell types across nine species.EnhancerAtlas 2.0:一个更新的资源,包含了 9 个物种的 586 种组织/细胞类型中的增强子注释。
Nucleic Acids Res. 2020 Jan 8;48(D1):D58-D64. doi: 10.1093/nar/gkz980.
6
Computational schemes for the prediction and annotation of enhancers from epigenomic assays.基于表观基因组分析预测和注释增强子的计算方案。
Methods. 2015 Jan 15;72:86-94. doi: 10.1016/j.ymeth.2014.10.008. Epub 2014 Oct 15.
7
Insect Innate Immunity Database (IIID): an annotation tool for identifying immune genes in insect genomes.昆虫先天免疫数据库(IIID):一种用于鉴定昆虫基因组中免疫基因的注释工具。
PLoS One. 2012;7(9):e45125. doi: 10.1371/journal.pone.0045125. Epub 2012 Sep 12.
8
Opening up the blackbox: an interpretable deep neural network-based classifier for cell-type specific enhancer predictions.打开黑箱:一种基于可解释深度神经网络的细胞类型特异性增强子预测分类器。
BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):54. doi: 10.1186/s12918-016-0302-3.
9
Computational enhancer prediction: evaluation and improvements.计算增强子预测:评估与改进。
BMC Bioinformatics. 2019 Apr 5;20(1):174. doi: 10.1186/s12859-019-2781-x.
10
Cis-regulatory complexity within a large non-coding region in the Drosophila genome.果蝇基因组内一个大的非编码区域中的顺式调控复杂性。
PLoS One. 2013 Apr 22;8(4):e60137. doi: 10.1371/journal.pone.0060137. Print 2013.

引用本文的文献

1
SCRMshaw: Supervised cis-regulatory module prediction for insect genomes.SCRMshaw:昆虫基因组的监督式顺式调控模块预测
PLoS One. 2024 Dec 5;19(12):e0311752. doi: 10.1371/journal.pone.0311752. eCollection 2024.
2
A new suite of reporter vectors and a novel landing site survey system to study cis-regulatory elements in diverse insect species.一套新的报告基因载体和一种新的着陆位点调查系统,用于研究不同昆虫物种中的顺式调控元件。
Sci Rep. 2024 May 2;14(1):10078. doi: 10.1038/s41598-024-60432-9.

本文引用的文献

1
A new suite of reporter vectors and a novel landing site survey system to study cis-regulatory elements in diverse insect species.一套新的报告基因载体和一种新的着陆位点调查系统,用于研究不同昆虫物种中的顺式调控元件。
Sci Rep. 2024 May 2;14(1):10078. doi: 10.1038/s41598-024-60432-9.
2
A novel role for trithorax in the gene regulatory network for a rapidly evolving fruit fly pigmentation trait.三价X 染色体激活蛋白在快速进化的果蝇色素表型基因调控网络中的新作用。
PLoS Genet. 2023 Feb 16;19(2):e1010653. doi: 10.1371/journal.pgen.1010653. eCollection 2023 Feb.
3
Reporter gene assays and chromatin-level assays define substantially non-overlapping sets of enhancer sequences.
报告基因检测和染色质水平检测定义了大量非重叠的增强子序列集。
BMC Genomics. 2023 Jan 13;24(1):17. doi: 10.1186/s12864-023-09123-9.
4
OrthoDB v11: annotation of orthologs in the widest sampling of organismal diversity.OrthoDB v11:在最广泛的生物多样性样本中注释直系同源物。
Nucleic Acids Res. 2023 Jan 6;51(D1):D445-D451. doi: 10.1093/nar/gkac998.
5
Deep cis-regulatory homology of the butterfly wing pattern ground plan.蝴蝶翅膀图案平面图的深顺式调控同源性。
Science. 2022 Oct 21;378(6617):304-308. doi: 10.1126/science.abi9407. Epub 2022 Oct 20.
6
REDfly: An Integrated Knowledgebase for Insect Regulatory Genomics.REDfly:昆虫调控基因组学的综合知识库。
Insects. 2022 Jul 11;13(7):618. doi: 10.3390/insects13070618.
7
Mixing genome annotation methods in a comparative analysis inflates the apparent number of lineage-specific genes.在比较分析中混合基因组注释方法会增加谱系特异性基因的表观数量。
Curr Biol. 2022 Jun 20;32(12):2632-2639.e2. doi: 10.1016/j.cub.2022.04.085. Epub 2022 May 18.
8
Pleiotropic Enhancers are Ubiquitous Regulatory Elements in the Human Genome.多效增强子是人类基因组中普遍存在的调控元件。
Genome Biol Evol. 2022 May 31;14(6). doi: 10.1093/gbe/evac071.
9
Comprehensive enhancer-target gene assignments improve gene set level interpretation of genome-wide regulatory data.全面的增强子-靶基因分配可改善全基因组调控数据的基因集水平解释。
Genome Biol. 2022 Apr 26;23(1):105. doi: 10.1186/s13059-022-02668-0.
10
The importance of considering regulatory domains in genome-wide analyses - the nearest gene is often wrong!考虑调控域在全基因组分析中的重要性——最接近的基因往往是错误的!
Biol Open. 2022 Apr 15;11(4). doi: 10.1242/bio.059091. Epub 2022 Apr 4.