• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

OverGeneDB:一个人类和鼠基因组中 5' 端蛋白编码重叠基因的数据库。

OverGeneDB: a database of 5' end protein coding overlapping genes in human and mouse genomes.

机构信息

Department of Integrative Genomics, Institute of Anthropology, Faculty of Biology, Adam Mickiewicz University in Poznan, 61-712 Poznan, Poland.

Department of Computational Biology, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 272-8562, Japan.

出版信息

Nucleic Acids Res. 2018 Jan 4;46(D1):D186-D193. doi: 10.1093/nar/gkx948.

DOI:10.1093/nar/gkx948
PMID:29069459
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5753363/
Abstract

Gene overlap plays various regulatory functions on transcriptional and post-transcriptional levels. Most current studies focus on protein-coding genes overlapping with non-protein-coding counterparts, the so called natural antisense transcripts. Considerably less is known about the role of gene overlap in the case of two protein-coding genes. Here, we provide OverGeneDB, a database of human and mouse 5' end protein-coding overlapping genes. The database contains 582 human and 113 mouse gene pairs that are transcribed using overlapping promoters in at least one analyzed library. Gene pairs were identified based on the analysis of the transcription start site (TSS) coordinates in 73 human and 10 mouse organs, tissues and cell lines. Beside TSS data, resources for 26 human lung adenocarcinoma cell lines also contain RNA-Seq and ChIP-Seq data for seven histone modifications and RNA Polymerase II activity. The collected data revealed that the overlap region is rarely conserved between the studied species and tissues. In ∼50% of the overlapping genes, transcription started explicitly in the overlap regions. In the remaining half of overlapping genes, transcription was initiated both from overlapping and non-overlapping TSSs. OverGeneDB is accessible at http://overgenedb.amu.edu.pl.

摘要

基因重叠在转录和转录后水平上发挥着各种调节功能。大多数当前的研究集中在与非蛋白编码对应物(所谓的天然反义转录本)重叠的蛋白编码基因上。关于两个蛋白编码基因的基因重叠的作用,人们知之甚少。在这里,我们提供了 OverGeneDB,这是一个人类和小鼠 5'端蛋白编码重叠基因的数据库。该数据库包含了至少在一个分析文库中使用重叠启动子转录的 582 个人类和 113 个小鼠基因对。基因对是基于对 73 个人类和 10 个小鼠器官、组织和细胞系的转录起始位点 (TSS) 坐标的分析确定的。除了 TSS 数据外,26 个人类肺腺癌细胞系的资源还包含了针对七个组蛋白修饰和 RNA 聚合酶 II 活性的 RNA-Seq 和 ChIP-Seq 数据。收集的数据表明,研究物种和组织之间重叠区域的保守性很少。在研究的重叠基因中,约有 50%的基因在重叠区域明确开始转录。在其余一半的重叠基因中,转录既从重叠的 TSS 也从非重叠的 TSS 开始。OverGeneDB 可在 http://overgenedb.amu.edu.pl 上访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2642/5753363/46011dd24b9d/gkx948fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2642/5753363/252f6f7a46f1/gkx948fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2642/5753363/b8291365655b/gkx948fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2642/5753363/375a78897a4f/gkx948fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2642/5753363/46011dd24b9d/gkx948fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2642/5753363/252f6f7a46f1/gkx948fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2642/5753363/b8291365655b/gkx948fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2642/5753363/375a78897a4f/gkx948fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2642/5753363/46011dd24b9d/gkx948fig4.jpg

相似文献

1
OverGeneDB: a database of 5' end protein coding overlapping genes in human and mouse genomes.OverGeneDB:一个人类和鼠基因组中 5' 端蛋白编码重叠基因的数据库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D186-D193. doi: 10.1093/nar/gkx948.
2
Transcription start sites at the end of protein-coding genes.转录起始位点位于蛋白质编码基因的末端。
Hum Genomics. 2018 Mar 16;12(1):15. doi: 10.1186/s40246-018-0146-6.
3
Genome-wide RNA pol II initiation and pausing in neural progenitors of the rat.大鼠神经祖细胞中全基因组 RNA 聚合酶 II 的起始和暂停。
BMC Genomics. 2019 Jun 11;20(1):477. doi: 10.1186/s12864-019-5829-4.
4
A novel short L-arginine responsive protein-coding gene (laoB) antiparallel overlapping to a CadC-like transcriptional regulator in Escherichia coli O157:H7 Sakai originated by overprinting.在产志贺毒素大肠杆菌 O157:H7 菌株 Sakai 中,一个 novel short L-arginine responsive protein-coding gene (laoB) 通过重叠编码产生,该基因与 CadC 样转录调节子呈反平行重叠。
BMC Evol Biol. 2018 Feb 12;18(1):21. doi: 10.1186/s12862-018-1134-0.
5
DBTSS: DataBase of Transcriptional Start Sites progress report in 2012.DBTSS:2012 年转录起始位点数据库进展报告。
Nucleic Acids Res. 2012 Jan;40(Database issue):D150-4. doi: 10.1093/nar/gkr1005. Epub 2011 Nov 15.
6
Promoter switching in response to changing environment and elevated expression of protein-coding genes overlapping at their 5' ends.启动子切换以响应环境变化和 5' 端重叠的蛋白质编码基因的表达升高。
Sci Rep. 2021 Apr 26;11(1):8984. doi: 10.1038/s41598-021-87970-w.
7
Genome-wide transcription start site mapping of Bradyrhizobium japonicum grown free-living or in symbiosis - a rich resource to identify new transcripts, proteins and to study gene regulation.日本慢生根瘤菌在自由生活或共生状态下的全基因组转录起始位点定位——这是鉴定新转录本、蛋白质以及研究基因调控的丰富资源。
BMC Genomics. 2016 Apr 23;17:302. doi: 10.1186/s12864-016-2602-9.
8
TE-TSS: an integrated data resource of human and mouse transposable element (TE)-derived transcription start site (TSS).TE-TSS:一个整合了人类和小鼠转座元件(TE)衍生转录起始位点(TSS)的综合数据资源。
Nucleic Acids Res. 2024 Jan 5;52(D1):D322-D333. doi: 10.1093/nar/gkad1048.
9
Genome wide transcription start sites analysis of Xanthomonas campestris pv. campestris B100 with insights into the gum gene cluster directing the biosynthesis of the exopolysaccharide xanthan.野油菜黄单胞菌野油菜致病变种B100的全基因组转录起始位点分析,深入了解指导胞外多糖黄原胶生物合成的胶基因簇。
J Biotechnol. 2016 May 10;225:18-28. doi: 10.1016/j.jbiotec.2016.03.020. Epub 2016 Mar 11.
10
Overlapping protein-coding genes in human genome and their coincidental expression in tissues.人类基因组中重叠的蛋白编码基因及其在组织中的偶然表达。
Sci Rep. 2019 Sep 16;9(1):13377. doi: 10.1038/s41598-019-49802-w.

引用本文的文献

1
Mechanism of expression regulation of head-to-head overlapping protein-coding genes INO80E and HIRIP3.头对头重叠蛋白编码基因INO80E和HIRIP3的表达调控机制
Commun Biol. 2025 Mar 8;8(1):391. doi: 10.1038/s42003-025-07815-4.
2
Catalytic activity of Setd2 is essential for embryonic development in mice: establishment of a mouse model harboring patient-derived Setd2 mutation.组蛋白赖氨酸 N6-甲基转移酶 2(Setd2)的催化活性对小鼠胚胎发育至关重要:携带患者来源的 Setd2 突变的小鼠模型的建立。
Front Med. 2024 Oct;18(5):831-849. doi: 10.1007/s11684-024-1095-1. Epub 2024 Aug 8.
3
The Functional Meaning of 5'UTR in Protein-Coding Genes.

本文引用的文献

1
European Nucleotide Archive in 2016.2016年的欧洲核苷酸档案库。
Nucleic Acids Res. 2017 Jan 4;45(D1):D32-D36. doi: 10.1093/nar/gkw1106. Epub 2016 Nov 29.
2
Database Resources of the National Center for Biotechnology Information.美国国立医学图书馆国家生物技术信息中心数据库资源
Nucleic Acids Res. 2017 Jan 4;45(D1):D12-D17. doi: 10.1093/nar/gkw1071. Epub 2016 Nov 28.
3
Biological functions of natural antisense transcripts.天然反义转录本的生物学功能。
5'UTR 在蛋白质编码基因中的功能意义。
Int J Mol Sci. 2023 Feb 3;24(3):2976. doi: 10.3390/ijms24032976.
4
Mapping gene and gene pathways associated with coronary artery disease: a CARDIoGRAM exome and multi-ancestry UK biobank analysis.与冠状动脉疾病相关的基因和基因通路的映射:CARDIoGRAM 外显子组和多民族英国生物库分析。
Sci Rep. 2021 Aug 12;11(1):16461. doi: 10.1038/s41598-021-95637-9.
5
Promoter switching in response to changing environment and elevated expression of protein-coding genes overlapping at their 5' ends.启动子切换以响应环境变化和 5' 端重叠的蛋白质编码基因的表达升高。
Sci Rep. 2021 Apr 26;11(1):8984. doi: 10.1038/s41598-021-87970-w.
6
Molecular dissection of the replication system of plasmid pIGRK encoding two in-frame Rep proteins with antagonistic functions.分子剖析编码具有拮抗功能的两个框内 Rep 蛋白的质粒 pIGRK 复制系统。
BMC Microbiol. 2019 Nov 13;19(1):254. doi: 10.1186/s12866-019-1595-3.
7
Overlapping protein-coding genes in human genome and their coincidental expression in tissues.人类基因组中重叠的蛋白编码基因及其在组织中的偶然表达。
Sci Rep. 2019 Sep 16;9(1):13377. doi: 10.1038/s41598-019-49802-w.
Acta Biochim Pol. 2016;63(4):665-673. doi: 10.18388/abp.2016_1350. Epub 2016 Oct 21.
4
Sirt1 AS lncRNA interacts with its mRNA to inhibit muscle formation by attenuating function of miR-34a.Sirt1反义长链非编码RNA与它的信使核糖核酸相互作用,通过减弱微小核糖核酸-34a的功能来抑制肌肉形成。
Sci Rep. 2016 Feb 23;6:21865. doi: 10.1038/srep21865.
5
NATpipe: an integrative pipeline for systematical discovery of natural antisense transcripts (NATs) and phase-distributed nat-siRNAs from de novo assembled transcriptomes.NATpipe:一种用于从从头组装的转录组中系统发现天然反义转录本(NATs)和相位分布的nat-siRNAs的综合流程。
Sci Rep. 2016 Feb 9;6:21666. doi: 10.1038/srep21666.
6
Nonsense-Mediated Decay Restricts LncRNA Levels in Yeast Unless Blocked by Double-Stranded RNA Structure.无义介导的mRNA降解限制酵母中的长链非编码RNA水平,除非被双链RNA结构阻断。
Mol Cell. 2016 Feb 4;61(3):379-392. doi: 10.1016/j.molcel.2015.12.020. Epub 2016 Jan 21.
7
Derivation of an endogenous small RNA from double-stranded Sox4 sense and natural antisense transcripts in the mouse brain.源自小鼠大脑中双链Sox4正义转录本和天然反义转录本的内源性小RNA的产生。
Genomics. 2016 Mar;107(2-3):88-99. doi: 10.1016/j.ygeno.2016.01.006. Epub 2016 Jan 21.
8
TFBSTools: an R/bioconductor package for transcription factor binding site analysis.TFBSTools:一个用于转录因子结合位点分析的R/生物导体软件包。
Bioinformatics. 2016 May 15;32(10):1555-6. doi: 10.1093/bioinformatics/btw024. Epub 2016 Jan 21.
9
Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.美国国立生物技术信息中心的参考序列(RefSeq)数据库:当前状态、分类扩展及功能注释。
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45. doi: 10.1093/nar/gkv1189. Epub 2015 Nov 8.
10
JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles.JASPAR 2016:转录因子结合谱开放获取数据库的重大扩展与更新
Nucleic Acids Res. 2016 Jan 4;44(D1):D110-5. doi: 10.1093/nar/gkv1176. Epub 2015 Nov 3.