• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用靶向 RNA 富集技术鉴定的新型人类蛋白编码基因座。

A novel human protein-coding locus identified using a targeted RNA enrichment technique.

机构信息

School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.

State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, 361102, China.

出版信息

BMC Biol. 2024 Nov 26;22(1):273. doi: 10.1186/s12915-024-02069-8.

DOI:10.1186/s12915-024-02069-8
PMID:39593153
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11590353/
Abstract

BACKGROUND

Accurate and comprehensive genomic annotation, including the full list of protein-coding genes, is vital for understanding the molecular mechanisms of human biology. We have previously shown that the genome contains a multitude of yet hidden functional exons and transcripts, some of which might represent novel mRNAs. These results resonate with those from other groups and strongly argue that two decades after the completion of the first draft of the human genome sequence, the current annotation of human genes and transcripts remains far from being complete.

RESULTS

Using a targeted RNA enrichment technique, we showed that one of the novel functional exons previously discovered by us and currently annotated as part of a long non-coding RNA, is actually a part of a novel protein-coding gene, InSETG-4, which encodes a novel human protein with no known homologs or motifs. We found that InSETG-4 is induced by various DNA-damaging agents across multiple cell types and therefore might represent a novel component of DNA damage response. Despite its low abundance in bulk cell populations, InSETG-4 exhibited expression restricted to a small fraction of cells, as demonstrated by the amplification-based single-molecule fluorescence in situ hybridization (asmFISH) analysis.

CONCLUSIONS

This study argues that yet undiscovered human protein-coding genes exist and provides an example of how targeted RNA enrichment techniques can help to fill this major gap in our knowledge of the information encoded in the human genome.

摘要

背景

准确和全面的基因组注释,包括蛋白质编码基因的完整列表,对于理解人类生物学的分子机制至关重要。我们之前已经表明,基因组包含大量隐藏的功能外显子和转录本,其中一些可能代表新的 mRNA。这些结果与其他研究小组的结果一致,并强烈表明,在完成人类基因组序列初稿的 20 年后,目前对人类基因和转录本的注释仍然远远不够完整。

结果

我们使用靶向 RNA 富集技术表明,我们之前发现的一个新的功能外显子,目前被注释为长非编码 RNA 的一部分,实际上是一个新的蛋白质编码基因 InSETG-4 的一部分,该基因编码一种新的人类蛋白质,没有已知的同源物或基序。我们发现 InSETG-4 被多种细胞类型的各种 DNA 损伤剂诱导,因此可能代表 DNA 损伤反应的一个新组成部分。尽管 InSETG-4 在大量细胞群体中的丰度较低,但通过基于扩增的单分子荧光原位杂交(asmFISH)分析表明,它在一小部分细胞中表现出受限的表达。

结论

这项研究表明,尚未发现的人类蛋白质编码基因的存在,并提供了一个例子,说明如何使用靶向 RNA 富集技术来填补我们对人类基因组中编码信息的了解中的这一主要空白。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/0e2d174036e0/12915_2024_2069_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/83f1a9197075/12915_2024_2069_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/b44ea41ee35d/12915_2024_2069_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/5a9d2eecf550/12915_2024_2069_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/32c67104f648/12915_2024_2069_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/1115bed74c2b/12915_2024_2069_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/6b6b0ac24f37/12915_2024_2069_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/0e2d174036e0/12915_2024_2069_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/83f1a9197075/12915_2024_2069_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/b44ea41ee35d/12915_2024_2069_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/5a9d2eecf550/12915_2024_2069_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/32c67104f648/12915_2024_2069_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/1115bed74c2b/12915_2024_2069_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/6b6b0ac24f37/12915_2024_2069_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6072/11590353/0e2d174036e0/12915_2024_2069_Fig7_HTML.jpg

相似文献

1
A novel human protein-coding locus identified using a targeted RNA enrichment technique.一种使用靶向 RNA 富集技术鉴定的新型人类蛋白编码基因座。
BMC Biol. 2024 Nov 26;22(1):273. doi: 10.1186/s12915-024-02069-8.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
De novo reconstruction of the Toxoplasma gondii transcriptome improves on the current genome annotation and reveals alternatively spliced transcripts and putative long non-coding RNAs.新生重建弓形虫转录组提高了目前的基因组注释,并揭示了选择性剪接的转录本和潜在的长非编码 RNA。
BMC Genomics. 2012 Dec 12;13:696. doi: 10.1186/1471-2164-13-696.
4
Combining RT-PCR-seq and RNA-seq to catalog all genic elements encoded in the human genome.将 RT-PCR-seq 和 RNA-seq 相结合,对人类基因组中所有编码基因元件进行编目。
Genome Res. 2012 Sep;22(9):1698-710. doi: 10.1101/gr.134478.111.
5
cDNA cloning, chromosome assignment, and genomic structure of a human gene encoding a novel member of the RBM family.编码RBM家族一个新成员的人类基因的cDNA克隆、染色体定位及基因组结构
Cytogenet Cell Genet. 2001;92(3-4):225-30. doi: 10.1159/000056908.
6
Complexities of mammalian transcriptome revealed by targeted RNA enrichment techniques.靶向RNA富集技术揭示的哺乳动物转录组复杂性
Trends Genet. 2023 Apr;39(4):320-333. doi: 10.1016/j.tig.2022.12.004. Epub 2023 Jan 19.
7
GENCODE: the reference human genome annotation for The ENCODE Project.GENCODE:ENCODE 项目的人类参考基因组注释。
Genome Res. 2012 Sep;22(9):1760-74. doi: 10.1101/gr.135350.111.
8
Prominent use of distal 5' transcription start sites and discovery of a large number of additional exons in ENCODE regions.在ENCODE区域中5'远端转录起始位点的显著使用以及大量额外外显子的发现。
Genome Res. 2007 Jun;17(6):746-59. doi: 10.1101/gr.5660607.
9
Discovery of high-confidence human protein-coding genes and exons by whole-genome PhyloCSF helps elucidate 118 GWAS loci.全基因组 PhyloCSF 发现高可信度的人类蛋白编码基因和外显子,有助于阐明 118 个 GWAS 基因座。
Genome Res. 2019 Dec;29(12):2073-2087. doi: 10.1101/gr.246462.118. Epub 2019 Sep 19.
10
Transcribed dark matter: meaning or myth?转录暗物质:意义还是神话?
Hum Mol Genet. 2010 Oct 15;19(R2):R162-8. doi: 10.1093/hmg/ddq362. Epub 2010 Aug 25.

引用本文的文献

1
RACE-Nano-Seq: Profiling Transcriptome Diversity of a Genomic Locus.RACE-Nano-Seq:基因组位点转录组多样性分析
Bio Protoc. 2025 Jul 5;15(13):e5374. doi: 10.21769/BioProtoc.5374.