串联基因的表达：真核生物中另一种基因调控机制。

Expression of conjoined genes: another mechanism for gene regulation in eukaryotes.

机构信息

MetaSystems Research Team, Computational Systems Biology Research Group, Advanced Computational Sciences Department, RIKEN Advanced Science Institute, Yokohama, Japan.

出版信息

PLoS One. 2010 Oct 12;5(10):e13284. doi: 10.1371/journal.pone.0013284.

DOI:10.1371/journal.pone.0013284

PMID:20967262

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2953495/

Abstract

From the ENCODE project, it is realized that almost every base of the entire human genome is transcribed. One class of transcripts resulting from this arises from the conjoined gene, which is formed by combining the exons of two or more distinct (parent) genes lying on the same strand of a chromosome. Only a very limited number of such genes are known, and the definition and terminologies used for them are highly variable in the public databases. In this work, we have computationally identified and manually curated 751 conjoined genes (CGs) in the human genome that are supported by at least one mRNA or EST sequence available in the NCBI database. 353 representative CGs, of which 291 (82%) could be confirmed, were subjected to experimental validation using RT-PCR and sequencing methods. We speculate that these genes are arising out of novel functional requirements and are not merely artifacts of transcription, since more than 70% of them are conserved in other vertebrate genomes. The unique splicing patterns exhibited by CGs reveal their possible roles in protein evolution or gene regulation. Novel CGs, for which no transcript is available, could be identified in 80% of randomly selected potential CG forming regions, indicating that their formation is a routine process. Formation of CGs is not only limited to human, as we have also identified 270 CGs in mouse and 227 in drosophila using our approach. Additionally, we propose a novel mechanism for the formation of CGs. Finally, we developed a database, ConjoinG, which contains detailed information about all the CGs (800 in total) identified in the human genome. In summary, our findings reveal new insights about the functionality of CGs in terms of another possible mechanism for gene regulation and genomic evolution and the mechanism leading to their formation.

摘要

从 ENCODE 项目中可以发现，人类基因组的几乎每个碱基都能被转录。由这些转录本产生的一类转录本来自于拼接基因，它是由位于同一染色体链上的两个或多个不同（亲本）基因的外显子组合而成的。目前已知的此类基因数量非常有限，而且在公共数据库中用于它们的定义和术语也存在很大的差异。在这项工作中，我们通过计算方法在人类基因组中识别并手动整理了 751 个拼接基因（CGs），这些基因至少有一条来自 NCBI 数据库中 mRNA 或 EST 序列的支持。我们选择了 353 个有代表性的 CGs 进行实验验证，其中 291 个（82%）可以通过 RT-PCR 和测序方法进行验证。我们推测这些基因是由于新的功能需求而产生的，而不仅仅是转录的产物，因为它们中的 70%以上在其他脊椎动物基因组中是保守的。CGs 所展示的独特拼接模式揭示了它们在蛋白质进化或基因调控中的可能作用。在 80%的随机选择的潜在 CG 形成区域中可以识别出没有转录本的新型 CGs，这表明它们的形成是一个常规过程。CGs 的形成不仅限于人类，我们还使用我们的方法在小鼠中鉴定了 270 个 CGs，在果蝇中鉴定了 227 个 CGs。此外，我们提出了一种 CG 形成的新机制。最后，我们开发了一个数据库 ConjoinG，其中包含了在人类基因组中识别出的所有 CGs（总共 800 个）的详细信息。总之，我们的研究结果揭示了 CGs 在基因调控和基因组进化以及导致它们形成的机制方面的新功能的新见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0d0/2953495/29199c4023c2/pone.0013284.g001.jpg

相似文献

Expression of conjoined genes: another mechanism for gene regulation in eukaryotes.串联基因的表达：真核生物中另一种基因调控机制。

PLoS One. 2010 Oct 12;5(10):e13284. doi: 10.1371/journal.pone.0013284.

Novel mechanism of conjoined gene formation in the human genome.人类基因组中连接基因形成的新机制。

Funct Integr Genomics. 2012 Mar;12(1):45-61. doi: 10.1007/s10142-011-0260-1. Epub 2012 Jan 10.

[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]

Yi Chuan Xue Bao. 2004 May;31(5):431-43.

CACG: a database for comparative analysis of conjoined genes.CACG：一个用于连接基因比较分析的数据库。

Genomics. 2012 Jul;100(1):14-7. doi: 10.1016/j.ygeno.2012.05.005. Epub 2012 May 11.

Discovery of novel human transcript variants by analysis of intronic single-block EST with polyadenylation site.通过分析具有多聚腺苷酸化位点的内含子单块 EST 发现新型人类转录变体。

BMC Genomics. 2009 Nov 12;10:518. doi: 10.1186/1471-2164-10-518.

Promoter-sharing by different genes in human genome--CPNE1 and RBM12 gene pair as an example.人类基因组中不同基因的启动子共享——以CPNE1和RBM12基因对为例。

BMC Genomics. 2008 Oct 3;9:456. doi: 10.1186/1471-2164-9-456.

Computational discovery of sense-antisense transcription in the human and mouse genomes.人类和小鼠基因组中正义-反义转录的计算发现。

Genome Biol. 2002 Aug 22;3(9):RESEARCH0044. doi: 10.1186/gb-2002-3-9-research0044.

A candidate chimeric mammalian mRNA transcript is derived from distinct chromosomes and is associated with nonconsensus splice junction motifs.一种候选的嵌合哺乳动物mRNA转录本源自不同的染色体，并与非标准剪接连接基序相关。

DNA Cell Biol. 2003 May;22(5):303-15. doi: 10.1089/104454903322216653.

Expressed sequence tags for the chicken genome from a normalized, ten-day-old white leghorn whole embryo cDNA library. 2. Comparative DNA sequence analysis of guinea fowl, quail, and turkey genomes.来自标准化的10日龄白来航鸡全胚胎cDNA文库的鸡基因组表达序列标签。2. 珍珠鸡、鹌鹑和火鸡基因组的比较DNA序列分析。

Poult Sci. 2001 Sep;80(9):1263-72. doi: 10.1093/ps/80.9.1263.

Conserved introns reveal novel transcripts in Drosophila melanogaster.保守内含子揭示了黑腹果蝇中的新转录本。

Genome Res. 2009 Jul;19(7):1289-300. doi: 10.1101/gr.090050.108. Epub 2009 May 20.

引用本文的文献

Accurate fusion transcript identification from long- and short-read isoform sequencing at bulk or single-cell resolution.在批量或单细胞分辨率下，从长读长和短读长异构体测序中准确鉴定融合转录本。

Genome Res. 2025 Apr 14;35(4):967-986. doi: 10.1101/gr.279200.124.

Long-read RNA sequencing atlas of human microglia isoforms elucidates disease-associated genetic regulation of splicing.人类小胶质细胞异构体的长读长RNA测序图谱阐明了与疾病相关的剪接基因调控。

Nat Genet. 2025 Mar;57(3):604-615. doi: 10.1038/s41588-025-02099-0. Epub 2025 Mar 3.

Oncogenic fusion protein interacts with polypyrimidine tract binding protein 1 to facilitate bladder cancer proliferation and metastasis by regulating mRNA stability.致癌融合蛋白与多嘧啶序列结合蛋白1相互作用，通过调节mRNA稳定性促进膀胱癌的增殖和转移。

MedComm (2020). 2024 Aug 14;5(9):e685. doi: 10.1002/mco2.685. eCollection 2024 Sep.

Long-read RNA-seq atlas of novel microglia isoforms elucidates disease-associated genetic regulation of splicing.新型小胶质细胞异构体的长读长RNA测序图谱阐明了疾病相关的剪接基因调控。

medRxiv. 2023 Dec 1:2023.12.01.23299073. doi: 10.1101/2023.12.01.23299073.

Longitudinal APOE4- and amyloid-dependent changes in the blood transcriptome in cognitively intact older adults.在认知正常的老年人中，载脂蛋白 E4 和淀粉样蛋白依赖性的血液转录组的纵向变化。

Alzheimers Res Ther. 2023 Jul 12;15(1):121. doi: 10.1186/s13195-023-01242-5.

Targeting pre-mRNA splicing in cancers: roles, inhibitors, and therapeutic opportunities.靶向癌症中的前体mRNA剪接：作用、抑制剂及治疗机遇

Front Oncol. 2023 Jun 5;13:1152087. doi: 10.3389/fonc.2023.1152087. eCollection 2023.

Definition of the transcriptional units of inherited retinal disease genes by meta-analysis of human retinal transcriptome data.通过对人类视网膜转录组数据的荟萃分析来定义遗传性视网膜疾病基因的转录单位。

BMC Genomics. 2023 Apr 18;24(1):206. doi: 10.1186/s12864-023-09300-w.

Skin Phototype and Disease: A Comprehensive Genetic Approach to Pigmentary Traits Pleiotropy Using PRS in the GCAT Cohort.皮肤表型与疾病：利用 GCAT 队列中的 PRS 对色素性状的多效性进行全面的遗传研究。

Genes (Basel). 2023 Jan 5;14(1):149. doi: 10.3390/genes14010149.

Transcriptomic response of bioengineered human cartilage to parabolic flight microgravity is sex-dependent.生物工程化人类软骨对抛物线飞行微重力的转录组反应存在性别依赖性。

NPJ Microgravity. 2023 Jan 19;9(1):5. doi: 10.1038/s41526-023-00255-6.

Long-Read Transcriptome of Equine Bronchoalveolar Cells.马支气管肺泡细胞的长读转录组。

Genes (Basel). 2022 Sep 25;13(10):1722. doi: 10.3390/genes13101722.

本文引用的文献

Revealing frequent alternative polyadenylation and widespread low-level transcription read-through of novel plant transcription terminators.揭示新型植物转录终止子的频繁交替多聚腺苷酸化和广泛的低水平转录通读。

Plant Biotechnol J. 2010 Sep;8(7):772-82. doi: 10.1111/j.1467-7652.2010.00504.x. Epub 2010 Mar 16.

RNA processing and its regulation: global insights into biological networks.RNA 加工及其调控：对生物网络的全局洞察。

Nat Rev Genet. 2010 Jan;11(1):75-87. doi: 10.1038/nrg2673.

Induced chromosomal proximity and gene fusions in prostate cancer.前列腺癌中的诱导染色体接近和基因融合

Science. 2009 Nov 27;326(5957):1230. doi: 10.1126/science.1178124. Epub 2009 Oct 29.

Chromosome crosstalk in three dimensions.三维染色体串扰

Nature. 2009 Sep 10;461(7261):212-7. doi: 10.1038/nature08453.

Implications of chimaeric non-co-linear transcripts.嵌合非共线性转录本的影响

Nature. 2009 Sep 10;461(7261):206-11. doi: 10.1038/nature08452.

Regulation of non-coding RNA networks in the nervous system--what's the REST of the story?非编码 RNA 网络在神经系统中的调控——REST 还有什么故事？

Neurosci Lett. 2009 Dec 4;466(2):73-80. doi: 10.1016/j.neulet.2009.07.093. Epub 2009 Aug 11.

Mutation-associated fusion cancer genes in solid tumors.实体瘤中与突变相关的融合癌基因

Mol Cancer Ther. 2009 Jun;8(6):1399-408. doi: 10.1158/1535-7163.MCT-09-0135. Epub 2009 Jun 9.

Nonsense-mediated mRNA decay (NMD) mechanisms.无义介导的mRNA降解（NMD）机制。

Nat Struct Mol Biol. 2009 Feb;16(2):107-13. doi: 10.1038/nsmb.1550.

Short homologous sequences are strongly associated with the generation of chimeric RNAs in eukaryotes.短同源序列与真核生物中嵌合RNA的产生密切相关。

J Mol Evol. 2009 Jan;68(1):56-65. doi: 10.1007/s00239-008-9187-0. Epub 2008 Dec 17.

Alternative polyadenylation: a twist on mRNA 3' end formation.可变聚腺苷酸化：mRNA 3' 末端形成的一种变体

ACS Chem Biol. 2008 Oct 17;3(10):609-17. doi: 10.1021/cb800138w. Epub 2008 Sep 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

串联基因的表达：真核生物中另一种基因调控机制。

Expression of conjoined genes: another mechanism for gene regulation in eukaryotes.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献