作为长链非编码RNA前身的假设性蛋白质。

Hypothetical Proteins as Predecessors of Long Non-coding RNAs.

作者信息

Malik Girik, Agarwal Tanu, Raj Utkarsh, Sundararajan Vijayaraghava Seshadri, Bandapalli Obul Reddy, Suravajhala Prashanth

机构信息

1Khoury College of Computer Sciences, Northeastern University, 360 Huntington Ave., Boston, MA02115, USA; 2Bioclues.org, Kukatpally, Hyderabad, 500072, India; 3Labrynthe Pvt. Ltd., New Delhi, India; 4NIIT University, NH8, Delhi- Jaipur Highway, District Alwar, Neemrana, Rajasthan 301705, India; 5Hopp Children's Cancer Center [KiTZ], Heidelberg, Germany; 6Division of Pediatric Neuro Oncology, German Cancer Research Center [DKFZ], German Cancer Consortium [DKTK], Heidelberg, Germany; 7Heidelberg University, Medical Faculty, Heidelberg, Germany; 8Department of Biotechnology and Bioinformatics, Birla Institute of Scientific Research, Statue Circle, Jaipur302021, RJ, India.

出版信息

Curr Genomics. 2020 Nov;21(7):531-535. doi: 10.2174/1389202921999200611155418.

DOI:10.2174/1389202921999200611155418

PMID:33214769

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7604745/

Abstract

Hypothetical Proteins [HP] are the transcripts predicted to be expressed in an organism, but no evidence of it exists in gene banks. On the other hand, long non-coding RNAs [lncRNAs] are the transcripts that might be present in the 5' UTR or intergenic regions of the genes whose lengths are above 200 bases. With the known unknown [KU] regions in the genomes rapidly existing in gene banks, there is a need to understand the role of open reading frames in the context of annotation. In this commentary, we emphasize that HPs could indeed be the predecessors of lncRNAs.

摘要

假设蛋白[HP]是预测在生物体中表达但在基因库中尚无证据证明其存在的转录本。另一方面，长链非编码RNA[lncRNA]是可能存在于长度超过200个碱基的基因的5'非翻译区或基因间区域的转录本。随着基因库中基因组中已知的未知[KU]区域迅速增加，有必要了解开放阅读框在注释背景下的作用。在本评论中，我们强调假设蛋白确实可能是长链非编码RNA的前身。