Malik Girik, Agarwal Tanu, Raj Utkarsh, Sundararajan Vijayaraghava Seshadri, Bandapalli Obul Reddy, Suravajhala Prashanth
1Khoury College of Computer Sciences, Northeastern University, 360 Huntington Ave., Boston, MA02115, USA; 2Bioclues.org, Kukatpally, Hyderabad, 500072, India; 3Labrynthe Pvt. Ltd., New Delhi, India; 4NIIT University, NH8, Delhi- Jaipur Highway, District Alwar, Neemrana, Rajasthan 301705, India; 5Hopp Children's Cancer Center [KiTZ], Heidelberg, Germany; 6Division of Pediatric Neuro Oncology, German Cancer Research Center [DKFZ], German Cancer Consortium [DKTK], Heidelberg, Germany; 7Heidelberg University, Medical Faculty, Heidelberg, Germany; 8Department of Biotechnology and Bioinformatics, Birla Institute of Scientific Research, Statue Circle, Jaipur302021, RJ, India.
Curr Genomics. 2020 Nov;21(7):531-535. doi: 10.2174/1389202921999200611155418.
Hypothetical Proteins [HP] are the transcripts predicted to be expressed in an organism, but no evidence of it exists in gene banks. On the other hand, long non-coding RNAs [lncRNAs] are the transcripts that might be present in the 5' UTR or intergenic regions of the genes whose lengths are above 200 bases. With the known unknown [KU] regions in the genomes rapidly existing in gene banks, there is a need to understand the role of open reading frames in the context of annotation. In this commentary, we emphasize that HPs could indeed be the predecessors of lncRNAs.
假设蛋白[HP]是预测在生物体中表达但在基因库中尚无证据证明其存在的转录本。另一方面,长链非编码RNA[lncRNA]是可能存在于长度超过200个碱基的基因的5'非翻译区或基因间区域的转录本。随着基因库中基因组中已知的未知[KU]区域迅速增加,有必要了解开放阅读框在注释背景下的作用。在本评论中,我们强调假设蛋白确实可能是长链非编码RNA的前身。