Erady Chaitanya, Boxall Adam, Puntambekar Shraddha, Suhas Jagannathan N, Chauhan Ruchi, Chong David, Meena Narendra, Kulkarni Apurv, Kasabe Bhagyashri, Prathivadi Bhayankaram Kethaki, Umrania Yagnesh, Andreani Adam, Nel Jean, Wayland Matthew T, Pina Cristina, Lilley Kathryn S, Prabakaran Sudhakaran
Department of Genetics, University of Cambridge, Downing Site, Cambridge, CB2 3EH, UK.
Department of Biology, Indian Institute of Science Education and Research, Pune, Maharashtra, 411008, India.
NPJ Genom Med. 2021 Jan 25;6(1):4. doi: 10.1038/s41525-020-00167-4.
Uncharacterized and unannotated open-reading frames, which we refer to as novel open reading frames (nORFs), may sometimes encode peptides that remain unexplored for novel therapeutic opportunities. To our knowledge, no systematic identification and characterization of transcripts encoding nORFs or their translation products in cancer, or in any other physiological process has been performed. We use our curated nORFs database (nORFs.org), together with RNA-Seq data from The Cancer Genome Atlas (TCGA) and Genotype-Expression (GTEx) consortiums, to identify transcripts containing nORFs that are expressed frequently in cancer or matched normal tissue across 22 cancer types. We show nORFs are subject to extensive dysregulation at the transcript level in cancer tissue and that a small subset of nORFs are associated with overall patient survival, suggesting that nORFs may have prognostic value. We also show that nORF products can form protein-like structures with post-translational modifications. Finally, we perform in silico screening for inhibitors against nORF-encoded proteins that are disrupted in stomach and esophageal cancer, showing that they can potentially be targeted by inhibitors. We hope this work will guide and motivate future studies that perform in-depth characterization of nORF functions in cancer and other diseases.
未表征和未注释的开放阅读框,我们将其称为新型开放阅读框(nORFs),有时可能编码尚未探索新型治疗机会的肽段。据我们所知,尚未在癌症或任何其他生理过程中对编码nORFs的转录本或其翻译产物进行系统的鉴定和表征。我们使用我们精心策划的nORFs数据库(nORFs.org),以及来自癌症基因组图谱(TCGA)和基因型-表达(GTEx)联盟的RNA测序数据,来鉴定在22种癌症类型的癌组织或匹配的正常组织中频繁表达的包含nORFs的转录本。我们发现nORFs在癌组织的转录水平上受到广泛的失调调控,并且一小部分nORFs与患者的总体生存相关,这表明nORFs可能具有预后价值。我们还表明nORF产物可以形成具有翻译后修饰的蛋白质样结构。最后,我们对胃癌和食管癌中被破坏的nORF编码蛋白的抑制剂进行了计算机模拟筛选,结果表明它们可能是抑制剂的潜在靶点。我们希望这项工作将指导和推动未来对癌症和其他疾病中nORF功能进行深入表征的研究。