Thakur Anamika, Kumar Manoj
Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh, India.
Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India.
Methods Mol Biol. 2025;2883:299-323. doi: 10.1007/978-1-0716-4290-0_13.
Long non-coding RNAs (lncRNAs) are a type of non-coding RNA molecules exceeding 200 nucleotides in length and that do not encode proteins. The dysregulated expression of lncRNAs has been identified in various diseases, holding therapeutic significance. Over the past decade, numerous computational resources have been published in the field of lncRNA. In this chapter, we have provided a comprehensive review of the databases as well as predictive tools, that is, lncRNA databases, machine learning based algorithms, and tools predicting lncRNAs utilizing different techniques. The chapter will focus on the importance of lncRNA resources developed for different organisms specifically for humans, mouse, plants, and other model organisms. We have enlisted important databases, primarily focusing on comprehensive information related to lncRNA registries, associations with diseases, differential expression, lncRNA transcriptome, target regulations, and all-in-one resources. Further, we have also included the updated version of lncRNA resources. Additionally, computational identification of lncRNAs using algorithms like Deep learning, Support Vector Machine (SVM), and Random Forest (RF) was also discussed. In conclusion, this comprehensive overview concludes by summarizing vital in silico resources, empowering biologists to choose the most suitable tools for their lncRNA research endeavors. This chapter serves as a valuable guide, emphasizing the significance of computational approaches in understanding lncRNAs and their implications in various biological contexts.
长链非编码RNA(lncRNAs)是一类长度超过200个核苷酸且不编码蛋白质的非编码RNA分子。lncRNAs的表达失调已在多种疾病中被发现,具有治疗意义。在过去十年中,lncRNA领域已发表了大量的计算资源。在本章中,我们对数据库以及预测工具进行了全面综述,即lncRNA数据库、基于机器学习的算法以及利用不同技术预测lncRNAs的工具。本章将重点关注为不同生物体(特别是人类、小鼠、植物和其他模式生物)开发的lncRNA资源的重要性。我们列出了重要的数据库,主要侧重于与lncRNA登记、与疾病的关联、差异表达、lncRNA转录组、靶标调控以及一体化资源相关的全面信息。此外,我们还纳入了lncRNA资源的更新版本。此外,还讨论了使用深度学习、支持向量机(SVM)和随机森林(RF)等算法对lncRNAs进行计算识别。总之,本全面综述通过总结重要的计算机资源得出结论,使生物学家能够为其lncRNA研究工作选择最合适的工具。本章是一份有价值的指南,强调了计算方法在理解lncRNAs及其在各种生物学背景中的意义方面的重要性。