Suppr超能文献

RNA-NRD:用于基准测试和功能分析的非冗余RNA结构数据集。

RNA-NRD: a non-redundant RNA structural dataset for benchmarking and functional analysis.

作者信息

Khan Nabila Shahnaz, Rahaman Md Mahfuzur, Islam Shahidul, Zhang Shaojie

机构信息

Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA.

School of Computing and Design, California State University, Monterey Bay, Seaside, CA 93955, USA.

出版信息

NAR Genom Bioinform. 2023 Apr 26;5(2):lqad040. doi: 10.1093/nargab/lqad040. eCollection 2023 Jun.

Abstract

The significance of RNA functions and their role in evolution and disease control have remarkably increased the research scope in the field of RNA science. Though the availability of RNA structure data in PBD has been growing tremendously, maintaining their quality and integrity has become the greater challenge. Since the data available in PDB are results of different independent research, they might contain redundancy. As a result, there remains a possibility of data bias for both protein and RNA chains. Quite a few studies have been conducted to remove the redundancy of protein structures by introducing high-quality representatives. However, the amount of research done to remove the redundancy of RNA structures is still very low. To remove RNA chain redundancy in PDB, we have introduced RNA-NRD, a non-redundant dataset of RNA chains based on sequence and 3D structural similarity. We compared RNA-NRD with the existing non-redundant RNA structure dataset RS-RNA and showed that it has better-formed clusters of redundant RNA chains with lower average RMSD and higher average PSI, thus improving the overall quality of the dataset.

摘要

RNA功能的重要性及其在进化和疾病控制中的作用显著扩大了RNA科学领域的研究范围。尽管PBD中RNA结构数据的可用性一直在大幅增长,但维持其质量和完整性已成为更大的挑战。由于PDB中的数据是不同独立研究的结果,它们可能包含冗余。因此,蛋白质和RNA链都存在数据偏差的可能性。已经进行了不少研究,通过引入高质量的代表性结构来消除蛋白质结构的冗余。然而,为消除RNA结构冗余所做的研究数量仍然非常少。为了消除PDB中的RNA链冗余,我们引入了RNA-NRD,这是一个基于序列和三维结构相似性的RNA链非冗余数据集。我们将RNA-NRD与现有的非冗余RNA结构数据集RS-RNA进行了比较,结果表明它具有更好的冗余RNA链聚类,平均RMSD更低,平均PSI更高,从而提高了数据集的整体质量。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验