Chen Wei, Feng Pengmian, Ding Hui, Lin Hao
Department of Physics, School of Sciences, and Center for Genomics and Computational Biology, North China University of Science and Technology, Tangshan, 063000, China.
School of Public Health, North China University of Science and Technology, Tangshan, 063000, China.
Sci Rep. 2016 Oct 11;6:35123. doi: 10.1038/srep35123.
The adenosine to inosine (A-to-I) editing is the most prevalent kind of RNA editing and involves in many biological processes. Accurate identification of A-to-I editing site is invaluable for better understanding its biological functions. Due to the limitations of experimental methods, in the present study, a support vector machine based-model, called PAI, is proposed to identify A-to-I editing site in D. melanogaster. In this model, RNA sequences are encoded by "pseudo dinucleotide composition" into which six RNA physiochemical properties were incorporated. PAI achieves promising performances in jackknife test and independent dataset test, indicating that it holds very high potential to become a useful tool for identifying A-to-I editing site. For the convenience of experimental scientists, a web-server was constructed for PAI and it is freely accessible at http://lin.uestc.edu.cn/server/PAI.
腺苷到次黄苷(A-to-I)编辑是最普遍的RNA编辑类型,涉及许多生物过程。准确识别A-to-I编辑位点对于更好地理解其生物学功能非常重要。由于实验方法的局限性,在本研究中,提出了一种基于支持向量机的模型PAI,用于识别黑腹果蝇中的A-to-I编辑位点。在该模型中,RNA序列通过“伪二核苷酸组成”进行编码,并纳入了六种RNA理化性质。PAI在留一法测试和独立数据集测试中取得了良好的性能,表明它具有成为识别A-to-I编辑位点有用工具的巨大潜力。为方便实验科学家使用,还为PAI构建了一个网络服务器,可通过http://lin.uestc.edu.cn/server/PAI免费访问。