Zeng Zhiqiang, Shi Hua, Wu Yun, Hong Zhiling
College of Computer and Information Engineering, Xiamen University of Technology, Xiamen 361024, China.
Software School, Xiamen University, Xiamen 361005, China.
Comput Math Methods Med. 2015;2015:674296. doi: 10.1155/2015/674296. Epub 2015 Oct 7.
Informatics methods, such as text mining and natural language processing, are always involved in bioinformatics research. In this study, we discuss text mining and natural language processing methods in bioinformatics from two perspectives. First, we aim to search for knowledge on biology, retrieve references using text mining methods, and reconstruct databases. For example, protein-protein interactions and gene-disease relationship can be mined from PubMed. Then, we analyze the applications of text mining and natural language processing techniques in bioinformatics, including predicting protein structure and function, detecting noncoding RNA. Finally, numerous methods and applications, as well as their contributions to bioinformatics, are discussed for future use by text mining and natural language processing researchers.
信息学方法,如文本挖掘和自然语言处理,一直都参与到生物信息学研究中。在本研究中,我们从两个角度讨论生物信息学中的文本挖掘和自然语言处理方法。首先,我们旨在搜索生物学知识,使用文本挖掘方法检索参考文献,并重建数据库。例如,可以从PubMed中挖掘蛋白质-蛋白质相互作用和基因-疾病关系。然后,我们分析文本挖掘和自然语言处理技术在生物信息学中的应用,包括预测蛋白质结构和功能、检测非编码RNA。最后,讨论了众多方法和应用及其对生物信息学的贡献,以供文本挖掘和自然语言处理研究人员未来使用。