Department of Computer and Information Science, University of Macau, Taipa, Macau, China.
Department of Media integration technology center, Zhejiang Radio & TV Group, Hangzhou, People's Republic of China.
J Med Syst. 2018 Jun 28;42(8):139. doi: 10.1007/s10916-018-1003-9.
The fields of medicine science and health informatics have made great progress recently and have led to in-depth analytics that is demanded by generation, collection and accumulation of massive data. Meanwhile, we are entering a new period where novel technologies are starting to analyze and explore knowledge from tremendous amount of data, bringing limitless potential for information growth. One fact that cannot be ignored is that the techniques of machine learning and deep learning applications play a more significant role in the success of bioinformatics exploration from biological data point of view, and a linkage is emphasized and established to bridge these two data analytics techniques and bioinformatics in both industry and academia. This survey concentrates on the review of recent researches using data mining and deep learning approaches for analyzing the specific domain knowledge of bioinformatics. The authors give a brief but pithy summarization of numerous data mining algorithms used for preprocessing, classification and clustering as well as various optimized neural network architectures in deep learning methods, and their advantages and disadvantages in the practical applications are also discussed and compared in terms of their industrial usage. It is believed that in this review paper, valuable insights are provided for those who are dedicated to start using data analytics methods in bioinformatics.
近年来,医学科学和健康信息学领域取得了长足的进步,并深入分析了大规模数据的生成、收集和积累所带来的需求。与此同时,我们正进入一个新时期,新技术开始从大量数据中分析和探索知识,为信息增长带来了无限的潜力。有一个不容忽视的事实是,从生物数据的角度来看,机器学习和深度学习应用技术在生物信息学探索的成功中发挥了更为重要的作用,强调并建立了将这两种数据分析技术与生物信息学联系起来的桥梁,无论是在工业界还是学术界。本调查集中于对最近使用数据挖掘和深度学习方法分析生物信息学特定领域知识的研究的回顾。作者简要但精辟地总结了用于预处理、分类和聚类的众多数据挖掘算法以及深度学习方法中各种优化的神经网络架构,并讨论和比较了它们在实际应用中的优缺点,特别是在工业应用方面。相信在这篇综述文章中,为那些致力于在生物信息学中开始使用数据分析方法的人提供了有价值的见解。