Center for Systems Biology, Soochow University, Suzhou, Jiangsu, China.
School of Chemistry, Biology and Material Engineering, Suzhou University of Science and Technology, China.
Brief Bioinform. 2019 May 21;20(3):952-975. doi: 10.1093/bib/bbx158.
Biomarkers are a class of measurable and evaluable indicators with the potential to predict disease initiation and progression. In contrast to disease-associated factors, biomarkers hold the promise to capture the changeable signatures of biological states. With methodological advances, computer-aided biomarker discovery has now become a burgeoning paradigm in the field of biomedical science. In recent years, the 'big data' term has accumulated for the systematical investigation of complex biological phenomena and promoted the flourishing of computational methods for systems-level biomarker screening. Compared with routine wet-lab experiments, bioinformatics approaches are more efficient to decode disease pathogenesis under a holistic framework, which is propitious to identify biomarkers ranging from single molecules to molecular networks for disease diagnosis, prognosis and therapy. In this review, the concept and characteristics of typical biomarker types, e.g. single molecular biomarkers, module/network biomarkers, cross-level biomarkers, etc., are explicated on the guidance of systems biology. Then, publicly available data resources together with some well-constructed biomarker databases and knowledge bases are introduced. Biomarker identification models using mathematical, network and machine learning theories are sequentially discussed. Based on network substructural and functional evidences, a novel bioinformatics model is particularly highlighted for microRNA biomarker discovery. This article aims to give deep insights into the advantages and challenges of current computational approaches for biomarker detection, and to light up the future wisdom toward precision medicine and nation-wide healthcare.
生物标志物是一类具有潜在疾病起始和进展预测能力的可测量和可评估指标。与疾病相关因素不同,生物标志物有望捕获生物状态的可变化特征。随着方法学的进步,计算机辅助生物标志物发现已成为生物医学科学领域的一个新兴范例。近年来,“大数据”一词积累了对复杂生物现象的系统研究,并促进了系统水平生物标志物筛选的计算方法的蓬勃发展。与常规湿实验室实验相比,生物信息学方法更有效地在整体框架下解码疾病发病机制,有利于识别从单个分子到分子网络的疾病诊断、预后和治疗标志物。在本综述中,我们在系统生物学的指导下阐述了典型生物标志物类型的概念和特征,例如单分子标志物、模块/网络标志物、跨层次标志物等。然后,介绍了公开可用的数据资源以及一些精心构建的生物标志物数据库和知识库。我们还依次讨论了使用数学、网络和机器学习理论的生物标志物识别模型。基于网络的亚结构和功能证据,我们特别强调了一种新的生物信息学模型,用于 miRNA 生物标志物发现。本文旨在深入了解当前计算方法在生物标志物检测方面的优势和挑战,并为精准医学和全国性医疗保健指明未来的智慧方向。