Health Protection Agency, Centre for Infections, 61 Colindale Avenue, London NW9 5EQ, UK.
BMC Bioinformatics. 2010 Aug 26;11:437. doi: 10.1186/1471-2105-11-437.
Robust biomarkers are needed to improve microbial identification and diagnostics. Proteomics methods based on mass spectrometry can be used for the discovery of novel biomarkers through their high sensitivity and specificity. However, there has been a lack of a coherent pipeline connecting biomarker discovery with established approaches for evaluation and validation. We propose such a pipeline that uses in silico methods for refined biomarker discovery and confirmation.
The pipeline has four main stages: Sample preparation, mass spectrometry analysis, database searching and biomarker validation. Using the pathogen Clostridium botulinum as a model, we show that the robustness of candidate biomarkers increases with each stage of the pipeline. This is enhanced by the concordance shown between various database search algorithms for peptide identification. Further validation was done by focusing on the peptides that are unique to C. botulinum strains and absent in phylogenetically related Clostridium species. From a list of 143 peptides, 8 candidate biomarkers were reliably identified as conserved across C. botulinum strains. To avoid discarding other unique peptides, a confidence scale has been implemented in the pipeline giving priority to unique peptides that are identified by a union of algorithms.
This study demonstrates that implementing a coherent pipeline which includes intensive bioinformatics validation steps is vital for discovery of robust biomarkers. It also emphasises the importance of proteomics based methods in biomarker discovery.
需要强大的生物标志物来改善微生物鉴定和诊断。基于质谱的蛋白质组学方法可以通过其高灵敏度和特异性来发现新的生物标志物。然而,在将生物标志物的发现与已建立的评估和验证方法联系起来方面,一直缺乏一个连贯的途径。我们提出了这样一个使用计算方法进行精细生物标志物发现和确认的途径。
该途径有四个主要阶段:样品制备、质谱分析、数据库搜索和生物标志物验证。使用病原体肉毒梭菌作为模型,我们表明候选生物标志物的稳健性随着途径的每个阶段而增加。这通过各种数据库搜索算法在肽识别方面的一致性得到增强。进一步的验证是通过关注那些独特存在于肉毒梭菌菌株中而不存在于系统发育上相关的梭菌物种中的肽来完成的。从 143 个肽列表中,可靠地鉴定出 8 个候选生物标志物在肉毒梭菌菌株中是保守的。为了避免丢弃其他独特的肽,该途径中实施了置信度评分,优先考虑由算法联合识别的独特肽。
本研究表明,实施包括强化生物信息学验证步骤的连贯途径对于发现稳健的生物标志物至关重要。它还强调了基于蛋白质组学的方法在生物标志物发现中的重要性。