Henneges Carsten, Bullinger Dino, Fux Richard, Friese Natascha, Seeger Harald, Neubauer Hans, Laufer Stefan, Gleiter Christoph H, Schwab Matthias, Zell Andreas, Kammerer Bernd
Center for Bioinformatics Tübingen (ZBIT), Sand 1, D-72076 Tübingen, Germany.
BMC Cancer. 2009 Apr 5;9:104. doi: 10.1186/1471-2407-9-104.
Breast cancer belongs to the most frequent and severe cancer types in human. Since excretion of modified nucleosides from increased RNA metabolism has been proposed as a potential target in pathogenesis of breast cancer, the aim of the present study was to elucidate the predictability of breast cancer by means of urinary excreted nucleosides.
We analyzed urine samples from 85 breast cancer women and respective healthy controls to assess the metabolic profiles of nucleosides by a comprehensive bioinformatic approach. All included nucleosides/ribosylated metabolites were isolated by cis-diol specific affinity chromatography and measured with liquid chromatography ion trap mass spectrometry (LC-ITMS). A valid set of urinary metabolites was selected by exclusion of all candidates with poor linearity and/or reproducibility in the analytical setting. The bioinformatic tool of Oscillating Search Algorithm for Feature Selection (OSAF) was applied to iteratively improve features for training of Support Vector Machines (SVM) to better predict breast cancer.
After identification of 51 nucleosides/ribosylated metabolites in the urine of breast cancer women and/or controls by LC- ITMS coupling, a valid set of 35 candidates was selected for subsequent computational analyses. OSAF resulted in 44 pairwise ratios of metabolite features by iterative optimization. Based on this approach ultimately estimates for sensitivity and specificity of 83.5% and 90.6% were obtained for best prediction of breast cancer. The classification performance was dominated by metabolite pairs with SAH which highlights its importance for RNA methylation in cancer pathogenesis.
Extensive RNA-pathway analysis based on mass spectrometric analysis of metabolites and subsequent bioinformatic feature selection allowed for the identification of significant metabolic features related to breast cancer pathogenesis. The combination of mass spectrometric analysis and subsequent SVM-based feature selection represents a promising tool for the development of a non-invasive prediction system.
乳腺癌是人类最常见且最严重的癌症类型之一。由于从增加的RNA代谢中排泄的修饰核苷已被提出作为乳腺癌发病机制中的一个潜在靶点,本研究的目的是通过尿中排泄的核苷来阐明乳腺癌的可预测性。
我们分析了85名乳腺癌女性患者和相应健康对照者的尿液样本,通过综合生物信息学方法评估核苷的代谢谱。所有纳入的核苷/核糖基化代谢物通过顺式二醇特异性亲和色谱法分离,并用液相色谱离子阱质谱法(LC-ITMS)进行测量。通过排除在分析环境中线性和/或重现性差的所有候选物,选择了一组有效的尿代谢物。应用振荡搜索算法进行特征选择(OSAF)的生物信息学工具来迭代改进特征,以训练支持向量机(SVM),从而更好地预测乳腺癌。
通过LC-ITMS联用在乳腺癌女性患者和/或对照者尿液中鉴定出51种核苷/核糖基化代谢物后,选择了一组有效的35种候选物进行后续的计算分析。OSAF通过迭代优化产生了44对代谢物特征比率。基于这种方法,最终获得了用于乳腺癌最佳预测的灵敏度和特异性估计值,分别为83.5%和90.6%。分类性能由与SAH相关的代谢物对主导,这突出了其在癌症发病机制中对RNA甲基化的重要性。
基于代谢物质谱分析和后续生物信息学特征选择的广泛RNA通路分析,能够识别与乳腺癌发病机制相关的重要代谢特征。质谱分析与后续基于SVM的特征选择相结合,是开发非侵入性预测系统的一种有前景的工具。