Basic Experimental of Natural Science, University of Science and Technology Beijing, Beijing 100083, China.
School of Mathematics and Physics, University of Science and Technology Beijing, Beijing 100083, China.
Genes (Basel). 2019 Feb 1;10(2):112. doi: 10.3390/genes10020112.
The imbalance of human gut microbiota has been associated with colorectal cancer. In recent years, metagenomics research has provided a large amount of scientific data enabling us to study the dedicated roles of gut microbes in the onset and progression of cancer. We removed unrelated and redundant features during feature selection by mutual information. We then trained a random forest classifier on a large metagenomics dataset of colorectal cancer patients and healthy people assembled from published reports and extracted and analysed the information from the learned decision trees. We identified key microbial species associated with colorectal cancers. These microbes included , , sp., and We obtained the optimal splitting abundance thresholds for these species to distinguish between healthy and colorectal cancer samples. This extracted consensus decision tree may be applied to the diagnosis of colorectal cancers.
人类肠道微生物群落失衡与结直肠癌有关。近年来,宏基因组学研究提供了大量的科学数据,使我们能够研究肠道微生物在癌症发生和发展中的特定作用。我们通过互信息去除特征选择中的不相关和冗余特征。然后,我们在一个由已发表报告组装的大型结直肠癌患者和健康人群的宏基因组学数据集上训练了一个随机森林分类器,并从学习到的决策树中提取和分析信息。我们确定了与结直肠癌相关的关键微生物物种。这些微生物包括 Faecalibacterium prausnitzii、Roseburia intestinalis、Eubacterium eligens、Coprococcus eutactus 和 Blautia hansenii。我们获得了这些物种区分健康和结直肠癌样本的最佳分裂丰度阈值。这个提取的共识决策树可以应用于结直肠癌的诊断。