Suppr超能文献

超越微生物群落的分类分析:一种重新审视结直肠癌中微生物群落变化的功能方法。

Beyond Taxonomic Analysis of Microbiomes: A Functional Approach for Revisiting Microbiome Changes in Colorectal Cancer.

作者信息

Norouzi-Beirami Mohammad Hossein, Marashi Sayed-Amir, Banaei-Moghaddam Ali Mohammad, Kavousi Kaveh

机构信息

Laboratory of Complex Biological Systems and Bioinformatics, Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.

Department of Biotechnology, College of Science, University of Tehran, Tehran, Iran.

出版信息

Front Microbiol. 2020 Jan 23;10:3117. doi: 10.3389/fmicb.2019.03117. eCollection 2019.

Abstract

Colorectal cancer (CRC) is one of the most prevalent cancers in the world, especially in developed countries. In different studies, the association between CRC and dysbiosis of gut microbiome has been reported. However, most of these works focus on the taxonomic variation of the microbiome, which presents little, if any, functional insight about the reason behind and/or consequences of microbiome dysbiosis. In this study, we used a previously reported metagenome dataset which is obtained by sequencing 156 microbiome samples of healthy individuals as the control group (Co), as well as microbiome samples of patients with advanced colorectal adenoma (Ad) and colorectal carcinoma (Ca). Features of the microbiome samples have been analyzed at the level of species, as well as four functional levels, i.e., gene, KEGG orthology (KO) group, Enzyme Commission (EC) number, and reaction. It was shown that, at each of these levels, certain features exist which show significant changing trends during cancer progression. In the next step, a list of these features were extracted, which were shown to be able to predict the category of Co, Ad, and Ca samples with an accuracy of >85%. When only one group of features (species, gene, KO group, EC number, reaction) was used, KO-related features were found to be the most successful features for classifying the three categories of samples. Notably, species-related features showed the least success in sample classification. Furthermore, by applying an independent test set, we showed that these performance trends are not limited to our original dataset. We determined the most important classification features at each of the four functional levels. We propose that these features can be considered as biomarkers of CRC progression. Finally, we show that the intra-diversity of each sample at the levels of bacterial species and genes is much more than those of the KO groups, EC numbers, and reactions of that sample. Therefore, we conclude that the microbiome diversity at the species level, or gene level, is not necessarily associated with the diversity at the functional level, which again indicates the importance of KO-, EC-, and reaction-based features in metagenome analysis. The source code of proposed method is freely available from https://www.bioinformatics.org/mamed.

摘要

结直肠癌(CRC)是世界上最常见的癌症之一,在发达国家尤为如此。在不同的研究中,已经报道了结直肠癌与肠道微生物群失调之间的关联。然而,这些研究大多集中在微生物群的分类学变化上,对于微生物群失调背后的原因和/或后果,几乎没有提供任何功能方面的见解。在本研究中,我们使用了一个先前报道的宏基因组数据集,该数据集通过对156名健康个体的微生物群样本进行测序获得,作为对照组(Co),同时还有晚期结直肠腺瘤(Ad)和结直肠癌(Ca)患者的微生物群样本。已在物种水平以及四个功能水平,即基因、京都基因与基因组百科全书(KEGG)直系同源组(KO)、酶委员会(EC)编号和反应水平上分析了微生物群样本的特征。结果表明,在这些水平中的每一个上,都存在某些特征,这些特征在癌症进展过程中呈现出显著的变化趋势。下一步,提取了这些特征列表,结果表明这些特征能够以>85%的准确率预测Co、Ad和Ca样本的类别。当仅使用一组特征(物种、基因、KO组、EC编号、反应)时,发现与KO相关的特征是对这三类样本进行分类最成功的特征。值得注意的是,与物种相关的特征在样本分类中表现出的成功率最低。此外,通过应用一个独立的测试集,我们表明这些性能趋势并不局限于我们的原始数据集。我们确定了四个功能水平中每一个水平上最重要的分类特征。我们提出这些特征可被视为结直肠癌进展的生物标志物。最后,我们表明每个样本在细菌物种和基因水平上的内部多样性远高于该样本在KO组、EC编号和反应水平上的多样性。因此,我们得出结论,物种水平或基因水平上的微生物群多样性不一定与功能水平上的多样性相关,这再次表明了基于KO、EC和反应的特征在宏基因组分析中的重要性。所提出方法的源代码可从https://www.bioinformatics.org/mamed免费获取。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验