• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EXPLANA:横断面和纵向微生物组研究中用于探索性分析和特征选择的用户友好型工作流程。

EXPLANA: A user-friendly workflow for EXPLoratory ANAlysis and feature selection in cross-sectional and longitudinal microbiome studies.

作者信息

Fouquier Jennifer, Stanislawski Maggie, O'Connor John, Scadden Ashley, Lozupone Catherine

机构信息

Department of Biomedical Informatics, School of Medicine, University of Colorado, Anschutz Medical Campus, Aurora, CO.

出版信息

bioRxiv. 2024 Aug 15:2024.03.20.585968. doi: 10.1101/2024.03.20.585968.

DOI:10.1101/2024.03.20.585968
PMID:39185201
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11343137/
Abstract

MOTIVATION

Longitudinal microbiome studies (LMS) are increasingly common but have analytic challenges including non-independent data requiring mixed-effects models and large amounts of data that motivate exploratory analysis to identify factors related to outcome variables. Although change analysis (i.e. calculating deltas between values at different timepoints) can be powerful, how to best conduct these analyses is not always clear. For example, observational LMS measurements show natural fluctuations, so baseline might not be a reference of primary interest; whereas, for interventional LMS, baseline is a key reference point, often indicating the start of treatment.

RESULTS

To address these challenges, we developed a feature selection workflow for cross-sectional and LMS that supports numerical and categorical data called EXPLANA (EXPLoratory ANAlysis). Machine-learning methods were combined with different types of change calculations and downstream interpretation methods to identify statistically meaningful variables and explain their relationship to outcomes. EXPLANA generates an interactive report that textually and graphically summarizes methods and results. EXPLANA had good performance on simulated data, with an average area under the curve (AUC) of 0.91 (range: 0.79-1.0, SD = 0.05), outperformed an existing tool (AUC: 0.95 vs. 0.56), and identified novel order-dependent categorical feature changes. EXPLANA is broadly applicable and simplifies analytics for identifying features related to outcomes of interest.

摘要

动机

纵向微生物组研究(LMS)越来越普遍,但存在分析挑战,包括需要混合效应模型处理的非独立数据以及大量促使进行探索性分析以识别与结果变量相关因素的数据。尽管变化分析(即计算不同时间点值之间的差值)可能很有效,但如何最好地进行这些分析并不总是明确的。例如,观察性LMS测量显示自然波动,因此基线可能不是主要关注的参考;而对于干预性LMS,基线是一个关键参考点,通常指示治疗开始。

结果

为应对这些挑战,我们开发了一种用于横断面和LMS的特征选择工作流程,该流程支持数值和分类数据,称为EXPLANA(探索性分析)。机器学习方法与不同类型的变化计算及下游解释方法相结合,以识别具有统计学意义的变量并解释它们与结果的关系。EXPLANA生成一份交互式报告,以文本和图形方式总结方法和结果。EXPLANA在模拟数据上表现良好,曲线下面积(AUC)平均为0.91(范围:0.79 - 1.0,标准差 = 0.05),优于现有工具(AUC:0.95对0.56),并识别出与顺序相关的新型分类特征变化。EXPLANA具有广泛的适用性,简化了用于识别与感兴趣结果相关特征的分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/84f85de761fd/nihpp-2024.03.20.585968v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/e99dbf680a12/nihpp-2024.03.20.585968v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/9a469c88ee8e/nihpp-2024.03.20.585968v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/27109cfb5269/nihpp-2024.03.20.585968v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/aef8a1471c6a/nihpp-2024.03.20.585968v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/84f85de761fd/nihpp-2024.03.20.585968v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/e99dbf680a12/nihpp-2024.03.20.585968v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/9a469c88ee8e/nihpp-2024.03.20.585968v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/27109cfb5269/nihpp-2024.03.20.585968v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/aef8a1471c6a/nihpp-2024.03.20.585968v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/53a2/11343137/84f85de761fd/nihpp-2024.03.20.585968v2-f0005.jpg

相似文献

1
EXPLANA: A user-friendly workflow for EXPLoratory ANAlysis and feature selection in cross-sectional and longitudinal microbiome studies.EXPLANA:横断面和纵向微生物组研究中用于探索性分析和特征选择的用户友好型工作流程。
bioRxiv. 2024 Aug 15:2024.03.20.585968. doi: 10.1101/2024.03.20.585968.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Application of information theoretic feature selection and machine learning methods for the development of genetic risk prediction models.信息论特征选择和机器学习方法在遗传风险预测模型开发中的应用。
Sci Rep. 2021 Dec 2;11(1):23335. doi: 10.1038/s41598-021-00854-x.
4
Recovery schools for improving behavioral and academic outcomes among students in recovery from substance use disorders: a systematic review.改善物质使用障碍康复期学生行为和学业成果的康复学校:一项系统综述
Campbell Syst Rev. 2018 Oct 4;14(1):1-86. doi: 10.4073/csr.2018.9. eCollection 2018.
5
Democratizing cheminformatics: interpretable chemical grouping using an automated KNIME workflow.化学信息学的民主化:使用自动化的KNIME工作流程进行可解释的化学分组
J Cheminform. 2024 Aug 16;16(1):101. doi: 10.1186/s13321-024-00894-1.
6
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
7
Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合,以预测放射性肺损伤。
Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.
8
Impact of summer programmes on the outcomes of disadvantaged or 'at risk' young people: A systematic review.暑期项目对处境不利或“有风险”的年轻人的影响:一项系统综述。
Campbell Syst Rev. 2024 Jun 13;20(2):e1406. doi: 10.1002/cl2.1406. eCollection 2024 Jun.
9
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
10
A random forest method with feature selection for developing medical prediction models with clustered and longitudinal data.基于聚类和纵向数据的医学预测模型的特征选择随机森林方法。
J Biomed Inform. 2021 May;117:103763. doi: 10.1016/j.jbi.2021.103763. Epub 2021 Mar 26.

本文引用的文献

1
Association between microbiome and the development of adverse posttraumatic neuropsychiatric sequelae after traumatic stress exposure.创伤后应激暴露后微生物组与不良创伤后神经精神后遗症发展的关系。
Transl Psychiatry. 2023 Nov 18;13(1):354. doi: 10.1038/s41398-023-02643-8.
2
An insight into the functional alterations in the gut microbiome of healthy adults in response to a multi-strain probiotic intake: a single arm open label trial.探究健康成年人在摄入多菌株益生菌后肠道微生物组功能变化的洞察:一项单臂开放标签试验。
Front Cell Infect Microbiol. 2023 Sep 29;13:1240267. doi: 10.3389/fcimb.2023.1240267. eCollection 2023.
3
coda4microbiome: compositional data analysis for microbiome cross-sectional and longitudinal studies.
coda4microbiome:微生物组横断面和纵向研究的组成数据分析。
BMC Bioinformatics. 2023 Mar 6;24(1):82. doi: 10.1186/s12859-023-05205-3.
4
Changes in Microbiome Dominance Are Associated With Declining Lung Function and Fluctuating Inflammation in People With Cystic Fibrosis.微生物群优势的变化与囊性纤维化患者肺功能下降和炎症波动有关。
Front Microbiol. 2022 May 13;13:885822. doi: 10.3389/fmicb.2022.885822. eCollection 2022.
5
Gut microbiome and its role in colorectal cancer.肠道微生物组及其在结直肠癌中的作用。
BMC Cancer. 2021 Dec 11;21(1):1325. doi: 10.1186/s12885-021-09054-2.
6
timeOmics: an R package for longitudinal multi-omics data integration.timeOmics:一个用于纵向多组学数据整合的 R 包。
Bioinformatics. 2022 Jan 3;38(2):577-579. doi: 10.1093/bioinformatics/btab664.
7
Microbial trend analysis for common dynamic trend, group comparison, and classification in longitudinal microbiome study.纵向微生物组研究中常见动态趋势、组间比较和分类的微生物趋势分析。
BMC Genomics. 2021 Sep 15;22(1):667. doi: 10.1186/s12864-021-07948-w.
8
Multi 'omic data integration: A review of concepts, considerations, and approaches.多组学数据整合:概念、考虑因素和方法综述。
Semin Perinatol. 2021 Oct;45(6):151456. doi: 10.1016/j.semperi.2021.151456. Epub 2021 Jun 17.
9
The Gut Microbiome in Autism: Study-Site Effects and Longitudinal Analysis of Behavior Change.自闭症中的肠道微生物群:研究地点效应及行为变化的纵向分析
mSystems. 2021 Apr 6;6(2):e00848-20. doi: 10.1128/mSystems.00848-20.
10
The high prevalence of among nursing home elders associates with a dysbiotic microbiome.养老院老年人中 的高发与微生物组的失调有关。
Gut Microbes. 2021 Jan-Dec;13(1):1-15. doi: 10.1080/19490976.2021.1897209.