• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用系统发育信息和元数据整合来估算时间序列肠道微生物组图谱的扩散模型。

Diffusion model for imputing time-series gut microbiome profiles using phylogenetic information and metadata integration.

作者信息

Seki Misato, Zhang Yao-Zhong, Imoto Seiya

机构信息

Division of Health Medical Intelligence, Human Genome Center, The Institute of Medical Science, The University of Tokyo, Tokyo 108-8639, Japan.

出版信息

Bioinform Adv. 2025 Jul 28;5(1):vbaf181. doi: 10.1093/bioadv/vbaf181. eCollection 2025.

DOI:10.1093/bioadv/vbaf181
PMID:40861397
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12371328/
Abstract

MOTIVATION

The gut microbiota interacts closely with the host, playing crucial roles in maintaining health. Analysing time-series genomic data enables the investigation of dynamic microbiota changes. However, missing values create significant analytical challenges.

RESULTS

We propose a microbiome imputation framework based on a conditional score-based diffusion model, tailored to microbiome data by incorporating phylogenetic convolutional layers. Our method effectively reduces mean absolute error across various missing data ratios for both 16S rRNA and whole-genome shotgun profiles. The imputed datasets enhance downstream predictive tasks, achieving area under the curve scores that exceed or are comparable with those of the existing methods. To further improve the performance, we embedded host metadata into the model using a tabular encoding approach, which yielded additional improvements particularly under higher missing ratios. Our findings underscore the potential of the diffusion model for processing time-series microbiome data with missing values.

AVAILABILITY AND IMPLEMENTATION

Related codes and dataset can be found at: https://github.com/misatoseki/metag_time_impute_phylo.git.

摘要

动机

肠道微生物群与宿主密切相互作用,在维持健康方面发挥着关键作用。分析时间序列基因组数据有助于研究微生物群的动态变化。然而,缺失值带来了重大的分析挑战。

结果

我们提出了一种基于条件分数的扩散模型的微生物组插补框架,通过纳入系统发育卷积层对微生物组数据进行了定制。我们的方法有效降低了16S rRNA和全基因组鸟枪法图谱在各种缺失数据比例下的平均绝对误差。插补后的数据集增强了下游预测任务,获得的曲线下面积分数超过或与现有方法相当。为了进一步提高性能,我们使用表格编码方法将宿主元数据嵌入模型,这在较高缺失比例下尤其带来了额外的改进。我们的研究结果强调了扩散模型在处理具有缺失值的时间序列微生物组数据方面的潜力。

可用性和实现

相关代码和数据集可在以下网址找到:https://github.com/misatoseki/metag_time_impute_phylo.git。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/6777552b3fbb/vbaf181f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/89d58bd10b6e/vbaf181f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/28130a5eb39f/vbaf181f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/9dd3c03c538c/vbaf181f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/86adfaa760fd/vbaf181f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/7fb9eb3d0b7a/vbaf181f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/6777552b3fbb/vbaf181f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/89d58bd10b6e/vbaf181f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/28130a5eb39f/vbaf181f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/9dd3c03c538c/vbaf181f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/86adfaa760fd/vbaf181f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/7fb9eb3d0b7a/vbaf181f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e50d/12371328/6777552b3fbb/vbaf181f6.jpg

相似文献

1
Diffusion model for imputing time-series gut microbiome profiles using phylogenetic information and metadata integration.利用系统发育信息和元数据整合来估算时间序列肠道微生物组图谱的扩散模型。
Bioinform Adv. 2025 Jul 28;5(1):vbaf181. doi: 10.1093/bioadv/vbaf181. eCollection 2025.
2
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
5
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
6
Improving the FAIRness and Sustainability of the NHGRI Resources Ecosystem.提高国家人类基因组研究所资源生态系统的公平性和可持续性。
ArXiv. 2025 Aug 19:arXiv:2508.13498v1.
7
Short-Term Memory Impairment短期记忆障碍
8
Multi-scale variational autoencoder for imputation of missing values in untargeted metabolomics using whole-genome sequencing data.基于全基因组测序数据的无靶向代谢组学缺失值插补的多尺度变分自动编码器。
Comput Biol Med. 2024 Sep;179:108813. doi: 10.1016/j.compbiomed.2024.108813. Epub 2024 Jul 1.
9
A novel deep learning method for predictive modeling of microbiome data.一种用于微生物组数据预测建模的新型深度学习方法。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa073.
10
A segment anything model-guided and match-based semi-supervised segmentation framework for medical imaging.一种用于医学成像的基于段式分割模型引导和匹配的半监督分割框架。
Med Phys. 2025 Mar 29. doi: 10.1002/mp.17785.

本文引用的文献

1
A self-knowledge distillation-driven CNN-LSTM model for predicting disease outcomes using longitudinal microbiome data.一种基于自我知识蒸馏驱动的CNN-LSTM模型,用于使用纵向微生物组数据预测疾病结果。
Bioinform Adv. 2023 May 18;3(1):vbad059. doi: 10.1093/bioadv/vbad059. eCollection 2023.
2
DeepMicroGen: a generative adversarial network-based method for longitudinal microbiome data imputation.DeepMicroGen:一种基于生成对抗网络的纵向微生物组数据插补方法。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad286.
3
Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4.
利用 MetaPhlAn 4 对未鉴定物种进行宏基因组分类分析的扩展和改进。
Nat Biotechnol. 2023 Nov;41(11):1633-1644. doi: 10.1038/s41587-023-01688-w. Epub 2023 Feb 23.
4
Statistical challenges in longitudinal microbiome data analysis.纵向微生物组数据分析中的统计挑战。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac273.
5
Human disease prediction from microbiome data by multiple feature fusion and deep learning.通过多特征融合和深度学习从微生物组数据预测人类疾病
iScience. 2022 Mar 16;25(4):104081. doi: 10.1016/j.isci.2022.104081. eCollection 2022 Apr 15.
6
phyLoSTM: a novel deep learning model on disease prediction from longitudinal microbiome data.phyLoSTM:一种基于纵向微生物组数据进行疾病预测的新型深度学习模型。
Bioinformatics. 2021 Nov 5;37(21):3707-3714. doi: 10.1093/bioinformatics/btab482.
7
The aging gut microbiome and its impact on host immunity.衰老的肠道微生物组及其对宿主免疫的影响。
Genes Immun. 2021 Oct;22(5-6):289-303. doi: 10.1038/s41435-021-00126-8. Epub 2021 Apr 19.
8
TaxoNN: ensemble of neural networks on stratified microbiome data for disease prediction.TaxoNN:基于分层微生物组数据的神经网络集成用于疾病预测。
Bioinformatics. 2020 Nov 1;36(17):4544-4550. doi: 10.1093/bioinformatics/btaa542.
9
Interaction between drugs and the gut microbiome.药物与肠道微生物组的相互作用。
Gut. 2020 Aug;69(8):1510-1519. doi: 10.1136/gutjnl-2019-320204. Epub 2020 May 14.
10
Microbiome Composition and Its Impact on the Development of Allergic Diseases.微生物组组成及其对过敏性疾病发展的影响。
Front Immunol. 2020 Apr 23;11:700. doi: 10.3389/fimmu.2020.00700. eCollection 2020.