• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepMicroGen:一种基于生成对抗网络的纵向微生物组数据插补方法。

DeepMicroGen: a generative adversarial network-based method for longitudinal microbiome data imputation.

机构信息

Department of Computer Science, Virginia Tech, Blacksburg, VA 24060, United States.

College of Nursing, University of South Florida, Tampa, FL 33620, United States.

出版信息

Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad286.

DOI:10.1093/bioinformatics/btad286
PMID:37099704
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10196688/
Abstract

MOTIVATION

The human microbiome, which is linked to various diseases by growing evidence, has a profound impact on human health. Since changes in the composition of the microbiome across time are associated with disease and clinical outcomes, microbiome analysis should be performed in a longitudinal study. However, due to limited sample sizes and differing numbers of timepoints for different subjects, a significant amount of data cannot be utilized, directly affecting the quality of analysis results. Deep generative models have been proposed to address this lack of data issue. Specifically, a generative adversarial network (GAN) has been successfully utilized for data augmentation to improve prediction tasks. Recent studies have also shown improved performance of GAN-based models for missing value imputation in a multivariate time series dataset compared with traditional imputation methods.

RESULTS

This work proposes DeepMicroGen, a bidirectional recurrent neural network-based GAN model, trained on the temporal relationship between the observations, to impute the missing microbiome samples in longitudinal studies. DeepMicroGen outperforms standard baseline imputation methods, showing the lowest mean absolute error for both simulated and real datasets. Finally, the proposed model improved the predicted clinical outcome for allergies, by providing imputation for an incomplete longitudinal dataset used to train the classifier.

AVAILABILITY AND IMPLEMENTATION

DeepMicroGen is publicly available at https://github.com/joungmin-choi/DeepMicroGen.

摘要

动机

越来越多的证据表明,人类微生物组与各种疾病有关,对人类健康有深远的影响。由于微生物组随时间的组成变化与疾病和临床结果有关,因此应该在纵向研究中进行微生物组分析。然而,由于样本量有限,不同受试者的时间点数量不同,大量数据无法被利用,直接影响了分析结果的质量。深度生成模型已被提出以解决这个数据不足的问题。具体来说,生成对抗网络(GAN)已成功用于数据扩充,以改善预测任务。最近的研究还表明,与传统的插补方法相比,基于 GAN 的模型在多元时间序列数据集的缺失值插补方面具有更好的性能。

结果

本研究提出了 DeepMicroGen,这是一种基于双向递归神经网络的 GAN 模型,针对观察值之间的时间关系进行训练,以对纵向研究中的缺失微生物样本进行插补。DeepMicroGen 优于标准基线插补方法,对于模拟和真实数据集,均显示出最低的平均绝对误差。最后,通过对用于训练分类器的不完整纵向数据集进行插补,该模型提高了过敏的预测临床结果。

可用性和实现

DeepMicroGen 可在 https://github.com/joungmin-choi/DeepMicroGen 上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/59572d64e6fa/btad286f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/a2a14e5e4693/btad286f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/85ad28fa6140/btad286f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/d570e8ab03ff/btad286f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/b15f7ae61b86/btad286f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/59572d64e6fa/btad286f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/a2a14e5e4693/btad286f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/85ad28fa6140/btad286f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/d570e8ab03ff/btad286f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/b15f7ae61b86/btad286f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c62b/10196688/59572d64e6fa/btad286f5.jpg

相似文献

1
DeepMicroGen: a generative adversarial network-based method for longitudinal microbiome data imputation.DeepMicroGen:一种基于生成对抗网络的纵向微生物组数据插补方法。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad286.
2
PC-GAIN: Pseudo-label conditional generative adversarial imputation networks for incomplete data.PC-GAIN:用于不完整数据的伪标签条件生成对抗插补网络
Neural Netw. 2021 Sep;141:395-403. doi: 10.1016/j.neunet.2021.05.033. Epub 2021 Jun 2.
3
phylaGAN: data augmentation through conditional GANs and autoencoders for improving disease prediction accuracy using microbiome data.phylaGAN:通过条件 GAN 和自动编码器进行数据增强,以改善使用微生物组数据进行疾病预测的准确性。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae161.
4
Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems.基于生成对抗网络的多重插补法解决高维分块缺失值问题
Proc Int Conf Mach Learn Appl. 2021 Dec;2021:791-798. doi: 10.1109/icmla52953.2021.00131.
5
MB-GAN: Microbiome Simulation via Generative Adversarial Network.MB-GAN:基于生成对抗网络的微生物组模拟。
Gigascience. 2021 Feb 5;10(2). doi: 10.1093/gigascience/giab005.
6
A joint learning method for incomplete and imbalanced data in electronic health record based on generative adversarial networks.基于生成对抗网络的电子健康记录中不完全和不平衡数据的联合学习方法。
Comput Biol Med. 2024 Jan;168:107687. doi: 10.1016/j.compbiomed.2023.107687. Epub 2023 Nov 14.
7
Medical multivariate time series imputation and forecasting based on a recurrent conditional Wasserstein GAN and attention.基于循环条件瓦瑟斯坦生成对抗网络和注意力机制的医学多变量时间序列插补与预测
J Biomed Inform. 2023 Mar;139:104320. doi: 10.1016/j.jbi.2023.104320. Epub 2023 Feb 13.
8
Adversarial Joint-Learning Recurrent Neural Network for Incomplete Time Series Classification.对抗式联合学习循环神经网络在不完全时间序列分类中的应用。
IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):1765-1776. doi: 10.1109/TPAMI.2020.3027975. Epub 2022 Mar 4.
9
Longitudinal Prediction of Infant MR Images With Multi-Contrast Perceptual Adversarial Learning.基于多对比度感知对抗学习的婴儿磁共振图像纵向预测
Front Neurosci. 2021 Sep 9;15:653213. doi: 10.3389/fnins.2021.653213. eCollection 2021.
10
Electronic medical records imputation by temporal Generative Adversarial Network.基于时间生成对抗网络的电子病历插补
BioData Min. 2024 Jun 26;17(1):19. doi: 10.1186/s13040-024-00372-2.

引用本文的文献

1
Diffusion model for imputing time-series gut microbiome profiles using phylogenetic information and metadata integration.利用系统发育信息和元数据整合来估算时间序列肠道微生物组图谱的扩散模型。
Bioinform Adv. 2025 Jul 28;5(1):vbaf181. doi: 10.1093/bioadv/vbaf181. eCollection 2025.
2
Semisynthetic simulation for microbiome data analysis.用于微生物组数据分析的半合成模拟
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf051.
3
Deep learning in microbiome analysis: a comprehensive review of neural network models.微生物组分析中的深度学习:神经网络模型综述

本文引用的文献

1
Generalizing predictions to unseen sequencing profiles via deep generative models.通过深度生成模型将预测推广到未见的测序谱。
Sci Rep. 2022 May 3;12(1):7151. doi: 10.1038/s41598-022-11363-w.
2
phyLoSTM: a novel deep learning model on disease prediction from longitudinal microbiome data.phyLoSTM:一种基于纵向微生物组数据进行疾病预测的新型深度学习模型。
Bioinformatics. 2021 Nov 5;37(21):3707-3714. doi: 10.1093/bioinformatics/btab482.
3
MB-GAN: Microbiome Simulation via Generative Adversarial Network.MB-GAN:基于生成对抗网络的微生物组模拟。
Front Microbiol. 2025 Jan 22;15:1516667. doi: 10.3389/fmicb.2024.1516667. eCollection 2024.
4
Designing host-associated microbiomes using the consumer/resource model.使用消费者/资源模型设计宿主相关微生物群。
mSystems. 2025 Jan 21;10(1):e0106824. doi: 10.1128/msystems.01068-24. Epub 2024 Dec 9.
5
Wise Roles and Future Visionary Endeavors of Current Emperor: Advancing Dynamic Methods for Longitudinal Microbiome Meta-Omics Data in Personalized and Precision Medicine.当代帝王的明智角色与未来前瞻性努力:推进个性化与精准医学中纵向微生物组元组学数据的动态方法
Adv Sci (Weinh). 2024 Dec;11(47):e2400458. doi: 10.1002/advs.202400458. Epub 2024 Nov 13.
6
MIDASim: a fast and simple simulator for realistic microbiome data.MIDASim:一个快速而简单的用于真实微生物组数据模拟的工具。
Microbiome. 2024 Jul 22;12(1):135. doi: 10.1186/s40168-024-01822-z.
7
In silico generation and augmentation of regulatory variants from massively parallel reporter assay using conditional variational autoencoder.利用条件变分自编码器通过大规模平行报告基因检测在计算机上生成和增强调控变异体。
bioRxiv. 2024 Jun 29:2024.06.25.600715. doi: 10.1101/2024.06.25.600715.
8
Detracking Autoencoding Conditional Generative Adversarial Network: Improved Generative Adversarial Network Method for Tabular Missing Value Imputation.解耦自动编码条件生成对抗网络:用于表格缺失值插补的改进生成对抗网络方法
Entropy (Basel). 2024 May 4;26(5):402. doi: 10.3390/e26050402.
9
phylaGAN: data augmentation through conditional GANs and autoencoders for improving disease prediction accuracy using microbiome data.phylaGAN:通过条件 GAN 和自动编码器进行数据增强,以改善使用微生物组数据进行疾病预测的准确性。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae161.
10
Data pre-processing for analyzing microbiome data - A mini review.用于分析微生物组数据的数据预处理——一篇综述短文
Comput Struct Biotechnol J. 2023 Oct 4;21:4804-4815. doi: 10.1016/j.csbj.2023.10.001. eCollection 2023.
Gigascience. 2021 Feb 5;10(2). doi: 10.1093/gigascience/giab005.
4
Health and disease markers correlate with gut microbiome composition across thousands of people.健康和疾病标志物与数千人的肠道微生物组组成相关。
Nat Commun. 2020 Oct 15;11(1):5206. doi: 10.1038/s41467-020-18871-1.
5
A deep learning-based, unsupervised method to impute missing values in electronic health records for improved patient management.一种基于深度学习的、无监督的方法,用于填补电子健康记录中的缺失值,以改善患者管理。
J Biomed Inform. 2020 Nov;111:103576. doi: 10.1016/j.jbi.2020.103576. Epub 2020 Oct 1.
6
Microbiome definition re-visited: old concepts and new challenges.微生物组定义再探讨:旧概念和新挑战。
Microbiome. 2020 Jun 30;8(1):103. doi: 10.1186/s40168-020-00875-0.
7
TaxoNN: ensemble of neural networks on stratified microbiome data for disease prediction.TaxoNN:基于分层微生物组数据的神经网络集成用于疾病预测。
Bioinformatics. 2020 Nov 1;36(17):4544-4550. doi: 10.1093/bioinformatics/btaa542.
8
A novel deep learning method for predictive modeling of microbiome data.一种用于微生物组数据预测建模的新型深度学习方法。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa073.
9
Fecal dysbiosis in infants with cystic fibrosis is associated with early linear growth failure.囊性纤维化患儿粪便菌群失调与早期线性生长发育不良有关。
Nat Med. 2020 Feb;26(2):215-221. doi: 10.1038/s41591-019-0714-x. Epub 2020 Jan 20.
10
Microbiome Datasets Are Compositional: And This Is Not Optional.微生物组数据集具有构成性:这并非可有可无。
Front Microbiol. 2017 Nov 15;8:2224. doi: 10.3389/fmicb.2017.02224. eCollection 2017.