• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从基因组规模数据到传染病模型:一种基于贝叶斯网络推动模型开发的策略。

From genome-scale data to models of infectious disease: A Bayesian network-based strategy to drive model development.

作者信息

Yin Weiwei, Kissinger Jessica C, Moreno Alberto, Galinski Mary R, Styczynski Mark P

机构信息

School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA 30332-0100, USA.

Department of Genetics, Institute of Bioinformatics, Center for Tropical and Emerging Global Diseases, University of Georgia, Athens, GA, USA.

出版信息

Math Biosci. 2015 Dec;270(Pt B):156-68. doi: 10.1016/j.mbs.2015.06.006. Epub 2015 Jun 17.

DOI:10.1016/j.mbs.2015.06.006
PMID:26093035
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4679518/
Abstract

High-throughput, genome-scale data present a unique opportunity to link host to pathogen on a molecular level. Forging such connections will help drive the development of mathematical models to better understand and predict both pathogen behavior and the epidemiology of infectious diseases, including malaria. However, the datasets that can aid in identifying these links and models are vast and not amenable to simple, reductionist, and univariate analyses. These datasets require data mining in order to identify the truly important measurements that best describe clinical and molecular observations. Moreover, these datasets typically have relatively few samples due to experimental limitations (particularly for human studies or in vivo animal experiments), making data mining extremely difficult. Here, after first providing a brief overview of common strategies for data reduction and identification of relationships between variables for inclusion in mathematical models, we present a new generalized strategy for performing these data reduction and relationship inference tasks. Our approach emphasizes the importance of robustness when using data to drive model development, particularly when using genome-scale, small-sample in vivo data. We identify the use of appropriate feature reduction combined with data permutations and subsampling strategies as being critical to enable increasingly robust results from network inference using high-dimensional, low-observation data.

摘要

高通量、全基因组规模的数据提供了一个在分子水平上连接宿主与病原体的独特机会。建立这样的联系将有助于推动数学模型的发展,以便更好地理解和预测病原体行为以及包括疟疾在内的传染病流行病学。然而,有助于识别这些联系和模型的数据集非常庞大,不适合进行简单、还原论和单变量分析。这些数据集需要进行数据挖掘,以识别最能描述临床和分子观察结果的真正重要的测量值。此外,由于实验限制(特别是对于人体研究或体内动物实验),这些数据集通常样本相对较少,这使得数据挖掘极其困难。在此,我们首先简要概述了数据简化以及识别纳入数学模型的变量之间关系的常见策略,然后提出一种执行这些数据简化和关系推断任务的新通用策略。我们的方法强调在使用数据推动模型开发时稳健性的重要性,特别是在使用全基因组规模、小样本体内数据时。我们确定,结合数据排列和子采样策略使用适当的特征约简对于利用高维、低观测数据进行网络推断从而获得日益稳健的结果至关重要。

相似文献

1
From genome-scale data to models of infectious disease: A Bayesian network-based strategy to drive model development.从基因组规模数据到传染病模型:一种基于贝叶斯网络推动模型开发的策略。
Math Biosci. 2015 Dec;270(Pt B):156-68. doi: 10.1016/j.mbs.2015.06.006. Epub 2015 Jun 17.
2
A tree-like Bayesian structure learning algorithm for small-sample datasets from complex biological model systems.一种用于来自复杂生物模型系统的小样本数据集的树状贝叶斯结构学习算法。
BMC Syst Biol. 2015 Aug 28;9:49. doi: 10.1186/s12918-015-0194-7.
3
[Development of antituberculous drugs: current status and future prospects].[抗结核药物的研发:现状与未来前景]
Kekkaku. 2006 Dec;81(12):753-74.
4
Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.美国东部地区遥感气溶胶光学厚度与PM2.5之间关系的评估及统计建模
Res Rep Health Eff Inst. 2012 May(167):5-83; discussion 85-91.
5
Network inference from multimodal data: A review of approaches from infectious disease transmission.基于多模态数据的网络推断:传染病传播方法综述
J Biomed Inform. 2016 Dec;64:44-54. doi: 10.1016/j.jbi.2016.09.004. Epub 2016 Sep 6.
6
Social network analysis of wild chimpanzees provides insights for predicting infectious disease risk.野生黑猩猩的社会网络分析为预测传染病风险提供了新视角。
J Anim Ecol. 2013 Sep;82(5):976-86. doi: 10.1111/1365-2656.12088. Epub 2013 Jun 4.
7
H-CORE: enabling genome-scale Bayesian analysis of biological systems without prior knowledge.H-CORE:无需先验知识即可实现生物系统的全基因组规模贝叶斯分析。
Biosystems. 2007 Jul-Aug;90(1):197-210. doi: 10.1016/j.biosystems.2006.08.004. Epub 2006 Aug 22.
8
Bayesian model choice for epidemic models with two levels of mixing.两层混合的传染病模型的贝叶斯模型选择。
Biostatistics. 2014 Jan;15(1):46-59. doi: 10.1093/biostatistics/kxt023. Epub 2013 Jul 24.
9
Supervised learning and prediction of spatial epidemics.空间流行病的监督学习与预测
Spat Spatiotemporal Epidemiol. 2014 Oct;11:59-77. doi: 10.1016/j.sste.2014.08.003. Epub 2014 Sep 16.
10
[Meta-analysis of the Italian studies on short-term effects of air pollution].[意大利关于空气污染短期影响研究的荟萃分析]
Epidemiol Prev. 2001 Mar-Apr;25(2 Suppl):1-71.

引用本文的文献

1
CoVar: A generalizable machine learning approach to identify the coordinated regulators driving variational gene expression.CoVar:一种可推广的机器学习方法,用于识别驱动变异基因表达的协调调控因子。
PLoS Comput Biol. 2024 Apr 17;20(4):e1012016. doi: 10.1371/journal.pcbi.1012016. eCollection 2024 Apr.
2
New Algorithm and Software (BNOmics) for Inferring and Visualizing Bayesian Networks from Heterogeneous Big Biological and Genetic Data.用于从异构生物大数据和遗传数据推断和可视化贝叶斯网络的新算法与软件(BNOmics)
J Comput Biol. 2017 Apr;24(4):340-356. doi: 10.1089/cmb.2016.0100. Epub 2016 Sep 28.
3
From within host dynamics to the epidemiology of infectious disease: Scientific overview and challenges.从宿主内部动态到传染病流行病学:科学概述与挑战
Math Biosci. 2015 Dec;270(Pt B):143-55. doi: 10.1016/j.mbs.2015.10.002. Epub 2015 Oct 16.
4
A tree-like Bayesian structure learning algorithm for small-sample datasets from complex biological model systems.一种用于来自复杂生物模型系统的小样本数据集的树状贝叶斯结构学习算法。
BMC Syst Biol. 2015 Aug 28;9:49. doi: 10.1186/s12918-015-0194-7.

本文引用的文献

1
Comparative transcriptomics and metabolomics in a rhesus macaque drug administration study.在恒河猴给药研究中进行比较转录组学和代谢组学分析。
Front Cell Dev Biol. 2014 Oct 8;2:54. doi: 10.3389/fcell.2014.00054. eCollection 2014.
2
A molecular marker of artemisinin-resistant Plasmodium falciparum malaria.青蒿素耐药恶性疟原虫的一个分子标记。
Nature. 2014 Jan 2;505(7481):50-5. doi: 10.1038/nature12876. Epub 2013 Dec 18.
3
A network integration approach to predict conserved regulators related to pathogenicity of influenza and SARS-CoV respiratory viruses.一种网络整合方法,用于预测与流感和 SARS-CoV 呼吸道病毒致病性相关的保守调控因子。
PLoS One. 2013 Jul 25;8(7):e69374. doi: 10.1371/journal.pone.0069374. Print 2013.
4
Multiple populations of artemisinin-resistant Plasmodium falciparum in Cambodia.柬埔寨存在多种青蒿素耐药恶性疟原虫种群。
Nat Genet. 2013 Jun;45(6):648-55. doi: 10.1038/ng.2624. Epub 2013 Apr 28.
5
Human infections and detection of Plasmodium knowlesi.人类感染 knowlesi 疟原虫的情况及检测。
Clin Microbiol Rev. 2013 Apr;26(2):165-84. doi: 10.1128/CMR.00079-12.
6
A multi-omic systems approach to elucidating Yersinia virulence mechanisms.一种用于阐明耶尔森氏菌毒力机制的多组学系统方法。
Mol Biosyst. 2013 Jan 27;9(1):44-54. doi: 10.1039/c2mb25287b. Epub 2012 Nov 13.
7
Wisdom of crowds for robust gene network inference.群体智慧在稳健基因网络推断中的应用。
Nat Methods. 2012 Jul 15;9(8):796-804. doi: 10.1038/nmeth.2016.
8
A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information.基于人工对比变量和互信息的支持向量机递归特征消除特征选择方法。
J Chromatogr B Analyt Technol Biomed Life Sci. 2012 Dec 1;910:149-55. doi: 10.1016/j.jchromb.2012.05.020. Epub 2012 May 24.
9
Some strains of Plasmodium falciparum, a human malaria parasite, evade the complement-like system of Anopheles gambiae mosquitoes.某些恶性疟原虫(一种人类疟疾寄生虫)株逃避冈比亚按蚊的补体样系统。
Proc Natl Acad Sci U S A. 2012 Jul 10;109(28):E1957-62. doi: 10.1073/pnas.1121183109. Epub 2012 May 23.
10
Systems analysis of multiple regulator perturbations allows discovery of virulence factors in Salmonella.对多种调控因子扰动进行系统分析有助于发现沙门氏菌的毒力因子。
BMC Syst Biol. 2011 Jun 28;5:100. doi: 10.1186/1752-0509-5-100.