• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将大数据转化为适用于营养与肥胖研究的人工智能可用数据。

Transforming Big Data into AI-ready data for nutrition and obesity research.

作者信息

Thomas Diana M, Knight Rob, Gilbert Jack A, Cornelis Marilyn C, Gantz Marie G, Burdekin Kate, Cummiskey Kevin, Sumner Susan C J, Pathmasiri Wimal, Sazonov Edward, Gabriel Kelley Pettee, Dooley Erin E, Green Mark A, Pfluger Andrew, Kleinberg Samantha

机构信息

Department of Mathematical Sciences, United States Military Academy, West Point, New York, USA.

Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, California, USA.

出版信息

Obesity (Silver Spring). 2024 May;32(5):857-870. doi: 10.1002/oby.23989. Epub 2024 Mar 1.

DOI:10.1002/oby.23989
PMID:38426232
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11180473/
Abstract

OBJECTIVE

Big Data are increasingly used in obesity and nutrition research to gain new insights and derive personalized guidance; however, this data in raw form are often not usable. Substantial preprocessing, which requires machine learning (ML), human judgment, and specialized software, is required to transform Big Data into artificial intelligence (AI)- and ML-ready data. These preprocessing steps are the most complex part of the entire modeling pipeline. Understanding the complexity of these steps by the end user is critical for reducing misunderstanding, faulty interpretation, and erroneous downstream conclusions.

METHODS

We reviewed three popular obesity/nutrition Big Data sources: microbiome, metabolomics, and accelerometry. The preprocessing pipelines, specialized software, challenges, and how decisions impact final AI- and ML-ready products were detailed.

RESULTS

Opportunities for advances to improve quality control, speed of preprocessing, and intelligent end user consumption were presented.

CONCLUSIONS

Big Data have the exciting potential for identifying new modifiable factors that impact obesity research. However, to ensure accurate interpretation of conclusions arising from Big Data, the choices involved in preparing AI- and ML-ready data need to be transparent to investigators and clinicians relying on the conclusions.

摘要

目的

大数据在肥胖与营养研究中的应用日益广泛,以获取新的见解并得出个性化指导;然而,原始形式的数据往往无法直接使用。需要大量的预处理工作,包括机器学习(ML)、人工判断和专用软件,才能将大数据转化为适用于人工智能(AI)和机器学习的数据。这些预处理步骤是整个建模流程中最复杂的部分。终端用户了解这些步骤的复杂性对于减少误解、错误解读和错误的下游结论至关重要。

方法

我们回顾了三种流行的肥胖/营养大数据来源:微生物组学、代谢组学和加速度测量法。详细介绍了预处理流程、专用软件、挑战以及决策如何影响最终适用于人工智能和机器学习的产品。

结果

提出了改进质量控制、提高预处理速度和实现智能终端用户应用的进展机会。

结论

大数据在识别影响肥胖研究的新的可改变因素方面具有令人兴奋的潜力。然而,为确保对大数据得出的结论进行准确解读,准备适用于人工智能和机器学习的数据时所涉及的选择需要对依赖这些结论的研究人员和临床医生保持透明。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1124/11180473/9de3e0d8cc1d/nihms-1956812-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1124/11180473/678610978ccd/nihms-1956812-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1124/11180473/1d840b516c53/nihms-1956812-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1124/11180473/6fb84afafe45/nihms-1956812-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1124/11180473/9de3e0d8cc1d/nihms-1956812-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1124/11180473/678610978ccd/nihms-1956812-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1124/11180473/1d840b516c53/nihms-1956812-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1124/11180473/6fb84afafe45/nihms-1956812-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1124/11180473/9de3e0d8cc1d/nihms-1956812-f0004.jpg

相似文献

1
Transforming Big Data into AI-ready data for nutrition and obesity research.将大数据转化为适用于营养与肥胖研究的人工智能可用数据。
Obesity (Silver Spring). 2024 May;32(5):857-870. doi: 10.1002/oby.23989. Epub 2024 Mar 1.
2
The Use of Machine Learning for Analyzing Real-World Data in Disease Prediction and Management: Systematic Review.机器学习在疾病预测与管理中分析真实世界数据的应用:系统评价
JMIR Med Inform. 2025 Jun 19;13:e68898. doi: 10.2196/68898.
3
Gaps in Artificial Intelligence Research for Rural Health in the United States: A Scoping Review.美国农村卫生人工智能研究的差距:一项范围综述
medRxiv. 2025 Jun 27:2025.06.26.25330361. doi: 10.1101/2025.06.26.25330361.
4
Research status, hotspots and perspectives of artificial intelligence applied to pain management: a bibliometric and visual analysis.人工智能应用于疼痛管理的研究现状、热点与展望:一项文献计量学与可视化分析
Updates Surg. 2025 Jun 28. doi: 10.1007/s13304-025-02296-w.
5
Regulatory Insights From 27 Years of Artificial Intelligence/Machine Learning-Enabled Medical Device Recalls in the United States: Implications for Future Governance.美国27年人工智能/机器学习驱动的医疗器械召回监管洞察:对未来治理的启示
JMIR Med Inform. 2025 Jul 11;13:e67552. doi: 10.2196/67552.
6
Assessing the Emergence and Evolution of Artificial Intelligence and Machine Learning Research in Neuroradiology.评估神经放射学中人工智能和机器学习研究的出现和发展。
AJNR Am J Neuroradiol. 2024 Sep 9;45(9):1269-1275. doi: 10.3174/ajnr.A8252.
7
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
8
Effectiveness and cost-effectiveness of computer and other electronic aids for smoking cessation: a systematic review and network meta-analysis.计算机和其他电子戒烟辅助手段的有效性和成本效益:系统评价和网络荟萃分析。
Health Technol Assess. 2012;16(38):1-205, iii-v. doi: 10.3310/hta16380.
9
Exploring the Applications of Explainability in Wearable Data Analytics: Systematic Literature Review.探索可解释性在可穿戴数据分析中的应用:系统文献综述
J Med Internet Res. 2024 Dec 24;26:e53863. doi: 10.2196/53863.
10
Artificial intelligence for detecting keratoconus.人工智能在圆锥角膜检测中的应用。
Cochrane Database Syst Rev. 2023 Nov 15;11(11):CD014911. doi: 10.1002/14651858.CD014911.pub2.

引用本文的文献

1
Bridging the gap in obesity research: A consensus statement from the European Society for Clinical Investigation.弥合肥胖研究差距:欧洲临床研究学会的共识声明
Eur J Clin Invest. 2025 Aug;55(8):e70059. doi: 10.1111/eci.70059. Epub 2025 May 15.
2
Succinic Acid Improves the Metabolism of High-Fat Diet-Induced Mice and Promotes White Adipose Browning.琥珀酸改善高脂饮食诱导小鼠的代谢并促进白色脂肪棕色化。
Nutrients. 2024 Nov 8;16(22):3828. doi: 10.3390/nu16223828.
3
AI-readiness for Biomedical Data: Bridge2AI Recommendations.生物医学数据的人工智能准备情况:Bridge2AI 建议

本文引用的文献

1
Host-diet-gut microbiome interactions influence human energy balance: a randomized clinical trial.宿主-饮食-肠道微生物组相互作用影响人体能量平衡:一项随机临床试验。
Nat Commun. 2023 May 31;14(1):3161. doi: 10.1038/s41467-023-38778-x.
2
Large language models and the perils of their hallucinations.大语言模型及其幻觉的风险。
Crit Care. 2023 Mar 21;27(1):120. doi: 10.1186/s13054-023-04393-x.
3
Multi-omics microsampling for the profiling of lifestyle-associated changes in health.多组学生物标志物微采样分析与生活方式相关的健康变化特征。
bioRxiv. 2024 Nov 24:2024.10.23.619844. doi: 10.1101/2024.10.23.619844.
Nat Biomed Eng. 2024 Jan;8(1):11-29. doi: 10.1038/s41551-022-00999-8. Epub 2023 Jan 19.
4
The gut microbiota in obesity and weight management: microbes as friends or foe?肥胖与体重管理中的肠道微生物群:微生物是朋友还是敌人?
Nat Rev Endocrinol. 2023 May;19(5):258-271. doi: 10.1038/s41574-022-00794-0. Epub 2023 Jan 17.
5
Machine learning modeling practices to support the principles of AI and ethics in nutrition research.支持营养研究中人工智能和伦理原则的机器学习建模实践。
Nutr Diabetes. 2022 Dec 2;12(1):48. doi: 10.1038/s41387-022-00226-y.
6
NMR and Metabolomics-A Roadmap for the Future.核磁共振与代谢组学——未来路线图
Metabolites. 2022 Jul 23;12(8):678. doi: 10.3390/metabo12080678.
7
Memory-Efficient Searching of Gas-Chromatography Mass Spectra Accelerated by Prescreening.通过预筛选加速气相色谱质谱的内存高效搜索
Metabolites. 2022 May 29;12(6):491. doi: 10.3390/metabo12060491.
8
Phylogeny-Aware Analysis of Metagenome Community Ecology Based on Matched Reference Genomes while Bypassing Taxonomy.基于匹配参考基因组绕过分类学的宏基因组群落生态学的系统发育分析。
mSystems. 2022 Apr 26;7(2):e0016722. doi: 10.1128/msystems.00167-22. Epub 2022 Apr 4.
9
Swapping Metagenomics Preprocessing Pipeline Components Offers Speed and Sensitivity Increases.更换宏基因组预处理流水线组件可提高速度和灵敏度。
mSystems. 2022 Apr 26;7(2):e0137821. doi: 10.1128/msystems.01378-21. Epub 2022 Mar 16.
10
Nanopore sequencing technology, bioinformatics and applications.纳米孔测序技术、生物信息学及其应用。
Nat Biotechnol. 2021 Nov;39(11):1348-1365. doi: 10.1038/s41587-021-01108-x. Epub 2021 Nov 8.