• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
AI-driven synthetic data generation for accelerating hepatology research: A study of the United Network for Organ Sharing (UNOS) database.人工智能驱动的合成数据生成以加速肝病学研究:器官共享联合网络(UNOS)数据库研究
Hepatology. 2025 Mar 11. doi: 10.1097/HEP.0000000000001299.
2
Synthetic neurosurgical data generation with generative adversarial networks and large language models:an investigation on fidelity, utility, and privacy.使用生成对抗网络和大语言模型生成合成神经外科数据:关于保真度、实用性和隐私性的调查
Neurosurg Focus. 2025 Jul 1;59(1):E17. doi: 10.3171/2025.4.FOCUS25225.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
ASAS-NANP symposium: mathematical modeling in animal nutrition: synthetic database generation for non-normal multivariate distributions: a rank-based method with application to ruminant methane emissions.美国动物科学学会-北美猪营养大会研讨会:动物营养中的数学建模:非正态多元分布的综合数据库生成:一种基于秩的方法及其在反刍动物甲烷排放中的应用
J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf136.
5
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
6
Generation of Fundus Fluorescein Angiography Videos for Health Care Data Sharing.用于医疗保健数据共享的眼底荧光血管造影视频生成
JAMA Ophthalmol. 2025 Jun 26. doi: 10.1001/jamaophthalmol.2025.1419.
7
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
8
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
9
The Costs of Anonymization: Case Study Using Clinical Data.匿名化的成本:使用临床数据的案例研究
J Med Internet Res. 2024 Apr 24;26:e49445. doi: 10.2196/49445.
10
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

引用本文的文献

1
Future directions in prognostic modeling for dengue-induced severe hepatitis.登革热所致严重肝炎预后模型的未来发展方向。
World J Hepatol. 2025 Jun 27;17(6):107299. doi: 10.4254/wjh.v17.i6.107299.

本文引用的文献

1
Synthetic data for privacy-preserving clinical risk prediction.用于保护隐私的临床风险预测的合成数据。
Sci Rep. 2024 Oct 27;14(1):25676. doi: 10.1038/s41598-024-72894-y.
2
FDA Perspective on the Regulation of Artificial Intelligence in Health Care and Biomedicine.美国食品药品监督管理局对医疗保健和生物医学领域人工智能监管的看法。
JAMA. 2025 Jan 21;333(3):241-247. doi: 10.1001/jama.2024.21451.
3
Reliable generation of privacy-preserving synthetic electronic health record time series via diffusion models.通过扩散模型可靠地生成隐私保护的合成电子健康记录时间序列。
J Am Med Inform Assoc. 2024 Nov 1;31(11):2529-2539. doi: 10.1093/jamia/ocae229.
4
Generative models improve fairness of medical classifiers under distribution shifts.生成式模型可提高分布偏移下医学分类器的公平性。
Nat Med. 2024 Apr;30(4):1166-1173. doi: 10.1038/s41591-024-02838-6. Epub 2024 Apr 10.
5
Digital twins for health: a scoping review.用于健康的数字孪生:一项范围综述。
NPJ Digit Med. 2024 Mar 22;7(1):77. doi: 10.1038/s41746-024-01073-0.
6
Mimicking clinical trials with synthetic acute myeloid leukemia patients using generative artificial intelligence.利用生成式人工智能模拟急性髓系白血病患者的临床试验。
NPJ Digit Med. 2024 Mar 20;7(1):76. doi: 10.1038/s41746-024-01076-x.
7
Development and evaluation of objective trial performance metrics for multisite clinical studies: Experience from the AlcHep Network.多中心临床研究中客观试验性能指标的制定和评估:来自 AlcHep 网络的经验。
Contemp Clin Trials. 2024 Mar;138:107437. doi: 10.1016/j.cct.2024.107437. Epub 2024 Jan 11.
8
Harnessing the power of synthetic data in healthcare: innovation, application, and privacy.利用合成数据在医疗保健领域的力量:创新、应用与隐私。
NPJ Digit Med. 2023 Oct 9;6(1):186. doi: 10.1038/s41746-023-00927-3.
9
Synthetic Data Generation by Artificial Intelligence to Accelerate Research and Precision Medicine in Hematology.人工智能生成合成数据以加速血液学研究和精准医学
JCO Clin Cancer Inform. 2023 Jun;7:e2300021. doi: 10.1200/CCI.23.00021.
10
Actionable Solutions to Achieve Health Equity in Chronic Liver Disease.实现慢性肝脏疾病健康公平的可行解决方案。
Clin Gastroenterol Hepatol. 2023 Jul;21(8):1992-2000. doi: 10.1016/j.cgh.2023.03.043. Epub 2023 Apr 13.

人工智能驱动的合成数据生成以加速肝病学研究:器官共享联合网络(UNOS)数据库研究

AI-driven synthetic data generation for accelerating hepatology research: A study of the United Network for Organ Sharing (UNOS) database.

作者信息

Ahn Joseph C, Noh Yung-Kyun, Hu Mingzhao, Shen Xiaotong, Simonetto Douglas A, Kamath Patrick S, Loomba Rohit, Shah Vijay H

机构信息

Division of Gastroenterology and Hepatology, Department of Internal Medicine, Mayo Clinic, Rochester, Minnesota, USA.

Department of Computer Science, Hanyang University, Seoul, South Korea.

出版信息

Hepatology. 2025 Mar 11. doi: 10.1097/HEP.0000000000001299.

DOI:10.1097/HEP.0000000000001299
PMID:40067682
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12353439/
Abstract

BACKGROUND AND AIMS

Clinical hepatology research often faces limited data availability, underrepresentation of minority groups, and complex data-sharing regulations. Synthetic data-artificially generated patient records designed to mirror real-world distributions-offers a potential solution. We hypothesized that diffusion models, a state-of-the-art generative technique, could produce synthetic liver transplant waitlist data from the United Network for Organ Sharing database that maintains statistical fidelity, replicates clinical correlations and survival patterns, and ensures robust privacy protection.

APPROACH AND RESULTS

Diffusion models were used to generate synthetic patient cohorts mirroring the United Network for Organ Sharing liver transplant waitlist database between the years 2019 and 2023. Statistical fidelity was assessed using maximum mean discrepancy (MMD) and Wasserstein distance, correlation analysis, and variable-level metrics. Clinical utility was evaluated by comparing transplant-free survival via Kaplan-Meier curves and the MELD score performance. Privacy was quantified using the Distance to Closest Record (DCR) and attribute disclosure risk assessments.The synthetic dataset was nearly indistinguishable from the original dataset (MMD=0.002, standardized Wasserstein distance <1.0), preserving clinically relevant correlations and survival patterns as evidenced by similar median survival times (110 vs. 101 days) and 5-year survival rates (22.2% vs. 22.8%). MELD-based 90-day mortality prediction was maintained (original AUC=0.839 vs. synthetic AUC=0.844). Privacy metrics indicated no identifiable patient matches, and mean DCR values ensured that synthetic individuals were not direct replicas of real patients.

CONCLUSION

Artificial intelligence-generated synthetic data derived from diffusion models can faithfully replicate complex hepatology datasets, maintain key clinical signals, and ensure strong privacy safeguards. This approach can help address data scarcity, enhance model generalizability, foster multi-institutional collaboration, and accelerate progress in hepatology research.

摘要

背景与目的

临床肝脏病学研究常常面临数据可得性有限、少数群体代表性不足以及复杂的数据共享规定等问题。合成数据——旨在反映真实世界分布的人工生成的患者记录——提供了一种潜在的解决方案。我们假设,扩散模型作为一种先进的生成技术,可以从器官共享联合网络数据库中生成合成肝移植等待名单数据,该数据保持统计保真度、复制临床相关性和生存模式,并确保强大的隐私保护。

方法与结果

使用扩散模型生成了反映2019年至2023年器官共享联合网络肝移植等待名单数据库的合成患者队列。使用最大均值差异(MMD)和瓦瑟斯坦距离、相关性分析以及变量水平指标评估统计保真度。通过卡普兰-迈耶曲线比较无移植生存率和终末期肝病模型(MELD)评分表现来评估临床效用。使用到最近记录的距离(DCR)和属性披露风险评估来量化隐私。合成数据集与原始数据集几乎无法区分(MMD = 0.002,标准化瓦瑟斯坦距离<1.0),保留了临床相关的相关性和生存模式,相似的中位生存时间(110天对101天)和5年生存率(22.2%对22.8%)证明了这一点。基于MELD的90天死亡率预测得以维持(原始曲线下面积[AUC]=0.839对合成AUC = 0.844)。隐私指标表明没有可识别的患者匹配,平均DCR值确保合成个体不是真实患者的直接复制品。

结论

源自扩散模型的人工智能生成的合成数据可以忠实地复制复杂的肝脏病学数据集,保持关键临床信号,并确保强大的隐私保护。这种方法有助于解决数据稀缺问题,提高模型的通用性,促进多机构合作,并加速肝脏病学研究的进展。