• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

合成数据能否在慢性荨麻疹研究中减少样本量?

Can Synthetic Data Allow for Smaller Sample Sizes in Chronic Urticaria Research?

作者信息

Gutsche Annika, Salameh Pascale, Jahandideh Samad S, Roodsaz Mehran, Kutan Serkan, Salehzadeh-Yazdi Ali, Kocatürk Emek, Gregoriou Stamatios, Thomsen Simon F, Kulthanan Kanokvalai, Tuchinda Papapit, Dissemond Joachim, Kasperska-Zajac Alicja, Zajac Magdalena, Zamłyński Mateusz, van Doorn Martijn, Parisi Claudio A S, Peter Jonny G, Day Cascia, McDougall Cathryn, Makris Michael, Fomina Daria, Kovalkova Elena, Streliaev Nikolai, Andrenova Gerelma, Lebedkina Marina, Khoskhkui Maryam, Aliabadi Mehraneh M, Bauer Andrea, Kiefer Lea, Muñoz Melba, Weller Karsten, Kolkhir Pavel, Metz Martin

机构信息

Institute of Allergology, Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany.

Fraunhofer Institute for Translational Medicine and Pharmacology ITMP, Immunology and Allergology, Berlin, Germany.

出版信息

Clin Transl Allergy. 2025 Aug;15(8):e70087. doi: 10.1002/clt2.70087.

DOI:10.1002/clt2.70087
PMID:40771049
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12329239/
Abstract

BACKGROUND

Robust data are essential for clinical and epidemiological research, yet in chronic spontaneous urticaria (CSU), certain patient groups, such as the elderly or comorbid patients, are often underrepresented. In clinical trials, strict inclusion and exclusion criteria frequently limit recruitment, making it difficult to achieve sufficient statistical power. Similarly, real-world observational studies may lack sufficient sample sizes for robust analysis. To address these limitations, we generated synthetic patient data that reflect these groups' clinical characteristics and variability. This approach enables more comprehensive analyses, facilitates hypothesis testing in otherwise inaccessible populations, and supports the generation of evidence where traditional data sources are insufficient.

METHODS

A tree-based decision model was applied to generate synthetic data based on an existing set of real-world data (RWD) from the Chronic Urticaria Registry (CURE). Descriptive characteristics and association strength between relevant RWD variables and their synthetic counterparts were analyzed as indicators of replication accuracy, providing insight into how closely the synthetic data aligns with the RWD. Finally, we determined the minimum sample size required to generate high-quality synthetic data.

RESULTS

The algorithm produced extensive synthetic data records, closely mirroring patient demographics and disease clinical characteristics. Smaller subgroups of the data were equally replicated and followed the same distribution as RWD. Known associations and correlations between disease-specific factors (disease control) and risk factors (age) yielded similar results, with no significant difference (p > 0.05). The lowest threshold at which synthetic data could be generated while maintaining high accuracy in RWD was identified to be 25%, enabling a fourfold increase in the synthetic population.

CONCLUSION

Synthetic data could replicate RWD with reasonable accuracy for patients with CSU down to 25% of the original population size. This method has the potential to extend small patient subgroups in clinical and epidemiological research.

摘要

背景

可靠的数据对于临床和流行病学研究至关重要,但在慢性自发性荨麻疹(CSU)中,某些患者群体,如老年人或合并症患者,在研究中往往代表性不足。在临床试验中,严格的纳入和排除标准常常限制了招募,难以获得足够的统计效力。同样,真实世界的观察性研究可能缺乏足够的样本量进行有力分析。为解决这些局限性,我们生成了反映这些群体临床特征和变异性的合成患者数据。这种方法能够进行更全面的分析,便于在其他难以触及的人群中进行假设检验,并在传统数据来源不足时支持证据的生成。

方法

应用基于树的决策模型,根据慢性荨麻疹登记处(CURE)现有的一组真实世界数据(RWD)生成合成数据。分析相关RWD变量与其合成对应变量之间的描述性特征和关联强度,作为复制准确性的指标,以深入了解合成数据与RWD的匹配程度。最后,我们确定了生成高质量合成数据所需的最小样本量。

结果

该算法生成了大量的合成数据记录,紧密反映了患者人口统计学和疾病临床特征。数据的较小亚组也得到了同等复制,并遵循与RWD相同的分布。疾病特异性因素(疾病控制)和危险因素(年龄)之间已知的关联和相关性产生了相似的结果,无显著差异(p>0.05)。在保持RWD高精度的同时能够生成合成数据的最低阈值被确定为25%,这使得合成人群增加了四倍。

结论

对于CSU患者,合成数据能够以合理的准确性复制RWD,最低可至原始人群规模的25%。这种方法有潜力在临床和流行病学研究中扩展小型患者亚组。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7604/12329239/1de4e5173fd6/CLT2-15-e70087-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7604/12329239/f2526ad29505/CLT2-15-e70087-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7604/12329239/d440c1c3f6f7/CLT2-15-e70087-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7604/12329239/1de4e5173fd6/CLT2-15-e70087-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7604/12329239/f2526ad29505/CLT2-15-e70087-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7604/12329239/d440c1c3f6f7/CLT2-15-e70087-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7604/12329239/1de4e5173fd6/CLT2-15-e70087-g002.jpg

相似文献

1
Can Synthetic Data Allow for Smaller Sample Sizes in Chronic Urticaria Research?合成数据能否在慢性荨麻疹研究中减少样本量?
Clin Transl Allergy. 2025 Aug;15(8):e70087. doi: 10.1002/clt2.70087.
2
Comparison of cellulose, modified cellulose and synthetic membranes in the haemodialysis of patients with end-stage renal disease.纤维素、改性纤维素和合成膜在终末期肾病患者血液透析中的比较。
Cochrane Database Syst Rev. 2001(3):CD003234. doi: 10.1002/14651858.CD003234.
3
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
6
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
7
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
8
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
9
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
10
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.

本文引用的文献

1
Chronic Spontaneous Urticaria: A Review.慢性自发性荨麻疹:综述
JAMA. 2024 Nov 5;332(17):1464-1477. doi: 10.1001/jama.2024.15568.
2
Synthetic data generation for a longitudinal cohort study - evaluation, method extension and reproduction of published data analysis results.纵向队列研究的合成数据生成 - 评估、方法扩展和已发表数据分析结果的再现。
Sci Rep. 2024 Jun 22;14(1):14412. doi: 10.1038/s41598-024-62102-2.
3
How AI is being used to accelerate clinical trials.人工智能如何被用于加速临床试验。
Nature. 2024 Mar;627(8003):S2-S5. doi: 10.1038/d41586-024-00753-x.
4
Harnessing the power of synthetic data in healthcare: innovation, application, and privacy.利用合成数据在医疗保健领域的力量:创新、应用与隐私。
NPJ Digit Med. 2023 Oct 9;6(1):186. doi: 10.1038/s41746-023-00927-3.
5
A method for generating synthetic longitudinal health data.一种生成合成纵向健康数据的方法。
BMC Med Res Methodol. 2023 Mar 23;23(1):67. doi: 10.1186/s12874-023-01869-w.
6
Synthetic data in medical research.医学研究中的合成数据。
BMJ Med. 2022 Sep 26;1(1):e000167. doi: 10.1136/bmjmed-2022-000167. eCollection 2022.
7
Synthetic data in health care: A narrative review.医疗保健中的合成数据:一篇叙述性综述。
PLOS Digit Health. 2023 Jan 6;2(1):e0000082. doi: 10.1371/journal.pdig.0000082. eCollection 2023 Jan.
8
Chronic Urticaria in Older Adults: Treatment Considerations.老年慢性荨麻疹:治疗考量
Drugs Aging. 2023 Mar;40(3):165-177. doi: 10.1007/s40266-023-01010-y. Epub 2023 Feb 18.
9
Comorbidities of Chronic Urticaria: A glimpse into a complex relationship.慢性荨麻疹的合并症:洞察复杂关系
Front Allergy. 2022 Nov 17;3:1008145. doi: 10.3389/falgy.2022.1008145. eCollection 2022.
10
Real-world data: a brief review of the methods, applications, challenges and opportunities.真实世界数据:方法、应用、挑战和机遇的简要回顾。
BMC Med Res Methodol. 2022 Nov 5;22(1):287. doi: 10.1186/s12874-022-01768-6.