Williams Michelle C, MacArthur Jacqueline A L, Forsyth Ross, Petersen Steffen E
British Heart Foundation Data Science Centre, Health Data Research UK, Gibbs Building, 215 Euston Road, London NW12BE, UK.
British Heart Foundation Centre for Research Excellence Centre for Cardiovascular Science, University of Edinburgh, Chancellors Building, 49 Little France Crescent, Edinburgh EH164TJ, UK.
Eur Heart J Imaging Methods Pract. 2025 Jan 24;3(1):qyaf012. doi: 10.1093/ehjimp/qyaf012. eCollection 2025 Jan.
Federated learning and the creation of synthetic data are emerging tools, which may enhance the use of imaging data in cardiovascular research. This study sought to understand the perspectives of cardiovascular imaging researchers on the potential benefits and challenges associated with these technologies.
The British Heart Foundation Data Science Centre conducted a series of online surveys and a virtual workshop to gather insights from stakeholders involved in cardiovascular imaging research about federated learning and synthetic data generation. The federated learning survey included 67 respondents: 18% ( = 12) were currently using federated learning, 4% ( = 3) had previously used it, 31% ( = 21) were planning to use it, and 46% ( = 31) were neither using nor planning to use it. Highlighted benefits included data privacy and enhanced collaboration, while challenges included data heterogeneity and technical complexity. The synthetic data survey had 22 respondents: 50% ( = 11) were currently using synthetic imaging data, 36% ( = 8) expressed interest in using it, and 14% ( = 3) thought it should not be used. Amongst the respondents, 50% had created synthetic imaging data and 45% had used it in cardiovascular research. Advantages cited included privacy preservation, increased dataset size and diversity, improved data access, and reduced administrative burden. Concerns included potential biases, trust issues, privacy concerns, and the fact that the images were not real and may have limited diversity or quality.
Federated learning and synthetic data offer opportunities for advancing cardiovascular imaging research by addressing data privacy concerns and expanding data availability. However, challenges must be addressed to realize their full potential.
联邦学习和合成数据的创建是新兴工具,可能会增强心血管研究中成像数据的使用。本研究旨在了解心血管成像研究人员对这些技术潜在益处和挑战的看法。
英国心脏基金会数据科学中心开展了一系列在线调查和一次虚拟研讨会,以收集参与心血管成像研究的利益相关者对联邦学习和合成数据生成的见解。联邦学习调查有67名受访者:18%(=12)目前正在使用联邦学习,4%(=3)以前使用过,31%(=21)计划使用,46%(=31)既未使用也不打算使用。突出的益处包括数据隐私和加强协作,而挑战包括数据异质性和技术复杂性。合成数据调查有22名受访者:50%(=11)目前正在使用合成成像数据,36%(=8)表示有兴趣使用,14%(=3)认为不应使用。在受访者中,50%创建过合成成像数据,45%在心血管研究中使用过。提到的优点包括隐私保护、数据集规模和多样性增加、数据获取改善以及行政负担减轻。担忧包括潜在偏差、信任问题、隐私问题,以及图像不是真实的且可能多样性或质量有限这一事实。
联邦学习和合成数据通过解决数据隐私问题和扩大数据可用性,为推进心血管成像研究提供了机会。然而,必须应对挑战才能充分发挥其潜力。