• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用合成文本在精神卫生保健领域助力人工智能的潜力。

Leveraging the potential of synthetic text for AI in mental healthcare.

作者信息

Ive Julia

机构信息

School of Electronic Engineering and Computer Science, Queen Mary University of London, London, UK.

出版信息

Front Digit Health. 2022 Oct 24;4:1010202. doi: 10.3389/fdgth.2022.1010202. eCollection 2022.

DOI:10.3389/fdgth.2022.1010202
PMID:36352890
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9637610/
Abstract

In today's world it seems fair to say that extensive digital data sharing is the price we pay for the technological advances we have seen achieved as a result of AI systems analysing large quantities of data in a relatively short time. Where such AI is used in the realm of mental health, this data sharing poses additional challenges not just due to the sensitive nature of the data itself but also the potential vulnerability of the data donors themselves should there be a cybersecurity data breach. To address the problem, the AI community proposes to use synthetic text preserving only the salient properties of the original. Such text has potential to fill gaps in the textual data availability (e.g., rare conditions or under-represented groups) while reducing exposure. Our perspective piece is aimed to demystify the process of generating synthetic text, explain its algorithmic and ethical challenges, especially for the mental health domain, as well as most promising ways of overcoming them. We aim to promote better understanding and as a result acceptability of synthetic text outside the research community.

摘要

在当今世界,可以说广泛的数字数据共享是我们为人工智能系统在相对较短的时间内分析大量数据所取得的技术进步而付出的代价。在心理健康领域使用此类人工智能时,这种数据共享不仅会带来额外的挑战,这不仅是因为数据本身的敏感性质,还因为如果发生网络安全数据泄露,数据提供者自身可能会面临风险。为了解决这个问题,人工智能社区提议使用仅保留原始文本显著特征的合成文本。这样的文本有可能填补文本数据可用性方面的空白(例如,罕见病症或代表性不足的群体),同时减少暴露风险。我们的观点文章旨在揭开合成文本生成过程的神秘面纱,解释其算法和伦理挑战,特别是针对心理健康领域的挑战,以及克服这些挑战的最有前景的方法。我们旨在促进更好的理解,并因此提高研究界之外对合成文本的接受度。

相似文献

1
Leveraging the potential of synthetic text for AI in mental healthcare.利用合成文本在精神卫生保健领域助力人工智能的潜力。
Front Digit Health. 2022 Oct 24;4:1010202. doi: 10.3389/fdgth.2022.1010202. eCollection 2022.
2
DataSifterText: Partially Synthetic Text Generation for Sensitive Clinical Notes.DataSifterText:用于敏感临床记录的部分合成文本生成。
J Med Syst. 2022 Nov 16;46(12):96. doi: 10.1007/s10916-022-01880-6.
3
ARTIFICIAL INTELLIGENCE IN MEDICAL PRACTICE: REGULATIVE ISSUES AND PERSPECTIVES.人工智能在医学实践中的应用:监管问题与展望。
Wiad Lek. 2020;73(12 cz 2):2722-2727.
4
Generation and evaluation of artificial mental health records for Natural Language Processing.用于自然语言处理的人工心理健康记录的生成与评估
NPJ Digit Med. 2020 May 14;3:69. doi: 10.1038/s41746-020-0267-x. eCollection 2020.
5
Man vs the machine in the struggle for effective text anonymisation in the age of large language models.在大语言模型时代,为实现有效的文本匿名化,人与机器展开激烈竞争。
Sci Rep. 2023 Sep 25;13(1):16026. doi: 10.1038/s41598-023-42977-3.
6
Value attributed to text-based archives generated by artificial intelligence.人工智能生成的基于文本的档案的价值归属。
R Soc Open Sci. 2023 Feb 8;10(2):220915. doi: 10.1098/rsos.220915. eCollection 2023 Feb.
7
Efficacy of an mHealth Intervention (BRAVE) to Promote Mental Wellness for American Indian and Alaska Native Teenagers and Young Adults: Randomized Controlled Trial.一项移动健康干预措施(BRAVE)对促进美国印第安人和阿拉斯加原住民青少年及年轻人心理健康的效果:随机对照试验。
JMIR Ment Health. 2021 Sep 15;8(9):e26158. doi: 10.2196/26158.
8
Patient Data-Sharing for AI: Ethical Challenges, Catholic Solutions.用于人工智能的患者数据共享:伦理挑战与天主教解决方案
Linacre Q. 2020 Nov;87(4):471-481. doi: 10.1177/0024363920922690. Epub 2020 May 15.
9
Ethical Development of Digital Phenotyping Tools for Mental Health Applications: Delphi Study.数字表型工具在精神健康应用中的伦理发展:德尔菲研究。
JMIR Mhealth Uhealth. 2021 Jul 28;9(7):e27343. doi: 10.2196/27343.
10
Application Scenarios for Artificial Intelligence in Nursing Care: Rapid Review.人工智能在护理中的应用场景:快速综述。
J Med Internet Res. 2021 Nov 29;23(11):e26522. doi: 10.2196/26522.

引用本文的文献

1
A review on generative AI models for synthetic medical text, time series, and longitudinal data.关于用于合成医学文本、时间序列和纵向数据的生成式人工智能模型的综述。
NPJ Digit Med. 2025 May 15;8(1):281. doi: 10.1038/s41746-024-01409-w.
2
Open Science at the generative AI turn: An exploratory analysis of challenges and opportunities.生成式人工智能时代的开放科学:挑战与机遇的探索性分析。
Quant Sci Stud. 2025;6:22-45. doi: 10.1162/qss_a_00337. Epub 2025 Jan 27.
3
Synthetic data as an investigative tool in hypertension and renal diseases research.合成数据作为高血压和肾脏疾病研究中的一种调查工具。
World J Methodol. 2025 Mar 20;15(1):98626. doi: 10.5662/wjm.v15.i1.98626.
4
Embracing the uncertainty in human-machine collaboration to support clinical decision-making for mental health conditions.接纳人机协作中的不确定性以支持针对心理健康状况的临床决策。
Front Digit Health. 2023 Sep 5;5:1188338. doi: 10.3389/fdgth.2023.1188338. eCollection 2023.

本文引用的文献

1
Shifting machine learning for healthcare from development to deployment and from models to data.将医疗保健领域的机器学习从开发转移到部署,从模型转移到数据。
Nat Biomed Eng. 2022 Dec;6(12):1330-1345. doi: 10.1038/s41551-022-00898-y. Epub 2022 Jul 4.
2
A systematic review of automatic text summarization for biomedical literature and EHRs.生物医学文献和电子健康记录的自动文本摘要的系统评价。
J Am Med Inform Assoc. 2021 Sep 18;28(10):2287-2297. doi: 10.1093/jamia/ocab143.
3
Are synthetic clinical notes useful for real natural language processing tasks: A case study on clinical entity recognition.用于真实自然语言处理任务的合成临床笔记是否有用:以临床实体识别为例的研究
J Am Med Inform Assoc. 2021 Sep 18;28(10):2193-2201. doi: 10.1093/jamia/ocab112.
4
Synthetic data in machine learning for medicine and healthcare.机器学习在医学和医疗保健领域中的合成数据。
Nat Biomed Eng. 2021 Jun;5(6):493-497. doi: 10.1038/s41551-021-00751-8.
5
Robust suicide risk assessment on social media via deep adversarial learning.基于深度对抗学习的社交媒体上稳健的自杀风险评估。
J Am Med Inform Assoc. 2021 Jul 14;28(7):1497-1506. doi: 10.1093/jamia/ocab031.
6
Generation and evaluation of artificial mental health records for Natural Language Processing.用于自然语言处理的人工心理健康记录的生成与评估
NPJ Digit Med. 2020 May 14;3:69. doi: 10.1038/s41746-020-0267-x. eCollection 2020.
7
Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus.用于去识别化的纵向临床记录标注:2014年i2b2/德克萨斯大学健康科学中心语料库
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S20-S29. doi: 10.1016/j.jbi.2015.07.020. Epub 2015 Aug 28.