• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 GPT-3 构建社交媒体药物警戒中滥用药物同义词词典。

Using GPT-3 to Build a Lexicon of Drugs of Abuse Synonyms for Social Media Pharmacovigilance.

机构信息

Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA.

Departments of Bioengineering, Genetics, and Medicine, Stanford University, Stanford, CA 94305, USA.

出版信息

Biomolecules. 2023 Feb 18;13(2):387. doi: 10.3390/biom13020387.

DOI:10.3390/biom13020387
PMID:36830756
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9953178/
Abstract

Drug abuse is a serious problem in the United States, with over 90,000 drug overdose deaths nationally in 2020. A key step in combating drug abuse is detecting, monitoring, and characterizing its trends over time and location, also known as pharmacovigilance. While federal reporting systems accomplish this to a degree, they often have high latency and incomplete coverage. Social-media-based pharmacovigilance has zero latency, is easily accessible and unfiltered, and benefits from drug users being willing to share their experiences online pseudo-anonymously. However, unlike highly structured official data sources, social media text is rife with misspellings and slang, making automated analysis difficult. Generative Pretrained Transformer 3 (GPT-3) is a large autoregressive language model specialized for few-shot learning that was trained on text from the entire internet. We demonstrate that GPT-3 can be used to generate slang and common misspellings of terms for drugs of abuse. We repeatedly queried GPT-3 for synonyms of drugs of abuse and filtered the generated terms using automated Google searches and cross-references to known drug names. When generated terms for alprazolam were manually labeled, we found that our method produced 269 synonyms for alprazolam, 221 of which were new discoveries not included in an existing drug lexicon for social media. We repeated this process for 98 drugs of abuse, of which 22 are widely-discussed drugs of abuse, building a lexicon of colloquial drug synonyms that can be used for pharmacovigilance on social media.

摘要

药物滥用是美国的一个严重问题,2020 年全国有超过 90,000 例药物过量死亡。打击药物滥用的一个关键步骤是检测、监测和描述其随时间和地点的趋势,也称为药物警戒。虽然联邦报告系统在一定程度上实现了这一目标,但它们通常存在高延迟和不完整的覆盖范围。基于社交媒体的药物警戒具有零延迟、易于访问且不受过滤、并且受益于药物使用者愿意在网上匿名分享他们的经验。然而,与高度结构化的官方数据源不同,社交媒体文本充满了拼写错误和俚语,使得自动化分析变得困难。生成式预训练转换器 3(GPT-3)是一种专门用于少量学习的大型自回归语言模型,它是在整个互联网的文本上进行训练的。我们证明 GPT-3 可用于生成滥用药物的俚语和常见拼写错误。我们反复向 GPT-3 查询滥用药物的同义词,并使用自动谷歌搜索和交叉引用已知药物名称来过滤生成的术语。当对阿普唑仑生成的术语进行手动标记时,我们发现我们的方法生成了 269 个阿普唑仑的同义词,其中 221 个是新发现的,未包含在现有的社交媒体药物词汇中。我们对 98 种滥用药物重复了这个过程,其中 22 种是广泛讨论的滥用药物,构建了一个口语药物同义词词汇表,可用于社交媒体上的药物警戒。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/9a0c3d08320d/biomolecules-13-00387-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/e738c64da09a/biomolecules-13-00387-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/0239f25eac5d/biomolecules-13-00387-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/94567257bcba/biomolecules-13-00387-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/8fe0c9fe6ec3/biomolecules-13-00387-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/b080559ef84d/biomolecules-13-00387-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/d08d411a0fb5/biomolecules-13-00387-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/0b9f3a402d77/biomolecules-13-00387-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/14a791c8c22c/biomolecules-13-00387-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/9a0c3d08320d/biomolecules-13-00387-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/e738c64da09a/biomolecules-13-00387-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/0239f25eac5d/biomolecules-13-00387-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/94567257bcba/biomolecules-13-00387-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/8fe0c9fe6ec3/biomolecules-13-00387-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/b080559ef84d/biomolecules-13-00387-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/d08d411a0fb5/biomolecules-13-00387-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/0b9f3a402d77/biomolecules-13-00387-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/14a791c8c22c/biomolecules-13-00387-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e26/9953178/9a0c3d08320d/biomolecules-13-00387-g008.jpg

相似文献

1
Using GPT-3 to Build a Lexicon of Drugs of Abuse Synonyms for Social Media Pharmacovigilance.利用 GPT-3 构建社交媒体药物警戒中滥用药物同义词词典。
Biomolecules. 2023 Feb 18;13(2):387. doi: 10.3390/biom13020387.
2
RedMed: Extending drug lexicons for social media applications.红医:扩展社交媒体应用中的药物词汇。
J Biomed Inform. 2019 Nov;99:103307. doi: 10.1016/j.jbi.2019.103307. Epub 2019 Oct 15.
3
Leveraging social networks for toxicovigilance.利用社交网络进行毒理学监测。
J Med Toxicol. 2013 Jun;9(2):184-91. doi: 10.1007/s13181-013-0299-6.
4
BERT-based language model for accurate drug adverse event extraction from social media: implementation, evaluation, and contributions to pharmacovigilance practices.基于 BERT 的社交媒体中药物不良反应精准提取语言模型:实现、评估及对药物警戒实践的贡献。
Front Public Health. 2024 Apr 23;12:1392180. doi: 10.3389/fpubh.2024.1392180. eCollection 2024.
5
Transferability Based on Drug Structure Similarity in the Automatic Classification of Noncompliant Drug Use on Social Media: Natural Language Processing Approach.基于药物结构相似性的社交媒体中不规范用药自动分类的可转移性:自然语言处理方法。
J Med Internet Res. 2023 May 3;25:e44870. doi: 10.2196/44870.
6
Machine Learning and Natural Language Processing for Geolocation-Centric Monitoring and Characterization of Opioid-Related Social Media Chatter.基于机器学习和自然语言处理的地理定位中心监测和特征描述阿片类药物相关社交媒体聊天。
JAMA Netw Open. 2019 Nov 1;2(11):e1914672. doi: 10.1001/jamanetworkopen.2019.14672.
7
Filtering big data from social media--Building an early warning system for adverse drug reactions.从社交媒体中筛选大数据——构建药物不良反应预警系统。
J Biomed Inform. 2015 Apr;54:230-40. doi: 10.1016/j.jbi.2015.01.011. Epub 2015 Feb 14.
8
Artificial Intelligence-Enabled Social Media Analysis for Pharmacovigilance of COVID-19 Vaccinations in the United Kingdom: Observational Study.英国利用人工智能进行社交媒体分析以监测新冠疫苗接种的药物警戒:观察性研究
JMIR Public Health Surveill. 2022 May 27;8(5):e32543. doi: 10.2196/32543.
9
Artificial Intelligent Context-Aware Machine-Learning Tool to Detect Adverse Drug Events from Social Media Platforms.人工智能上下文感知机器学习工具,用于从社交媒体平台检测药物不良事件。
J Med Toxicol. 2022 Oct;18(4):311-320. doi: 10.1007/s13181-022-00906-2. Epub 2022 Sep 12.
10
Utilizing social media data for pharmacovigilance: A review.利用社交媒体数据进行药物警戒:综述
J Biomed Inform. 2015 Apr;54:202-12. doi: 10.1016/j.jbi.2015.02.004. Epub 2015 Feb 23.

引用本文的文献

1
RadGPT: A System Based on a Large Language Model That Generates Sets of Patient-Centered Materials to Explain Radiology Report Information.RadGPT:一种基于大语言模型的系统,可生成以患者为中心的材料集来解释放射学报告信息。
J Am Coll Radiol. 2025 Jun 10. doi: 10.1016/j.jacr.2025.06.013.
2
Pharmacovigilance in the digital age: gaining insight from social media data.数字时代的药物警戒:从社交媒体数据中获取见解
Exp Biol Med (Maywood). 2025 May 27;250:10555. doi: 10.3389/ebm.2025.10555. eCollection 2025.
3
Which social media platforms facilitate monitoring the opioid crisis?

本文引用的文献

1
The opioid epidemic: A worldwide exploratory study using the WHO pharmacovigilance database.阿片类药物流行:使用世界卫生组织药物警戒数据库进行的全球探索性研究。
Addiction. 2023 Apr;118(4):771-775. doi: 10.1111/add.16081. Epub 2022 Nov 22.
2
Deep Neural Networks for Fine-Grained Surveillance of Overdose Mortality.深度神经网络在过量死亡率的精细监测中的应用。
Am J Epidemiol. 2023 Feb 1;192(2):257-266. doi: 10.1093/aje/kwac180.
3
Pharmacovigilance Signals of the Opioid Epidemic over 10 Years: Data Mining Methods in the Analysis of Pharmacovigilance Datasets Collecting Adverse Drug Reactions (ADRs) Reported to EudraVigilance (EV) and the FDA Adverse Event Reporting System (FAERS).
哪些社交媒体平台有助于监测阿片类药物危机?
PLOS Digit Health. 2025 Apr 28;4(4):e0000842. doi: 10.1371/journal.pdig.0000842. eCollection 2025 Apr.
4
Evolutionary Trend of Dental Health Care Information on Chinese Social Media Platforms During 2018-2022: Retrospective Observational Study.2018 - 2022年中国社交媒体平台上牙齿保健信息的演变趋势:回顾性观察研究
JMIR Infodemiology. 2025 Apr 10;5:e55065. doi: 10.2196/55065.
5
Performance of ChatGPT-3.5 and ChatGPT-4 in the Taiwan National Pharmacist Licensing Examination: Comparative Evaluation Study.ChatGPT-3.5和ChatGPT-4在台湾国家药剂师执照考试中的表现:比较评估研究。
JMIR Med Educ. 2025 Jan 17;11:e56850. doi: 10.2196/56850.
6
Revolutionizing Health Care: The Transformative Impact of Large Language Models in Medicine.变革医疗保健:大语言模型在医学领域的变革性影响。
J Med Internet Res. 2025 Jan 7;27:e59069. doi: 10.2196/59069.
7
Discovering opioid slang on social media: A Word2Vec approach with reddit data.在社交媒体上发现阿片类药物俚语:一种基于Reddit数据的词向量方法。
Drug Alcohol Depend Rep. 2024 Nov 19;13:100302. doi: 10.1016/j.dadr.2024.100302. eCollection 2024 Dec.
8
Applications and Concerns of ChatGPT and Other Conversational Large Language Models in Health Care: Systematic Review.ChatGPT 及其他会话型大型语言模型在医疗保健中的应用及关注:系统评价。
J Med Internet Res. 2024 Nov 7;26:e22769. doi: 10.2196/22769.
9
A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare.ChatGPT及其他对话式大语言模型在医疗保健领域的系统评价
medRxiv. 2024 Apr 27:2024.04.26.24306390. doi: 10.1101/2024.04.26.24306390.
10
AutoCriteria: a generalizable clinical trial eligibility criteria extraction system powered by large language models.AutoCriteria:一个由大型语言模型驱动的可推广的临床试验纳入标准提取系统。
J Am Med Inform Assoc. 2024 Jan 18;31(2):375-385. doi: 10.1093/jamia/ocad218.
十年阿片类药物流行的药物警戒信号:用于分析向欧洲药品不良反应数据库(EudraVigilance,EV)和美国食品药品监督管理局不良事件报告系统(FAERS)报告的药物不良反应(ADR)的药物警戒数据集的数据挖掘方法
Pharmaceuticals (Basel). 2022 May 27;15(6):675. doi: 10.3390/ph15060675.
4
New meaning for NLP: the trials and tribulations of natural language processing with GPT-3 in ophthalmology.NLP 有了新的含义:眼科中使用 GPT-3 进行自然语言处理的尝试和磨难。
Br J Ophthalmol. 2022 Jul;106(7):889-892. doi: 10.1136/bjophthalmol-2022-321141. Epub 2022 May 6.
5
A descriptive analysis of concomitant opioid and benzodiazepine medication use and associated adverse drug events in United States adults between 2009 and 2018.2009年至2018年间美国成年人同时使用阿片类药物和苯二氮卓类药物及其相关药物不良事件的描述性分析。
Explor Res Clin Soc Pharm. 2022 Mar 20;5:100130. doi: 10.1016/j.rcsop.2022.100130. eCollection 2022 Mar.
6
Causal Effect of Chronic Pain on Mortality Through Opioid Prescriptions: Application of the Front-Door Formula.慢性疼痛通过阿片类药物处方对死亡率的因果影响:前门公式的应用。
Epidemiology. 2022 Jul 1;33(4):572-580. doi: 10.1097/EDE.0000000000001490. Epub 2022 Apr 5.
7
Using Named Entity Recognition to Identify Substances Used in the Self-medication of Opioid Withdrawal: Natural Language Processing Study of Reddit Data.使用命名实体识别来识别阿片类药物戒断自我治疗中使用的物质:Reddit数据的自然语言处理研究
JMIR Form Res. 2022 Mar 30;6(3):e33919. doi: 10.2196/33919.
8
Artificial Intelligence-Enabled Social Media Analysis for Pharmacovigilance of COVID-19 Vaccinations in the United Kingdom: Observational Study.英国利用人工智能进行社交媒体分析以监测新冠疫苗接种的药物警戒:观察性研究
JMIR Public Health Surveill. 2022 May 27;8(5):e32543. doi: 10.2196/32543.
9
Operationalizing and Implementing Pretrained, Large Artificial Intelligence Linguistic Models in the US Health Care System: Outlook of Generative Pretrained Transformer 3 (GPT-3) as a Service Model.在美国医疗保健系统中实施和应用预训练的大型人工智能语言模型:生成式预训练变换器3(GPT-3)作为服务模型的前景
JMIR Med Inform. 2022 Feb 10;10(2):e32875. doi: 10.2196/32875.
10
Quantifying the Severity of Adverse Drug Reactions Using Social Media: Network Analysis.利用社交媒体量化药物不良反应的严重程度:网络分析
J Med Internet Res. 2021 Oct 21;23(10):e27714. doi: 10.2196/27714.