• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

文本挖掘方法揭示住院患者长新冠的临床状况。

Text mining method to unravel long COVID's clinical condition in hospitalized patients.

机构信息

Laboratório de Medicina e Saúde Pública de Precisão (MeSP2), Instituto Gonçalo Moniz, Fundação Oswaldo Cruz, Salvador, Brazil.

Centro de Integração de Dados e Conhecimentos para a Saúde (CIDACS), Instituto Gonçalo Moniz, Fundação Oswaldo Cruz, Salvador, Brazil.

出版信息

Cell Death Dis. 2024 Sep 13;15(9):671. doi: 10.1038/s41419-024-07043-4.

DOI:10.1038/s41419-024-07043-4
PMID:39271699
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11399332/
Abstract

Long COVID is characterized by persistent that extends symptoms beyond established timeframes. Its varied presentation across different populations and healthcare systems poses significant challenges in understanding its clinical manifestations and implications. In this study, we present a novel application of text mining technique to automatically extract unstructured data from a long COVID survey conducted at a prominent university hospital in São Paulo, Brazil. Our phonetic text clustering (PTC) method enables the exploration of unstructured Electronic Healthcare Records (EHR) data to unify different written forms of similar terms into a single phonemic representation. We used n-gram text analysis to detect compound words and negated terms in Portuguese-BR, focusing on medical conditions and symptoms related to long COVID. By leveraging text mining, we aim to contribute to a deeper understanding of this chronic condition and its implications for healthcare systems globally. The model developed in this study has the potential for scalability and applicability in other healthcare settings, thereby supporting broader research efforts and informing clinical decision-making for long COVID patients.

摘要

长新冠的特点是持续存在的症状超出了既定的时间框架。它在不同人群和医疗保健系统中的不同表现形式给理解其临床表现和影响带来了重大挑战。在这项研究中,我们提出了一种文本挖掘技术的新应用,用于自动从巴西圣保罗一家著名大学医院进行的长新冠调查中提取非结构化数据。我们的语音文本聚类 (PTC) 方法能够探索非结构化的电子健康记录 (EHR) 数据,将相似术语的不同书写形式统一为单个语音表示。我们使用 n 元组文本分析来检测葡萄牙语-BR 中的复合词和否定词,重点是与长新冠相关的医疗状况和症状。通过利用文本挖掘,我们旨在深入了解这种慢性疾病及其对全球医疗保健系统的影响。本研究中开发的模型具有可扩展性和在其他医疗保健环境中的适用性,从而支持更广泛的研究工作,并为长新冠患者的临床决策提供信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/11399332/50cbc477507e/41419_2024_7043_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/11399332/7644c112a8bd/41419_2024_7043_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/11399332/35e21e5cee6a/41419_2024_7043_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/11399332/913d13ff8f2c/41419_2024_7043_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/11399332/50cbc477507e/41419_2024_7043_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/11399332/7644c112a8bd/41419_2024_7043_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/11399332/35e21e5cee6a/41419_2024_7043_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/11399332/913d13ff8f2c/41419_2024_7043_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/11399332/50cbc477507e/41419_2024_7043_Fig4_HTML.jpg

相似文献

1
Text mining method to unravel long COVID's clinical condition in hospitalized patients.文本挖掘方法揭示住院患者长新冠的临床状况。
Cell Death Dis. 2024 Sep 13;15(9):671. doi: 10.1038/s41419-024-07043-4.
2
High prevalence of SARS-CoV-2 infection among symptomatic healthcare workers in a large university tertiary hospital in São Paulo, Brazil.巴西圣保罗一家大型大学附属医院出现症状的医护人员中 SARS-CoV-2 感染的高流行率。
BMC Infect Dis. 2020 Dec 2;20(1):917. doi: 10.1186/s12879-020-05662-8.
3
Text-mining in electronic healthcare records can be used as efficient tool for screening and data collection in cardiovascular trials: a multicenter validation study.电子医疗记录中的文本挖掘可以作为心血管试验中筛选和数据收集的有效工具:一项多中心验证研究。
J Clin Epidemiol. 2021 Apr;132:97-105. doi: 10.1016/j.jclinepi.2020.11.014. Epub 2020 Nov 25.
4
Approaches to text mining for analyzing treatment plan of quit smoking with free-text medical records: A PRISMA-compliant meta-analysis.利用自由文本医疗记录分析戒烟治疗方案的文本挖掘方法:一项遵循PRISMA标准的荟萃分析。
Medicine (Baltimore). 2020 Jul 17;99(29):e20999. doi: 10.1097/MD.0000000000020999.
5
Text mining approaches for dealing with the rapidly expanding literature on COVID-19.文本挖掘方法在处理 COVID-19 相关文献快速膨胀方面的应用。
Brief Bioinform. 2021 Mar 22;22(2):781-799. doi: 10.1093/bib/bbaa296.
6
Long COVID-19: A Systematic Review.长新冠:系统综述。
J Assoc Physicians India. 2023 Sep;71(9):82-94. doi: 10.59556/japi.71.0337.
7
The hidden crisis: Long COVID's association with housing stability and home accessibility among people with disabilities.隐藏的危机:长期新冠与残疾人士住房稳定性和家居可达性的关联。
Disabil Health J. 2024 Oct;17(4):101650. doi: 10.1016/j.dhjo.2024.101650. Epub 2024 Jun 7.
8
Could tiny blood clots cause long COVID's puzzling symptoms?微小的血凝块会导致长期新冠的令人困惑的症状吗?
Nature. 2022 Aug;608(7924):662-664. doi: 10.1038/d41586-022-02286-7.
9
Redundancy in electronic health record corpora: analysis, impact on text mining performance and mitigation strategies.电子健康记录语料库中的冗余:分析、对文本挖掘性能的影响和缓解策略。
BMC Bioinformatics. 2013 Jan 16;14:10. doi: 10.1186/1471-2105-14-10.
10
Coronary artery disease risk assessment from unstructured electronic health records using text mining.利用文本挖掘技术从非结构化电子健康记录中进行冠状动脉疾病风险评估。
J Biomed Inform. 2015 Dec;58 Suppl(Suppl):S203-S210. doi: 10.1016/j.jbi.2015.08.003. Epub 2015 Aug 28.

引用本文的文献

1
Analysis of the causes of improper medical decision-making in medical damage liability disputes in China: a text mining approach.中国医疗损害责任纠纷中医方不当诊疗决策成因分析:一种文本挖掘方法
BMC Health Serv Res. 2025 Aug 20;25(1):1112. doi: 10.1186/s12913-025-13177-8.

本文引用的文献

1
COVID-19 outbreaks surveillance through text mining applied to electronic health records.通过文本挖掘应用于电子健康记录进行 COVID-19 暴发监测。
BMC Infect Dis. 2024 Mar 28;24(1):359. doi: 10.1186/s12879-024-09250-y.
2
Postacute sequelae of COVID-19 at 2 years.COVID-19 后 2 年的后遗症。
Nat Med. 2023 Sep;29(9):2347-2357. doi: 10.1038/s41591-023-02521-2. Epub 2023 Aug 21.
3
Altered tissue oxygenation in patients with post COVID-19 syndrome.新冠后遗症患者的组织氧合改变。
Microvasc Res. 2023 Jul;148:104551. doi: 10.1016/j.mvr.2023.104551. Epub 2023 May 16.
4
Complications Post-COVID-19 and Risk Factors among Patients after Six Months of a SARS-CoV-2 Infection: A Population-Based Prospective Cohort Study.新冠病毒感染六个月后患者的新冠后并发症及危险因素:一项基于人群的前瞻性队列研究
Epidemiologia (Basel). 2022 Feb 10;3(1):49-67. doi: 10.3390/epidemiologia3010006.
5
Acute and postacute sequelae associated with SARS-CoV-2 reinfection.与 SARS-CoV-2 再感染相关的急性和后期后遗症。
Nat Med. 2022 Nov;28(11):2398-2405. doi: 10.1038/s41591-022-02051-3. Epub 2022 Nov 10.
6
Effectiveness of BNT162b2 booster after CoronaVac primary regimen in pregnant people during omicron period in Brazil.在巴西奥密克戎时期,BNT162b2加强针在接种科兴疫苗初免方案后的孕妇中的有效性。
Lancet Infect Dis. 2022 Dec;22(12):1669-1670. doi: 10.1016/S1473-3099(22)00728-9. Epub 2022 Nov 7.
7
Estimated Global Proportions of Individuals With Persistent Fatigue, Cognitive, and Respiratory Symptom Clusters Following Symptomatic COVID-19 in 2020 and 2021.估计 2020 年和 2021 年有症状 COVID-19 后持续性疲劳、认知和呼吸症状群个体在全球的比例。
JAMA. 2022 Oct 25;328(16):1604-1615. doi: 10.1001/jama.2022.18931.
8
Use of the Postacute Sequelae of COVID-19 Diagnosis Code in Routine Clinical Practice in the US.美国常规临床实践中 COVID-19 诊断后后遗症编码的使用。
JAMA Netw Open. 2022 Oct 3;5(10):e2235089. doi: 10.1001/jamanetworkopen.2022.35089.
9
Mental health-related communication in a virtual community: text mining analysis of a digital exchange platform during the Covid-19 pandemic.心理健康相关的虚拟社区交流:Covid-19 大流行期间数字交流平台的文本挖掘分析。
BMC Psychiatry. 2022 Jun 25;22(1):430. doi: 10.1186/s12888-022-04080-1.
10
Effectiveness of CoronaVac, ChAdOx1 nCoV-19, BNT162b2, and Ad26.COV2.S among individuals with previous SARS-CoV-2 infection in Brazil: a test-negative, case-control study.在巴西,既往感染过 SARS-CoV-2 的个体中 CoronaVac、ChAdOx1 nCoV-19、BNT162b2 和 Ad26.COV2.S 的有效性:一项病例对照研究。
Lancet Infect Dis. 2022 Jun;22(6):791-801. doi: 10.1016/S1473-3099(22)00140-2. Epub 2022 Apr 1.