• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索 CTEPH 药物治疗:联合自然语言处理、嵌入学习、体外模型和真实世界证据进行药物再利用。

An exploration into CTEPH medications: Combining natural language processing, embedding learning, in vitro models, and real-world evidence for drug repurposing.

机构信息

Laboratory of in vitro modeling systems of pulmonary and thrombotic diseases, Institute of Physiology, Charité-Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany.

Structural Bioinformatics Group, Institute of Physiology, Charité-Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany.

出版信息

PLoS Comput Biol. 2024 Sep 12;20(9):e1012417. doi: 10.1371/journal.pcbi.1012417. eCollection 2024 Sep.

DOI:10.1371/journal.pcbi.1012417
PMID:39264975
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11478854/
Abstract

BACKGROUND

In the modern era, the growth of scientific literature presents a daunting challenge for researchers to keep informed of advancements across multiple disciplines.

OBJECTIVE

We apply natural language processing (NLP) and embedding learning concepts to design PubDigest, a tool that combs PubMed literature, aiming to pinpoint potential drugs that could be repurposed.

METHODS

Using NLP, especially term associations through word embeddings, we explored unrecognized relationships between drugs and diseases. To illustrate the utility of PubDigest, we focused on chronic thromboembolic pulmonary hypertension (CTEPH), a rare disease with an overall limited number of scientific publications.

RESULTS

Our literature analysis identified key clinical features linked to CTEPH by applying term frequency-inverse document frequency (TF-IDF) scoring, a technique measuring a term's significance in a text corpus. This allowed us to map related diseases. One standout was venous thrombosis (VT), which showed strong semantic links with CTEPH. Looking deeper, we discovered potential repurposing candidates for CTEPH through large-scale neural network-based contextualization of literature and predictive modeling on both the CTEPH and the VT literature corpora to find novel, yet unrecognized associations between the two diseases. Alongside the anti-thrombotic agent caplacizumab, benzofuran derivatives were an intriguing find. In particular, the benzofuran derivative amiodarone displayed potential anti-thrombotic properties in the literature. Our in vitro tests confirmed amiodarone's ability to reduce platelet aggregation significantly by 68% (p = 0.02). However, real-world clinical data indicated that CTEPH patients receiving amiodarone treatment faced a significant 15.9% higher mortality risk (p<0.001).

CONCLUSIONS

While NLP offers an innovative approach to interpreting scientific literature, especially for drug repurposing, it is crucial to combine it with complementary methods like in vitro testing and real-world evidence. Our exploration with benzofuran derivatives and CTEPH underscores this point. Thus, blending NLP with hands-on experiments and real-world clinical data can pave the way for faster and safer drug repurposing approaches, especially for rare diseases like CTEPH.

摘要

背景

在现代,科学文献的增长给研究人员带来了一项艰巨的挑战,他们需要及时了解多个学科的最新进展。

目的

我们应用自然语言处理(NLP)和嵌入学习概念来设计 PubDigest,这是一种梳理 PubMed 文献的工具,旨在发现可能被重新利用的潜在药物。

方法

我们使用 NLP,特别是通过词嵌入来探索术语关联,研究了药物和疾病之间未被识别的关系。为了说明 PubDigest 的实用性,我们专注于慢性血栓栓塞性肺动脉高压(CTEPH)这一罕见疾病,其科学文献总体数量有限。

结果

通过应用术语频率-逆文档频率(TF-IDF)评分的方法,我们对文献进行了分析,识别出与 CTEPH 相关的关键临床特征,这是一种衡量术语在文本语料库中重要性的技术。这使我们能够映射相关疾病。其中一个突出的疾病是静脉血栓形成(VT),它与 CTEPH 具有很强的语义联系。进一步深入研究,我们通过对文献进行大规模神经网络上下文化处理,并对 CTEPH 和 VT 文献语料库进行预测建模,发现了这两种疾病之间新的、尚未被认识到的关联,找到了 CTEPH 的潜在重新利用候选药物。除了抗血栓药物 caplacizumab 外,苯并呋喃衍生物也是一个有趣的发现。特别是,苯并呋喃衍生物胺碘酮在文献中显示出潜在的抗血栓特性。我们的体外试验证实,胺碘酮可使血小板聚集显著减少 68%(p = 0.02)。然而,真实世界的临床数据表明,接受胺碘酮治疗的 CTEPH 患者的死亡率风险显著增加了 15.9%(p<0.001)。

结论

虽然 NLP 为解释科学文献提供了一种创新方法,特别是在药物重新利用方面,但将其与体外测试和真实世界证据等补充方法结合使用至关重要。我们对苯并呋喃衍生物和 CTEPH 的探索突出了这一点。因此,将 NLP 与实际实验和真实世界的临床数据相结合,可以为更快、更安全的药物重新利用方法铺平道路,特别是对于 CTEPH 等罕见疾病。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/30a3bd97cd8e/pcbi.1012417.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/a6c1c2b9e9b8/pcbi.1012417.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/b055f702c78b/pcbi.1012417.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/195bcc2662a9/pcbi.1012417.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/609d73b5d002/pcbi.1012417.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/4d7923dc2076/pcbi.1012417.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/30a3bd97cd8e/pcbi.1012417.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/a6c1c2b9e9b8/pcbi.1012417.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/b055f702c78b/pcbi.1012417.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/195bcc2662a9/pcbi.1012417.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/609d73b5d002/pcbi.1012417.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/4d7923dc2076/pcbi.1012417.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3460/11478854/30a3bd97cd8e/pcbi.1012417.g006.jpg

相似文献

1
An exploration into CTEPH medications: Combining natural language processing, embedding learning, in vitro models, and real-world evidence for drug repurposing.探索 CTEPH 药物治疗:联合自然语言处理、嵌入学习、体外模型和真实世界证据进行药物再利用。
PLoS Comput Biol. 2024 Sep 12;20(9):e1012417. doi: 10.1371/journal.pcbi.1012417. eCollection 2024 Sep.
2
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
3
Non-contrasted computed tomography (NCCT) based chronic thromboembolic pulmonary hypertension (CTEPH) automatic diagnosis using cascaded network with multiple instance learning.基于非对比增强计算机断层扫描(NCCT)的慢性血栓栓塞性肺动脉高压(CTEPH)自动诊断的级联网络与多实例学习方法。
Phys Med Biol. 2024 Sep 13;69(18). doi: 10.1088/1361-6560/ad7455.
4
PISTON: Predicting drug indications and side effects using topic modeling and natural language processing.PISTON:使用主题建模和自然语言处理预测药物适应证和副作用。
J Biomed Inform. 2018 Nov;87:96-107. doi: 10.1016/j.jbi.2018.09.015. Epub 2018 Sep 27.
5
Text mining-based word representations for biomedical data analysis and protein-protein interaction networks in machine learning tasks.基于文本挖掘的词表示在生物医学数据分析和机器学习任务中的蛋白质-蛋白质相互作用网络。
PLoS One. 2021 Oct 15;16(10):e0258623. doi: 10.1371/journal.pone.0258623. eCollection 2021.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Mining fall-related information in clinical notes: Comparison of rule-based and novel word embedding-based machine learning approaches.挖掘临床记录中与跌倒相关的信息:基于规则和基于新颖词嵌入的机器学习方法的比较。
J Biomed Inform. 2019 Feb;90:103103. doi: 10.1016/j.jbi.2019.103103. Epub 2019 Jan 9.
8
Exercise physiological responses to drug treatments in chronic thromboembolic pulmonary hypertension.慢性血栓栓塞性肺动脉高压中运动对药物治疗的生理反应
J Appl Physiol (1985). 2016 Sep 1;121(3):623-8. doi: 10.1152/japplphysiol.00087.2016. Epub 2016 Jul 14.
9
Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach.基于机器学习的自然语言处理方法对临床笔记进行医学子域分类。
BMC Med Inform Decis Mak. 2017 Dec 1;17(1):155. doi: 10.1186/s12911-017-0556-8.
10
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

引用本文的文献

1
In Silico Validation of AI-Assisted Drugs in Healthcare.医疗保健中人工智能辅助药物的计算机模拟验证
Methods Mol Biol. 2025;2952:445-458. doi: 10.1007/978-1-0716-4690-8_24.

本文引用的文献

1
Biomedical knowledge graph learning for drug repurposing by extending guilt-by-association to multiple layers.通过将关联推断扩展到多个层次来进行药物再利用的生物医学知识图学习。
Nat Commun. 2023 Jun 15;14(1):3570. doi: 10.1038/s41467-023-39301-y.
2
Building a knowledge graph to enable precision medicine.构建知识图谱以实现精准医学。
Sci Data. 2023 Feb 2;10(1):67. doi: 10.1038/s41597-023-01960-3.
3
Pulmonary Fibrosis Related to Amiodarone-Is It a Standard Pathophysiological Pattern? A Case-Based Literature Review.胺碘酮相关的肺纤维化——它是一种标准的病理生理模式吗?基于病例的文献综述。
Diagnostics (Basel). 2022 Dec 19;12(12):3217. doi: 10.3390/diagnostics12123217.
4
2022 ESC/ERS Guidelines for the diagnosis and treatment of pulmonary hypertension.2022年欧洲心脏病学会/欧洲呼吸学会肺动脉高压诊断和治疗指南。
Eur Heart J. 2022 Oct 11;43(38):3618-3731. doi: 10.1093/eurheartj/ehac237.
5
Natural source, bioactivity and synthesis of benzofuran derivatives.苯并呋喃衍生物的天然来源、生物活性及合成
RSC Adv. 2019 Sep 2;9(47):27510-27540. doi: 10.1039/c9ra04917g. eCollection 2019 Aug 29.
6
Epigenetic Modification of the von Willebrand Factor Promoter Drives Platelet Aggregation on the Pulmonary Endothelium in Chronic Thromboembolic Pulmonary Hypertension.慢性血栓栓塞性肺动脉高压中血管性血友病因子启动子的表观遗传修饰驱动血小板在肺内皮聚集。
Am J Respir Crit Care Med. 2022 Apr 1;205(7):806-818. doi: 10.1164/rccm.202109-2075OC.
7
In Vitro Microfluidic Disease Model to Study Whole Blood-Endothelial Interactions and Blood Clot Dynamics in Real-Time.用于实时研究全血与内皮相互作用及血凝块动力学的体外微流控疾病模型
J Vis Exp. 2020 May 24(159). doi: 10.3791/61068.
8
Wikidata as a knowledge graph for the life sciences.Wikidata 作为生命科学的知识图谱。
Elife. 2020 Mar 17;9:e52614. doi: 10.7554/eLife.52614.
9
Unsupervised word embeddings capture latent knowledge from materials science literature.无监督词嵌入方法可以从材料科学文献中提取潜在知识。
Nature. 2019 Jul;571(7763):95-98. doi: 10.1038/s41586-019-1335-8. Epub 2019 Jul 3.
10
The ADAMTS13-VWF axis is dysregulated in chronic thromboembolic pulmonary hypertension.ADAMTS13-VWF 轴在慢性血栓栓塞性肺动脉高压中失调。
Eur Respir J. 2019 Mar 28;53(3). doi: 10.1183/13993003.01805-2018. Print 2019 Mar.