• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于识别和分类子痫前期及类似病症中母体蛋白质和基因的文本短语挖掘

Text phrase-mining in identifying and classifying maternal proteins and genes across preeclampsia and similar pathologies.

作者信息

Urdang Jacqueline G, Masters Stephanie, Edokobi Nneoma, Mukherjee Chitra, Quazi Arnib, Liem David A, Ahrens Monica, Wang Xuan, Whitham Megan

机构信息

Virginia Tech Carilion School of Medicine, Roanoke, Virginia, USA.

Carilion Clinic, Roanoke, Virginia, USA.

出版信息

Physiol Rep. 2025 Mar;13(6):e70262. doi: 10.14814/phy2.70262.

DOI:10.14814/phy2.70262
PMID:40102640
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11919630/
Abstract

This study aims to demonstrate that text phrase-mining and natural language processing (NLP) can annotate huge quantities of obstetrics textual data for the discovery and evaluation of maternal protein/gene (MPG)-disease interactions involved in the preeclampsia pathway. We employ a phrase-mining/NLP pipeline to evaluate unique MPGs involved in six cardiovascular derangements with overlapping presentations during pregnancy. The diseases were matched with Medical Subject Headings. A textual corpus was developed from abstracts matched to these terms through PubMed. Fourty-four MPGs were identified with respect to the diseases. Processing was performed, with unique scores for each MPG-disease pair. Components of the score were calculated and weighted for distinctness, integrity, and popularity. Statistical analyses were conducted for the examination of protein-disease relationships. Fourty-four MPGs with known associations to cardiovascular disease and preeclampsia pathways were identified among the 6 diseases. MPGs shared across the greatest number of disease states were implicated in: (1) angiogenesis and vasoconstriction, (2) hemodynamic regulation, (3) hormonal regulation of metabolism, and (4) inflammation. NLP and text phrase-mining are successfully applied to Obstetrics abstracts with accuracy and speed. This approach holds promise in synthesizing large volumes of data for presenting trends in the Obstetric literature and for the identification of promising biomarkers.

摘要

本研究旨在证明文本短语挖掘和自然语言处理(NLP)能够注释大量产科文本数据,以发现和评估子痫前期途径中涉及的母体蛋白/基因(MPG)-疾病相互作用。我们采用短语挖掘/NLP流程来评估妊娠期间具有重叠表现的六种心血管紊乱所涉及的独特MPG。这些疾病与医学主题词进行匹配。通过PubMed从与这些术语匹配的摘要中构建了一个文本语料库。针对这些疾病确定了44个MPG。进行了处理,为每个MPG-疾病对赋予了独特的分数。计算了分数的组成部分,并根据独特性、完整性和流行度进行加权。进行了统计分析以检查蛋白质-疾病关系。在这6种疾病中确定了44个与心血管疾病和子痫前期途径有已知关联的MPG。在最多疾病状态中共享的MPG涉及:(1)血管生成和血管收缩,(2)血流动力学调节,(3)代谢的激素调节,以及(4)炎症。NLP和文本短语挖掘已成功且准确快速地应用于产科摘要。这种方法在综合大量数据以呈现产科文献趋势和识别有前景的生物标志物方面具有前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/26e74e0dcecd/PHY2-13-e70262-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/01b2a2d527eb/PHY2-13-e70262-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/760526d0e7b4/PHY2-13-e70262-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/884063ba9df1/PHY2-13-e70262-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/d6be5a7d532c/PHY2-13-e70262-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/26e74e0dcecd/PHY2-13-e70262-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/01b2a2d527eb/PHY2-13-e70262-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/760526d0e7b4/PHY2-13-e70262-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/884063ba9df1/PHY2-13-e70262-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/d6be5a7d532c/PHY2-13-e70262-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a24/11919630/26e74e0dcecd/PHY2-13-e70262-g003.jpg

相似文献

1
Text phrase-mining in identifying and classifying maternal proteins and genes across preeclampsia and similar pathologies.用于识别和分类子痫前期及类似病症中母体蛋白质和基因的文本短语挖掘
Physiol Rep. 2025 Mar;13(6):e70262. doi: 10.14814/phy2.70262.
2
Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications.基于云的生物医学出版物中用户定义短语类别关联的短语挖掘与分析
J Vis Exp. 2019 Feb 23(144). doi: 10.3791/59108.
3
Phrase mining of textual data to analyze extracellular matrix protein patterns across cardiovascular disease.基于文本数据的短语挖掘,分析心血管疾病中外细胞基质蛋白模式。
Am J Physiol Heart Circ Physiol. 2018 Oct 1;315(4):H910-H924. doi: 10.1152/ajpheart.00175.2018. Epub 2018 May 18.
4
Text Mining and Machine Learning Protocol for Extracting Human-Related Protein Phosphorylation Information from PubMed.从 PubMed 中提取与人相关的蛋白质磷酸化信息的文本挖掘和机器学习协议。
Methods Mol Biol. 2022;2496:159-177. doi: 10.1007/978-1-0716-2305-3_9.
5
Natural language processing (NLP) to facilitate abstract review in medical research: the application of BioBERT to exploring the 20-year use of NLP in medical research.自然语言处理(NLP)在医学研究中的应用:BioBERT 在探索 20 年来 NLP 在医学研究中的应用。
Syst Rev. 2024 Apr 15;13(1):107. doi: 10.1186/s13643-024-02470-y.
6
Bioinformatic approach to the genetics of preeclampsia.生物信息学方法在子痫前期遗传学中的应用。
Obstet Gynecol. 2014 Jun;123(6):1155-1161. doi: 10.1097/AOG.0000000000000293.
7
Automating Clinical Chart Review: An Open-Source Natural Language Processing Pipeline Developed on Free-Text Radiology Reports From Patients With Glioblastoma.自动化临床图表审查:基于胶质母细胞瘤患者的自由文本放射学报告开发的开源自然语言处理管道。
JCO Clin Cancer Inform. 2020 Jan;4:25-34. doi: 10.1200/CCI.19.00060.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.评估医疗保健中的实体识别:实体模型定量研究。
JMIR Med Inform. 2024 Oct 17;12:e59782. doi: 10.2196/59782.
10
dbPEC: a comprehensive literature-based database for preeclampsia related genes and phenotypes.dbPEC:一个基于文献的子痫前期相关基因和表型综合数据库。
Database (Oxford). 2016 Mar 5;2016. doi: 10.1093/database/baw006. Print 2016.

本文引用的文献

1
Thrombotic Microangiopathy in Pregnancy: Current Understanding and Management Strategies.妊娠期血栓性微血管病:当前的认识与管理策略
Kidney Int Rep. 2024 May 22;9(8):2353-2371. doi: 10.1016/j.ekir.2024.05.016. eCollection 2024 Aug.
2
Obstetric antiphospholipid syndrome carries an increased lifetime risk for obstetric and thrombotic complications-a population-based study.产科抗磷脂综合征患者发生产科和血栓并发症的终生风险增加——一项基于人群的研究。
Res Pract Thromb Haemost. 2024 Apr 29;8(4):102430. doi: 10.1016/j.rpth.2024.102430. eCollection 2024 May.
3
Circulating Angiogenic Factor Levels in Hypertensive Disorders of Pregnancy.
妊娠期高血压疾病相关循环血管生成因子水平。
NEJM Evid. 2022 Dec;1(12):EVIDoa2200161. doi: 10.1056/EVIDoa2200161. Epub 2022 Nov 9.
4
Genetic Associations of Circulating Cardiovascular Proteins With Gestational Hypertension and Preeclampsia.循环心血管蛋白与妊娠期高血压和子痫前期的遗传关联。
JAMA Cardiol. 2024 Mar 1;9(3):209-220. doi: 10.1001/jamacardio.2023.4994.
5
Use of electronic health record data mining for heart failure subtyping.利用电子健康记录数据挖掘进行心力衰竭亚型分类。
BMC Res Notes. 2023 Sep 11;16(1):208. doi: 10.1186/s13104-023-06469-x.
6
Two distinct molecular faces of preeclampsia revealed by single-cell transcriptomics.单细胞转录组学揭示的子痫前期的两种不同分子面貌。
Med. 2023 Oct 13;4(10):687-709.e7. doi: 10.1016/j.medj.2023.07.005. Epub 2023 Aug 11.
7
Hypertensive disorders of pregnancy.妊娠高血压疾病。
BMJ. 2023 Jun 30;381:e071653. doi: 10.1136/bmj-2022-071653.
8
Identifying miRNA biomarkers for breast cancer and ovarian cancer: a text mining perspective.从文本挖掘角度识别乳腺癌和卵巢癌的 miRNA 生物标志物。
Breast Cancer Res Treat. 2023 Aug;201(1):5-14. doi: 10.1007/s10549-023-06996-y. Epub 2023 Jun 17.
9
Integrating Text Mining into the Curation of Disease Maps.将文本挖掘技术整合到疾病图谱的编纂中。
Biomolecules. 2022 Sep 10;12(9):1278. doi: 10.3390/biom12091278.
10
Text Mining Protocol to Retrieve Significant Drug-Gene Interactions from PubMed Abstracts.从 PubMed 摘要中检索重要药物-基因相互作用的文本挖掘方案。
Methods Mol Biol. 2022;2496:17-39. doi: 10.1007/978-1-0716-2305-3_2.