• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Automatic lymphoma classification with sentence subgraph mining from pathology reports.基于病理报告中的句子子图挖掘进行自动淋巴瘤分类。
J Am Med Inform Assoc. 2014 Sep-Oct;21(5):824-32. doi: 10.1136/amiajnl-2013-002443. Epub 2014 Jan 15.
2
Subgraph augmented non-negative tensor factorization (SANTF) for modeling clinical narrative text.用于临床叙事文本建模的子图增强非负张量分解(SANTF)
J Am Med Inform Assoc. 2015 Sep;22(5):1009-19. doi: 10.1093/jamia/ocv016. Epub 2015 Apr 9.
3
BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network.BioEGRE:一种基于 BioELECTRA 和图指针神经网络的生物医学关系抽取的语言拓扑增强方法。
BMC Bioinformatics. 2023 Dec 19;24(1):486. doi: 10.1186/s12859-023-05601-9.
4
Context-driven automatic subgraph creation for literature-based discovery.用于基于文献的发现的上下文驱动自动子图创建
J Biomed Inform. 2015 Apr;54:141-57. doi: 10.1016/j.jbi.2015.01.014. Epub 2015 Feb 7.
5
Use of "off-the-shelf" information extraction algorithms in clinical informatics: A feasibility study of MetaMap annotation of Italian medical notes.临床信息学中“现成可用”信息提取算法的应用:意大利医学记录的MetaMap注释可行性研究。
J Biomed Inform. 2016 Oct;63:22-32. doi: 10.1016/j.jbi.2016.07.017. Epub 2016 Jul 18.
6
Exploiting graph kernels for high performance biomedical relation extraction.利用图核进行高性能生物医学关系提取。
J Biomed Semantics. 2018 Jan 30;9(1):7. doi: 10.1186/s13326-017-0168-3.
7
Application of Adaptive Neural Network Algorithm Model in English Text Analysis.自适应神经网络算法模型在英文文本分析中的应用。
Comput Intell Neurosci. 2022 May 26;2022:4866531. doi: 10.1155/2022/4866531. eCollection 2022.
8
Automated Classification of Selected Data Elements from Free-text Diagnostic Reports for Clinical Research.用于临床研究的自由文本诊断报告中选定数据元素的自动分类
Methods Inf Med. 2016 Aug 5;55(4):373-80. doi: 10.3414/ME15-02-0019. Epub 2016 Jul 13.
9
Portable automatic text classification for adverse drug reaction detection via multi-corpus training.通过多语料库训练实现用于药物不良反应检测的便携式自动文本分类
J Biomed Inform. 2015 Feb;53:196-207. doi: 10.1016/j.jbi.2014.11.002. Epub 2014 Nov 8.
10
A systematic approach for developing a corpus of patient reported adverse drug events: A case study for SSRI and SNRI medications.构建患者报告的药物不良事件语料库的系统方法:以SSRI和SNRI药物为例的案例研究。
J Biomed Inform. 2019 Feb;90:103091. doi: 10.1016/j.jbi.2018.12.005. Epub 2019 Jan 4.

引用本文的文献

1
A survey of NLP methods for oncology in the past decade with a focus on cancer registry applications.对过去十年肿瘤学领域自然语言处理方法的一项调查,重点关注癌症登记应用。
Artif Intell Rev. 2025;58(10):314. doi: 10.1007/s10462-025-11316-5. Epub 2025 Jul 16.
2
Leveraging social media data to study disease and treatment characteristics of Hodgkin's lymphoma Using Natural Language Processing methods.利用社交媒体数据,采用自然语言处理方法研究霍奇金淋巴瘤的疾病及治疗特征。
PLOS Digit Health. 2025 Mar 19;4(3):e0000765. doi: 10.1371/journal.pdig.0000765. eCollection 2025 Mar.
3
Leveraging natural language processing for efficient information extraction from breast cancer pathology reports: Single-institution study.利用自然语言处理技术从乳腺癌病理报告中高效提取信息:单机构研究。
PLoS One. 2025 Feb 18;20(2):e0318726. doi: 10.1371/journal.pone.0318726. eCollection 2025.
4
Northwestern University resource and education development initiatives to advance collaborative artificial intelligence across the learning health system.西北大学推进跨学习健康系统的协作人工智能的资源与教育发展举措。
Learn Health Syst. 2024 Apr 15;8(3):e10417. doi: 10.1002/lrh2.10417. eCollection 2024 Jul.
5
An accessible, efficient, and accurate natural language processing method for extracting diagnostic data from pathology reports.一种用于从病理报告中提取诊断数据的便捷、高效且准确的自然语言处理方法。
J Pathol Inform. 2022 Nov 8;13:100154. doi: 10.1016/j.jpi.2022.100154. eCollection 2022.
6
Machine Learning for Lung Cancer Diagnosis, Treatment, and Prognosis.机器学习在肺癌诊断、治疗和预后中的应用。
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):850-866. doi: 10.1016/j.gpb.2022.11.003. Epub 2022 Dec 1.
7
A novel differential diagnosis algorithm for chronic lymphocytic leukemia using immunophenotyping with flow cytometry.一种使用流式细胞术免疫表型分析的慢性淋巴细胞白血病新型鉴别诊断算法。
Hematol Transfus Cell Ther. 2023 Apr-Jun;45(2):176-181. doi: 10.1016/j.htct.2021.08.012. Epub 2021 Nov 29.
8
Integration of NLP2FHIR Representation with Deep Learning Models for EHR Phenotyping: A Pilot Study on Obesity Datasets.自然语言处理 (NLP) 到 FHIR 表示的集成与深度学习模型在电子健康记录表型中的应用:肥胖数据集的初步研究。
AMIA Jt Summits Transl Sci Proc. 2021 May 17;2021:410-419. eCollection 2021.
9
Implementing a Portable Clinical NLP System with a Common Data Model - a Lisp Perspective.使用通用数据模型实现便携式临床自然语言处理系统——从Lisp的角度来看
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2018 Dec;2018:461-466. doi: 10.1109/bibm.2018.8621521. Epub 2019 Jan 24.
10
Identifying Breast Cancer Distant Recurrences from Electronic Health Records Using Machine Learning.使用机器学习从电子健康记录中识别乳腺癌远处复发
J Healthc Inform Res. 2019;3(3):283-299. doi: 10.1007/s41666-019-00046-3. Epub 2019 Apr 8.

本文引用的文献

1
Indexed Pain Journals.索引疼痛期刊。
J Pain Palliat Care Pharmacother. 2008;22(1):45-46. doi: 10.1080/15360280801989377.
2
Towards comprehensive syntactic and semantic annotations of the clinical narrative.朝着临床叙述的全面句法和语义标注努力。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):922-30. doi: 10.1136/amiajnl-2012-001317. Epub 2013 Jan 25.
3
Fibrin-associated large B-cell lymphoma: part of the spectrum of cardiac lymphomas.纤维相关的大 B 细胞淋巴瘤:心脏淋巴瘤谱的一部分。
Am J Surg Pathol. 2012 Oct;36(10):1527-37. doi: 10.1097/PAS.0b013e31825d53b5.
4
Nodular lymphocyte-predominant hodgkin lymphoma with atypical T cells: a morphologic variant mimicking peripheral T-cell lymphoma.结节性淋巴细胞为主型霍奇金淋巴瘤伴非典型 T 细胞:一种模仿外周 T 细胞淋巴瘤的形态学变异型。
Am J Surg Pathol. 2011 Nov;35(11):1666-78. doi: 10.1097/PAS.0b013e31822832de.
5
Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.梅奥临床文本分析和知识提取系统(cTAKES):架构、组件评估和应用。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.
6
Semantic relations for problem-oriented medical records.面向问题的病历的语义关系。
Artif Intell Med. 2010 Oct;50(2):63-73. doi: 10.1016/j.artmed.2010.05.006. Epub 2010 Jun 19.
7
Electronic medical records for discovery research in rheumatoid arthritis.电子病历在类风湿关节炎研究中的应用。
Arthritis Care Res (Hoboken). 2010 Aug;62(8):1120-7. doi: 10.1002/acr.20184.
8
B-cell lymphomas with concurrent IGH-BCL2 and MYC rearrangements are aggressive neoplasms with clinical and pathologic features distinct from Burkitt lymphoma and diffuse large B-cell lymphoma.IGH-BCL2 和 MYC 同时重排的 B 细胞淋巴瘤是具有侵袭性的肿瘤,其临床和病理特征与 Burkitt 淋巴瘤和弥漫性大 B 细胞淋巴瘤不同。
Am J Surg Pathol. 2010 Mar;34(3):327-40. doi: 10.1097/PAS.0b013e3181cd3aeb.
9
MedEx: a medication information extraction system for clinical narratives.MedEx:一个用于临床叙述的药物信息提取系统。
J Am Med Inform Assoc. 2010 Jan-Feb;17(1):19-24. doi: 10.1197/jamia.M3378.
10
Description of a rule-based system for the i2b2 challenge in natural language processing for clinical data.用于临床数据自然语言处理中i2b2挑战的基于规则系统的描述。
J Am Med Inform Assoc. 2009 Jul-Aug;16(4):571-5. doi: 10.1197/jamia.M3083. Epub 2009 Apr 23.

基于病理报告中的句子子图挖掘进行自动淋巴瘤分类。

Automatic lymphoma classification with sentence subgraph mining from pathology reports.

机构信息

Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.

Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Cambridge, Massachusetts, USA.

出版信息

J Am Med Inform Assoc. 2014 Sep-Oct;21(5):824-32. doi: 10.1136/amiajnl-2013-002443. Epub 2014 Jan 15.

DOI:10.1136/amiajnl-2013-002443
PMID:24431333
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4147603/
Abstract

OBJECTIVE

Pathology reports are rich in narrative statements that encode a complex web of relations among medical concepts. These relations are routinely used by doctors to reason on diagnoses, but often require hand-crafted rules or supervised learning to extract into prespecified forms for computational disease modeling. We aim to automatically capture relations from narrative text without supervision.

METHODS

We design a novel framework that translates sentences into graph representations, automatically mines sentence subgraphs, reduces redundancy in mined subgraphs, and automatically generates subgraph features for subsequent classification tasks. To ensure meaningful interpretations over the sentence graphs, we use the Unified Medical Language System Metathesaurus to map token subsequences to concepts, and in turn sentence graph nodes. We test our system with multiple lymphoma classification tasks that together mimic the differential diagnosis by a pathologist. To this end, we prevent our classifiers from looking at explicit mentions or synonyms of lymphomas in the text.

RESULTS AND CONCLUSIONS

We compare our system with three baseline classifiers using standard n-grams, full MetaMap concepts, and filtered MetaMap concepts. Our system achieves high F-measures on multiple binary classifications of lymphoma (Burkitt lymphoma, 0.8; diffuse large B-cell lymphoma, 0.909; follicular lymphoma, 0.84; Hodgkin lymphoma, 0.912). Significance tests show that our system outperforms all three baselines. Moreover, feature analysis identifies subgraph features that contribute to improved performance; these features agree with the state-of-the-art knowledge about lymphoma classification. We also highlight how these unsupervised relation features may provide meaningful insights into lymphoma classification.

摘要

目的

病理报告中富含叙述性陈述,这些陈述编码了医学概念之间复杂的关系网络。这些关系通常被医生用于诊断推理,但通常需要手工规则或监督学习才能提取为预定义形式,以便进行计算疾病建模。我们旨在自动从叙述性文本中捕获关系,而无需监督。

方法

我们设计了一个新颖的框架,该框架将句子转换为图表示形式,自动挖掘句子子图,减少挖掘出的子图中的冗余,并自动生成子图特征,以用于后续的分类任务。为了确保对句子图进行有意义的解释,我们使用统一医学语言系统元词表将标记子序列映射到概念,进而映射到句子图节点。我们使用多个淋巴瘤分类任务来测试我们的系统,这些任务共同模拟病理学家的鉴别诊断。为此,我们防止分类器在文本中查看淋巴瘤的显式提及或同义词。

结果与结论

我们使用标准 n-gram、完整的 MetaMap 概念和过滤后的 MetaMap 概念,将我们的系统与三个基线分类器进行比较。我们的系统在多个淋巴瘤的二元分类(伯基特淋巴瘤,0.8;弥漫性大 B 细胞淋巴瘤,0.909;滤泡性淋巴瘤,0.84;霍奇金淋巴瘤,0.912)中取得了较高的 F 度量值。显著性检验表明,我们的系统优于所有三个基线。此外,特征分析确定了对子图特征的贡献,这些特征提高了性能;这些特征与关于淋巴瘤分类的最新知识一致。我们还强调了这些无监督关系特征如何为淋巴瘤分类提供有意义的见解。