• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过数据挖掘病理报告分析原发性和复发性乳腺癌中的激素受体状态

Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports.

作者信息

Chang Kai-Po, Chu Yen-Wei, Wang John

机构信息

Department of Pathology, China Medical University Hospital, Taichung 404, Taiwan.

Ph.D. Program in Medical Biotechnology, National Chung Hsing University, Taichung 402, Taiwan.

出版信息

Open Med (Wars). 2019 Feb 20;14:91-98. doi: 10.1515/med-2019-0013. eCollection 2019.

DOI:10.1515/med-2019-0013
PMID:30847396
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6401490/
Abstract

BACKGROUND

Hormone receptors of breast cancer, such as estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (Her-2), are important prognostic factors for breast cancer.

OBJECTIVE

The current study aimed to develop a method to retrieve the statistics of hormone receptor expression status, documented in pathology reports, given their importance in research for primary and recurrent breast cancer, and quality management of pathology laboratories.

METHOD

A two-stage text mining approach via regular expression-based word/phrase matching, was developed to retrieve the data.

RESULTS

The method achieved a sensitivity of 98.8%, 98.7% and 98.4% for extraction of ER, PR, and Her-2 results. The hormone expression status from 3679 primary and 44 recurrent breast cancer cases was successfully retrieved with the method. Statistical analysis of these data showed that the recurrent disease had a significantly lower positivity rate for ER (54.5% vs 76.5%, p=0.001278) than primary breast cancer and a higher positivity rate for Her-2 (48.8% vs 16.2%, p=9.79e-8). These results corroborated the previous literature.

CONCLUSION

Text mining on pathology reports using the developed method may benefit research of primary and recurrent breast cancer.

摘要

背景

乳腺癌的激素受体,如雌激素受体(ER)、孕激素受体(PR)和人表皮生长因子受体2(Her-2),是乳腺癌重要的预后因素。

目的

鉴于激素受体表达状态在原发性和复发性乳腺癌研究以及病理实验室质量管理中的重要性,本研究旨在开发一种方法,以获取病理报告中记录的激素受体表达状态统计数据。

方法

通过基于正则表达式的词/短语匹配开发了一种两阶段文本挖掘方法来检索数据。

结果

该方法提取ER、PR和Her-2结果的灵敏度分别为98.8%、98.7%和98.4%。使用该方法成功检索到3679例原发性和44例复发性乳腺癌病例的激素表达状态。对这些数据的统计分析表明,复发性疾病的ER阳性率(54.5%对76.5%,p = 0.001278)显著低于原发性乳腺癌,而Her-2阳性率(48.8%对16.2%,p = 9.79e-8)高于原发性乳腺癌。这些结果证实了先前的文献报道。

结论

使用所开发的方法对病理报告进行文本挖掘可能有助于原发性和复发性乳腺癌的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/7907ef40d723/med-14-091-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/0106e78a922f/med-14-091-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/5a5a39e9067c/med-14-091-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/f35491b74a79/med-14-091-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/878dba669d9f/med-14-091-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/43740d8f947a/med-14-091-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/7907ef40d723/med-14-091-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/0106e78a922f/med-14-091-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/5a5a39e9067c/med-14-091-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/f35491b74a79/med-14-091-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/878dba669d9f/med-14-091-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/43740d8f947a/med-14-091-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17b3/6401490/7907ef40d723/med-14-091-g006.jpg

相似文献

1
Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports.通过数据挖掘病理报告分析原发性和复发性乳腺癌中的激素受体状态
Open Med (Wars). 2019 Feb 20;14:91-98. doi: 10.1515/med-2019-0013. eCollection 2019.
2
Development of a Novel Tool for the Retrieval and Analysis of Hormone Receptor Expression Characteristics in Metastatic Breast Cancer via Data Mining on Pathology Reports.通过对病理报告进行数据挖掘,开发一种用于检索和分析转移性乳腺癌激素受体表达特征的新型工具。
Biomed Res Int. 2020 May 23;2020:2654815. doi: 10.1155/2020/2654815. eCollection 2020.
3
The variation and clinical significance of hormone receptors and Her-2 status from primary to metastatic lesions in breast cancer patients.乳腺癌患者从原发性病灶到转移病灶的激素受体及人表皮生长因子受体2(Her-2)状态的变化及其临床意义
Tumour Biol. 2016 Jun;37(6):7675-84. doi: 10.1007/s13277-015-4649-7. Epub 2015 Dec 21.
4
Quantitative association between HER-2/neu and steroid hormone receptors in hormone receptor-positive primary breast cancer.激素受体阳性原发性乳腺癌中HER-2/neu与类固醇激素受体之间的定量关联
J Natl Cancer Inst. 2003 Jan 15;95(2):142-53. doi: 10.1093/jnci/95.2.142.
5
Gene expression profiling for guiding adjuvant chemotherapy decisions in women with early breast cancer: an evidence-based and economic analysis.用于指导早期乳腺癌女性辅助化疗决策的基因表达谱分析:基于证据的经济分析
Ont Health Technol Assess Ser. 2010;10(23):1-57. Epub 2010 Dec 1.
6
Hormone- and HER2-receptor assessment in 33,046 breast cancer patients: a nationwide comparison of positivity rates between pathology laboratories in the Netherlands.33046 例乳腺癌患者的激素和 HER2 受体检测:荷兰各病理实验室间阳性率的全国性比较。
Breast Cancer Res Treat. 2019 Jun;175(2):487-497. doi: 10.1007/s10549-019-05180-5. Epub 2019 Mar 1.
7
Estrogen receptor, progesterone receptor, and Her 2 Neu positivity and its association with tumour characteristics and menopausal status in a breast cancer cohort from northern Pakistan.巴基斯坦北部乳腺癌队列中雌激素受体、孕激素受体和人表皮生长因子受体2(Her 2 Neu)的阳性表达及其与肿瘤特征和绝经状态的关联
Ecancermedicalscience. 2012;6:283. doi: 10.3332/ecancer.2012.283. Epub 2012 Dec 11.
8
[Prognostic value of estrogen receptor, progesterone receptor and human epidermal growth factor receptor-2 in node positive breast cancer patients treated by mastectomy].[雌激素受体、孕激素受体及人表皮生长因子受体-2在接受乳房切除术的淋巴结阳性乳腺癌患者中的预后价值]
Zhonghua Zhong Liu Za Zhi. 2010 Jul;32(7):520-5.
9
Estrogen and progesterone receptor concordance between primary and recurrent breast cancer.原发性与复发性乳腺癌之间雌激素和孕激素受体的一致性
J Surg Oncol. 1994 Oct;57(2):71-7. doi: 10.1002/jso.2930570202.
10
Alterations in three biomarkers (estrogen receptor, progesterone receptor and human epidermal growth factor 2) and the Ki67 index between primary and metastatic breast cancer lesions.原发性和转移性乳腺癌病灶之间三种生物标志物(雌激素受体、孕激素受体和人表皮生长因子2)及Ki67指数的改变。
Biomed Rep. 2017 Dec;7(6):535-542. doi: 10.3892/br.2017.1003. Epub 2017 Oct 19.

引用本文的文献

1
Leveraging natural language processing for efficient information extraction from breast cancer pathology reports: Single-institution study.利用自然语言处理技术从乳腺癌病理报告中高效提取信息:单机构研究。
PLoS One. 2025 Feb 18;20(2):e0318726. doi: 10.1371/journal.pone.0318726. eCollection 2025.
2
Rule-Based Information Extraction from Free-Text Pathology Reports Reveals Trends in South African Female Breast Cancer Molecular Subtypes and Ki67 Expression.基于规则的自由文本病理学报告信息提取揭示了南非女性乳腺癌分子亚型和 Ki67 表达的趋势。
Biomed Res Int. 2022 Jan 20;2022:6157861. doi: 10.1155/2022/6157861. eCollection 2022.
3

本文引用的文献

1
Breast Cancer-Major changes in the American Joint Committee on Cancer eighth edition cancer staging manual.乳腺癌——美国癌症联合委员会第八版癌症分期手册的重大变化。
CA Cancer J Clin. 2017 Jul 8;67(4):290-303. doi: 10.3322/caac.21393. Epub 2017 Mar 14.
2
Corpus domain effects on distributional semantic modeling of medical terms.语料库领域对医学术语分布语义建模的影响。
Bioinformatics. 2016 Dec 1;32(23):3635-3644. doi: 10.1093/bioinformatics/btw529. Epub 2016 Aug 16.
3
Implementation and use of electronic synoptic cancer reporting: an explorative case study of six Norwegian pathology laboratories.
Weighing and modelling factors influencing serum cortisol and melatonin concentration among workers that are exposed to various sound pressure levels using neural network algorithm: An empirical study.
使用神经网络算法对暴露于不同声压水平的工人血清皮质醇和褪黑素浓度的影响因素进行加权和建模:一项实证研究。
Heliyon. 2020 Sep 28;6(9):e05044. doi: 10.1016/j.heliyon.2020.e05044. eCollection 2020 Sep.
4
Development of a Novel Tool for the Retrieval and Analysis of Hormone Receptor Expression Characteristics in Metastatic Breast Cancer via Data Mining on Pathology Reports.通过对病理报告进行数据挖掘,开发一种用于检索和分析转移性乳腺癌激素受体表达特征的新型工具。
Biomed Res Int. 2020 May 23;2020:2654815. doi: 10.1155/2020/2654815. eCollection 2020.
5
MicroRNAs and Epigenetics Strategies to Reverse Breast Cancer.微小 RNA 与表观遗传学策略逆转乳腺癌。
Cells. 2019 Oct 8;8(10):1214. doi: 10.3390/cells8101214.
电子概要癌症报告的实施与应用:对六个挪威病理实验室的探索性案例研究
Implement Sci. 2014 Aug 20;9:111. doi: 10.1186/s13012-014-0111-2.
4
Automatic lymphoma classification with sentence subgraph mining from pathology reports.基于病理报告中的句子子图挖掘进行自动淋巴瘤分类。
J Am Med Inform Assoc. 2014 Sep-Oct;21(5):824-32. doi: 10.1136/amiajnl-2013-002443. Epub 2014 Jan 15.
5
Recommendations for human epidermal growth factor receptor 2 testing in breast cancer: American Society of Clinical Oncology/College of American Pathologists clinical practice guideline update.人表皮生长因子受体 2 检测在乳腺癌中的应用:美国临床肿瘤学会/美国病理学家学会临床实践指南更新。
J Clin Oncol. 2013 Nov 1;31(31):3997-4013. doi: 10.1200/JCO.2013.50.9984. Epub 2013 Oct 7.
6
Information Extraction for Clinical Data Mining: A Mammography Case Study.用于临床数据挖掘的信息提取:一个乳腺钼靶摄影的案例研究
Proc IEEE Int Conf Data Min. 2009:37-42. doi: 10.1109/icdmw.2009.63.
7
American society of clinical oncology/college of american pathologists guideline recommendations for immunohistochemical testing of estrogen and progesterone receptors in breast cancer.美国临床肿瘤学会/美国病理学家学会关于乳腺癌雌激素和孕激素受体免疫组织化学检测的指南建议。
J Oncol Pract. 2010 Jul;6(4):195-7. doi: 10.1200/JOP.777003. Epub 2010 Jun 23.
8
Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.梅奥临床文本分析和知识提取系统(cTAKES):架构、组件评估和应用。
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.
9
The "meaningful use" regulation for electronic health records.电子健康记录的“有意义使用”规定。
N Engl J Med. 2010 Aug 5;363(6):501-4. doi: 10.1056/NEJMp1006114. Epub 2010 Jul 13.
10
Recommendations for validating estrogen and progesterone receptor immunohistochemistry assays.雌激素和孕激素受体免疫组织化学检测验证的建议。
Arch Pathol Lab Med. 2010 Jun;134(6):930-5. doi: 10.5858/134.6.930.