Suppr超能文献

基于自然语言处理和机器学习的乳腺癌病历预后分期预测的信息提取。

Information extraction for prognostic stage prediction from breast cancer medical records using NLP and ML.

机构信息

School of Computer Engineering and Technology, MIT World Peace University, Pune, India, 411029.

Department of Computer Engineering and Information Technology, College of Engineering, Pune, 411005, India.

出版信息

Med Biol Eng Comput. 2021 Sep;59(9):1751-1772. doi: 10.1007/s11517-021-02399-7. Epub 2021 Jul 23.

Abstract

For cancer prediction, the prognostic stage is the main factor that helps medical experts to decide the optimal treatment for a patient. Specialists study prognostic stage information from medical reports, often in an unstructured form, and take a larger review time. The main objective of this study is to suggest a generic clinical decision-unifying staging method to extract the most reliable prognostic stage information of breast cancer from medical records of various health institutions. Additional prognostic elements should be extracted from medical reports to identify the cancer stage for getting an exact measure of cancer and improving care quality. This study has collected 465 pathological and clinical reports of breast cancer sufferers from India's reputed medical institutions. The unstructured records were found distinct from each institute. Anatomic and biologic factors are extracted from medical records using the natural language processing, machine learning and rule-based method for prognostic stage detection. This study has extracted anatomic stage, grade, estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2) from medical reports with high accuracy and predicted prognostic stage for both regions. The prognostic stage prediction's average accuracy is found 92% and 82% in rural and urban areas, respectively. It was essential to combine biological and anatomical elements under a single prognostic staging method. A generic clinical decision-unifying staging method for prognostic stage detection with great accuracy in various institutions of different regional areas suggests that the proposed research improves the prognosis of breast cancer.

摘要

对于癌症预测,预后分期是帮助医学专家为患者确定最佳治疗方案的主要因素。专家从医疗报告中研究预后分期信息,这些信息通常以非结构化的形式呈现,需要花费更多的时间进行审查。本研究的主要目的是提出一种通用的临床决策统一分期方法,从不同医疗机构的病历中提取乳腺癌最可靠的预后分期信息。还应从医疗报告中提取额外的预后因素,以确定癌症分期,从而更准确地衡量癌症并提高护理质量。本研究从印度知名医疗机构收集了 465 份乳腺癌患者的病理和临床报告。未结构化的记录在每个机构之间存在明显差异。使用自然语言处理、机器学习和基于规则的方法从医疗记录中提取解剖和生物学因素,以检测预后分期。本研究从医疗报告中以高精度提取了解剖分期、分级、雌激素受体 (ER)、孕激素受体 (PR) 和人表皮生长因子受体 2 (HER2),并对两个地区的预后分期进行了预测。在农村和城市地区,预后分期预测的平均准确率分别为 92%和 82%。将生物学和解剖学因素结合在单一的预后分期方法中是至关重要的。本研究提出了一种通用的临床决策统一分期方法,用于在不同地区的不同医疗机构中进行预后分期检测,具有很高的准确性,表明所提出的研究提高了乳腺癌的预后。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验