Ansoborlo Marie, Gaborit Christophe, Grammatico-Guillon Leslie, Cuggia Marc, Bouzille Guillaume
Medicine, 56555Université de Tours Faculté de Médecine, Tours, France.
Medecine, 27079Université de Rennes, Rennes, France.
Health Informatics J. 2023 Jan-Mar;29(1):14604582221146709. doi: 10.1177/14604582221146709.
Defining profiles of patients that could benefit from relevant anti-cancer treatments is essential. An increasing number of specific criteria are necessary to be eligible to specific anti-cancer therapies. This study aimed to develop an automated algorithm able to detect patient and tumor characteristics to reduce the time-consuming prescreening for trial inclusions without delay. Hence, 640 anonymized multidisciplinary team meetings (MTM) reports concerning lung cancers from one French teaching hospital data warehouse between 2018 and 2020 were annotated. To automate the extraction of eight major eligibility criteria, corresponding to 52 classes, regular expressions were implemented. The RegEx's evaluation gave a F1-score of 93% in average, a positive predictive value (precision) of 98% and sensitivity (recall) of 92%. However, in MTM, fill rates variabilities among patient and tumor information remained important (from 31% to 100%). Genetic mutations and rearrangement test results were the least reported characteristics and also the hardest to automatically extract. To ease prescreening in clinical trials, the PreScIOUs study demonstrated the additional value of rule based and machine learning based methods applied on lung cancer MTM reports.
确定可能从相关抗癌治疗中获益的患者特征至关重要。越来越多的特定标准是符合特定抗癌疗法的必要条件。本研究旨在开发一种自动化算法,能够检测患者和肿瘤特征,以减少耗时的试验纳入预筛选而不延误。因此,对来自一家法国教学医院数据仓库2018年至2020年间的640份关于肺癌的匿名多学科团队会议(MTM)报告进行了注释。为了自动提取对应52个类别的八个主要入选标准,实施了正则表达式。正则表达式的评估平均F1分数为93%,阳性预测值(精确率)为98%,敏感度(召回率)为92%。然而,在多学科团队会议中,患者和肿瘤信息的填写率差异仍然很大(从31%到100%)。基因突变和重排检测结果是报告最少的特征,也是最难自动提取的特征。为了简化临床试验中的预筛选,“PreScIOUs”研究证明了基于规则和基于机器学习的方法应用于肺癌多学科团队会议报告的附加价值。