Cardiac Imaging Department, Barts Heart Centre, St Bartholomew's Hospital, London, UK; Institute of Cardiovascular Science, University College London, London, UK.
Cardiac Imaging Department, Barts Heart Centre, St Bartholomew's Hospital, London, UK; William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University of London, London, UK.
Lancet Digit Health. 2021 Jan;3(1):e20-e28. doi: 10.1016/S2589-7500(20)30267-3. Epub 2020 Dec 3.
Left ventricular maximum wall thickness (MWT) is central to diagnosis and risk stratification of hypertrophic cardiomyopathy, but human measurement is prone to variability. We developed an automated machine learning algorithm for MWT measurement and compared precision (reproducibility) with that of 11 international experts, using a dataset of patients with hypertrophic cardiomyopathy.
60 adult patients with hypertrophic cardiomyopathy, including those carrying hypertrophic cardiomyopathy gene mutations, were recruited at three institutes in the UK from August, 2018, to September, 2019: Barts Heart Centre, University College London Hospital (The Heart Hospital), and Leeds Teaching Hospitals NHS Trust. Participants had two cardiovascular magnetic resonance scans (test and retest) on the same day, ensuring no biological variability, using four cardiac MRI scanner models represented across two manufacturers and two field strengths. End-diastolic short-axis MWT was measured in test and retest by 11 international experts (from nine centres in six countries) and an automated machine learning method, which was trained to segment endocardial and epicardial contours on an independent, multicentre, multidisease dataset of 1923 patients. Machine learning MWT measurement was done with a method based on solving Laplace's equation. To assess test-retest reproducibility, we estimated the absolute test-retest MWT difference (precision), the coefficient of variation (CoV) for duplicate measurements, and the number of patients reclassified between test and retest according to different thresholds (MWT >15 mm and >30 mm). We calculated the sample size required to detect a prespecified MWT change between pairs of scans for machine learning and each expert.
1440 MWT measurements were analysed, corresponding to two scans from 60 participants by 12 observers (11 experts and machine learning). Experts differed in the MWT they measured, ranging from 14·9 mm (SD 4·2) to 19·0 mm (4·7; p<0·0001 for trend). Machine learning-measured mean MWT was 16·8 mm (4·1). Machine learning precision was superior, with a test-retest difference of 0·7 mm (0·6) compared with experts, who ranged from 1·1 mm (0·9) to 3·7 mm (2·0; p values for machine learning vs expert comparison ranging from <0·0001 to 0·0073) and a significantly lower CoV than for all experts (4·3% [95% CI 3·3-5·1] vs 5·7-12·1% across experts). On average, 38 (64%) patients were designated as having MWT greater than 15 mm by machine learning compared with 27 (45%) to 50 (83%) patients by experts; five (8%) patients were reclassified in test-retest by machine learning compared with four (7%) to 12 (20%) by experts. With a cutoff point of more than 30 mm for implantable cardioverter-defibrillator, three experts would have changed recommendations between tests a total of four times, but machine learning was consistent. Using machine learning, a clinical trial to detect a 2 mm MWT change would need 2·3 times (range 1·6-4·6) fewer patients.
In this preliminary study, machine learning MWT measurement in hypertrophic cardiomyopathy is superior to human experts with potential implications for diagnosis, risk stratification, and clinical trials.
European Regional Development Fund and Barts Charity.
左心室最大壁厚度(MWT)是肥厚型心肌病诊断和风险分层的核心,但人类测量容易出现变异性。我们开发了一种用于 MWT 测量的自动化机器学习算法,并使用肥厚型心肌病患者的数据集,比较了该算法与 11 名国际专家的精确性(可重复性)。
2018 年 8 月至 2019 年 9 月,英国的三个机构共招募了 60 名成年肥厚型心肌病患者,包括携带肥厚型心肌病基因突变的患者:巴茨心脏中心、伦敦大学学院医院(心脏医院)和利兹教学医院 NHS 信托。参与者在同一天进行了两次心血管磁共振扫描(测试和复测),使用四种代表两种制造商和两种场强的心脏 MRI 扫描仪模型,以确保没有生物学变异性。11 名国际专家(来自六个国家的九个中心)和一种自动化机器学习方法在测试和复测中测量了舒张末期短轴 MWT,该方法经过训练,可以在一个独立的、多中心、多疾病的 1923 名患者数据集上分割心内膜和心外膜轮廓。机器学习 MWT 测量是基于解决拉普拉斯方程的方法进行的。为了评估测试-复测的可重复性,我们估计了绝对的测试-复测 MWT 差异(精度)、重复测量的变异系数(CoV),以及根据不同的阈值(MWT>15mm 和>30mm)重新分类的患者数量。我们计算了为机器学习和每个专家检测对扫描对之间规定的 MWT 变化所需的样本量。
分析了 1440 次 MWT 测量结果,对应于 60 名参与者的两次扫描,由 12 名观察者(11 名专家和机器学习)进行。专家们测量的 MWT 不同,范围从 14.9mm(SD 4.2)到 19.0mm(4.7;趋势的 p 值<0.0001)。机器学习测量的平均 MWT 为 16.8mm(4.1)。机器学习的精度更高,与专家相比,测试-复测的差异为 0.7mm(0.6),而专家的差异范围为 1.1mm(0.9)至 3.7mm(2.0;机器学习与专家比较的 p 值范围从<0.0001 到 0.0073),并且明显低于所有专家的 CoV(4.3%[95%CI 3.3-5.1]与专家的 5.7-12.1%)。平均而言,与专家相比,有 38 名(64%)患者被机器学习诊断为 MWT 大于 15mm,而有 27 名(45%)至 50 名(83%)患者被专家诊断为 MWT 大于 15mm;与专家相比,有 5 名(8%)患者在测试-复测时被重新分类,而有 4 名(7%)至 12 名(20%)患者被专家重新分类。对于植入式心脏复律除颤器的截定点超过 30mm,有 3 名专家总共会有 4 次改变建议,但机器学习是一致的。使用机器学习,检测 2mm MWT 变化的临床试验需要 2.3 倍(范围 1.6-4.6)的患者数量。
在这项初步研究中,肥厚型心肌病的机器学习 MWT 测量优于人类专家,这可能对诊断、风险分层和临床试验有影响。
欧洲区域发展基金和巴茨慈善基金会。