Department of Statistics, University of Warwick, Coventry, CV4 7AL, United Kingdom.
University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.
Br J Radiol. 2024 Jan 23;97(1153):98-112. doi: 10.1093/bjr/tqad023.
To build a data set capturing the whole breast cancer screening journey from individual breast cancer screening records to outcomes and assess data quality.
Routine screening records (invitation, attendance, test results) from all 79 English NHS breast screening centres between January 1, 1988 and March 31, 2018 were linked to cancer registry (cancer characteristics and treatment) and national mortality data. Data quality was assessed using comparability, validity, timeliness, and completeness.
Screening records were extracted from 76/79 English breast screening centres, 3/79 were not possible due to software issues. Data linkage was successful from 1997 after introduction of a universal identifier for women (NHS number). Prior to 1997 outcome data are incomplete due to linkage issues, reducing validity. Between January 1, 1997 and March 31, 2018, a total of 11 262 730 women were offered screening of whom 9 371 973 attended at least one appointment, with 139 million person-years of follow-up (a median of 12.4 person years for each woman included) with 73 810 breast cancer deaths and 1 111 139 any-cause deaths. Comparability to reference data sets and internal validity were demonstrated. Data completeness was high for core screening variables (>99%) and main cancer outcomes (>95%).
The ATHENA-M project has created a large high-quality and representative data set of individual women's screening trajectories and outcomes in England from 1997 to 2018, data before 1997 are lower quality.
This is the most complete data set of English breast screening records and outcomes constructed to date, which can be used to evaluate and optimize screening.
构建一个数据集,涵盖从个体乳腺癌筛查记录到结局的整个乳腺癌筛查过程,并评估数据质量。
将 1988 年 1 月 1 日至 2018 年 3 月 31 日期间所有 79 家英国国民保健署(NHS)乳腺筛查中心的常规筛查记录(邀请、出席、检查结果)与癌症登记处(癌症特征和治疗)以及国家死亡率数据进行链接。使用可比性、有效性、及时性和完整性来评估数据质量。
从 76/79 家英国乳腺筛查中心提取了筛查记录,由于软件问题,有 3 家无法进行。1997 年引入女性通用标识符(NHS 号码)后,数据链接成功。1997 年之前,由于链接问题,结局数据不完整,降低了有效性。在 1997 年 1 月 1 日至 2018 年 3 月 31 日期间,共有 1126.273 万名女性接受了筛查,其中 937.1973 万名女性至少参加了一次预约,随访了 1.39 亿人年(中位数为每位女性 12.4 人年),有 73810 名女性死于乳腺癌,1111139 名女性死于任何原因。与参考数据集和内部有效性进行了比较。核心筛查变量(>99%)和主要癌症结局(>95%)的数据完整性很高。
ATHENA-M 项目创建了一个高质量、有代表性的大型数据集,涵盖了 1997 年至 2018 年期间英格兰女性的个体筛查轨迹和结局,1997 年之前的数据质量较低。
这是迄今为止构建的最完整的英国乳腺筛查记录和结局数据集,可用于评估和优化筛查。