Allen Institute for Artificial Intelligence, Seattle, Washington.
University of South Alabama College of Medicine, Mobile.
JAMA Netw Open. 2019 Jul 3;2(7):e196700. doi: 10.1001/jamanetworkopen.2019.6700.
Analyses of female representation in clinical studies have been limited in scope and scale.
To perform a large-scale analysis of global enrollment sex bias in clinical studies.
DESIGN, SETTING, AND PARTICIPANTS: In this cross-sectional study, clinical studies from published articles from PubMed from 1966 to 2018 and records from Aggregate Analysis of ClinicalTrials.gov from 1999 to 2018 were identified. Global disease prevalence was determined for male and female patients in 11 disease categories from the Global Burden of Disease database: cardiovascular, diabetes, digestive, hepatitis (types A, B, C, and E), HIV/AIDS, kidney (chronic), mental, musculoskeletal, neoplasms, neurological, and respiratory (chronic). Machine reading algorithms were developed that extracted sex data from tables in articles and records on December 31, 2018, at an artificial intelligence research institute. Male and female participants in 43 135 articles (792 004 915 participants) and 13 165 records (12 977 103 participants) were included.
Sex bias was defined as the difference between the fraction of female participants in study participants minus prevalence fraction of female participants for each disease category. A total of 1000 bootstrap estimates of sex bias were computed by resampling individual studies with replacement. Sex bias was reported as mean and 95% bootstrap confidence intervals from articles and records in each disease category over time (before or during 1993 to 2018), with studies or participants as the measurement unit.
There were 792 004 915 participants, including 390 470 834 female participants (49%), in articles and 12 977 103 participants, including 6 351 619 female participants (49%), in records. With studies as measurement unit, substantial female underrepresentation (sex bias ≤ -0.05) was observed in 7 of 11 disease categories, especially HIV/AIDS (mean for articles, -0.17 [95% CI, -0.18 to -0.16]), chronic kidney diseases (mean, -0.17 [95% CI, -0.17 to -0.16]), and cardiovascular diseases (mean, -0.14 [95% CI, -0.14 to -0.13]). Sex bias in articles for all categories combined was unchanged over time with studies as measurement unit (range, -0.15 [95% CI, -0.16 to -0.13] to -0.10 [95% CI, -0.14 to -0.06]), but improved from before or during 1993 (mean, -0.11 [95% CI, -0.16 to -0.05]) to 2014 to 2018 (mean, -0.05 [95% CI, -0.09 to -0.02]) with participants as the measurement unit. Larger study size was associated with greater female representation.
Automated extraction of the number of participants in clinical reports provides an effective alternative to manual analysis of demographic bias. Despite legal and policy initiatives to increase female representation, sex bias against female participants in clinical studies persists. Studies with more participants have greater female representation. Differences between sex bias estimates with studies vs participants as measurement unit, and between articles vs records, suggest that sex bias with both measures and data sources should be reported.
对临床研究中女性代表性的分析在范围和规模上都受到限制。
对全球临床研究中性别招募偏差进行大规模分析。
设计、地点和参与者:在这项横断面研究中,从 1966 年至 2018 年的 PubMed 发表文章和 1999 年至 2018 年的 Aggregate Analysis of ClinicalTrials.gov 记录中确定了临床研究。使用全球疾病负担数据库中的 11 种疾病类别(心血管疾病、糖尿病、消化系统疾病、肝炎(A、B、C 和 E 型)、艾滋病毒/艾滋病、慢性肾脏疾病、精神、肌肉骨骼、肿瘤、神经和慢性呼吸道疾病)来确定男性和女性患者的全球患病率。2018 年 12 月 31 日,在人工智能研究所,开发了从文章和记录中的表格中提取性别数据的机器阅读算法。在 43135 篇文章(792004915 名参与者)和 13165 份记录(12977103 名参与者)中包括男性和女性参与者。
性别偏差定义为研究参与者中女性参与者的分数减去每个疾病类别的女性参与者的流行分数之间的差异。通过对个体研究进行有放回的重新采样,共计算了 1000 个性别偏差的bootstrap 估计值。性别偏差以文章和记录中每个疾病类别的平均值和 95%bootstrap 置信区间表示(1993 年之前或期间至 2018 年),以研究或参与者为测量单位。
文章中有 792004915 名参与者,包括 390470834 名女性参与者(49%),记录中有 12977103 名参与者,包括 6351619 名女性参与者(49%)。以研究为测量单位,在 11 种疾病类别中有 7 种存在明显的女性代表性不足(性别偏差≤-0.05),尤其是艾滋病毒/艾滋病(文章平均值为-0.17 [95%置信区间为-0.18 至-0.16])、慢性肾脏疾病(平均值为-0.17 [95%置信区间为-0.17 至-0.16])和心血管疾病(平均值为-0.14 [95%置信区间为-0.14 至-0.13])。以研究为测量单位,所有类别的文章性别偏差随时间变化不变(范围为-0.15 [95%置信区间为-0.16 至-0.13]至-0.10 [95%置信区间为-0.14 至-0.06]),但从 1993 年之前或期间(平均值为-0.11 [95%置信区间为-0.16 至-0.05])到 2014 年至 2018 年(平均值为-0.05 [95%置信区间为-0.09 至-0.02]),随着参与者的测量单位,性别偏差有所改善。研究规模越大,女性代表人数就越多。
对临床报告中参与者数量的自动提取为分析人口统计学偏差提供了一种有效的替代方法。尽管有法律和政策举措来增加女性代表性,但临床研究中对女性参与者的性别偏见仍然存在。参与者较多的研究具有更大的女性代表性。以研究和参与者为测量单位的性别偏差估计值之间以及文章和记录之间的差异表明,应报告这两种测量方法和数据源的性别偏差。