Department of Radiation Oncology, University of Michigan, Ann Arbor.
School of Public Health, City University of New York, Hunter College, New York.
JAMA Oncol. 2016 Aug 1;2(8):1070-4. doi: 10.1001/jamaoncol.2016.1854.
Although poorly understood, there is heterogeneity in the molecular biology of cancer across race and ethnicities. The representation of racial minorities in large genomic sequencing efforts is unclear, and could have an impact on health care disparities.
To determine the racial distribution among samples sequenced within The Cancer Genome Atlas (TCGA) and the deficit of samples needed to detect moderately common mutational frequencies in racial minorities.
DESIGN, SETTING, AND PARTICIPANTS: This was a retrospective review of individual patient data from TCGA data portal accessed in July 2015. TCGA comprises samples from a wide array of institutions primarily across the United States. Samples from 10 of the 31 currently available tumor types were analyzed, comprising 5729 samples from the approximately 11 000 available.
Using the estimated median somatic mutational frequency, the samples needed beyond TCGA to detect a 10% and 5% mutational frequency over the background somatic mutation frequency were calculated for each tumor type by racial ethnicity.
Of the 5729 samples, 77% (n = 4389) were white, 12% (n = 660) were black, 3% (n = 173) were Asian, 3% (n = 149) were Hispanic, and less than 0.5% combined were from patients of Native Hawaiian, Pacific Islander, Alaskan Native, or American Indian decent. This overrepresents white patients compared with the US population and underrepresents primarily Asian and Hispanic patients. With a somatic mutational frequency of 0.7 (prostate cancer) to 9.9 (lung squamous cell cancer), all tumor types from white patients contained enough samples to detect a 10% mutational frequency. This is in contrast to all other racial ethnicities, for which group-specific mutations with 10% frequency would be detectable only for black patients with breast cancer. Group-specific mutations with 5% frequency would be undetectable in any racial minority, but detectable in white patients for all cancer types except lung (adenocarcinoma and squamous cell carcinoma) and colon cancer.
It is probable, but poorly understood, that ethnic diversity is related to the pathogenesis of cancer, and may have an impact on the generalizability of findings from TCGA to racial minorities. Despite the important benefits that continue to be gained from genomic sequencing, dedicated efforts are needed to avoid widening the already pervasive gap in health care disparities.
尽管对其分子生物学机制的了解仍不够充分,但癌症在不同种族和民族之间存在异质性。在大型基因组测序研究中,少数族裔的代表性尚不明确,这可能会对医疗保健差异产生影响。
确定在癌症基因组图谱(TCGA)中测序的样本的种族分布情况,以及检测少数族裔中中等常见突变频率所需的样本数量。
设计、环境和参与者:这是对 2015 年 7 月通过 TCGA 数据门户获取的个体患者数据的回顾性分析。TCGA 样本来自美国各地的众多机构。对 31 种现有肿瘤类型中的 10 种的样本进行了分析,共分析了大约 11000 个可用样本中的 5729 个样本。
使用估计的中位体细胞突变频率,按种族和民族计算了每种肿瘤类型检测到背景体细胞突变频率之上 10%和 5%突变频率所需的 TCGA 以外的样本数量。
在 5729 个样本中,77%(n=4389)为白人,12%(n=660)为黑人,3%(n=173)为亚洲人,3%(n=149)为西班牙裔,不到 0.5%为夏威夷原住民、太平洋岛民、阿拉斯加原住民或美洲印第安人。这与美国人口相比,白人患者的比例过高,而亚洲和西班牙裔患者的比例过低。对于体细胞突变频率为 0.7(前列腺癌)至 9.9(肺鳞状细胞癌)的所有肿瘤类型,白人患者的样本量足以检测到 10%的突变频率。这与其他所有种族和民族形成对比,在其他种族和民族中,只有黑人乳腺癌患者的群体特异性突变频率为 10%时才可检测到。任何少数族裔的群体特异性突变频率为 5%时都无法检测到,但除了肺癌(腺癌和鳞状细胞癌)和结肠癌之外,所有癌症类型的白人患者都可检测到。
尽管种族多样性与癌症的发病机制有关,但这一点可能尚未被充分了解,并且可能会对 TCGA 研究结果在少数族裔中的普遍性产生影响。尽管从基因组测序中继续获得重要益处,但仍需要做出专门努力,以避免扩大医疗保健差异方面已经普遍存在的差距。