De Marchi Tommaso, Liu Ning Qing, Sting Christoph, Smid Marcel, Tjoa Mila, Braakman René B H, Luider Theo M, Foekens John A, Martens John W M, Umar Arzu
Department of Medical Oncology, Erasmus MC Cancer Institute, University Medical Center, Rotterdam, The Netherlands.
Department of Neurology, Erasmus MC, University Medical Center, Rotterdam, The Netherlands.
Data Brief. 2015 Oct 8;5:399-402. doi: 10.1016/j.dib.2015.09.034. eCollection 2015 Dec.
We here describe two proteomic datasets deposited in ProteomeXchange via PRIDE partner repository [1] with dataset identifiers PXD000484 (defined as "training") and PXD000485 (defined as "test") that have been used for the development of a tamoxifen outcome predictive signature [2]. Both datasets comprised 56 fresh frozen estrogen receptor (ER) positive primary breast tumor specimens derived from patients who received tamoxifen as first line therapy for recurrent disease. Patient groups were defined based on time to progression (TTP) after start of tamoxifen therapy (6 months cutoff): 32 good and 24 poor treatment outcome patients were comprised in the training set, respectively. The test set included 41 good and 15 poor treatment outcome patients. All specimens were subjected to laser capture microdissection (LCM) to enrich for epithelial tumor cells prior to high resolution mass spectrometric (MS) analysis. Protein identification and label-free quantification (LFQ) were performed with MaxQuant software package [3]. A total of 3109 and 4061 proteins were identified and quantified in the training and test set, respectively. We here present the first public proteomic dataset analyzing ER positive recurrent breast cancer by LCM coupled to high resolution MS.
我们在此描述了两个通过PRIDE合作伙伴库存入ProteomeXchange的蛋白质组学数据集[1],数据集标识符分别为PXD000484(定义为“训练集”)和PXD000485(定义为“测试集”),它们已被用于开发他莫昔芬疗效预测特征[2]。这两个数据集均包含56例新鲜冷冻的雌激素受体(ER)阳性原发性乳腺肿瘤标本,这些标本来自接受他莫昔芬作为复发性疾病一线治疗的患者。根据他莫昔芬治疗开始后的疾病进展时间(TTP)(以6个月为界)定义患者组:训练集中分别包含32例治疗效果良好和24例治疗效果不佳的患者。测试集包括41例治疗效果良好和15例治疗效果不佳的患者。所有标本在进行高分辨率质谱(MS)分析之前,均经过激光捕获显微切割(LCM)以富集上皮肿瘤细胞。使用MaxQuant软件包[3]进行蛋白质鉴定和无标记定量(LFQ)。训练集和测试集分别鉴定和定量了总共3109种和4061种蛋白质。我们在此展示了首个通过LCM结合高分辨率MS分析ER阳性复发性乳腺癌的公开蛋白质组学数据集。