National Center for Toxicological Research, US Food and Drug Administration, Jefferson, Arkansas, USA.
Nat Biotechnol. 2010 Aug;28(8):827-38. doi: 10.1038/nbt.1665. Epub 2010 Jul 30.
Gene expression data from microarrays are being applied to predict preclinical and clinical endpoints, but the reliability of these predictions has not been established. In the MAQC-II project, 36 independent teams analyzed six microarray data sets to generate predictive models for classifying a sample with respect to one of 13 endpoints indicative of lung or liver toxicity in rodents, or of breast cancer, multiple myeloma or neuroblastoma in humans. In total, >30,000 models were built using many combinations of analytical methods. The teams generated predictive models without knowing the biological meaning of some of the endpoints and, to mimic clinical reality, tested the models on data that had not been used for training. We found that model performance depended largely on the endpoint and team proficiency and that different approaches generated models of similar performance. The conclusions and recommendations from MAQC-II should be useful for regulatory agencies, study committees and independent investigators that evaluate methods for global gene expression analysis.
基因表达数据的微阵列正在被应用于预测临床前和临床终点,但这些预测的可靠性尚未确定。在 MAQC-II 项目中,36 个独立的团队分析了六个微阵列数据集,以生成预测模型,用于将样本分类为 13 个终点之一,这些终点指示了啮齿动物的肺或肝毒性,或人类的乳腺癌、多发性骨髓瘤或神经母细胞瘤。总共使用了许多分析方法的组合来构建超过 30000 个模型。团队在不知道一些终点的生物学意义的情况下生成了预测模型,并且为了模拟临床现实,在没有用于训练的数据上测试了模型。我们发现,模型性能在很大程度上取决于终点和团队的熟练程度,并且不同的方法生成了性能相似的模型。MAQC-II 的结论和建议应该对评估全球基因表达分析方法的监管机构、研究委员会和独立研究人员有用。