Genome Science and Technology Graduate Program, University of British Columbia, Vancouver, BC, V6T1Z4, Canada.
Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T1Z4, Canada.
Sci Rep. 2021 Sep 2;11(1):17624. doi: 10.1038/s41598-021-97005-z.
The Connectivity Map (CMap) is a popular resource designed for data-driven drug repositioning using a large transcriptomic compendium. However, evaluations of its performance are limited. We used two iterations of CMap (CMap 1 and 2) to assess their comparability and reliability. We queried CMap 2 with CMap 1-derived signatures, expecting CMap 2 would highly prioritize the queried compounds; the success rate was 17%. Analysis of previously published prioritizations yielded similar results. Low recall is caused by low differential expression (DE) reproducibility both between CMaps and within each CMap. DE strength was predictive of reproducibility, and is influenced by compound concentration and cell-line responsiveness. Reproducibility of CMap 2 sample expression levels was also lower than expected. We attempted to identify the "better" CMap by comparison with a third dataset, but they were mutually discordant. Our findings have implications for CMap usage and we suggest steps for investigators to limit false positives.
连接图谱(CMap)是一个受欢迎的资源,旨在使用大型转录组汇进行数据驱动的药物重定位。然而,对其性能的评估是有限的。我们使用了两个版本的 CMap(CMap1 和 CMap2)来评估它们的可比性和可靠性。我们使用 CMap1 衍生的特征来查询 CMap2,预计 CMap2 将高度优先考虑查询的化合物;成功率为 17%。对先前发表的优先级排序的分析得出了类似的结果。召回率低是由于 CMaps 之间以及每个 CMap 内部的差异表达(DE)重现性低所致。DE 强度可预测重现性,并受化合物浓度和细胞系反应性的影响。CMap2 样本表达水平的重现性也低于预期。我们试图通过与第三个数据集进行比较来确定“更好”的 CMap,但它们相互不一致。我们的发现对 CMap 的使用有影响,我们建议研究人员采取措施限制假阳性。