Kitsos Christine M, Bhamidipati Phani, Melnikova Irena, Cash Ethan P, McNulty Chris, Furman Julia, Cima Michael J, Levinson Douglas
Transform Pharmaceuticals, Incorporated, A Unit of Johnson & Johnson, 29 Hartwell Ave., Lexington, Massachusetts 02421, USA.
Cytometry A. 2007 Jan;71(1):16-27. doi: 10.1002/cyto.a.20353.
This study examined whether hierarchical clustering could be used to detect cell states induced by treatment combinations that were generated through automation and high-throughput (HT) technology. Data-mining techniques were used to analyze the large experimental data sets to determine whether nonlinear, non-obvious responses could be extracted from the data.
Unary, binary, and ternary combinations of pharmacological factors (examples of stimuli) were used to induce differentiation of HL-60 cells using a HT automated approach. Cell profiles were analyzed by incorporating hierarchical clustering methods on data collected by flow cytometry. Data-mining techniques were used to explore the combinatorial space for nonlinear, unexpected events. Additional small-scale, follow-up experiments were performed on cellular profiles of interest.
Multiple, distinct cellular profiles were detected using hierarchical clustering of expressed cell-surface antigens. Data-mining of this large, complex data set retrieved cases of both factor dominance and cooperativity, as well as atypical cellular profiles. Follow-up experiments found that treatment combinations producing "atypical cell types" made those cells more susceptible to apoptosis. CONCLUSIONS Hierarchical clustering and other data-mining techniques were applied to analyze large data sets from HT flow cytometry. From each sample, the data set was filtered and used to define discrete, usable states that were then related back to their original formulations. Analysis of resultant cell populations induced by a multitude of treatments identified unexpected phenotypes and nonlinear response profiles.
本研究探讨了层次聚类是否可用于检测由通过自动化和高通量(HT)技术生成的治疗组合所诱导的细胞状态。使用数据挖掘技术分析大型实验数据集,以确定是否可以从数据中提取非线性、不明显的反应。
使用药理学因素(刺激示例)的一元、二元和三元组合,采用HT自动化方法诱导HL-60细胞分化。通过对流式细胞术收集的数据应用层次聚类方法来分析细胞图谱。使用数据挖掘技术探索非线性、意外事件的组合空间。对感兴趣的细胞图谱进行了额外的小规模后续实验。
使用表达的细胞表面抗原的层次聚类检测到多个不同的细胞图谱。对这个大型复杂数据集的数据挖掘检索到了因子优势和协同作用的案例,以及非典型细胞图谱。后续实验发现,产生“非典型细胞类型”的治疗组合使这些细胞更容易发生凋亡。结论:层次聚类和其他数据挖掘技术被应用于分析来自HT流式细胞术的大型数据集。从每个样本中,对数据集进行筛选并用于定义离散的、可用的状态,然后将这些状态与其原始配方相关联。对多种处理诱导的所得细胞群体的分析确定了意外的表型和非线性反应图谱。