Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, NH, USA.
Department of Microbiology and Immunology, Geisel School of Medicine at Dartmouth, Hanover, NH, USA.
Cell Syst. 2017 Jul 26;5(1):63-71.e6. doi: 10.1016/j.cels.2017.06.003. Epub 2017 Jul 12.
Cross-experiment comparisons in public data compendia are challenged by unmatched conditions and technical noise. The ADAGE method, which performs unsupervised integration with denoising autoencoder neural networks, can identify biological patterns, but because ADAGE models, like many neural networks, are over-parameterized, different ADAGE models perform equally well. To enhance model robustness and better build signatures consistent with biological pathways, we developed an ensemble ADAGE (eADAGE) that integrated stable signatures across models. We applied eADAGE to a compendium of Pseudomonas aeruginosa gene expression profiling experiments performed in 78 media. eADAGE revealed a phosphate starvation response controlled by PhoB in media with moderate phosphate and predicted that a second stimulus provided by the sensor kinase, KinB, is required for this PhoB activation. We validated this relationship using both targeted and unbiased genetic approaches. eADAGE, which captures stable biological patterns, enables cross-experiment comparisons that can highlight measured but undiscovered relationships.
在公共数据汇编中进行的交叉实验比较受到不匹配条件和技术噪声的挑战。ADAGE 方法通过与去噪自动编码器神经网络进行无监督集成,可以识别生物模式,但由于 ADAGE 模型(与许多神经网络一样)参数过多,因此不同的 ADAGE 模型表现同样出色。为了增强模型的稳健性并更好地构建与生物学途径一致的特征,我们开发了一种集成 ADAGE(eADAGE),该方法可以在模型之间集成稳定的特征。我们将 eADAGE 应用于在 78 种培养基中进行的铜绿假单胞菌基因表达谱实验的汇编中。eADAGE 揭示了在中等磷酸盐培养基中由 PhoB 控制的磷酸盐饥饿反应,并预测第二个由传感器激酶 KinB 提供的刺激对于这种 PhoB 激活是必需的。我们使用靶向和非靶向遗传方法验证了这种关系。eADAGE 可以捕获稳定的生物学模式,从而实现跨实验比较,突出已测量但未发现的关系。