Center for Bioimage Informatics and Department of Biomedical Engineering, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
Proc Natl Acad Sci U S A. 2010 Feb 16;107(7):2944-9. doi: 10.1073/pnas.0912090107. Epub 2010 Feb 1.
Many proteins or other biological macromolecules are localized to more than one subcellular structure. The fraction of a protein in different cellular compartments is often measured by colocalization with organelle-specific fluorescent markers, requiring availability of fluorescent probes for each compartment and acquisition of images for each in conjunction with the macromolecule of interest. Alternatively, tailored algorithms allow finding particular regions in images and quantifying the amount of fluorescence they contain. Unfortunately, this approach requires extensive hand-tuning of algorithms and is often cell type-dependent. Here we describe a machine-learning approach for estimating the amount of fluorescent signal in different subcellular compartments without hand tuning, requiring only the acquisition of separate training images of markers for each compartment. In testing on images of cells stained with mixtures of probes for different organelles, we achieved a 93% correlation between estimated and expected amounts of probes in each compartment. We also demonstrated that the method can be used to quantify drug-dependent protein translocations. The method enables automated and unbiased determination of the distributions of protein across cellular compartments, and will significantly improve imaging-based high-throughput assays and facilitate proteome-scale localization efforts.
许多蛋白质或其他生物大分子定位于不止一个亚细胞结构。通过与细胞器特异性荧光标记物共定位,可以测量蛋白质在不同细胞区室中的部分,这需要为每个区室提供荧光探针,并与感兴趣的大分子一起获取图像。或者,定制的算法可以找到图像中的特定区域,并量化它们包含的荧光量。不幸的是,这种方法需要对算法进行广泛的手动调整,并且通常依赖于细胞类型。在这里,我们描述了一种无需手动调整的机器学习方法,用于估计不同亚细胞区室中荧光信号的量,只需要为每个区室获取单独的标记物训练图像。在对用不同细胞器探针混合物染色的细胞图像进行测试时,我们在每个区室中观察到估计和预期探针量之间的相关性达到 93%。我们还证明该方法可用于定量药物依赖性蛋白质易位。该方法能够自动且无偏地确定蛋白质在细胞区室中的分布,这将显著改善基于成像的高通量测定,并促进蛋白质组范围的定位工作。