Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China.
Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China.
Biochim Biophys Acta Gen Subj. 2024 Jun;1868(6):130601. doi: 10.1016/j.bbagen.2024.130601. Epub 2024 Mar 24.
Aberrant protein localization is a prominent feature in many human diseases and can have detrimental effects on the function of specific tissues and organs. High-throughput technologies, which continue to advance with iterations of automated equipment and the development of bioinformatics, enable the acquisition of large-scale data that are more pattern-rich, allowing for the use of a wider range of methods to extract useful patterns and knowledge from them.
The proposed sc2promap (Spatial and Channel for SubCellular Protein Localization Mapping) model, designed to proficiently extract meaningful features from a vast repository of single-channel grayscale protein images for the purposes of protein localization analysis and clustering. Sc2promap incorporates a prediction head component enriched with supplementary protein annotations, along with the integration of a spatial-channel attention mechanism within the encoder to enables the generation of high-resolution protein localization maps that encapsulate the fundamental characteristics of cells, including elemental cellular localizations such as nuclear and non-nuclear domains.
Qualitative and quantitative comparisons were conducted across internal and external clustering evaluation metrics, as well as various facets of the clustering results. The study also explored different components of the model. The research outcomes conclusively indicate that, in comparison to previous methods, Sc2promap exhibits superior performance.
The amalgamation of the attention mechanism and prediction head components has led the model to excel in protein localization clustering and analysis tasks.
The model effectively enhances the capability to extract features and knowledge from protein fluorescence images.
异常蛋白质定位是许多人类疾病的一个显著特征,会对特定组织和器官的功能产生有害影响。高通量技术不断通过自动化设备的迭代和生物信息学的发展而进步,能够获取更具模式丰富的大规模数据,从而可以使用更广泛的方法从中提取有用的模式和知识。
提出的 sc2promap(亚细胞蛋白质定位映射的空间和通道)模型,旨在从大量单通道灰度蛋白质图像库中高效提取有意义的特征,用于蛋白质定位分析和聚类。Sc2promap 包含一个预测头组件,其中包含丰富的补充蛋白质注释,以及在编码器中集成空间通道注意力机制,以生成高分辨率蛋白质定位图,这些图包含细胞的基本特征,包括核和非核区域等基本细胞定位。
通过内部和外部聚类评估指标以及聚类结果的各个方面进行了定性和定量比较。该研究还探索了模型的不同组件。研究结果明确表明,与以前的方法相比,Sc2promap 表现出更好的性能。
注意力机制和预测头组件的融合使模型在蛋白质定位聚类和分析任务中表现出色。
该模型有效地增强了从蛋白质荧光图像中提取特征和知识的能力。