用于痕量荧光标记蛋白质结晶图像分类的特征分析

Feature analysis for classification of trace fluorescent labeled protein crystallization images.

作者信息

Sigdel Madhav, Dinc Imren, Sigdel Madhu S, Dinc Semih, Pusey Marc L, Aygun Ramazan S

机构信息

Computer Science Department, University of Alabama in Huntsville, Huntsville, 35899 Alabama USA.

Computer Science Department, Troy University, Troy, 36082 Alabama USA.

出版信息

BioData Min. 2017 Apr 27;10:14. doi: 10.1186/s13040-017-0133-9. eCollection 2017.

DOI:10.1186/s13040-017-0133-9

PMID:28465724

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5408444/

Abstract

BACKGROUND

Large number of features are extracted from protein crystallization trial images to improve the accuracy of classifiers for predicting the presence of crystals or phases of the crystallization process. The excessive number of features and computationally intensive image processing methods to extract these features make utilization of automated classification tools on stand-alone computing systems inconvenient due to the required time to complete the classification tasks. Combinations of image feature sets, feature reduction and classification techniques for crystallization images benefiting from trace fluorescence labeling are investigated.

RESULTS

Features are categorized into intensity, graph, histogram, texture, shape adaptive, and region features (using binarized images generated by Otsu's, green percentile, and morphological thresholding). The effects of normalization, feature reduction with principle components analysis (PCA), and feature selection using random forest classifier are also analyzed. The time required to extract feature categories is computed and an estimated time of extraction is provided for feature category combinations. We have conducted around 8624 experiments (different combinations of feature categories, binarization methods, feature reduction/selection, normalization, and crystal categories). The best experimental results are obtained using combinations of intensity features, region features using Otsu's thresholding, region features using green percentile thresholding, region features using green percentile thresholding, graph features, and histogram features. Using this feature set combination, 96% accuracy (without misclassifying crystals as non-crystals) was achieved for the first level of classification to determine presence of crystals. Since missing a crystal is not desired, our algorithm is adjusted to achieve a high sensitivity rate. In the second level classification, 74.2% accuracy for (5-class) crystal sub-category classification. Best classification rates were achieved using random forest classifier.

CONTRIBUTIONS

The feature extraction and classification could be completed in about 2 s per image on a stand-alone computing system, which is suitable for real time analysis. These results enable research groups to select features according to their hardware setups for real-time analysis.

摘要

背景

从蛋白质结晶试验图像中提取大量特征，以提高用于预测结晶过程中晶体存在或相的分类器的准确性。特征数量过多以及提取这些特征的计算密集型图像处理方法，使得在独立计算系统上使用自动分类工具不方便，因为完成分类任务需要时间。研究了受益于微量荧光标记的结晶图像的图像特征集、特征约简和分类技术的组合。

结果

特征分为强度、图形、直方图、纹理、形状自适应和区域特征（使用大津法、绿色百分位数和形态学阈值生成的二值化图像）。还分析了归一化、主成分分析（PCA）进行特征约简以及使用随机森林分类器进行特征选择的效果。计算了提取特征类所需的时间，并为特征类组合提供了估计的提取时间。我们进行了约8624次实验（特征类、二值化方法、特征约简/选择、归一化和晶体类别的不同组合）。使用强度特征、大津法阈值处理的区域特征、绿色百分位数阈值处理的区域特征、绿色百分位数阈值处理的区域特征、图形特征和直方图特征的组合获得了最佳实验结果。使用此特征集组合，在确定晶体存在的一级分类中实现了96%的准确率（无将晶体误分类为非晶体的情况）。由于不希望错过晶体，我们调整了算法以实现高灵敏度率。在二级分类中，（5类）晶体子类别分类的准确率为74.2%。使用随机森林分类器实现了最佳分类率。

贡献

在独立计算系统上，每幅图像的特征提取和分类大约可以在2秒内完成，适用于实时分析。这些结果使研究小组能够根据其硬件设置选择特征进行实时分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3150/5408444/b7412aa77e0b/13040_2017_133_Fig1_HTML.jpg

相似文献

Feature analysis for classification of trace fluorescent labeled protein crystallization images.用于痕量荧光标记蛋白质结晶图像分类的特征分析

BioData Min. 2017 Apr 27;10:14. doi: 10.1186/s13040-017-0133-9. eCollection 2017.

Real-Time Protein Crystallization Image Acquisition and Classification System.实时蛋白质结晶图像采集与分类系统

Cryst Growth Des. 2013 Jul 3;13(7):2728-2736. doi: 10.1021/cg3016029.

Differentiation of fat-poor angiomyolipoma from clear cell renal cell carcinoma in contrast-enhanced MDCT images using quantitative feature classification.基于定量特征分类的 MDCT 增强图像鉴别乏脂性血管平滑肌脂肪瘤与透明细胞肾细胞癌

Med Phys. 2017 Jul;44(7):3604-3614. doi: 10.1002/mp.12258. Epub 2017 Jun 9.

Protein Crystallization Segmentation and Classification Using Subordinate Color Channel in Fluorescence Microscopy Images.利用荧光显微镜图像中的从属颜色通道进行蛋白质结晶的分割和分类。

J Fluoresc. 2020 May;30(3):637-656. doi: 10.1007/s10895-020-02500-7. Epub 2020 Apr 20.

Full Intelligent Cancer Classification of Thermal Breast Images to Assist Physician in Clinical Diagnostic Applications.用于临床诊断应用中辅助医生的乳腺热图像全智能癌症分类

J Med Signals Sens. 2016 Jan-Mar;6(1):12-24.

Computer-assisted lip diagnosis on Traditional Chinese Medicine using multi-class support vector machines.基于多类支持向量机的中医唇诊计算机辅助诊断。

BMC Complement Altern Med. 2012 Aug 16;12:127. doi: 10.1186/1472-6882-12-127.

Phenotype recognition with combined features and random subspace classifier ensemble.基于组合特征和随机子空间分类器集成的表型识别。

BMC Bioinformatics. 2011 Apr 30;12:128. doi: 10.1186/1471-2105-12-128.

Classification of childhood medulloblastoma into WHO-defined multiple subtypes based on textural analysis.基于纹理分析的 WHO 定义的多种小儿髓母细胞瘤亚型分类。

J Microsc. 2020 Jul;279(1):26-38. doi: 10.1111/jmi.12893. Epub 2020 Apr 28.

Deep feature classification of angiomyolipoma without visible fat and renal cell carcinoma in abdominal contrast-enhanced CT images with texture image patches and hand-crafted feature concatenation.利用纹理图像补丁和手工特征串联对腹部增强 CT 图像中无可见脂肪的血管平滑肌脂肪瘤和肾细胞癌进行深度特征分类。

Med Phys. 2018 Apr;45(4):1550-1561. doi: 10.1002/mp.12828. Epub 2018 Mar 25.

Cell type classifiers for breast cancer microscopic images based on fractal dimension texture analysis of image color layers.基于图像颜色层分形维纹理分析的乳腺癌显微图像细胞类型分类器

Scanning. 2015 Mar-Apr;37(2):145-51. doi: 10.1002/sca.21191. Epub 2015 Feb 16.

引用本文的文献

Exploring the role of repetitive negative thinking in the transdiagnostic context of depression and anxiety in children.探索重复性消极思维在儿童抑郁和焦虑跨诊断背景中的作用。

BMC Psychol. 2025 Aug 12;13(1):902. doi: 10.1186/s40359-025-03169-y.

Inhibitors of Calcium Oxalate Crystallization for the Treatment of Oxalate Nephropathies.用于治疗草酸盐肾病的草酸钙结晶抑制剂

Adv Sci (Weinh). 2020 Feb 27;7(8):1903337. doi: 10.1002/advs.201903337. eCollection 2020 Apr.

J Fluoresc. 2020 May;30(3):637-656. doi: 10.1007/s10895-020-02500-7. Epub 2020 Apr 20.

本文引用的文献

Super-Thresholding: Supervised Thresholding of Protein Crystal Images.超阈值处理：蛋白质晶体图像的监督阈值处理

IEEE/ACM Trans Comput Biol Bioinform. 2017 Jul-Aug;14(4):986-998. doi: 10.1109/TCBB.2016.2542811. Epub 2016 Mar 16.

Optimizing Associative Experimental Design for Protein Crystallization Screening.优化用于蛋白质结晶筛选的关联实验设计

IEEE Trans Nanobioscience. 2016 Mar;15(2):101-12. doi: 10.1109/TNB.2016.2536030. Epub 2016 Feb 29.

Trace fluorescent labeling for protein crystallization.用于蛋白质结晶的微量荧光标记

Acta Crystallogr F Struct Biol Commun. 2015 Jul;71(Pt 7):806-14. doi: 10.1107/S2053230X15008626. Epub 2015 Jun 27.

Evaluation of Normalization and PCA on the Performance of Classifiers for Protein Crystallization Images.蛋白质结晶图像分类器性能的归一化和主成分分析评估

Proc IEEE Southeastcon. 2014 Mar;2014. doi: 10.1109/SECON.2014.6950744.

Evaluation of Semi-supervised Learning for Classification of Protein Crystallization Imagery.蛋白质结晶图像分类的半监督学习评估

Proc IEEE Southeastcon. 2014 Mar;2014. doi: 10.1109/SECON.2014.6950649.

Real-Time Protein Crystallization Image Acquisition and Classification System.实时蛋白质结晶图像采集与分类系统

Cryst Growth Des. 2013 Jul 3;13(7):2728-2736. doi: 10.1021/cg3016029.

Introduction to protein crystallization.蛋白质结晶简介。

Acta Crystallogr F Struct Biol Commun. 2014 Jan;70(Pt 1):2-20. doi: 10.1107/S2053230X13033141. Epub 2013 Dec 24.

Letter to the editor: Stability of Random Forest importance measures.致编辑的信：随机森林重要性度量的稳定性。

Brief Bioinform. 2011 Jan;12(1):86-9. doi: 10.1093/bib/bbq011. Epub 2010 Mar 31.

Protein crystallization analysis on the World Community Grid.世界计算网格上的蛋白质结晶分析。

J Struct Funct Genomics. 2010 Mar;11(1):61-9. doi: 10.1007/s10969-009-9076-9. Epub 2010 Jan 14.

Leveraging genetic algorithm and neural network in automated protein crystal recognition.在自动蛋白质晶体识别中利用遗传算法和神经网络。

Annu Int Conf IEEE Eng Med Biol Soc. 2008;2008:1926-9. doi: 10.1109/IEMBS.2008.4649564.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于痕量荧光标记蛋白质结晶图像分类的特征分析

Feature analysis for classification of trace fluorescent labeled protein crystallization images.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONTRIBUTIONS

背景

结果

贡献

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献