• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用重复码和弱检测器增强蛋白质亚细胞定位的多类学习。

Boosting multiclass learning with repeating codes and weak detectors for protein subcellular localization.

作者信息

Lin Chung-Chih, Tsai Yuh-Show, Lin Yu-Shi, Chiu Tai-Yu, Hsiung Chia-Cheng, Lee May-I, Simpson Jeremy C, Hsu Chun-Nan

机构信息

Faculty of Life Sciences and Institute of Genomes, National Yang-Ming University, Taipei, Taiwan.

出版信息

Bioinformatics. 2007 Dec 15;23(24):3374-81. doi: 10.1093/bioinformatics/btm497. Epub 2007 Oct 22.

DOI:10.1093/bioinformatics/btm497
PMID:17956879
Abstract

MOTIVATION

Determining locations of protein expression is essential to understand protein function. Advances in green fluorescence protein (GFP) fusion proteins and automated fluorescence microscopy allow for rapid acquisition of large collections of protein localization images. Recognition of these cell images requires an automated image analysis system. Approaches taken by previous work concentrated on designing a set of optimal features and then applying standard machine-learning algorithms. In fact, trends of recent advances in machine learning and computer vision can be applied to improve the performance. One trend is the advances in multiclass learning with error-correcting output codes (ECOC). Another trend is the use of a large number of weak detectors with boosting for detecting objects in images of real-world scenes.

RESULTS

We take advantage of these advances to propose a new learning algorithm, AdaBoost.ERC, coupled with weak and strong detectors, to improve the performance of automatic recognition of protein subcellular locations in cell images. We prepared two image data sets of CHO and Vero cells and downloaded a HeLa cell image data set in the public domain to evaluate our new method. We show that AdaBoost.ERC outperforms other AdaBoost extensions. We demonstrate the benefit of weak detectors by showing significant performance improvements over classifiers using only strong detectors. We also empirically test our method's capability of generalizing to heterogeneous image collections. Compared with previous work, our method performs reasonably well for the HeLa cell images.

AVAILABILITY

CHO and Vero cell images, their corresponding feature sets (SSLF and WSLF), our new learning algorithm, AdaBoost.ERC, and Supplementary Material are available at http://aiia.iis.sinica.edu.tw/

摘要

动机

确定蛋白质表达位置对于理解蛋白质功能至关重要。绿色荧光蛋白(GFP)融合蛋白和自动荧光显微镜技术的进步使得能够快速获取大量蛋白质定位图像。识别这些细胞图像需要一个自动图像分析系统。先前工作所采用的方法集中在设计一组最优特征,然后应用标准机器学习算法。事实上,机器学习和计算机视觉领域的最新进展趋势可用于提高性能。一个趋势是纠错输出码(ECOC)在多类学习方面的进展。另一个趋势是使用大量弱检测器并结合增强技术来检测真实场景图像中的物体。

结果

我们利用这些进展提出了一种新的学习算法AdaBoost.ERC,结合弱检测器和强检测器,以提高细胞图像中蛋白质亚细胞定位自动识别的性能。我们准备了CHO和Vero细胞的两个图像数据集,并下载了公共领域的HeLa细胞图像数据集来评估我们的新方法。我们表明AdaBoost.ERC优于其他AdaBoost扩展方法。通过展示相较于仅使用强检测器的分类器有显著的性能提升,我们证明了弱检测器的优势。我们还通过实验测试了我们的方法对异构图像集的泛化能力。与先前工作相比,我们的方法在HeLa细胞图像上表现良好。

可用性

CHO和Vero细胞图像、它们相应的特征集(SSLF和WSLF)、我们的新学习算法AdaBoost.ERC以及补充材料可在http://aiia.iis.sinica.edu.tw/获取。

相似文献

1
Boosting multiclass learning with repeating codes and weak detectors for protein subcellular localization.利用重复码和弱检测器增强蛋白质亚细胞定位的多类学习。
Bioinformatics. 2007 Dec 15;23(24):3374-81. doi: 10.1093/bioinformatics/btm497. Epub 2007 Oct 22.
2
Automated recognition system to classify subcellular protein localizations in images of different cell lines acquired by different imaging systems.用于对通过不同成像系统获取的不同细胞系图像中的亚细胞蛋白质定位进行分类的自动识别系统。
Microsc Res Tech. 2008 Apr;71(4):305-14. doi: 10.1002/jemt.20555.
3
Automated image analysis of protein localization in budding yeast.芽殖酵母中蛋白质定位的自动化图像分析
Bioinformatics. 2007 Jul 1;23(13):i66-71. doi: 10.1093/bioinformatics/btm206.
4
Experiments with AdaBoost.RT, an improved boosting scheme for regression.使用AdaBoost.RT进行的实验,一种改进的回归增强方案。
Neural Comput. 2006 Jul;18(7):1678-710. doi: 10.1162/neco.2006.18.7.1678.
5
A reliable method for cell phenotype image classification.一种用于细胞表型图像分类的可靠方法。
Artif Intell Med. 2008 Jun;43(2):87-97. doi: 10.1016/j.artmed.2008.03.005. Epub 2008 Apr 28.
6
Sharing visual features for multiclass and multiview object detection.用于多类和多视图目标检测的视觉特征共享。
IEEE Trans Pattern Anal Mach Intell. 2007 May;29(5):854-69. doi: 10.1109/TPAMI.2007.1055.
7
Fast asymmetric learning for cascade face detection.用于级联人脸检测的快速非对称学习
IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):369-82. doi: 10.1109/TPAMI.2007.1181.
8
AdaBoost-based algorithm for network intrusion detection.基于AdaBoost的网络入侵检测算法。
IEEE Trans Syst Man Cybern B Cybern. 2008 Apr;38(2):577-83. doi: 10.1109/TSMCB.2007.914695.
9
Toward practical smile detection.迈向实用的微笑检测。
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2106-11. doi: 10.1109/TPAMI.2009.42.
10
Prediction of subcellular protein localization based on functional domain composition.基于功能域组成预测亚细胞蛋白质定位
Biochem Biophys Res Commun. 2007 Jun 1;357(2):366-70. doi: 10.1016/j.bbrc.2007.03.139. Epub 2007 Apr 2.

引用本文的文献

1
Isolation and Characterization of an LBD Transcription Factor CsLBD39 from Tea Plant () and Its Roles in Modulating Nitrate Content by Regulating Nitrate-Metabolism-Related Genes.从茶树()中分离和鉴定一个 LBD 转录因子 CsLBD39,并研究其通过调控硝酸盐代谢相关基因来调节硝酸盐含量的作用。
Int J Mol Sci. 2022 Aug 18;23(16):9294. doi: 10.3390/ijms23169294.
2
MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy.MIC_Locator:一种新颖的基于图像的蛋白质亚细胞位置多标签预测模型,基于多尺度单基因信号表示和强度编码策略。
BMC Bioinformatics. 2019 Oct 26;20(1):522. doi: 10.1186/s12859-019-3136-3.
3
Determining the subcellular location of new proteins from microscope images using local features.
使用局部特征从显微镜图像中确定新蛋白质的亚细胞位置。
Bioinformatics. 2013 Sep 15;29(18):2343-9. doi: 10.1093/bioinformatics/btt392. Epub 2013 Jul 8.
4
Ranking of multidimensional drug profiling data by fractional-adjusted bi-partitional scores.多维药物剖析数据的分数调整二分法得分排序。
Bioinformatics. 2012 Jun 15;28(12):i106-14. doi: 10.1093/bioinformatics/bts232.
5
A spectral graph theoretic approach to quantification and calibration of collective morphological differences in cell images.基于谱图理论的细胞图像整体形态差异量化和校准方法。
Bioinformatics. 2010 Jun 15;26(12):i29-37. doi: 10.1093/bioinformatics/btq194.
6
Screening cellular feature measurements for image-based assay development.为基于图像的分析方法开发筛选细胞特征测量值。
J Biomol Screen. 2010 Aug;15(7):840-6. doi: 10.1177/1087057110370895. Epub 2010 Jun 1.
7
Introduction to the quantitative analysis of two-dimensional fluorescence microscopy images for cell-based screening.用于基于细胞筛选的二维荧光显微镜图像定量分析简介
PLoS Comput Biol. 2009 Dec;5(12):e1000603. doi: 10.1371/journal.pcbi.1000603. Epub 2009 Dec 24.