• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高通量蛋白质结晶试验中的自动分类与模式发现

Automatic classification and pattern discovery in high-throughput protein crystallization trials.

作者信息

Cumbaa Christian, Jurisica Igor

机构信息

Ontario Cancer Institute, Northeast Structural Genomics Consortium, 610 University Avenue, Toronto, Ontario M5G 2M9, Canada.

出版信息

J Struct Funct Genomics. 2005;6(2-3):195-202. doi: 10.1007/s10969-005-5243-9.

DOI:10.1007/s10969-005-5243-9
PMID:16211519
Abstract

Conceptually, protein crystallization can be divided into two phases search and optimization. Robotic protein crystallization screening can speed up the search phase, and has a potential to increase process quality. Automated image classification helps to increase throughput and consistently generate objective results. Although the classification accuracy can always be improved, our image analysis system can classify images from 1,536-well plates with high classification accuracy (85%) and ROC score (0.87), as evaluated on 127 human-classified protein screens containing 5,600 crystal images and 189,472 non-crystal images. Data mining can integrate results from high-throughput screens with information about crystallizing conditions, intrinsic protein properties, and results from crystallization optimization. We apply association mining, a data mining approach that identifies frequently occurring patterns among variables and their values. This approach segregates proteins into groups based on how they react in a broad range of conditions, and clusters cocktails to reflect their potential to achieve crystallization. These results may lead to crystallization screen optimization, and reveal associations between protein properties and crystallization conditions. We also postulate that past experience may lead us to the identification of initial conditions favorable to crystallization for novel proteins.

摘要

从概念上讲,蛋白质结晶可分为两个阶段:搜索阶段和优化阶段。机器人蛋白质结晶筛选可以加快搜索阶段的速度,并有可能提高过程质量。自动图像分类有助于提高通量并持续产生客观结果。尽管分类准确率总是可以提高,但我们的图像分析系统能够以较高的分类准确率(85%)和ROC评分(0.87)对1536孔板中的图像进行分类,这是在对127个人工分类的蛋白质筛选进行评估得出的结果,这些筛选包含5600张晶体图像和189472张非晶体图像。数据挖掘可以将高通量筛选的结果与有关结晶条件、蛋白质固有特性以及结晶优化结果的信息整合起来。我们应用关联挖掘,这是一种数据挖掘方法,可识别变量及其值之间频繁出现的模式。这种方法根据蛋白质在广泛条件下的反应方式将蛋白质分为不同组,并对结晶剂进行聚类以反映它们实现结晶的潜力。这些结果可能会导致结晶筛选的优化,并揭示蛋白质特性与结晶条件之间的关联。我们还推测,过去的经验可能会引导我们确定有利于新型蛋白质结晶的初始条件。

相似文献

1
Automatic classification and pattern discovery in high-throughput protein crystallization trials.高通量蛋白质结晶试验中的自动分类与模式发现
J Struct Funct Genomics. 2005;6(2-3):195-202. doi: 10.1007/s10969-005-5243-9.
2
Spectroscopic imaging of protein crystals in crystallization drops.结晶液滴中蛋白质晶体的光谱成像。
J Struct Funct Genomics. 2005;6(2-3):203-8. doi: 10.1007/s10969-005-1914-9.
3
Large-scale tracking and classification for automatic analysis of cell migration and proliferation, and experimental optimization of high-throughput screens of neuroblastoma cells.用于神经母细胞瘤细胞迁移和增殖自动分析的大规模跟踪与分类,以及高通量筛选的实验优化。
Cytometry A. 2015 Jun;87(6):524-40. doi: 10.1002/cyto.a.22632. Epub 2015 Jan 28.
4
Evaluation of crystalline objects in crystallizing protein droplets based on line-segment information in greyscale images.基于灰度图像中的线段信息对结晶蛋白质液滴中的晶体物体进行评估。
Acta Crystallogr D Biol Crystallogr. 2006 Mar;62(Pt 3):239-45. doi: 10.1107/S0907444905041077. Epub 2006 Feb 22.
5
Automated classification of protein crystallization images using support vector machines with scale-invariant texture and Gabor features.使用具有尺度不变纹理和Gabor特征的支持向量机对蛋白质结晶图像进行自动分类。
Acta Crystallogr D Biol Crystallogr. 2006 Mar;62(Pt 3):271-9. doi: 10.1107/S0907444905041648. Epub 2006 Feb 22.
6
Computational analysis of crystallization trials.结晶试验的计算分析。
Acta Crystallogr D Biol Crystallogr. 2002 Nov;58(Pt 11):1915-23. doi: 10.1107/s0907444902016840. Epub 2002 Oct 21.
7
Development of an automated large-scale protein-crystallization and monitoring system for high-throughput protein-structure analyses.用于高通量蛋白质结构分析的自动化大规模蛋白质结晶与监测系统的开发。
Acta Crystallogr D Biol Crystallogr. 2006 Sep;62(Pt 9):1058-65. doi: 10.1107/S0907444906023821. Epub 2006 Aug 19.
8
Managing and mining protein crystallization data.管理和挖掘蛋白质结晶数据。
Proteins. 2006 Jan 1;62(1):4-7. doi: 10.1002/prot.20776.
9
High-throughput crystallization screening.高通量结晶筛选
Methods Mol Biol. 2014;1140:159-68. doi: 10.1007/978-1-4939-0354-2_12.
10
Protein production and crystallization at SECSG -- an overview.SECSG的蛋白质生产与结晶——概述。
J Struct Funct Genomics. 2005;6(2-3):233-43. doi: 10.1007/s10969-005-2462-z.

引用本文的文献

1
Parallel and High Throughput Reaction Monitoring with Computer Vision.基于计算机视觉的并行和高通量反应监测
Angew Chem Int Ed Engl. 2025 Jan 2;64(1):e202413395. doi: 10.1002/anie.202413395. Epub 2024 Oct 31.
2
Moving beyond MARCO.超越 MARCO。
PLoS One. 2023 Mar 24;18(3):e0283124. doi: 10.1371/journal.pone.0283124. eCollection 2023.
3
Not getting in too deep: A practical deep learning approach to routine crystallisation image classification.不入虎穴:一种实用的深度学习方法用于常规结晶图像分类。

本文引用的文献

1
Robotic cloning and Protein Production Platform of the Northeast Structural Genomics Consortium.东北结构基因组学联盟的机器人克隆与蛋白质生产平台
Methods Enzymol. 2005;394:210-43. doi: 10.1016/S0076-6879(05)94008-1.
2
Machine-learning techniques for macromolecular crystallization data.
Acta Crystallogr D Biol Crystallogr. 2004 Oct;60(Pt 10):1705-16. doi: 10.1107/S090744490401683X. Epub 2004 Sep 23.
3
UniProt: the Universal Protein knowledgebase.通用蛋白质知识库(UniProt)。
PLoS One. 2023 Mar 9;18(3):e0282562. doi: 10.1371/journal.pone.0282562. eCollection 2023.
4
High-Throughput Crystallization Pipeline at the Crystallography Core Facility of the Institut Pasteur.巴斯德研究所晶体学核心设施的高通量结晶流水线。
Molecules. 2019 Dec 5;24(24):4451. doi: 10.3390/molecules24244451.
5
Cinder: keeping crystallographers app-y.辛德:让晶体学家们开心。
Acta Crystallogr F Struct Biol Commun. 2018 Jul 1;74(Pt 7):410-418. doi: 10.1107/S2053230X18008038. Epub 2018 Jun 26.
6
Classification of crystallization outcomes using deep convolutional neural networks.使用深度卷积神经网络对结晶结果进行分类。
PLoS One. 2018 Jun 20;13(6):e0198883. doi: 10.1371/journal.pone.0198883. eCollection 2018.
7
Feature analysis for classification of trace fluorescent labeled protein crystallization images.用于痕量荧光标记蛋白质结晶图像分类的特征分析
BioData Min. 2017 Apr 27;10:14. doi: 10.1186/s13040-017-0133-9. eCollection 2017.
8
CrystPro: Spatiotemporal Analysis of Protein Crystallization Images.CrystPro:蛋白质结晶图像的时空分析
Cryst Growth Des. 2015;15(11):5254-5262. doi: 10.1021/acs.cgd.5b00714. Epub 2015 Sep 16.
9
Approaches to automated protein crystal harvesting.蛋白质晶体自动收获方法。
Acta Crystallogr F Struct Biol Commun. 2014 Feb;70(Pt 2):133-55. doi: 10.1107/S2053230X14000387. Epub 2014 Jan 28.
10
Real-Time Protein Crystallization Image Acquisition and Classification System.实时蛋白质结晶图像采集与分类系统
Cryst Growth Des. 2013 Jul 3;13(7):2728-2736. doi: 10.1021/cg3016029.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D115-9. doi: 10.1093/nar/gkh131.
4
Automatic classification of sub-microlitre protein-crystallization trials in 1536-well plates.1536孔板中亚微升蛋白质结晶试验的自动分类
Acta Crystallogr D Biol Crystallogr. 2003 Sep;59(Pt 9):1619-27. doi: 10.1107/s0907444903015130. Epub 2003 Aug 19.
5
SPINE 2: a system for collaborative structural proteomics within a federated database framework.SPINE 2:一种用于联合数据库框架内协作结构蛋白质组学的系统。
Nucleic Acids Res. 2003 Jun 1;31(11):2833-8. doi: 10.1093/nar/gkg397.
6
A deliberate approach to screening for initial crystallization conditions of biological macromolecules.一种用于筛选生物大分子初始结晶条件的审慎方法。
J Struct Biol. 2003 Apr;142(1):170-9. doi: 10.1016/s1047-8477(03)00048-0.
7
Computational analysis of crystallization trials.结晶试验的计算分析。
Acta Crystallogr D Biol Crystallogr. 2002 Nov;58(Pt 11):1915-23. doi: 10.1107/s0907444902016840. Epub 2002 Oct 21.
8
Towards the automated evaluation of crystallization trials.迈向结晶试验的自动化评估。
Acta Crystallogr D Biol Crystallogr. 2002 Nov;58(Pt 11):1907-14. doi: 10.1107/s0907444902016633. Epub 2002 Oct 21.
9
The Biological Macromolecule Crystallization Database and NASA Protein Crystal Growth Archive.生物大分子结晶数据库和美国国家航空航天局蛋白质晶体生长档案库。
J Res Natl Inst Stand Technol. 1996 May-Jun;101(3):309-20. doi: 10.6028/jres.101.032.