CSI 2.0：化学位移指数的显著改进版本。

CSI 2.0: a significantly improved version of the Chemical Shift Index.

作者信息

Hafsa Noor E, Wishart David S

机构信息

Department of Computing Science, University of Alberta, Edmonton, Canada.

出版信息

J Biomol NMR. 2014 Nov;60(2-3):131-46. doi: 10.1007/s10858-014-9863-x. Epub 2014 Oct 2.

DOI:10.1007/s10858-014-9863-x

PMID:25273503

Abstract

Protein chemical shifts have long been used by NMR spectroscopists to assist with secondary structure assignment and to provide useful distance and torsion angle constraint data for structure determination. One of the most widely used methods for secondary structure identification is called the Chemical Shift Index (CSI). The CSI method uses a simple digital chemical shift filter to locate secondary structures along the protein chain using backbone (13)C and (1)H chemical shifts. While the CSI method is simple to use and easy to implement, it is only about 75-80% accurate. Here we describe a significantly improved version of the CSI (2.0) that uses machine-learning techniques to combine all six backbone chemical shifts ((13)Cα, (13)Cβ, (13)C, (15)N, (1)HN, (1)Hα) with sequence-derived features to perform far more accurate secondary structure identification. Our tests indicate that CSI 2.0 achieved an average identification accuracy (Q3) of 90.56% for a training set of 181 proteins in a repeated tenfold cross-validation and 89.35% for a test set of 59 proteins. This represents a significant improvement over other state-of-the-art chemical shift-based methods. In particular, the level of performance of CSI 2.0 is equal to that of standard methods, such as DSSP and STRIDE, used to identify secondary structures via 3D coordinate data. This suggests that CSI 2.0 could be used both in providing accurate NMR constraint data in the early stages of protein structure determination as well as in defining secondary structure locations in the final protein model(s). A CSI 2.0 web server (http://csi.wishartlab.com) is available for submitting the input queries for secondary structure identification.

摘要

长期以来，核磁共振光谱学家一直使用蛋白质化学位移来辅助二级结构归属，并为结构测定提供有用的距离和扭转角约束数据。二级结构识别中使用最广泛的方法之一称为化学位移指数（CSI）。CSI方法使用简单的数字化学位移滤波器，通过主链（13）C和（1）H化学位移来定位蛋白质链上的二级结构。虽然CSI方法使用简单且易于实现，但其准确率仅约为75 - 80%。在此，我们描述了一种显著改进的CSI（2.0）版本，它使用机器学习技术将所有六个主链化学位移（（13）Cα、（13）Cβ、（13）C、（15）N、（1）HN、（1）Hα）与序列衍生特征相结合，以进行更准确的二级结构识别。我们的测试表明，在重复的十折交叉验证中，对于181个蛋白质的训练集，CSI 2.0的平均识别准确率（Q3）为90.56%，对于59个蛋白质的测试集为89.35%。这相对于其他基于化学位移的先进方法有显著改进。特别是，CSI 2.0的性能水平与用于通过三维坐标数据识别二级结构的标准方法（如DSSP和STRIDE）相当。这表明CSI 2.0既可以在蛋白质结构测定的早期阶段提供准确的核磁共振约束数据，也可以在最终的蛋白质模型中定义二级结构位置。一个CSI 2.0网络服务器（http://csi.wishartlab.com）可用于提交二级结构识别的输入查询。

相似文献

CSI 2.0: a significantly improved version of the Chemical Shift Index.

J Biomol NMR. 2014 Nov;60(2-3):131-46. doi: 10.1007/s10858-014-9863-x. Epub 2014 Oct 2.

CSI 3.0: a web server for identifying secondary and super-secondary structure in proteins using NMR chemical shifts.

Nucleic Acids Res. 2015 Jul 1;43(W1):W370-7. doi: 10.1093/nar/gkv494. Epub 2015 May 15.

SHIFTX2: significantly improved protein chemical shift prediction.

J Biomol NMR. 2011 May;50(1):43-57. doi: 10.1007/s10858-011-9478-4. Epub 2011 Mar 30.

EFG-CS: Predicting chemical shifts from amino acid sequences with protein structure prediction using machine learning and deep learning models.

Protein Sci. 2024 Aug;33(8):e5096. doi: 10.1002/pro.5096.

Accurate prediction of protein torsion angles using chemical shifts and sequence homology.

Magn Reson Chem. 2006 Jul;44 Spec No:S158-67. doi: 10.1002/mrc.1832.

Practical use of chemical shift databases for protein solid-state NMR: 2D chemical shift maps and amino-acid assignment with secondary-structure information.

J Biomol NMR. 2013 Jun;56(2):155-67. doi: 10.1007/s10858-013-9732-z. Epub 2013 Apr 28.

Toward direct determination of conformations of protein building units from multidimensional NMR experiments VI: chemical shift analysis of his to gain 3D structure and protonation state information.

J Comput Chem. 2005 Oct;26(13):1307-17. doi: 10.1002/jcc.20266.

Nearest-neighbor effects on backbone alpha and beta carbon chemical shifts in proteins.

J Biomol NMR. 2007 Nov;39(3):247-57. doi: 10.1007/s10858-007-9193-3.

Accurate and automated classification of protein secondary structure with PsiCSI.

Protein Sci. 2003 Feb;12(2):288-95. doi: 10.1110/ps.0222303.

Accessible surface area from NMR chemical shifts.

J Biomol NMR. 2015 Jul;62(3):387-401. doi: 10.1007/s10858-015-9957-0. Epub 2015 Jun 16.

引用本文的文献

Evolutionary Adaptations in Biliverdin Reductase B: Insights into Coenzyme Dynamics and Catalytic Efficiency.

Int J Mol Sci. 2024 Dec 10;25(24):13233. doi: 10.3390/ijms252413233.

Backbone chemical shift and secondary structure assignments for mouse siderocalin.

Biomol NMR Assign. 2024 Jun;18(1):79-84. doi: 10.1007/s12104-024-10171-9. Epub 2024 Apr 2.

Structures, Interactions and Activity of the N-Terminal Truncated Variants of Antimicrobial Peptide Thanatin.

Antibiotics (Basel). 2024 Jan 12;13(1):74. doi: 10.3390/antibiotics13010074.

Recognition and coacervation of G-quadruplexes by a multifunctional disordered region in RECQ4 helicase.

Nat Commun. 2023 Oct 24;14(1):6751. doi: 10.1038/s41467-023-42503-z.

Identifying structural and dynamic changes during the Biliverdin Reductase B catalytic cycle.

Front Mol Biosci. 2023 Aug 14;10:1244587. doi: 10.3389/fmolb.2023.1244587. eCollection 2023.

The influence of random-coil chemical shifts on the assessment of structural propensities in folded proteins and IDPs.

RSC Adv. 2023 Mar 31;13(15):10182-10203. doi: 10.1039/d3ra00977g. eCollection 2023 Mar 27.

Human and Bacterial Toll-Interleukin Receptor Domains Exhibit Distinct Dynamic Features and Functions.

Molecules. 2022 Jul 14;27(14):4494. doi: 10.3390/molecules27144494.

NMR resonance assignments of the DNA binding domain of mouse Junctophilin-2.

Biomol NMR Assign. 2022 Oct;16(2):273-279. doi: 10.1007/s12104-022-10091-6. Epub 2022 Jun 4.

Merging NMR Data and Computation Facilitates Data-Centered Research.

Front Mol Biosci. 2022 Jan 17;8:817175. doi: 10.3389/fmolb.2021.817175. eCollection 2021.

The Inherent Dynamics and Interaction Sites of the SARS-CoV-2 Nucleocapsid N-Terminal Region.

J Mol Biol. 2021 Jul 23;433(15):167108. doi: 10.1016/j.jmb.2021.167108. Epub 2021 Jun 20.

本文引用的文献

Protein backbone and sidechain torsion angles predicted from NMR chemical shifts using artificial neural networks.

J Biomol NMR. 2013 Jul;56(3):227-41. doi: 10.1007/s10858-013-9741-y. Epub 2013 Jun 2.

Advances, interactions, and future developments in the CNS, Phenix, and Rosetta structural biology software systems.

Annu Rev Biophys. 2013;42:265-87. doi: 10.1146/annurev-biophys-083012-130253. Epub 2013 Feb 28.

Determination of secondary structure populations in disordered states of proteins using nuclear magnetic resonance chemical shifts.

Biochemistry. 2012 Mar 20;51(11):2224-31. doi: 10.1021/bi3001825. Epub 2012 Mar 6.

Identification of helix capping and b-turn motifs from NMR chemical shifts.

J Biomol NMR. 2012 Mar;52(3):211-32. doi: 10.1007/s10858-012-9602-0.

Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Mol Syst Biol. 2011 Oct 11;7:539. doi: 10.1038/msb.2011.75.

Protein sequence comparison and fold recognition: progress and good-practice benchmarking.

Curr Opin Struct Biol. 2011 Jun;21(3):404-11. doi: 10.1016/j.sbi.2011.03.005. Epub 2011 Mar 31.

SHIFTX2: significantly improved protein chemical shift prediction.

J Biomol NMR. 2011 May;50(1):43-57. doi: 10.1007/s10858-011-9478-4. Epub 2011 Mar 30.

Interpreting protein chemical shift data.

Prog Nucl Magn Reson Spectrosc. 2011 Feb;58(1-2):62-87. doi: 10.1016/j.pnmrs.2010.07.004. Epub 2010 Aug 5.

Characterization of protein secondary structure from NMR chemical shifts.

Prog Nucl Magn Reson Spectrosc. 2009 Apr 5;54(3-4):141-165. doi: 10.1016/j.pnmrs.2008.06.002.

DANGLE: A Bayesian inferential method for predicting protein backbone dihedral angles and secondary structure.

J Magn Reson. 2010 Feb;202(2):223-33. doi: 10.1016/j.jmr.2009.11.008. Epub 2009 Dec 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

CSI 2.0：化学位移指数的显著改进版本。

CSI 2.0: a significantly improved version of the Chemical Shift Index.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献