Hekkelman Maarten L, Salmoral Daniel Álvarez, Perrakis Anastassis, Joosten Robbie P
Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands.
Oncode Institute, Utrecht, The Netherlands.
Protein Sci. 2025 Aug;34(8):e70208. doi: 10.1002/pro.70208.
Protein secondary structure annotation is essential for understanding protein architecture, serving as a cornerstone for structural classification, alignment, visualization, and machine learning applications. The Define Secondary Structure of Proteins (DSSP) algorithm has long been the standard for assigning secondary structure elements such as α-helices, β-sheets, and loops in protein models. Here, we introduce DSSP version 4, which recapitulates DSSP functionality in a modern computational framework, extending also to the detection of left-handed κ-helices (Poly-Proline II helices). To align with the Findable, Accessible, Interoperable, Reusable principles, DSSP 4 adopts mmCIF as its primary input and output format, while retaining compatibility with legacy Protein Data Bank (PDB) and DSSP formats. We applied this updated tool to analyze the distribution of secondary structure elements across the PDB, differentiating structures from diverse experimental methods, revealing insights into the prevalence and length of secondary structure elements, including the newly annotated κ-helices. The DSSP 4 software, databank, and server are freely accessible from https://pdb-redo.eu/dssp, ensuring broad utility and interoperability in structural biology research.
蛋白质二级结构注释对于理解蛋白质结构至关重要,是结构分类、比对、可视化及机器学习应用的基石。蛋白质二级结构定义(DSSP)算法长期以来一直是在蛋白质模型中分配二级结构元件(如α螺旋、β折叠和环)的标准。在此,我们介绍DSSP版本4,它在现代计算框架中重现了DSSP功能,还扩展到了左手κ螺旋(多聚脯氨酸II螺旋)的检测。为符合可查找、可访问、可互操作、可重用原则,DSSP 4采用mmCIF作为其主要输入和输出格式,同时保留与传统蛋白质数据库(PDB)和DSSP格式的兼容性。我们应用这个更新后的工具来分析PDB中二级结构元件的分布,区分来自不同实验方法的结构,揭示二级结构元件(包括新注释的κ螺旋)的普遍性和长度的见解。DSSP 4软件、数据库和服务器可从https://pdb - redo.eu/dssp免费获取,确保在结构生物学研究中的广泛实用性和互操作性。