Johns Hopkins University, Baltimore, MD, USA.
Methods Mol Biol. 2025;2867:1-17. doi: 10.1007/978-1-0716-4196-5_1.
The protein folding problem dates back to Pauling's insights almost a century ago, but the first venture into actual protein structure was the Pauling-Corey-Brandson α-helix in 1951, a proposed model that was confirmed almost immediately using X-ray crystallography. Many subsequent efforts to predict protein helices from the amino acid sequence met with only partial success, as discussed here. Surprisingly, in 2021, these efforts were superseded by deep-learning artificial intelligence, especially AlphaFold2, a machine learning program based on neural nets. This approach can predict most protein structures successfully at or near atomic resolution. Deservedly, deep-learning artificial intelligence was named Science magazine's 2021 "breakthrough of the year." Today, ~200 million predicted protein structures can be downloaded from the AlphaFold2 Protein Structure Database. Deep learning represents a deep conundrum because these successfully predicted macromolecular structures are based on methods that are completely devoid of a hypothesis or of any physical chemistry. Perhaps we are now poised to transcend five centuries of reductive science.
蛋白质折叠问题可以追溯到保罗ing 的洞察近一个世纪前,但第一次涉足实际的蛋白质结构是鲍林-科里-布兰森在 1951 年的α-螺旋,一个提出的模型,几乎立即使用 X 射线晶体学证实。许多随后的努力预测蛋白质螺旋从氨基酸序列只遇到了部分成功,这里讨论。令人惊讶的是,在 2021 年,这些努力被深度学习人工智能所取代,尤其是 AlphaFold2,一种基于神经网络的机器学习程序。这种方法可以成功地预测大多数蛋白质结构在原子分辨率或接近原子分辨率。当之无愧的,深度学习人工智能被评为《科学》杂志 2021 年的“年度突破”。今天,大约 2 亿个预测的蛋白质结构可以从 AlphaFold2 蛋白质结构数据库下载。深度学习代表了一个深刻的难题,因为这些成功预测的大分子结构是基于完全没有假设或任何物理化学的方法。也许我们现在正准备超越五个世纪的还原性科学。