Authors: Brooks H. Perkins-Jechow, Juan Pablo Iglesias Ahualli, Huyen Thuc Nhu, Alireza Omidi, Chunchao Li, Jorge A. Holguin-Cruz, Daeahn Cho, Dokyun Na, Nawar Malhis, Jennifer M. Bui, Jörg Gsponer
Published in Nature Communications Chemistry, 2025
Abstract
Many proteins function by toggling between distinct conformations, yet most structure predictors have been trained on data that do not capture this conformational diversity. Here, we benchmarked AlphaFold2, AlphaFold3, and recent variants on autoinhibited proteins, a class of allosterically regulated, often multi-domain proteins that exist in equilibrium between active and autoinhibited states. Our analyses show that AlphaFold2 fails to reproduce the experimental structures of many autoinhibited proteins, which is reflected in reduced confidence scores. This contrasts sharply with its high-accuracy, high-confidence predictions of non-autoinhibited multi-domain proteins. When tested for its ability to capture conformational diversity, we found that AlphaFold2 performs better when combined with uniform subsampling of sequence alignments rather than local subsampling. BioEmu and AlphaFold3 improve upon these results, yet still struggle to accurately reproduce details of experimental structures. Together, our study underscores the persistent challenges of predicting protein structures shaped by complex energy landscapes.