Research Highlights

Cyclophilin A catalyses proline isomerization by an electrostatic handle mechanism. Proc. Natl. Acad. Sci. USA 111, 10203-10208 (2014).

One of the most widespread molecular switches in biochemical pathways is based on the isomerization of the amino acid proline, a process that normally is facilitated by enzymes known as 'proline isomerases'. We show that cyclophilin A, one of the most common proline isomerases, acts by a simple mechanism, which we describe as an 'electrostatic handle'. In this mechanism, the enzyme creates an electrostatic environment in its catalytic site that rotates a peptide bond in the substrate by pulling the electric dipole associated with the carbonyl group preceding the peptide bond itself. Our results thus identify a specific mechanism by which electrostatics is exploited in enzyme catalysis.

Statistical mechanics of the denatured state of a protein using replica-averaged metadynamics. J. Am. Chem. Soc. 136, 8982-8991 (2014).

The characterization of denatured states of proteins is challenging because the lack of permanent structure in these states makes it difficult to apply to them standard methods of structural biology. In this work we use all-atom replica-averaged metadynamics (RAM) simulations with NMR chemical shift restraints to determine an ensemble of structures representing an acid-denatured state of the 86-residue protein ACBP. This approach has enabled us to reach convergence in the free energy landscape calculations, obtaining an ensemble of structures in relatively accurate agreement with independent experimental data used for validation. By observing at atomistic resolution the transient formation of native and non-native structures in this acid-denatured state of ACBP, we rationalize the effects of single-point mutations on the folding rate, stability, and transition-state structures of this protein, thus characterizing the role of the unfolded state in determining the folding process.

Widespread aggregation and neurodegenerative diseases are associated with supersaturated proteins. Cell Reports 5, 781-790 (2013).

The maintenance of protein solubility is a fundamental aspect of cellular homeostasis because protein aggregation is associated with a wide variety of human diseases. Numerous proteins unrelated in sequence and structure, however, can misfold and aggregate, and widespread aggregation can occur in living systems under stress or aging. A crucial question in this context is why only certain proteins appear to aggregate readily in vivo, whereas others do not. We identify here the proteins most vulnerable to aggregation as those whose cellular concentrations are high relative to their solubilities. We find that these supersaturated proteins represent a metastable subproteome involved in pathological aggregation during stress and aging and are overrepresented in biochemical processes associated with neurodegenerative disorders. Consequently, such cellular processes become dysfunctional when the ability to keep intrinsically supersaturated proteins soluble is compromised. Thus, the simultaneous analysis of abundance and solubility can rationalize the diverse cellular pathologies linked to neurodegenerative diseases and aging.

Atomic structure and hierarchical assembly of a cross-beta amyloid fibril. PNAS 110, 5468-5473 (2013).

The cross-beta amyloid form of peptides and proteins represents an archetypal and widely accessible structure consisting of ordered arrays of beta-sheet filaments. These complex aggregates have remarkable chemical and physical properties, and the conversion of normally soluble functional forms of proteins into amyloid structures is linked to many debilitating human diseases, including several common forms of age-related dementia. Despite their importance, however, cross-beta amyloid fibrils have proved to be recalcitrant to detailed structural analysis. By combining structural constraints from a series of experimental techniques spanning five orders of magnitude in length scale - including magic angle spinning nuclear magnetic resonance spectroscopy, X-ray fiber diffraction, cryo-electron microscopy, scanning transmission electron microscopy, and atomic force microscopy - we report the atomic-resolution (0.5 A) structures of three amyloid polymorphs formed by an 11-residue peptide. These structures reveal the details of the packing interactions by which the constituent beta-strands are assembled hierarchically into protofilaments, filaments, and mature fibrils.

Using chemical shifts as replica-averaged structural restraints in molecular dynamics simulations to characterise the dynamics of proteins. J. Am. Chem. Soc. 134, 3968-3971 (2012).

Following the recognition that NMR chemical shifts can be used for protein structure determination, rapid advances have recently been made in methods for extending this strategy for proteins and protein complexes of increasing size and complexity. A remaining major challenge is to develop approaches to exploit the information contained in the chemical shifts about conformational fluctuations in native states of proteins. In this work we show that it is possible to determine an ensemble of conformations representing the free energy surface of RNase A using chemical shifts as replica-averaged restraints in molecular dynamics simulations. Analysis of this surface indicates that chemical shifts can be used to characterize the conformational equilibrium.

Structure of an intermediate state in protein folding and aggregation. Science 336, 362-368 (2012).

Protein-folding intermediates have been implicated in amyloid fibril formation involved in neurodegenerative disorders. However, the structural mechanisms by which intermediates initiate fibrillar aggregation have remained largely elusive. To gain insight, we used relaxation dispersion nuclear magnetic resonance spectroscopy to determine the structure of a low-populated, on-pathway folding intermediate of the A39V/N53P/V55L (A, Ala; V, Val; N, Asn; P, Pro; L, Leu) Fyn SH3 domain. The carboxyl terminus remains disordered in this intermediate, thereby exposing the aggregation-prone amino-terminal β strand. Accordingly, mutants lacking the carboxyl terminus and thus mimicking the intermediate fail to safeguard the folding route and spontaneously form fibrillar aggregates. The structure provides a detailed characterization of the non-native interactions stabilizing an aggregation-prone intermediate under native conditions and insight into how such an intermediate can derail folding and initiate fibrillation.

Amyloid-like aggregates sequester numerous metastable proteins with essential cellular functions. Cell 144, 67-78 (2011).

Protein aggregation is linked with neurodegeneration and numerous other diseases by mechanisms that are not well understood. Here, we have analyzed the gain-of-function toxicity of artificial beta sheet proteins that were designed to form amyloid-like fibrils. Using quantitative proteomics, we found that the toxicity of these proteins in human cells correlates with the capacity of their aggregates to promote aberrant protein interactions and to deregulate the cytosolic stress response. The endogenous proteins that are sequestered by the aggregates share distinct physicochemical properties: They are relatively large in size and significantly enriched in predicted unstructured regions, features that are strongly linked with multifunctionality. Many of the interacting proteins occupy essential hub positions in cellular protein networks, with key roles in chromatin organization, transcription, translation, maintenance of cell architecture and protein quality control. We suggest that amyloidogenic aggregation targets a metastable subproteome, thereby causing multifactorial toxicity and, eventually, the collapse of essential cellular functions.

Characterization of the structure of a misfolded intermediate populated during the folding process of a PDZ domain. Nature Struct. Mol. Biol. 17, 1431-1437 (2010).

Incorrectly folded states transiently populated during the protein folding process are potentially prone to aggregation and have been implicated in a range of misfolding disorders that include Alzheimer's and Parkinson's diseases. Despite their importance, however, the structures of these states and the mechanism of their formation have largely escaped detailed characterization because of their short-lived nature. Here we present the structures of all the major states involved in the folding process of a PDZ domain, which include an off-pathway misfolded intermediate. By using a combination of kinetic, protein engineering, biophysical and computational techniques, we show that the misfolded intermediate is characterized by an alternative packing of the N-terminal β-hairpin onto an otherwise native-like scaffold. Our results suggest a mechanism of formation of incorrectly folded transient compact states by which misfolded structural elements are assembled together with more extended native-like regions.

An analytical solution to the kinetics of filament assembly. Science, 326, 1533-1537 (2009).

We present an analytical treatment of a set of coupled kinetic equations that governs the self-assembly of filamentous molecular structures. Application to the case of protein aggregation demonstrates that the kinetics of amyloid growth can often be dominated by secondary rather than by primary nucleation events. Our results further reveal a range of general features of the growth kinetics of fragmenting filamentous structures, including the existence of generic scaling laws that provide mechanistic information in contexts ranging from in vitro amyloid growth to the in vivo development of mammalian prion diseases.

Prediction of aggregation-prone regions of structured proteins. J. Mol. Biol. 380, 425-436 (2008).

We present a method for predicting the regions of the sequences of peptides and proteins that are most important in promoting their aggregation and amyloid formation. The method extends previous approaches by allowing such predictions to be carried out for conditions under which the molecules concerned can be folded or contain a significant degree of persistent structure. In order to achieve this result, the method uses only knowledge of the sequence of amino acids to estimate simultaneously both the propensity for folding and aggregation and the way in which these two types of propensity compete. We illustrate the approach by its application to a set of peptides and proteins both associated and not associated with disease. Our results show not only that the regions of a protein with a high intrinsic aggregation propensity can be identified in a robust manner but also that the structural context of such regions in the monomeric form is crucial for determining their actual role in the aggregation process.

Structure determination of protein complexes using NMR chemical shifts. JACS 143, 15990-15996 (2008).

Nuclear magnetic resonance (NMR) spectroscopy provides a range of powerful techniques for determining the structures and the dynamics of proteins. The high-resolution determination of the structures of protein-protein complexes, however, is still a challenging problem for this approach, since it can normally provide only a limited amount of structural information at protein-protein interfaces. We present here the determination using NMR chemical shifts of the structure (PDB code 2K5X) of the cytotoxic endonuclease domain from bacterial toxin colicin (E9) in complex with its cognate immunity protein (Im9). In order to achieve this result, we introduce the CamDock method, which combines a flexible docking procedure with a refinement that exploits the structural information provided by chemical shifts. The results that we report thus indicate that chemical shifts can be used as structural restraints for the determination of the conformations of protein complexes that are difficult to obtain by more standard NMR approaches.

Protein structure determination from NMR chemical shifts. PNAS 104, 9615-9620 (2007).

NMR spectroscopy plays a major role in the determination of the structures and dynamics of proteins and other biological macromolecules. Chemical shifts are the most readily and accurately measurable NMR parameters, and they reflect with great specificity the conformations of native and nonnative states of proteins. We show, using 11 examples of proteins representative of the major structural classes and containing up to 123 residues, that it is possible to use chemical shifts as structural restraints in combination with a conventional molecular mechanics force field to determine the conformations of proteins at a resolution of 2A or better. This strategy should be widely applicable and, subject to further development, will enable quantitative structural analysis to be carried out to address a range of complex biological problems not accessible to current structural techniques.

Correlation between expression levels and aggregation rates of proteins. TiBS 32, 204-206 (2007).

We have found that expression levels of human genes in vivo are remarkably anti-correlated with the aggregation rates of the corresponding proteins measured in vitro by experiment. This result indicates that human proteins have evolved to resist aggregation and to function efficiently, but with almost no margin of safety to respond to genetic and environmental factors that decrease their solubility or increase their concentration in vivo. We speculate that this result, which we call the 'life on the edge' hypothesis, provides a compelling reason for the existence of disorders that are associated with protein aggregation, such as Alzheimer's and Parkinson's diseases.

Simultaneous determination of protein structure and dynamics. Nature 433, 128-132 (2005).

We present a protocol for the experimental determination of ensembles of protein conformations that represent simultaneously the native structure and its associated dynamics. The procedure combines the strengths of nuclear magnetic resonance spectroscopy - for obtaining experimental information at the atomic level about the structural and dynamical features of proteins - with the ability of molecular dynamics simulations to explore a wide range of prot ein conformations. We illustrate the method for human ubiquitin in solution and find that there is considerable conformational heterogeneity throughout the protein structure. The interior atoms of the protein are tightly packed in each individual conformation that contributes to the ensemble but their overall behaviour can be described as having a significant degree of liquid-like character. The protocol is completely general and should lead to significant advances in our ability to understand and utilize the structures of native proteins.