Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2018 Jun 1;74(Pt 6):492-505.
doi: 10.1107/S2059798318007313. Epub 2018 May 30.

Current approaches for the fitting and refinement of atomic models into cryo-EM maps using CCP-EM

Affiliations
Review

Current approaches for the fitting and refinement of atomic models into cryo-EM maps using CCP-EM

Robert A Nicholls et al. Acta Crystallogr D Struct Biol. .

Abstract

Recent advances in instrumentation and software have resulted in cryo-EM rapidly becoming the method of choice for structural biologists, especially for those studying the three-dimensional structures of very large macromolecular complexes. In this contribution, the tools available for macromolecular structure refinement into cryo-EM reconstructions that are available via CCP-EM are reviewed, specifically focusing on REFMAC5 and related tools. Whilst originally designed with a view to refinement against X-ray diffraction data, some of these tools have been able to be repurposed for cryo-EM owing to the same principles being applicable to refinement against cryo-EM maps. Since both techniques are used to elucidate macromolecular structures, tools encapsulating prior knowledge about macromolecules can easily be transferred. However, there are some significant qualitative differences that must be acknowledged and accounted for; relevant differences between these techniques are highlighted. The importance of phases is considered and the potential utility of replacing inaccurate amplitudes with their expectations is justified. More pragmatically, an upper bound on the correlation between observed and calculated Fourier coefficients, expressed in terms of the Fourier shell correlation between half-maps, is demonstrated. The importance of selecting appropriate levels of map blurring/sharpening is emphasized, which may be facilitated by considering the behaviour of the average map amplitude at different resolutions, as well as the utility of simultaneously viewing multiple blurred/sharpened maps. Features that are important for the purposes of computational efficiency are discussed, notably the Divide and Conquer pipeline for the parallel refinement of large macromolecular complexes. Techniques that have recently been developed or improved in Coot to facilitate and expedite the building, fitting and refinement of atomic models into cryo-EM maps are summarized. Finally, a tool for symmetry identification from a given map or coordinate set, ProSHADE, which can identify the point group of a map and thus may be used during deposition as well as during molecular visualization, is introduced.

Keywords: Divide and Conquer; ProSHADE; REFMAC5; cryo-EM; map blurring; map sharpening; model refinement; symmetry detection.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The effect of map blurring on cryo-EM reconstructions. Electrostatic potential maps and atomic models corresponding to the structure of β-­galactosidase (PDB entry 5a1a, EMDB entry EMD-2084; Bartesaghi et al., 2015 ▸). (a) The original map is very noisy, suggesting that it is over-sharpened. (b) Blurring the map using a B value of 40 Å reduces the noise, appearing to make it largely disappear, although the features (signal) in the map remain clear. This produces a clearer map that it will be easier to refine the atomic model against. (c) Applying further blurring (B value of 100 Å) does not help: it causes more of the noise to disappear, but also causes structural details to be lost (for example regarding side chains).
Figure 2
Figure 2
Plot of average structure-factor amplitude against resolution to aid the selection of the optimal blurring/sharpening B value, as shown in the CCP-EM GUI (Burnley et al., 2017 ▸). The data correspond to a cryo-EM reconstruction of β-galactosidase (PDB entry 5a1a, EMDB entry EMD-2084; Bartesaghi et al., 2015 ▸). Different levels of map blurring and sharpening are shown, using an array of B values from −100 Å (blurring) to +100 Å (sharpening) in increments of 20 Å. For the optimally blurred/sharpened map, the average amplitude should decay to zero at the ‘high-resolution limit’ (above which the data contain no information about the true structure). For high levels of map sharpening, the average amplitude persistently increases with resolution: this indicates over-sharpening. For high levels of map blurring, the average amplitude approaches zero rapidly prior to reaching the high-resolution limit, resulting in a loss of high-resolution information. In this case, the optimal average level of blurring over the map is around 40 Å (subjectively determined by manual inspection of the plot), in agreement with the maps shown in Figs. 1 ▸ and 3 ▸.
Figure 3
Figure 3
The utility of simultaneously viewing multiple maps using various levels of map blurring/sharpening. Electrostatic potential maps and atomic models corresponding to β-galactosidase (PDB entry 5a1a, EMDB entry EMD-2084; Bartesaghi et al., 2015 ▸) using different levels of map blurring. (a) In the original/default map, noise is visible in the solvent regions and there is little backbone density visible in the exposed region around residue 732 in chain A. (b) Blurring the map using a B value of 40 Å reduces noise and reveals features in the map. This suggests incorrect modelling of the backbone in this region and indicates where the model backbone should be positioned. (c) Simultaneously viewing an array of maps with different levels of sharpening and blurring further increases map interpretability. Here, maps corresponding to blurring B values of 0, 20, 40, 60, 80 and 100 Å are displayed. In this case, it is evident how the backbone should be traced (from the more blurred maps), and at the same time it is possible to interpret finer structural details such as the orientations of side chains (from the sharper maps).
Figure 4
Figure 4
External restraint visualization and usage during parallelized full-chain real-space atomic model refinement in Coot. In order to mimic a ‘typical’ stage of the model-building and refinement process in which the model is complete yet the conformational geometry is suboptimal, the atomic model corresponding to the RAD51 filament (PDB entry 5jzc, EMDB entry EMD-8183; Short et al., 2016 ▸) was subjected to simple re-refinement, causing overfitting into the map. External restraints were generated by ProSMART using a homologous structural model derived using MX (PDB entry 1n0w; Pellegrini et al., 2002 ▸) as a reference. In Coot, the colouring of interatomic vectors representing restraints indicates local structural conservation; consistent interatomic distances are coloured grey, whilst those that are substantially longer/shorter in the target than in the reference model are coloured red/blue. (a) The re-refined model of 5jzc is shown, along with the interatomic distance restraints that were generated using 1n0w. From observing the restraint colouring it is evident that there are substantial local conformational differences between the two structural models despite the sequence homology. (b) Performing full-chain real-space refinement of the model in Coot results in the local conformation of the model becoming much closer to that of the high-resolution homologue, as indicated by the increased quantity of interatomic vectors coloured grey. Regions containing restraints coloured red/blue indicate persisting differences between the target and reference models. Such regions are indicative of either true differences between the target and reference structures (as is the case in Fig. 5 ▸), suboptimal restraint weighting parameters or errors in the model that require manual resolution. Regardless, the presence of any such regions would warrant closer inspection.
Figure 5
Figure 5
Robustness to outliers when performing atomic model refinement using external restraints. The re-refined atomic model corresponding to the RAD51 filament (PDB entry 5jzc, EMDB entry EMD-8183; Short et al., 2016 ▸), as shown in Fig. 4 ▸(a), is displayed coloured green. External restraints were generated using ProSMART using the homologous structural model derived using MX (PDB entry 1n0w; Pellegrini et al., 2002 ▸) as a reference. The homologous model 1n0w is shown coloured brown superposed onto 5jzc. (a) The red colouring of the restraints reflects differences in the rotameric state of the aligned functional tyrosine between the target and reference models (Tyr293 in 5jzc, Tyr301 in 1n0w). Map density is clearly visible; this side chain is oriented acceptably in the original model, so it would be undesirable for the external restraints to pull this side chain into the conformation observed in the homologous model. (b) The model after real-space re-refinement with external restraints in Coot (as shown in Fig. 4 ▸ b) is coloured blue. Owing to the use of robust estimation, which restricts the influence of outliers, the tyrosine side chain retains its original conformation rather than being pulled out of the map and into the rotameric state observed in the homologous model. However, the geometry of the backbone in this region is improved, adopting a similar conformation to the homologous model as appropriate. Following such injection of prior structural information, the typical protocol would then be to perform full-model refinement using REFMAC5 with jelly-body restraints, which would allow the model to relax into the density whilst ensuring refinement stability.
Figure 6
Figure 6
The Divide and Conquer procedure illustrated using a model of a rotavirus particle (PDB entry 4v7q, EMDB entry EMD-5199; Settembre et al., 2011 ▸) visualized using PyMOL (Schrödinger). (a) The input macromolecule is split into individual chains. (b) For each chain, the adjacent contacting chains are identified and included for the refinement of the target chain. Each such substructure is masked in order to identify an appropriate box size for use in refinement. (c) After parallel refinement of each individual substructure, all of the target chains are extracted and recombined into the macromolecular complex.
Figure 7
Figure 7
Symmetry detection using ProSHADE. The map corresponding to a bacteriophage T4 portal protein (PDB entry 3ja7, EMDB entry EMD-6324; Sun et al., 2015 ▸) visualized using UCSF Chimera (Pettersen et al., 2004 ▸). ProSHADE reports the correct C12 symmetry, with the symmetry axis along the vector (0.02, 0.02, 1.00) with an average cross-correlation between the original structure and the structures rotated along this symmetry axis of 0.443. All symmetry elements present for this instance of the C12 symmetry group are reported, along with the list of alternative symmetries that are also present in the structure (C6, C4, C3 and C2) and the corresponding rotation-function peak heights.

Similar articles

Cited by

References

    1. Afonine, P. V., Grosse-Kunstleve, R. W., Urzhumtsev, A. & Adams, P. D. (2009). J. Appl. Cryst. 42, 607–615. - PMC - PubMed
    1. Afonine, P. V., Headd, J. J., Terwilliger, T. C. & Adams, P. D. (2013). Comput. Crystallogr. Newsl. 4, 43–44. https://www.phenix-online.org/newsletter/CCN_2013_07.pdf.
    1. Afonine, P. V., Moriarty, N. W., Mustyakimov, M., Sobolev, O. V., Terwilliger, T. C., Turk, D., Urzhumtsev, A. & Adams, P. D. (2015). Acta Cryst. D71, 646–666. - PMC - PubMed
    1. Afonine, P. V., Poon, B. K., Read, R. J., Sobolev, O. V., Terwilliger, T. C., Urzhumtsev, A. & Adams, P. D. (2018). Acta Cryst. D74, 531–544. - PMC - PubMed
    1. Bartesaghi, A., Merk, A., Banerjee, S., Matthies, D., Wu, X., Milne, J. L. & Subramaniam, S. (2015). Science, 348, 1147–1151. - PMC - PubMed

Substances

-