Single-particle electron microscopy (cryo-electron microscopy) CS/CME/BioE/Biophys/BMI 279 Nov. 16 and 28, 2017 Ron Dror

Similar documents
Single-particle electron microscopy (cryo-electron microscopy) CS/CME/BioE/Biophys/BMI 279 Nov. 16 and 28, 2017 Ron Dror

THREE-DIMENSIONA L ELECTRON MICROSCOP Y OF MACROMOLECULAR ASSEMBLIE S. Visualization of Biological Molecules in Their Native Stat e.

II: Single particle cryoem - an averaging technique

3. Image formation, Fourier analysis and CTF theory. Paula da Fonseca

Cover Page. The handle holds various files of this Leiden University dissertation

Single Particle Reconstruction Techniques

III: Single particle cryoem - practical approaches

Crystallography & Cryo-electron microscopy

3DEM > GENERAL METHODS. SINGLE PARTICLE RECONSTRUCTION (SPA): Introduction

Fourier transforms and convolution

II: Single particle cryoem - an averaging technique

Vision in the Small: Reconstructing the Structure of Protein Macromolecules from Cryo-Electron Micrographs

Transmission Electron Microscopy 3D Construction by IPET Method TCBG GPU Workshop

Alignment and Other Challenges in Reconstructing Cryotomograms with IMOD

Introduction to Cryo Electron Microscopy/Tomography

Image Alignment and Application of 2D Fast Rotational Matching in Single Particle cryo-electron Microscopy. Yao Cong. Baylor College of Medicine

CHEM-E5225 :Electron Microscopy Imaging I

Structural Information obtained

2D Image Alignment and Classification

Dynamic Reconstruction for Coded Aperture Imaging Draft Unpublished work please do not cite or distribute.

Cryo-electron microscopy Cryo-EM. Garry Taylor

convolution shift invariant linear system Fourier Transform Aliasing and sampling scale representation edge detection corner detection

Refinement into cryo-em maps. Garib Murshudov MRC-LMB, Cambridge, UK

From cryo-em images of protein complexes to high-resolution 3D structures

Initial Model Generation

(Refer Slide Time 00:17) Welcome to the course on Digital Image Processing. (Refer Slide Time 00:22)

EMBO Practical Course on Image Processing for Cryo EM 1-11 September 2015

MAT 17C - DISCUSSION #4, Counting Proteins

Chapter 1 Introduction

BMB/Bi/Ch 173 Winter 2017

Image Registration Lecture 4: First Examples

BME I5000: Biomedical Imaging

Intrinsic Classification of Single Particle Images by Spectral Clustering

Likelihood-based optimization of cryo-em data. Sjors Scheres National Center for Biotechnology CSIC Madrid, Spain

An Intuitive Explanation of Fourier Theory

EE795: Computer Vision and Intelligent Systems

Machine Learning : supervised versus unsupervised

Thomas Abraham, PhD

proteindiffraction.org Select

Final Exam Study Guide

SUPPLEMENTARY INFORMATION

CPSC 425: Computer Vision

specular diffuse reflection.

Lecture 15: Segmentation (Edge Based, Hough Transform)

Introduction to Digital Image Processing

Processing: Introduction and new approaches. Sjors H.W. Scheres NRAMM cryo-em workshop, NYSBC, 1 November 2017

SYSTEM LINEARITY LAB MANUAL: 2 Modifications for P551 Fall 2013 Medical Physics Laboratory

Scattering/Wave Terminology A few terms show up throughout the discussion of electron microscopy:

Local Features: Detection, Description & Matching

Chapter 12 Notes: Optics

Central Slice Theorem

Tomographic Reconstruction

Reading on the Accumulation Buffer: Motion Blur, Anti-Aliasing, and Depth of Field

Motivation. Gray Levels

Assessing the homo- or heterogeneity of noisy experimental data. Kay Diederichs Konstanz, 01/06/2017

Lecture 8. Conical tilt reconstruction

Robust PDF Table Locator

3D Computer Vision. Depth Cameras. Prof. Didier Stricker. Oliver Wasenmüller

Edge and Texture. CS 554 Computer Vision Pinar Duygulu Bilkent University

Michael Moody School of Pharmacy University of London 29/39 Brunswick Square London WC1N 1AX, U.K.

PHYSICS. Chapter 33 Lecture FOR SCIENTISTS AND ENGINEERS A STRATEGIC APPROACH 4/E RANDALL D. KNIGHT

The Paper Project Guide to Confocal Imaging at Home & in the Classroom

Edges, interpolation, templates. Nuno Vasconcelos ECE Department, UCSD (with thanks to David Forsyth)

Blob-Representation of Multidimensional Objects and Surfaces

Edge Detection (with a sidelight introduction to linear, associative operators). Images

Physical or wave optics

Motivation. Intensity Levels

Michael Moody School of Pharmacy University of London 29/39 Brunswick Square London WC1N 1AX, U.K.

Let s start with occluding contours (or interior and exterior silhouettes), and look at image-space algorithms. A very simple technique is to render

Fall 09, Homework 5

SUPPLEMENTARY INFORMATION

CoE4TN4 Image Processing. Chapter 5 Image Restoration and Reconstruction

Fundamentals of Digital Image Processing

Supplementary Information. Structure of a RSC nucleosome Complex and Insights into Chromatin Remodeling

Types of Edges. Why Edge Detection? Types of Edges. Edge Detection. Gradient. Edge Detection

BSB663 Image Processing Pinar Duygulu. Slides are adapted from Selim Aksoy

3D Reconstruction in EM

Lecture 7: Most Common Edge Detectors

All forms of EM waves travel at the speed of light in a vacuum = 3.00 x 10 8 m/s This speed is constant in air as well

Identification of Shielding Material Configurations Using NMIS Imaging

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)

EPSRC Centre for Doctoral Training in Industrially Focused Mathematical Modelling

Optical Ptychography Imaging

Digital Image Processing. Prof. P. K. Biswas. Department of Electronic & Electrical Communication Engineering

EE795: Computer Vision and Intelligent Systems

Batch Processing of Tomograms with IMOD

Edge detection. Winter in Kraków photographed by Marcin Ryczek

Lecture 16: Computer Vision

Lecture 16: Computer Vision

Chapter 15. Light Waves

Practice Exam Sample Solutions

Three-dimensional structure and flexibility of a membrane-coating module of the nuclear pore complex

10.5 Polarization of Light

ADVANCED RECONSTRUCTION FOR ELECTRON MICROSCOPY

Digital Image Processing COSC 6380/4393

Determination of the image orientation

Structural Basis for RNA Processing by the Human. RISC-Loading Complex

Edges and Binary Images

Computer Graphics. Sampling Theory & Anti-Aliasing. Philipp Slusallek

CS4670: Computer Vision

Transcription:

Single-particle electron microscopy (cryo-electron microscopy) CS/CME/BioE/Biophys/BMI 279 Nov. 16 and 28, 2017 Ron Dror 1

Last month s Nobel Prize in Chemistry Awarded to Jacques Dubochet, Joachim Frank and Richard Henderson and "For developing cryo-electron microscopy for the high-resolution structure determination of biomolecules in solution" 2

Nature, Sept. 10, 2015 3

Outline Overview of single-particle electron microscopy (EM) Single-particle EM images are projections Sample preparation Computational reconstruction methods 2D image analysis Image preprocessing Particle picking The images are very noisy. First, clean up the 2D images. Then try to recreate the 3D structure whose projection created those 2D images. Image clustering and class averaging 3D reconstruction Reconstruction with known view angles Structure refinement with unknown view angles Calculating an initial structure Fitting atomic-resolution models to lower-resolution EM structures 4

Overview of single-particle electron microscopy (EM) 5

The basic idea We want the structure of a particle : a molecule (e.g., protein) or a well-defined complex composed of many molecules (e.g., a ribosome) We spread identical particles out on a film, and image them using an electron microscope The images are two-dimensional (2D), and each particle is positioned with a different, unknown orientation. Given enough 2D images of particles, we can computationally reconstruct the 3D shape of the particle Electron beam Particles Images Image from Joachim Frank http://biomachina.org/courses/structures/091.pdf 6

Stanford has recently purchased at least 3 of these machines. A high-end cryo-electron microscope 7

Dramatic recent improvements Single-particle EM has been around for decades, but it has improved dramatically in the last five years due to: Invention of better cameras Until recently, electrons were detected either by photographic film, or by scintillator-based digital cameras which converted electrons to photons for detection New direct-electron detectors can detect electrons directly, substantially improving image resolution and quality Better computational reconstruction techniques Single-particle EM is thus coming into much wider use, and may challenge crystallography as the dominant way to determine experimental structures 8

Comparison to x-ray crystallography Single-particle EM s major advantage over crystallography is that it does not require formation of a crystal Particularly advantageous for large complexes, which are usually difficult to crystallize Also avoids structural artifacts due to packing in a crystal lattice. In EM, particles are in a more natural environment. On the other hand: Single-particle EM s resolution is (typically) lower than that of crystallography Reconstructing structures of small proteins from EM images is difficult, because images from different orientations look similar (i.e., a blob ) Bottom line: single-particle EM is particularly advantageous for large complexes, because: Large complexes tend to be harder to crystallize The computational reconstruction problem in single-particle EM is usually easier to solve for large particles than for small ones 9

Single-particle EM images are projections 10

Single-particle EM uses transmission electron microscopy In transmission electron microscopy, a beam of electrons pass through a thin sample before forming an image Scanning electron microscopy detects surfaces, so it more closely mimics how we normally see. Transmission electron microscopy Scanning electron microsopy http://www.cas.miamioh.edu/~meicenrd/anatomy/ Ch2_Ultrastructure/Tempcell.htm http://www.newscientist.com/data/images/ns/cms/ dn14136/dn14136-1_788.jpg 11

Single-particle EM images are projections Each recorded 2D image is thus a projection of the 3D shape (density) we want to reconstruct That is, we can think of each pixel value in the 2D image as a sum of the values along a line through the 3D sample (in the direction of the electron beam) Likelihood of particle being stopped is proportional to density of the cross section Electron beam Particles Images 12

13 From Joachim Frank, Three-dimensional electron microscopy of macromolecular assemblies: Visualization of biological molecules in their native state, 2006

In transmission EM, the image would look more like an x-ray of the bunny than a shadow of the bunny 14 From Joachim Frank, Three-dimensional electron microscopy of macromolecular assemblies: Visualization of biological molecules in their native state, 2006

Sample preparation 15

Sample preparation To survive in the electron microscope (in a vacuum, under electron bombardment), the particles are usually prepared in one of two ways: Negative staining Coat particles with heavy metal salt crystals This increases contrast (particles are easy to pick out from background) It limits resolution to ~20 Å and can introduce artifacts Vitrification Particles are embedded in ice (vitreous ice: flash frozen, not crystalline) This gives less contrast, but enables much higher resolution (below 4 Å) High-resolution single-particle EM relies on vitrification and is thus referred to as cryo-electron microscopy (cryo-em)

Usually you ll perform negative staining to check your protein sample, make sure everything is okay. Then you ll move to vitreous ice to take high resolution images. Negative stain. Particles are easy to pick out. Vitreous ice. Particles are harder to pick out, even though this is a very good ( easy ) case. Frank, 2006 Cheng, Cell 161:450 (2015) 17

Computational reconstruction methods 18

Overview of computational methods 2D image analysis: First, go from raw image data to higher-resolution 2D projections Image preprocessing Particle picking Image clustering and class averaging 3D reconstruction: Then use these higherresolution projections to build a 3D model Background: Reconstruction with known view angles Structure refinement with unknown view angles Calculating an initial structure Fitting atomic-resolution models to lower-resolution EM structures 19

Overview of computational methods This is a good review, check it out! Cheng et al., Cell 161:438 (2015) 20

Computational reconstruction methods 2D image analysis 21

Particles are very low contrast, meaning they don t stand out very well from the background. Actually, this is an especially clean example! The raw images don t look so good Image from Joachim Frank http://biomachina.org/courses/structures/091.pdf Before attempting any 3D reconstruction, we do several types of processing on the images 22

Computational reconstruction methods 2D image analysis Image preprocessing 23

Image preprocessing Problem 1: The sample tends to move slightly during imaging, blurring the image Solution It wasn t until recently that detectors were fast enough to record a movie. Direct electron detectors are fast enough to record a movie instead of a single image Align the movie frames computationally, then average them together 24

Image preprocessing Problem 2: Overall brightness is often nonuniform (due to uneven illumination or sample thickness) Solution: high-pass filter the image Because uneven illumination is a very low frequency effect. Cheng et al., Cell 161:438 (2015) 25

Image preprocessing There is blurring that happens when an image is recorded on the electron microscope, but we have a good model of how the blurring occurs, so we can partially correct for it. Problem 3: The optics cause the recorded image to be a blurred version of the ideal image This blurring is a convolution, and can thus be expressed as a multiplication in the frequency domain, where the ideal image is multiplied by the contrast transfer function Solution: Estimate parameters of the contrast transfer function, then correct for it Some of the parameters are known (from the optics), while others are estimated from the images Correction is generally done in the frequency domain A typical contrast transfer function, in the frequency domain (zero frequency at the center) Frequencies corresponding to bright pixels here have strong signal under the microscope; frequencies corresponding to dark pixels are poorly detected under the microscope. You re not responsible for the particular form of the contrast transfer function https://en.wikipedia.org/wiki/contrast_transfer_function

Computational reconstruction methods 2D image analysis Particle picking 27

Pick out the particles in the 2D images It s like you snap a bunch of blurry images of an anthill, and then you want to reconstruct the model of an ant. Image from Joachim Frank http://biomachina.org/courses/structures/091.pdf 28

Particle picking results In reality, you ll have snapshots of at least 10,000 different particles, sometimes 100s of thousands. Image from Joachim Frank http://biomachina.org/courses/structures/091.pdf 29

Particle picking methods Particle picking can be difficult, because the images are lowcontrast and noisy Images may also have contaminants that should be ignored A variety of automated and semi-automated methods have been developed For example, matching to templates, or picking out highcontrast regions Often this is still done manually, at least to seed automated methods with suitable templates Contaminant Particle of interest Cheng et al., Cell 2015 Doing a manual annotation helps you make sure your data look okay. This also provides templates that teaches the software what a particle should roughly look like. 30

Computational reconstruction methods 2D image analysis Image clustering and class averaging 31

Averaging similar images reduces noise In practice, the particles under the microscope will be in different orientations, and these orientations aren t told to you in advance, so you need a way to group particles with a similar orientation. (Imagine taking thousands of pictures of a water bottle floating in space your pictures will capture all possible orientations of the water bottle.) This is like grouping smiley faces with similar expressions as shown to the left. Image from Joachim Frank http://biomachina.org/courses/ structures/091.pdf The images in each row above represent the same ideal image but with different corrupting noise If we average the images in each row (in the sense of averaging corresponding pixels), we end up with a less noisy image, because the noise in the different images tends to cancel out Noise will average out because the noise between different images are random and not correlated with each other.

Goal: cluster the particle images into classes of similar images Group together images with similar view angles Then align them to one another and average them together to reduce noise To do this, divide images into several classes (with each class representing a set of similar view angles) We need to determine both what the classes are and which images should be assigned to each class This is a clustering problem Group images such that the images within a group are similar, but images in different groups are different In machine learning terminology, this is unsupervised learning 33

Standard approach: k-means clustering k is decided in advance. Pick k random images as class exemplars Then iterate the following: Assign each image to the closest exemplar Average all the images in each class to determine a new class exemplar On the left is an example of k-means clustering iteration from http://stanford.edu/ ~cpiech/cs221/handouts/kmeans.html, which also gives a good mathematical explanation. 34

Standard approach: k-means clustering k is decided in advance. Pick k random images as class exemplars Then iterate the following: Assign each image to the closest exemplar Average all the images in each class to determine a new class exemplar Notes: In the assignment step, we need to align each particle image against the exemplar images We need to specify the number of classes (k) in advance, or experiment with different values of k k-means clustering is guaranteed to converge, but not guaranteed to find a globally optimal solution Indeed, the solution may depend heavily on the initialization conditions, and may be heavily suboptimal 35

Caveat: Potential model bias in clustering/alignment Number of aligned and averaged images Image from Steve Ludtke http://biomachina.org/courses/structures/091.pdf In this case, the images are just noise, but by selecting images and alignments that best match a given template, we get a class average that looks like the template. This is essentially because the alignment step biases the random pixels towards positions that match the template image of the little girl, so when these images are averaged you reproduce the image of the little girl. 36

Caveat: Potential model bias in clustering/alignment Image from Steve Ludtke http://biomachina.org/courses/structures/091.pdf In this case, the images are noisy versions of one face, but by selecting images and alignments that best match a second face, we get a class average that looks like the second face. This can be a big problem experimentally! In these examples, you can reconstruct an image that wasn t there to begin with. 37

Avoiding these problems A variety of more sophisticated clustering methods ameliorate these problems Some involve modifications to k-means (including the recently developed Iterative Stable Alignment and Clustering method) ISAC essentially computes a whole bunch of different k-means and combines the results to produce a more robust solution. Some involve principal components analysis or other dimensionality reduction techniques 38 You re not responsible for these methods

Class averaging results Cheng, Cell 161:450 (2015) These are considered good class averages (from a high-resolution single-particle EM study) 39

Computational reconstruction methods 3D reconstruction 40

Problem Suppose you re given many projections of a 2D image, and you want to reconstruct the original image. How would you do it? Projections Original image Projections are the sum along the vertical or horizontal line Working with your neighbor, try to come up with one way to do it if you know the view angle for each projection, and another if you don t Don t look at the next few slides. If you already have, try to come up with approaches that are not on those slides. 41

Computational reconstruction methods 3D reconstruction Background: Reconstruction with known view angles 42

Suppose you knew the view angle for each particle image How would you reconstruct the 3D density map from 2D projections? Same problem is encountered in medical imaging (e.g. in CT scans, which are basically 3D x-rays) The simplest approach would be back-projection: reverse the projection process by smearing each projection back across the reconstructed image 43

Back-projection Original image Reconstructions based on different numbers of projections (2, 4, 8, 16, 32) Projections http://www.impactscan.org/slides/impactcourse/basic_principles_of_ct/img12.html The result of back-projection is a blurred version of the original image. How can we fix this? 44

Filtered-back projection It turns out we can fix this problem by applying a specific high-pass filter to each image before back-projection. This is filtered back-projection. 45 http://www.impactscan.org/slides/impactcourse/basic_principles_of_ct/img15.gif

Why does filtered back-projection work? You re not responsible for this To answer this, use the projection slice theorem Projection slice theorem (2D version): The 1D Fourier transform of the 1D projection of a 2D density is equal to the central section perpendicular to the direction of projection of the 2D Fourier transform of the density This theorem holds because each of the 2D sinusoids used in the 2D Fourier transform is constant in one direction 46

Why does filtered back-projection work? You re not responsible for this Back-projection is equivalent to filling in central sections in the Fourier domain The problem is that when reconstructing by back-projection, we overweight the lowfrequency values (in the figure, the density of dots is greatest near the center) To fix this, reduce the weights on lowfrequency components. Ideal filter shape grows linearly with frequency. http://jnm.snmjournals.org/cgi/content-nw/full/42/10/1499/f2 Frank, 2006 Filtered back-projection is a common technique, but there are several alternatives, including direct Fourierdomain reconstruction

This carries over to the 3D case Projection slice theorem (3D version): The 2D Fourier transform of the 2D projection of a 3D density is equal to the central section perpendicular to the direction of projection of the 3D Fourier transform of the density Frank, 2006 You re not responsible for this 48

Computational reconstruction methods 3D reconstruction Structure refinement with unknown view angles 49

Refining a structure If we re not given the view angles for each particle, but we have a decent initial 3D model, then iterate the following steps to improve the model: For each projection (i.e., each class average), find the view angle that best matches the 3D model Given the newly estimated view angles, reconstruct a better 3D model (e.g., using filtered back-projection) This is called 3D projection matching 50

An example Class averages (starting point for reconstruction) Image from Steve Ludtke http://biomachina.org/courses/structures/091.pdf 51

Picking this starting structure is an important task, and how to do it is discussed several slides later. This surface is a contour map. Estimated density is greater than a threshold inside the surface and less than the threshold outside it. Density here corresponds roughly (not precisely) to electron density.

The first iteration was based on a model so it had some detail, but the detail was slightly wrong, so iteration 2 looks more blobby

Final reconstruction Protein: GroEL 6.5 Å resolution Ignore the color coding

A high-resolution single-particle EM structure You can see alpha helixes! And you can see electron densities corresponding to each side-chain. The best EM structures are still not as high-resolution as the best crystal structures, but it s remarkable that you can achieve atomic-resolution structures with this approach. A 3.3 Å resolution EM structure 57 Li et al., Nature Methods 10:584 (2013)

Caveat Structure refinement methods are prone to overfitting Converged model can show features that don t really exist and just reflect noise in the images (analogous to the issue with image clustering) A variety of methods have been developed recently to deal with this issue Some use Bayesian approaches (e.g., RELION software) Some of the most important recent algorithmic developments in single-particle EM are in this area. You re not responsible for these methods. 58

Computational reconstruction methods 3D reconstruction Calculating an initial structure 59

How do we get an initial structural model? Multiple options: You re not responsible for this We might have one from prior experimental work Conduct specialized experiments, often at lower resolution Example: random canonical tilt approach, which requires collecting each image twice, from different angles Direct computational solution Common lines method: relies on the fact that Fourier transforms of different 2D projections share a common line Stochastic hill climbing: a robust projection matching (refinement) approach that often allows random initialization Frank, 2006 60

Computational reconstruction methods 3D reconstruction Fitting atomic-resolution models to lower-resolution EM structure 61

Obtaining atomic-resolution models from lower-resolution EM Often we have high-resolution x- ray crystallography structures of each individual protein in a complex whose low-resolution structure was determined by single-particle EM. We can fit the high-resolution structures into the EM density. 62 Schur et al., Nature 517:505 (2015)

Obtaining atomic-resolution models from lower-resolution EM Approaches based on molecular dynamics simulations can be used to allow the proteins to relax away from their crystallographic structures to better fit the EM density. https://www.youtube.com/watch?v=6knykqcxzfg 63