Single Particle Reconstruction Techniques

Similar documents
Image Alignment and Application of 2D Fast Rotational Matching in Single Particle cryo-electron Microscopy. Yao Cong. Baylor College of Medicine

THREE-DIMENSIONA L ELECTRON MICROSCOP Y OF MACROMOLECULAR ASSEMBLIE S. Visualization of Biological Molecules in Their Native Stat e.

2D Image Alignment and Classification

Initial Model Generation

Single-particle electron microscopy (cryo-electron microscopy) CS/CME/BioE/Biophys/BMI 279 Nov. 16 and 28, 2017 Ron Dror

II: Single particle cryoem - an averaging technique

Supplementary Information. Structure of a RSC nucleosome Complex and Insights into Chromatin Remodeling

Lecture 8. Conical tilt reconstruction

Transmission Electron Microscopy 3D Construction by IPET Method TCBG GPU Workshop

Intrinsic Classification of Single Particle Images by Spectral Clustering

Determination of the image orientation

Random Conical tilt 3D reconstruction

Structural Basis for RNA Processing by the Human. RISC-Loading Complex

Random Conical tilt 3D reconstruction

Single-particle electron microscopy (cryo-electron microscopy) CS/CME/BioE/Biophys/BMI 279 Nov. 16 and 28, 2017 Ron Dror

II: Single particle cryoem - an averaging technique

All rights reserved Image Science Software GmbH Berlin, Germany

Likelihood-based optimization of cryo-em data. Sjors Scheres National Center for Biotechnology CSIC Madrid, Spain

3DEM > GENERAL METHODS. SINGLE PARTICLE RECONSTRUCTION (SPA): Introduction

Structural Information obtained

Vision in the Small: Reconstructing the Structure of Protein Macromolecules from Cryo-Electron Micrographs

From cryo-em images of protein complexes to high-resolution 3D structures

3D Reconstruction in EM

Chapter 1 Introduction

Detecting Consistent Common Lines in Cryo-EM by Voting

Cover Page. The handle holds various files of this Leiden University dissertation

X-RAY crystallography [1], [2] and nuclear magnetic. Computational Approaches for Automatic Structural Analysis of Large Bio-molecular Complexes

Difficult Problems & EMAN2 Introduction

Fundamentals of Digital Image Processing

Introduction to Cryo Electron Microscopy/Tomography

University of Florida CISE department Gator Engineering. Data Preprocessing. Dr. Sanjay Ranka

III: Single particle cryoem - practical approaches

Data Preprocessing. Data Preprocessing

Michael Moody School of Pharmacy University of London 29/39 Brunswick Square London WC1N 1AX, U.K.

3. Image formation, Fourier analysis and CTF theory. Paula da Fonseca

Contents. 1 Test dataset: House Centipede (Scutigera coleoptrata) Hemocyanin 3

FindEM a fast, efficient program for automatic selection of particles from electron micrographs

Crystallography & Cryo-electron microscopy

IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING

Mix-and-Match Tutorial

arxiv: v1 [q-bio.qm] 6 May 2011

Digital Image Processing

Three-dimensional structure and flexibility of a membrane-coating module of the nuclear pore complex

Journal of Structural Biology

JOINT DIRECTION AND VOLUME TOMOGRAPHICAL AB-INITIO RECONSTRUCTION FOR ELECTRON MICROSCOPY

MEDICAL IMAGE ANALYSIS

Digital Image Processing

Lecture 8 Object Descriptors

Blob-Representation of Multidimensional Objects and Surfaces

COMPUTATIONAL CHALLENGES IN HIGH-RESOLUTION CRYO-ELECTRON MICROSCOPY. Thesis by. Peter Anthony Leong. In Partial Fulfillment of the Requirements

An Assembly Automation Approach to Alignment of Noncircular Projections in Electron Microscopy

Maximum-likelihood methods

GENERAL AUTOMATED FLAW DETECTION SCHEME FOR NDE X-RAY IMAGES

Non-Stationary CT Image Noise Spectrum Analysis

Machine Learning Feature Creation and Selection

Machine Learning : supervised versus unsupervised

BMB/Bi/Ch 173 Winter 2017

Machine Learning : Clustering, Self-Organizing Maps

Lecture Topic Projects

Vector Visualization

Lecture 27: Review. Reading: All chapters in ISLR. STATS 202: Data mining and analysis. December 6, 2017

Structure and View Estimation for Tomographic Reconstruction: A Bayesian Approach

Image Analysis, Classification and Change Detection in Remote Sensing

Overview. Spectral Processing of Point- Sampled Geometry. Introduction. Introduction. Fourier Transform. Fourier Transform

Application of optimal sampling lattices on CT image reconstruction and segmentation or three dimensional printing

Equation to LaTeX. Abhinav Rastogi, Sevy Harris. I. Introduction. Segmentation.

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1

Correspondence. CS 468 Geometry Processing Algorithms. Maks Ovsjanikov

An easy-to-follow tutorial for anyone new to cistem

Supporting Information

Image Processing in Single Particle Analysis

Ratios and Proportional Relationships (RP) 6 8 Analyze proportional relationships and use them to solve real-world and mathematical problems.

XFEL single particle scattering data classification and assembly

X-ray tomography. X-ray tomography. Applications in Science. X-Rays. Notes. Notes. Notes. Notes

CS 521 Data Mining Techniques Instructor: Abdullah Mueen

Scope and Sequence. Number and Operations. Scope and Sequence. Level 1 Level 2 Level 3. Fractions. Percents

Machine Learning in the Wild. Dealing with Messy Data. Rajmonda S. Caceres. SDS 293 Smith College October 30, 2017

Nature Methods: doi: /nmeth Supplementary Figure 1

Tomographic reconstruction of shock layer flows

Marcel Worring Intelligent Sensory Information Systems

Fourier Transform and Texture Filtering

Data Preprocessing. Why Data Preprocessing? MIT-652 Data Mining Applications. Chapter 3: Data Preprocessing. Multi-Dimensional Measure of Data Quality

Dietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++

Spectral Classification

Tomography and Subtomogram Averaging

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

The Electron Microscopy Data Bank and OME

Topics. Projections. Review Filtered Backprojection Fan Beam Spiral CT Applications. Bioengineering 280A Principles of Biomedical Imaging

Kapitel 4: Clustering

8 th Grade Mathematics Unpacked Content For the new Common Core standards that will be effective in all North Carolina schools in the

Notes on Single-Particle Reconstruction in Cryogenic Electron Microscopy

Exploratory Data Analysis using Self-Organizing Maps. Madhumanti Ray

Introduction to Medical Imaging. Cone-Beam CT. Klaus Mueller. Computer Science Department Stony Brook University

Based on Raymond J. Mooney s slides

Topics. Review Filtered Backprojection Fan Beam Spiral CT Applications. Bioengineering 280A Principles of Biomedical Imaging

Lecture 3: Geometric and Signal 3D Processing (and some Visualization)

Automatic Particle Selection: Results of A Comparative Study

Journal of Structural Biology 160 (2007) 11 27

Topology representing network enables highly accurate classification of protein images taken by cryo electron-microscope without masking

Thomas Abraham, PhD

Transcription:

T H E U N I V E R S I T Y of T E X A S S C H O O L O F H E A L T H I N F O R M A T I O N S C I E N C E S A T H O U S T O N Single Particle Reconstruction Techniques For students of HI 6001-125 Computational Structural Biology Willy Wriggers, Ph.D. School of Health Information Sciences http://biomachina.org/courses/structures/09.html

Joachim Frank Cryo EM Micrograph of Single Particles

Joachim Frank What is Observed

Main Assumptions 1) All particles in the specimen have identical structure 2) All are linked by 3D rigid body transformations (rotations, translations) 3) Particle images are interpreted as a signal part (= the projection of the common structure) plus noise Important requirement: even angular coverage, without major gaps. Joachim Frank

Joachim Frank How to Get Even Angular Coverage

Joachim Frank Particle Picking

Joachim Frank Particle Picking

Automated Particle Picking Example: CCF-based with local normalization (i) Define a reference (e.g., by averaging projections over full Eulerian range); (ii) Paste reference into array with size matching the size of the micrograph; (iii) Compute CCF via FFT; (iv) Compute locally varying variance of the micrograph via FFT (Roseman, 2003); (v) Local CCF = CCF/local variance (vi) Peak search; (vii) Window particles ranked by peak size; (viii) Fast visual screening. Advantage of local CCF: avoid problems from background variability Joachim Frank

Joachim Frank Performance

Results Many algorithms exist Recent review (current state of the art): Potter C. S. et al. J Struct Biol. (2004)Aug;145:3-14 Joachim Frank

Classification of Images EM images of protein are very noisy and, therefore, the primary process of single-particle analysis is the classification of images according to their Euler angles, the images in each classified group then being averaged to reduce the noise level (Frank et al., 1978; van Heel and Frank, 1981). Classification methods are divided into those that are supervised and those that are unsupervised : Supervised: divide or categorize according to similarity with template or reference. Example for application: projection matching Unsupervised: divide according to intrinsic properties. Example for application: find classes of projections presenting the same view Note: explained in Frank book, chapter 3. Joachim Frank

Joachim Frank Classification

Unsupervised Classification (Clustering) Classification deals with objects in the space in which they are represented. For instance, a 64x64 image is an object in a 4096-dimensional space since in principle each of its pixels can vary independently. Let s say we have 8000 such images. They would form a cloud with 8000 points in this space. Unsupervised classification is a method that is designed to find clusters (regions of cohesiveness) in such a point cloud. Joachim Frank

Dimensonality Reduction Role of Multivariate Statistical Analysis (MSA): find a space ( factor space ) with reduced dimensionality for the representation of the objects. This greatly simplifies classification. Reason for the fact that the space of representation can be much smaller than the original space: resolution limitation (neighborhoods behave the same), and correlations due to the physical origin of the variations (e.g., movement of a structural component is represented by correlated additions and subtractions at the leading and trailing boundaries). MSA very similar to Principal Component Analysis (PCA). Self-Organizing Map (SOM) and Topology-Representing Network (TRN) are neural net based approaches to dimensionality reduction. Joachim Frank

Factor Space / Principal Components: Find new coordinate system, tailored to the data Joachim Frank

Example Brétaudière JP and Frank J (1986) Reconstitution of molecule images analyzed by correspondence analysis: A tool for structural interpretation. J. Microsc. 144, 1-14. Joachim Frank

Joachim Frank Simulated Noisy Data

Eigenimages Extracted from Data (MSA) Joachim Frank Note: explained in Frank book, chapter 4.

Expansion in Factor Space Avrg + F1 Avrg + F1+F2 Avrg + F1+F2+F3 Joachim Frank

Joachim Frank Hierarchical Ascendant Classification

Hierarchical Ascendant Classification Note: Many other clustering techniques exist Joachim Frank

3D Reconstruction Projection Theorem: The 2D Fourier transform of the 2D projection of a 3D density is a central section of the 3D Fourier transform of the density perpendicular to the direction of projection. Joachim Frank

3D Reconstruction Projection Theorem: The 2D Fourier transform of the 2D projection of a 3D density is a central section of the 3D Fourier transform of the density perpendicular to the direction of projection. This holds in Fourier Space.

Angular Reconstitution Real Space: Common Line Projection Theorem Two different 2D projections of the same 3D object always have a 1D line projection in common. van Heel et al, Quarterly Reviews of Biophysics 33, 4 (2000), pp. 307 369.

Sinograms Determine relative orientations with common lines! van Heel et al, Quarterly Reviews of Biophysics 33, 4 (2000), pp. 307 369.

Angular Reconstitution 1) Unsupervised classification, to determine classes of particles exhibiting the same view 2) Unsupervised <-> Reference-free 3) Average images in each class -> class averages 4) Determine common lines between class averages stepwise (van Heel, 1967) simultaneously (Penczek et al., 1996) Issues: unaveraged images are too noisy resolution loss due to implicit use of view range handedness not defined tilt or prior knowledge needed

Reference-Based Projection Matching Reference <-> supervised adapted from an image by Joachim Frank

Joachim Frank Reference-Based Projection Matching

Angular Coverage good poor Joachim Frank

What if Particles are Aligned with Grid? Joachim Frank

Joachim Frank Solution: Tilt of Specimen

Joachim Frank Random Conical Tilt

Random Conical Tilt Premise: all particle exhibit the same view Take same field first at theta ~50 degrees, then at 0 degrees [in this order, to minimize dose] Display both fields side by side Pick each particle in both fields Align particles from 0-degree field This yields azimuths, so that data can be put into the conical geometry Assign azimuths and theta to the tilted particles Proceed with 3D reconstruction Joachim Frank

Random Conical Tilt for Multiple Orientations 1) Find a subset (view class) of particles that lie in the same orientation on the grid: unsupervised classification of 0-degree particles 2) Missing cone problem: do several random conical reconstructions, each from a different subset (view class), find relative orientations, then make reconstruction from merged projections set. Joachim Frank

Joachim Frank Missing Cone Artifacts

Reconstruction Algorithms (a) Fourier interpolation (b) Weighted back-projection (c) Iterative algebraic reconstruction

Fourier Interpolation Obtain samples on a regular Cartesian grid in 3D Fourier space by interpolation between Fourier values on oblique 2D grids (central sections) running through the origin, each grid corresponding to a projection. Joachim Frank

Fourier Interpolation Sample points of adjacent projections are increasingly sparse as we go to higher resolution: Speed (high) versus accuracy (low). Can be used in the beginning phases of a reconstruction project. Joachim Frank

Back Projection Simple back-projection: Sum over back-projection bodies, each obtained by smearing out a projection in the viewing direction: Joachim Frank

Weighted Back Projection Weighted back-projection: as before, but weight the projections first by a function that is tailored to the angular distribution of directions (R* weighting, in X-ray terminology), then inversing the Fourier transform. For general geometries, the weighting function is more complicated, and has to be computed every time. Weighted back-projection is fast, but does not yield the smoothest results. It may show strong artifacts from angular gaps. Joachim Frank

Iterative Algebraic Reconstruction The discrete algebraic projection equation is satisfied, one angle at a time, by adjusting the densities of a starting volume. As iterations proceed, each round produces a better approximation of the object. The algorithm comes in many variants. It allows constraints to be easily implemented. It produces a very smooth reconstruction, and is less affected by angular gaps. Joachim Frank

Joachim Frank Comparison (w/ Missing Cone)

Sources for Limited Resolution Instrumental: partial coherence (envelope function) Particles with different height all considered having same defocus (envelope function) Numerical: interpolations, inaccuracies Failure to exhaust existing information Conformational diversity

Conformational Diversity: Heterogeneous Particle Population Joachim Frank

Example: Low Occupancy of Ternary Complex reconstruction using all data empty ribosome (control) Joachim Frank

Problem Solved by Supervised Classification Joachim Frank

Steve Ludtke The Reconstruction Process

Steve Ludtke Typical Refinement - SPIDER

Steve Ludtke Typical Refinement - IMAGIC

Steve Ludtke Typical Refinement - EMAN

Steve Ludtke Preliminary Model

Steve Ludtke Projections

Steve Ludtke Classification

2D Alignment Resampling the objects in Polar coordinate. The given density objects are expanded in Fourier series: f ( r, β ) = g( r, β ) = m n fˆ gˆ m n ( r) e ( r) e imβ inβ Idea: Rotate both objects, while translate one object along the positive x axis, until match is found. Yao Cong

Yao Cong 2D Alignment

2D Alignment Where: The 2D Fourier transform of correlation function: The correlation function becomes: + = n m mn n m i I e c, ) ( ) ( ) ;, ( ρ ρ φ φ φ φ ) (, β ρ n h r ( ) ( ) rdr r f d r g e e I m n in im mn = ) ( ˆ ˆ ) ( 0 2 0 β ρ β π β ) (, β ρ n h r ( ) ( ) rdr r f d r g e e I m n in im mn = ) ( ˆ ˆ ) ( 0 2 0 β ρ β π β () dr r r f h I n m c m m n r mn ˆ ) ˆ ( 2 ) ( ) ;, ( ˆ, 0 ρ π ρ ρ = = The correlation function is a function of 2 rotations and 1 distance: = 2 ) ; ( ) ( ) ;, ( R ρ φ φ ρ φ φ g f c Here: ( ) ( ) = = n in n m im m e r g r g e r f r f φ β φ β β ρ φ β φ ) ( ˆ ), )( ; ( ) ( ˆ ), )( ( Yao Cong

2D Alignment Particle Image Reference Image Rotate φ f ( r, β ) g ( r, β ) Rotate φ Translate ρ FFT 1D ( r ) f ˆ m FFT 1D ˆ ( r ) f m g n ρ +1 1 h ˆ n r, ρ Precomputing ( m ) BEST FIT For each ρ, update Max. Corre. value + corresp. φ and φ Compute Correlation c ( φ, φ ; ρ ) = FFT Compute Fourier Transform of Correlation n ˆ r, ρ 0 c ( m, n ; ρ ) = 2 π ( ˆ) 1 2 D c (ˆ h ) m f ˆ m () r r dr Yao Cong

Steve Ludtke Class Averages

Steve Ludtke Iteration 1

Steve Ludtke Iteration 2

Steve Ludtke Iteration 3

Steve Ludtke Iteration 4

Steve Ludtke GroEL Reconstruction at 6.5Å

Steve Ludtke Comparison with Xtal Structure

Steve Ludtke Caveat: Model Bias

Steve Ludtke Caveat: Model Bias

Steve Ludtke Caveat: Model Bias

Steve Ludtke Caveat: Model Bias

Caveat: Model Bias better reference model and more iterations! Steve Ludtke

Steve Ludtke CTF Correction

Steve Ludtke CTF Correction

Steve Ludtke CTF Correction

Steve Ludtke CTF Correction - SPIDER

Steve Ludtke CTF Correction - IMAGIC

Steve Ludtke CTF Correction - EMAN

Resources Textbook: Chapters 3,4,5 in: Joachim Frank, Three-Dimensional Electron Microscopy of Macromolecular Assemblies (1996, Academic Press)