Outline. Applications of FFT in Communications. Fundamental FFT Algorithms. FFT Circuit Design Architectures. Conclusions

Similar documents
Chapter 3 Classification of FFT Processor Algorithms

Fast Fourier Transform (FFT) Algorithms

Fast Fourier Transform (FFT)

EE123 Digital Signal Processing

Introduction to FFT Processors

TOPICS PIPELINE IMPLEMENTATIONS OF THE FAST FOURIER TRANSFORM (FFT) DISCRETE FOURIER TRANSFORM (DFT) INVERSE DFT (IDFT) Consulted work:

Design of Efficient Pipelined Radix-2 2 Single Path Delay Feedback FFT

Introduction to FFT Processors. Chih-Wei Liu VLSI Signal Processing Lab Department of Electronics Engineering National Chiao-Tung University

The following algorithms have been tested as a method of converting an I.F. from 16 to 512 MHz to 31 real 16 MHz USB channels:

DESIGN METHODOLOGY. 5.1 General

ELEG 5173L Digital Signal Processing Introduction to TMS320C6713 DSK

Analysis of Radix- SDF Pipeline FFT Architecture in VLSI Using Chip Scope

INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ENGINEERING AND TECHNOLOGY (IJARET) A NEW RADIX-4 FFT ALGORITHM

Computation of DFT. W has a specific structure and because

Computation of DFT. point DFT. W has a specific structure and because

ENT 315 Medical Signal Processing CHAPTER 3 FAST FOURIER TRANSFORM. Dr. Lim Chee Chin

Spectral leakage and windowing

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control

Research Article Design of A Novel 8-point Modified R2MDC with Pipelined Technique for High Speed OFDM Applications

FPGA Implementation of Pipeline Digit-Slicing Multiplier-Less Radix 2 2 DIF SDF Butterfly for Fast Fourier Transform Structure

Appendix D. Controller Implementation

Computer Graphics Hardware An Overview

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation

SPIRAL DSP Transform Compiler:

Announcements. Reading. Project #4 is on the web. Homework #1. Midterm #2. Chapter 4 ( ) Note policy about project #3 missing components

Reconfigurable FFT Processor A Broader Perspective Survey

Elementary Educational Computer

1. SWITCHING FUNDAMENTALS

6. Fast Fourier Transform

Precise Psychoacoustic Correction Method Based on Calculation of JND Level

CTx / CTx-II. Ultra Compact SD COFDM Concealment Transmitters. Features: Options: Accessories: Applications:

Twiddle Factor Transformation for Pipelined FFT Processing

SRx. HD/SD Dual Input Diversity COFDM Receiver. Features. Options. Accessories. Applications

Chapter 3. Floating Point Arithmetic

SRx. HD/SD Dual Input Diversity COFDM Receiver. Features. Options

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design

Generation of Distributed Arithmetic Designs for Reconfigurable Applications

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Part A Datapath Design

Multiprocessors. HPC Prof. Robert van Engelen

APPLICATION NOTE PACE1750AE BUILT-IN FUNCTIONS

Lecture 2. RTL Design Methodology. Transition from Pseudocode & Interface to a Corresponding Block Diagram

ISSN: [Kavitha* et al., (6): 3 March-2017] Impact Factor: 4.116

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming

Big-O Analysis. Asymptotics

REAL TIME DIGITAL SIGNAL PROCESSING

Reversible Realization of Quaternary Decoder, Multiplexer, and Demultiplexer Circuits

1&1 Next Level Hosting

Filter design. 1 Design considerations: a framework. 2 Finite impulse response (FIR) filter design

ANN WHICH COVERS MLP AND RBF

Computer Systems - HS

ALU Augmentation for MPEG-4 Repetitive Padding

The Serial Commutator FFT

EEO 401 Digital Signal Processing Prof. Mark Fowler

UNIVERSITY OF MORATUWA

CMSC Computer Architecture Lecture 11: More Caches. Prof. Yanjing Li University of Chicago

FFT: An example of complex combinational circuits

FFT/IFFTProcessor IP Core Datasheet

An Efficient Implementation Method of Fractal Image Compression on Dynamically Reconfigurable Architecture

Panel Methods : Mini-Lecture. David Willis

Multi-Threading. Hyper-, Multi-, and Simultaneous Thread Execution

k (check node degree) and j (variable node degree)

FFT. There are many ways to decompose an FFT [Rabiner and Gold] The simplest ones are radix-2 Computation made up of radix-2 butterflies X = A + BW

Fixed Point Streaming Fft Processor For Ofdm

Big-O Analysis. Asymptotics

ICS Regent. Communications Modules. Module Operation. RS-232, RS-422 and RS-485 (T3150A) PD-6002

Polynomial Functions and Models. Learning Objectives. Polynomials. P (x) = a n x n + a n 1 x n a 1 x + a 0, a n 0

EE260: Digital Design, Spring /16/18. n Example: m 0 (=x 1 x 2 ) is adjacent to m 1 (=x 1 x 2 ) and m 2 (=x 1 x 2 ) but NOT m 3 (=x 1 x 2 )

Efficient Hardware Design for Implementation of Matrix Multiplication by using PPI-SO

CMSC Computer Architecture Lecture 5: Pipelining. Prof. Yanjing Li University of Chicago

FAST FOURIER TRANSFORM (FFT) and inverse fast

SERIAL COMMUNICATION INTERFACE FOR ESA ESTRO

CMSC Computer Architecture Lecture 10: Caches. Prof. Yanjing Li University of Chicago

How do we evaluate algorithms?

Introduction to Digital Signal Processing Systems

Lecture 3. RTL Design Methodology. Transition from Pseudocode & Interface to a Corresponding Block Diagram

CMSC22200 Computer Architecture Lecture 9: Out-of-Order, SIMD, VLIW. Prof. Yanjing Li University of Chicago

Digital Signal Processing. Soma Biswas

Quality of Service. Spring 2018 CS 438 Staff - University of Illinois 1

EE 435. Lecture 26. Data Converters. Architectures. Characterization

Asymmetrical Load-Balancing for Incremental Fast Fourier Transform on Multi-Core Processors

FAST Fourier transform (FFT) is an important signal processing

COARSE ANGLE ROTATION MODE CORDIC BASED SINGLE PROCESSING ELEMENT QR-RLS PROCESSOR

A Note on Least-norm Solution of Global WireWarping

ON CONFIGURATION OF RESIDUE SCALING PROCESS IN PIPELINED RADIX-4 MQRNS FFT PROCESSOR

Low Power Complex Multiplier based FFT Processor

INTRODUCTION TO THE FAST FOURIER TRANSFORM ALGORITHM

End Semester Examination CSE, III Yr. (I Sem), 30002: Computer Organization

ECE4050 Data Structures and Algorithms. Lecture 6: Searching

Computer Architecture. Microcomputer Architecture and Interfacing Colorado School of Mines Professor William Hoff

DSP ELEMENTS IN MAX/MSP

( n+1 2 ) , position=(7+1)/2 =4,(median is observation #4) Median=10lb

AN OPTIMIZATION NETWORK FOR MATRIX INVERSION

SAMPLE VERSUS POPULATION. Population - consists of all possible measurements that can be made on a particular item or procedure.

Introduction to Wireless & Mobile Systems. Chapter 6. Multiple Radio Access Cengage Learning Engineering. All Rights Reserved.

Lazy Type Changes in Object-oriented Database. Shan Ming Woo and Barbara Liskov MIT Lab. for Computer Science December 1999

Math Section 2.2 Polynomial Functions

Chapter 4 The Datapath

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Single-Cycle Disadvantages & Advantages

Bezier curves. Figure 2 shows cubic Bezier curves for various control points. In a Bezier curve, only

Transcription:

FFT Circuit Desig

Outlie Applicatios of FFT i Commuicatios Fudametal FFT Algorithms FFT Circuit Desig Architectures Coclusios

DAB Receiver Tuer OFDM Demodulator Chael Decoder Mpeg Audio Decoder 56/5/ 4/48 poit FFT Packet Demux Cotroller Cotrol Pael 3

LA OFDM System TRASMITTER FEC Coder S/P IFFT 64-pt Guard Iterval Isertio D/A LPF Up Coverter MAC Layer 6Mbps ~ 54Mbps RECEIVER FEC Decoder P/S FFT 64-pt Guard Iterval Removal A/D LPF Dow Coverter 4

ADSL (Discrete Multi-tue) System TRASMITTER Data I S/P QAM ecoders IFFT 5-pt add cyclic prefix P/S D/A + trasmit filter RECEIVER chael Data Out P/S QAM decoders FEQ FFT 5-pt S/P remove cyclic prefix TEQ receive filter + A/D 5

Applicatios of FFT i Commuicatios Comm. LA DAB DVB ADSL VDSL System FFT Size 64 56/5/ 4/48 48/ 89 5 5/4/ 48/496 OFDM DMT 6

Outlie Applicatios of FFT i Commuicatios Fudametal FFT Algorithms FFT Circuit Desig Architectures Coclusios 7

Fudametal FFT Algorithms Discrete Fourier Trasfer Pair Radix- FFT ( ν ) Decimatio-i-time (DIT) Decimatio-i-frequecy (DIF) FFT for composite ( ) Cooley-Tukey Algorithms Radix-r FFT 8

Discrete Fourier Trasform Pair Let DFT x[ ] X[ k] deote a DFP pair. e have X [ k ] x[ ] k, k,,...,, x[ ] X [ k ] k,,,...,, here, e j ( π / ). 9

Observatios k is -periodic. k is cojugate symmetric. Both x[] ad X[k] are -periodic. If x[] is real, the X[k] is cojugate symmetric ad vice versa.

Observatios A direct calculatio requires approximately complex multiplicatios ad additios. FFT algorithms reduce the computatio complexity to the order of log. Algorithms developed for FFT also works for IFFT with oly mior modificatios.

Example: Zero-Paddig (LA) Subcarriers LA 5 sub-carriers: use 64-poit FFT. ull # #.. #6 ull ull ull #-6.. #- # 6 7 37 38 6 63 IFFT 6 7 37 38 6 63 Time Domai Outputs

3 Decimatio-i-Time Radix- FFT,, ] [ ] [ ] [ ] [ / / / / + + + k H[k] G[k] r x r x x k X k kr r k kr r k K Assume is a eve umber.

Observatios G[k] is DFT of eve samples of x[]. H[k] is DFT of odd samples of x[]. G[k] ad H[k] are /-periodic. k+/ - k. 4

DIT Radix- FFT X [ r ] G[r] + r H[r], X [ r + / ] G[r] + ( r + / ) H[r], G[r] r < / -. r H[r], G[r] X[r] r H[r] X[r+/] - r 5

Decimatio-i-Time Radix- FFT Butterfly for Radix- DIT FFT (M) th stage r M th stage - r (M) th stage M th stage r I-place Computatio 6

Decimatio-i-Time Radix- FFT x[] x[] x[4] x[6] x[] x[3] x[5] x[7] First layer decimatio /-poit DFT /-poit DFT G[] G[] G[] G[3] H[] H[] H[] H[3] 3 X[] X[] X[] X[3] X[4] X[5] X[6] X[7] 7

Decimatio-i-Time Radix- FFT x[] X[] x[4] X[] x[] X[] x[6] X[3] x[] X[4] x[5] X[5] x[3] X[6] x[7] 3 X[7] 8

Bit Reversal x[ ] x[ ] x[ ] x[ ] x[ ] x[ ] x[ ] x[ ] x[ ] 9

Decimatio-i-frequecy Radix- FFT,, ]) / [ ] [ ( ] [ ]) / [ ] [ ( ] [,,, ] [ ] [ / ) / ( / ) / ( - / r x x r X x x r X k x k X r r k K K + + + + Assume is a eve umber.

Decimatio-i-frequecy Radix- FFT,, where,, ] [ ] [, ] [ ] [ / ) / ( / ) / ( - / r ]) / x[ (x[] h[] ]) / x[ (x[] g[] h r X g r X r r K + + + +

Decimatio-i-frequecy Radix- FFT Butterfly for Radix- DIF FFT (M) th stage M th stage I-place Computatio

Decimatio-i-frequecy Radix- FFT First layer decimatio x[] g[] X[] x[] x[] x[3] g[] g[] g[3] /-poit DFT X[] X[4] X[6] x[4] x[5] x[6] x[7] h[] h[] h[] h[3] 3 /-poit DFT X[] X[3] X[5] X[7] 3

Decimatio-i-frequecy Radix- FFT x[] X[] x[] X[4] x[] X[] x[3] X[6] x[4] X[] x5] X[5] x[6] X3] x7] 3 X7] 4

Butterfly Compariso Butterfly (decimatio-i-frequecy) (M) th stage M th stage Butterfly (decimatio-i-time) (M) th stage M th stage r 5

6 Cooley-Tukey Algorithm ]. k X[ ] [ ], x[ ] [ : t arrageme - re poit D,,,,,, k k X x k k k k + + + +

7 Cooley-Tukey Algorithms, ] [ ] [ k k k x k X + ], [ k G Twiddle factor ], [ ~ k G

Observatios, / -> st stage of the decimatio i frequecy radix- FFT. /, -> st stage of the decimatio i time radix- FFT. I geeral,. If r -> Radix-r. 8

Radix-3 FFT (DIF) Assume is a multiple of 3. X[ k] X[3r ] X[3r + ] X[3r + ] x[ ] ( /3) ( /3) r ( /3) r k ( x[ ] + x[ + /3] + x[ + /3]) ( x[ ] + x[ + /3] e ( x[ ] + x[ + /3] e jπ 3 jπ 3 r /3 + x[ + /3] e + x[ + /3] e jπ 3 jπ 3 ) ) r /3 r /3 9

Radix-3 FFT (DIF) Butterfly for Radix-3 DIF FFT (M) th stage 3 e jπ j π 3 e M th stage j π 3 e 3 e jπ 3

Radix-4 FFT (DIF) Assume is a multiple of 4. X[4 r] X[4 r+ ] X[4 r+ ] X[4 r+ 3] ( /4) ( /4) r ( /4) r ( /4) r ( x[ ] + x[ + /4] + x[ + /4] + x[ + 3 /4]) r /4 ( x[ ] + ( j) x[ + /4] + ( ) x[ + /4] + jx [ + /4]) ( x[ ] + ( ) x[ + /4] + x[ + /4] + ( ) x[ + 3 /4]) 3 r /4 ( x[ ] + jx [ + /4] + ( ) x[ + /4] + ( j) x[ + 3 /4]) r /4 r /4 3

Radix-4 FFT (DIF) Butterfly for Radix-4 DIF FFT (M) th stage M th stage 3

Split Radix FFT Mix Radix- ad Radix-4 architecture. Compute eve trasform coefficiets based o Radix- strategy ad odd coefficiets based o Radix-4 strategy. Ca perform FFT for ν. 33

Simplify Butterfly Represetatios Radix- Radix-4 34

Split-Radix FFT 35

Computatioal Complexity Method # of Complex Multiplicatios # of Complex Additios DFT () Radix- (/) log log Radix-4 (3/8) log (3/) log The above umbers do ot tell the whole story! Architecture is the key issue to trade of amog performace, cost, hardware complexity, etc. 36

Outlie Applicatios of FFT i Commuicatios Fudametal FFT Algorithms FFT Circuit Desig Architectures Coclusios 37

FFT Architecture Desig Cosideratios Trade-off amog accuracy, speed, hardware complexity, ad power cosumptio best fit architecture should be applicatio depedet. Mai architecture differeces i: Degrees of parallelism umber ad complexity of processig elemets, Cotrol schemes - hardware utilizatio ad data flow cotrol. 38

Degree of Parallelism Oe simple processig uit or multiple simple processig uits x[] X[] x[4] X[] x[] x[6] x[] X[] X[3] X[4] x[5] x[3] x[7] 3 X[5] X[6] X[7] 39

Degree of Parallelism Simple processig uits versus complicate processig uits 4

Memory-based FFT architecture Sigle butterfly or processig elemet. Required memory size. A cotrol uit esures the right data flows to compute FFT. Firmware Like. Low complexity. Low speed. 4

Memory-based FFT Block Diagram Data I Iput Buffer Coefficiets ROM or Geerator Butterfly or Processig Elemet RAM Data Out Cotrol Cotrol Uit 4

Pipelie Architectures FFT Sigal Flow Graph Multiple path delay commutator Sigle path delay commutator Sigle path delay feedback 43

Radix- Sigal Flow Graph (DIT) x[] X[] x[4] x[] x[6] x[] x[5] x[3] x[7] 3 X[] X[] X[3] X[4] X[5] X[6] X[7] BF Buffer BF Buffer BF ROM ROM ROM 44

Radix- Sigal Flow Graph (DIF) x[] x[] x[] x[3] x[4] x5] x[6] x7] 3 X[] X[4] X[] X[6] X[] X[5] X3] X7] BF Buffer BF Buffer BF ROM ROM ROM 45

Multi-Path Delay Commutator Delay Commutator (switch) Butterfly Delay Delay Delay 46

Radix- Multi-Path Delay Commutator x[] x[] x[] x[3] x[4] x5] x[6] x7] 7 6 5 4 3 3 switch 4 5 6 7 3 delay 3 4 5 6 7 butterfly 3 4 5 6 7 X[] X[4] X[] X[6] X[] X[5] X3] X7] delay 3 4 5 6 7 switch 5 4 7 6 3 delay 5 4 7 6 3 5 4 7 6 3 butterfly delay 5 4 7 6 3 switch 6 4 7 5 3 delay 6 4 7 5 3 butterfly 6 4 7 5 3 47

Radix- Multi-Path Delay Commutator 8 4 C BF C BF C BF C BF 4 6 48

Radix-4 Multi-Path Delay Commutator 9 48 3 8 C4 BF4 64 6 3 C4 3 6 BF4 4 8 C4 8 4 BF4 C4 BF4 48 3 56 49

Sigle Path Delay Commutator Delay Commutator Butterfly 5

Radix- Sigle Path Delay Commutator DC BF DC BF DC BF DC BF 6 5

Radix-4 Sigle Path Delay Commutator DC4 BF4 DC4 BF4 DC4 BF4 DC4 BF4 56 5

Sigle Path Delay Feedback Delay Butterfly 53

Radix- Sigle Path Delay Feedback 8 4 BF BF BF BF 6 54

Radix-4 Sigle Path Delay Feedback 64x3 6x3 4x3 x3 BF4 BF4 BF4 BF4 56 55

R SDF 8 64 3 6 8 4 BF I BF II BF I BF II BF I BF II BF I BF II 56 56

Hardware Compariso Architecture Multiplier # Adder # Memory Size Cotrol RMDC RSDF R4MDC R4SDF R4SDC R SDF (log 4 ) (log 4 ) 3(log 4 ) log 4 log 4 log 4 4 log 4 4 log 4 8 log 4 8 log 4 3 log 4 4 log 4 3/- 5/-4 - simple simple simple medium complex simple 57

Coclusios Effect FFT computatio is essetial to may commuicatio applicatios utilizig OFDM or DMT techique. A pipelied FFT architecture is applied where a high real-time performace is required. A memory-based FFT architecture ca be adopted whe cost is more cocered tha speed. A best fit FFT architecture depeds o applicatio specific requiremets to trade off amog accuracy, speed, chip size, power cosumptio, etc. 58