Introduction to FFT Processors
|
|
- Bryce Daniels
- 5 years ago
- Views:
Transcription
1 Itroductio to FFT Processors Chih-ei Liu VLSI Siga Processig Lab Departmet of Eectroics Egieerig atioa Chiao-Tug Uiversity FFT Desig FFT Cosists of a series of compe additios ad compe mutipicatios Agorithm Cooey-Tuey decompositio for power of two egth FFT Architecture Systematic mappig procedure
2 Agorithm Leve Cooey-Tuey decompositio Radi-, decimatio-i-frequecy A A Variats based o CT agorithm Fied radi: Radi-, Radi-, Radi-, Radi- Mied radi: Spit-radi, Radi-, Radi- umber of additio Same for ay mied-radi or fied-radi agorithm. umber of mutipicatio Depeds o the reductio of trivia mutipicatios. - A A Hece, icrease additios FFT Agorithms Review of Radi- r agorithm DIFdecimatio i frequecy ad DITdecimatio i time versio Radi- agorithm Radi- ad Radi- agorithm Radi- ad Radi- agorithm Spit-radi ad Spit-radi
3 FFT Agorithms DFT X e π j,,, a jb * a jb * 5 7 j j j [ a b j b a] [ b a j b a] FFT Agorithms Radi- Agorithm DIF Radi- Agorithm X [ ] X [ ] Butterfy of Radi- Agorithm,, K,. DIF Form
4 FFT Agorithms Radi- Agorithm X [ ] Radi- Agorithm X,,,,; ~ ; [ ] ` {[ ] j [ ]},; ~. FFT Agorithms Butterfy of Radi- Agorithm Data Orderig: Digit Reversed
5 FFT Agorithms Data Orderig of Radi-... Digit-reversed orderig FFT Agorithms Butterfy of radi- Agorithm Data Orderig: Bit Reversed 5
6 FFT Agorithms X Data Orderig of Radi- Bit-reversed orderig m m m m m m X 7 7 } ] 7 5 [ ] {[ ] [ FFT Agorithms DIF Radi- Agorithm,,,,,5,,7;. ~
7 7 } ] 7 5 [ ] {[ } ] 7 5 [ ] {[ FFT Agorithms DIF Radi- Agorithm X,;,,. ~ FFT Agorithms Butterfy of Radi- Agorithm
8 FFT Agorithms Butterfy of Radi- Agorithm FFT Agorithms DIF Spit-Radi Agorithm X [ ] X { j[ ]} X { j[ ]} i X is from to -, ad i X ad X are from to -
9 FFT Agorithms Butterfy of Spit-Radi Agorithm FFT Agorithms Advatage of Radi- Agorithm Low Computatioa Compeity Feibe as radi- agorithm Bit reversed output whe ormay ordered iput 9
10 X X } ] 7 5 [ ] {[ ] [ FFT Agorithms DIF Spit-Radi Agorithm,,5,7 FFT Agorithms Butterfy of Spit-Radi Agorithm
11 Mutipicative Compeity Trivia mutipicatios i FFT Mutipied by Radi-: removed Radi-: ad j partiay removed Spit-radi: ad j removed Radi-:, j, j partiay removed Radi-:, j, j removed Radi- Siga Fow Graph
12 Spit-Radi Siga Fow Graph Mutipicative Compeity Radi- Cost. Mu Cost. Mu Radi- Spit- Radi Radi- Radi How to obtai reguar SR FFT architecture?
13 Architecture Leve Mappig procedure Systoic array techiques Operatio scheduig, resource sharig Pipeie architecture Oe-dimesioa iear array Deay-feedbac vs. Deay-commutator. Sige PE architecture Shared-memory, Sige Processig Eemet PE RMDC Radi- Muti-Path Deay Commutator X X X X X X X X X X9 X5 X X X X7 X5 Z - Z - Z - Z - Z - Z - Z - Deay Commutator or Deay-Switch Switch-Deay
14 st st ad d stages i RMDC Iput pairs : Iput pairs : Iput pairs : Iput pairs : X X 9 X X X X X X X X9 X5 X X X X7 X5 Stage Stage Stage Stage RMDC Radi- Muti-Path Deay Commutator X X X X X 5 X5 X9 7 9 X X X X X X 9 X7 X 5 X5 Stage Stage
15 RrMDC r r r r Iput stage r th stage r r r r r r r r r r r r r r r r Deay Feedbac RSDF RSDF R SDF 5
16 RSDF Radi- Sige-Path Deay Feedbac RSDF vs. RSDF
17 Buffer Styes of pipeie architecture R deay-commutator: iefficiet 5% MEM usage. RMDC R deay-feedbac: % MEM usage.rsdf Sige PE Architecture 7
18 Cocudig Remars The Spit-Radi agorithm has ess computatio compeity, comparig with the fied Radi agorithm. However, its butterfy operatio is irreguar L-shape. The processig speed of pipeie architecture is faster tha sige-pe architecture. However, the sige PE architecture is the most areaefficiet, especiay for og egth FFTIFFT appicatio. Review Traditioa FFT Desig Steps. Give -poit FFT spec., choose fied-radi agorithm. Desig radi-r butterfy, mutipier, etc.. Cascade og r stages to compute poit FFT. Arbitrary radi ca be used Base o Cooey-Tuey decompositio for ay composite umber
19 Probem of Traditioa Approach Caot drive architecture for mied-radi agorithm The processig speed is o oger the critica issue ay more owadays. The chip area ad the power cosumptio domiate the desig quaity. Re-cofigurabe FFTIFFT architecture desig is ecessary for various appicatios. A egth-scaabe ad atecy-specified FFTIFFT core is ecessary. Proposed Soutio e impemet FFT modue by sige PE architecture Mutipe-port Memory Reg Radi-r Butterfy Processig Eemet Reg Pre-fetch buffer 9
20 Desig Issue Performace-eough, Chip area, power cosumptio. Scaabe processig eemet. Limited Storage bocs. Efficiet memory address geerator. Agorithm Leve e adopt spit-radi agorithm to reaize the FFT modue. X X A X j X X j X X j X X j X A A
21 The Kere of Processig Eemet A A A A A A 5 A A 7 A A 9 A A A A A A 5 j j j j j j 9 j j j X X X X X X X X X X 9 X 5 X X X X 7 X 5 Foded Butterfy Uits Comparig with Radi-Radi-, it saves haf memory access times. Feedbac path Mu Mu Mu Mu Butterfy uit Butterfy uit
22 Storage Bocs e use mutipe sige-port memory bas to repace the muti-port memory. The cocept of cofict-free memory. Verte coorig probem Scaabe Memory Address Geerator There must eist a soutio for such verte coorig probem. The best soutio --- The proposed Itereave Rotated Data Aocatio IRDA agorithm. Address GeeratorAG for -egth AT AT AT AT Address Switcher AS Address GeeratorAG for -egth Rotator RAM- RAM- RAM- RAM- RAM- RAM- RAM- RAM-
23 The IRDA Cocept A cofict-free memory bas. Simpe ad egthscaabe desig. The circuar shift rotator RAM-A RAM-B RAM-C RAM-D Legth-Scaabe FFTIFFT Core Address geerator RAM -A RAM -B RAM -C RAM -D Adder Reg Radi- butterfy processig eemet Radi- butterfy processig eemet
24 Further Performace Improvemet Mutipe PEs architecture. pipeie PEs, for eampe. RAM RAM RAM RAM Group RAM RAM 5 RAM RAM 7 Group The Cached-FFT Agorithm
25 Overview. Iput data are oaded ito a -word mai memory.. C of the words are oaded ito the cache.. As may butterfies as possibe are computed usig the data i the cache.. Processed data i the cache are fushed to mai memory. 5. Steps - are repeated uti a words have bee processed oce.. Steps -5 are repeated uti the FFT has bee competed. Processor cache Mai Memory Resut X X X X X X 5 X X 7 X X 9 X X X X X X 5 5
26 , E, Radi- Cached-FFT
Introduction to FFT Processors. Chih-Wei Liu VLSI Signal Processing Lab Department of Electronics Engineering National Chiao-Tung University
Itroductio to FFT Processors Chihei Liu VLSI Siga Processig Lab Departmet of Eectroics Egieerig atioa ChiaoTug Uiversity FFT Desig FFT Cosists of a series of compe additios ad compe mutipicatios Agorithm
More informationOutline. Applications of FFT in Communications. Fundamental FFT Algorithms. FFT Circuit Design Architectures. Conclusions
FFT Circuit Desig Outlie Applicatios of FFT i Commuicatios Fudametal FFT Algorithms FFT Circuit Desig Architectures Coclusios DAB Receiver Tuer OFDM Demodulator Chael Decoder Mpeg Audio Decoder 56/5/ 4/48
More informationChapter 3 Classification of FFT Processor Algorithms
Chapter Classificatio of FFT Processor Algorithms The computatioal complexity of the Discrete Fourier trasform (DFT) is very high. It requires () 2 complex multiplicatios ad () complex additios [5]. As
More informationDesign of Efficient Pipelined Radix-2 2 Single Path Delay Feedback FFT
IOSR Joural of VLSI ad Sigal Processig IOSR-JVSP Volume Issue Ver. I May-Ju. 0 PP 88-9 e-iss: 9 00 p-iss o. : 9 97 www.iosrjourals.org Desig of Efficiet Pipelied Radi- Sigle Path Delay Feedbac FFT isha
More informationFast Fourier Transform (FFT) Algorithms
Fast Fourier Trasform FFT Algorithms Relatio to the z-trasform elsewhere, ozero, z x z X x [ ] 2 ~ elsewhere,, ~ e j x X x x π j e z z X X π 2 ~ The DFS X represets evely spaced samples of the z- trasform
More informationFast Fourier Transform (FFT)
IC Desig Lab. Fast Fourier Trasfor FFT, - e e π π G Twiddle Factor : e π Radi- DIT FFT: IC Desig Lab. π cos si π Syetry Property Periodicity Property u u u * Cougate Syetric Property a a b b a b b a b
More informationEE123 Digital Signal Processing
Last Time EE Digital Sigal Processig Lecture 7 Block Covolutio, Overlap ad Add, FFT Discrete Fourier Trasform Properties of the Liear covolutio through circular Today Liear covolutio with Overlap ad add
More informationCOMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Part A Datapath Design
COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter The Processor Part A path Desig Itroductio CPU performace factors Istructio cout Determied by ISA ad compiler. CPI ad
More informationComputation of DFT. W has a specific structure and because
DSP 7 Computatio of DFT CTU EE Computatio of DFT Efficiet algoithms fo computig DFT Fast Fouie Tasfom. a Compute oly a few poits out of all poits b Compute all poits hat ae the efficiecy citeia? umbe of
More informationRecursion. A problem solving technique where an algorithm is defined in terms of itself. A recursive method is a method that calls itself
Recursio 1 A probem sovig techique where a agorithm is defied i terms of itsef A recursive method is a method that cas itsef A recursive agorithm breaks dow the iput or the search space ad appies the same
More informationFPGA Implementation of Pipeline Digit-Slicing Multiplier-Less Radix 2 2 DIF SDF Butterfly for Fast Fourier Transform Structure
FPGA Implemetatio of Pipelie Slicig MultiplierLess Radix DIF SDF Butterfly for Fast Fourier Trasform Structure Yaza Samir Algabi,. Rozita Teymourzadeh, Masuri Othma, Md Shabiul Islam Istitute of MicroEgieerig
More informationCSC 220: Computer Organization Unit 11 Basic Computer Organization and Design
College of Computer ad Iformatio Scieces Departmet of Computer Sciece CSC 220: Computer Orgaizatio Uit 11 Basic Computer Orgaizatio ad Desig 1 For the rest of the semester, we ll focus o computer architecture:
More informationComputation of DFT. point DFT. W has a specific structure and because
DSP 5 Spig Computatio of DFT CTU EE Computatio of DFT Efficiet algoithms fo computig DFT Fast Fouie Tasfom. a Compute oly a few poits out of all poits b Compute all poits hat ae the efficiecy citeia? umbe
More informationELEG 5173L Digital Signal Processing Introduction to TMS320C6713 DSK
Departmet of Electrical Egieerig Uiversity of Arasas ELEG 5173L Digital Sigal Processig Itroductio to TMS320C6713 DSK Dr. Jigia Wu wuj@uar.edu ANALOG V.S DIGITAL 2 Aalog sigal processig ASP Aalog sigal
More informationReversible Realization of Quaternary Decoder, Multiplexer, and Demultiplexer Circuits
Egieerig Letters, :, EL Reversible Realizatio of Quaterary Decoder, Multiplexer, ad Demultiplexer Circuits Mozammel H.. Kha, Member, ENG bstract quaterary reversible circuit is more compact tha the correspodig
More informationLower Bounds for Sorting
Liear Sortig Topics Covered: Lower Bouds for Sortig Coutig Sort Radix Sort Bucket Sort Lower Bouds for Sortig Compariso vs. o-compariso sortig Decisio tree model Worst case lower boud Compariso Sortig
More informationCMSC Computer Architecture Lecture 12: Virtual Memory. Prof. Yanjing Li University of Chicago
CMSC 22200 Computer Architecture Lecture 12: Virtual Memory Prof. Yajig Li Uiversity of Chicago A System with Physical Memory Oly Examples: most Cray machies early PCs Memory early all embedded systems
More information6.081 March Lecture 6
Review exampe : Aut Zeda s ba accout growig from some iitia baace y [ ] y[.05y[ y[ ].05y[ Natura frequecy is.05 soutio grows as.05 Review exampe : Your robot from ab d d right d eft ] δtθ[ θ[ ] θ[ δtk
More informationMorgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5
Morga Kaufma Publishers 26 February, 28 COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter 5 Set-Associative Cache Architecture Performace Summary Whe CPU performace icreases:
More informationThe following algorithms have been tested as a method of converting an I.F. from 16 to 512 MHz to 31 real 16 MHz USB channels:
DBE Memo#1 MARK 5 MEMO #18 MASSACHUSETTS INSTITUTE OF TECHNOLOGY HAYSTACK OBSERVATORY WESTFORD, MASSACHUSETTS 1886 November 19, 24 Telephoe: 978-692-4764 Fax: 781-981-59 To: From: Mark 5 Developmet Group
More informationImprovement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation
Improvemet of the Orthogoal Code Covolutio Capabilities Usig FPGA Implemetatio Naima Kaabouch, Member, IEEE, Apara Dhirde, Member, IEEE, Saleh Faruque, Member, IEEE Departmet of Electrical Egieerig, Uiversity
More informationMINKOWSKI SET OPERATORS. Daniela VELICHOVÁ
Te Joura of Pois Society for Geometry ad Egieerig Grapics Voume 4 (013), 45-49 45 MINKOWSKI SET OPERATORS Daiea VELICHOVÁ Sova Uiversity of Tecoogy, Istitute of Matematics ad Pysics, Nám. Sobody 17, 81
More informationCopyright 2016 Ramez Elmasri and Shamkant B. Navathe
Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 22 Database Recovery Techiques Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio Recovery algorithms Recovery cocepts Write-ahead
More informationLU Decomposition Method
SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS LU Decompositio Method Jamie Traha, Autar Kaw, Kevi Marti Uiversity of South Florida Uited States of America kaw@eg.usf.edu http://umericalmethods.eg.usf.edu Itroductio
More informationBehavioral Modeling in Verilog
Behavioral Modelig i Verilog COE 202 Digital Logic Desig Dr. Muhamed Mudawar Kig Fahd Uiversity of Petroleum ad Mierals Presetatio Outlie Itroductio to Dataflow ad Behavioral Modelig Verilog Operators
More informationINTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ENGINEERING AND TECHNOLOGY (IJARET) A NEW RADIX-4 FFT ALGORITHM
ITERATIOAL JOURAL OF ADVACED REEARC I EIEERI AD TECOLO (IJARET Iteratioal Joural of Advaced Research i Egieerig ad Techology (IJARET I 976 68(Prit I 976 699(Olie Volume Issue 3 April (3 IAEME I 976-68
More informationOptimizing Triangular Parabolic Fuzzy EOQ Model with Shortage Using Nearest Interval Approximation
Iteratioa Joura o Future Revoutio i Computer Sciece & Commuicatio Egieerig ISSN: 454-448 Voume: 3 Issue: 9 96 Optimizig Triaguar Paraboic Fuzzy EO Mode with Shortage Usig Nearest Iterva Approximatio Faritha
More informationCS425 Project Assignment
CS425 Project Assigmet Tue the TPC-H queries i a database of your choice What is TPC-H? Idustry stadard bechmark for testig database performace for compex SQL Decisio support Aaytics Ad-hoc TPC-H specificatio
More informationMessage Integrity and Hash Functions. TELE3119: Week4
Message Itegrity ad Hash Fuctios TELE3119: Week4 Outlie Message Itegrity Hash fuctios ad applicatios Hash Structure Popular Hash fuctios 4-2 Message Itegrity Goal: itegrity (ot secrecy) Allows commuicatig
More informationSPIRAL DSP Transform Compiler:
SPIRAL DSP Trasform Compiler: Applicatio Specific Hardware Sythesis Peter A. Milder (peter.milder@stoybroo.edu) Fraz Frachetti, James C. Hoe, ad Marus Pueschel Departmet of ECE Caregie Mello Uiversity
More informationParallel Polygon Approximation Algorithm Targeted at Reconfigurable Multi-Ring Hardware
Parallel Polygo Approximatio Algorithm Targeted at Recofigurable Multi-Rig Hardware M. Arif Wai* ad Hamid R. Arabia** *Califoria State Uiversity Bakersfield, Califoria, USA **Uiversity of Georgia, Georgia,
More informationInternational Journal of Pure and Applied Sciences and Technology
It J Pure App Sci Techo 6( (0 pp7-79 Iteratioa Joura of Pure ad Appied Scieces ad Techoogy ISS 9-607 Avaiabe oie at wwwijopaasati Research Paper Reatioship Amog the Compact Subspaces of Rea Lie ad their
More informationChapter 10 : Concurrency Control
Chapter 10 : Cocurrecy Cotro modified from: Database System Cocepts, 6 th Ed. Siberschatz, Korth ad Sudarsha See www.db-book.com for coditios o re-use Ituitio of Lock-based Protocos Trasactios have to
More informationCopyright 2016 Ramez Elmasri and Shamkant B. Navathe
Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 18 Strategies for Query Processig Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio DBMS techiques to process a query Scaer idetifies
More informationCounting the Number of Minimum Roman Dominating Functions of a Graph
Coutig the Number of Miimum Roma Domiatig Fuctios of a Graph SHI ZHENG ad KOH KHEE MENG, Natioal Uiversity of Sigapore We provide two algorithms coutig the umber of miimum Roma domiatig fuctios of a graph
More informationANN WHICH COVERS MLP AND RBF
ANN WHICH COVERS MLP AND RBF Josef Boští, Jaromír Kual Faculty of Nuclear Scieces ad Physical Egieerig, CTU i Prague Departmet of Software Egieerig Abstract Two basic types of artificial eural etwors Multi
More informationEE University of Minnesota. Midterm Exam #1. Prof. Matthew O'Keefe TA: Eric Seppanen. Department of Electrical and Computer Engineering
EE 4363 1 Uiversity of Miesota Midterm Exam #1 Prof. Matthew O'Keefe TA: Eric Seppae Departmet of Electrical ad Computer Egieerig Uiversity of Miesota Twi Cities Campus EE 4363 Itroductio to Microprocessors
More informationA Modified Multiband U Shaped and Microcontroller Shaped Fractal Antenna
al Joural o Recet ad Iovatio Treds i Computig ad Commuicatio ISSN: 221-8169 A Modified Multibad U Shaped ad Microcotroller Shaped Fractal Atea Shweta Goyal 1, Yogedra Kumar Katiyar 2 1 M.tech Scholar,
More informationEE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control
EE 459/500 HDL Based Digital Desig with Programmable Logic Lecture 13 Cotrol ad Sequecig: Hardwired ad Microprogrammed Cotrol Refereces: Chapter s 4,5 from textbook Chapter 7 of M.M. Mao ad C.R. Kime,
More informationImproving Information Retrieval System Security via an Optimal Maximal Coding Scheme
Improvig Iformatio Retrieval System Security via a Optimal Maximal Codig Scheme Dogyag Log Departmet of Computer Sciece, City Uiversity of Hog Kog, 8 Tat Chee Aveue Kowloo, Hog Kog SAR, PRC dylog@cs.cityu.edu.hk
More informationCongestion Avoidance Mechanism for Optical Label Switched Networks: A Dynamic QoS-based Approach
Cogestio Avoidace Mechaism for Optica Labe Switched Networks: A Dyamic QoS-based Approach Yassie Khifi, Noureddie Boudriga ad Mohammad S. Obaidat Uiversity of November 7th at Carthage, Tuisia ad Momouth
More informationEE260: Digital Design, Spring /16/18. n Example: m 0 (=x 1 x 2 ) is adjacent to m 1 (=x 1 x 2 ) and m 2 (=x 1 x 2 ) but NOT m 3 (=x 1 x 2 )
EE26: Digital Desig, Sprig 28 3/6/8 EE 26: Itroductio to Digital Desig Combiatioal Datapath Yao Zheg Departmet of Electrical Egieerig Uiversity of Hawaiʻi at Māoa Combiatioal Logic Blocks Multiplexer Ecoders/Decoders
More informationLecture 1: Introduction and Strassen s Algorithm
5-750: Graduate Algorithms Jauary 7, 08 Lecture : Itroductio ad Strasse s Algorithm Lecturer: Gary Miller Scribe: Robert Parker Itroductio Machie models I this class, we will primarily use the Radom Access
More informationCMSC Computer Architecture Lecture 11: More Caches. Prof. Yanjing Li University of Chicago
CMSC 22200 Computer Architecture Lecture 11: More Caches Prof. Yajig Li Uiversity of Chicago Lecture Outlie Caches 2 Review Memory hierarchy Cache basics Locality priciples Spatial ad temporal How to access
More informationComputer Systems - HS
What have we leared so far? Computer Systems High Level ENGG1203 2d Semester, 2017-18 Applicatios Sigals Systems & Cotrol Systems Computer & Embedded Systems Digital Logic Combiatioal Logic Sequetial Logic
More informationA New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method
A ew Morphological 3D Shape Decompositio: Grayscale Iterframe Iterpolatio Method D.. Vizireau Politehica Uiversity Bucharest, Romaia ae@comm.pub.ro R. M. Udrea Politehica Uiversity Bucharest, Romaia mihea@comm.pub.ro
More informationCOMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Single-Cycle Disadvantages & Advantages
COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter 4 The Processor Pipeliig Sigle-Cycle Disadvatages & Advatages Clk Uses the clock cycle iefficietly the clock cycle must
More informationAlgorithm Design Techniques. Divide and conquer Problem
Algorithm Desig Techiques Divide ad coquer Problem Divide ad Coquer Algorithms Divide ad Coquer algorithm desig works o the priciple of dividig the give problem ito smaller sub problems which are similar
More informationComputational Geometry
Computatioal Geometry Chapter 4 Liear programmig Duality Smallest eclosig disk O the Ageda Liear Programmig Slides courtesy of Craig Gotsma 4. 4. Liear Programmig - Example Defie: (amout amout cosumed
More informationData Structures and Algorithms Part 1.4
1 Data Structures ad Algorithms Part 1.4 Werer Nutt 2 DSA, Part 1: Itroductio, syllabus, orgaisatio Algorithms Recursio (priciple, trace, factorial, Fiboacci) Sortig (bubble, isertio, selectio) 3 Sortig
More informationA Study on the Performance of Cholesky-Factorization using MPI
A Study o the Performace of Cholesky-Factorizatio usig MPI Ha S. Kim Scott B. Bade Departmet of Computer Sciece ad Egieerig Uiversity of Califoria Sa Diego {hskim, bade}@cs.ucsd.edu Abstract Cholesky-factorizatio
More informationMultiprocessors. HPC Prof. Robert van Engelen
Multiprocessors Prof. Robert va Egele Overview The PMS model Shared memory multiprocessors Basic shared memory systems SMP, Multicore, ad COMA Distributed memory multicomputers MPP systems Network topologies
More informationMaster Informatics Eng. 2017/18. A.J.Proença. Memory Hierarchy. (most slides are borrowed) AJProença, Advanced Architectures, MiEI, UMinho, 2017/18 1
Advaced Architectures Master Iformatics Eg. 2017/18 A.J.Proeça Memory Hierarchy (most slides are borrowed) AJProeça, Advaced Architectures, MiEI, UMiho, 2017/18 1 Itroductio Programmers wat ulimited amouts
More informationLecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming
Lecture Notes 6 Itroductio to algorithm aalysis CSS 501 Data Structures ad Object-Orieted Programmig Readig for this lecture: Carrao, Chapter 10 To be covered i this lecture: Itroductio to algorithm aalysis
More informationUNIVERSITY OF MORATUWA
UNIVERSITY OF MORATUWA FACULTY OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING B.Sc. Egieerig 2014 Itake Semester 2 Examiatio CS2052 COMPUTER ARCHITECTURE Time allowed: 2 Hours Jauary 2016
More informationComputing a k-sparse n-length Discrete Fourier Transform using at most 4k samples and O(k log k) complexity
2013 IEEE Iteratioal Symposium o Iformatio Theory Computig a k-sparse -legth Discrete Fourier Trasform usig at most 4k samples ad O(k log k) complexity Sameer Pawar ad Kaa Ramchadra Dept of Electrical
More informationChapter 5: CPU Scheduling
Chapter 5: CPU Scheduig Chapter 5: CPU Scheduig Basic Cocepts Scheduig Criteria Scheduig Agorithms Mutipe-Processor Scheduig Rea-Time Scheduig Thread Scheduig Operatig Systems Exampes Java Thread Scheduig
More informationLecture 5. Counting Sort / Radix Sort
Lecture 5. Coutig Sort / Radix Sort T. H. Corme, C. E. Leiserso ad R. L. Rivest Itroductio to Algorithms, 3rd Editio, MIT Press, 2009 Sugkyukwa Uiversity Hyuseug Choo choo@skku.edu Copyright 2000-2018
More informationCS473-Algorithms I. Lecture 2. Asymptotic Notation. CS 473 Lecture 2 1
CS473-Algorithms I Lecture Asymptotic Notatio CS 473 Lecture 1 O-otatio (upper bouds) f() = O(g()) if positive costats c, 0 such that e.g., = O( 3 ) 0 f() cg(), 0 c 3 c c = 1 & 0 = or c = & 0 = 1 Asymptotic
More informationMulti-response optimization in drilling using Taguchi s quality loss function
Idia Joura of Egieerig & Materias Scieces Vo. 3, December 006, pp. 484-488 Muti-respose optimizatio i driig usig aguchi s quaity oss fuctio V N Gaitode a*, S R Karik b, B Achyutha c & B Siddeswarappa d
More informationLecturers: Sanjam Garg and Prasad Raghavendra Feb 21, Midterm 1 Solutions
U.C. Berkeley CS170 : Algorithms Midterm 1 Solutios Lecturers: Sajam Garg ad Prasad Raghavedra Feb 1, 017 Midterm 1 Solutios 1. (4 poits) For the directed graph below, fid all the strogly coected compoets
More informationAutomatic Generation of Polynomial-Basis Multipliers in GF (2 n ) using Recursive VHDL
Automatic Geeratio of Polyomial-Basis Multipliers i GF (2 ) usig Recursive VHDL J. Nelso, G. Lai, A. Teca Abstract Multiplicatio i GF (2 ) is very commoly used i the fields of cryptography ad error correctig
More informationEnhancing Efficiency of Software Fault Tolerance Techniques in Satellite Motion System
Joural of Iformatio Systems ad Telecommuicatio, Vol. 2, No. 3, July-September 2014 173 Ehacig Efficiecy of Software Fault Tolerace Techiques i Satellite Motio System Hoda Baki Departmet of Electrical ad
More informationRedundancy Allocation for Series Parallel Systems with Multiple Constraints and Sensitivity Analysis
IOSR Joural of Egieerig Redudacy Allocatio for Series Parallel Systems with Multiple Costraits ad Sesitivity Aalysis S. V. Suresh Babu, D.Maheswar 2, G. Ragaath 3 Y.Viaya Kumar d G.Sakaraiah e (Mechaical
More informationMOST of the advanced signal processing algorithms are
A edited versio of this work was publiched i IEEE TRANS. ON CIRCUITS AND SYSTEMS II, VOL. 6, NO. 9, SEPT 05 DOI:0.09/TCSII.05.35753 High-Throughput FPGA Implemetatio of QR Decompositio Sergio D. Muñoz
More informationCIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19
CIS Data Structures ad Algorithms with Java Sprig 09 Stacks, Queues, ad Heaps Moday, February 8 / Tuesday, February 9 Stacks ad Queues Recall the stack ad queue ADTs (abstract data types from lecture.
More informationCOMP Parallel Computing. PRAM (1): The PRAM model and complexity measures
COMP 633 - Parallel Computig Lecture 2 August 24, 2017 : The PRAM model ad complexity measures 1 First class summary This course is about parallel computig to achieve high-er performace o idividual problems
More informationFilter design. 1 Design considerations: a framework. 2 Finite impulse response (FIR) filter design
Filter desig Desig cosideratios: a framework C ı p ı p H(f) Aalysis of fiite wordlegth effects: I practice oe should check that the quatisatio used i the implemetatio does ot degrade the performace of
More informationCourse Site: Copyright 2012, Elsevier Inc. All rights reserved.
Course Site: http://cc.sjtu.edu.c/g2s/site/aca.html 1 Computer Architecture A Quatitative Approach, Fifth Editio Chapter 2 Memory Hierarchy Desig 2 Outlie Memory Hierarchy Cache Desig Basic Cache Optimizatios
More informationRecursion. Recursion. Mathematical induction: example. Recursion. The sum of the first n odd numbers is n 2 : Informal proof: Principle:
Recursio Recursio Jordi Cortadella Departmet of Computer Sciece Priciple: Reduce a complex problem ito a simpler istace of the same problem Recursio Itroductio to Programmig Dept. CS, UPC 2 Mathematical
More informationMulti-Threading. Hyper-, Multi-, and Simultaneous Thread Execution
Multi-Threadig Hyper-, Multi-, ad Simultaeous Thread Executio 1 Performace To Date Icreasig processor performace Pipeliig. Brach predictio. Super-scalar executio. Out-of-order executio. Caches. Hyper-Threadig
More informationAPPLICATION NOTE PACE1750AE BUILT-IN FUNCTIONS
APPLICATION NOTE PACE175AE BUILT-IN UNCTIONS About This Note This applicatio brief is iteded to explai ad demostrate the use of the special fuctios that are built ito the PACE175AE processor. These powerful
More informationComputer Graphics Hardware An Overview
Computer Graphics Hardware A Overview Graphics System Moitor Iput devices CPU/Memory GPU Raster Graphics System Raster: A array of picture elemets Based o raster-sca TV techology The scree (ad a picture)
More informationModified Self-Organizing Map for Optical Flow Clustering System
Proceedigs of the 7th WSEAS Iteratioa Coferece o Siga, Speech ad Image Processig, Beijig, Chia, September -7, 7 6 Modified Sef-Orgaizig Map for Optica Fow Custerig System B. DOUNGCHATOM, P. KUMSAWAT, K.
More informationAnalysis of Radix- SDF Pipeline FFT Architecture in VLSI Using Chip Scope
Analysis of Radix- SDF Pipeline FFT Architecture in VLSI Using Chip Scope G. Mohana Durga 1, D.V.R. Mohan 2 1 M.Tech Student, 2 Professor, Department of ECE, SRKR Engineering College, Bhimavaram, Andhra
More informationk (check node degree) and j (variable node degree)
A Parallel Turbo Decodig Message Passig Architecture for Array LDPC Codes Kira Guam, Pakaj Bhagawat, Weihuag Wag, Gwa Choi, Mark Yeary * Dept. of Electrical Egieerig, Texas A&M Uiversity, College Statio,
More informationImproved Random Graph Isomorphism
Improved Radom Graph Isomorphism Tomek Czajka Gopal Paduraga Abstract Caoical labelig of a graph cosists of assigig a uique label to each vertex such that the labels are ivariat uder isomorphism. Such
More informationConvolutional Neural Network Architectures for Signals Supported on Graphs
IEEE TRANSACTIONS ON SIGNAL PROCESSING (ACCEPTED) 1 Covoutioa Neura Network Architectures for Sigas Supported o Graphs Ferado Gama, Atoio G. Marques, Geert Leus, ad Aejadro Ribeiro arxiv:1805.00165v2 eess.sp
More informationLecture 6. Lecturer: Ronitt Rubinfeld Scribes: Chen Ziv, Eliav Buchnik, Ophir Arie, Jonathan Gradstein
068.670 Subliear Time Algorithms November, 0 Lecture 6 Lecturer: Roitt Rubifeld Scribes: Che Ziv, Eliav Buchik, Ophir Arie, Joatha Gradstei Lesso overview. Usig the oracle reductio framework for approximatig
More informationOnes Assignment Method for Solving Traveling Salesman Problem
Joural of mathematics ad computer sciece 0 (0), 58-65 Oes Assigmet Method for Solvig Travelig Salesma Problem Hadi Basirzadeh Departmet of Mathematics, Shahid Chamra Uiversity, Ahvaz, Ira Article history:
More informationOperating System Concepts. Operating System Concepts
Chapter 4: Mass-Storage Systems Logical Disk Structure Logical Disk Structure Disk Schedulig Disk Maagemet RAID Structure Disk drives are addressed as large -dimesioal arrays of logical blocks, where the
More informationCOARSE ANGLE ROTATION MODE CORDIC BASED SINGLE PROCESSING ELEMENT QR-RLS PROCESSOR
7th Europea Sigal Processig Coferece (EUSIPCO 9 Glasgow, Scotlad, August 4-8, 9 COARSE ANGLE ROTATION MODE CORDIC BASED SINGLE PROCESSING ELEMENT QR-RLS PROCESSOR Qiag Gao, Louise Crockett ad Robert Stewart
More informationEfficient Hough transform on the FPGA using DSP slices and block RAMs
Efficiet Hough trasform o the FPGA usig DSP slices ad block RAMs Xi Zhou, Norihiro Tomagou, Yasuaki Ito, ad Koji Nakao Departmet of Iformatio Egieerig Hiroshima Uiversity Kagamiyama 1-4-1, Higashi Hiroshima,
More informationBig-O Analysis. Asymptotics
Big-O Aalysis 1 Defiitio: Suppose that f() ad g() are oegative fuctios of. The we say that f() is O(g()) provided that there are costats C > 0 ad N > 0 such that for all > N, f() Cg(). Big-O expresses
More informationT Shaped Fractal Geomerty Based Micro Strip Patch Antenna
ISSN 2395-1621 T Shaped Fractal Geomerty Based Micro Strip Patch Atea #1 Dr. Swapil Lahudkar, #2 Satosh Sigh, #3 Akit Yadav, #4 Pooja Kad 2 sk.sigh260@gmail.com #1 Prof. Departmet of Electroics ad Telecommuicatio
More informationLINPACK Benchmark Optimizations on a Virtual Processor Grid
LINPACK Bechmark Optimizatios o a Virtual Processor Grid Edward Aderso, Mayard Bradt ad Chao Yag, Cray Ic. ABSTRACT: The "massively parallel" LINPACK bechmark is a familiar measure of highperformace computig
More informationK-NET bus. When several turrets are connected to the K-Bus, the structure of the system is as showns
K-NET bus The K-Net bus is based o the SPI bus but it allows to addressig may differet turrets like the I 2 C bus. The K-Net is 6 a wires bus (4 for SPI wires ad 2 additioal wires for request ad ackowledge
More informationA New Approach To Scheduling Parallel Programs Using Task Duplication
A New Approach To Schedulig Parallel Programs Usig Task Duplicatio Ishfaq Ahmad ad Yu-Kwog Kwok Departmet of Computer Sciece Hog Kog Uiversity of Sciece ad Techology, Clear Water Bay, Kowloo, Hog Kog Abstract
More informationOLYMPIAD 2 MOCK TEST
More Practice - More Score This readies the studets for cofidety tackig differet types of questios ad prepares them adequatey for ay Regioa / Natioa / Iteratioa Oympiad. The saiet features of this series
More informationAppendix D. Controller Implementation
COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Appedix D Cotroller Implemetatio Cotroller Implemetatios Combiatioal logic (sigle-cycle); Fiite state machie (multi-cycle, pipelied);
More informationState-space feedback 6 challenges of pole placement
State-space feedbac 6 challeges of pole placemet J Rossiter Itroductio The earlier videos itroduced the cocept of state feedbac ad demostrated that it moves the poles. x u x Kx Bu It was show that whe
More informationStructuring Redundancy for Fault Tolerance. CSE 598D: Fault Tolerant Software
Structurig Redudacy for Fault Tolerace CSE 598D: Fault Tolerat Software What do we wat to achieve? Versios Damage Assessmet Versio 1 Error Detectio Iputs Versio 2 Voter Outputs State Restoratio Cotiued
More informationLecture 13: Validation
Lecture 3: Validatio Resampli methods Holdout Cross Validatio Radom Subsampli -Fold Cross-Validatio Leave-oe-out The Bootstrap Bias ad variace estimatio Three-way data partitioi Itroductio to Patter Recoitio
More informationThe Magma Database file formats
The Magma Database file formats Adrew Gaylard, Bret Pikey, ad Mart-Mari Breedt Johaesburg, South Africa 15th May 2006 1 Summary Magma is a ope-source object database created by Chris Muller, of Kasas City,
More informationEfficient Hardware Design for Implementation of Matrix Multiplication by using PPI-SO
Efficiet Hardware Desig for Implemetatio of Matrix Multiplicatio by usig PPI-SO Shivagi Tiwari, Niti Meea Dept. of EC, IES College of Techology, Bhopal, Idia Assistat Professor, Dept. of EC, IES College
More informationA Selected Primer on Computer Vision: Geometric and Photometric Stereo & Structured Light
A Seected Primer o Computer Visio: Geometric ad Photometric Stereo & Structured Light CS334 Sprig 2012 Daie G. Aiaga Departmet of Computer Sciece Purdue Uiversit Defiitios Camera geometr (=motio) Give
More informationFREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS
FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS Prosejit Bose Evagelos Kraakis Pat Mori Yihui Tag School of Computer Sciece, Carleto Uiversity {jit,kraakis,mori,y
More informationWEBSITE STRUCTURE IMPROVEMENT USING ANT COLONY TECHNIQUE
WEBSITE STRUCTURE IMPROVEMENT USING ANT COLONY TECHNIQUE Wiwik Aggraei 1, Agyl Ardi Rahmadi 1, Radityo Prasetyo Wibowo 1 1 Iformatio System Departmet, Faculty of Iformatio Techology, Istitut Tekologi Sepuluh
More informationLoad balanced Parallel Prime Number Generator with Sieve of Eratosthenes on Cluster Computers *
Load balaced Parallel Prime umber Geerator with Sieve of Eratosthees o luster omputers * Soowook Hwag*, Kyusik hug**, ad Dogseug Kim* *Departmet of Electrical Egieerig Korea Uiversity Seoul, -, Rep. of
More informationSERIAL COMMUNICATION INTERFACE FOR ESA ESTRO
Bulleti E708 rev0 7/06/0 SERIAL COMMUNICATION INTERFACE FOR - SERIES FEATURES Supply voltage: 90 40vac Supply frequecy: 40 70 Hz Max. absorbtio: 40W Operatig temperature: 0 50 C Storage temperature: -0
More information