APPLICATION SPECIFIC PROCESSORS
|
|
- Arthur Dalton
- 6 years ago
- Views:
Transcription
1 APPLICATION SPECIFIC PROCESSORS
2 THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE VLSI, COMPUTER ARCHITECTURE AND DIGITAL SIGNAL PROCESSING Consulting Editor Jonathan Allen Other books in the series: QUICK-TURNAROUND ASIC DESIGN IN VHDL: Core-Based Behavioral Synthesis M.S. Romdhane, V.K. Madisetti, J.W. Hines ISBN: ADVANCED CONCEPTS IN ADAPTIVE SIGNAL PROCESSING W. Kenneth Jenkins, Andrew W. Hull, Jeffrey C. Strait ISBN: SOFTWARE SYNTIIESIS FROM DATAFLOW GRAPHS Shuvra S. Bhattacharyya, Praveen K. Murthy, Edward A. Lee ISBN: AUTOMATIC SPEECH AND SPEAKER RECOGNITION: Advanced Topics, Chin-Hui Lee, Kuldip K. Paliwal ISBN: BINARY DECISION DIAGRAMS AND APPLICATIONS FOR VLSI CAD, Shin-ichi Minato ISBN: ROBUSTNESS IN AUTOMATIC SPEECH RECOGNITION, Jean-Claude Junqua, Jean-Paul Haton ISBN: HIGH-PERFORMANCE DIGITAL VLSI CIRCUIT DESIGN, Richard X. Gu, Khaled M. Sharaf, Mohamed I. Elmasry ISBN: LOW POWER DESIGN METHODOLOGIES, Jan M. Rabaey, Massoud Pedram ISBN: MODERN METHODS OF SPEECH PROCESSING, Ravi P. Ramachandran ISBN: LOGIC SYNTHESIS FOR FIELD-PROGRAMMABLE GATE ARRAYS, Rajeev Murgai, Robert K. Brayton ISBN: CODE GENERATION FOR EMBEDDED PROCESSORS, P. Marwedel, G. Goossens ISBN: DIGITAL TIMING MACROMODELING FOR VLSI DESIGN VERIFICATION, Jeong Taek Kong, David Overhauser ISBN: DIGIT-SERIAL COMPUTATION, Richard Hartley, Keshab K. Parhi ISBN: FORMAL SEMANTICS FOR VHDL, Carlos Delgado Kloos, Peter T. Breuer ISBN: ON OPTIMAL INTERCONNECTIONS FOR VLSI, Andrew B. Kahng, Gabriel Robins ISBN:
3 APPLICATION SPECIFIC PROCESSORS Edited by Earl E. Swartzlander, Jr. University of Texas at Austin KLUWER ACADEMIC PUBLISHERS Boston / Dordrecht / London
4 Distributors for North America: Kluwer Academic Publishers 101 Philip Drive Assinippi Park Norwell, Massachusetts USA Distributors for all other countries: Kluwer Academic Publishers Group Distribution Centre Post Office Box AH Dordrecht, THE NETHERLANDS Library of Congress Cataloging-in-Publication Data A C.I.P. Catalogue record for this book is available from the Library of Congress. ISBN-13: DOl: / e-isbn-13: Copyright «:l 1997 by Kluwer Academic Publishers Softcover reprint of the hardcover 1st edition 1997 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any means, mechanical, photo-copying, recording, or otherwise, without the prior written permission of the publisher, Kluwer Academic Publishers, 101 Philip Drive, Assinippi Park, Norwell, Massachusetts Printed on acid-free paper.
5 TABLE OF CONTENTS Preface... i x 1. Variable Precision, Interval Arithmetic Processors... 1 Michael J. Schulte 1.1 Introduction Variable-Precision, Interval Arithmetic Previous Research Processor Implementation Area, Delay and Execution Time Estimates Variable-Precision, Interval Arithmetic Algorithms Conclusions Modeling the Power Consumption of CMOS Arithmetic Elernents Thomas K. Callaway 2.1 Introduction Previous Research Parallel Adders Parallel Multipliers Conclusions Fault Tolerant Arithmetic Yuang-Ming Hsu 3.1 Introduction Previous Research The Time Shared TMR Technique VLSI Designs and Performance Evaluations Conclusions... 85
6 vi Table of Contents 4. Low Power Digital Multipliers Edwin de Angel 4.1 Introduction Related Research Digital Multipliers CMOS Multipliers Combinational Self-Timed Multipliers with Bypassing Logic Results s. A Unified View of CORDIC Processor Design Shaoyun Wang and Vincenzo Piuri 5.1 Introduction The CORDIC Algorithm Combined Architectures Pipelined Architectures Architectural Evaluation Design Guidelines and Conclusions Multidimensional Systolic Arrays for Computing Discrete Fourier Transforms and Discrete Cosine Transforms Hyesook Lim 6.1 Introduction Multidimensional OFf and ocr by Multidimensional Systolic Array Fast Fourier Transform Computation by Multidimensional Systolic Array Prime-Factor Decomposed Computation by Multidimensional Systolic Array Conclusions
7 Application Specific Processors vii 7. Parallel Implementation of a Fast Third-Order Volterra Filtering Algorithm Hercule Kwan 7.1 Introduction Volterra Filtering in the Time and Frequency Domain Parallel Implementation on DSPS Performance Evaluation Applications to Nonlinear Communication Channels Future Research Design and Implementation of an Interface Control Unit for Rapid Prototyping Mohammad S. Khan 8.1 Introduction Related Work Interface Control Unit ICU Protocol Hardware Design of the ICU Conclusions Index... 2S 1
8 PREFACE Application specific processors are not a new idea. For example, difference engines such as those of Muller, Babbage and the Scheutzs are excellent examples of very early digital application specific processors. These machines were designed for the efficient production of numerical tables. These are stand alone application specific processors. Such processors are very different from geneml purpose computers. General purpose computers sacrifice performance in order to achieve flexibility and genemlity. In contrast, application specific processors are optimized for their intended application, often achieving orders of magnitude improvement in performance. Two decades ago, the minicomputer gained wide acceptance by providing economical computing with adequate performance for a wide variety of applications. Because the hardware and software programming environment design costs can be amortized over a large production run, they are reduced quite dramatically. Of course, minicomputer based systems require the development of custom software which is often as expensive as custom hardware. With the development of highly automated Computer Aided Design systems for the design of VLSI circuits, the design of Application Specific Integrated Circuits (ASIC) has become an attractive way to achieve performance approaching that of custom cmfted custom VLSI with very modest cost. It seems clear that the high level aspects of VLSI CAD technology can be applied (perhaps with higher level "system" extensions) to create "system compilers" that will greatly simplify the design and development of application specific processors. This paradigm drastically reduces the risk and cost to create application specific processors. APPLICATION SPECIFIC PROCESSORS From the earliest times, most high performance signal processors have been realized with application specific processors. The explanation is that application specific processors can be tailored to exactly match the (usually very demanding) application requirements. The result is that no "processing power" is wasted for ix
9 x Preface unnecessary capabilities and maximum perfonnance is achieved. A disadvantage is that such processors have been expensive to design since each is a unique design that is customized to tlle specific application. In the last decade, computer aided design systems have been developed to facilitate the development of application specific integrated circuits. The success of such ASIC CAD systems suggests that it should be possible to streamline the process of application specific processor design. Based on experience is developing VLSI chips and signal processing systems, I believe there are three rules that should guide the development of application specific processors: (1) use only as much arithmetic as necessary, (2) use data interconnections that match tlle algoritlun and (3) use programmability sparingly. The first rule is to use only as much arithmetic as necessary. In earlier application specific processors, the use of minimal aritlunetic was absolutely crucial as large wordsize floating point aritlunetic was prohibitively complex. The advent of VLSI has relaxed this constraint somewhat, but it remains obvious that fixed point arithmetic with small wordsize should be used if possible. The penalty of floating point arithmetic is particularly significant for addition where floating point requires an initial alignment and a final normalization in addition to tlle basic addition operation. The second rule is to use data interconnections that match the algoritllm. In a general purpose computer, such as tlle von Neumann machine shown on Figure I, a large memory holds most of the data and serves as the interconnection media for the succession of aritlunetic operations tllat are performed on data. This is an inefficient process. For example, contrast the effort required to perform an operation on two numbers using a special purpose implementation Witll the effort required with a general purpose processor. For tlle special purpose implementation, data are latched into tlle input registers, tlle result is computed and then latched into the output register. With the general purpose processor at least four steps are required (load first data, load second data, perform tlle operation, and store the result). The special purpose solution avoids tlle need to generate tllfee addresses and to read two data from the memory and write two data to the memory. Instead of passing data through a memory, application specific processors connect from one processor to tlle next as required for tlle algorithm. In cases where data is not in the correct order, small multi-port memories or shift register queues can be used to provide the necessary reordering. The final rule is to use programmability sparingly. It often may seem attractive to use a fast programmable processor or a network of programmable processors to provide the necessary computational capability. Upon close examination, we
10 Earl Swartzlander xi generally discover that this is extremely inefficient. The parts of an algorithm that are stable and that involve fixed processing sequences could have been implemented much more efficiently with custom (non programmable) processors. The remaining portion of the algorithm which requires the flexibility of a programmable processor can be executed at much lower speeds. Thus it is attractive to use an application specific combination of fixed processors with a programmable processor. For this hybrid combination, the fixed processor provides the computational "horsepower" while the programmable processor provides the "steering." DISCUSSION When application specific processors offer such attractive performance, why does anyone ever use a general purpose processor? There are two reasons: (1) general purpose processor hardware and supporting software programming environment development costs can be amortized over a potentially long production run which reduces the cost to any individual user and (2) the risk of fundamental defects in the design is eliminated (at least for all users after the first!) Of course, the second point indicates that only the first user gets state of the art performance. The problem remains that general purpose processors generally offer inadequate throughput for many "interesting" problems. The "general purpose" supercomputers that do achieve high throughputs are not true general purpose machines and are extremely expensive (in cost, power consumption, size, etc.) relative to their performance. The software development process introduces an additional cost factor I1mt is often overlooked in comparisons of application specific versus genera) purpose processors. One solution to U1C low throughput attainable Witll a general purpose processor is to use a large number of them. Connecting a general purpose host processor to an array of processing elements provides reasonably high throughput for many problems. This "semi-application specific" approach has an advantage in that the existing host computer programming environment can be used, which simplifies tlle programming task. The host processor communicates Witll an interface that provides data buffering and control to provide data (via a communication network) to the processor array and to capture data from it. A wide variety of regular geometries have been used as required for specific classes of applications. The processors used in the arrays range from Transputers (advanced computers with flexible communication interfaces) to specialized single bit processing elements. The more advanced processors like the Transputer offer a well developed programming environment which facilitates tlle development of programs for the processing elemems.
11 xii Preface Problems with the semi-application specific array processors include: (1) the difficulty of writing software for an array of processors that interact with each other, (2) the inefficiency of parallel processing, (3) the hardware complexity of the data communication network, and (4) the high complexity of coherently sharing global data amongst a multiplicity of processing elements. It may be noted tlmt most parallel processing systems do not achieve a speed-up commensurate with the number of processors. In the future, the remarkable advances in productivity achieved for VLSI circuits with advanced CAD and silicon compilers will be extended to application specific computing systems. Such syntllesis or "system compilation" will significantly automate the design process for application specific processors. Specifically the system compiler will handle the design of networks of processors where both the tailored network and tlle processor design are optimized for the specific application. A hierarchy of simulators will be used to verify the performance and to confirm correctness of the highly interactive software. Is this chip level CAD, system compilation or both? OUTLINE OF THE BOOK This book consists of eight chapters which provide a mixture of techniques and examples that relate to application specific processing. The inclusion of techniques is expected to suggest additional research and to assist tllose who are faced with the requirement to implement efficient application specific processors. The examples illustrate the application of the concepts and demonstrate tlle efficiency tllat can be achieved via application specific processors. The chapters were written by members and fonner members of tlle application specific processing group at tlle University of Texas at Austin. The first five chapters relate to specialized aritlunetic which often is the key to achieving high perfonnance in application specific processors. The next two chapters focus on signal processing systems, and the final chapter examines the interconnection of possibly disparate elements to create systems. The first chapter, "Variable-Precision, Interval Aritllmetic Processors" is by Michael J. Schulte. This chapter presents tlle design of a processor that efficiently implements interval arithmetic. Here data values are represented by tlle endpoints of intervals that contain tlle coitect value. As data is processed, the intervals lengthen until they are so wide tllat no information is provided about tlle data. Thus part of the attraction of interval arithmetic is that it provides a built in
12 Earl Swartzlander xiii accuracy monitor for all data. Until this work interval arithmetic has been orders of magnitude slower than conventional arithmetic, so it has not been widely used. The second chapter is "Modeling the Power Consumption of CMOS Arithmetic Elements" by Thomas K. Callaway. It compares a number of standard adder and multiplier circuits in terms of their area, delay and the average number of gate transitions that each requires per arithmetic operation. For static CMOS circuits the power consumption is approximately proportional to the number of gate transitions. Thus the average number of gate transitions gives a good approximate estimate of the power consumption. Next is "Fault Tolerant Arithmetic" by Yuang-Ming Hsu. This chapter surveys techniques that facilitate tlle construction of fault tolerant arithmetic processors. Techniques in tlle areas of hardware redundancy, information redundancy and time redundancy are all considered. The time redundant techniques offer attractive performance with a modest complexity overhead in comparison to standard arithmetic. Chapter 4 is "Low Power Digital Multipliers" by Edwin de Angel. It presents several techniques at the algorithm and circuit level that can be employed individually or in combination to substantially reduce the power required by an array implementation of a radix-4 Booth multiplier. Most of the techniques have the effect of increasing the speed. The fifth chapter is "A Unified View of CORDIC Processor Design" by Shaoyun Wang and Vincenzo Piuri. This chapter examines the well known CORDIC algorithm for evaluating sines, cosines and other trigonometric functions. It shows that many of the iterations can be performed in parallel. The next chapter is "Multidimensional Systolic Anays for Computing Discrete Fourier Transforms and Discrete Cosine Transforms" by Hyesook Lim. This work concerns the combination of two semi-systolic arrays to produce a systolic system. This work is a good example of situations where humans have tlle advantage over even the best CAD systems. The fundamental idea presented here is the result of insight that is not likely to be automated in the foreseeable future. The seventh chapter is "Parallel Implementation of a Fast Third-Order Volterra Filtering Algoritllm" by Hercule Kwan. Volterra filters are of great theoretical value in analyzing phenomena is diverse fields ranging from ocean wave shapes and multipatll distortion. The computational loads are exu emely high suggesting the need to consider application specific processing implementations. This chapter reports on some of tlle early work in developing a multiprocessor implementation of Volterra filters.
13 xiv Preface The final chapter is "Design and Implementation of an Interface Control Unit for Rapid Prototyping" by Mohammad S. Khan. This work addresses the need to interconnect multiple processors to implement a system. The processors can all be of the same type (homogeneous) or can be a variety of specialized processors (heterogeneous). Having a generic processor interface is expected to greatly simplify the development of large application specific processing systems. Earl E. Swartzlander, Jr., Austin, Texas
14 APPLICATION SPECIFIC PROCESSORS
TIME-CONSTRAINED TRANSACTION MANAGEMENT. Real-Time Constraints in Database Transaction Systems
TIME-CONSTRAINED TRANSACTION MANAGEMENT Real-Time Constraints in Database Transaction Systems The Kluwer International Series on ADV ANCES IN DATABASE SYSTEMS Other books in the Series: Series Editor Ahmed
More informationINFORMATION RETRIEVAL SYSTEMS: Theory and Implementation
INFORMATION RETRIEVAL SYSTEMS: Theory and Implementation THE KLUWER INTERNATIONAL SERIES ON INFORMATION RETRIEVAL Series Editor W. Bruce Croft University of Massachusetts Amherst, MA 01003 Also in the
More informationFINITE FIELDS FOR COMPUTER SCIENTISTS AND ENGINEERS
FINITE FIELDS FOR COMPUTER SCIENTISTS AND ENGINEERS THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE INFORMATION THEORY Consulting Editor Robert G. Gallager FINITE FIELDS FOR COMPUTER
More informationHIGH-LEVEL SYNTHESIS FOR REAL-TIME DIGITAL SIGNAL PROCESSING
HIGH-LEVEL SYNTHESIS FOR REAL-TIME DIGITAL SIGNAL PROCESSING THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE VLSI, COMPUTER ARCHITECfURE AND DIGITAL SIGNAL PROCESSING Latest Titles
More informationThe VHDL Handbook. David R. Coelho Vantage Analysis Systems, Inc. Kluwer Academic Publishers. KALA llrporation
The VHDL Handbook The VHDL Handbook by David R. Coelho Vantage Analysis Systems, Inc. ~. " Kluwer Academic Publishers KALA llrporation Distributors for North America: Kluwer Academic Publishers 101 Philip
More informationPARALLEL ARCHITECTURES AND PARALLEL ALGORITHMS FOR INTEGRATED VISION SYSTEMS
PARALLEL ARCHITECTURES AND PARALLEL ALGORITHMS FOR INTEGRATED VISION SYSTEMS THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE ROBOTICS: VISION, MANIPULATION AND SENSORS Consulting Editor:
More informationTopological Structure and Analysis of Interconnection Networks
Topological Structure and Analysis of Interconnection Networks Network Theory and Applications Volume 7 Managing Editors: Ding-Zhu Du, University of Minnesota, U.S.A. and Cauligi Raghavendra, University
More informationMULTIMEDIA TOOLS AND APPLICATIONS
MULTIMEDIA TOOLS AND APPLICATIONS THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE MULTIMEDIA SYSTEMS AND APPLICATIONS Recently Published Titles: Consulting Editor Borko Furht Florida
More informationClustering and Information Retrieval
Clustering and Information Retrieval Network Theory and Applications Volume 11 Managing Editors: Ding-ZhuDu University o/minnesota, U.S.A. Cauligi Raghavendra University 0/ Southern Califorina, U.S.A.
More informationDUE to the high computational complexity and real-time
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 3, MARCH 2005 445 A Memory-Efficient Realization of Cyclic Convolution and Its Application to Discrete Cosine Transform Hun-Chen
More informationController Synthesis for Hardware Accelerator Design
ler Synthesis for Hardware Accelerator Design Jiang, Hongtu; Öwall, Viktor 2002 Link to publication Citation for published version (APA): Jiang, H., & Öwall, V. (2002). ler Synthesis for Hardware Accelerator
More informationFuzzy Modeling for Control.,,i.
Fuzzy Modeling for Control,,i. INTERNATIONAL SERIES IN INTELLIGENT TECHNOLOGIES Prof. Dr. Dr. h.c. Hans-Jiirgen Zimmermann, Editor European Laboratory for Intelligent Techniques Engineering Aachen, Germany
More informationFPGA IMPLEMENTATION OF CORDIC ALGORITHM ARCHITECTURE
FPGA IMPLEMENTATION OF CORDIC ALGORITHM ARCHITECTURE Ramanpreet Kaur 1, Parminder Singh Jassal 2 Department of Electronics And Communication Engineering, Punjabi University Yadavindra College of Engineering,
More informationFundamentals of Operating Systems. Fifth Edition
Fundamentals of Operating Systems Fifth Edition Fundamentals of Operating Systems A.M. Lister University of Queensland R. D. Eager University of Kent at Canterbury Fifth Edition Springer Science+Business
More informationAlgorithms and Parallel Computing
Algorithms and Parallel Computing Algorithms and Parallel Computing Fayez Gebali University of Victoria, Victoria, BC A John Wiley & Sons, Inc., Publication Copyright 2011 by John Wiley & Sons, Inc. All
More informationTHE VERILOG? HARDWARE DESCRIPTION LANGUAGE
THE VERILOG? HARDWARE DESCRIPTION LANGUAGE THE VERILOGf HARDWARE DESCRIPTION LANGUAGE by Donald E. Thomas Carnegie Mellon University and Philip R. Moorby Cadence Design Systems, Inc. SPRINGER SCIENCE+BUSINESS
More informationFUZZY DATABASES Principles and Applications
FUZZY DATABASES Principles and Applications INTERNATIONAL SERIES IN INTELLIGENT TECHNOLOGIES Prof. Dr. Dr. h.c. Hans-Jiirgen Zimmermann, Editor European Laboratory for Intelligent Techniques Engineering
More informationUML for SOC Design GRANT MARTIN WOLFGANG MÜLLER. Edited by. Tensilica Inc., Santa Clara, CA, USA. and. University of Paderborn, Germany
UML FOR SOC DESIGN UML for SOC Design Edited by GRANT MARTIN Tensilica Inc., Santa Clara, CA, USA and WOLFGANG MÜLLER University of Paderborn, Germany A C.I.P. Catalogue record for this book is available
More informationPhilip Andrew Simpson. FPGA Design. Best Practices for Team-based Reuse. Second Edition
FPGA Design Philip Andrew Simpson FPGA Design Best Practices for Team-based Reuse Second Edition Philip Andrew Simpson San Jose, CA, USA ISBN 978-3-319-17923-0 DOI 10.1007/978-3-319-17924-7 ISBN 978-3-319-17924-7
More informationHardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University
Hardware Design Environments Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University Outline Welcome to COE 405 Digital System Design Design Domains and Levels of Abstractions Synthesis
More informationPerformance Analysis of CORDIC Architectures Targeted by FPGA Devices
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Performance Analysis of CORDIC Architectures Targeted by FPGA Devices Guddeti Nagarjuna Reddy 1, R.Jayalakshmi 2, Dr.K.Umapathy
More informationARCHITECTURE AND CAD FOR DEEP-SUBMICRON FPGAs
ARCHITECTURE AND CAD FOR DEEP-SUBMICRON FPGAs THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE ARCHITECTURE AND CAD FOR DEEP-SUBMICRON FPGAs Vaughn Betz Jonathan Rose Alexander Marquardt
More informationASSIGNMENT PROBLEMS IN PARALLEL AND DISTRIBUTED COMPUTING
ASSIGNMENT PROBLEMS IN PARALLEL AND DISTRIBUTED COMPUTING THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE PARALLEL PROCESSING AND FIFTH GENERATION COMPUTING Consulting Editor Doug DeGroot
More informationINPUT/OUTPUT IN PARALLEL AND DISTRIBUTED COMPUTER SYSTEMS
INPUT/OUTPUT IN PARALLEL AND DISTRIBUTED COMPUTER SYSTEMS THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE INPUT/OUTPUT IN PARALLEL AND DISTRIBUTED COMPUTER SYSTEMS edited by RaviJain
More informationVIDEO CODING. The Second Generation Approach
VIDEO CODING The Second Generation Approach VIDEO CODING The Second Generation Approach EDITED BY luis Torres Department of Signal Theory and Communications, Universitat Politecnica de Catalunya Barcelona,
More informationLow-Power FIR Digital Filters Using Residue Arithmetic
Low-Power FIR Digital Filters Using Residue Arithmetic William L. Freking and Keshab K. Parhi Department of Electrical and Computer Engineering University of Minnesota 200 Union St. S.E. Minneapolis, MN
More informationLinear Programming: Mathematics, Theory and Algorithms
Linear Programming: Mathematics, Theory and Algorithms Applied Optimization Volume 2 The titles published in this series are listed at the end of this volume. Linear Programming: Mathematics, Theory and
More informationTHE DESIGNER S GUIDE TO VERILOG-AMS
THE DESIGNER S GUIDE TO VERILOG-AMS THE DESIGNER S GUIDE BOOK SERIES Consulting Editor Kenneth S. Kundert Books in the series: The Designer s Guide to Verilog-AMS ISBN: 1-00-80-1 The Designer s Guide to
More informationRETARGETABLE CODE GENERATION FOR DIGITAL SIGNAL PROCESSORS
RETARGETABLE CODE GENERATION FOR DIGITAL SIGNAL PROCESSORS RETARGETABLE CODE GENERATION FOR DIGITAL SIGNAL PROCESSORS Rainer LEUPERS University of Dortmund Department of Computer Science Dortmund, Germany
More informationADAPTIVE HYPERTEXT AND HYPERMEDIA
ADAPTIVE HYPERTEXT AND HYPERMEDIA ADAPTIVE HYPERTEXT AND HYPERMEDIA Edited by Peter Brusilovsky Carnegie Mellon University Alfred Kobsa GMDFIT German National Research Centre for Information Technology
More informationEnergy Efficient Microprocessor Design
Energy Efficient Microprocessor Design Energy Efficient Microprocessor Design by Thomas D. Burd Robert W. Brodersen with Contributions Irom Trevor Pering Anthony Stratakos Berkeley Wireless Research Center
More informationAnalysis of Different Multiplication Algorithms & FPGA Implementation
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 4, Issue 2, Ver. I (Mar-Apr. 2014), PP 29-35 e-issn: 2319 4200, p-issn No. : 2319 4197 Analysis of Different Multiplication Algorithms & FPGA
More informationPERFORMANCE ANALYSIS OF REAL-TIME EMBEDDED SOFTWARE
PERFORMANCE ANALYSIS OF REAL-TIME EMBEDDED SOFTWARE PERFORMANCE ANALYSIS OF REAL-TIME EMBEDDED SOFTWARE Yau-Tsun Steven Li Monterey Design Systems, Inc. Sharad Malik Princeton University ~. " SPRINGER
More informationOn-Line Error Detecting Constant Delay Adder
On-Line Error Detecting Constant Delay Adder Whitney J. Townsend and Jacob A. Abraham Computer Engineering Research Center The University of Texas at Austin whitney and jaa @cerc.utexas.edu Parag K. Lala
More informationEfficient Double-Precision Cosine Generation
Efficient Double-Precision Cosine Generation Derek Nowrouzezahrai Brian Decker William Bishop dnowrouz@uwaterloo.ca bjdecker@uwaterloo.ca wdbishop@uwaterloo.ca Department of Electrical and Computer Engineering
More informationHonorary Professor Supercomputer Education and Research Centre Indian Institute of Science, Bangalore
COMPUTER ORGANIZATION AND ARCHITECTURE V. Rajaraman Honorary Professor Supercomputer Education and Research Centre Indian Institute of Science, Bangalore T. Radhakrishnan Professor of Computer Science
More informationAn Introduction to Programming with IDL
An Introduction to Programming with IDL Interactive Data Language Kenneth P. Bowman Department of Atmospheric Sciences Texas A&M University AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD PARIS SAN
More informationComputer Architecture
Computer Architecture Pipelined and Parallel Processor Design Michael J. Flynn Stanford University Technische Universrtat Darmstadt FACHBEREICH INFORMATIK BIBLIOTHEK lnventar-nr.: Sachgebiete: Standort:
More informationApplication-Specific Mesh-based Heterogeneous FPGA Architectures
Application-Specific Mesh-based Heterogeneous FPGA Architectures Husain Parvez H abib Mehrez Application-Specific Mesh-based Heterogeneous FPGA Architectures Husain Parvez Habib Mehrez Université Pierre
More informationVLSI Implementation of Low Power Area Efficient FIR Digital Filter Structures Shaila Khan 1 Uma Sharma 2
IJSRD - International Journal for Scientific Research & Development Vol. 3, Issue 05, 2015 ISSN (online): 2321-0613 VLSI Implementation of Low Power Area Efficient FIR Digital Filter Structures Shaila
More informationAll MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes
MSEE Curriculum All MSEE students are required to take the following two core courses: 3531-571 Linear systems 3531-507 Probability and Random Processes The course requirements for students majoring in
More informationDesign and Analysis of Kogge-Stone and Han-Carlson Adders in 130nm CMOS Technology
Design and Analysis of Kogge-Stone and Han-Carlson Adders in 130nm CMOS Technology Senthil Ganesh R & R. Kalaimathi 1 Assistant Professor, Electronics and Communication Engineering, Info Institute of Engineering,
More informationCHAPTER 1 INTRODUCTION
CHAPTER 1 INTRODUCTION Rapid advances in integrated circuit technology have made it possible to fabricate digital circuits with large number of devices on a single chip. The advantages of integrated circuits
More informationCREATIVE ASSERTION AND CONSTRAINT METHODS FOR FORMAL DESIGN VERIFICATION
CREATIVE ASSERTION AND CONSTRAINT METHODS FOR FORMAL DESIGN VERIFICATION Joseph Richards SGI, High Performance Systems Development Mountain View, CA richards@sgi.com Abstract The challenges involved in
More informationCOE 561 Digital System Design & Synthesis Introduction
1 COE 561 Digital System Design & Synthesis Introduction Dr. Aiman H. El-Maleh Computer Engineering Department King Fahd University of Petroleum & Minerals Outline Course Topics Microelectronics Design
More informationECE 637 Integrated VLSI Circuits. Introduction. Introduction EE141
ECE 637 Integrated VLSI Circuits Introduction EE141 1 Introduction Course Details Instructor Mohab Anis; manis@vlsi.uwaterloo.ca Text Digital Integrated Circuits, Jan Rabaey, Prentice Hall, 2 nd edition
More informationINTRUSION DETECTION AND CORRELATION. Challenges and Solutions
INTRUSION DETECTION AND CORRELATION Challenges and Solutions Advances in Information Security Sushil Jajodia Consulting editor Center for Secure Information Systems George Mason University Fairfax, VA
More informationSPECC: SPECIFICATION LANGUAGE AND METHODOLOGY
SPECC: SPECIFICATION LANGUAGE AND METHODOLOGY SPECC: SPECIFICATION LANGUAGE AND METHODOLOGY Daniel D. Gajski Jianwen Zhu Rainer Dömer Andreas Gerstlauer Shuqing Zhao University of California, Irvine SPRINGER
More informationHigh Speed Special Function Unit for Graphics Processing Unit
High Speed Special Function Unit for Graphics Processing Unit Abd-Elrahman G. Qoutb 1, Abdullah M. El-Gunidy 1, Mohammed F. Tolba 1, and Magdy A. El-Moursy 2 1 Electrical Engineering Department, Fayoum
More informationStructure of Computer Systems
Structure of Computer Systems Structure of Computer Systems Baruch Zoltan Francisc Technical University of Cluj-Napoca Computer Science Department U. T. PRES Cluj-Napoca, 2002 CONTENTS PREFACE... xiii
More informationFILTER SYNTHESIS USING FINE-GRAIN DATA-FLOW GRAPHS. Waqas Akram, Cirrus Logic Inc., Austin, Texas
FILTER SYNTHESIS USING FINE-GRAIN DATA-FLOW GRAPHS Waqas Akram, Cirrus Logic Inc., Austin, Texas Abstract: This project is concerned with finding ways to synthesize hardware-efficient digital filters given
More informationProf. Steven Nowick. Chair, Computer Engineering Program
Prof. Steven Nowick (nowick@cs.columbia.edu) Chair, Computer Engineering Program Overview of 4000-/6000-Level Comp Eng Courses Selective survey of some key computer engineering courses Focus: COMS (i.e.
More informationDIGITAL VS. ANALOG SIGNAL PROCESSING Digital signal processing (DSP) characterized by: OUTLINE APPLICATIONS OF DIGITAL SIGNAL PROCESSING
1 DSP applications DSP platforms The synthesis problem Models of computation OUTLINE 2 DIGITAL VS. ANALOG SIGNAL PROCESSING Digital signal processing (DSP) characterized by: Time-discrete representation
More informationOverview. CSE372 Digital Systems Organization and Design Lab. Hardware CAD. Two Types of Chips
Overview CSE372 Digital Systems Organization and Design Lab Prof. Milo Martin Unit 5: Hardware Synthesis CAD (Computer Aided Design) Use computers to design computers Virtuous cycle Architectural-level,
More informationThe Verilog Hardware Description Language, Fifth Edition
The Verilog Hardware Description Language, Fifth Edition The Verilog Hardware Description Language, Fifth Edition Donald E. Thomas ECE Department Carnegie Mellon University Pittsburgh, PA Philip R. Moorby
More informationHardware Software Codesign of Embedded Systems
Hardware Software Codesign of Embedded Systems Rabi Mahapatra Texas A&M University Today s topics Course Organization Introduction to HS-CODES Codesign Motivation Some Issues on Codesign of Embedded System
More informationVLSI Design Of a Novel Pre Encoding Multiplier Using DADDA Multiplier. Guntur(Dt),Pin:522017
VLSI Design Of a Novel Pre Encoding Multiplier Using DADDA Multiplier 1 Katakam Hemalatha,(M.Tech),Email Id: hema.spark2011@gmail.com 2 Kundurthi Ravi Kumar, M.Tech,Email Id: kundurthi.ravikumar@gmail.com
More informationLow Power and Memory Efficient FFT Architecture Using Modified CORDIC Algorithm
Low Power and Memory Efficient FFT Architecture Using Modified CORDIC Algorithm 1 A.Malashri, 2 C.Paramasivam 1 PG Student, Department of Electronics and Communication K S Rangasamy College Of Technology,
More informationGraphics Programming in c++
Graphics Programming in c++ Springer London Berlin Heidelberg New York Barcelona Budapest Hong Kong Milan Paris Santa Clara Singapore Tokyo Mark Walmsley Graphics Programming in c++ Writing Graphics Applications
More informationImplementation of Efficient Modified Booth Recoder for Fused Sum-Product Operator
Implementation of Efficient Modified Booth Recoder for Fused Sum-Product Operator A.Sindhu 1, K.PriyaMeenakshi 2 PG Student [VLSI], Dept. of ECE, Muthayammal Engineering College, Rasipuram, Tamil Nadu,
More informationVERILOG QUICKSTART. James M. Lee Cadence Design Systems, Inc. SPRINGER SCIENCE+BUSINESS MEDIA, LLC
VERILOG QUICKSTART VERILOG QUICKSTART by James M. Lee Cadence Design Systems, Inc. ~. " SPRINGER SCIENCE+BUSINESS MEDIA, LLC ISBN 978-1-4613-7801-3 ISBN 978-1-4615-6113-2 (ebook) DOI 10.1007/978-1-4615-6113-2
More informationHIGH PERFORMANCE QUATERNARY ARITHMETIC LOGIC UNIT ON PROGRAMMABLE LOGIC DEVICE
International Journal of Advances in Applied Science and Engineering (IJAEAS) ISSN (P): 2348-1811; ISSN (E): 2348-182X Vol. 2, Issue 1, Feb 2015, 01-07 IIST HIGH PERFORMANCE QUATERNARY ARITHMETIC LOGIC
More informationFundamentals of Operating Systems
Fundamentals of Operating Systems Macmillan Computer Science Series Consulting Editor Professor F. H. Sumner, University of Manchester S. T. Allworth, Introduction to Real-time Software Design Ian O. Angell,
More informationOptimization of Vertical and Horizontal Beamforming Kernels on the PowerPC G4 Processor with AltiVec Technology
Optimization of Vertical and Horizontal Beamforming Kernels on the PowerPC G4 Processor with AltiVec Technology EE382C: Embedded Software Systems Final Report David Brunke Young Cho Applied Research Laboratories:
More informationOPTIMIZATION OF AREA COMPLEXITY AND DELAY USING PRE-ENCODED NR4SD MULTIPLIER.
OPTIMIZATION OF AREA COMPLEXITY AND DELAY USING PRE-ENCODED NR4SD MULTIPLIER. A.Anusha 1 R.Basavaraju 2 anusha201093@gmail.com 1 basava430@gmail.com 2 1 PG Scholar, VLSI, Bharath Institute of Engineering
More informationPipelined Quadratic Equation based Novel Multiplication Method for Cryptographic Applications
, Vol 7(4S), 34 39, April 204 ISSN (Print): 0974-6846 ISSN (Online) : 0974-5645 Pipelined Quadratic Equation based Novel Multiplication Method for Cryptographic Applications B. Vignesh *, K. P. Sridhar
More informationEE878 Special Topics in VLSI. Computer Arithmetic for Digital Signal Processing
EE878 Special Topics in VLSI Computer Arithmetic for Digital Signal Processing Part 6c High-Speed Multiplication - III Spring 2017 Koren Part.6c.1 Array Multipliers The two basic operations - generation
More informationHigh Level Abstractions for Implementation of Software Radios
High Level Abstractions for Implementation of Software Radios J. B. Evans, Ed Komp, S. G. Mathen, and G. Minden Information and Telecommunication Technology Center University of Kansas, Lawrence, KS 66044-7541
More informationMULTIMEDIA DATABASE MANAGEMENT SYSTEMS
MULTIMEDIA DATABASE MANAGEMENT SYSTEMS THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE MULTIMEDIA SYSTEMS AND APPLICATIONS Recently Published Titles: Consulting Editor Borko Furht Florida
More informationDistributed Virtual Reality Computation
Jeff Russell 4/15/05 Distributed Virtual Reality Computation Introduction Virtual Reality is generally understood today to mean the combination of digitally generated graphics, sound, and input. The goal
More informationHardware Software Codesign of Embedded System
Hardware Software Codesign of Embedded System CPSC489-501 Rabi Mahapatra Mahapatra - Texas A&M - Fall 00 1 Today s topics Course Organization Introduction to HS-CODES Codesign Motivation Some Issues on
More informationAn Overview of the Ptolemy Project. Organizational
An Overview of the Ptolemy Project Edward A. Lee Professor and Principal Investigator UC Berkeley Dept. of EECS Copyright 1997, The Regents of the University of California All rights reserved. Organizational
More informationInternational Journal for Research in Applied Science & Engineering Technology (IJRASET) IIR filter design using CSA for DSP applications
IIR filter design using CSA for DSP applications Sagara.K.S 1, Ravi L.S 2 1 PG Student, Dept. of ECE, RIT, Hassan, 2 Assistant Professor Dept of ECE, RIT, Hassan Abstract- In this paper, a design methodology
More informationA Quadruple Precision and Dual Double Precision Floating-Point Multiplier
A Quadruple Precision and Dual Double Precision Floating-Point Multiplier Ahmet Akkaş Computer Engineering Department Koç University 3445 Sarıyer, İstanbul, Turkey ahakkas@kuedutr Michael J Schulte Department
More informationGroupware and the World Wide Web
Groupware and the World Wide Web Edited by Richard Bentley, Uwe Busbach, David Kerr & Klaas Sikkel German National Research Center for Information Technology, Institutefor Applied Information Technology
More informationFixed-Width Recursive Multipliers
Fixed-Width Recursive Multipliers Presented by: Kevin Biswas Supervisors: Dr. M. Ahmadi Dr. H. Wu Department of Electrical and Computer Engineering University of Windsor Motivation & Objectives Outline
More informationImplementation of a Unified DSP Coprocessor
Vol. (), Jan,, pp 3-43, ISS: 35-543 Implementation of a Unified DSP Coprocessor Mojdeh Mahdavi Department of Electronics, Shahr-e-Qods Branch, Islamic Azad University, Tehran, Iran *Corresponding author's
More informationARCHITECTURAL DESIGN OF 8 BIT FLOATING POINT MULTIPLICATION UNIT
ARCHITECTURAL DESIGN OF 8 BIT FLOATING POINT MULTIPLICATION UNIT Usha S. 1 and Vijaya Kumar V. 2 1 VLSI Design, Sathyabama University, Chennai, India 2 Department of Electronics and Communication Engineering,
More informationBibliography. Measuring Software Reuse, Jeffrey S. Poulin, Addison-Wesley, Practical Software Reuse, Donald J. Reifer, Wiley, 1997.
Bibliography Books on software reuse: 1. 2. Measuring Software Reuse, Jeffrey S. Poulin, Addison-Wesley, 1997. Practical Software Reuse, Donald J. Reifer, Wiley, 1997. Formal specification and verification:
More informationHardware/Software Co-design
Hardware/Software Co-design Zebo Peng, Department of Computer and Information Science (IDA) Linköping University Course page: http://www.ida.liu.se/~petel/codesign/ 1 of 52 Lecture 1/2: Outline : an Introduction
More informationLOGIC SYNTHESIS FOR FIELD-PROGRAMMABLE GATE ARRAYS
LOGIC SYNTHESIS FOR FIELD-PROGRAMMABLE GATE ARRAYS THE KLUWER INTERNATIONAL SERIES IN ENGINEERING AND COMPUTER SCIENCE VLSI, COMPUTER ARCHITECTURE AND DIGITAL SIGNAL PROCESSING Consulting Editor Jonathan
More informationProgrammable Logic Devices II
São José February 2015 Prof. Hoeller, Prof. Moecke (http://www.sj.ifsc.edu.br) 1 / 28 Lecture 01: Complexity Management and the Design of Complex Digital Systems Prof. Arliones Hoeller arliones.hoeller@ifsc.edu.br
More informationRECENTLY, researches on gigabit wireless personal area
146 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 55, NO. 2, FEBRUARY 2008 An Indexed-Scaling Pipelined FFT Processor for OFDM-Based WPAN Applications Yuan Chen, Student Member, IEEE,
More informationChapter 5: ASICs Vs. PLDs
Chapter 5: ASICs Vs. PLDs 5.1 Introduction A general definition of the term Application Specific Integrated Circuit (ASIC) is virtually every type of chip that is designed to perform a dedicated task.
More informationHardware Modeling using Verilog Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur
Hardware Modeling using Verilog Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture 01 Introduction Welcome to the course on Hardware
More informationSystem Design and Methodology/ Embedded Systems Design (Modeling and Design of Embedded Systems)
Design&Methodologies Fö 1&2-1 Design&Methodologies Fö 1&2-2 Course Information Design and Methodology/ Embedded s Design (Modeling and Design of Embedded s) TDTS07/TDDI08 Web page: http://www.ida.liu.se/~tdts07
More informationSoft-Core Embedded Processor-Based Built-In Self- Test of FPGAs: A Case Study
Soft-Core Embedded Processor-Based Built-In Self- Test of FPGAs: A Case Study Bradley F. Dutton, Graduate Student Member, IEEE, and Charles E. Stroud, Fellow, IEEE Dept. of Electrical and Computer Engineering
More informationHigh Performance Computer Architecture Prof. Ajit Pal Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur
High Performance Computer Architecture Prof. Ajit Pal Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture - 23 Hierarchical Memory Organization (Contd.) Hello
More informationDESIGN AND ANALYSIS OF COMPETENT ARITHMETIC AND LOGIC UNIT FOR RISC PROCESSOR
DESIGN AND ANALYSIS OF COMPETENT ARITHMETIC AND LOGIC UNIT FOR RISC PROCESSOR M. Priyanka 1 and T. Ravi 2 1 M.Tech VLSI Design, Sathyabama University, Chennai, Tamil Nadu, India 2 Department of Electronics
More informationDESIGN AND IMPLEMENTATION OF VLSI SYSTOLIC ARRAY MULTIPLIER FOR DSP APPLICATIONS
International Journal of Computing Academic Research (IJCAR) ISSN 2305-9184 Volume 2, Number 4 (August 2013), pp. 140-146 MEACSE Publications http://www.meacse.org/ijcar DESIGN AND IMPLEMENTATION OF VLSI
More informationHigh Quality 4D Development. David Adams
David Adams Introduction 1 Chapter 1 - About This Training 3 What To Expect... 3 Course Organization... 3 Course Contents... 3 About The Materials... 5 Conventions Used in This Manual... 7 Phones & Pagers...
More informationDatabase Optimization
Database Optimization June 9 2009 A brief overview of database optimization techniques for the database developer. Database optimization techniques include RDBMS query execution strategies, cost estimation,
More informationDigital Signal Processing for Multimedia Systems
Digital Signal Processing for Multimedia Systems edited by Keshab K. Parhi University of Minnesota Minneapolis, Minnesota Takao Nishitani NEC Corporation Sagamihara, Japan MARCEL MARCEL DEKKER, INC. NEW
More informationDigital Signal Processing System Design: LabVIEW-Based Hybrid Programming Nasser Kehtarnavaz
Digital Signal Processing System Design: LabVIEW-Based Hybrid Programming Nasser Kehtarnavaz Digital Signal Processing System Design: LabVIEW-Based Hybrid Programming by Nasser Kehtarnavaz University
More informationSoftware Synthesis Trade-offs in Dataflow Representations of DSP Applications
in Dataflow Representations of DSP Applications Shuvra S. Bhattacharyya Department of Electrical and Computer Engineering, and Institute for Advanced Computer Studies University of Maryland, College Park
More informationDIGITAL DESIGN TECHNOLOGY & TECHNIQUES
DIGITAL DESIGN TECHNOLOGY & TECHNIQUES CAD for ASIC Design 1 INTEGRATED CIRCUITS (IC) An integrated circuit (IC) consists complex electronic circuitries and their interconnections. William Shockley et
More informationEE382V: System-on-a-Chip (SoC) Design
EE382V: System-on-a-Chip (SoC) Design Lecture 8 HW/SW Co-Design Sources: Prof. Margarida Jacome, UT Austin Andreas Gerstlauer Electrical and Computer Engineering University of Texas at Austin gerstl@ece.utexas.edu
More informationLinköping University Post Print. epuma: a novel embedded parallel DSP platform for predictable computing
Linköping University Post Print epuma: a novel embedded parallel DSP platform for predictable computing Jian Wang, Joar Sohl, Olof Kraigher and Dake Liu N.B.: When citing this work, cite the original article.
More informationINTERCONNECT TESTING WITH BOUNDARY SCAN
INTERCONNECT TESTING WITH BOUNDARY SCAN Paul Wagner Honeywell, Inc. Solid State Electronics Division 12001 State Highway 55 Plymouth, Minnesota 55441 Abstract Boundary scan is a structured design technique
More informationDesign of a Multiplier Architecture Based on LUT and VHBCSE Algorithm For FIR Filter
African Journal of Basic & Applied Sciences 9 (1): 53-58, 2017 ISSN 2079-2034 IDOSI Publications, 2017 DOI: 10.5829/idosi.ajbas.2017.53.58 Design of a Multiplier Architecture Based on LUT and VHBCSE Algorithm
More information