TODAY, new applications, e.g., multimedia or advanced

Size: px
Start display at page:

Download "TODAY, new applications, e.g., multimedia or advanced"

Transcription

1 584 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: ANALOG AND DIGITAL SIGNAL PROCESSING, VOL. 45, NO. 5, MAY 1998 A Formal Technique for Hardware Interface Design Adel Baganne, Jean-Luc Philippe, and Eric Martin, Member, IEEE Abstract In this paper, we consider the problem of hardware interface design in a codesign approach for real-time digital signal processing (DSP) applications. We refer to the hardware component as ASICS (Applied Specific Integrated Circuits) and the software component as processors. We describe a formal technique to communication synthesis starting from hardware I/O transfer sequences computed by a high level synthesis tool, like GAUT. The original nature of our work is the fact that a communication interface is generated at the same time as the hardware module which leads to better performance and optimization and ensures communication data coherency. Our design strategy starts from the hardware I/O transfer sequences computed by GAUT. It incorporates some interface specification (I/O transfer order, timing constraints) obtained by any cosynthesis tool. The proposed allocation procedure of necessary storage components needed for data communication between hardware software components assigns for each I/O data a time interval at which its transfer could occur. As an illustration, we present a mixed implementation of the GMDF alpha algorithm, an adaptive filter well suited to acoustic echo cancellation, on both ASIC and TS320C40 DSP. Index Terms Communication synthesis, DSP systems, hardware/software codesign, interface generation, real time. I. INTRODUCTION TODAY, new applications, e.g., multimedia or advanced mobile communication systems are emerging. They need implementation of complex real-time digital signal processing (DSP) algorithms under both time, cost, and power constraints in a very short time-to-market, Hardware Software (H/S) Codesign has been widely recognized as a solution to the new requirements of system design [1], [4]. Heterogeneous implementation of applications on both hardware and software components provides significant gain of time and better performance. Traditionally, the main steps in the codesign methodology are system modeling and specification, analysis of constraints and requirements, H/S partitioning and scheduling, hardware and software module generation, H/S interface synthesis, cosimulation, and system integration [1]. This paper deals with the problem of H/S interface synthesis task which is a key issue in codesign. It specifies how the software and hardware components communicate. Interface synthesis can be divided into two major subtasks, namely communication synthesis and H/S synchronization. Communication synthesis consists in determining: 1) the communication protocol used for data transfer between com- Manuscript received October 10, 1997; revised February 25, This paper was recommended by Guest Editors F. Maloberti and W. C. Siu. The authors are with the LESTER Laboratory, Department of Electrical Engineering, Université de Bretagne Sud, Saint Haouen 56100, Lorient, France ( Adel.Baganne@univ-ubs.fr). Publisher Item Identifier S (98) ponents (blocking or nonblocking, master slave, special protocols [7]); 2) timing requirements for each data transfer (transfer delay estimation, date of transfer, and 3) the transfer mode between hardware and software (Memory mapped I/O scheme, looped DMA transfers, direct transfer by point-to-point link, etc.). The H/S synchronization subtask refers to making data transfer happen in a specified time order by task and communication scheduling, or more especially, coordinating the real time presentation of data and maintaining the time-ordered relation among hardware and software components. We present in this paper a design methodology of a hardware interface that supports functional constraints description of I/O data exchanged between hardware and software. We refer to the hardware component as ASICS (Applied Specific Integrated Circuits) and the software component as processors. Our work domain is the real time implementation of DSP and image applications. These applications are characterized by a parameter called the iteration period. All computations are repeated every time. Real time implementation of these applications on mixed H/S architecture should satisfy the following periodic constraints: 1) execution timing constraints on each component and 2) I/O data transfer sequences between both components with their timing specifications. Such constraints can be obtained by any cosynthesis tool [6]. We consider the hardware interface synthesis task as a part of the ASIC generation process by means of a high level synthesis (HLS) tool GAUT [3]. This tool involves several techniques such as behavioral transformations or hardware selection that allows a large exploration of the hardware design process. Our hardware design methodology is based on the following points: Processing unit (PU) of the ASIC is the first functional unit synthesized by GAUT because it undergoes the most important constraints from the real time execution constraints. the hardware interface is synthesized after PU and takes into account both I/O PU constraints and Processors I/O constraints specified by any cosynthesis tool used for system partitioning. Our design method allows the overlapping of computation and I/O communication which leads to better performances in execution time and hardware cost reduction. II. PREVIOUS WORK Few works have addressed the problem of interface synthesis in a global way. Most of them can be classified into two main categories. The first category aims to incorporate interface timing constraints into the scheduling of operations during high level synthesis [7], [9]. In [7], the authors presented a /98$ IEEE

2 BAGANNE et al.: A FORMAL TECHNIQUE FOR HARDWARE INTERFACE DESIGN 585 technique that attempts interface optimization by minimizing the number of blocking communication between two processes and by merging their communication channels. These processes could be rescheduled to satisfy the timing requirements. In [9] authors examined the bus and protocol generation problem after the partitioning step. They present a channel grouping technique based on average rate estimations for each communication channel. The second category addressed the interface synthesis problem between standard components which have incompatible protocols [4], [6], [10]. In [5], the authors described a template-based technique for transducers synthesis directly from timing diagrams. They used timing diagrams and signal Transitions Graphs (STG) for protocols specifications and the hardware interface is synthesized with asynchronous logic. In our case, we propose a new approach for hardware interface design that allows high overlapping between computation and I/O communication. The interface design is carried out after hardware module generation. The main objective of our synthesis is the reduction of both I/O communication overhead and hardware cost. The organization of this paper is as follows. In Section III, we briefly describe the main steps of our system design flow and we state the assumptions we make about our system target architecture and hardware design by Gaut HLS. In Section IV, we give our formulation of the hardware interface synthesis problem. In Section V, a generic architecture of the interface is presented and discussed. In Section VI, we introduce a timing and communication model for the I/O data transferred between the ASIC and the processors. In Section VII, we present our design technique for analysis and synthesis of the interface buffers under I/O timing requirements. In Section VIII, we present some results relative to the mixed implementation of the GMDF alpha algorithm [13], an adaptive filter well suited to acoustic echo cancellation, on an ASIC and a TMSC40 DSP. III. SYSTEM DESIGN OVERVIEW Our system target architecture consists of an ASIC (hardware component) and a general purpose processor (software component). Fig. 1 illustrates the design flow of both hardware and software components. We consider the design of real time application that is described at the behavioral level by some specification language [1] such as SDL, Statecharts, etc. We assume that a cosynthesis tool such as [6], [8] is used to partition the specification into three components: software, hardware, and interface. The software partition is composed of threads ( ) which are sets of computations that execute in deterministic time and other sets of computation that execute in nondeterministic time (such as, loops and synchronization tasks). A control-data driven scheduler is used in the software that allows both hardware and software to schedule threads over time and to exchange I/O data [8]. The hardware partition is specified with VHDL at behavioral level and constitutes the entry point to the high level synthesis tool GAUT. From this description, the timing constraints (computation delays) imposed after partitioning, and a technological driven library (standard cells, FPGA), the hardware architecture will Fig. 1. Hardware module design overview. be synthesized by GAUT HLS. This architecture is based on four functional units (see Fig. 2): the Processing Unit (PU), the Control Unit (CU), the Memory Unit (MU), and the Communication Unit (UCOM) that implements the H/S interface in the hardware side. The generic architecture topology of the PU is based on elementary cells including an operator, registers and interconnecting components. The memory unit implements registers, memory banks and their associated address generators, as well as channel for data transfer. After the PU synthesis step, GAUT produces a set of functional constraints relative to the PU I/O transfer sequences: the production and consumption dates of the I/O data. This sequence thus constitutes the background and the schedule of conditions for the ASIC s interface synthesis procedure. Our design process of the ASIC s interface requires some knowledge (software part) about I/O communication data exchanged with the processor such as I/O busses number, data transfer delays, and partial or total ordering of data transfer. These information could be obtained by some synthesis process such as Bus generation [10] (bus structure synthesis, by trading off the width of the bus and the processes communication over it) or protocol generation (how to define the exact mechanism of data transfer over the bus) [9]. Therefore, the H/S interface specification is assumed to be further annotated with detailed information. The following lists a description style of such information.

3 586 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: ANALOG AND DIGITAL SIGNAL PROCESSING, VOL. 45, NO. 5, MAY 1998 How to generate the smallest amount of hardware (datapath and control logic) that still meets the I/O sequence constraints (timing requirement, data transfer order) and allows a high overlapping between computation and I/O communication? The aims are the following [see Fig. 3(c)]: mixing and demixing of data busses of processor toward PU busses: it concerns to ensure all the necessary link between the Processor and the PU busses. synthesis of I/O data storage under IOCC constraints. Fig. 2. Generic architecture of the ASIC. 1) The ASIC-processor interconnection is realized by direct links. Communications between both components are based on the message passing model [8] and are buffered in order to reduce synchronization delays. 2) The communication protocol is slave master where the software component (Processor) works as a master. The hardware and software modules interacts with an interrupt driven mechanism. The software running on processor is capable of communicating with the ASIC and performing an interrupt-driven I/O. 3) The communication channels between threads and hardware partition are mapped to I/O busses such as in [10]. Therefore bus transactions between the processor and the ASIC could be described at the lowest level: I/O data structure (scalar, vector) and their timing constraints are known. In the rest of the paper, we will refer to these constraints as IOCC (Input Output Cosynthesis Constraints). Therefore, these information constitute another entry point to our ASIC s interface design. IV. PROBLEM FORMULATION Let us consider a mixed implementation of a DSP application where the hardware component consists in a FFT implementation. For simplicity, we consider a four points FFT. Fig. 3(a) shows a target architecture composed by an ASIC (hardware component) and a processor (software component). At each iteration, an I/O sequence is transferred over the bus B between both components. Each communication data may be described by different ways: 1) transaction order and 2) timing constraints determined by some communication synthesis process [10], [11]. These constraints can include strict timing specifications (fixed dates of transfers, transfer deadlines) (see Section VI-A) or only just I/O data transaction order. As mentioned previously our architectural synthesis tool GAUT starts ASIC design cycle by the PU and gives the consumption and gives the production and consumption dates of I/O data. and are the fully specified I/O sequences transferred, respectively, over B1 and B2 (internal busses of the PU). The problem of the hardware interface generation can be explained in the following way: V. GENERIC MODEL OF THE HARDWARE INTERFACE The hardware interface model that we have adopted can be seen in Fig. 3(d). It is composed of datapath and an FSM-based controller. The datapath of the interface includes three major components: buffers for storing I/O data, busses, and interconnection components such as multiplexers, demultiplexers, and tristate buffers. The storage buffers are composed of FIFO s, LIFO s, and registers. Busses included in the interface have two types: external and internal. External busses have two categories. The first one is I/O busses which represent the physical links between ASIC and processor. Their number and the bus width are fixed by the designer or determined by some bus generation tasks such as [10]. The second category is the Interface-Processing Unit busses. Their number is determined by the synthesis step of the processing unit. However, it should be greater or equal to the maximum number of simultaneous data transfer between the interface and PU. The FSM based controller implements the communication protocol defined by the cosynthesis tool and generates the adequate control signals for the datapath components. VI. MODELING AND SPECIFICATION OF I/O SEQUENCES In this section, we will consider that processor and ASIC have the same reference of time. In the following sections, we will describe the timing specification of the ASIC-processor I/O data transfer. A. ASIC-Processor I/O Transfer Sequences In our model (see Fig. 4) the processor has busses connected to the ASIC. As mentioned previously, the set ordering of communication data is provided by a cosynthesis tool and therefore known in advance. Let us have an ordered set of all I/O data transferred over busses in one iteration period. Some of the data item may be structured (vectors). Let us define the set as an ordered set of data subsets: where denotes the ordered sets of data subsets (vectors) transferred over the th bus. the single subset transferred over the is defined as a set of ordered scalar th bus

4 BAGANNE et al.: A FORMAL TECHNIQUE FOR HARDWARE INTERFACE DESIGN 587 (a) (b) (c) (d) Fig. 3. (a) Example: hardware FFT implementation. (b) ASIC-processor data transfer sequences. (c) Hardware interface synthesis problem. (d) Generic model of the hardware interface. Fig. 4. Topology model of the hardware interface. Let be the common reference of time and let and be, respectively, the start and the end of transfer of the th data item on the th bus (see Fig. 5). The communication time for transfer is defined as. Note that our a priori knowledge only concerns the mutual time constraints of data, and not the time values themselves, i.e.,. However, our model can be refined and extended to cases where all data ordering and their timing constraints are fully satisfied. Timing constraints are then introduced to define upper and lower bounds between the start times of two data transfers. The following equations express the main ( ) timing constraints that can be specified for each pair of the exchanged data unit: where and where Fig. 5. Timing specifications of I/O processor data. and define, respectively, the minimum and maximum delays between two consecutive subset transfers over the th bus. defines the minimum delay between transfer starts of the data units and. B. Processing Unit I/O Transfer We have mentioned previously that all the PU I/O transfer sequences are fully specified by GAUT HLS. The hardware interface exchanges data with the PU over internal busses (see Fig. 4). Let be the set of PU I/O busses and an ordered I/O sequence transferred over the th bus.

5 588 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: ANALOG AND DIGITAL SIGNAL PROCESSING, VOL. 45, NO. 5, MAY 1998 TABLE I ASIC S I/O DATA DESCRIPTION Definition 3: The Data Test Function (DTF) of the module is defined by the Boolean function : where if the I/O data can share the resource with all data of the sequence. Definition 4: The Sequence Storage Function (SSF) of the module is defined by : where represents the time interval at which the transfer of data, assumed to be stored in, can be occurred. Each input data (toward the ASIC) that belongs to is described by the following information: (Label, Receive step, First read step, Last read step, Source, Destination). The label is basically an integer identifier. The receive step [denoted as ] is the control step (c-step) at which is received by the communication unit from the processor. The first read step [denoted as ] is the c-step of the first use of by the PU. Similarly, the last read step [denoted as ] is the c-step of the last use of by the PU. The source and destination are integer values to represent the I/O ports of the UCOM or the PU that produces and uses the data. Each output data (toward the processor) that belongs to is described by the following information: (Label, Production step, Emission step, Source, Destination). The production step [denoted as ] is the c-step at which is produced by the PU. The Emission step is the c-step at which is sent to the processor. The Other information of output data have the same meaning of those of input data. All variable descriptions presented above are resumed in X Table I. C. Definitions Let be the execution time constraint of the hardware module. Let be a storage module, ( may be a Register, FIFO, or LIFO). Let and be, respectively, an input and output data that belong to the PU I/O data sequence. Definition 1: The PU lifetime of I/O data is defined by for input data and for output data. To keep homogeneous notations, we give an equivalent notation to as. Definition 2: The UCOM lifetime of I/O data is defined by for input data and for output data. Note that. VII. I/O DATA STORAGE AND OPTIMIZATION In this section, we are interested in allocation and mapping of I/O data sequences to specific storage modules which are queues (FIFO) stacks (LIFO) and registers. The allocation problem of I/O data may be solved by means of traditional techniques used for resources sharing [2] if all I/O timing are fully specified. However a full I/O timing specification can hardly be obtained because H/S communications synthesis is based on execution-time estimations, and not guaranteed to be cycle-accurate. The allocation procedure we propose here starts from the PU I/O sequence timing (fully specified by GAUT HLS) and incorporates the I/O constraints presented in Section III. Our goal is to assign, for each I/O data, a time interval at which the transfer to/from the processor could occur. This approach gives the designer a fine grain description of data transfer timing and such information could be exploited to refine and improve the granularity of system partitioning. We have developed temporal transformations SSF (, see Definition 4) based on functional properties of each storage module. Each function takes into account conditions that should be fulfilled by each pair of data to be allocated to the same storage component. For example, conditions relative to data storage in the same register must have disjoint lifetimes (the two intervals have empty intersection). All data to be allocated to FIFO or LIFO buffers must have the same type, e.g., all of them are input data or output data and must have disjoint lifetimes. Here are the main steps of our allocation procedure: First, given the PU I/O sequence and its relative constraints (I/O timing requirements and data ordering), we determine which data that can be grouped in the same storage module. Table II shows PU conditions that should be fulfilled by each pair of data to be allocated to the same storage component (Register, FIFO, LIFO). This step produces a set of subsequences that could be stored into the module is the DTF function of the module. Second, for each subsequences, some accessing constraints of the storage component are added. These constraints (denoted as access conditions in Table II) express the relationship between input and output sequence relative to each storage module. For example, the input sequence of Register or Fifo must be the same as the output sequence. Third, we apply the SSF function on each data subsequence. This function takes into account all the

6 BAGANNE et al.: A FORMAL TECHNIQUE FOR HARDWARE INTERFACE DESIGN 589 TABLE II TIMING CONDITIONS RELATIVE TO STORAGE MODULE ALLOCATION Fig. 6. DFG of GMDF algorithm. TABLE III SSF FUNCTIONS: TRANSFORMATIONS AND DERIVED CONSTRAINTS conditions discussed above and produce for each data a time interval in which the data transfer could occur. In Table III, we describe the SSF transformation of each storage module and its respective derived constraints. These derived constraints can be merged with some user constraints and therefore the allocation procedure carries out, by means of some recursive heuristics we have developed, the following tasks: 1) constraints analysis and checking; 2) I/O buffer sizing; 3) determination for each I/O data a time interval. VIII. DESIGN EXAMPLE: ACOUSTIC ECHO CANCELLER GMDF ALPHA In this example, we consider a mixed implementation of the acoustic echo canceler GMDF algorithm on heterogeneous architecture based on an ASIC and a TMS320C40 DSP. Previous implementations of this algorithm on two DSP C40 are described in [13]. A. Acoustic Echo Cancellation The real time implementation of acoustic echo cancellation algorithms is a challenging problem, especially in teleconference application, because in this context, the acoustic impulse response is very long (1024 taps) and the sampling frequency is high (16 khz). The GMDF algorithm (standing for generalized multi-delay adaptive filter with oversampling factor )isa frequency domain block algorithm where a block of input data is processed at one time, producing a block of outputs. The analysis of the learning behavior and performance of GMDF algorithm can be found in [13]. Fig. 6 shows the data flow graph (DFG) of the GMDF. This DFG includes computation tasks (shaded nodes) and some waiting delays, which are equal to a multiple of the sampling period. Note that computation of the convolution (tasks ) and the filter coefficients updating (tasks ) need the storage of complex values corresponding to FFT s of past blocks of reference signal samples. Note also, that set of tasks such as and may be executed in parallel. B. GMDF Partitioning and Scheduling The available time to meet the real time execution of GMDF is DSP cycles (1 cycle = 50 ns). We used a manual partitioning where the FFT is executed on the dedicated hardware component. All the other tasks are executed on the TMS320C40. The ASIC-processor interconnection is realized by one point-to-point link.

7 590 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: ANALOG AND DIGITAL SIGNAL PROCESSING, VOL. 45, NO. 5, MAY 1998 Fig. 8. Hardware interface of the ASIC (FFT). Fig. 8 shows the hardware interface buffers obtained by our allocation procedure. They are composed of two FIFO memories (123 and 112 points) and eight registers. These buffers carry out the storage of 1024 I/O data relative to each FFT execution. The proposed design offers a significant gain of I/O buffer sizing (420%). Fig. 7. Scheduling and execution of the GMDF. TABLE IV HARDWARE DESIGN RESULTS: ASIC ARCHITECTURE IX. CONCLUSION We have presented in this paper a method for hardware interface synthesis taking into account two types of design constraints. The first one consists in some interface specifications produced by a cosynthesis tool. The second one is represented by I/O transfer sequence constraints resulting from the ASIC s processing unit design by a high-level synthesis tool GAUT. Our hardware interface design ensures: 1) real time execution of DSP applications and 2) I/O data coherence, by means of formal transformations we have developed, between processor and PU I/O transfer sequences. We described a global modeling of data transactions between the software (Processor) and hardware (ASIC) modules and we presented a generic model of on-chip hardware interface. The necessary storage components of the interface are composed of queues, stacks, and registers. The allocation procedure we described is based on timing transformation relative to each storage component. The proposed synthesis method allows I/O synchronization delay reduction, a high overlapping between computation and I/O communication and leads to good performance in I/O buffer sizing and optimization. REFERENCES The FFT (real, 512 points) execution time imposed to the ASIC design is 4000 cycles (0, 2 ms). Fig. 7 presents the GMDF scheduling on both DSP and ASIC produced by SYNDEX CAD tool [12]. This tool offers a rapid prototyping of DSP applications on TMS320C40 based architectures. The temporal diagram obtained shows the H/S communications (oblique lines), and tasks scheduling for each component. The total number of cycles of algorithm execution is approximately Table IV shows the hardware architecture of the ASIC obtained by GAUT tool. The component library is extracted from the Compass library (16 bits CMOS 1- m standard cells). [1] G. De Micheli and M. Sami, Hardware/Software Codesign. Amsterdam, The Netherlands: Kluwer Academic, [2] D. D. Gajski and N. Dutt, High Level Synthesis. Amsterdam, The Netherlands: Kluwer Academic, [3] E. Martin, O. Sentieys, H. Dubois, and J. L. Philippe, GAUT, An architectural synthesis tool for dedicated signal processors, in Proc. EURO-DAC, 1993, pp [4] Toward a multi-formalism framework for architectural synthesis: The ASAR project members, in Proc. 3rd Int, Workshop on H/S Codesign, Grenoble, France, Sept [5] G. Boriello and R. Katz, Synthesis of interface transducer logic, in Proc. ICCAD, 1987, pp [6] P. H. Chou, R. B Ortega, and G. Boriello, The Chinook hardware/software co-synthesis system, in Proc. ISSS, 1995, pp [7] D. Filo, D. C. Ku, C. N. Coelho, and G. De Micheli, Interface optimization for concurrent systems under timing constraints, IEEE Trans. VLSI Syst., vol. 1, pp , Sept

8 BAGANNE et al.: A FORMAL TECHNIQUE FOR HARDWARE INTERFACE DESIGN 591 [8] R. K. Gupta, C. N. Coelho, and G. De Micheli, Synthesis and simulation of digital systems containing interacting hardware and software components, in Proc. DAC, 1992, pp [9] S. Narayan and D. D Gajski, Protocol generation for communication channels, in Proc. DAC, 1994, pp [10], Synthesis of system level bus interfaces, in Proc. DAC, 1994, pp [11] C. N. Coelho, Jr. et al., Redesigning hardware software systems, in Proc. 3rd Int. Workshop on H/S Codesign, Grenoble, France, Sept [12] Y. Sorel, Massively parallel computing systems with real time constraints: The algorithm architecture adequation, in Proc. Massively Parallel Computing Systems Conf., Italy, 1994, pp [13] A. Baganne, A. Gilloire, E. Martin, P. Martineau, and J. P. Thomas, Implementation methods of an acoustic echo cancellation algorithm GMDF, in Proc. 4th Int. Workshop on Acoustic Echo and Noise Control, Roros, Norway, 1995, pp Adel Baganne was born in Tunisia in He received the Ph.D. degree in signal processing and telecommunications at the University of Rennes, France, in 1997, the Engineer degree in electronics from the National Superior Engineering School in Angers (ESEO), France, in 1993, and the Masters degree in computer architecture and signal processing from the University of Rennes in He is presently with the Université de Bretagne Sud, Lorient, France, where he is an Assistant Professor. His research interests, inside the LESTER Laboratory, include communication synthesis and hardware generation for codesign, computer architecture, VLSI design for fast signal processing, and CAD tools. Jean-Luc Philippe received the Ph.D. degree in signal processing from the University of Rennes in Since 1992, he had been an Assistant Professor at ENSSAT, University of Rennes, conducting a research group in CAD for VLSI design. Currently, he is Professor in computer science at the Universitéde Bretagne Sud, Lorient, France. His research interests include signal and image processing systems, VLSI, and high level synthesis. Eric Martin (M 93) was born in He was Holder of the agregation at the Ecole Normale Supérieure of Cachan, France, where he received the Ph.D. degree. He founded the LESTER Lab (Laboratoire d Electronique des Systèmes Temps Réels), Université de Bretagne Sud, Lorient, France, in 1994, which he Heads currently as Professor. His research interests are VLSI, high level synthesis, and the conception of dedicated architecture to signal, image processing, and telecommunication.

Scheduling with Bus Access Optimization for Distributed Embedded Systems

Scheduling with Bus Access Optimization for Distributed Embedded Systems 472 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 8, NO. 5, OCTOBER 2000 Scheduling with Bus Access Optimization for Distributed Embedded Systems Petru Eles, Member, IEEE, Alex

More information

Hardware/Software Co-design

Hardware/Software Co-design Hardware/Software Co-design Zebo Peng, Department of Computer and Information Science (IDA) Linköping University Course page: http://www.ida.liu.se/~petel/codesign/ 1 of 52 Lecture 1/2: Outline : an Introduction

More information

Hardware Software Codesign of Embedded System

Hardware Software Codesign of Embedded System Hardware Software Codesign of Embedded System CPSC489-501 Rabi Mahapatra Mahapatra - Texas A&M - Fall 00 1 Today s topics Course Organization Introduction to HS-CODES Codesign Motivation Some Issues on

More information

Managing Dynamic Reconfiguration Overhead in Systems-on-a-Chip Design Using Reconfigurable Datapaths and Optimized Interconnection Networks

Managing Dynamic Reconfiguration Overhead in Systems-on-a-Chip Design Using Reconfigurable Datapaths and Optimized Interconnection Networks Managing Dynamic Reconfiguration Overhead in Systems-on-a-Chip Design Using Reconfigurable Datapaths and Optimized Interconnection Networks Zhining Huang, Sharad Malik Electrical Engineering Department

More information

Data Wordlength Optimization for FPGA Synthesis

Data Wordlength Optimization for FPGA Synthesis Data Wordlength Optimization for FPGA Synthesis Nicolas HERVÉ, Daniel MÉNARD and Olivier SENTIEYS IRISA University of Rennes 6, rue de Kerampont 22300 Lannion, France {first-name}.{name}@irisa.fr Abstract

More information

FPGA-Based derivative module for bioimpedance signal

FPGA-Based derivative module for bioimpedance signal 16 FPGA-Based derivative module for bioimpedance signal Sofienne Mansouri1, Halima Mahjoubi2 and Ridha Ben Salah3, Biophysics Research unit (Faculty of Medicine, Sousse, TUNISIA), Tunisia Summary Input

More information

Hardware Software Codesign of Embedded Systems

Hardware Software Codesign of Embedded Systems Hardware Software Codesign of Embedded Systems Rabi Mahapatra Texas A&M University Today s topics Course Organization Introduction to HS-CODES Codesign Motivation Some Issues on Codesign of Embedded System

More information

Hardware, Software and Mechanical Cosimulation for Automotive Applications

Hardware, Software and Mechanical Cosimulation for Automotive Applications Hardware, Software and Mechanical Cosimulation for Automotive Applications P. Le Marrec, C.A. Valderrama, F. Hessel, A.A. Jerraya TIMA Laboratory 46 Avenue Felix Viallet 38031 Grenoble France fphilippe.lemarrec,

More information

FILTER SYNTHESIS USING FINE-GRAIN DATA-FLOW GRAPHS. Waqas Akram, Cirrus Logic Inc., Austin, Texas

FILTER SYNTHESIS USING FINE-GRAIN DATA-FLOW GRAPHS. Waqas Akram, Cirrus Logic Inc., Austin, Texas FILTER SYNTHESIS USING FINE-GRAIN DATA-FLOW GRAPHS Waqas Akram, Cirrus Logic Inc., Austin, Texas Abstract: This project is concerned with finding ways to synthesize hardware-efficient digital filters given

More information

EE382V: System-on-a-Chip (SoC) Design

EE382V: System-on-a-Chip (SoC) Design EE382V: System-on-a-Chip (SoC) Design Lecture 8 HW/SW Co-Design Sources: Prof. Margarida Jacome, UT Austin Andreas Gerstlauer Electrical and Computer Engineering University of Texas at Austin gerstl@ece.utexas.edu

More information

Co-synthesis and Accelerator based Embedded System Design

Co-synthesis and Accelerator based Embedded System Design Co-synthesis and Accelerator based Embedded System Design COE838: Embedded Computer System http://www.ee.ryerson.ca/~courses/coe838/ Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrical and Computer

More information

Electronic System Level to Hw/Sw Design Flow

Electronic System Level to Hw/Sw Design Flow Electronic System Level to Hw/Sw Design Flow S. HUET P. BOMEL E. CASSEAU LESTER Laboratory LESTER Laboratory LESTER Laboratory UBS University UBS University UBS University BP 92116 BP 92116 BP 92116 56321

More information

The Design of Mixed Hardware/Software Systems

The Design of Mixed Hardware/Software Systems The Design of Mixed Hardware/Software Systems Jay K. Adams Synopsys, Inc. 700 East Middlefield Road Mountain View, CA 94043 jka@synopsys.com Donald E. Thomas Deptartment of Electrical and Computer Engineering

More information

System Level Design, a VHDL Based Approach.

System Level Design, a VHDL Based Approach. System Level Design, a VHDL Based Approach. Joris van den Hurk and Edwin Dilling Product Concept and Application Laboratory Eindhoven (PCALE) Philips Semiconductors, The Netherlands Abstract A hierarchical

More information

Datapath Allocation. Zoltan Baruch. Computer Science Department, Technical University of Cluj-Napoca

Datapath Allocation. Zoltan Baruch. Computer Science Department, Technical University of Cluj-Napoca Datapath Allocation Zoltan Baruch Computer Science Department, Technical University of Cluj-Napoca e-mail: baruch@utcluj.ro Abstract. The datapath allocation is one of the basic operations executed in

More information

Abstract A SCALABLE, PARALLEL, AND RECONFIGURABLE DATAPATH ARCHITECTURE

Abstract A SCALABLE, PARALLEL, AND RECONFIGURABLE DATAPATH ARCHITECTURE A SCALABLE, PARALLEL, AND RECONFIGURABLE DATAPATH ARCHITECTURE Reiner W. Hartenstein, Rainer Kress, Helmut Reinig University of Kaiserslautern Erwin-Schrödinger-Straße, D-67663 Kaiserslautern, Germany

More information

1 Introduction Data format converters (DFCs) are used to permute the data from one format to another in signal processing and image processing applica

1 Introduction Data format converters (DFCs) are used to permute the data from one format to another in signal processing and image processing applica A New Register Allocation Scheme for Low Power Data Format Converters Kala Srivatsan, Chaitali Chakrabarti Lori E. Lucke Department of Electrical Engineering Minnetronix, Inc. Arizona State University

More information

Concepts for Model Compilation in Hardware/Software Codesign

Concepts for Model Compilation in Hardware/Software Codesign Concepts for Model Compilation in Hardware/Software Codesign S. Schulz, and J.W. Rozenblit Dept. of Electrical and Computer Engineering The University of Arizona Tucson, AZ 85721 USA sschulz@ece.arizona.edu

More information

Exploration of Hardware/Software Design Space through a Codesign of Robot Arm Controller

Exploration of Hardware/Software Design Space through a Codesign of Robot Arm Controller Exploration of /Software Design Space through a Codesign of Robot Arm Mohamed Abid 1, Adel Changuel, Ahmed Jerraya Laboratoire TIMA/INPG, System-level Synthesis Group, 46 Av. Félix Viallet, 38031 Grenoble

More information

Synthesis of DSP Systems using Data Flow Graphs for Silicon Area Reduction

Synthesis of DSP Systems using Data Flow Graphs for Silicon Area Reduction Synthesis of DSP Systems using Data Flow Graphs for Silicon Area Reduction Rakhi S 1, PremanandaB.S 2, Mihir Narayan Mohanty 3 1 Atria Institute of Technology, 2 East Point College of Engineering &Technology,

More information

High Performance Computing. University questions with solution

High Performance Computing. University questions with solution High Performance Computing University questions with solution Q1) Explain the basic working principle of VLIW processor. (6 marks) The following points are basic working principle of VLIW processor. The

More information

Accelerating DSP Applications in Embedded Systems with a Coprocessor Data-Path

Accelerating DSP Applications in Embedded Systems with a Coprocessor Data-Path Accelerating DSP Applications in Embedded Systems with a Coprocessor Data-Path Michalis D. Galanis, Gregory Dimitroulakos, and Costas E. Goutis VLSI Design Laboratory, Electrical and Computer Engineering

More information

Area/Delay Estimation for Digital Signal Processor Cores

Area/Delay Estimation for Digital Signal Processor Cores Area/Delay Estimation for Digital Signal Processor Cores Yuichiro Miyaoka Yoshiharu Kataoka, Nozomu Togawa Masao Yanagisawa Tatsuo Ohtsuki Dept. of Electronics, Information and Communication Engineering,

More information

SpecC Methodology for High-Level Modeling

SpecC Methodology for High-Level Modeling EDP 2002 9 th IEEE/DATC Electronic Design Processes Workshop SpecC Methodology for High-Level Modeling Rainer Dömer Daniel D. Gajski Andreas Gerstlauer Center for Embedded Computer Systems Universitiy

More information

/$ IEEE

/$ IEEE IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 56, NO. 1, JANUARY 2009 81 Bit-Level Extrinsic Information Exchange Method for Double-Binary Turbo Codes Ji-Hoon Kim, Student Member,

More information

High-Level Synthesis

High-Level Synthesis High-Level Synthesis 1 High-Level Synthesis 1. Basic definition 2. A typical HLS process 3. Scheduling techniques 4. Allocation and binding techniques 5. Advanced issues High-Level Synthesis 2 Introduction

More information

Hardware-Software Codesign. 1. Introduction

Hardware-Software Codesign. 1. Introduction Hardware-Software Codesign 1. Introduction Lothar Thiele 1-1 Contents What is an Embedded System? Levels of Abstraction in Electronic System Design Typical Design Flow of Hardware-Software Systems 1-2

More information

Codesign Framework. Parts of this lecture are borrowed from lectures of Johan Lilius of TUCS and ASV/LL of UC Berkeley available in their web.

Codesign Framework. Parts of this lecture are borrowed from lectures of Johan Lilius of TUCS and ASV/LL of UC Berkeley available in their web. Codesign Framework Parts of this lecture are borrowed from lectures of Johan Lilius of TUCS and ASV/LL of UC Berkeley available in their web. Embedded Processor Types General Purpose Expensive, requires

More information

Ptolemy Seamlessly Supports Heterogeneous Design 5 of 5

Ptolemy Seamlessly Supports Heterogeneous Design 5 of 5 In summary, the key idea in the Ptolemy project is to mix models of computation, rather than trying to develop one, all-encompassing model. The rationale is that specialized models of computation are (1)

More information

A Top-down Hardware/Software Co-Simulation Method for Embedded Systems Based Upon a Component Logical Bus Architecture

A Top-down Hardware/Software Co-Simulation Method for Embedded Systems Based Upon a Component Logical Bus Architecture A Top-down / Co-Simulation Method for Embedded Systems Based Upon a Architecture Mitsuhiro YASUDA Barry SHACKLEFORD Fumio SUZUKI Katsuhiko SEO Hisao KOIZUMI Mitsubishi Electric Corporation, Hewlett-Packard

More information

Hardware/ Software Partitioning

Hardware/ Software Partitioning Hardware/ Software Partitioning Peter Marwedel TU Dortmund, Informatik 12 Germany Marwedel, 2003 Graphics: Alexandra Nolte, Gesine 2011 年 12 月 09 日 These slides use Microsoft clip arts. Microsoft copyright

More information

HW SW Partitioning. Reading. Hardware/software partitioning. Hardware/Software Codesign. CS4272: HW SW Codesign

HW SW Partitioning. Reading. Hardware/software partitioning. Hardware/Software Codesign. CS4272: HW SW Codesign CS4272: HW SW Codesign HW SW Partitioning Abhik Roychoudhury School of Computing National University of Singapore Reading Section 5.3 of textbook Embedded System Design Peter Marwedel Also must read Hardware/software

More information

A Complete Data Scheduler for Multi-Context Reconfigurable Architectures

A Complete Data Scheduler for Multi-Context Reconfigurable Architectures A Complete Data Scheduler for Multi-Context Reconfigurable Architectures M. Sanchez-Elez, M. Fernandez, R. Maestre, R. Hermida, N. Bagherzadeh, F. J. Kurdahi Departamento de Arquitectura de Computadores

More information

Design of a System-on-Chip Switched Network and its Design Support Λ

Design of a System-on-Chip Switched Network and its Design Support Λ Design of a System-on-Chip Switched Network and its Design Support Λ Daniel Wiklund y, Dake Liu Dept. of Electrical Engineering Linköping University S-581 83 Linköping, Sweden Abstract As the degree of

More information

Automated RTR Temporal Partitioning for Reconfigurable Embedded Real-Time System Design

Automated RTR Temporal Partitioning for Reconfigurable Embedded Real-Time System Design Automated RTR Temporal Partitioning for Reconfigurable Embedded Real-Time System Design C. Tanougast, Y. Berviller, P. Brunet and S. Weber L. I. E. N. Laboratoire d Instrumentation Electronique de Nancy

More information

ARITHMETIC operations based on residue number systems

ARITHMETIC operations based on residue number systems IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 53, NO. 2, FEBRUARY 2006 133 Improved Memoryless RNS Forward Converter Based on the Periodicity of Residues A. B. Premkumar, Senior Member,

More information

ISSN (Online), Volume 1, Special Issue 2(ICITET 15), March 2015 International Journal of Innovative Trends and Emerging Technologies

ISSN (Online), Volume 1, Special Issue 2(ICITET 15), March 2015 International Journal of Innovative Trends and Emerging Technologies VLSI IMPLEMENTATION OF HIGH PERFORMANCE DISTRIBUTED ARITHMETIC (DA) BASED ADAPTIVE FILTER WITH FAST CONVERGENCE FACTOR G. PARTHIBAN 1, P.SATHIYA 2 PG Student, VLSI Design, Department of ECE, Surya Group

More information

Fixed Point LMS Adaptive Filter with Low Adaptation Delay

Fixed Point LMS Adaptive Filter with Low Adaptation Delay Fixed Point LMS Adaptive Filter with Low Adaptation Delay INGUDAM CHITRASEN MEITEI Electronics and Communication Engineering Vel Tech Multitech Dr RR Dr SR Engg. College Chennai, India MR. P. BALAVENKATESHWARLU

More information

DUE to the high computational complexity and real-time

DUE to the high computational complexity and real-time IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 3, MARCH 2005 445 A Memory-Efficient Realization of Cyclic Convolution and Its Application to Discrete Cosine Transform Hun-Chen

More information

C- BASED RAPID PROTOTYPING FOR DIGITAL SIGNAL PROCESSING

C- BASED RAPID PROTOTYPING FOR DIGITAL SIGNAL PROCESSING C- BASED RAPID PROTOTYPING FOR DIGITAL SIGNAL PROCESSING Emmanuel Casseau, Bertrand Le Gal, Pierre Bomel, Christophe Jego, Sylvain Huet, Eric Martin To cite this version: Emmanuel Casseau, Bertrand Le

More information

Runtime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays

Runtime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays Runtime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays Éricles Sousa 1, Frank Hannig 1, Jürgen Teich 1, Qingqing Chen 2, and Ulf Schlichtmann

More information

Energy Aware Optimized Resource Allocation Using Buffer Based Data Flow In MPSOC Architecture

Energy Aware Optimized Resource Allocation Using Buffer Based Data Flow In MPSOC Architecture ISSN (Online) : 2319-8753 ISSN (Print) : 2347-6710 International Journal of Innovative Research in Science, Engineering and Technology Volume 3, Special Issue 3, March 2014 2014 International Conference

More information

Introduction to Electronic Design Automation. Model of Computation. Model of Computation. Model of Computation

Introduction to Electronic Design Automation. Model of Computation. Model of Computation. Model of Computation Introduction to Electronic Design Automation Model of Computation Jie-Hong Roland Jiang 江介宏 Department of Electrical Engineering National Taiwan University Spring 03 Model of Computation In system design,

More information

Scalable Performance Scheduling for Hardware-Software Cosynthesis

Scalable Performance Scheduling for Hardware-Software Cosynthesis Scalable Performance Scheduling for Hardware-Software Cosynthesis Thomas Benner, Rolf Ernst and Achim Österling Institut für Datenverarbeitungsanlagen Hans-Sommer-Str. 66 D 38106 Braunschweig, Germany

More information

Hardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University

Hardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University Hardware Design Environments Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University Outline Welcome to COE 405 Digital System Design Design Domains and Levels of Abstractions Synthesis

More information

THE orthogonal frequency-division multiplex (OFDM)

THE orthogonal frequency-division multiplex (OFDM) 26 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 57, NO. 1, JANUARY 2010 A Generalized Mixed-Radix Algorithm for Memory-Based FFT Processors Chen-Fong Hsiao, Yuan Chen, Member, IEEE,

More information

A VLSI Architecture for H.264/AVC Variable Block Size Motion Estimation

A VLSI Architecture for H.264/AVC Variable Block Size Motion Estimation Journal of Automation and Control Engineering Vol. 3, No. 1, February 20 A VLSI Architecture for H.264/AVC Variable Block Size Motion Estimation Dam. Minh Tung and Tran. Le Thang Dong Center of Electrical

More information

A New Approach to Execution Time Estimations in a Hardware/Software Codesign Environment

A New Approach to Execution Time Estimations in a Hardware/Software Codesign Environment A New Approach to Execution Time Estimations in a Hardware/Software Codesign Environment JAVIER RESANO, ELENA PEREZ, DANIEL MOZOS, HORTENSIA MECHA, JULIO SEPTIÉN Departamento de Arquitectura de Computadores

More information

Design and Implementation of CVNS Based Low Power 64-Bit Adder

Design and Implementation of CVNS Based Low Power 64-Bit Adder Design and Implementation of CVNS Based Low Power 64-Bit Adder Ch.Vijay Kumar Department of ECE Embedded Systems & VLSI Design Vishakhapatnam, India Sri.Sagara Pandu Department of ECE Embedded Systems

More information

Synthesis of Complicated Asynchronous Control Circuits Using Template Based Technique

Synthesis of Complicated Asynchronous Control Circuits Using Template Based Technique Synthesis of Complicated Asynchronous Control Circuits Using Template Based Technique Sufian Sudeng and Arthit Thongtak Abstract this paper proposes an approach for complicated asynchronous controller

More information

Rapid Prototyping System for Teaching Real-Time Digital Signal Processing

Rapid Prototyping System for Teaching Real-Time Digital Signal Processing IEEE TRANSACTIONS ON EDUCATION, VOL. 43, NO. 1, FEBRUARY 2000 19 Rapid Prototyping System for Teaching Real-Time Digital Signal Processing Woon-Seng Gan, Member, IEEE, Yong-Kim Chong, Wilson Gong, and

More information

MOST computations used in applications, such as multimedia

MOST computations used in applications, such as multimedia IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 13, NO. 9, SEPTEMBER 2005 1023 Pipelining With Common Operands for Power-Efficient Linear Systems Daehong Kim, Member, IEEE, Dongwan

More information

Effective Memory Access Optimization by Memory Delay Modeling, Memory Allocation, and Slack Time Management

Effective Memory Access Optimization by Memory Delay Modeling, Memory Allocation, and Slack Time Management International Journal of Computer Theory and Engineering, Vol., No., December 01 Effective Memory Optimization by Memory Delay Modeling, Memory Allocation, and Slack Time Management Sultan Daud Khan, Member,

More information

Karthik Narayanan, Santosh Madiraju EEL Embedded Systems Seminar 1/41 1

Karthik Narayanan, Santosh Madiraju EEL Embedded Systems Seminar 1/41 1 Karthik Narayanan, Santosh Madiraju EEL6935 - Embedded Systems Seminar 1/41 1 Efficient Search Space Exploration for HW-SW Partitioning Hardware/Software Codesign and System Synthesis, 2004. CODES + ISSS

More information

HW/SW Co-design. Design of Embedded Systems Jaap Hofstede Version 3, September 1999

HW/SW Co-design. Design of Embedded Systems Jaap Hofstede Version 3, September 1999 HW/SW Co-design Design of Embedded Systems Jaap Hofstede Version 3, September 1999 Embedded system Embedded Systems is a computer system (combination of hardware and software) is part of a larger system

More information

Integrating Communication Protocol Selection with Partitioning in Hardware/Software Codesign

Integrating Communication Protocol Selection with Partitioning in Hardware/Software Codesign Integrating Communication Protocol Selection with Partitioning in Hardware/Software Codesign Peter Voigt Knudsen and Jan Madsen Department of Information Technology, Technical University of Denmark pvk@it.dtu.dk,

More information

AVSynDEx Methodology For Fast Prototyping of Multi-C6x DSP Architectures

AVSynDEx Methodology For Fast Prototyping of Multi-C6x DSP Architectures AVSynDEx Methodology For Fast Prototyping of Multi-C6x DSP Architectures Jean-François NEZAN, Virginie FRESSE, Olivier DEFORGES, Michael RAULET CNRS UMR IETR (Institut en Electronique et Télecommunications

More information

Re-configurable VLIW processor for streaming data

Re-configurable VLIW processor for streaming data International Workshop NGNT 97 Re-configurable VLIW processor for streaming data V. Iossifov Studiengang Technische Informatik, FB Ingenieurwissenschaften 1, FHTW Berlin. G. Megson School of Computer Science,

More information

A Unified Model for Co-simulation and Co-synthesis of Mixed Hardware/Software Systems

A Unified Model for Co-simulation and Co-synthesis of Mixed Hardware/Software Systems A Unified Model for Co-simulation and Co-synthesis of Mixed Hardware/Software Systems C. A. Valderrama 1 A. Changuel P.V. Raghavan M. Abid 2 T. Ben Ismail A. A. Jerraya TIMA / INPG, System-Level Synthesis

More information

COE 561 Digital System Design & Synthesis Introduction

COE 561 Digital System Design & Synthesis Introduction 1 COE 561 Digital System Design & Synthesis Introduction Dr. Aiman H. El-Maleh Computer Engineering Department King Fahd University of Petroleum & Minerals Outline Course Topics Microelectronics Design

More information

THE EUROPEAN DESIGN AND TEST CONFERENCE 1995 Paris,France 6-9 March 1995

THE EUROPEAN DESIGN AND TEST CONFERENCE 1995 Paris,France 6-9 March 1995 THE EUROPEAN DESIGN AND TEST CONFERENCE 1995 Paris,France 6-9 March 1995 A UNIFIED MODEL FOR CO-SIMULATION AND CO-SYNTHESIS OF MIXED HARDWARE/SOFTWARE SYSTEMS Authors: C. A. Valderrama, A. Changuel, P.V.

More information

Low energy and High-performance Embedded Systems Design and Reconfigurable Architectures

Low energy and High-performance Embedded Systems Design and Reconfigurable Architectures Low energy and High-performance Embedded Systems Design and Reconfigurable Architectures Ass. Professor Dimitrios Soudris School of Electrical and Computer Eng., National Technical Univ. of Athens, Greece

More information

Software Synthesis Trade-offs in Dataflow Representations of DSP Applications

Software Synthesis Trade-offs in Dataflow Representations of DSP Applications in Dataflow Representations of DSP Applications Shuvra S. Bhattacharyya Department of Electrical and Computer Engineering, and Institute for Advanced Computer Studies University of Maryland, College Park

More information

Design Space Exploration in System Level Synthesis under Memory Constraints

Design Space Exploration in System Level Synthesis under Memory Constraints Design Space Exploration in System Level Synthesis under Constraints Radoslaw Szymanek and Krzysztof Kuchcinski Dept. of Computer and Information Science Linköping University Sweden radsz@ida.liu.se Abstract

More information

Incorporating the Controller Eects During Register Transfer Level. Synthesis. Champaka Ramachandran and Fadi J. Kurdahi

Incorporating the Controller Eects During Register Transfer Level. Synthesis. Champaka Ramachandran and Fadi J. Kurdahi Incorporating the Controller Eects During Register Transfer Level Synthesis Champaka Ramachandran and Fadi J. Kurdahi Department of Electrical & Computer Engineering, University of California, Irvine,

More information

Digital System Design

Digital System Design Digital System Design Analog time varying signals that can take on any value across a continuous range of voltage, current or other metric Digital signals are modeled with two states, 0 or 1 underneath

More information

All MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes

All MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes MSEE Curriculum All MSEE students are required to take the following two core courses: 3531-571 Linear systems 3531-507 Probability and Random Processes The course requirements for students majoring in

More information

FPGA based Design of Low Power Reconfigurable Router for Network on Chip (NoC)

FPGA based Design of Low Power Reconfigurable Router for Network on Chip (NoC) FPGA based Design of Low Power Reconfigurable Router for Network on Chip (NoC) D.Udhayasheela, pg student [Communication system],dept.ofece,,as-salam engineering and technology, N.MageshwariAssistant Professor

More information

A Low Power Asynchronous FPGA with Autonomous Fine Grain Power Gating and LEDR Encoding

A Low Power Asynchronous FPGA with Autonomous Fine Grain Power Gating and LEDR Encoding A Low Power Asynchronous FPGA with Autonomous Fine Grain Power Gating and LEDR Encoding N.Rajagopala krishnan, k.sivasuparamanyan, G.Ramadoss Abstract Field Programmable Gate Arrays (FPGAs) are widely

More information

Low Power and Memory Efficient FFT Architecture Using Modified CORDIC Algorithm

Low Power and Memory Efficient FFT Architecture Using Modified CORDIC Algorithm Low Power and Memory Efficient FFT Architecture Using Modified CORDIC Algorithm 1 A.Malashri, 2 C.Paramasivam 1 PG Student, Department of Electronics and Communication K S Rangasamy College Of Technology,

More information

Cosimulation of ITRON-Based Embedded Software with SystemC

Cosimulation of ITRON-Based Embedded Software with SystemC Cosimulation of ITRON-Based Embedded Software with SystemC Shin-ichiro Chikada, Shinya Honda, Hiroyuki Tomiyama, Hiroaki Takada Graduate School of Information Science, Nagoya University Information Technology

More information

DTNS: a Discrete Time Network Simulator for C/C++ Language Based Digital Hardware Simulations

DTNS: a Discrete Time Network Simulator for C/C++ Language Based Digital Hardware Simulations DTNS: a Discrete Time Network Simulator for C/C++ Language Based Digital Hardware Simulations KIMMO KUUSILINNA, JOUNI RIIHIMÄKI, TIMO HÄMÄLÄINEN, and JUKKA SAARINEN Digital and Computer Systems Laboratory

More information

SPARK: A Parallelizing High-Level Synthesis Framework

SPARK: A Parallelizing High-Level Synthesis Framework SPARK: A Parallelizing High-Level Synthesis Framework Sumit Gupta Rajesh Gupta, Nikil Dutt, Alex Nicolau Center for Embedded Computer Systems University of California, Irvine and San Diego http://www.cecs.uci.edu/~spark

More information

Linköping University Post Print. epuma: a novel embedded parallel DSP platform for predictable computing

Linköping University Post Print. epuma: a novel embedded parallel DSP platform for predictable computing Linköping University Post Print epuma: a novel embedded parallel DSP platform for predictable computing Jian Wang, Joar Sohl, Olof Kraigher and Dake Liu N.B.: When citing this work, cite the original article.

More information

Hardware Software Partitioning of Multifunction Systems

Hardware Software Partitioning of Multifunction Systems Hardware Software Partitioning of Multifunction Systems Abhijit Prasad Wangqi Qiu Rabi Mahapatra Department of Computer Science Texas A&M University College Station, TX 77843-3112 Email: {abhijitp,wangqiq,rabi}@cs.tamu.edu

More information

Reprogrammable Controller Design From High- Level Specification.

Reprogrammable Controller Design From High- Level Specification. Reprogrammable Controller Design From High- Level Specification. M. BENMOHAMMED 1, 1 Computer Science Department, University of Cne, 25000 Constantine, ALGERIE. Email : ibnmyahoo.fr M. BOURAHLA 2, and

More information

FIR Filter Synthesis Algorithms for Minimizing the Delay and the Number of Adders

FIR Filter Synthesis Algorithms for Minimizing the Delay and the Number of Adders 770 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: ANALOG AND DIGITAL SIGNAL PROCESSING, VOL. 48, NO. 8, AUGUST 2001 FIR Filter Synthesis Algorithms for Minimizing the Delay and the Number of Adders Hyeong-Ju

More information

A framework for automatic generation of audio processing applications on a dual-core system

A framework for automatic generation of audio processing applications on a dual-core system A framework for automatic generation of audio processing applications on a dual-core system Etienne Cornu, Tina Soltani and Julie Johnson etienne_cornu@amis.com, tina_soltani@amis.com, julie_johnson@amis.com

More information

Controller Synthesis for Hardware Accelerator Design

Controller Synthesis for Hardware Accelerator Design ler Synthesis for Hardware Accelerator Design Jiang, Hongtu; Öwall, Viktor 2002 Link to publication Citation for published version (APA): Jiang, H., & Öwall, V. (2002). ler Synthesis for Hardware Accelerator

More information

ECE519 Advanced Operating Systems

ECE519 Advanced Operating Systems IT 540 Operating Systems ECE519 Advanced Operating Systems Prof. Dr. Hasan Hüseyin BALIK (10 th Week) (Advanced) Operating Systems 10. Multiprocessor, Multicore and Real-Time Scheduling 10. Outline Multiprocessor

More information

DESIGN AND PERFORMANCE ANALYSIS OF CARRY SELECT ADDER

DESIGN AND PERFORMANCE ANALYSIS OF CARRY SELECT ADDER DESIGN AND PERFORMANCE ANALYSIS OF CARRY SELECT ADDER Bhuvaneswaran.M 1, Elamathi.K 2 Assistant Professor, Muthayammal Engineering college, Rasipuram, Tamil Nadu, India 1 Assistant Professor, Muthayammal

More information

The Chinook Hardware/Software Co-Synthesis System

The Chinook Hardware/Software Co-Synthesis System The Chinook HardwareSoftware Co-Synthesis System Pai H. Chou Ross B. Ortega Gaetano Borriello Department of Computer Science & Engineering University of Washington Seattle, WA 98195-2350 Abstract Designers

More information

Interface Specification and Synthesis for VHDL Processes *

Interface Specification and Synthesis for VHDL Processes * Interface Specification and Synthesis for VHDL Processes * P. Gutberlet W. Rosenstiel 1 Forschungszentrum Informatik (FZI) Haid-und-Neu-Str. 10-14, D-76131 Karlsruhe, Germany 1 FZI and University of Tübingen

More information

THE focus of design effort on higher levels of abstraction,

THE focus of design effort on higher levels of abstraction, 84 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 6, NO. 1, MARCH 1998 SpecSyn: An Environment Supporting the Specify-Explore-Refine Paradigm for Hardware/Software System Design

More information

A New Decomposition of Boolean Functions

A New Decomposition of Boolean Functions A New Decomposition of Boolean Functions Elena Dubrova Electronic System Design Lab Department of Electronics Royal Institute of Technology Kista, Sweden elena@ele.kth.se Abstract This paper introduces

More information

Design Issues in Hardware/Software Co-Design

Design Issues in Hardware/Software Co-Design Volume-2, Issue-1, January-February, 2014, pp. 01-05, IASTER 2013 www.iaster.com, Online: 2347-6109, Print: 2348-0017 ABSTRACT Design Issues in Hardware/Software Co-Design R. Ganesh Sr. Asst. Professor,

More information

An Integrated Cosimulation Environment for Heterogeneous Systems Prototyping

An Integrated Cosimulation Environment for Heterogeneous Systems Prototyping An Integrated Cosimulation Environment for Heterogeneous Systems Prototyping Yongjoo Kim, Kyuseok Kim, Youngsoo Shin, Taekyoon Ahn, and Kiyoung Choi School of Electrical Engineering Seoul National University

More information

Digital Hardware-/Softwaresystems Specification

Digital Hardware-/Softwaresystems Specification Digital Hardware-/Softwaresystems Specification Seminar Architecture & Design Methods for Embedded Systems Summer Term 2006 University of Stuttgart Faculty of Computer Science, Electrical Engineering and

More information

Hardware/Software Co-Design/Co-Verification

Hardware/Software Co-Design/Co-Verification Hardware/Software Co-Design/Co-Verification Sungho Kang Yonsei University Outline Introduction Co-design Methodology Partitioning Scheduling Co-Simulation Systems Timed Co-simulation Multimedia Examples

More information

Unit 2: High-Level Synthesis

Unit 2: High-Level Synthesis Course contents Unit 2: High-Level Synthesis Hardware modeling Data flow Scheduling/allocation/assignment Reading Chapter 11 Unit 2 1 High-Level Synthesis (HLS) Hardware-description language (HDL) synthesis

More information

COMPLEX embedded systems with multiple processing

COMPLEX embedded systems with multiple processing IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 12, NO. 8, AUGUST 2004 793 Scheduling and Mapping in an Incremental Design Methodology for Distributed Real-Time Embedded Systems

More information

VERY large scale integration (VLSI) design for power

VERY large scale integration (VLSI) design for power IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 7, NO. 1, MARCH 1999 25 Short Papers Segmented Bus Design for Low-Power Systems J. Y. Chen, W. B. Jone, Member, IEEE, J. S. Wang,

More information

INTERFACE SYNTHESIS. process A. process A1 variable MEM : intarray ;. process A2. process A1 variable MEM : intarray ; procedure send_ch1(...

INTERFACE SYNTHESIS. process A. process A1 variable MEM : intarray ;. process A2. process A1 variable MEM : intarray ; procedure send_ch1(... Protocol Generation for Communication Channels y Sanjiv Narayan Daniel D. Gajski Viewlogic Systems Inc. Dept. of Computer Science Marlboro, MA 01752 Univ. of California, Irvine, CA 92717 Abstract System-level

More information

Optimization of Run-time Reconfigurable Embedded Systems

Optimization of Run-time Reconfigurable Embedded Systems Optimization of Run-time Reconfigurable Embedded Systems Michael Eisenring and Marco Platzner Swiss Federal Institute of Technology (ETH) Zurich, Switzerland {eisenring platzner}@tik.ee.ethz.ch Abstract.

More information

Chhattisgarh Swami Vivekanand Technical University, Bhilai

Chhattisgarh Swami Vivekanand Technical University, Bhilai Sr. No. Chhattisgarh Swami Vivekanand Technical University, Bhilai SCHEME OF MASTER OF TECHNOLOGY Electronics & Telecommunication Engineering (VLSI & Embedded System Design) Board Of Studies Code M. Tech.

More information

Software-Level Power Estimation

Software-Level Power Estimation Chapter 3 Software-Level Power Estimation The main goal of this chapter is to define a methodology for the static and dynamic estimation of the power consumption at the software-level to be integrated

More information

Tradeoff Analysis and Architecture Design of a Hybrid Hardware/Software Sorter

Tradeoff Analysis and Architecture Design of a Hybrid Hardware/Software Sorter Tradeoff Analysis and Architecture Design of a Hybrid Hardware/Software Sorter M. Bednara, O. Beyer, J. Teich, R. Wanka Paderborn University D-33095 Paderborn, Germany bednara,beyer,teich @date.upb.de,

More information

Reduction of Periodic Broadcast Resource Requirements with Proxy Caching

Reduction of Periodic Broadcast Resource Requirements with Proxy Caching Reduction of Periodic Broadcast Resource Requirements with Proxy Caching Ewa Kusmierek and David H.C. Du Digital Technology Center and Department of Computer Science and Engineering University of Minnesota

More information

Hardware-Software Codesign. 1. Introduction

Hardware-Software Codesign. 1. Introduction Hardware-Software Codesign 1. Introduction Lothar Thiele 1-1 Contents What is an Embedded System? Levels of Abstraction in Electronic System Design Typical Design Flow of Hardware-Software Systems 1-2

More information

Carry-Free Radix-2 Subtractive Division Algorithm and Implementation of the Divider

Carry-Free Radix-2 Subtractive Division Algorithm and Implementation of the Divider Tamkang Journal of Science and Engineering, Vol. 3, No., pp. 29-255 (2000) 29 Carry-Free Radix-2 Subtractive Division Algorithm and Implementation of the Divider Jen-Shiun Chiang, Hung-Da Chung and Min-Show

More information