Frantz LOHIER 1,2, Lionel LACASSAGNE 1,2, Pr. Patrick GARDA 2
|
|
- Paul Walker
- 5 years ago
- Views:
Transcription
1 A New Mehodology o Opimize DMA Daa Caching: Applicaion owards he Real-ime Execuion of an MRF-based Moion Deecion Algorihm on a muli-processor DSP Franz LOHER 1,2, Lionel LACASSAGNE 1,2, Pr. Parick GARDA 2 franz@lohier.com, lionel@lis.jussieu.fr, garda@lis.jussieu.fr 1 Elecronique, nformaique, Applicaions (EA) Burospace, Ba. 4, Roue de Gisy, Bièvres Cedex, France ABSTRACT A novel approach o opimise daa caching based on a muli-dimensional DMA is used o implemen a robus moion deecion algorihm. Qualiaively, we demonsrae he adequacy of his generic approach owards complex low-level image algorihms under real-ime consrains. Quaniaively, he C80 implemenaion improves by 4 former published performance for his algorihm. 1. nroducion 1.1 A Novel approach o opimize 2D /O sreams While analyzing he recen archiecural evoluions, i appears ha on he one hand : RSC and DSP cores are converging owards combined VLW/SMD archiecures ; DMA co-processors now sand as he only major archiecural difference beween he RSC and DSP world. They offer more predicabiliy a he cos of a more difficul programming sage (hey feaure selfmodifying parameers and muli-dimensions suppor as in [4]). Curren programming mehodological rends, on he oher hand, are sill limied : 2 Laboraoire des nsrumens e Sysèmes (LS) Universié Pierre e Marie Curie (UPMC), 4 place Jussieu B.C. 252, Paris Cedex 05, France DSP developmen ools are lacking whereas implemenaion is raising in complexiy (SMD/sofware pipelining, C8x s VLW operaions, advanced DMAs); DSP flexible simulaion/implemenaion plaform (such as MaLab/Simulink, Polemy or Hypercepion) are no suied for 2D processing. For hese applicaions, he 1D synchronous daa flow (SDF) represenaion domain seems he mos maure ; The code generaed from hese plaforms isn opimal and adding new processing kernels is ofen lacking a flexible re-usable framework. To ackle hese issues, we recenly inroduced a SDF oriened programming mehodology [2][3]. seeks he enhancemen of daa localiy by chaining he execuion of processing operaors (i.e nodes) hence minimising he global amoun of parallel DMA ransfers. This approach grounds on emplaes which se up a generic framework o expand node s libraries and compose complex processing chains dynamically. Templaes derive from he srucuring elemen required/produced by algorihmic operaors in he sense ha hey also gaher node s implemenaion consrains. These addiional mulipliciy consrains relae o opimizaion echniques such as he use of SMD operaion, loop
2 unrolling or sofware pipelining [1][3]. These echniques maximise he usage of processor s resources hrough he execuion of several loop ieraions in parallel. n a muli-processor conex, he ulimae goal of our approach is o se up all he synchronous DMA requess from a generic chain s descripion, enhancing daa localiy while mainaining insrucion caching low. Based on his mehodology we have implemened a complee low-level image processing library (more han 60 nodes) on he C80. This library feaures dynamic parsing of chains descripion (nodes emplae, image sizes, daa cache sizes, number of processors), auomaic DMA requess generaion and synchronisaion wih processing. The design space isn auomaically explored owards an opimal use of heerogeneous compuaional resources and communicaion channels as wih plaforms such as he Syndex generic approach [7]. nsead, pariioning on homogenous resource is user-guided bu reconfigurable a run-ime and since i relies on opimized 2D /O sreams (in erms of daa localiy enhancemen combined wih advanced opimizing implemenaion echniques), i improves he parallelizaion efficiency. 1.2 The moion deecion algorihm To illusrae he use of our library, we describe he implemenaion of he robus Markov Random Fields (MRF) based moion deecion algorihm proposed in [6]. Saring from an image difference, he CM (eraed Condiional Mode algorihm) is used o compue he resuling binary label picure in an ieraive manner. acs as a low-pass moion filer o locally minimize a spaioemporal energy model. eraions are performed a pixel level ( sie recursive ) and he suggesed number of ieraions is 4. npu(+1) AbsDiff O(+1) npu() x x 2 (Thres.) σ 2 (+1) Buffering (delay) 8 bis daa sream 1 bi daa sream Single in/ou value E(+1) sep1 (4PPs: 6ms) O() Alhough his algorihm is sub-opimal, i nowadays involves a challenging compuaional load when seeking real-ime (RT) processing on large images. Also, for his algorihm, he RT performance sand as an imporan crierion owards a good deecion efficiency (especially when fas moion consiue he sequence). A complee descripion of he approach is deailed in [2]. As a synopsis of i, a pre-possessing phase merges he absolue difference of 2 images, he variance of i and he binary hresholding of he absolue daa (he iniial labels E +1 ). This phase is followed by he CM regularizing algorihm as shown in figure The archiecural mapping The C8x has 1 general purpose RSC processor (MP) and 4 advanced SMD/VLW DSPs (or PPs for Parallel Processors) [4]. This suies he processorfarm model where he MP synchronizes he E() σ 2 () E(-1) C M (Num. er. ) E() sep2 (4PPs, 4 passes: 74ms) FG. 1 : Synopsis of he MRF algorihm 8 bis binrarised resul
3 PPs represening he bulk processing power. The pre-processing and he CM passes require jus 2 image scans. For each scan, we use an SPMD pariioning scheme. The image is spli among he 4 PPs and since each sub-region doesn fi in he PP s general purpose inernal daa memory, he daa are brough using muliple DMA requess while double buffering. 2.1 mplemening he pre-processing The pre-processing sage is done in a single pass hanks o a chain connecing several nodes hrough heir corresponding emplaes. Templaes' geomery are mosly described by 4 parameers. w h corresponds o he minimum amoun of daa ha mus be presen in inernal memory. This quaniy ofen exceeds he surface of he operaors' srucuring elemen because we inegrae size requiremens ha raise while implemening generic opimizaion echniques owards an efficien algorihmical mapping ono hardware resources. Here, he goal is o lower he developmen burden and he granulariy of each node by assuming we have a minimum amoun of daa ha maches he number of parallel ieraions processed in one occurrence of he nodes' loop kernel. Following he same goals, we also impose mulipliciy consrains on he number of addiional daa ha can be presen in inernal memory for he horizonal and verical direcion and described hanks o parameers w s and h s respecively. These emplaes allow he composing of complex processing chains which, ogeher wih he reduced granulariy of nodes, allows o enhance daa localiy and reduce parallel DMA ransfer as well as insrucion caching owards improved realime performance. Each node has a synchronized inpu and oupu emplaes whose parameers are deailed on figure 2. npu(+1) npu() Exernal buffer nernal processing chain (1) AbsDiff 2 1,1,1 6 1,2,1 16 2,8,1 Templae parameers: w h, w s, h s min. size nernal (inpu) emplae geomery sep h s (2) x (3) x 2 (4) (Thres.) nernal (oupu) emplae geomery 2,1,2,1 O(+1) (8 bis daa) E(+1) (1 bi daa) Processing node Templae geomery Nex and since he processed images do no fi in inernal memory, we need o esimae and minimise he number of all inpu and oupu parallel DMA requess beween each synchronized execuion of he chain. This is based on he exac surface of daa we can process and depends on he overall size consrains ha are propagaed along each pah of he chain ha connecs o an exernal buffer. Propagaion is based on emplae parameers and he solving of simple and independen diophanine equaions (for he ver. and horz. direcion). These equaions appear as we ry o synchronize he amoun of daa produced by a node wih he number of daa consumed by he nex node in he chain. This synchronizaion mechanism is depiced wih figure 3. Recursively, he gained synchronizaion consans (which appear has 2 congruences: β ο φ(φ) e γ ο τ( Γ)) are propagaed beween he oupu and inpu of each node unil he leading node of each pah is reached. This procedure enables he merging of he called "virualemplae" which feaures new parameers w s FG. 2 : The pre-processing chain h w
4 Synchronisaion consrains β O φ ( Φ ) 0(1) ()/(+)1 (16,8) (2,1) Abs Diff O β = β O γ = γ Thres ( w, w ) = ( w ( h, h ) = ( h s s + w + h φ, w τ, h Γ Φ ) ) γ O τ ( Γ) 1(1) β Ι/Ο, γ Ι/Ο : number of horizonal/verical emplae ha fi in he inernal caching buffer of size S-w' h' O O O w + ws β = w + ws β O O O h + hs γ = h + hs γ gahering all he size consrains (w ' h ',w s ',h s '). Then, according o he original buffer size, we esimae he combined number of horizonal (β) and verical (γ) virual emplaes we can cache in an inernal memory space of S-w 'xh ' byes. A complee descripion of his procedure can be found in [2]. As a synopsis of our approach, able 1 shows (β,γ ) couples we gain while applying his procedure on he firs sep of he moion deecion algorihm wih W=H=256 and S=2048 for all he inpu and oupu buffers. Min(β) and Min(γ) are hen used o synchronize all he nodes as well as o pariion daa among all he processors (in a SPMD fashion). The subsequen ransparen and dynamic programming of DMA requess allows o synchronize all inpu and oupu ransfers (for all he processors) owards sofware managed daa caches. TAB. 1 : Generaing & synchronizing DMA requess Pahs Virual emplae (w, w s,h,h s) AbsDiff Thres 16,8,2, AbsDiff x 16,8,1, AbsDiff x 2 16,8,1, O Thres AbsDiff 2,1,2, Mulidimensional suppor allows he DMA o address any region of ineres and is arihmeic capabiliies permis o implemen he double buffering scheme wih minimum processors involvemen. This grealy favors FG 3. Geing he virual emplae β γ insrucion cache coherency and hence, performance. Wih he combined use of our mehodology and well known generic opimizaion echniques o implemen he nodes in assembly language (sofware pipelining, SMD operaions), he preprocessing phase achieves 6 ms for a image wih 4 PPs working in parallel a 40 Mhz. 2.2 mplemening he CM node Thanks o he dynamic infrasrucure of our C80 library implemening our generic DMA caching mehodology, he sysem is reconfigured a runime o run he CM. This sep is implemened as a single-node chain. The same /O generaing/opimizing algorihm we described is used bu no deailed here. nsead, he various involved emplaes are shown in Figure 4. To lower he number of DMA requess required, some exernal buffer s daa are organized sequenially. This allows o define he CM node as diadic (feauring jus 2 inpu physical buffers). Also, here is no synchronizaion regarding he processed daa ha are shared beween 2 processors and despie he exising daa dependency inroduced by he sie recursive updaes. This simplificaion has very lile impac on he moion deecion efficiency and permis SPMD pariioning wih maximum performance. The CM node is wrien in VLW algebraic assembly language. We ook advanage of
5 Single inpu DMA reques E(-1) 8 bis binrarised resul O() 32 3,32,1 4 3,4,1 4 3,4,1 4 ier. CM 32 1,32,1 4 1,4,1 E() 4 1,4,1 E() Single oupu DMA reques FG. 4 : he CM single-node chain E(+1) he new code compacor (ppca) deailed in [5] o auomaically gaher assembly operaions ino 64 bis VLW insrucions. Regiser allocaion was also done by ppca ha efficienly compaced 313 operaions ino 199 VLW insrucions. Since he core of he node requires more regiser resources han available (each PP feaures 8 daa regisers and 2 ses of 5 address and 3 index regisers), we manually insered regiser spilling operaions based on ppca feedback logs. We inroduced he use of 3 LUTs (iniialised by he MP) o fasen calculaion whereas he sie recursive version of he CM algorihm prevened us from using he SMD capabiliies of he ALU (bu his version of he CM demonsraes faser converging). The core of he kernel requires 40 VLW insrucions per pixel which is 4 imes faser han wha PP s opimising compiler produces on a C version of he same algorihm. On images, a 40 Mhz, we measured 74 ms for he 4 CM passes which is very close o he opimisic opimum: (num. pass)/(40mhz 4(num. PP))= 65 ms. 3. Conclusions This paper inroduces wo imporan resuls. Qualiaively, we demonsrae ha our mehodology is generic enough o cope wih a complex low-level image algorihm. is also flexible enough o be dynamic and independen of he image size and number of processors. Mos imporanly, wih respec o generic opimizaion echniques, i opimises 2D /O sreams and allows good speed-up owards RT performance. Quaniaively and hanks o he framework of our general c80 image processing library, we gain a speed facor of 4 wih respec o previously published processing duraion for he same algorihm [6]. When considering he 60 Mhz version of he device, we can increase he processing rae up o 18 images per second (i/s) for images. The use of more efficien memory, like synchronous dynamic RAM (we used DRAM), would even furher increase ha rae. Pre-prop.& CM LS-UPMC/EA LS-NPG 128x128 - DSP 96002: 15 i/s Cnaps 256 PEs: 10 i/s 256x Mhz C80: 12 i/s - REFERENCES [1] F. Lohier, L. Lacassagne, Pr. P. Garda, Programming Technique for Real Time Sofware mplemenaion of Opimal Edge Deecors. Proc. of DSPWORLD 98, Spring Conference [2] F. Lohier, L. Lacassagne, Pr. P. Garda, A Generic Mehodology for he Sofware Managing of Caches in Muli-Processors DSP Archiecures. EEE n. Conference on Acousics, Speech and Signal Processing. CASSP 99. [3] Aricles [1]&[2] are online a [4] TMS320C8X: Sysem-Level Synopsis, Texas nsrumens SPRU113b. [5] Jihong Kim, Graham Shor, Performance Evaluaion of Regiser Allocaor for he Advanced DSP of TMS320C80. Proc. of CASSP 98 [6] A. Caplier, F. Luhon, C. Dumonier, Real ime mplemenaions of an MRF-based Moion Deecion Algorihm, Journal of Real Time maging [7] Y. Sorel, Massively Parallel Compuing Sysems wih RT Consrains : The Algorihm Archiecure Adequaion Mehodology, Proc. Massively Parallel Compuing Sysems schia aly, Mai 1994.
Y. Tsiatouhas. VLSI Systems and Computer Architecture Lab
CMOS INEGRAED CIRCUI DESIGN ECHNIQUES Universiy of Ioannina Clocking Schemes Dep. of Compuer Science and Engineering Y. siaouhas CMOS Inegraed Circui Design echniques Overview 1. Jier Skew hroughpu Laency
More informationCS 152 Computer Architecture and Engineering. Lecture 7 - Memory Hierarchy-II
CS 152 Compuer Archiecure and Engineering Lecure 7 - Memory Hierarchy-II Krse Asanovic Elecrical Engineering and Compuer Sciences Universiy of California a Berkeley hp://www.eecs.berkeley.edu/~krse hp://ins.eecs.berkeley.edu/~cs152
More informationMORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES
MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES B. MARCOTEGUI and F. MEYER Ecole des Mines de Paris, Cenre de Morphologie Mahémaique, 35, rue Sain-Honoré, F 77305 Fonainebleau Cedex, France Absrac. In image
More informationSTEREO PLANE MATCHING TECHNIQUE
STEREO PLANE MATCHING TECHNIQUE Commission III KEY WORDS: Sereo Maching, Surface Modeling, Projecive Transformaion, Homography ABSTRACT: This paper presens a new ype of sereo maching algorihm called Sereo
More informationOptimal Crane Scheduling
Opimal Crane Scheduling Samid Hoda, John Hooker Laife Genc Kaya, Ben Peerson Carnegie Mellon Universiy Iiro Harjunkoski ABB Corporae Research EWO - 13 November 2007 1/16 Problem Track-mouned cranes move
More informationNetwork management and QoS provisioning - QoS in Frame Relay. . packet switching with virtual circuit service (virtual circuits are bidirectional);
QoS in Frame Relay Frame relay characerisics are:. packe swiching wih virual circui service (virual circuis are bidirecional);. labels are called DLCI (Daa Link Connecion Idenifier);. for connecion is
More informationImplementing Ray Casting in Tetrahedral Meshes with Programmable Graphics Hardware (Technical Report)
Implemening Ray Casing in Terahedral Meshes wih Programmable Graphics Hardware (Technical Repor) Marin Kraus, Thomas Erl March 28, 2002 1 Inroducion Alhough cell-projecion, e.g., [3, 2], and resampling,
More informationSimple Network Management Based on PHP and SNMP
Simple Nework Managemen Based on PHP and SNMP Krasimir Trichkov, Elisavea Trichkova bsrac: This paper aims o presen simple mehod for nework managemen based on SNMP - managemen of Cisco rouer. The paper
More informationFIELD PROGRAMMABLE GATE ARRAY (FPGA) AS A NEW APPROACH TO IMPLEMENT THE CHAOTIC GENERATORS
FIELD PROGRAMMABLE GATE ARRAY (FPGA) AS A NEW APPROACH TO IMPLEMENT THE CHAOTIC GENERATORS Mohammed A. Aseeri and M. I. Sobhy Deparmen of Elecronics, The Universiy of Ken a Canerbury Canerbury, Ken, CT2
More informationOutline. EECS Components and Design Techniques for Digital Systems. Lec 06 Using FSMs Review: Typical Controller: state
Ouline EECS 5 - Componens and Design Techniques for Digial Sysems Lec 6 Using FSMs 9-3-7 Review FSMs Mapping o FPGAs Typical uses of FSMs Synchronous Seq. Circuis safe composiion Timing FSMs in verilog
More informationA time-space consistency solution for hardware-in-the-loop simulation system
Inernaional Conference on Advanced Elecronic Science and Technology (AEST 206) A ime-space consisency soluion for hardware-in-he-loop simulaion sysem Zexin Jiang a Elecric Power Research Insiue of Guangdong
More informationChapter 4 Sequential Instructions
Chaper 4 Sequenial Insrucions The sequenial insrucions of FBs-PLC shown in his chaper are also lised in secion 3.. Please refer o Chaper, "PLC Ladder diagram and he Coding rules of Mnemonic insrucion",
More informationPART 1 REFERENCE INFORMATION CONTROL DATA 6400 SYSTEMS CENTRAL PROCESSOR MONITOR
. ~ PART 1 c 0 \,).,,.,, REFERENCE NFORMATON CONTROL DATA 6400 SYSTEMS CENTRAL PROCESSOR MONTOR n CONTROL DATA 6400 Compuer Sysems, sysem funcions are normally handled by he Monior locaed in a Peripheral
More informationCAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL
CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL Klečka Jan Docoral Degree Programme (1), FEEC BUT E-mail: xkleck01@sud.feec.vubr.cz Supervised by: Horák Karel E-mail: horak@feec.vubr.cz
More informationUser Adjustable Process Scheduling Mechanism for a Multiprocessor Embedded System
Proceedings of he 6h WSEAS Inernaional Conference on Applied Compuer Science, Tenerife, Canary Islands, Spain, December 16-18, 2006 346 User Adjusable Process Scheduling Mechanism for a Muliprocessor Embedded
More informationScheduling. Scheduling. EDA421/DIT171 - Parallel and Distributed Real-Time Systems, Chalmers/GU, 2011/2012 Lecture #4 Updated March 16, 2012
EDA421/DIT171 - Parallel and Disribued Real-Time Sysems, Chalmers/GU, 2011/2012 Lecure #4 Updaed March 16, 2012 Aemps o mee applicaion consrains should be done in a proacive way hrough scheduling. Schedule
More informationQuick Verification of Concurrent Programs by Iteratively Relaxed Scheduling
Quick Verificaion of Concurren Programs by Ieraively Relaxed Scheduling Parick Mezler, Habib Saissi, Péer Bokor, Neeraj Suri Technische Univerisä Darmsad, Germany {mezler, saissi, pbokor, suri}@deeds.informaik.u-darmsad.de
More informationSam knows that his MP3 player has 40% of its battery life left and that the battery charges by an additional 12 percentage points every 15 minutes.
8.F Baery Charging Task Sam wans o ake his MP3 player and his video game player on a car rip. An hour before hey plan o leave, he realized ha he forgo o charge he baeries las nigh. A ha poin, he plugged
More informationImage segmentation. Motivation. Objective. Definitions. A classification of segmentation techniques. Assumptions for thresholding
Moivaion Image segmenaion Which pixels belong o he same objec in an image/video sequence? (spaial segmenaion) Which frames belong o he same video sho? (emporal segmenaion) Which frames belong o he same
More informationThe Impact of Product Development on the Lifecycle of Defects
The Impac of Produc Developmen on he Lifecycle of Rudolf Ramler Sofware Compeence Cener Hagenberg Sofware Park 21 A-4232 Hagenberg, Ausria +43 7236 3343 872 rudolf.ramler@scch.a ABSTRACT This paper invesigaes
More informationLess Pessimistic Worst-Case Delay Analysis for Packet-Switched Networks
Less Pessimisic Wors-Case Delay Analysis for Packe-Swiched Neworks Maias Wecksén Cenre for Research on Embedded Sysems P O Box 823 SE-31 18 Halmsad maias.wecksen@hh.se Magnus Jonsson Cenre for Research
More informationShortest Path Algorithms. Lecture I: Shortest Path Algorithms. Example. Graphs and Matrices. Setting: Dr Kieran T. Herley.
Shores Pah Algorihms Background Seing: Lecure I: Shores Pah Algorihms Dr Kieran T. Herle Deparmen of Compuer Science Universi College Cork Ocober 201 direced graph, real edge weighs Le he lengh of a pah
More informationLet s get physical - EDA Tools for Mobility
Le s ge physical - EDA Tools for Mobiliy Aging and Reliabiliy Communicaion Mobile and Green Mobiliy - Smar and Safe Frank Oppenheimer OFFIS Insiue for Informaion Technology OFFIS a a glance Applicaion-oriened
More informationCENG 477 Introduction to Computer Graphics. Modeling Transformations
CENG 477 Inroducion o Compuer Graphics Modeling Transformaions Modeling Transformaions Model coordinaes o World coordinaes: Model coordinaes: All shapes wih heir local coordinaes and sies. world World
More information4. Minimax and planning problems
CS/ECE/ISyE 524 Inroducion o Opimizaion Spring 2017 18 4. Minima and planning problems ˆ Opimizing piecewise linear funcions ˆ Minima problems ˆ Eample: Chebyshev cener ˆ Muli-period planning problems
More informationAn Adaptive Spatial Depth Filter for 3D Rendering IP
JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, VOL.3, NO. 4, DECEMBER, 23 175 An Adapive Spaial Deph Filer for 3D Rendering IP Chang-Hyo Yu and Lee-Sup Kim Absrac In his paper, we presen a new mehod
More informationComputer representations of piecewise
Edior: Gabriel Taubin Inroducion o Geomeric Processing hrough Opimizaion Gabriel Taubin Brown Universiy Compuer represenaions o piecewise smooh suraces have become vial echnologies in areas ranging rom
More informationA GRAPHICS PROCESSING UNIT IMPLEMENTATION OF THE PARTICLE FILTER
A GRAPHICS PROCESSING UNIT IMPLEMENTATION OF THE PARTICLE FILTER ABSTRACT Modern graphics cards for compuers, and especially heir graphics processing unis (GPUs), are designed for fas rendering of graphics.
More informationTest - Accredited Configuration Engineer (ACE) Exam - PAN-OS 6.0 Version
Tes - Accredied Configuraion Engineer (ACE) Exam - PAN-OS 6.0 Version ACE Exam Quesion 1 of 50. Which of he following saemens is NOT abou Palo Alo Neworks firewalls? Sysem defauls may be resored by performing
More informationCOMP26120: Algorithms and Imperative Programming
COMP26120 ecure C3 1/48 COMP26120: Algorihms and Imperaive Programming ecure C3: C - Recursive Daa Srucures Pee Jinks School of Compuer Science, Universiy of Mancheser Auumn 2011 COMP26120 ecure C3 2/48
More informationAutomatic Calculation of Coverage Profiles for Coverage-based Testing
Auomaic Calculaion of Coverage Profiles for Coverage-based Tesing Raimund Kirner 1 and Waler Haas 1 Vienna Universiy of Technology, Insiue of Compuer Engineering, Vienna, Ausria, raimund@vmars.uwien.ac.a
More informationCOSC 3213: Computer Networks I Chapter 6 Handout # 7
COSC 3213: Compuer Neworks I Chaper 6 Handou # 7 Insrucor: Dr. Marvin Mandelbaum Deparmen of Compuer Science York Universiy F05 Secion A Medium Access Conrol (MAC) Topics: 1. Muliple Access Communicaions:
More informationQuantitative macro models feature an infinite number of periods A more realistic (?) view of time
INFINIE-HORIZON CONSUMPION-SAVINGS MODEL SEPEMBER, Inroducion BASICS Quaniaive macro models feaure an infinie number of periods A more realisic (?) view of ime Infinie number of periods A meaphor for many
More informationA GRAPHICS PROCESSING UNIT IMPLEMENTATION OF THE PARTICLE FILTER
A GRAPHICS PROCESSING UNIT IMPLEMENTATION OF THE PARTICLE FILTER Gusaf Hendeby, Jeroen D. Hol, Rickard Karlsson, Fredrik Gusafsson Deparmen of Elecrical Engineering Auomaic Conrol Linköping Universiy,
More informationCoded Caching with Multiple File Requests
Coded Caching wih Muliple File Requess Yi-Peng Wei Sennur Ulukus Deparmen of Elecrical and Compuer Engineering Universiy of Maryland College Park, MD 20742 ypwei@umd.edu ulukus@umd.edu Absrac We sudy a
More informationAnalysis of Various Types of Bugs in the Object Oriented Java Script Language Coding
Indian Journal of Science and Technology, Vol 8(21), DOI: 10.17485/ijs/2015/v8i21/69958, Sepember 2015 ISSN (Prin) : 0974-6846 ISSN (Online) : 0974-5645 Analysis of Various Types of Bugs in he Objec Oriened
More informationMOBILE COMPUTING 3/18/18. Wi-Fi IEEE. CSE 40814/60814 Spring 2018
MOBILE COMPUTING CSE 40814/60814 Spring 2018 Wi-Fi Wi-Fi: name is NOT an abbreviaion play on Hi-Fi (high fideliy) Wireless Local Area Nework (WLAN) echnology WLAN and Wi-Fi ofen used synonymous Typically
More informationMOBILE COMPUTING. Wi-Fi 9/20/15. CSE 40814/60814 Fall Wi-Fi:
MOBILE COMPUTING CSE 40814/60814 Fall 2015 Wi-Fi Wi-Fi: name is NOT an abbreviaion play on Hi-Fi (high fideliy) Wireless Local Area Nework (WLAN) echnology WLAN and Wi-Fi ofen used synonymous Typically
More informationCombinatorial Optimization for Embedded System Design. Luca Benini
Combinaorial Opimizaion for Embedded Sysem Design Luca Benini Work in cooperaion wih Michela Milano s group Embedded Sysems A rough definiion Any compuing sysem which is no a compuer Large variey of devices
More informationRestorable Dynamic Quality of Service Routing
QOS ROUTING Resorable Dynamic Qualiy of Service Rouing Murali Kodialam and T. V. Lakshman, Lucen Technologies ABSTRACT The focus of qualiy-of-service rouing has been on he rouing of a single pah saisfying
More informationMOTION DETECTORS GRAPH MATCHING LAB PRE-LAB QUESTIONS
NME: TE: LOK: MOTION ETETORS GRPH MTHING L PRE-L QUESTIONS 1. Read he insrucions, and answer he following quesions. Make sure you resae he quesion so I don hae o read he quesion o undersand he answer..
More informationFast Real-Time LIDAR Processing on FPGAs
Fas Real-Time LIDAR Processing on FPGAs K. Shih, A. Balachandran, K. Nagarajan, B. Holland, C. Slaon, A. George NSF Cener for High-Performance Reconfigurable Compuing (CHREC) Deparmen of Elecrical and
More informationChapter 8 LOCATION SERVICES
Disribued Compuing Group Chaper 8 LOCATION SERVICES Mobile Compuing Winer 2005 / 2006 Overview Mobile IP Moivaion Daa ransfer Encapsulaion Locaion Services & Rouing Classificaion of locaion services Home
More informationSTRING DESCRIPTIONS OF DATA FOR DISPLAY*
SLAC-PUB-383 January 1968 STRING DESCRIPTIONS OF DATA FOR DISPLAY* J. E. George and W. F. Miller Compuer Science Deparmen and Sanford Linear Acceleraor Cener Sanford Universiy Sanford, California Absrac
More informationsource managemen, naming, proecion, and service provisions. This paper concenraes on he basic processor scheduling aspecs of resource managemen. 2 The
Virual Compuers A New Paradigm for Disribued Operaing Sysems Banu Ozden y Aaron J. Goldberg Avi Silberschaz z 600 Mounain Ave. AT&T Bell Laboraories Murray Hill, NJ 07974 Absrac The virual compuers (VC)
More informationVideo Content Description Using Fuzzy Spatio-Temporal Relations
Proceedings of he 4s Hawaii Inernaional Conference on Sysem Sciences - 008 Video Conen Descripion Using Fuzzy Spaio-Temporal Relaions rchana M. Rajurkar *, R.C. Joshi and Sananu Chaudhary 3 Dep of Compuer
More informationWeb System for the Remote Control and Execution of an IEC Application
Web Sysem for he Remoe Conrol and Execuion of an IEC 61499 Applicaion Oana ROHAT, Dan POPESCU Faculy of Auomaion and Compuer Science, Poliehnica Universiy, Splaiul Independenței 313, Bucureși, 060042,
More informationOptimizing the Processing Performance of a Smart DMA Controller for LTE Terminals
Opimizing he Processing Performance of a Smar DMA Conroller for LTE Terminals David Szczesny, Sebasian Hessel, Shadi Traboulsi, Aila Bilgic Insiue for Inegraed Sysems, Ruhr-Universiä Bochum D-78 Bochum,
More informationLearning in Games via Opponent Strategy Estimation and Policy Search
Learning in Games via Opponen Sraegy Esimaion and Policy Search Yavar Naddaf Deparmen of Compuer Science Universiy of Briish Columbia Vancouver, BC yavar@naddaf.name Nando de Freias (Supervisor) Deparmen
More informationNEWTON S SECOND LAW OF MOTION
Course and Secion Dae Names NEWTON S SECOND LAW OF MOTION The acceleraion of an objec is defined as he rae of change of elociy. If he elociy changes by an amoun in a ime, hen he aerage acceleraion during
More informationNRMI: Natural and Efficient Middleware
NRMI: Naural and Efficien Middleware Eli Tilevich and Yannis Smaragdakis Cener for Experimenal Research in Compuer Sysems (CERCS), College of Compuing, Georgia Tech {ilevich, yannis}@cc.gaech.edu Absrac
More informationParallel and Distributed Systems for Constructive Neural Network Learning*
Parallel and Disribued Sysems for Consrucive Neural Nework Learning* J. Flecher Z. Obradovi School of Elecrical Engineering and Compuer Science Washingon Sae Universiy Pullman WA 99164-2752 Absrac A consrucive
More informationWORKSHOP SAFETY IN MOBILE APPLICATION
WORKSHOP SAFETY IN MOBILE APPLICATION Renaa Mondelaers Seven Bellens SICK SEW Cerified Funcional Safey Applicaion Exper Technology Leader Smar Facory CFSAE by SGS/TÜV Saar MOBILE APPLICATION AVAILABLE
More informationVisually Summarizing the Web using Internal Images and Keyphrases
Visually Summarizing he Web using Inernal Images and Keyphrases M.V.Gedam, S. A. Taale Deparmen of compuer engineering, PUNE Universiy Vidya Praishhan s College of Engg., India Absrac Visual summarizaion
More information4 Error Control. 4.1 Issues with Reliable Protocols
4 Error Conrol Jus abou all communicaion sysems aemp o ensure ha he daa ges o he oher end of he link wihou errors. Since i s impossible o build an error-free physical layer (alhough some shor links can
More informationFisheye Lens Distortion Correction on Multicore and Hardware Accelerator Platforms
Fiheye Len Diorion Correcion on Mulicore and Hardware Acceleraor Plaform Konani Dalouka 1 Chrio D. Anonopoulo 1 Nikolao Sek M. Bella 1 Chai 1 Deparmen of Compuer and Communicaion Engineering Univeriy of
More informationDynamic Route Planning and Obstacle Avoidance Model for Unmanned Aerial Vehicles
Volume 116 No. 24 2017, 315-329 ISSN: 1311-8080 (prined version); ISSN: 1314-3395 (on-line version) url: hp://www.ijpam.eu ijpam.eu Dynamic Roue Planning and Obsacle Avoidance Model for Unmanned Aerial
More informationAUTOMATIC 3D FACE REGISTRATION WITHOUT INITIALIZATION
Chaper 3 AUTOMATIC 3D FACE REGISTRATION WITHOUT INITIALIZATION A. Koschan, V. R. Ayyagari, F. Boughorbel, and M. A. Abidi Imaging, Roboics, and Inelligen Sysems Laboraory, The Universiy of Tennessee, 334
More informationCS 152 Computer Architecture and Engineering. Lecture 6 - Memory
CS 152 Compuer Archiecure and Engineering Lecure 6 - Memory Krse Asanovic Elecrical Engineering and Compuer Sciences Universiy of California a Berkeley hp://www.eecs.berkeley.edu/~krse hp://ins.eecs.berkeley.edu/~cs152
More informationRule-Based Multi-Query Optimization
Rule-Based Muli-Query Opimizaion Mingsheng Hong Dep. of Compuer cience Cornell Universiy mshong@cs.cornell.edu Johannes Gehrke Dep. of Compuer cience Cornell Universiy johannes@cs.cornell.edu Mirek Riedewald
More informationVoltair Version 2.5 Release Notes (January, 2018)
Volair Version 2.5 Release Noes (January, 2018) Inroducion 25-Seven s new Firmware Updae 2.5 for he Volair processor is par of our coninuing effors o improve Volair wih new feaures and capabiliies. For
More informationC 1. Last Time. CSE 490/590 Computer Architecture. Cache I. Branch Delay Slots (expose control hazard to software)
CSE 490/590 Compuer Archiecure Cache I Seve Ko Compuer Sciences and Engineering Universiy a Buffalo Las Time Pipelining hazards Srucural hazards hazards Conrol hazards hazards Sall Bypass Conrol hazards
More informationMARSS Reference Sheet
MARSS Reference Shee The defaul MARSS model (form="marxss") is wrien as follows: x = B x 1 + u + C c + w where w MVN( Q ) y = Z x + a + D d + v where v MVN( R ) x 1 MVN(π Λ) or x MVN(π Λ) c and d are inpus
More informationManaging XML Versions and Replicas in a P2P Context
Managing XML Versions and Replicas in a P2P Conex Deise de Brum Saccol1 1,2, Nina Edelweiss 2, Renaa de Maos Galane 2,4, Carlo Zaniolo 3 2 Insiuo de Informáica - Universidade Federal do Rio Grande do Sul
More informationMATH Differential Equations September 15, 2008 Project 1, Fall 2008 Due: September 24, 2008
MATH 5 - Differenial Equaions Sepember 15, 8 Projec 1, Fall 8 Due: Sepember 4, 8 Lab 1.3 - Logisics Populaion Models wih Harvesing For his projec we consider lab 1.3 of Differenial Equaions pages 146 o
More informationUtility-Based Hybrid Memory Management
Uiliy-Based Hybrid Memory Managemen Yang Li Saugaa Ghose Jongmoo Choi Jin Sun Hui Wang Onur Mulu Carnegie Mellon Universiy Dankook Universiy Beihang Universiy ETH Zürich While he memory fooprins of cloud
More informationMIC2569. Features. General Description. Applications. Typical Application. CableCARD Power Switch
CableCARD Power Swich General Descripion is designed o supply power o OpenCable sysems and CableCARD hoss. These CableCARDs are also known as Poin of Disribuion (POD) cards. suppors boh Single and Muliple
More informationPerformance Evaluation of Implementing Calls Prioritization with Different Queuing Disciplines in Mobile Wireless Networks
Journal of Compuer Science 2 (5): 466-472, 2006 ISSN 1549-3636 2006 Science Publicaions Performance Evaluaion of Implemening Calls Prioriizaion wih Differen Queuing Disciplines in Mobile Wireless Neworks
More informationDesign Alternatives for a Thin Lens Spatial Integrator Array
Egyp. J. Solids, Vol. (7), No. (), (004) 75 Design Alernaives for a Thin Lens Spaial Inegraor Array Hala Kamal *, Daniel V azquez and Javier Alda and E. Bernabeu Opics Deparmen. Universiy Compluense of
More informationParallel Multigrid Preconditioning on Graphics Processing Units (GPUs) for Robust Power Grid Analysis
Design Auomaion Group Parallel Muligrid Precondiioning on Graphics Processing Unis (GPUs) for Robus Power Grid Analysis Zhuo Feng Michigan Technological Universiy Zhiyu Zeng Texas A&M Universiy 200 ACM/EDAC/IEEE
More informationReal Time Integral-Based Structural Health Monitoring
Real Time Inegral-Based Srucural Healh Monioring The nd Inernaional Conference on Sensing Technology ICST 7 J. G. Chase, I. Singh-Leve, C. E. Hann, X. Chen Deparmen of Mechanical Engineering, Universiy
More informationSelective Offloading in Mobile Edge Computing for the Green Internet of Things
EDGE COMPUTING FOR THE INTERNET OF THINGS Selecive Offloading in Mobile Edge Compuing for he Green Inerne of Things Xinchen Lyu, Hui Tian, Li Jiang, Alexey Vinel, Sabia Maharjan, Sein Gjessing, and Yan
More informationArchitectural Design and Implementation of 2008 ZigBee Specification on TKNlS.4 Layer
rchiecural Design and mplemenaion of 2008 ZigBee Specificaion on TKNlS.4 Layer nh-vu Dinh-Due Universiy of nfonnaion Technology Ho Chi Minh ciy, Vie Nam anhvu@ui.edu.vn Ngoe-nLam HCMC Universiy of Technology
More informationHidden Markov Model and Chapman Kolmogrov for Protein Structures Prediction from Images
Hidden Markov Model and Chapman Kolmogrov for Proein Srucures Predicion from Images Md.Sarwar Kamal 1, Linkon Chowdhury 2, Mohammad Ibrahim Khan 2, Amira S. Ashour 3, João Manuel R.S. Tavares 4, Nilanjan
More informationA Petri Net Approach for. Performance Oriented Parallel Program Design. A. Ferscha. Institut fur Statistik und Informatik, Universitat Wien
A eri Ne Approach for erformance Oriened arallel rogram Design A. Ferscha Insiu fur Saisik und Informaik, Universia Wien Lenaugasse 2/8, A-1080 Vienna, AUSTRIA Tel.: +43 1 408 63 66 5, Fax: +43 1 408 63
More informationAnnouncements. TCP Congestion Control. Goals of Today s Lecture. State Diagrams. TCP State Diagram
nnouncemens TCP Congesion Conrol Projec #3 should be ou onigh Can do individual or in a eam of 2 people Firs phase due November 16 - no slip days Exercise good (beer) ime managemen EE 122: Inro o Communicaion
More informationCHANGE DETECTION - CELLULAR AUTOMATA METHOD FOR URBAN GROWTH MODELING
CHANGE DETECTION - CELLULAR AUTOMATA METHOD FOR URBAN GROWTH MODELING Sharaf Alkheder, Jun Wang and Jie Shan Geomaics Engineering, School of Civil Engineering, Purdue Universiy 550 Sadium Mall Drive, Wes
More informationOpen Access Research on an Improved Medical Image Enhancement Algorithm Based on P-M Model. Luo Aijing 1 and Yin Jin 2,* u = div( c u ) u
Send Orders for Reprins o reprins@benhamscience.ae The Open Biomedical Engineering Journal, 5, 9, 9-3 9 Open Access Research on an Improved Medical Image Enhancemen Algorihm Based on P-M Model Luo Aijing
More informationA Bayesian Approach to Video Object Segmentation via Merging 3D Watershed Volumes
A Bayesian Approach o Video Objec Segmenaion via Merging 3D Waershed Volumes Yu-Pao Tsai 1,3, Chih-Chuan Lai 1,2, Yi-Ping Hung 1,2, and Zen-Chung Shih 3 1 Insiue of Informaion Science, Academia Sinica,
More informationMotion Level-of-Detail: A Simplification Method on Crowd Scene
Moion Level-of-Deail: A Simplificaion Mehod on Crowd Scene Absrac Junghyun Ahn VR lab, EECS, KAIST ChocChoggi@vr.kais.ac.kr hp://vr.kais.ac.kr/~zhaoyue Recen echnological improvemen in characer animaion
More informationRobust Visual Tracking for Multiple Targets
Robus Visual Tracking for Muliple Targes Yizheng Cai, Nando de Freias, and James J. Lile Universiy of Briish Columbia, Vancouver, B.C., Canada, V6T 1Z4 {yizhengc, nando, lile}@cs.ubc.ca Absrac. We address
More informationService Oriented Solution Modeling and Variation Propagation Analysis based on Architectural Building Blocks
Carnegie Mellon Universiy From he SelecedWorks of Jia Zhang Ocober, 203 Service Oriened Soluion Modeling and Variaion Propagaion Analysis based on Archiecural uilding locks Liang-Jie Zhang Jia Zhang Available
More informationProbabilistic Detection and Tracking of Motion Discontinuities
Probabilisic Deecion and Tracking of Moion Disconinuiies Michael J. Black David J. Flee Xerox Palo Alo Research Cener 3333 Coyoe Hill Road Palo Alo, CA 94304 fblack,fleeg@parc.xerox.com hp://www.parc.xerox.com/fblack,fleeg/
More informationA NEW APPROACH FOR 3D MODELS TRANSMISSION
A NEW APPROACH FOR 3D MODELS TRANSMISSION A. Guarnieri a, F. Piroi a, M. Ponin a, A. Veore a a CIRGEO, Inerdep. Research Cener of Carography, Phoogrammery, Remoe Sensing and GIS Universiy of Padova, Agripolis
More informationAn efficient approach to improve throughput for TCP vegas in ad hoc network
Inernaional Research Journal of Engineering and Technology (IRJET) e-issn: 395-0056 Volume: 0 Issue: 03 June-05 www.irje.ne p-issn: 395-007 An efficien approach o improve hroughpu for TCP vegas in ad hoc
More informationSpline Curves. Color Interpolation. Normal Interpolation. Last Time? Today. glshademodel (GL_SMOOTH); Adjacency Data Structures. Mesh Simplification
Las Time? Adjacency Daa Srucures Spline Curves Geomeric & opologic informaion Dynamic allocaion Efficiency of access Mesh Simplificaion edge collapse/verex spli geomorphs progressive ransmission view-dependen
More informationEECS 487: Interactive Computer Graphics
EECS 487: Ineracive Compuer Graphics Lecure 7: B-splines curves Raional Bézier and NURBS Cubic Splines A represenaion of cubic spline consiss of: four conrol poins (why four?) hese are compleely user specified
More informationConstant-Work-Space Algorithms for Shortest Paths in Trees and Simple Polygons
Journal of Graph Algorihms and Applicaions hp://jgaa.info/ vol. 15, no. 5, pp. 569 586 (2011) Consan-Work-Space Algorihms for Shores Pahs in Trees and Simple Polygons Tesuo Asano 1 Wolfgang Mulzer 2 Yajun
More informationLecture 18: Mix net Voting Systems
6.897: Advanced Topics in Crypography Apr 9, 2004 Lecure 18: Mix ne Voing Sysems Scribed by: Yael Tauman Kalai 1 Inroducion In he previous lecure, we defined he noion of an elecronic voing sysem, and specified
More informationA Matching Algorithm for Content-Based Image Retrieval
A Maching Algorihm for Conen-Based Image Rerieval Sue J. Cho Deparmen of Compuer Science Seoul Naional Universiy Seoul, Korea Absrac Conen-based image rerieval sysem rerieves an image from a daabase using
More informationCS 152 Computer Architecture and Engineering. Lecture 6 - Memory
CS 152 Compuer Archiecure and Engineering Lecure 6 - Memory Krse Asanovic Elecrical Engineering and Compuer Sciences Universiy of California a Berkeley hp://www.eecs.berkeley.edu/~krse hp://ins.eecs.berkeley.edu/~cs152
More informationUSBFC (USB Function Controller)
USBFC () EIFUFAL501 User s Manual Doc #: 88-02-E01 Revision: 2.0 Dae: 03/24/98 (USBFC) 1. Highlighs... 4 1.1 Feaures... 4 1.2 Overview... 4 1.3 USBFC Block Diagram... 5 1.4 USBFC Typical Sysem Block Diagram...
More informationCPUs and GPUs with Many Cores
Mulimedia Signal Processing on CPUs and GPUs wih Many Cores Yen-Kuang Chen Inel Corporaion y.k.chen@ieee.org Summary Muli-Core is Becoming Mainsream Archiecure: Number of cores per chip will grow quickly
More informationDetection and segmentation of moving objects in highly dynamic scenes
Deecion and segmenaion of moving objecs in highly dynamic scenes Aurélie Bugeau Parick Pérez INRIA, Cenre Rennes - Breagne Alanique Universié de Rennes, Campus de Beaulieu, 35 042 Rennes Cedex, France
More informationImproving the Efficiency of Dynamic Service Provisioning in Transport Networks with Scheduled Services
Improving he Efficiency of Dynamic Service Provisioning in Transpor Neworks wih Scheduled Services Ralf Hülsermann, Monika Jäger and Andreas Gladisch Technologiezenrum, T-Sysems, Goslarer Ufer 35, D-1585
More informationRepresenting Non-Manifold Shapes in Arbitrary Dimensions
Represening Non-Manifold Shapes in Arbirary Dimensions Leila De Floriani,2 and Annie Hui 2 DISI, Universiy of Genova, Via Dodecaneso, 35-646 Genova (Ialy). 2 Deparmen of Compuer Science, Universiy of Maryland,
More informationPo,,ll. I Appll I APP2 I I App3 I. Illll Illlllll II Illlll Illll Illll Illll Illll Illll Illll Illll Illll Illll Illll Illlll Illl Illl Illl
Illll Illlllll II Illlll Illll Illll Illll Illll Illll Illll Illll Illll Illll Illll Illlll Illl Illl Illl US 20110153728A1 (19) nied Saes (12) Paen Applicaion Publicaion (10) Pub. No.: S 2011/0153728
More informationResearch Article Particle Filtering: The Need for Speed
Hindawi Publishing Corporaion EURASIP Journal on Advances in Signal Processing Volume 2010, Aricle ID 181403, 9 pages doi:10.1155/2010/181403 Research Aricle Paricle Filering: The Need for Speed Gusaf
More informationA pipeline polish string computer*
A pipeline polish sring compuer* by GERARD G. BAILLE and JEAN P. SCHOELLKOPF Co--,np'ueT ATcli-iecure Group Grenoble "Universiy, France ABSTRACT This paper describes a new compuer organizaion which allows
More informationImproving Occupancy Grid FastSLAM by Integrating Navigation Sensors
Improving Occupancy Grid FasSLAM by Inegraing Navigaion Sensors Chrisopher Weyers Sensors Direcorae Air Force Research Laboraory Wrigh-Paerson AFB, OH 45433 Gilber Peerson Deparmen of Elecrical and Compuer
More information