Accelerating Multiprocessor Simulation with a Memory Timestamp Record
|
|
- Anna Riley
- 5 years ago
- Views:
Transcription
1 Aelerating Multiproessor Simulation with a Memory Timestamp Reord Kenneth Barr Heidi Pan Mihael Zhang Krste Asanovi Marh, 5 Massahusetts Institute of Tehnology
2 Intelligent sampling gives est speed-auray tradeoff for uniproessors (Yi, HPCA `5) Single sample Fastforward + single sample Fastforward + Warmup + sample detailed ISA only detailed ISA only measure d e t a i l e d ignored ignored ignored Seletive Sampling (SimPoints) Statistial Sampling Statistial sampling w/ Fast Funtional Warming (SMARTS, FFW) ISA+µarh Memory Timestamp Reord ISA+MTR Update Reonstrut ahes Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
3 Snapshots amortize fast-forwarding, ut require slow warming or ind to a partiular µarh ISA only snapshots: Slow due to warmup, ut allows any µarh ISA+µarh snapshots: Fast (less warmup), ut tied to µarh MTR snapshots: Fast, NOT tied to µarh, supports multiproessors Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
4 Multiproessors simulation is espeially slow More ores More state/omplexity Long, omplex simulations CPU CPU CPUn $ $ $ Memory Diretory Full system, threaded apps More variaility More simulation CPUs Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 4 time
5 For full-system simulations of ommerial workloads, sutle variation matters! (Alameldeen and Wood, ) CPU 4 Time =.5 All produe same result, eah has different runtime DRAM refresh Hard disk arrangement delays DMA Inoming paket interrupts appliation Loking order reversed Time =.8 Time =. Proesses migrate Is our new gizmo a suess? Maye OS just ordered threads differently! Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 5
6 What is the Memory Timestamp Reord (MTR)? MTR is astrat piture of an multiproessor s oherene state CPUn- Last Writetime Last Writer N- Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 6
7 What is the Memory Timestamp Reord (MTR)? MTR is astrat piture of an multiproessor s oherene state Allow quik update during fast forwarding Fill in onrete ahes and diretory prior to sampling CPUn- Last Writetime Last Writer CPU CPU CPUn $ $ $ N- Memory Diretory Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 7
8 MTR example: update MTR ontains one entry per memory lok; loality keeps it sparse. New aess times overwrite old (self-ompressing) MTR: a d e CPUn- Memory Trae: Time 4 Last Writetime CPU Last Writer Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 8
9 MTR example: update MTR ontains one entry per memory lok; loality keeps it sparse. New aess times overwrite old (self-ompressing) MTR: a d e CPUn- Memory Trae: Time 4 Last Writetime Read a CPU Last Writer Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 9
10 MTR example: update MTR ontains one entry per memory lok; loality keeps it sparse. New aess times overwrite old (self-ompressing) MTR: a d e CPUn- Memory Trae: Time 4 Last Writetime Read a Read e CPU Last Writer Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
11 MTR example: update MTR ontains one entry per memory lok; loality keeps it sparse. New aess times overwrite old (self-ompressing) MTR: a d e CPUn- Memory Trae: Time 4 Last Writetime Read a Read e Read CPU Last Writer Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
12 MTR example: update MTR ontains one entry per memory lok; loality keeps it sparse. New aess times overwrite old (self-ompressing) MTR: a d e CPUn- Memory Trae: Time 4 Last Writetime Read a Read e Read Read CPU Last Writer Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
13 MTR example: update MTR ontains one entry per memory lok; loality keeps it sparse. New aess times overwrite old (self-ompressing) MTR: a d e CPUn- 4 Memory Trae: Time 4 Last Writetime Read a Read e Read Read CPU CPU Write Last Writer Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
14 . Coalese: determining orret ahe tags MTR: CPUn- Last Writetime Last Writer Cahe: Way Way Set Set Set Set Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 4
15 . Coalese: determining orret ahe tags MTR: CPUn- Last Writetime Last Writer Cahe: Way Way Set Set Set Set Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 5
16 . Coalese: determining orret ahe tags MTR: CPUn- Last Writetime Last Writer Cahe: Way Way Set Set Set Set Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 6
17 MTR example: oalese Choose organization One set, two ways Coalese Determine whih loks map to same set Only ways most reent timestamps are present. Chek validity later. CPUn- Last Writetime Last Writer a 4 CPU d e Way Way Way Way Set Set Set Set Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 7
18 MTR example: oalese Choose organization One set, two ways Coalese Determine whih loks map to same set Only ways most reent timestamps are present. Chek validity later. CPUn- Last Writetime Last Writer a 4 CPU d e What are the ontents of ahe? Way Way Set Set Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 8
19 MTR example: oalese Choose organization One set, two ways Coalese Determine whih loks map to same set Only ways most reent timestamps are present. Chek validity later. CPUn- Last Writetime Last Writer a 4 CPU d e What are the ontents of ahe? Way Way Set Set a Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 9
20 MTR example: oalese Choose organization One set, two ways Coalese Determine whih loks map to same set Only ways most reent timestamps are present. Chek validity later. CPUn- Last Writetime Last Writer a 4 CPU d e What are the ontents of ahe? Way Way Set Set a Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
21 MTR example: oalese Choose organization One set, two ways Coalese Determine whih loks map to same set Only ways most reent timestamps are present. Chek validity later. CPUn- Last Writetime Last Writer a 4 CPU d e What are the ontents of ahe? Way Way Set a Set Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
22 MTR example: oalese Choose organization One set, two ways Coalese Determine whih loks map to same set Only ways most reent timestamps are present. Chek validity later. CPUn- Last Writetime Last Writer a 4 CPU d e What are the ontents of ahe? Way Way Set e Set Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
23 MTR example: oalese Choose organization One set, two ways Coalese Determine whih loks map to same set Only ways most reent timestamps are present. Chek validity later. CPUn- Last Writetime Last Writer a 4 CPU d e What are the ontents of ahe? CPU? Way Way Way Way Set e Set Set Set 4 Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
24 Fixup: determine orret status its Way Way Set Set Set Set Cahe Way Way Set Set Set Set Cahe Way Way Set Set Set Set Cahe n- Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 4
25 MTR example: fixup Reads prior to a write are invalid, valid writes are dirty, et CPUn- Last Writetime Last Writer a 4 CPU d e Whih ahe has the most reent opy of? Way Way Way Way Set e Set Set Set 4 invalid Valid, dirty Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 5
26 MTR example: diretory reonstrution CPUn- Last Writetime Last Writer a 4 CPU d e a d e State S M S I S Sharers CPU Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 6
27 The MTR supports many popular organizations and protools Snoopy or diretory-ased Multilevel ahes Inlusive Exlusive Time-ased replaement poliy Strit LRU Cahe deay Invalidate, Update, Update-Invalidate MSI, MESI, MOESI Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 7
28 Evits annot e reorded in the MTR, ut many an e inferred MTR: address CPU Writetime Writer n+k n CASE A: writes reads = dirty CASE B: writes writes eviting reads = lean n Time n+k Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 8
29 Evaluation / Results
30 Detailed, full-system, exeution-driven, x86, SMP simulation CPU N- CPU CPU Stall Detailed Memory System Magi Memory Main Memory Bohs Memory Timestamp Reord SMP Bohs provides: devies (allowing an OS), x86 Deoding and Exeution, Magi Memory Detailed Mode Enale Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
31 Detailed, full-system, exeution-driven, x86, SMP simulation CPU N- CPU CPU Stall Detailed Memory System Magi Memory Main Memory Bohs Memory Timestamp Reord SMP Bohs provides: devies (allowing an OS), x86 Deoding and Exeution, Magi Memory Detailed Mode Enale Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
32 Reonstruting with the MTR CPU N- CPU CPU Stall Detailed Memory System Magi Memory Main Memory Bohs Memory Timestamp Reord Memory Timestamp Reord: allows swithing etween funtional fast-fwd and detailed simulation Detailed Mode Enale Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
33 Our detailed memory model an stall a proessor s exeution ased on timing models CPU N- CPU CPU Stall Detailed Memory System Magi Memory Main Memory Bohs Memory Timestamp Reord Detailed Memory System provides: ahe oherene, network, DRAM timing, stall signal Detailed Mode Enale Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5.
34 Benhmarks! NASA Advaned Superomputing Parallel Benhmarks: sientifi (omp. fluid dynamis) OpenMP (loop iterations in parallel) Fortran OS enhmarks denh: (Sama) several lients making file-entri system alls Apahe: several lients hammer we server (via loopak interfae) Cilk: hekers: AI searh plies in parallel uses spawn/syn primitives (dynami thread reation/sheduling) Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 4
35 We ompare MTR to full detailed and a naïve implementation of SMP fast forwarding. Baseline : full detailed simulation (overnight) Baseline : naïve funtional fast forwarding (FFW) Funtional simulation of ISA Cahe / diretory state kept aurate Tag heks, replaement poliy enfored Diretory onsulted and updated On miss/oherene miss, invalidate outstanding opies Omits network messages, queues, latenies (present in detailed mode) Hypothesis Both FFW and MTR should e aurate and fast MTR should e faster than FFW To e useful, FFW and MTR must answer questions in the same way as a detailed model, ut faster Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 5
36 Full-system experiments must respet system variation, or risk inorret predition! Methodology Every k yles hoose vitim proessor Vitim will run 5% slower to emulate variation Note: variation has MUCH larger effet during fast mode Bar shows the median of eight runs, with tiks for min and max. Eah run is a valid result! Ideal: range for fast runs should e within range of (all possile) detailed. Can t draw onlusions aout disrepany until we understand distriution denh Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 6 Cahe miss rate (%) : : :
37 Repliating detailed -mode stats less ruial than aurate answers to design questions Assume oserved = atual With respet to reply message types, the MSI vs. MESI hange is dramati. All fast-fwd ars move with the detailed ar. Movement eyond range of detailed runs Disover evits to more losely math detailed run Or, tune vitim/slowdown (no amig. resolution) (no amig. resolution) writeak rep Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 7
38 Runtime (normalized to FFW :) MTR averages up to.45x faster than FFW FFW (mg) : : : MTR (mg) : : : Detailed Simulation Detailed Warming Fast to detailed Fast Forward MTR spends less time in fast forward MTR does less work in ommon ase Time saved in fast forward time less than MTR transition ost MTR has ostlier transition, ut Reonstrution sales with touhed lines not total aesses Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 8
39 Conlusion: Memory Timestamp Reord provides fast, aurate, and flexile SMP simulation MTR.45X faster than funtional warming 7.7X faster than our detailed simulator Eliminates need to regenerate snapshots Answers arhitetural questions similar to detailed simulation Future work Simultaneous multiple-onfiguration simulation MTR ompression for disk snapshots Parallelized update and reonstrution Barr, Pan, Zhang, and Asanović. ISPASS. Marh, 5. 9
Summarizing Multiprocessor Program Execution with Versatile, Microarchitecture-Independent Snapshots
Summarizing Multiprocessor Program Execution with Versatile, Microarchitecture-Independent Snapshots Kenneth C. Barr Thesis Defense August 25, 2006 Massachusetts Institute of Technology My thesis, a bird
More informationAnnouncements. Lecture Caching Issues for Multi-core Processors. Shared Vs. Private Caches for Small-scale Multi-core
Announements Your fous should be on the lass projet now Leture 17: Cahing Issues for Multi-ore Proessors This week: status update and meeting A short presentation on: projet desription (problem, importane,
More informationWhat are Cycle-Stealing Systems Good For? A Detailed Performance Model Case Study
What are Cyle-Stealing Systems Good For? A Detailed Performane Model Case Study Wayne Kelly and Jiro Sumitomo Queensland University of Tehnology, Australia {w.kelly, j.sumitomo}@qut.edu.au Abstrat The
More information8 Instruction Selection
8 Instrution Seletion The IR ode instrutions were designed to do exatly one operation: load/store, add, subtrat, jump, et. The mahine instrutions of a real CPU often perform several of these primitive
More informationCA Privileged Access Manager 3.x Proven Implementation Professional Exam (CAT-661) Study Guide Version 1.0
Exam (CAT-661) Study Guide Version 1.0 PROPRIETARY AND CONFIDENTIAL INFMATION 2018 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer use only. No unauthorized
More informationThis fact makes it difficult to evaluate the cost function to be minimized
RSOURC LLOCTION N SSINMNT In the resoure alloation step the amount of resoures required to exeute the different types of proesses is determined. We will refer to the time interval during whih a proess
More informationDirect-Mapped Caches
A Case for Diret-Mapped Cahes Mark D. Hill University of Wisonsin ahe is a small, fast buffer in whih a system keeps those parts, of the ontents of a larger, slower memory that are likely to be used soon.
More informationParametric Abstract Domains for Shape Analysis
Parametri Abstrat Domains for Shape Analysis Xavier RIVAL (INRIA & Éole Normale Supérieure) Joint work with Bor-Yuh Evan CHANG (University of Maryland U University of Colorado) and George NECULA (University
More informationOn - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2
On - Line Path Delay Fault Testing of Omega MINs M. Bellos, E. Kalligeros, D. Nikolos,2 & H. T. Vergos,2 Dept. of Computer Engineering and Informatis 2 Computer Tehnology Institute University of Patras,
More information- 1 - S 21. Directory-based Administration of Virtual Private Networks: Policy & Configuration. Charles A Kunzinger.
- 1 - S 21 Diretory-based Administration of Virtual Private Networks: Poliy & Configuration Charles A Kunzinger kunzinge@us.ibm.om - 2 - Clik here Agenda to type page title What is a VPN? What is VPN Poliy?
More informationCOSSIM An Integrated Solution to Address the Simulator Gap for Parallel Heterogeneous Systems
COSSIM An Integrated Solution to Address the Simulator Gap for Parallel Heterogeneous Systems Andreas Brokalakis Synelixis Solutions Ltd, Greee brokalakis@synelixis.om Nikolaos Tampouratzis Teleommuniation
More informationArchitecture and Performance of the Hitachi SR2201 Massively Parallel Processor System
Arhiteture and Performane of the Hitahi SR221 Massively Parallel Proessor System Hiroaki Fujii, Yoshiko Yasuda, Hideya Akashi, Yasuhiro Inagami, Makoto Koga*, Osamu Ishihara*, Masamori Kashiyama*, Hideo
More informationAutomated System for the Study of Environmental Loads Applied to Production Risers Dustin M. Brandt 1, Celso K. Morooka 2, Ivan R.
EngOpt 2008 - International Conferene on Engineering Optimization Rio de Janeiro, Brazil, 01-05 June 2008. Automated System for the Study of Environmental Loads Applied to Prodution Risers Dustin M. Brandt
More informationWhy then another BFT protocol? Zyzzyva. Simplify, simplify. Simplify, simplify. Complex decision tree hampers BFT adoption. H.D. Thoreau. H.D.
Why then another BFT protool? Yes No Zyzzyva Yes No Yes No Comple deision tree hampers BFT adoption Simplify, simplify H.D. Thoreau Simplify, simplify H.D. Thoreau Yes No Yes No Yes Yes No One protool
More informationPartial Character Decoding for Improved Regular Expression Matching in FPGAs
Partial Charater Deoding for Improved Regular Expression Mathing in FPGAs Peter Sutton Shool of Information Tehnology and Eletrial Engineering The University of Queensland Brisbane, Queensland, 4072, Australia
More informationAccommodations of QoS DiffServ Over IP and MPLS Networks
Aommodations of QoS DiffServ Over IP and MPLS Networks Abdullah AlWehaibi, Anjali Agarwal, Mihael Kadoh and Ahmed ElHakeem Department of Eletrial and Computer Department de Genie Eletrique Engineering
More informationCA Test Data Manager 4.x Implementation Proven Professional Exam (CAT-681) Study Guide Version 1.0
Implementation Proven Professional Study Guide Version 1.0 PROPRIETARY AND CONFIDENTIAL INFORMATION 2017 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer
More informationAllocating Rotating Registers by Scheduling
Alloating Rotating Registers by Sheduling Hongbo Rong Hyunhul Park Cheng Wang Youfeng Wu Programming Systems Lab Intel Labs {hongbo.rong,hyunhul.park,heng..wang,youfeng.wu}@intel.om ABSTRACT A rotating
More informationCA Identity Suite 14.x Implementation Proven Professional Exam (CAT-760) Study Guide Version 1.1
Study Guide Version 1.1 PROPRIETARY AND CONFIDENTIAL INFORMATION 2018 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer use only. No unauthorized use,
More informationCompilation Lecture 11a. Register Allocation Noam Rinetzky. Text book: Modern compiler implementation in C Andrew A.
Compilation 0368-3133 Leture 11a Text book: Modern ompiler implementation in C Andrew A. Appel Register Alloation Noam Rinetzky 1 Registers Dediated memory loations that an be aessed quikly, an have omputations
More informationZ8530 Programming Guide
Z8530 Programming Guide Alan Cox alan@redhat.om Z8530 Programming Guide by Alan Cox Copyright 2000 by Alan Cox This doumentation is free software; you an redistribute it and/or modify it under the terms
More informationCA API Management 8.x Implementation Proven Professional Exam (CAT-560) Study Guide Version 1.1
Exam (CAT-560) Study Guide Version 1.1 PROPRIETARY AND CONFIDENTIAL INFORMATION 2016 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer use only. No unauthorized
More informationCOST PERFORMANCE ASPECTS OF CCD FAST AUXILIARY MEMORY
COST PERFORMANCE ASPECTS OF CCD FAST AUXILIARY MEMORY Dileep P, Bhondarkor Texas Instruments Inorporated Dallas, Texas ABSTRACT Charge oupled devies (CCD's) hove been mentioned as potential fast auxiliary
More informationXML Data Streams. XML Stream Processing. XML Stream Processing. Yanlei Diao. University of Massachusetts Amherst
XML Stream Proessing Yanlei Diao University of Massahusetts Amherst XML Data Streams XML is the wire format for data exhanged online. Purhase orders http://www.oasis-open.org/ommittees/t_home.php?wg_abbrev=ubl
More informationDECODING OF ARRAY LDPC CODES USING ON-THE FLY COMPUTATION Kiran Gunnam, Weihuang Wang, Euncheol Kim, Gwan Choi, Mark Yeary *
DECODING OF ARRAY LDPC CODES USING ON-THE FLY COMPUTATION Kiran Gunnam, Weihuang Wang, Eunheol Kim, Gwan Choi, Mark Yeary * Dept. of Eletrial Engineering, Texas A&M University, College Station, TX-77840
More informationMenu. X + /X=1 and XY+X /Y = X(Y + /Y) = X
Menu K-Maps and Boolean Algera >Don t ares >5 Variale Look into my... 1 Karnaugh Maps - Boolean Algera We have disovered that simplifiation/minimization is an art. If you see it, GREAT! Else, work at it,
More informationChapter 2: Introduction to Maple V
Chapter 2: Introdution to Maple V 2-1 Working with Maple Worksheets Try It! (p. 15) Start a Maple session with an empty worksheet. The name of the worksheet should be Untitled (1). Use one of the standard
More informationAlgorithms, Mechanisms and Procedures for the Computer-aided Project Generation System
Algorithms, Mehanisms and Proedures for the Computer-aided Projet Generation System Anton O. Butko 1*, Aleksandr P. Briukhovetskii 2, Dmitry E. Grigoriev 2# and Konstantin S. Kalashnikov 3 1 Department
More information1 Disjoint-set data structure.
CS 124 Setion #4 Union-Fin, Greey Algorithms 2/20/17 1 Disjoint-set ata struture. 1.1 Operations Disjoint-set ata struture enale us to effiiently perform operations suh as plaing elements into sets, querying
More informationEvaluation of Benchmark Performance Estimation for Parallel. Fortran Programs on Massively Parallel SIMD and MIMD. Computers.
Evaluation of Benhmark Performane Estimation for Parallel Fortran Programs on Massively Parallel SIMD and MIMD Computers Thomas Fahringer Dept of Software Tehnology and Parallel Systems University of Vienna
More informationA Dictionary based Efficient Text Compression Technique using Replacement Strategy
A based Effiient Text Compression Tehnique using Replaement Strategy Debashis Chakraborty Assistant Professor, Department of CSE, St. Thomas College of Engineering and Tehnology, Kolkata, 700023, India
More informationCA PPM 14.x Implementation Proven Professional Exam (CAT-222) Study Guide Version 1.2
CA PPM 14.x Implementation Proven Professional Exam (CAT-222) Study Guide Version 1.2 PROPRIETARY AND CONFIDENTIAL INFMATION 2016 CA. All rights reserved. CA onfidential & proprietary information. For
More informationFolding. Hardware Mapped vs. Time multiplexed. Folding by N (N=folding factor) Node A. Unfolding by J A 1 A J-1. Time multiplexed/microcoded
Folding is verse of Unfolding Node A A Folding by N (N=folding fator) Folding A Unfolding by J A A J- Hardware Mapped vs. Time multiplexed l Hardware Mapped vs. Time multiplexed/mirooded FI : y x(n) h
More informationMulticore Software and Applications
Multiore Software and Appliations Nir Shavit MIT CSAIL Where do we ome from? What are we? Where are we going? Paul Gauguin [Image Credit: Museum of Fine Arts Boston] 64 Cores Today, 1000 in Near Future
More informationA Hybrid Neuro-Genetic Approach to Short-Term Traffic Volume Prediction
A Hybrid Neuro-Geneti Approah to Short-Term Traffi Volume Predition 1. Introdution Shahriar Afandizadeh 1,*, Jalil Kianfar 2 Reeived: January 2003, Revised: July 2008, Aepted: January 2009 Abstrat: This
More informationPBFT: A Byzantine Renaissance. The Setup. What could possibly go wrong? The General Idea. Practical Byzantine Fault-Tolerance (CL99, CL00)
PBFT: A Byzantine Renaissane Pratial Byzantine Fault-Tolerane (CL99, CL00) first to be safe in asynhronous systems live under weak synhrony assumptions -Byzantine Paos! The Setup Crypto System Model Asynhronous
More informationTriangles. Learning Objectives. Pre-Activity
Setion 3.2 Pre-tivity Preparation Triangles Geena needs to make sure that the dek she is building is perfetly square to the brae holding the dek in plae. How an she use geometry to ensure that the boards
More informationmahines. HBSP enhanes the appliability of the BSP model by inorporating parameters that reet the relative speeds of the heterogeneous omputing omponen
The Heterogeneous Bulk Synhronous Parallel Model Tiani L. Williams and Rebea J. Parsons Shool of Computer Siene University of Central Florida Orlando, FL 32816-2362 fwilliams,rebeag@s.uf.edu Abstrat. Trends
More informationCA Service Desk Manager 14.x Implementation Proven Professional Exam (CAT-181) Study Guide Version 1.3
Exam (CAT-181) Study Guide Version 1.3 PROPRIETARY AND CONFIDENTIAL INFORMATION 2017 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer use only. No unauthorized
More informationConstructing Transaction Serialization Order for Incremental. Data Warehouse Refresh. Ming-Ling Lo and Hui-I Hsiao. IBM T. J. Watson Research Center
Construting Transation Serialization Order for Inremental Data Warehouse Refresh Ming-Ling Lo and Hui-I Hsiao IBM T. J. Watson Researh Center July 11, 1997 Abstrat In typial pratie of data warehouse, the
More informationCA Single Sign-On 12.x Proven Implementation Professional Exam (CAT-140) Study Guide Version 1.5
Study Guide Version 1.5 PROPRIETARY AND CONFIDENTIAL INFORMATION 2018 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer use only. No unauthorized use,
More informationLearning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract
CS 9 Projet Final Report: Learning Convention Propagation in BeerAdvoate Reviews from a etwork Perspetive Abstrat We look at the way onventions propagate between reviews on the BeerAdvoate dataset, and
More informationOvidSP Quick Reference Card
OvidSP Quik Referene Card Searh in any of several dynami modes, ombine results, apply limits, use improved researh tools, develop strategies, save searhes, set automati alerts and RSS feeds, share results...
More informationIncremental Mining of Partial Periodic Patterns in Time-series Databases
CERIAS Teh Report 2000-03 Inremental Mining of Partial Periodi Patterns in Time-series Dataases Mohamed G. Elfeky Center for Eduation and Researh in Information Assurane and Seurity Purdue University,
More informationEfficient and scalable trie-based algorithms for computing set containment relations
Effiient and salale trie-ased algorithms for omputing set ontainment relations Yongming Luo #1, George H. L. Flether #2, Jan Hidders 3, Paul De Bra #4 # Eindhoven University of Tehnology, The Netherlands
More informationReevaluating the overhead of data preparation for asymmetric multicore system on graphics processing
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS VOL. 10, NO. 7, Jul. 2016 3231 Copyright 2016 KSII Reevaluating the overhead of data preparation for asymmetri multiore system on graphis proessing
More informationPost-K Supercomputer with Fujitsu's Original CPU, A64FX Powered by Arm ISA
Post-K Superomputer with Fujitsu's Original CPU, A64FX Powered by Arm ISA Toshiyuki Shimizu Nov. 15th, 2018 Post-K is under development, information in these slides is subjet to hange without notie 0 Agenda
More information13.1 Numerical Evaluation of Integrals Over One Dimension
13.1 Numerial Evaluation of Integrals Over One Dimension A. Purpose This olletion of subprograms estimates the value of the integral b a f(x) dx where the integrand f(x) and the limits a and b are supplied
More informationSVC-DASH-M: Scalable Video Coding Dynamic Adaptive Streaming Over HTTP Using Multiple Connections
SVC-DASH-M: Salable Video Coding Dynami Adaptive Streaming Over HTTP Using Multiple Connetions Samar Ibrahim, Ahmed H. Zahran and Mahmoud H. Ismail Department of Eletronis and Eletrial Communiations, Faulty
More informationA Novel Timestamp Ordering Approach for Co-existing Traditional and Cooperative Transaction Processing
A Novel Timestamp Ordering Approah for Co-existing Traditional and Cooperative Transation Proessing Author Sun, Chengzheng, Zhang, Y., Kambayashi, Y., Yang, Y. Published 1998 Conferene Title Proeedings
More informationCA Agile Requirements Designer 2.x Implementation Proven Professional Exam (CAT-720) Study Guide Version 1.0
Exam (CAT-720) Study Guide Version 1.0 PROPRIETARY AND CONFIDENTIAL INFORMATION 2017 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer use only. No unauthorized
More informationSSD Based First Layer File System for the Next Generation Super-computer
SSD Based First Layer File System for the Next Generation Super-omputer Shinji Sumimoto, Ph.D. Next Generation Tehnial Computing Unit FUJITSU LIMITED Sept. 24 th, 2018 0 Outline of This Talk A64FX: High
More informationCapturing Large Intra-class Variations of Biometric Data by Template Co-updating
Capturing Large Intra-lass Variations of Biometri Data by Template Co-updating Ajita Rattani University of Cagliari Piazza d'armi, Cagliari, Italy ajita.rattani@diee.unia.it Gian Lua Marialis University
More informationUser-level Fairness Delivered: Network Resource Allocation for Adaptive Video Streaming
User-level Fairness Delivered: Network Resoure Alloation for Adaptive Video Streaming Mu Mu, Steven Simpson, Arsham Farshad, Qiang Ni, Niholas Rae Shool of Computing and Communiations, Lanaster University
More informationZippy - A coarse-grained reconfigurable array with support for hardware virtualization
Zippy - A oarse-grained reonfigurable array with support for hardware virtualization Christian Plessl Computer Engineering and Networks Lab ETH Zürih, Switzerland plessl@tik.ee.ethz.h Maro Platzner Department
More informationAutomatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425)
Automati Physial Design Tuning: Workload as a Sequene Sanjay Agrawal Mirosoft Researh One Mirosoft Way Redmond, WA, USA +1-(425) 75-357 sagrawal@mirosoft.om Eri Chu * Computer Sienes Department University
More informationRegister Allocation III. Interference Graph Allocators. Coalescing. Granularity of Allocation (Renumber step in Briggs) Chaitin
Register Alloation III Last time Register alloation aross funtion alls Toay Register alloation options Interferene Graph Alloators Chaitin Briggs CS553 Leture Register Alloation III 1 CS553 Leture Register
More informationRouting Protocols for Wireless Ad Hoc Networks Hybrid routing protocols Theofanis Kilinkaridis
Routing Protools for Wireless Ad Ho Networks Hyrid routing protools Theofanis Kilinkaridis tkilinka@.hut.fi Astrat This paper presents a partiular group of routing protools that aim to omine the advantages
More informationIntroduction to Multiprocessors (Part II) Cristina Silvano Politecnico di Milano
Introduction to Multiprocessors (Part II) Cristina Silvano Politecnico di Milano Outline The problem of cache coherence Snooping protocols Directory-based protocols Prof. Cristina Silvano, Politecnico
More informationFast Elliptic Curve Algorithm of Embedded Mobile Equipment
Send Orders for Reprints to reprints@benthamsiene.net 8 The Open Eletrial & Eletroni Engineering Journal, 0, 7, 8-4 Fast Ellipti Curve Algorithm of Embedded Mobile Equipment Open Aess Lihong Zhang *, Shuqian
More informationDesign and Analysis of a Robust Pipelined Memory System
Design and Analysis of a obust Pipelined Memory System Hao ang 1 Haiquan (Chuk) Zhao 2 Bill Lin 1 Jun (Jim) Xu 2 1 Department of Eletrial and Computer Engineering, University of California, San Diego Email
More informationExploiting Enriched Contextual Information for Mobile App Classification
Exploiting Enrihed Contextual Information for Mobile App Classifiation Hengshu Zhu 1 Huanhuan Cao 2 Enhong Chen 1 Hui Xiong 3 Jilei Tian 2 1 University of Siene and Tehnology of China 2 Nokia Researh Center
More informationRegister Allocation III. Interference Graph Allocators. Computing the Interference Graph (in MiniJava compiler)
Register Alloation III Announements Reommen have interferene graph onstrution working by Monay Last leture Register alloation aross funtion alls Toay Register alloation options Interferene Graph Alloators
More informationQuery Evaluation Overview. Query Optimization: Chap. 15. Evaluation Example. Cost Estimation. Query Blocks. Query Blocks
Query Evaluation Overview Query Optimization: Chap. 15 CS634 Leture 12 SQL query first translated to relational algebra (RA) Atually, some additional operators needed for SQL Tree of RA operators, with
More informationAdobe Certified Associate
Adobe Certified Assoiate About the Adobe Certified Assoiate (ACA) Program The Adobe Certified Assoiate (ACA) program is for graphi designers, Web designers, video prodution designers, and digital professionals
More informationDesign Implications for Enterprise Storage Systems via Multi-Dimensional Trace Analysis
Design Impliations for Enterprise Storage Systems via Multi-Dimensional Trae Analysis Yanpei Chen, Kiran Srinivasan, Garth Goodson, Randy Katz University of California, Berkeley, NetApp In. {yhen2, randy}@ees.berkeley.edu,
More informationApproximate logic synthesis for error tolerant applications
Approximate logi synthesis for error tolerant appliations Doohul Shin and Sandeep K. Gupta Eletrial Engineering Department, University of Southern California, Los Angeles, CA 989 {doohuls, sandeep}@us.edu
More informationPROJECT PERIODIC REPORT
FP7-ICT-2007-1 Contrat no.: 215040 www.ative-projet.eu PROJECT PERIODIC REPORT Publishable Summary Grant Agreement number: ICT-215040 Projet aronym: Projet title: Enabling the Knowledge Powered Enterprise
More informationTime delay estimation of reverberant meeting speech: on the use of multichannel linear prediction
University of Wollongong Researh Online Faulty of Informatis - apers (Arhive) Faulty of Engineering and Information Sienes 7 Time delay estimation of reverberant meeting speeh: on the use of multihannel
More informationCA Release Automation 5.x Implementation Proven Professional Exam (CAT-600) Study Guide Version 1.1
Exam (CAT-600) Study Guide Version 1.1 PROPRIETARY AND CONFIDENTIAL INFORMATION 2016 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer use only. No unauthorized
More informationThe recursive decoupling method for solving tridiagonal linear systems
Loughborough University Institutional Repository The reursive deoupling method for solving tridiagonal linear systems This item was submitted to Loughborough University's Institutional Repository by the/an
More informationReal-Time Control for a Turbojet Engine
A Multiproessor mplementation of Real-Time Control for a Turbojet Engine Phillip L. Shaffer ABSTRACT: A real-time ontrol program for a turbojet engine has been implemented on a four-proessor omputer, ahieving
More informationA Dual-Hamiltonian-Path-Based Multicasting Strategy for Wormhole-Routed Star Graph Interconnection Networks
A Dual-Hamiltonian-Path-Based Multiasting Strategy for Wormhole-Routed Star Graph Interonnetion Networks Nen-Chung Wang Department of Information and Communiation Engineering Chaoyang University of Tehnology,
More informationDETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT
DETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT 1 ZHANGGUO TANG, 2 HUANZHOU LI, 3 MINGQUAN ZHONG, 4 JIAN ZHANG 1 Institute of Computer Network and Communiation Tehnology,
More informationCA Unified Infrastructure Management 8.x Implementation Proven Professional Exam (CAT-540) Study Guide Version 1.1
Management 8.x Implementation Proven Professional Exam (CAT-540) Study Guide Version 1.1 PROPRIETARY AND CONFIDENTIAL INFORMATION 2017 CA. All rights reserved. CA onfidential & proprietary information.
More informationVerifying Interaction Protocol Compliance of Service Orchestrations
Verifying Interation Protool Compliane of Servie Orhestrations Andreas Shroeder and Philip Mayer Ludwig-Maximilians-Universität Münhen, Germany {shroeda, mayer}@pst.ifi.lmu.de Abstrat. An important aspet
More information5.2.1 Ant, indispensable Ant
5.2.1 Ant, indispensale Ant Apahe s Ant produt (http://ant.apahe.org/) is a uild tool that lets you easily ompile and test appliations (among other things). It is the de fato standard for uilding Java
More informationCA PPM 15.x Proven Implementation Professional Exam (CAT-223) Study Guide Version 1.3
CA PPM 15.x Proven Implementation Professional Exam (CAT-223) Study Guide Version 1.3 PROPRIETARY AND CONFIDENTIAL INFORMATION 2018 CA. All rights reserved. CA onfidential & proprietary information. For
More informationMultiple Assignments
Two Outputs Conneted Together Multiple Assignments Two Outputs Conneted Together if (En1) Q
More informationHigh Speed Area Efficient VLSI Architecture for DCT using Proposed CORDIC Algorithm
International Journal of Innovative Researh in Siene, Engineering and Tehnology Website: www.ijirset.om High Speed Area Effiient VLSI Arhiteture for DCT using Proposed CORDIC Algorithm Deepnarayan Sinha
More informationA Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering
A Novel Bit Level Time Series Representation with Impliation of Similarity Searh and lustering hotirat Ratanamahatana, Eamonn Keogh, Anthony J. Bagnall 2, and Stefano Lonardi Dept. of omputer Siene & Engineering,
More informationRuntime Support for OOLs Part II Comp 412
COMP 412 FALL 2017 Runtime Support for OOLs Part II Comp 412 soure IR Front End Optimizer Bak End IR target Copright 2017, Keith D. Cooper & Linda Torzon, all rights reserved. Students enrolled in Comp
More informationSpeeding up Consensus by Chasing Fast Decisions
Speeding up Consensus by Chasing Fast Deisions Balaji Arun, Sebastiano Peluso, Roberto Palmieri, Giuliano Losa, Binoy Ravindran ECE, Virginia Teh, USA {balajia,peluso,robertop,giuliano.losa,binoy}@vt.edu
More informationInternational Journal of Advancements in Research & Technology, Volume 3, Issue 3, March-2014 ISSN
International Journal of Advanements in Researh & Tehnology, Volume 3, Issue 3, Marh-204 ISSN 2278-773 47 Phrase Based Doument Retrieving y Comining Suffix Tree index data struture and Boyer- Moore faster
More informationCOMBINATION OF INTERSECTION- AND SWEPT-BASED METHODS FOR SINGLE-MATERIAL REMAP
Combination of intersetion- and swept-based methods for single-material remap 11th World Congress on Computational Mehanis WCCM XI) 5th European Conferene on Computational Mehanis ECCM V) 6th European
More informationDefinitions Homework. Quine McCluskey Optimal solutions are possible for some large functions Espresso heuristic. Definitions Homework
EECS 33 There be Dragons here http://ziyang.ees.northwestern.edu/ees33/ Teaher: Offie: Email: Phone: L477 Teh dikrp@northwestern.edu 847 467 2298 Today s material might at first appear diffiult Perhaps
More informationEfficient Implementation of Beam-Search Incremental Parsers
Effiient Implementation of Beam-Searh Inremental Parsers Yoav Goldberg Dept. of Computer Siene Bar-Ilan University Ramat Gan, Tel Aviv, 5290002 Israel yoav.goldberg@gmail.om Kai Zhao Liang Huang Graduate
More informationInterconnection Styles
Interonnetion tyles oftware Design Following the Export (erver) tyle 2 M1 M4 M5 4 M3 M6 1 3 oftware Design Following the Export (Client) tyle e 2 e M1 M4 M5 4 M3 M6 1 e 3 oftware Design Following the Export
More information次世代スーパーコンピュータ向け ファイルシステムについて
Gfarm シンポジウム 2018 次世代スーパーコンピュータ向け ファイルシステムについて Shinji Sumimoto, Ph.D. Next Generation Tehnial Computing Unit FUJITSU LIMITED Ot. 26 th, 2018 0 Outline of This Talk A64FX: High Performane Arm CPU Next Generation
More informationWORKSHOP 20 CREATING PCL FUNCTIONS
WORKSHOP 20 CREATING PCL FUNCTIONS WS20-1 WS20-2 Problem Desription This exerise involves reating two PCL funtions that an be used to easily hange the view of a model. The PCL funtions are reated by reording
More informationAutomatic Generation of Transaction-Level Models for Rapid Design Space Exploration
Automati Generation of Transation-Level Models for Rapid Design Spae Exploration Dongwan Shin, Andreas Gerstlauer, Junyu Peng, Rainer Dömer and Daniel D. Gajski Center for Embedded Computer Systems University
More informationRecursion examples: Problem 2. (More) Recursion and Lists. Tail recursion. Recursion examples: Problem 2. Recursion examples: Problem 3
Reursion eamples: Problem 2 (More) Reursion and s Reursive funtion to reverse a string publi String revstring(string str) { if(str.equals( )) return str; return revstring(str.substring(1, str.length()))
More informationMultiple-Criteria Decision Analysis: A Novel Rank Aggregation Method
3537 Multiple-Criteria Deision Analysis: A Novel Rank Aggregation Method Derya Yiltas-Kaplan Department of Computer Engineering, Istanbul University, 34320, Avilar, Istanbul, Turkey Email: dyiltas@ istanbul.edu.tr
More informationNONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION. Ken Sauer and Charles A. Bouman
NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION Ken Sauer and Charles A. Bouman Department of Eletrial Engineering, University of Notre Dame Notre Dame, IN 46556, (219) 631-6999 Shool of
More informationAn Edge-based Clustering Algorithm to Detect Social Circles in Ego Networks
JOURNAL OF COMPUTERS, VOL. 8, NO., OCTOBER 23 2575 An Edge-based Clustering Algorithm to Detet Soial Cirles in Ego Networks Yu Wang Shool of Computer Siene and Tehnology, Xidian University Xi an,77, China
More informationQuad copter Control Using Android Smartphone
International Journal of Researh (IJR) e-issn: 2348-6848, p- ISSN: 2348-795X Volume 3, Issue 05, Marh 2016 Available at http://internationaljournalofresearh.org Quad opter Control Using Android Smartphone
More informationPipelined Multipliers for Reconfigurable Hardware
Pipelined Multipliers for Reonfigurable Hardware Mithell J. Myjak and José G. Delgado-Frias Shool of Eletrial Engineering and Computer Siene, Washington State University Pullman, WA 99164-2752 USA {mmyjak,
More informationParallel Block-Layered Nonbinary QC-LDPC Decoding on GPU
Parallel Blok-Layered Nonbinary QC-LDPC Deoding on GPU Huyen Thi Pham, Sabooh Ajaz and Hanho Lee Department of Information and Communiation Engineering, Inha University, Inheon, 42-751, Korea Abstrat This
More informationMultiprocessor Cache Coherence. Chapter 5. Memory System is Coherent If... From ILP to TLP. Enforcing Cache Coherence. Multiprocessor Types
Chapter 5 Multiprocessor Cache Coherence Thread-Level Parallelism 1: read 2: read 3: write??? 1 4 From ILP to TLP Memory System is Coherent If... ILP became inefficient in terms of Power consumption Silicon
More informationLayout Compliance for Triple Patterning Lithography: An Iterative Approach
Layout Compliane for Triple Patterning Lithography: An Iterative Approah Bei Yu, Gilda Garreton, David Z. Pan ECE Dept. University of Texas at Austin, Austin, TX, USA Orale Las, Orale Corporation, Redwood
More information