Partition and Compose: Parallel Complex Event Processing. Martin Hirzel, IBM Research Tuesday, 17 July 2012 DEBS

Size: px
Start display at page:

Download "Partition and Compose: Parallel Complex Event Processing. Martin Hirzel, IBM Research Tuesday, 17 July 2012 DEBS"

Transcription

1 Partition and Compose: Parallel Complex Event Processing Martin Hirzel, IBM Research Tuesday, 17 July 2012 DEBS 1

2 ? CEP = Stream Processing? Event (Stream) Processing Aggregate Enrich Join Filter Parse Complex Event Processing Use pattern over simple events to detect and report composite events CEP as an in a streaming language? 2

3 Background: SPL IBM Streams Processing Language SPL is the language for InfoSphere Streams (IBM Product) This paper is based on System S = research branch of InfoSphere Streams 3

4 Scenario: Financial analysis Series of rising peaks and troughs Deep drop below start of match M-shape (double-top) stock pattern Source: 4

5 M-Shape pattern in SPL Composite events Simple events Regular expression Key Aggregation Operator only, no extensions to SPL syntax 5

6 Regular expressions Pattern language familiar from string matching 6

7 Aggregations Operator-specific intrinsic functions 7

8 Matching semantics Standard regular expression semantics Non-greedy (right-minimal) Partition-isolated (Partition-)Contiguous Non-overlapping (submit longest: left-maximal) 8

9 Implementation overview MatchRegex invocation Upstream instance param, output Simple events MatchRegex generator MatchRegex instance Automaton Composite events At compile-time At runtime Downstream instance All C++ s in SPL are code generators 9

10 Automaton. rise+ drop+ rise+ drop* deep Update and filter partial match Create new partial match rise 2 rise rise 0. 1 drop 3 drop rise 4 deep drop 5 6 deep drop Report completed match and flush NFA (non-deterministic finite automaton) 10

11 Partitioning :SimpleEvent :PartitionMap ts symbol price size seqnum key 0..* :PartialMatch state aggr rise 2 rise rise 0. 1 drop 3 drop rise drop 4 5 drop deep deep 6 11

12 Generated C++ code Incremental aggregation rise 2 rise rise 0. 1 drop 3 drop rise drop 4 5 drop deep deep 6 12

13 Paralleli- zation Schneider et al. [PACT 12] Up-stream Simple events :SimpleEvent symbol Composite MatchRegex events PartitionMap key Down-stream Parallelize Simple events MatchRegex (replica 0) PartitionMap Composite events Up-stream key for hash-split :SimpleEvent symbol key for partition map MatchRegex (replica 1) PartitionMap MatchRegex (replica 2) PartitionMap Down-stream 13

14 Safety and determinism SPL compiler checks Syntax and names in expressions Expression and function types MatchRegex checks Syntax and names in regular expression pattern Starting predicate aggregation-free Auto-parallelizer checks Partitioning Absence of stateful expressions Sequence numbers and pulses Enables simple output validation with diff 14

15 Data sets and benchmarks 15

16 Absolute throughput in events per second Large speedup when low sequential throughput 16

17 Speedups 1 Machine x 8 Cores 4 Machines x 8 Cores = 32 Motivates elasticity and auto-width controller 17

18 Related work 2000 today Engine / language Complex events Parallelism NiagaraCQ / XML-QL Algebraic No SQL-TS Back-tracking No Amit Back-tracking No NFA b / SASE Automaton No MATCH_RECOGNIZE ANSI proposal No EventScript Automaton No Cayuga / CEL Automaton Yes, by hand EventJava Index data structures Yes, per task [Woods,Teubner VLDB] Automaton Yes, on FPGA This paper Automaton Yes, partitioned 18

19 Conclusions CEP as an SPL Use CEP for pattern matching Use other s for filtering, enrichment, parsing, joining, etc. Up to 830K events/second Incremental aggregation C++ code generation Parallelism (up to 14x speedup) 19

20 Backup 20

21 Shuffle in twitter02 and twitter03 ParseTweet (replica 0) MatchRegex (replica 0) Source ParseTweet (replica 1) MatchRegex (replica 1) Down-stream Raw tweets as XML documents ParseTweet (replica 2) Tweets as simple events MatchRegex (replica 2) Composite events 21

Partition and Compose: Parallel Complex Event Processing

Partition and Compose: Parallel Complex Event Processing Partition and Compose: Parallel Complex Event Processing Martin Hirzel IBM T.J. Watson Research Center hirzel@us.ibm.com ABSTRACT Complex event processing uses patterns to detect composite in streams of

More information

Extending a General-Purpose Streaming System for XML

Extending a General-Purpose Streaming System for XML Extending a General-Purpose Streaming System for XML Mark Mendell, Howard Nasgaard Eric Bouillet Martin Hirzel, Bugra Gedik IBM Canada IBM Ireland IBM USA EDBT 2012 1 General-Purpose Streaming System Streams

More information

Parallel Query Optimisation

Parallel Query Optimisation Parallel Query Optimisation Contents Objectives of parallel query optimisation Parallel query optimisation Two-Phase optimisation One-Phase optimisation Inter-operator parallelism oriented optimisation

More information

Testing Properties of Dataflow Program Operators

Testing Properties of Dataflow Program Operators Testing Properties of Dataflow Program Operators Zhihong Xu Martin Hirzel Gregg Rothermel Kun-Lung Wu U. of Nebraska IBM Research U. of Nebraska IBM Research Presentation at ASE, November 2013 Big Data

More information

State-based Event Detection Optimization for Complex Event Processing

State-based Event Detection Optimization for Complex Event Processing Sensors & Transducers 214 by IFSA Publishing, S. L. http://www.sensorsportal.com State-based Event Detection Optimization for Complex Event Processing 1 Shanglian PENG, 2 Haoxia LIU, 3 Xiaolin GUO, 1 Jia

More information

Lazy Evaluation Methods for Detecting Complex Events

Lazy Evaluation Methods for Detecting Complex Events Lazy Evaluation Methods for Detecting Complex Events Ilya Kolchinsky Technion, Israel Institute of Technology Izchak Sharfman Technion, Israel Institute of Technology Assaf Schuster Technion, Israel Institute

More information

Transactum Business Process Manager with High-Performance Elastic Scaling. November 2011 Ivan Klianev

Transactum Business Process Manager with High-Performance Elastic Scaling. November 2011 Ivan Klianev Transactum Business Process Manager with High-Performance Elastic Scaling November 2011 Ivan Klianev Transactum BPM serves three primary objectives: To make it possible for developers unfamiliar with distributed

More information

DEEP DIVE INTO 12c MATCH_RECOGNIZE

DEEP DIVE INTO 12c MATCH_RECOGNIZE DEEP DIVE INTO 12c MATCH_RECOGNIZE Getting deep inside the SQL pattern matching process in Database 12c @ASQLBarista oracle-big-data.blogspot.com Last Updated June 2017 Version 12 Who Am I Keith Laker

More information

Semantic Optimization of Preference Queries

Semantic Optimization of Preference Queries Semantic Optimization of Preference Queries Jan Chomicki University at Buffalo http://www.cse.buffalo.edu/ chomicki 1 Querying with Preferences Find the best answers to a query, instead of all the answers.

More information

Hardware-accelerated regular expression matching with overlap handling on IBM PowerEN processor

Hardware-accelerated regular expression matching with overlap handling on IBM PowerEN processor Kubilay Atasu IBM Research Zurich 23 May 2013 Hardware-accelerated regular expression matching with overlap handling on IBM PowerEN processor Kubilay Atasu, Florian Doerfler, Jan van Lunteren, and Christoph

More information

Towards a Universal Stream Processing System Robert Soulé Cornell University

Towards a Universal Stream Processing System Robert Soulé Cornell University 1 Towards a Universal Stream Processing System Robert Soulé Cornell University 2 Data Crisis 2.5 quintillion bytes every day 90% of the world s data was created in the last 2 years 3 How Big is Your Data?

More information

Systems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2012/13

Systems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2012/13 Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2012/13 Data Stream Processing Topics Model Issues System Issues Distributed Processing Web-Scale Streaming 3 Data Streams Continuous

More information

Spreadsheets for Stream Processing with Unbounded Windows and Partitions

Spreadsheets for Stream Processing with Unbounded Windows and Partitions Spreadsheets for Stream Processing with Unbounded Windows and Partitions Martin Hirzel Rodric Rabbah Philippe Suter Olivier Tardieu Mandana Vaziri IBM T.J. Watson Research Center {hirzel,rabbah,psuter,tardieu,mvaziri}@us.ibm.com

More information

Challenges in Data Stream Processing

Challenges in Data Stream Processing Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Challenges in Data Stream Processing Corso di Sistemi e Architetture per Big Data A.A. 2016/17 Valeria

More information

Viper: Communication-Layer Determinism and Scaling in Low-Latency Stream Processing

Viper: Communication-Layer Determinism and Scaling in Low-Latency Stream Processing Viper: Communication-Layer Determinism and Scaling in Low-Latency Stream Processing Ivan Walulya, Yiannis Nikolakopoulos, Vincenzo Gulisano Marina Papatriantafilou and Philippas Tsigas Auto-DaSP 2017 Chalmers

More information

HYRISE In-Memory Storage Engine

HYRISE In-Memory Storage Engine HYRISE In-Memory Storage Engine Martin Grund 1, Jens Krueger 1, Philippe Cudre-Mauroux 3, Samuel Madden 2 Alexander Zeier 1, Hasso Plattner 1 1 Hasso-Plattner-Institute, Germany 2 MIT CSAIL, USA 3 University

More information

SPL: An Extensible Language for Distributed Stream Processing

SPL: An Extensible Language for Distributed Stream Processing SPL: An Extensible Language for Distributed Stream Processing MARTIN HIRZEL and SCOTT SCHNEIDER, IBM Thomas J. Watson Research Center BUĞRA GEDİK, Bilkent University Big data is revolutionizing how all

More information

Imperial College of Science, Technology and Medicine Department of Computing. Distributing Complex Event Detection Kristian A.

Imperial College of Science, Technology and Medicine Department of Computing. Distributing Complex Event Detection Kristian A. Imperial College of Science, Technology and Medicine Department of Computing Distributing Complex Event Detection Kristian A. Nagy MEng Individual project report Supervised by Dr. Peter Pietzuch June 2012

More information

High-Performance Event Processing Bridging the Gap between Low Latency and High Throughput Bernhard Seeger University of Marburg

High-Performance Event Processing Bridging the Gap between Low Latency and High Throughput Bernhard Seeger University of Marburg High-Performance Event Processing Bridging the Gap between Low Latency and High Throughput Bernhard Seeger University of Marburg common work with Nikolaus Glombiewski, Michael Körber, Marc Seidemann 1.

More information

Language Support for Stream. Processing. Robert Soulé

Language Support for Stream. Processing. Robert Soulé Language Support for Stream Processing Robert Soulé Introduction Proposal Work Plan Outline Formalism Translators & Abstractions Optimizations Related Work Conclusion 2 Introduction Stream processing is

More information

Extending XQuery with Window Functions

Extending XQuery with Window Functions Extending XQuery with Window Functions Tim Kraska ETH Zurich September 25, 2007 Motivation XML is the data format for communication data (RSS, Atom, Web Services) meta data, logs (XMI, schemas, config

More information

Efficient Pattern Matching over Event Streams

Efficient Pattern Matching over Event Streams Efficient Pattern Matching over Event Streams Jagrati Agrawal, Yanlei Diao, Daniel Gyllstrom, and Neil Immerman Department of Computer Science University of Massachusetts Amherst Amherst, MA, USA {jagrati,

More information

Towards Expressive Publish/Subscribe Systems

Towards Expressive Publish/Subscribe Systems Towards Expressive Publish/Subscribe Systems Alan Demers, Johannes Gehrke, Mingsheng Hong, Mirek Riedewald, and Walker White Cornell University, Department of Computer Science {ademers,johannes,mshong,mirek,wmwhite}@cs.cornell.edu

More information

Building User-defined Runtime Adaptation Routines for Stream Processing Applications

Building User-defined Runtime Adaptation Routines for Stream Processing Applications Building User-defined Runtime Adaptation Routines for Stream Processing Applications Gabriela Jacques-Silva, Buğra Gedik, Rohit Wagle, Kun-Lung Wu, Vibhore Kumar Thomas J. Watson Research Center, IBM Research,

More information

Spring 2017 EXTERNAL SORTING (CH. 13 IN THE COW BOOK) 2/7/17 CS 564: Database Management Systems; (c) Jignesh M. Patel,

Spring 2017 EXTERNAL SORTING (CH. 13 IN THE COW BOOK) 2/7/17 CS 564: Database Management Systems; (c) Jignesh M. Patel, Spring 2017 EXTERNAL SORTING (CH. 13 IN THE COW BOOK) 2/7/17 CS 564: Database Management Systems; (c) Jignesh M. Patel, 2013 1 Motivation for External Sort Often have a large (size greater than the available

More information

Computation and Communication Scalable Algorithms I. Daniel Keren, Arnon Lazerson, Mickey Gabel, Ilya Kolchinsky, Tsachi Sharfman, Assaf Schuster

Computation and Communication Scalable Algorithms I. Daniel Keren, Arnon Lazerson, Mickey Gabel, Ilya Kolchinsky, Tsachi Sharfman, Assaf Schuster Project Deliverable D6.2 Distribution Scalable Data Analytics Scalable Algorithms, Software Frameworks and Visualisation ICT-2013.4.2a FP7-619435 / SPEEDD Public http://speedd-project.eu/ Computation and

More information

YFilter: an XML Stream Filtering Engine. Weiwei SUN University of Konstanz

YFilter: an XML Stream Filtering Engine. Weiwei SUN University of Konstanz YFilter: an XML Stream Filtering Engine Weiwei SUN University of Konstanz 1 Introduction Data exchange between applications: use XML Messages processed by an XML Message Broker Examples Publish/subscribe

More information

Towards a Universal Stream Processing Platform

Towards a Universal Stream Processing Platform 1 Towards a Universal Stream Processing Platform Robert Soulé, Martin Hirzel, Robert Grimm, Buğra Gedik New York University and IBM Research 2 Streaming Is Everywhere 3 Streaming Needs Languages Developers

More information

81067AE Development Environment Introduction in Microsoft

81067AE Development Environment Introduction in Microsoft Microsoft Course Modules for Microsoft Training Online: 1. Development Environment Lesson 1: Object Designer. Lesson 2: 7 Objects & The Logical Database. Lesson 3: Managing Objects. Lesson 4: Properties

More information

Ti Parallel Computing PIPELINING. Michał Roziecki, Tomáš Cipr

Ti Parallel Computing PIPELINING. Michał Roziecki, Tomáš Cipr Ti5317000 Parallel Computing PIPELINING Michał Roziecki, Tomáš Cipr 2005-2006 Introduction to pipelining What is this What is pipelining? Pipelining is an implementation technique in which multiple instructions

More information

What happens. 376a. Database Design. Execution strategy. Query conversion. Next. Two types of techniques

What happens. 376a. Database Design. Execution strategy. Query conversion. Next. Two types of techniques 376a. Database Design Dept. of Computer Science Vassar College http://www.cs.vassar.edu/~cs376 Class 16 Query optimization What happens Database is given a query Query is scanned - scanner creates a list

More information

Joint Entity Resolution

Joint Entity Resolution Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute

More information

Computation and Communication Scalable Algorithms II. Daniel Keren, Mickey Gabel, Ilya Kolchinsky, Assaf Schuster

Computation and Communication Scalable Algorithms II. Daniel Keren, Mickey Gabel, Ilya Kolchinsky, Assaf Schuster Project Deliverable D6.4 Distribution Scalable Data Analytics Scalable Algorithms, Software Frameworks and Visualisation ICT-2013.4.2a FP7-619435 / SPEEDD Public http://speedd-project.eu/ Computation and

More information

Extending XQuery with Window Functions

Extending XQuery with Window Functions Extending XQuery with Window Functions Irina Botan, Peter M. Fischer, Dana Florescu*, Donald Kossmann, Tim Kraska, Rokas Tamosevicius ETH Zurich, Oracle* Elevator Pitch Version of this Talk XQuery can

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe CHAPTER 19 Query Optimization Introduction Query optimization Conducted by a query optimizer in a DBMS Goal: select best available strategy for executing query Based on information available Most RDBMSs

More information

IQ for DNA. Interactive Query for Dynamic Network Analytics. Haoyu Song. HUAWEI TECHNOLOGIES Co., Ltd.

IQ for DNA. Interactive Query for Dynamic Network Analytics. Haoyu Song.   HUAWEI TECHNOLOGIES Co., Ltd. IQ for DNA Interactive Query for Dynamic Network Analytics Haoyu Song www.huawei.com Motivation Service Provider s pain point Lack of real-time and full visibility of networks, so the network monitoring

More information

Storm. Distributed and fault-tolerant realtime computation. Nathan Marz Twitter

Storm. Distributed and fault-tolerant realtime computation. Nathan Marz Twitter Storm Distributed and fault-tolerant realtime computation Nathan Marz Twitter Storm at Twitter Twitter Web Analytics Before Storm Queues Workers Example (simplified) Example Workers schemify tweets and

More information

Multi-threaded Queries. Intra-Query Parallelism in LLVM

Multi-threaded Queries. Intra-Query Parallelism in LLVM Multi-threaded Queries Intra-Query Parallelism in LLVM Multithreaded Queries Intra-Query Parallelism in LLVM Yang Liu Tianqi Wu Hao Li Interpreted vs Compiled (LLVM) Interpreted vs Compiled (LLVM) Interpreted

More information

Comprehensive Guide to Evaluating Event Stream Processing Engines

Comprehensive Guide to Evaluating Event Stream Processing Engines Comprehensive Guide to Evaluating Event Stream Processing Engines i Copyright 2006 Coral8, Inc. All rights reserved worldwide. Worldwide Headquarters: Coral8, Inc. 82 Pioneer Way, Suite 106 Mountain View,

More information

Impala. A Modern, Open Source SQL Engine for Hadoop. Yogesh Chockalingam

Impala. A Modern, Open Source SQL Engine for Hadoop. Yogesh Chockalingam Impala A Modern, Open Source SQL Engine for Hadoop Yogesh Chockalingam Agenda Introduction Architecture Front End Back End Evaluation Comparison with Spark SQL Introduction Why not use Hive or HBase?

More information

Real-time data processing with Apache Flink

Real-time data processing with Apache Flink Real-time data processing with Apache Flink Gyula Fóra gyfora@apache.org Flink committer Swedish ICT Stream processing Data stream: Infinite sequence of data arriving in a continuous fashion. Stream processing:

More information

Tutorial: Stream Processing Optimizations

Tutorial: Stream Processing Optimizations Tutorial: Stream Processing Optimizations Scott Schneider IM Watson Research Center, Yorktown Heights, NY, US scott.a.s@us.ibm.com Martin Hirzel IM Watson Research Center, Yorktown Heights, NY, US hirzel@us.ibm.com

More information

Towards Expressive Publish/Subscribe Systems

Towards Expressive Publish/Subscribe Systems Towards Expressive Publish/Subscribe Systems Alan Demers, Johannes Gehrke, Mingsheng Hong, Mirek Riedewald, and Walker White Cornell University, Department of Computer Science {ademers, johannes, mshong,

More information

How VoltDB does Transactions

How VoltDB does Transactions TEHNIL NOTE How does Transactions Note to readers: Some content about read-only transaction coordination in this document is out of date due to changes made in v6.4 as a result of Jepsen testing. Updates

More information

Apache Flink- A System for Batch and Realtime Stream Processing

Apache Flink- A System for Batch and Realtime Stream Processing Apache Flink- A System for Batch and Realtime Stream Processing Lecture Notes Winter semester 2016 / 2017 Ludwig-Maximilians-University Munich Prof Dr. Matthias Schubert 2016 Introduction to Apache Flink

More information

Heckaton. SQL Server's Memory Optimized OLTP Engine

Heckaton. SQL Server's Memory Optimized OLTP Engine Heckaton SQL Server's Memory Optimized OLTP Engine Agenda Introduction to Hekaton Design Consideration High Level Architecture Storage and Indexing Query Processing Transaction Management Transaction Durability

More information

ActiveSheets. Stream Processing with a Spreadsheet ECOOP 14

ActiveSheets. Stream Processing with a Spreadsheet ECOOP 14 Stream Processing with a Spreadsheet ECOOP 14 Mandana Vaziri, Olivier Tardieu, Rodric Rabbah, Philippe Suter, Martin Hirzel IBM T.J. Watson Research Center Introduction Continuous data streams Domains:

More information

Impala Intro. MingLi xunzhang

Impala Intro. MingLi xunzhang Impala Intro MingLi xunzhang Overview MPP SQL Query Engine for Hadoop Environment Designed for great performance BI Connected(ODBC/JDBC, Kerberos, LDAP, ANSI SQL) Hadoop Components HDFS, HBase, Metastore,

More information

Module 4. Implementation of XQuery. Part 0: Background on relational query processing

Module 4. Implementation of XQuery. Part 0: Background on relational query processing Module 4 Implementation of XQuery Part 0: Background on relational query processing The Data Management Universe Lecture Part I Lecture Part 2 2 What does a Database System do? Input: SQL statement Output:

More information

XML Data Stream Processing: Extensions to YFilter

XML Data Stream Processing: Extensions to YFilter XML Data Stream Processing: Extensions to YFilter Shaolei Feng and Giridhar Kumaran January 31, 2007 Abstract Running XPath queries on XML data steams is a challenge. Current approaches that store the

More information

NiagaraCQ: A Scalable Continuous Query System for Internet Databases

NiagaraCQ: A Scalable Continuous Query System for Internet Databases NiagaraCQ: A Scalable Continuous Query System for Internet Databases Jianjun Chen David J. DeWitt Feng Tian Yuan Wang Computer Sciences Department University of Wisconsin-Madison {jchen, dewitt, ftian,

More information

COURSE: DATA STRUCTURES USING C & C++ CODE: 05BMCAR17161 CREDITS: 05

COURSE: DATA STRUCTURES USING C & C++ CODE: 05BMCAR17161 CREDITS: 05 COURSE: DATA STRUCTURES USING C & C++ CODE: 05BMCAR17161 CREDITS: 05 Unit 1 : LINEAR DATA STRUCTURES Introduction - Abstract Data Types (ADT), Arrays and its representation Structures, Stack, Queue, Circular

More information

Red Fox: An Execution Environment for Relational Query Processing on GPUs

Red Fox: An Execution Environment for Relational Query Processing on GPUs Red Fox: An Execution Environment for Relational Query Processing on GPUs Haicheng Wu 1, Gregory Diamos 2, Tim Sheard 3, Molham Aref 4, Sean Baxter 2, Michael Garland 2, Sudhakar Yalamanchili 1 1. Georgia

More information

Big Data. Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich

Big Data. Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich Big Data Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich Data Stream Processing Overview Introduction Models and languages Query processing systems Distributed stream processing Real-time analytics

More information

Concurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration

Concurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration Concurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration Sina Meraji sinamera@ca.ibm.com Berni Schiefer schiefer@ca.ibm.com Tuesday March 17th at 12:00

More information

Faster FAST Multicore Acceleration of

Faster FAST Multicore Acceleration of Faster FAST Multicore Acceleration of Streaming Financial Data Virat Agarwal, David A. Bader, Lin Dan, Lurng-Kuo Liu, Davide Pasetto, Michael Perrone, Fabrizio Petrini Financial Market Finance can be defined

More information

SQL Server 2014 Performance Tuning and Optimization

SQL Server 2014 Performance Tuning and Optimization SQL Server 2014 Performance Tuning and Optimization 55144B; 5 Days, Instructor-led Course Description This course is designed to give the right amount of Internals knowledge, and wealth of practical tuning

More information

Efficient Event Processing through Reconfigurable Hardware for Algorithmic Trading. University of Toronto

Efficient Event Processing through Reconfigurable Hardware for Algorithmic Trading. University of Toronto Efficient Event Processing through Reconfigurable Hardware for Algorithmic Trading Martin Labrecque Harsh Singh Warren Shum Hans-Arno Jacobsen University of Toronto Algorithm Trading Examples of Financial

More information

VASim: An Open Virtual Automata Simulator for Automata Processing Research

VASim: An Open Virtual Automata Simulator for Automata Processing Research University of Virginia Technical Report #CS2016-03 VASim: An Open Virtual Automata Simulator for Automata Processing Research J. Wadden 1, K. Skadron 1 We present VASim, an open, extensible virtual automata

More information

HadoopDB: An open source hybrid of MapReduce

HadoopDB: An open source hybrid of MapReduce HadoopDB: An open source hybrid of MapReduce and DBMS technologies Azza Abouzeid, Kamil Bajda-Pawlikowski Daniel J. Abadi, Avi Silberschatz Yale University http://hadoopdb.sourceforge.net October 2, 2009

More information

CSE 544: Principles of Database Systems

CSE 544: Principles of Database Systems CSE 544: Principles of Database Systems Anatomy of a DBMS, Parallel Databases 1 Announcements Lecture on Thursday, May 2nd: Moved to 9am-10:30am, CSE 403 Paper reviews: Anatomy paper was due yesterday;

More information

7. Query Processing and Optimization

7. Query Processing and Optimization 7. Query Processing and Optimization Processing a Query 103 Indexing for Performance Simple (individual) index B + -tree index Matching index scan vs nonmatching index scan Unique index one entry and one

More information

55144-SQL Server 2014 Performance Tuning and Optimization

55144-SQL Server 2014 Performance Tuning and Optimization 55144-SQL Server 2014 Performance Tuning and Optimization Course Number: M55144 Category: Technical - Microsoft Duration: 5 day Overview This course is designed to give the right amount of Internals knowledge,

More information

Designing for Performance. Patrick Happ Raul Feitosa

Designing for Performance. Patrick Happ Raul Feitosa Designing for Performance Patrick Happ Raul Feitosa Objective In this section we examine the most common approach to assessing processor and computer system performance W. Stallings Designing for Performance

More information

Scalable Streaming Analytics

Scalable Streaming Analytics Scalable Streaming Analytics KARTHIK RAMASAMY @karthikz TALK OUTLINE BEGIN I! II ( III b Overview Storm Overview Storm Internals IV Z V K Heron Operational Experiences END WHAT IS ANALYTICS? according

More information

Formal Languages and Compilers Lecture VI: Lexical Analysis

Formal Languages and Compilers Lecture VI: Lexical Analysis Formal Languages and Compilers Lecture VI: Lexical Analysis Free University of Bozen-Bolzano Faculty of Computer Science POS Building, Room: 2.03 artale@inf.unibz.it http://www.inf.unibz.it/ artale/ Formal

More information

Dynamic Expressivity with Static Optimization for Streaming Languages

Dynamic Expressivity with Static Optimization for Streaming Languages Dynamic Expressivity with Static Optimization for Streaming Languages Robert Soulé Michael I. Gordon Saman marasinghe Robert Grimm Martin Hirzel ornell MIT MIT NYU IM DES 2013 1 Stream (FIFO queue) Operator

More information

Oracle Event Processing Extreme Performance on Sparc T5

Oracle Event Processing Extreme Performance on Sparc T5 Oracle Event Processing Extreme Performance on Sparc T5 An Oracle Event Processing (OEP) Whitepaper ORACLE WHITE PAPER AUGUST 2014 Table of Contents Introduction 2 OEP Architecture 2 Server Architecture

More information

Hardware Acceleration of Database Operations

Hardware Acceleration of Database Operations Hardware Acceleration of Database Operations Jared Casper and Kunle Olukotun Pervasive Parallelism Laboratory Stanford University Database machines n Database machines from late 1970s n Put some compute

More information

April Copyright 2013 Cloudera Inc. All rights reserved.

April Copyright 2013 Cloudera Inc. All rights reserved. Hadoop Beyond Batch: Real-time Workloads, SQL-on- Hadoop, and the Virtual EDW Headline Goes Here Marcel Kornacker marcel@cloudera.com Speaker Name or Subhead Goes Here April 2014 Analytic Workloads on

More information

ait: WORST-CASE EXECUTION TIME PREDICTION BY STATIC PROGRAM ANALYSIS

ait: WORST-CASE EXECUTION TIME PREDICTION BY STATIC PROGRAM ANALYSIS ait: WORST-CASE EXECUTION TIME PREDICTION BY STATIC PROGRAM ANALYSIS Christian Ferdinand and Reinhold Heckmann AbsInt Angewandte Informatik GmbH, Stuhlsatzenhausweg 69, D-66123 Saarbrucken, Germany info@absint.com

More information

MapReduce Spark. Some slides are adapted from those of Jeff Dean and Matei Zaharia

MapReduce Spark. Some slides are adapted from those of Jeff Dean and Matei Zaharia MapReduce Spark Some slides are adapted from those of Jeff Dean and Matei Zaharia What have we learnt so far? Distributed storage systems consistency semantics protocols for fault tolerance Paxos, Raft,

More information

Compiling Path Queries

Compiling Path Queries Compiling Path Queries Princeton University Srinivas Narayana Mina Tahmasbi Jen Rexford David Walker Management = Measure + Control Network Controller Measure Control Software-Defined Networking (SDN)

More information

PSD3A Principles of Compiler Design Unit : I-V. PSD3A- Principles of Compiler Design

PSD3A Principles of Compiler Design Unit : I-V. PSD3A- Principles of Compiler Design PSD3A Principles of Compiler Design Unit : I-V 1 UNIT I - SYLLABUS Compiler Assembler Language Processing System Phases of Compiler Lexical Analyser Finite Automata NFA DFA Compiler Tools 2 Compiler -

More information

Hybrid Shipping Architectures: A Survey

Hybrid Shipping Architectures: A Survey Hybrid Shipping Architectures: A Survey Ivan Bowman itbowman@acm.org http://plg.uwaterloo.ca/~itbowman CS748T 14 Feb 2000 Outline Partitioning query processing Partitioning client code Optimization of

More information

Intermediate Representations

Intermediate Representations Compiler Design 1 Intermediate Representations Compiler Design 2 Front End & Back End The portion of the compiler that does scanning, parsing and static semantic analysis is called the front-end. The translation

More information

PASS4TEST. IT Certification Guaranteed, The Easy Way! We offer free update service for one year

PASS4TEST. IT Certification Guaranteed, The Easy Way!   We offer free update service for one year PASS4TEST \ http://www.pass4test.com We offer free update service for one year Exam : C2090-303 Title : IBM InfoSphere DataStage v9.1 Vendors : IBM Version : DEMO Get Latest & Valid C2090-303 Exam's Question

More information

Segregating Data Within Databases for Performance Prepared by Bill Hulsizer

Segregating Data Within Databases for Performance Prepared by Bill Hulsizer Segregating Data Within Databases for Performance Prepared by Bill Hulsizer When designing databases, segregating data within tables is usually important and sometimes very important. The higher the volume

More information

DATABASES AND THE CLOUD. Gustavo Alonso Systems Group / ECC Dept. of Computer Science ETH Zürich, Switzerland

DATABASES AND THE CLOUD. Gustavo Alonso Systems Group / ECC Dept. of Computer Science ETH Zürich, Switzerland DATABASES AND THE CLOUD Gustavo Alonso Systems Group / ECC Dept. of Computer Science ETH Zürich, Switzerland AVALOQ Conference Zürich June 2011 Systems Group www.systems.ethz.ch Enterprise Computing Center

More information

Jaql. Kevin Beyer, Vuk Ercegovac, Eugene Shekita, Jun Rao, Ning Li, Sandeep Tata. IBM Almaden Research Center

Jaql. Kevin Beyer, Vuk Ercegovac, Eugene Shekita, Jun Rao, Ning Li, Sandeep Tata. IBM Almaden Research Center Jaql Running Pipes in the Clouds Kevin Beyer, Vuk Ercegovac, Eugene Shekita, Jun Rao, Ning Li, Sandeep Tata IBM Almaden Research Center http://code.google.com/p/jaql/ 2009 IBM Corporation Motivating Scenarios

More information

A modern programming language focused on integration

A modern programming language focused on integration A modern programming language focused on integration Sanjiva Weerawarana, Ph.D. Founder, Chairman & Chief Architect; WSO2 James Clark Independent Joint work with Sameera Jayasoma, WSO2; Hasitha Aravinda,

More information

Topic 1, Volume A QUESTION NO: 1 In your ETL application design you have found several areas of common processing requirements in the mapping specific

Topic 1, Volume A QUESTION NO: 1 In your ETL application design you have found several areas of common processing requirements in the mapping specific Vendor: IBM Exam Code: C2090-303 Exam Name: IBM InfoSphere DataStage v9.1 Version: Demo Topic 1, Volume A QUESTION NO: 1 In your ETL application design you have found several areas of common processing

More information

Scaling for Humongous amounts of data with MongoDB

Scaling for Humongous amounts of data with MongoDB Scaling for Humongous amounts of data with MongoDB Alvin Richards Technical Director, EMEA alvin@10gen.com @jonnyeight alvinonmongodb.com From here... http://bit.ly/ot71m4 ...to here... http://bit.ly/oxcsis

More information

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Copyright 2012, Oracle and/or its affiliates. All rights reserved. 1 Oracle Partitioning für Einsteiger Hermann Bär Partitioning Produkt Management 2 Disclaimer The goal is to establish a basic understanding of what can be done with Partitioning I want you to start thinking

More information

Exploiting the OpenPOWER Platform for Big Data Analytics and Cognitive. Rajesh Bordawekar and Ruchir Puri IBM T. J. Watson Research Center

Exploiting the OpenPOWER Platform for Big Data Analytics and Cognitive. Rajesh Bordawekar and Ruchir Puri IBM T. J. Watson Research Center Exploiting the OpenPOWER Platform for Big Data Analytics and Cognitive Rajesh Bordawekar and Ruchir Puri IBM T. J. Watson Research Center 3/17/2015 2014 IBM Corporation Outline IBM OpenPower Platform Accelerating

More information

Kafka Streams: Hands-on Session A.A. 2017/18

Kafka Streams: Hands-on Session A.A. 2017/18 Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Kafka Streams: Hands-on Session A.A. 2017/18 Matteo Nardelli Laurea Magistrale in Ingegneria Informatica

More information

8 Introduction to Distributed Computing

8 Introduction to Distributed Computing CME 323: Distributed Algorithms and Optimization, Spring 2017 http://stanford.edu/~rezab/dao. Instructor: Reza Zadeh, Matroid and Stanford. Lecture 8, 4/26/2017. Scribed by A. Santucci. 8 Introduction

More information

Control Hazards. Prediction

Control Hazards. Prediction Control Hazards The nub of the problem: In what pipeline stage does the processor fetch the next instruction? If that instruction is a conditional branch, when does the processor know whether the conditional

More information

C# 6.0 in a nutshell / Joseph Albahari & Ben Albahari. 6th ed. Beijin [etc.], cop Spis treści

C# 6.0 in a nutshell / Joseph Albahari & Ben Albahari. 6th ed. Beijin [etc.], cop Spis treści C# 6.0 in a nutshell / Joseph Albahari & Ben Albahari. 6th ed. Beijin [etc.], cop. 2016 Spis treści Preface xi 1. Introducing C# and the.net Framework 1 Object Orientation 1 Type Safety 2 Memory Management

More information

Lecture Notes to Big Data Management and Analytics Winter Term 2017/2018 Apache Flink

Lecture Notes to Big Data Management and Analytics Winter Term 2017/2018 Apache Flink Lecture Notes to Big Data Management and Analytics Winter Term 2017/2018 Apache Flink Matthias Schubert, Matthias Renz, Felix Borutta, Evgeniy Faerman, Christian Frey, Klaus Arthur Schmid, Daniyal Kazempour,

More information

BP-Mon: Monitoring Business Processes with Queries

BP-Mon: Monitoring Business Processes with Queries BP-Mon: Monitoring Business Processes with Queries Catriel Beeri, Anat Eyal, Tova Milo, Alon Pilberg Hebrew University Tel-Aviv University Outline Introduction to Business Processes Overview of BP-Mon

More information

High-Performance Holistic XML Twig Filtering Using GPUs. Ildar Absalyamov, Roger Moussalli, Walid Najjar and Vassilis Tsotras

High-Performance Holistic XML Twig Filtering Using GPUs. Ildar Absalyamov, Roger Moussalli, Walid Najjar and Vassilis Tsotras High-Performance Holistic XML Twig Filtering Using GPUs Ildar Absalyamov, Roger Moussalli, Walid Najjar and Vassilis Tsotras Outline! Motivation! XML filtering in the literature! Software approaches! Hardware

More information

HANA Performance. Efficient Speed and Scale-out for Real-time BI

HANA Performance. Efficient Speed and Scale-out for Real-time BI HANA Performance Efficient Speed and Scale-out for Real-time BI 1 HANA Performance: Efficient Speed and Scale-out for Real-time BI Introduction SAP HANA enables organizations to optimize their business

More information

Implementation of Lexical Analysis

Implementation of Lexical Analysis Implementation of Lexical Analysis Outline Specifying lexical structure using regular expressions Finite automata Deterministic Finite Automata (DFAs) Non-deterministic Finite Automata (NFAs) Implementation

More information

FAQ (Basic) Sybase CEP Option R4

FAQ (Basic) Sybase CEP Option R4 FAQ (Basic) Sybase CEP Option R4 DOCUMENT ID: DC01023-01-0400-01 LAST REVISED: February 2010 Copyright 2010 by Sybase, Inc. All rights reserved. This publication pertains to Sybase software and to any

More information

A Examcollection.Premium.Exam.47q

A Examcollection.Premium.Exam.47q A2090-303.Examcollection.Premium.Exam.47q Number: A2090-303 Passing Score: 800 Time Limit: 120 min File Version: 32.7 http://www.gratisexam.com/ Exam Code: A2090-303 Exam Name: Assessment: IBM InfoSphere

More information

SQL. The Basics Advanced Manipulation Constraints Authorization 1. 1

SQL. The Basics Advanced Manipulation Constraints Authorization 1. 1 SQL The Basics Advanced Manipulation Constraints Authorization 1. 1 Table of Contents SQL 0 Table of Contents 0/1 Parke Godfrey 0/2 Acknowledgments 0/3 SQL: a standard language for accessing databases

More information

Introduction. 1. Introduction. Overview Query Processing Overview Query Optimization Overview Query Execution 3 / 591

Introduction. 1. Introduction. Overview Query Processing Overview Query Optimization Overview Query Execution 3 / 591 1. Introduction Overview Query Processing Overview Query Optimization Overview Query Execution 3 / 591 Query Processing Reason for Query Optimization query languages like SQL are declarative query specifies

More information

Teleport Messaging for. Distributed Stream Programs

Teleport Messaging for. Distributed Stream Programs Teleport Messaging for 1 Distributed Stream Programs William Thies, Michal Karczmarek, Janis Sermulins, Rodric Rabbah and Saman Amarasinghe Massachusetts Institute of Technology PPoPP 2005 http://cag.lcs.mit.edu/streamit

More information

Index. caps method, 180 Character(s) base, 161 classes

Index. caps method, 180 Character(s) base, 161 classes A Abjads, 160 Abstract syntax tree (AST), 3 with action objects, 141 143 definition, 135 Action method for integers converts, 172 173 S-Expressions, 171 Action objects ASTs, 141 142 defined, 137.made attribute,

More information