Apache Flink Big Data Stream Processing

Size: px

Start display at page:

Download "Apache Flink Big Data Stream Processing"

Aileen Kathleen Hart
6 years ago
Views:

Apache Flink Big Data Stream Processing Tilmann Rabl Berlin Big Data Center www.dima.tu-berlin.

2 Agenda Disclaimer: I am neither a Flink developer nor affiliated with data Artisans. 2 2 DIMA 2017

3 Agenda Flink Primer Background & APIs (-> Polystore functionality) Execution Engine Some key features Stream Processing with Apache Flink Key features With slides from data Artisans, Volker Markl, Asterios Katsifodimos 3 3 DIMA 2017

5 Stratosphere: General Purpose Programming + Database Execution Draws on Database Technology Adds Draws on MapReduce Technology Relational Algebra Declarativity Query Optimization Robust Out-of-core Iterations Advanced Dataflows General APIs Native Streaming Scalability User-defined Functions Complex Data Types Schema on Read 5 DIMA 2017

6 The APIs Stream- & Batch Processing Analytics Stream SQL Table API (dynamic tables) Stateful Event-Driven Applications 6 DataStream API (streams, windows) Process Function (events, state, time) Berlin Big Data Center All Rights Reserved 6 DIMA 2017

7 Process Function class MyFunction extends ProcessFunction[MyEvent, Result] { // declare state to use in the program lazy val state: ValueState[CountWithTimestamp] = getruntimecontext().getstate( ) def processelement(event: MyEvent, ctx: Context, out: Collector[Result]): Unit = { // work with event and state (event, state.value) match { } out.collect( ) // emit events state.update( ) // modify state } // schedule a timer callback ctx.timerservice.registereventtimetimer(event.timestamp + 500) } def ontimer(timestamp: Long, ctx: OnTimerContext, out: Collector[Result]): Unit = { // handle callback when event-/processing- time instant is reached } Berlin Big Data Center All Rights Reserved 7 DIMA

8 Data Stream API val lines: DataStream[String] = env.addsource( new FlinkKafkaConsumer09<>( )) val events: DataStream[Event] = lines.map((line) => parse(line)) val stats: DataStream[Statistic] = stream.keyby("sensor").timewindow(time.seconds(5)).sum(new MyAggregationFunction()) stats.addsink(new RollingSink(path)) Berlin Big Data Center All Rights Reserved 8 DIMA

10 What can I do with it? Stream processing Batch processing Machine Learning at scale Complex event processing Graph Analysis Flink An engine that can natively support all these workloads Berlin Big Data Center All Rights Reserved 10 DIMA 2017

11 Flink in the Analytics Ecosystem Applications & Languages Hive Mahout Cascading Pig Giraph Crunch Data processing engines MapReduce Spark Storm Flink Tez App and resource management Yarn Mesos Storage, streams HDFS HBase Kafka Berlin Big Data Center All Rights Reserved 11 DIMA

12 Where in my cluster does Flink fit? Gathering Integration Analysis Server logs Upstream systems Trxn logs Sensor logs - Gather and backup streams - Offer streams for consumption - Provide stream recovery - Analyze and correlate streams - Create derived streams and state - Provide these to upstream systems 12 DIMA 2017

13 Architecture Hybrid MapReduce and MPP database runtime Pipelined/Streaming engine Complete DAG deployed Worker 1 Worker 2 Job Manager Worker 3 Worker DIMA 2017

14 Flink Execution Model Flink program = DAG* of operators and intermediate streams Operator = computation + state Intermediate streams = logical stream of records DIMA 2017

15 Technology inside Flink case class Path (from: Long, to: Long) val tc = edges.iterate(10) { paths: DataSet[Path] => val next = paths.join(edges).where("to").equalto("from") { (path, edge) => Path(path.from, edge.to) }.union(paths).distinct() next } Program Type extraction stack Cost-based optimizer Pre-flight (Client) Map Filter DataSourc e orders.tbl build HT GroupRed sort forward Join Hybrid Hash probe hash-part [0] hash-part [0] DataSourc e lineitem.tbl Dataflow Graph Memory manager Out-of-core algorithms deploy operators Recovery metadata Batch & streaming State & checkpoints Workers track intermediate results Task scheduling Master Berlin Big Data Center All Rights Reserved 15 DIMA 2017

16 Rich set of operators Map, Reduce, Join, CoGroup, Union, Iterate, Delta Iterate, Filter, FlatMap, GroupReduce, Project, Aggregate, Distinct, Vertex-Update, Accumulators, DIMA 2017

17 Effect of optimization Execution Plan A Hash vs. Sort Partition vs. Broadcast Caching Reusing partition/sort Run on a sample on the laptop Execution Plan B Run on large files on the cluster Execution Plan C Run a month later after the data evolved DIMA 2017

Flink Optimizer Transitive Closure replace Co-locate DISTINCT + JOIN Iterate Forward HDF S Hybrid Hash Join Group new Distinc Reduce (Sorted (on

Loop-invariant data cached in memory What you write is not what is executed No need to hardcode execution strategies Flink Optimizer decides:

18 Flink Optimizer Transitive Closure replace Co-locate DISTINCT + JOIN Iterate Forward HDF S Hybrid Hash Join Group new Distinc Reduce (Sorted (on [0])) Paths Join Union t Hash Partition on [1] Co-locate JOIN + UNION Hash Partition on [1] Step function Hash Partition on [0] paths Loop-invariant data cached in memory What you write is not what is executed No need to hardcode execution strategies Flink Optimizer decides: Pipelines and dam/barrier placement Sort- vs. hash- based execution Data exchange (partition vs. broadcast) Data partitioning steps In-memory caching DIMA 2017

19 Scale Out DIMA 2017

20 Stream Processing with Flink 20 DIMA 2017

21 8 Requirements of Big Streaming Keep the data moving Streaming architecture Integrate stored and streaming data Hybrid stream and batch Declarative access E.g. StreamSQL, CQL Data safety and availability Fault tolerance, durable state Handle imperfections Late, missing, unordered items Automatic partitioning and scaling Distributed processing Predictable outcomes Consistency, event time Instantaneous processing and response The 8 Requirements of Real-Time Stream Processing Stonebraker et al DIMA 2017

22 8 Requirements of Streaming Systems Keep the data moving Streaming architecture Integrate stored and streaming data Hybrid stream and batch see StreamSQL Declarative access E.g. StreamSQL, CQL Data safety and availability Fault tolerance, durable state Handle imperfections Late, missing, unordered items Automatic partitioning and scaling Distributed processing Predictable outcomes Consistency, event time Instantaneous processing and response The 8 Requirements of Real-Time Stream Processing Stonebraker et al DIMA 2017

23 How to keep data moving? Discretized Streams (mini-batch) while (true) { // get next few records // issue batch computation } Stream discretizer Job Job Job Job Native streaming while (true) { // process next record } Long-standing operators DIMA 2017

Declarative Access Stream SQL Stream / Table Duality Table without Primary Key

data item is received Processing time System time when data item is

25 Handle Imperfections - Event Time et al. Event time Data item production time Ingestion time System time when data item is received Processing time System time when data item is processed Typically, these do not match! In practice, streams are unordered! Image: Tyler Akidau DIMA 2017

Time: Event Time Example Event Time Episode IV Episode V Episode VI Episode I Episode II Episode III Episode VII

27 Flink s Windowing Windows can be any combination of (multiple) triggers & evictions Arbitrary tumbling, sliding, session, etc. windows can be constructed. Common triggers/evictions part of the API Time (processing vs. event time), Count Even more flexibility: define your own UDF trigger/eviction Examples: datastream.windowall(tumblingeventtimewindows.of(time.seconds(5))); datastream.keyby(0).window(tumblingeventtimewindows.of(time.seconds(5))); Flink will handle event time, ordering, etc DIMA 2017

Example Analysis: Windowed Aggregation (2) StockPrice(HDP, 23.8) StockPrice(SPX, 2113.9) StockPrice(FTSE, 6931.7) StockPrice(HDP, 23.8) StockPrice(HDP, 26.6) (1) (3) StockPrice(SPX, 2113.

28 Example Analysis: Windowed Aggregation (2) StockPrice(HDP, 23.8) StockPrice(SPX, ) StockPrice(FTSE, ) StockPrice(HDP, 23.8) StockPrice(HDP, 26.6) (1) (3) StockPrice(SPX, ) StockPrice(FTSE, ) StockPrice(HDP, 26.6) (4) StockPrice(SPX, ) StockPrice(FTSE, ) StockPrice(HDP, 25.2) (1) (2) (3) (4) val windowedstream = stockstream.window(time.of(10, SECONDS)).every(Time.of(5, SECONDS)) val lowest = windowedstream.minby("price") val maxbystock = windowedstream.groupby("symbol").maxby("price") val rollingmean = windowedstream.groupby("symbol").mapwindow(mean _) 28 DIMA 2017

29 Data Safety and Availability Ensure that operators see all events At least once Solved by replaying a stream from a checkpoint No good for correct results Ensure that operators do not perform duplicate updates to their state Exactly once Several solutions Ensure the job can survive failure DIMA 2017

30 Lessons Learned from Batch batch-2 batch-1 If a batch computation fails, simply repeat computation as a transaction Transaction rate is constant Can we apply these principles to a true streaming execution? DIMA

31 Taking Snapshots the naïve way t1 t2 Initial approach (e.g., Naiad) Pause execution on t1,t2,.. Collect state Restore execution execution snapshots DIMA

32 Asynchronous Snapshots in Flink t1 snapshotting t2 snapshotting Propagating markers/barriers snap - t1 Full or incremental snap - t DIMA 2017

33 Conclusion Apache Flink! The case for Flink as a stream processor Ideal basis for polystore computations Full feature big data streaming engine DIMA 2017

Practical Big Data Processing An Overview of Apache Flink

Practical Big Data Processing An Overview of Apache Flink Tilmann Rabl Berlin Big Data Center www.dima.tu-berlin.de bbdc.berlin rabl@tu-berlin.de With slides from Volker Markl and data artisans 1 2013