Deep dive into analytics using Aggregation. Boaz

Size: px
Start display at page:

Download "Deep dive into analytics using Aggregation. Boaz"

Transcription

1 Deep dive into analytics using Aggregation Boaz

2 Elasticsearch an end-to-end search and analytics platform.

3 full text search highlighted search snippets search-as-you-type did-you-mean suggestions Copyright Elasticsearch Copying, publishing and/or distributing without written permission is strictly prohibited.

4 combines full text search with geolocation uses more-like-this to find related questions and answers Copyright Elasticsearch Copying, publishing and/or distributing without written permission is strictly prohibited.

5 search repositories, users, issues, pull requests search 130 billion lines of code track all alerts, events, logs Copyright Elasticsearch Copying, publishing and/or distributing without written permission is strictly prohibited.

6 index and analyse 5TB of log data every day Copyright Elasticsearch Copying, publishing and/or distributing without written permission is strictly prohibited.

7 combine visitor logs with social network data real-time feedback to editors Copyright Elasticsearch Copying, publishing and/or distributing without written permission is strictly prohibited.

8 Copyright Elasticsearch Copying, publishing and/or distributing without written permission is strictly prohibited.

9 Feature summary Fully-featured search Relevance-ranked text search Scalable search High-performance geo, temporal, numeric range and key lookup Highlighting Support for complex document types (nested structures) * Spelling suggestions Powerful query DSL * Standing queries * Real-time results * Extensible via plugins *! Powerful faceting/analysis Summarise large sets by any combinations of time, geo, category and more. * Kibana visualisation tool *! * Features we see as differentiators Management Simple and robust deployments * REST APIs for handling all aspects of administration/monitoring * Marvel console for monitoring and administering clusters * Special features to manage the life cycle of content * Integration Hadoop (MapRed,Hive, Pig, Cascading..)* Client libraries (Python, Java, Ruby, javascript ) Data connectors (Twitter, JMS ) Logstash ETL framework * Support Development and Production support with tiered levels Support staff are the core developers of the product *

10 Let s talk data data === json

11 { "created_at": "Wed Sep 03 11:43: ", "id": , text": "Enroute to Dublin for a day of #elasticsearch. A shoutout for friendly chats or both ;)", "user": { "id": , "name": "Boaz Leskes", "screen_name": "bleskes", "location": "Amsterdam", "description": "Coder at Elasticsearch", "time_zone": "Amsterdam",, "geo": null, "retweet_count": 1, "entities": { "hashtags": [ { "text": elasticsearch" ], "symbols": [], "urls": [], "user_mentions": [ { "screen_name": "DevOpsIreland", { "screen_name": nosqlmatters" ], "favorited": false, "retweeted": false, "lang": "en"

12 { "dt": " T02:01:48.026Z", "url": " "querystring": "", "host": " "path": "/film/2014/mar/03/oscars-2014-winners-list", "section": "film", "platform": "r2", "useragent": { "type": "Browser", "family": "Safari 5.1.9", "os": "OS X ", "device": "Personal computer", "documentreferrer": " "browser": { "id": "ga6ruflhwnqvwdt0rw4r78fg", "isnew": false, "referringhost": "theguardian.com", "referringpath": "/world", "iscontent": true, "contentpublicationdate": " ", "countrycode": "US", "countryname": "United States", "location": { "lonlat": [ , ]

13 Data can be anything Questions Code Logs Credit card transactions Click logs

14 aggregations == 50km view == patterns

15 aggregations == 50km view == patterns insights

16 a simple UI element

17 or more complex

18 .. even more complex?

19 Underpinnings back to search

20 Powerful search engine GET tweets/tweet/_search { "query": { "filtered": { "query": { "match": { "text": "jumping", "filter": { "range": { "created_at": { "from": " T17:16:29+00:00", "to": "now"

21 Inverted index nosql 128 New York lat=6.9 lon=50 F

22 Our goal results

23 Counting results Cameras 6 8 Optics Accessories 153 Lenses

24 Counting results Cameras Optics Accessories Lenses 1 2 0

25 Field data nosql New York 4 lat=6.9 lon= F

26 Introducing aggregations analysis lego

27 Before there were facets facets are awesome the serve well and long but they are not scalable from a functionality perspective

28 Analysis lego

29 Buckets and metrics results Cameras Optics Accessories Lenses buckets metrics

30 Buckets and metrics results Cameras 8 Optics Accessories Lenses buckets metrics

31 json style GET localhost:9200/_search { "aggs" : { "countries" : { "terms" : { "field" : "country", "aggs" : { "subjects" : { "terms" : { field" : "subject", "aggs" : { "avg_score" : { "avg" : { "field" : "score" { "hits" : {..., "aggregations" : { "countries" : { "buckets" : [ { "key" : "USA", "doc_count" : 5 "aggregations" : { "subjects" : "buckets" : [ { "key" : "Mathematics", "doc_count" : 3, "aggregations" : { "avg_score" : { "value" : 87.5,... ],... ]

32 measure all the things doc count (free!) avg min max stats extended stats cardinality percentiles sum count

33 Buckets global filter missing terms range date histogram geo distance nested geohash grid significant terms date range ip range histogram

34 Example - range bucket GET localhost:9200/grades/grade/_search { "aggs" : { "age_groups" : { "range" : { "field" : "age", "ranges" : [ { "from" : 5, "to" : 10, { "from" : 10 ], "aggs" : { "avg_grade" : { "avg" : { "field" : "grade" ' "age_groups": { "buckets": [ { "from": 5, "to": 10, "doc_count": 911, "avg_grade": { "value": , { "from": 10, "doc_count": 2276, "avg_grade": { "value": ]!

35 Example - histogram GET grades/grade/_search { "aggs" : { "grades_distribution" : { "histogram" : { "field" : "grade", "interval" : 10

36 Example - histogram GET grades/grade/_search { "aggregations": { "aggs" : { "grades_distribution": { "grades_distribution" "buckets": [ : { "histogram" { : { "field" "key": 60, : "grade", "doc_count": 467 "interval" : 10, { ] "key": 70, "doc_count": 873, { "key": 80, "doc_count": 930, { "key": 90, "doc_count":

37 Significant terms analytics as search

38 Common crimes GET ukcrimes/_search { "query": { "aggregations" : { "map" : { "geohash_grid" : { "field":"location", "precision":5,, "aggregations":{ "most_popular_crime_type":{ "terms":{ "field" : "crime_type", "size" : 1

39 Geo-what?

40 Geo-what?

41 Geo-what?

42 The common terms problem

43 Uncommonly common GET ukcrimes/_search { "query": {, "aggregations" : { "map" : { "geohash_grid" : { "field":"location", "precision":5, "aggregations":{ "most_popular_crime_type":{ "significant_terms":{ "field" : "crime_type", "size" : 1

44 Uncommonly common

45 Demo!

46 Copyright Elasticsearch Copying, publishing and/or distributing without written permission is strictly prohibited

New features in Elasticsearch 1.0

New features in Elasticsearch 1.0 New features in Elasticsearch 1.0 @lucacavanna what is elasticsearch? RESTful analytics document oriented schema-free search Lucene open source real-time distributed JSON Copyright Elasticsearch 2014.

More information

Turbocharge your MySQL analytics with ElasticSearch. Guillaume Lefranc Data & Infrastructure Architect, Productsup GmbH Percona Live Europe 2017

Turbocharge your MySQL analytics with ElasticSearch. Guillaume Lefranc Data & Infrastructure Architect, Productsup GmbH Percona Live Europe 2017 Turbocharge your MySQL analytics with ElasticSearch Guillaume Lefranc Data & Infrastructure Architect, Productsup GmbH Percona Live Europe 2017 About the Speaker Guillaume Lefranc Data Architect at Productsup

More information

Realtime visitor analysis with Couchbase and Elasticsearch

Realtime visitor analysis with Couchbase and Elasticsearch Realtime visitor analysis with Couchbase and Elasticsearch Jeroen Reijn @jreijn #nosql13 About me Jeroen Reijn Software engineer Hippo @jreijn http://blog.jeroenreijn.com About Hippo Visitor Analysis OneHippo

More information

Goal of this document: A simple yet effective

Goal of this document: A simple yet effective INTRODUCTION TO ELK STACK Goal of this document: A simple yet effective document for folks who want to learn basics of ELK (Elasticsearch, Logstash and Kibana) without any prior knowledge. Introduction:

More information

ADVANCED DATABASES CIS 6930 Dr. Markus Schneider. Group 5 Ajantha Ramineni, Sahil Tiwari, Rishabh Jain, Shivang Gupta

ADVANCED DATABASES CIS 6930 Dr. Markus Schneider. Group 5 Ajantha Ramineni, Sahil Tiwari, Rishabh Jain, Shivang Gupta ADVANCED DATABASES CIS 6930 Dr. Markus Schneider Group 5 Ajantha Ramineni, Sahil Tiwari, Rishabh Jain, Shivang Gupta WHAT IS ELASTIC SEARCH? Elastic Search Elasticsearch is a search engine based on Lucene.

More information

Amazon Search Services. Christoph Schmitter

Amazon Search Services. Christoph Schmitter Amazon Search Services Christoph Schmitter csc@amazon.de What we'll cover Overview of Amazon Search Services Understand the difference between Cloudsearch and Amazon ElasticSearch Service Q&A Amazon Search

More information

Building a Scalable Recommender System with Apache Spark, Apache Kafka and Elasticsearch

Building a Scalable Recommender System with Apache Spark, Apache Kafka and Elasticsearch Nick Pentreath Nov / 14 / 16 Building a Scalable Recommender System with Apache Spark, Apache Kafka and Elasticsearch About @MLnick Principal Engineer, IBM Apache Spark PMC Focused on machine learning

More information

Professional Data - Wrestling Techniques Using Elasticsearch's Aggregation Framework. Mark 18/6/2015

Professional Data - Wrestling Techniques Using Elasticsearch's Aggregation Framework. Mark 18/6/2015 Professional Data - Wrestling Techniques Using Elasticsearch's Aggregation Framework Mark Harwood @elasticmark 18/6/2015 Some brief background How search moved into analytics 2 Search interface 1.0 search

More information

E l a s t i c s e a r c h F e a t u r e s. Contents

E l a s t i c s e a r c h F e a t u r e s. Contents Elasticsearch Features A n Overview Contents Introduction... 2 Location Based Search... 2 Search Social Media(Twitter) data from Elasticsearch... 4 Query Boosting in Elasticsearch... 4 Machine Learning

More information

Table 1 The Elastic Stack use cases Use case Industry or vertical market Operational log analytics: Gain real-time operational insight, reduce Mean Ti

Table 1 The Elastic Stack use cases Use case Industry or vertical market Operational log analytics: Gain real-time operational insight, reduce Mean Ti Solution Overview Cisco UCS Integrated Infrastructure for Big Data with the Elastic Stack Cisco and Elastic deliver a powerful, scalable, and programmable IT operations and security analytics platform

More information

by Cisco Intercloud Fabric and the Cisco

by Cisco Intercloud Fabric and the Cisco Expand Your Data Search and Analysis Capability Across a Hybrid Cloud Solution Brief June 2015 Highlights Extend Your Data Center and Cloud Build a hybrid cloud from your IT resources and public and providerhosted

More information

Combining Solr and Elasticsearch to Improve Autosuggestion on Mobile Local Search. Toan Vinh Luu, PhD Senior Search Engineer local.

Combining Solr and Elasticsearch to Improve Autosuggestion on Mobile Local Search. Toan Vinh Luu, PhD Senior Search Engineer local. Combining Solr and Elasticsearch to Improve Autosuggestion on Mobile Local Search Toan Vinh Luu, PhD Senior Search Engineer local.ch AG In this talk Autosuggestion feature Autosuggestion architecture Evaluation

More information

Amazon Elasticsearch Service

Amazon Elasticsearch Service Amazon Elasticsearch Service Fully managed, reliable, and scalable Elasticsearch service. Have Your Frontend & Monitor It Too Scalable Log Analytics Inside a VPC Lab Instructions Contents Lab Overview...

More information

Elasticsearch. Presented by: Steve Mayzak, Director of Systems Engineering Vince Marino, Account Exec

Elasticsearch. Presented by: Steve Mayzak, Director of Systems Engineering Vince Marino, Account Exec Elasticsearch Presented by: Steve Mayzak, Director of Systems Engineering Vince Marino, Account Exec What about Elasticsearch the Company?! Support 100s of Companies in Production environments Training

More information

Log Analytics with Amazon Elasticsearch Service. Christoph Schmitter

Log Analytics with Amazon Elasticsearch Service. Christoph Schmitter Log Analytics with Amazon Elasticsearch Service Christoph Schmitter (csc@amazon.de) What we'll cover Understanding Elasticsearch capabilities Elasticsearch, the technology Aggregations; ad-hoc analysis

More information

Elasticsearch Search made easy

Elasticsearch Search made easy Elasticsearch Search made easy Alexander Reelsen Agenda Why is search complex? Installation & initial setup Importing data Searching data Replication & Sharding Plugin-based

More information

Geo Capabilities in Elasticsearch Nicholas

Geo Capabilities in Elasticsearch Nicholas Geo Capabilities in Elasticsearch Nicholas Knize @nknize May 3, 2018 1 Nicholas Knize Elasticsearch & Apache Lucene Geo Software Guy Elastic 2 Housekeeping & Logistics Slides and recording will be available

More information

Big Data Analytics Tools. Applied to ATLAS Event Data

Big Data Analytics Tools. Applied to ATLAS Event Data Big Data Analytics Tools Applied to ATLAS Event Data Ilija Vukotic University of Chicago CHEP 2016, San Francisco Idea Big Data technologies have proven to be very useful for storage, visualization and

More information

About the Tutorial. Audience. Prerequisites. Copyright and Disclaimer. Logstash

About the Tutorial. Audience. Prerequisites. Copyright and Disclaimer. Logstash About the Tutorial is an open-source, centralized, events and logging manager. It is a part of the ELK (ElasticSearch,, Kibana) stack. In this tutorial, we will understand the basics of, its features,

More information

Ninja Level Infrastructure Monitoring. Defensive Approach to Security Monitoring and Automation

Ninja Level Infrastructure Monitoring. Defensive Approach to Security Monitoring and Automation Ninja Level Infrastructure Monitoring Defensive Approach to Security Monitoring and Automation 1 DEFCON 24 06 th August 2016, Saturday 10:00-14:00 Madhu Akula & Riyaz Walikar Appsecco.com 2 About Automation

More information

How to Design REST API? The Twitter Example Written Date : March 23, 2015

How to Design REST API? The Twitter Example Written Date : March 23, 2015 Written Date : March 23, 2015 REST (REpresentational State Transfer), an architectural style for web services, is getting more and more popular in recent years. Many leading vendors have opened the doors

More information

Esri and MarkLogic: Location Analytics, Multi-Model Data

Esri and MarkLogic: Location Analytics, Multi-Model Data Esri and MarkLogic: Location Analytics, Multi-Model Data Ben Conklin, Industry Manager, Defense, Intel and National Security, Esri Anthony Roach, Product Manager, MarkLogic James Kerr, Technical Director,

More information

CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION

CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION Hands-on Session NoSQL DB Donato Summa THE CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION 1 Summary Elasticsearch How to get Elasticsearch up and running ES data organization

More information

run your own search engine. today: Cablecar

run your own search engine. today: Cablecar run your own search engine. today: Cablecar Robert Kowalski @robinson_k http://github.com/robertkowalski Search nobody uses that, right? Services on the Market Google Bing Yahoo ask Wolfram Alpha Baidu

More information

Wrangling Logs with Logstash and ElasticSearch

Wrangling Logs with Logstash and ElasticSearch Wrangling Logs with Logstash and ElasticSearch Nate Jones & David Castro Media Temple OSCON 2012 Why are we here? Size Quantity Efficiency Access Locality Method Filtering Grokability Noise Structure Metrics

More information

MySQL for Developers. Duration: 5 Days

MySQL for Developers. Duration: 5 Days Oracle University Contact Us: 0800 891 6502 MySQL for Developers Duration: 5 Days What you will learn This MySQL for Developers training teaches developers how to develop console and web applications using

More information

Microsoft Perform Data Engineering on Microsoft Azure HDInsight.

Microsoft Perform Data Engineering on Microsoft Azure HDInsight. Microsoft 70-775 Perform Data Engineering on Microsoft Azure HDInsight http://killexams.com/pass4sure/exam-detail/70-775 QUESTION: 30 You are building a security tracking solution in Apache Kafka to parse

More information

Open Source Search. Andreas Pesenhofer. max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria

Open Source Search. Andreas Pesenhofer. max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria Open Source Search Andreas Pesenhofer max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria max.recall information systems max.recall is a software and consulting company enabling

More information

MySQL for Developers. Duration: 5 Days

MySQL for Developers. Duration: 5 Days Oracle University Contact Us: Local: 0845 777 7 711 Intl: +44 845 777 7 711 MySQL for Developers Duration: 5 Days What you will learn This MySQL for Developers training teaches developers how to develop

More information

Pig A language for data processing in Hadoop

Pig A language for data processing in Hadoop Pig A language for data processing in Hadoop Antonino Virgillito THE CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION Apache Pig: Introduction Tool for querying data on Hadoop

More information

Integration Service. Admin Console User Guide. On-Premises

Integration Service. Admin Console User Guide. On-Premises Kony MobileFabric TM Integration Service Admin Console User Guide On-Premises Release 7.3 Document Relevance and Accuracy This document is considered relevant to the Release stated on this title page and

More information

ARCHITECTURE ARCHITECTURE OVERVIEW

ARCHITECTURE ARCHITECTURE OVERVIEW ARCHITECTURE ARCHITECTURE OVERVIEW The personalization of the customer experience is in every marketer s mind and this requirement has strong impacts on customer data integration, across channels and applications.

More information

Building A Billion Spatio-Temporal Object Search and Visualization Platform

Building A Billion Spatio-Temporal Object Search and Visualization Platform 2017 2 nd International Symposium on Spatiotemporal Computing Harvard University Building A Billion Spatio-Temporal Object Search and Visualization Platform Devika Kakkar, Benjamin Lewis Goal Develop a

More information

The Billion Object Platform (BOP): a system to lower barriers to support big, streaming, spatio-temporal data sources

The Billion Object Platform (BOP): a system to lower barriers to support big, streaming, spatio-temporal data sources FOSS4G 2017 Boston The Billion Object Platform (BOP): a system to lower barriers to support big, streaming, spatio-temporal data sources Devika Kakkar and Ben Lewis Harvard Center for Geographic Analysis

More information

High Performance NoSQL with MongoDB

High Performance NoSQL with MongoDB High Performance NoSQL with MongoDB History of NoSQL June 11th, 2009, San Francisco, USA Johan Oskarsson (from http://last.fm/) organized a meetup to discuss advances in data storage which were all using

More information

Introduction to ELK stack

Introduction to ELK stack Introduction to ELK stack 巨量資料處理 搜尋 及分析工具介紹 計資中心網路組邵喻美 madeline@ntu.edu.tw 1 Topics Why big data tool for network traffic and log analysis What is ELK stack, and why choose it ELK stack intro ELK use cases

More information

Powering Monitoring Analytics with ELK stack

Powering Monitoring Analytics with ELK stack Powering Monitoring Analytics with ELK stack Abdelkader Lahmadi, Frédéric Beck To cite this version: Abdelkader Lahmadi, Frédéric Beck. Powering Monitoring Analytics with ELK stack. 9th International Conference

More information

Percona Live September 21-23, 2015 Mövenpick Hotel Amsterdam

Percona Live September 21-23, 2015 Mövenpick Hotel Amsterdam Percona Live 2015 September 21-23, 2015 Mövenpick Hotel Amsterdam MongoDB, Elastic, and Hadoop: The What, When, and How Kimberly Wilkins Principal Engineer/Database Denizen ObjectRocket/Rackspace kimberly@objectrocket.com

More information

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Cy Erbay Senior Director Striim Executive Summary Striim is Uniquely Qualified to Solve the Challenges of Real-Time

More information

EPHP a tool for learning the basics of PHP development. Nick Whitelegg School of Media Arts and Technology Southampton Solent University

EPHP a tool for learning the basics of PHP development. Nick Whitelegg School of Media Arts and Technology Southampton Solent University EPHP a tool for learning the basics of PHP development Nick Whitelegg School of Media Arts and Technology Southampton Solent University My background Lecturer at Southampton Solent University since 2003

More information

microsoft

microsoft 70-775.microsoft Number: 70-775 Passing Score: 800 Time Limit: 120 min Exam A QUESTION 1 Note: This question is part of a series of questions that present the same scenario. Each question in the series

More information

DURATION : 03 DAYS. same along with BI tools.

DURATION : 03 DAYS. same along with BI tools. AWS REDSHIFT TRAINING MILDAIN DURATION : 03 DAYS To benefit from this Amazon Redshift Training course from mildain, you will need to have basic IT application development and deployment concepts, and good

More information

@InfluxDB. David Norton 1 / 69

@InfluxDB. David Norton  1 / 69 @InfluxDB David Norton (@dgnorton) david@influxdb.com 1 / 69 Instrumenting a Data Center 2 / 69 3 / 69 4 / 69 The problem: Efficiently monitor hundreds or thousands of servers 5 / 69 The solution: Automate

More information

ArcGIS for Server: Administration and Security. Amr Wahba

ArcGIS for Server: Administration and Security. Amr Wahba ArcGIS for Server: Administration and Security Amr Wahba awahba@esri.com Agenda ArcGIS Server architecture Distributing and scaling components Implementing security Monitoring server logs Automating server

More information

Processing 11 billions events a day with Spark. Alexander Krasheninnikov

Processing 11 billions events a day with Spark. Alexander Krasheninnikov Processing 11 billions events a day with Spark Alexander Krasheninnikov Badoo facts 46 languages 10M Photos added daily 320M registered users 190 countries 21M daily active users 3000+ servers 2 data-centers

More information

How to choose the right approach to analytics and reporting

How to choose the right approach to analytics and reporting SOLUTION OVERVIEW How to choose the right approach to analytics and reporting A comprehensive comparison of the open source and commercial versions of the OpenText Analytics Suite In today s digital world,

More information

CIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench

CIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench CIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench Abstract Implementing a Hadoop-based system for processing big data and doing analytics is a topic which has been

More information

Kibana, Grafana and Zeppelin on Monitoring data

Kibana, Grafana and Zeppelin on Monitoring data Kibana, Grafana and Zeppelin on Monitoring data Internal group presentaion Ildar Nurgaliev OpenLab Summer student Presentation structure About IT-CM-MM Section and myself Visualisation with Kibana 4 and

More information

Integration Service. Admin Console User Guide. On-Premises

Integration Service. Admin Console User Guide. On-Premises Kony Fabric Integration Service Admin Console User Guide On-Premises Release V8 SP1 Document Relevance and Accuracy This document is considered relevant to the Release stated on this title page and the

More information

User guide for GEM-TREND

User guide for GEM-TREND User guide for GEM-TREND 1. Requirements for Using GEM-TREND GEM-TREND is implemented as a java applet which can be run in most common browsers and has been test with Internet Explorer 7.0, Internet Explorer

More information

Ingest. Aaron Mildenstein, Consulting Architect Tokyo Dec 14, 2017

Ingest. Aaron Mildenstein, Consulting Architect Tokyo Dec 14, 2017 Ingest Aaron Mildenstein, Consulting Architect Tokyo Dec 14, 2017 Data Ingestion The process of collecting and importing data for immediate use 2 ? Simple things should be simple. Shay Banon Elastic{ON}

More information

Ingest. David Pilato, Developer Evangelist Paris, 31 Janvier 2017

Ingest. David Pilato, Developer Evangelist Paris, 31 Janvier 2017 Ingest David Pilato, Developer Evangelist Paris, 31 Janvier 2017 Data Ingestion The process of collecting and importing data for immediate use in a datastore 2 ? Simple things should be simple. Shay Banon

More information

EPiServer Find Advanced Session. Patrick van Kleef Mari Jørgensen

EPiServer Find Advanced Session. Patrick van Kleef Mari Jørgensen EPiServer Find Advanced Session Patrick van Kleef Mari Jørgensen Introduction Patrick van Kleef Macaw EPiServer experience EPiServer MVP Blogs Presentations Forum www.patrickvankleef.com Agenda Unified

More information

Curriculum Guide. ThingWorx

Curriculum Guide. ThingWorx Curriculum Guide ThingWorx Live Classroom Curriculum Guide Introduction to ThingWorx 8 ThingWorx 8 User Interface Development ThingWorx 8 Platform Administration ThingWorx 7.3 Fundamentals Applying Machine

More information

Adobe Marketing Cloud Best Practices Implementing Adobe Target using Dynamic Tag Management

Adobe Marketing Cloud Best Practices Implementing Adobe Target using Dynamic Tag Management Adobe Marketing Cloud Best Practices Implementing Adobe Target using Dynamic Tag Management Contents Best Practices for Implementing Adobe Target using Dynamic Tag Management.3 Dynamic Tag Management Implementation...4

More information

MQ Monitoring on Cloud

MQ Monitoring on Cloud MQ Monitoring on Cloud Suganya Rane Digital Automation, Integration & Cloud Solutions Agenda Metrics & Monitoring Monitoring Options AWS ElasticSearch Kibana MQ CloudWatch on AWS Prometheus Grafana MQ

More information

Streaming Data: The Opportunity & How to Work With It

Streaming Data: The Opportunity & How to Work With It Streaming Data: The Opportunity & How to Work With It Roger Barga, GM Amazon Kinesis April 2016 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Interest in and demand for stream

More information

Data Driven Performance Repository to Classify and Retrieve Storage Tuning Profiles Ismael Solis Moreno IBM

Data Driven Performance Repository to Classify and Retrieve Storage Tuning Profiles Ismael Solis Moreno IBM Data Driven Performance Repository to Classify and Retrieve Storage Tuning Profiles Ismael Solis Moreno IBM 2018 Storage Developer Conference. IBM Corporation. All Rights Reserved. 1 Agenda The challenge

More information

Making Sense of your Data

Making Sense of your Data Making Sense of your Data Building A Custom DataSource for Grafana with Vert.x Gerald Mücke DevCon5 GmbH @gmuecke About me 3 IT Consultant & Java Specialist at DevCon5 (CH) Focal Areas Tool-assisted quality

More information

Improving Drupal search experience with Apache Solr and Elasticsearch

Improving Drupal search experience with Apache Solr and Elasticsearch Improving Drupal search experience with Apache Solr and Elasticsearch Milos Pumpalovic Web Front-end Developer Gene Mohr Web Back-end Developer About Us Milos Pumpalovic Front End Developer Drupal theming

More information

/ Cloud Computing. Recitation 7 October 10, 2017

/ Cloud Computing. Recitation 7 October 10, 2017 15-319 / 15-619 Cloud Computing Recitation 7 October 10, 2017 Overview Last week s reflection Project 3.1 OLI Unit 3 - Module 10, 11, 12 Quiz 5 This week s schedule OLI Unit 3 - Module 13 Quiz 6 Project

More information

Getting to know. by Michelle Darling August 2013

Getting to know. by Michelle Darling August 2013 Getting to know by Michelle Darling mdarlingcmt@gmail.com August 2013 Agenda: What is Cassandra? Installation, CQL3 Data Modelling Summary Only 15 min to cover these, so please hold questions til the end,

More information

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp.

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. 17-18 March, 2018 Beijing Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020 Today, 80% of organizations

More information

Apache Lucene 4. Robert Muir

Apache Lucene 4. Robert Muir Apache Lucene 4 Robert Muir Agenda Overview of Lucene Conclusion Resources Q & A Download of Lucene: core/ analysis/ queryparser/ highlighter/ suggest/ expressions/ join/ memory/ codecs/... core/ Lucene

More information

Develop and test your Mobile App faster on AWS

Develop and test your Mobile App faster on AWS Develop and test your Mobile App faster on AWS Carlos Sanchiz, Solutions Architect @xcarlosx26 #AWSSummit 2016, Amazon Web Services, Inc. or its Affiliates. All rights reserved. The best mobile apps are

More information

Introduction to Big Data. NoSQL Databases. Instituto Politécnico de Tomar. Ricardo Campos

Introduction to Big Data. NoSQL Databases. Instituto Politécnico de Tomar. Ricardo Campos Instituto Politécnico de Tomar Introduction to Big Data NoSQL Databases Ricardo Campos Mestrado EI-IC Análise e Processamento de Grandes Volumes de Dados Tomar, Portugal, 2016 Part of the slides used in

More information

Big Data, Complex Data

Big Data, Complex Data Big Data, Complex Data Managing Data and Complexity in Graph Databases Peter Neubauer Neo Technology #neo4j @peterneubauer peter@neotechnology.com Data size NOSQL data models Key-value stores Bigtable

More information

Prototyping Data Intensive Apps: TrendingTopics.org

Prototyping Data Intensive Apps: TrendingTopics.org Prototyping Data Intensive Apps: TrendingTopics.org Pete Skomoroch Research Scientist at LinkedIn Consultant at Data Wrangling @peteskomoroch 09/29/09 1 Talk Outline TrendingTopics Overview Wikipedia Page

More information

Big Data and Hadoop. Course Curriculum: Your 10 Module Learning Plan. About Edureka

Big Data and Hadoop. Course Curriculum: Your 10 Module Learning Plan. About Edureka Course Curriculum: Your 10 Module Learning Plan Big Data and Hadoop About Edureka Edureka is a leading e-learning platform providing live instructor-led interactive online training. We cater to professionals

More information

TIBCO Complex Event Processing Evaluation Guide

TIBCO Complex Event Processing Evaluation Guide TIBCO Complex Event Processing Evaluation Guide This document provides a guide to evaluating CEP technologies. http://www.tibco.com Global Headquarters 3303 Hillview Avenue Palo Alto, CA 94304 Tel: +1

More information

ElasticIntel. Scalable Threat Intel Aggregation in AWS

ElasticIntel. Scalable Threat Intel Aggregation in AWS ElasticIntel Scalable Threat Intel Aggregation in AWS Presenter: Matt Jane Obligatory Who I Am slide.. Builder/Automator I put things in clouds Open Source Advocate

More information

Application monitoring with BELK. Nishant Sahay, Sr. Architect Bhavani Ananth, Architect

Application monitoring with BELK. Nishant Sahay, Sr. Architect Bhavani Ananth, Architect Application monitoring with BELK Nishant Sahay, Sr. Architect Bhavani Ananth, Architect Why logs Business PoV Input Data Analytics User Interactions /Behavior End user Experience/ Improvements 2017 Wipro

More information

Facebook Insights User guide

Facebook Insights User guide Facebook Insights User guide 2 Overview Facebook Insights brings the page administrators valuable perspective on the performance of the pages they manage. Analytics allows integration of Insights data

More information

End User Monitoring. AppDynamics Pro Documentation. Version Page 1

End User Monitoring. AppDynamics Pro Documentation. Version Page 1 End User Monitoring AppDynamics Pro Documentation Version 4.1.1 Page 1 End User Monitoring....................................................... 4 Browser Real User Monitoring.............................................

More information

In this brief tutorial, we will be explaining the basics of Elasticsearch and its features.

In this brief tutorial, we will be explaining the basics of Elasticsearch and its features. About the Tutorial is a real-time distributed and open source full-text search and analytics engine. It is used in Single Page Application (SPA) projects. is open source developed in Java and used by many

More information

Axibase Time-Series Database. Non-relational database for storing and analyzing large volumes of metrics collected at high-frequency

Axibase Time-Series Database. Non-relational database for storing and analyzing large volumes of metrics collected at high-frequency Axibase Time-Series Database Non-relational database for storing and analyzing large volumes of metrics collected at high-frequency What is a Time-Series Database? A time series database (TSDB) is a software

More information

The Power of Twitter Data. Mark Callahan Data Product

The Power of Twitter Data. Mark Callahan Data Product The Power of Twitter Data Mark Callahan Data Product Marketing @MDCin303 Imagine what s possible when you know what the whole world is thinking about any topic, at any time What Is Unique About Twitter

More information

WHY AND HOW TO LEVERAGE THE POWER AND SIMPLICITY OF SQL ON APACHE FLINK - FABIAN HUESKE, SOFTWARE ENGINEER

WHY AND HOW TO LEVERAGE THE POWER AND SIMPLICITY OF SQL ON APACHE FLINK - FABIAN HUESKE, SOFTWARE ENGINEER WHY AND HOW TO LEVERAGE THE POWER AND SIMPLICITY OF SQL ON APACHE FLINK - FABIAN HUESKE, SOFTWARE ENGINEER ABOUT ME Apache Flink PMC member & ASF member Contributing since day 1 at TU Berlin Focusing on

More information

RethinkDB. Niharika Vithala, Deepan Sekar, Aidan Pace, and Chang Xu

RethinkDB. Niharika Vithala, Deepan Sekar, Aidan Pace, and Chang Xu RethinkDB Niharika Vithala, Deepan Sekar, Aidan Pace, and Chang Xu Content Introduction System Features Data Model ReQL Applications Introduction Niharika Vithala What is a NoSQL Database Databases that

More information

SAS STUDIO. JUNE 2014 PRESENTER: MARY HARDING Education SAS Canada. Copyr i g ht 2014, SAS Ins titut e Inc. All rights res er ve d.

SAS STUDIO. JUNE 2014 PRESENTER: MARY HARDING Education SAS Canada. Copyr i g ht 2014, SAS Ins titut e Inc. All rights res er ve d. JUNE 2014 PRESENTER: MARY HARDING Education SAS Canada NEW SAS PROGRAMMING ENVIRONMENT Available Consistent Assistive AVAILABLE THROUGH ALL MODERN WEB BROWSERS Available Consistent Assistive ONE INTERFACE

More information

Build, Deploy & Operate Intelligent Chatbots with Amazon Lex

Build, Deploy & Operate Intelligent Chatbots with Amazon Lex Build, Deploy & Operate Intelligent Chatbots with Amazon Lex Ian Massingham AWS Technical Evangelist @IanMmmm aws.amazon.com/lex 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

More information

How we built a highly scalable Machine Learning platform using Apache Mesos

How we built a highly scalable Machine Learning platform using Apache Mesos How we built a highly scalable Machine Learning platform using Apache Mesos Daniel Sârbe Development Manager, BigData and Cloud Machine Translation @ SDL Co-founder of BigData/DataScience Meetup Cluj,

More information

The Observatory Tool Dashboard Guide

The Observatory Tool Dashboard Guide To paraphrase Donella Meadows, we can't impose our will on Internet. We can listen to what Internet tells us, and discover how its properties and our values can work together to bring forth something much

More information

Search Engines and Time Series Databases

Search Engines and Time Series Databases Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Search Engines and Time Series Databases Corso di Sistemi e Architetture per Big Data A.A. 2017/18

More information

Hortonworks DataPlane Service

Hortonworks DataPlane Service Data Steward Studio Administration () docs.hortonworks.com : Data Steward Studio Administration Copyright 2016-2017 Hortonworks, Inc. All rights reserved. Please visit the Hortonworks Data Platform page

More information

Blurring the Line Between Developer and Data Scientist

Blurring the Line Between Developer and Data Scientist Blurring the Line Between Developer and Data Scientist Notebooks with PixieDust va barbosa va@us.ibm.com Developer Advocacy IBM Watson Data Platform WHY ARE YOU HERE? More companies making bet-the-business

More information

Package elasticsearchr

Package elasticsearchr Type Package Version 0.2.2 Package elasticsearchr March 29, 2018 Title A Lightweight Interface for Interacting with Elasticsearch from R Date 2018-03-29 Author Alex Ioannides Maintainer Alex Ioannides

More information

Storm. Distributed and fault-tolerant realtime computation. Nathan Marz Twitter

Storm. Distributed and fault-tolerant realtime computation. Nathan Marz Twitter Storm Distributed and fault-tolerant realtime computation Nathan Marz Twitter Storm at Twitter Twitter Web Analytics Before Storm Queues Workers Example (simplified) Example Workers schemify tweets and

More information

Integrating Splunk with AWS services:

Integrating Splunk with AWS services: Integrating Splunk with AWS services: Using Redshi+, Elas0c Map Reduce (EMR), Amazon Machine Learning & S3 to gain ac0onable insights via predic0ve analy0cs via Splunk Patrick Shumate Solutions Architect,

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

Analyze Bug Statistics using Kibana Dashboard and Get Voice Alerts

Analyze Bug Statistics using Kibana Dashboard and Get Voice Alerts Analyze Bug Statistics using Kibana Dashboard and Get Voice Alerts Kibana Dashboard Elast Alert Sensiple Notification System Abstract This white paper describes how Kibana Dashboard can be used to analyze

More information

Contents. Getting Set Up Contents 2

Contents. Getting Set Up Contents 2 Getting Set Up Contents 2 Contents Getting Set Up... 3 Best Practices...3 Installing the JAR File... 3 Configuring Community Manager Reports...4 Configure the Analytics Database...4 Enable the Analytics

More information

BEYOND THE RDBMS: WORKING WITH RELATIONAL DATA IN MARKLOGIC

BEYOND THE RDBMS: WORKING WITH RELATIONAL DATA IN MARKLOGIC BEYOND THE RDBMS: WORKING WITH RELATIONAL DATA IN MARKLOGIC Rob Rudin, Solutions Specialist, MarkLogic Agenda Introduction The problem getting relational data into MarkLogic Demo how to do this SLIDE:

More information

12/05/2017. Geneva ServiceNow Security Management

12/05/2017. Geneva ServiceNow Security Management 12/05/2017 Security Management Contents... 3 Security Incident Response...3 Security Incident Response overview... 3 Get started with Security Incident Response... 6 Security incident creation... 40 Security

More information

Data Mining with Elastic

Data Mining with Elastic 2017 IJSRST Volume 3 Issue 3 Print ISSN: 2395-6011 Online ISSN: 2395-602X Themed Section: Science and Technology Data Mining with Elastic Mani Nandhini Sri, Mani Nivedhini, Dr. A. Balamurugan Sri Krishna

More information

Social Network Analytics on Cray Urika-XA

Social Network Analytics on Cray Urika-XA Social Network Analytics on Cray Urika-XA Mike Hinchey, mhinchey@cray.com Technical Solutions Architect Cray Inc, Analytics Products Group April, 2015 Agenda 1. Introduce platform Urika-XA 2. Technology

More information

Zombie Apocalypse Workshop

Zombie Apocalypse Workshop Zombie Apocalypse Workshop Building Serverless Microservices Danilo Poccia @danilop Paolo Latella @LatellaPaolo September 22 nd, 2016 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

More information

TURN DATA INTO ACTIONABLE INSIGHTS. Google Analytics Workshop

TURN DATA INTO ACTIONABLE INSIGHTS. Google Analytics Workshop TURN DATA INTO ACTIONABLE INSIGHTS Google Analytics Workshop The Value of Analytics Google Analytics is more than just numbers and stats. It tells the story of how people are interacting with your brand

More information

Deep Dive on AWS CodeStar

Deep Dive on AWS CodeStar Deep Dive on AWS CodeStar with AWS CI/CD workflow Tara E. Walker Technical Evangelist @taraw June 28, 2017 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda What is DevOps

More information

API Gateway Analytics

API Gateway Analytics API Gateway Analytics API Gateway Analytics April 2016 Copyright Copyright 2016 Akana, Inc. All rights reserved. Trademarks Akana, SOA Software, Policy Manager, Portfolio Manager, Repository Manager, Service

More information