Data Analytics with MATLAB
|
|
- Patrick Butler
- 6 years ago
- Views:
Transcription
1 Data Analytics with MATLAB Tackling the Challenges of Big Data Adrienne James, PhD MathWorks 7 th October The MathWorks, Inc. 1
2 Big Data in Industry ENERGY Asset Optimization FINANCE Market Risk, Regulatory AUTO Fleet Data Analysis AERO Maintenance, reliability Medical Devices Patient Outcomes 2
3 PROCESSING OPTIONS MATLAB RESTful interface to Cluster MATLAB Hadoop Streaming NoSQL connector (e.g. mongo) MATLAB / Java App accessing Cluster MATLAB Map-Reduce Components 3
4 Key takeaways New functions for analysing data that does not fit in memory on your desktop datastore mapreduce & that can scale for use with Hadoop Additional techniques for predictive modelling with large data Work with large data in memory on a cluster (spmd) Deploy predictive models Bring MATLAB analytics to the Web Share analytics with a wider community of users 4
5 How big is big? What characterises big data? Wikipedia Any collection of data sets so large and complex that it becomes difficult to process using traditional data processing applications. Volume : amount of data Velocity : speed at which data is generated or needs to be analysed Variety : range of data types/data sources 5
6 Considerations: Large Data Analytics Data Characteristics 1. Size & type of data? 2. Where is your data? 3. What hardware do you have access to? 4. Analysis Characteristics? 6
7 Example: Airline Delay Analysis Data BTS/RITA Airline On-Time Statistics 123.5M records, 29 fields Analysis Tasks Calculate delay patterns Visualize summaries Estimate & evaluate predictive models 7
8 Considerations: Large Data Analytics Airline Data Characteristics 1. Size & type of data? CSV Data 22 files 12GB 8
9 Considerations: Large Data Analytics Data Characteristics 1. Size & type of data? 2. Where is my data? Small subset available locally Entire data set stored elsewhere 9
10 Big Data Analysis with MATLAB start on the desktop Prototype Access Explore Share/Deploy Scale Work on your desktop Start simple Basic statistics Explore data 10
11 Demo: Exploring departure delays using datastore Explore approaches pre- & post- Start with a small subset What happens as the data size grows?. until eventually it does not fit in memory on your desktop machine datastore 11
12 Access & explore bigger data on the desktop more easily datastore Easily specify data set Single text file (or collection of text files) Database (using Database Toolbox) Preview data structure and format Customise data to import using column names Incrementally read subsets of the data airdata = datastore('*.csv'); airdata.selectedvariables = {'Distance', 'ArrDelay }; data = read(airdata); 12
13 Images datastore extends Data Access Landscape imread, SystemObjects streaming data ImageAdapter readtable pre- Binary files Databases Text files.mat files Import Tool fread, database post- load textscan memmapfile database. ODBCConnection textscan, readtable +programming matfile API datastore SMALL Increasing Data Size 13
14 Considerations: Large Data Analytics Data Characteristics 1. Size & type of data? 2. Where is your data? Small subset available locally Entire data set stored elsewhere 3. What hardware do you have access to? 4. Analysis Characteristics Initially, simple statistics & data exploration 14
15 Big Data Analysis with MATLAB Prototype Access Explore Share/Deploy Scale Start locally and then.. Scale to a cluster 15
16 What is? A Big Data Platform Datastore HDFS Node Data Map Reduce Node Data Map Reduce Node Data Map Reduce 16
17 A bit of audience participation mapreduce. 17
18 Introducing the mapreduce programming framework Example: Input files Intermediate files (local disk) Output files National popularity contest Steve 11% Emily 58% David 31% Newspaper pages For each page how many times do Steve, Emily and David get mentioned? Total mentions 18
19 mapreduce concept group counts Input files Intermediate files (local disk) Output files Map Reduce 19
20 Demo: Exploring mapreduce 20
21 Explore and Analyze Data on Hadoop Datastore HDFS MATLAB Distributed Computing Server Node Data Map Reduce Node Data Map Reduce MATLAB MapReduce Code Node Data Map Reduce Hadoop ds = datastore('hdfs://myserver:7867/data/file1.txt'); 21
22 Considerations: Large Data Analytics Data Characteristics 1. Size & type of data? 2. Where is your data? 3. What hardware do you have access to? Cluster 4. Analysis Characteristics Explore predictive modelling 22
23 Big Data Analysis with MATLAB Prototype Access Explore Share/Deploy Scale Options for more involved algorithms. may require all data in memory multiple iterations Scale to a cluster 23
24 Data Analytics Landscape COMPLEX More programming effort required iterative all data needed in memory at once Algorithm complexity easily partitioned; independent tasks SIMPLE vectorisation Built-in numerical & statistical algorithms gpuarray parfor spmd distributed arrays mapreduce SMALL Increasing Data Size 24
25 Instructions Reduced Data Working with more complex algorithms with data in memory on a cluster MDCS Client 25
26 Demo: Predictive Modelling Logistic Regression & Neural Networks 10 busiest airport origins & 7 largest airline carriers Explore & compare prediction quality of two models to predict flights delayed for more than 20 minutes Randomly partition data into test and training sets (cvpartition) Model #1: Logistic Regression Model #2: Neural Network Predictor Variables: DayOfWeek,Origin,Airline,DepTime,Distance 26
27 Single Program, Multiple Data Lab 1 Lab 2 Lab 3 Lab 4 >> mycode >> mycode >> mycode >> mycode 27
28 Single Program, Multiple Data Cluster Parallel Pool Lab 1 a = rand; Client spmd a = rand; end Lab 2 Lab 3 a = rand; a = rand; Lab 4 a = rand; 28
29 Explore Big Data Prototype Access Explore Share/Deploy Scale Subset data by filtering or variable selection and gain insight with visualization 29
30 Highlights: Airline Delay Analysis Start small Scale up Quick prototyping on large data Interactive exploration Interspersed visualizations Predictive modelling with large data 30
31 Deploy Prototype Access Explore Share/Deploy Scale Hadoop Desktop Enterprise Web 31
32 Web Analytics: Analysis of traffic around Paris 32
33 Predictive Data Analytics Load Demand Forecasting 33
34 Demo Station: 34
35 MATLAB on Hadoop Two modes of operation Execute mapreduce on Hadoop from your MATLAB desktop using MATLAB Distributed Computing Server Extends your desktop environment for use with Hadoop Execute algorithms within Hadoop MapReduce on data stored in HDFS Create standalone applications or libraries for deploying to production instances of Hadoop Locked down package for use in production environments Integration of MATLAB analytics with operational systems 35
36 Key takeaways New functions for analysing data that does not fit in memory on your desktop datastore mapreduce & that can scale for use with Hadoop Additional techniques for predictive modelling with large data Work with large data in memory on a cluster (spmd) Deploy predictive models Bring MATLAB analytics to the Web Share analytics with a wider community of users 36
37 New Big Data Capabilities in MATLAB Memory and Data Access 64-bit processors Memory Mapped Variables Disk Variables Databases Datastores Programming Constructs Streaming Block Processing Parallel-for loops GPU Arrays SPMD and Distributed Arrays MapReduce Platforms Desktop (Multicore, GPU) Clusters Cloud Computing (MDCS on EC2) Hadoop 37
38 Additional Resources MathWorks Web Site Big Data With MATAB: MapReduce & Hadoop: Machine Learning with MATLAB: A selection of user stories LiquidNet: Lean Data Analysis: The Awesome Data Dexterity of MATLAB Desktop Ruuki Metals: Steel Manufacturing Process Analytics CEESAR: Data Processing Framework Supporting Large Scale Driving Data Analysis Daimler AG: Analyzing Test Data from a Worldwide Fleet of Fuel Cell Vehicles 38
39 Thank You 39
Data Analytics with MATLAB. Tackling the Challenges of Big Data
Data Analytics with MATLAB Tackling the Challenges of Big Data How big is big? What characterises big data? Any collection of data sets so large and complex that it becomes difficult to process using traditional
More informationBIG DATA: Data Analytics with MATLAB Christophe POUILLOT Senior Consultant MathWorks
BIG DATA: Data Analytics with MATLAB Christophe POUILLOT Senior Consultant MathWorks christophe.pouillot@mathworks.fr 2014 The MathWorks, Inc. 1 Definition of Big Data Data so large and complex that it
More informationTackling Big Data Using MATLAB
Tackling Big Data Using MATLAB Alka Nair Application Engineer 2015 The MathWorks, Inc. 1 Building Machine Learning Models with Big Data Access Preprocess, Exploration & Model Development Scale up & Integrate
More information2015 The MathWorks, Inc. 1
2015 The MathWorks, Inc. 1 What s New in Release 2015a and 2014b Young Joon Lee Principal Application Engineer 2015 The MathWorks, Inc. 2 Agenda New Features Graphics and Data Design Performance Design
More informationGetting Started with MATLAB Francesca Perino
Getting Started with MATLAB Francesca Perino francesca.perino@mathworks.it 2014 The MathWorks, Inc. 1 Agenda MATLAB Intro Importazione ed esportazione Programmazione in MATLAB Tecniche per la velocizzazione
More informationParallel and Distributed Computing with MATLAB The MathWorks, Inc. 1
Parallel and Distributed Computing with MATLAB 2018 The MathWorks, Inc. 1 Practical Application of Parallel Computing Why parallel computing? Need faster insight on more complex problems with larger datasets
More informationBig Data con MATLAB. Lucas García The MathWorks, Inc. 1
Big Data con MATLAB Lucas García 2015 The MathWorks, Inc. 1 Agenda Introduction Remote Arrays in MATLAB Tall Arrays for Big Data Scaling up Summary 2 Architecture of an analytics system Data from instruments
More informationScaling MATLAB. for Your Organisation and Beyond. Rory Adams The MathWorks, Inc. 1
Scaling MATLAB for Your Organisation and Beyond Rory Adams 2015 The MathWorks, Inc. 1 MATLAB at Scale Front-end scaling Scale with increasing access requests Back-end scaling Scale with increasing computational
More informationWhat's New in MATLAB for Engineering Data Analytics?
What's New in MATLAB for Engineering Data Analytics? Will Wilson Application Engineer MathWorks, Inc. 2017 The MathWorks, Inc. 1 Agenda Data Types Tall Arrays for Big Data Machine Learning (for Everyone)
More informationMATLAB. Senior Application Engineer The MathWorks Korea The MathWorks, Inc. 2
1 Senior Application Engineer The MathWorks Korea 2017 The MathWorks, Inc. 2 Data Analytics Workflow Business Systems Smart Connected Systems Data Acquisition Engineering, Scientific, and Field Business
More informationWhat s New MATLAB and Simulink
What s New MATLAB and Simulink Ascension Vizinho-Coutry Application Engineer Manager MathWorks Ascension.Vizinho-Coutry@mathworks.fr Daniel Martins Application Engineer MathWorks Daniel.Martins@mathworks.fr
More informationBehind Today s Trends The Technologies Driving Change. Paul Smith Director Consulting Services
Behind Today s Trends The Technologies Driving Change Paul Smith Director Consulting Services Industry 4.0 Big Data Wearable Tech Cloud Computing Internet of Things MOOC Trends from 2009 Social Computing
More informationParallel and Distributed Computing with MATLAB Gerardo Hernández Manager, Application Engineer
Parallel and Distributed Computing with MATLAB Gerardo Hernández Manager, Application Engineer 2018 The MathWorks, Inc. 1 Practical Application of Parallel Computing Why parallel computing? Need faster
More informationWorking with Large Sets of Images in MATLAB Just Got Easier Avi Nehemiah
Working with Large Sets of Images in MATLAB Just Got Easier Avi Nehemiah 2015 The MathWorks, Inc. 1 Challenges Posed by Large Sets of Images 1. How do I import several thousand images into MATLAB? 2. Can
More informationIntegrate MATLAB Analytics into Enterprise Applications
Integrate Analytics into Enterprise Applications Aurélie Urbain MathWorks Consulting Services 2015 The MathWorks, Inc. 1 Data Analytics Workflow Data Acquisition Data Analytics Analytics Integration Business
More informationNavigating Big Data with MATLAB
Navigating Big Data with MATLAB Isaac Noh Application Engineer 2015 The MathWorks, Inc. 1 How big is big? What does Big Data even mean? Big data is a term for data sets that are so large or complex that
More informationAnalyzing Fleet Data with MATLAB and Spark
Analyzing Fleet Data with MATLAB and Spark Christoph Stockhammer 2018 The MathWorks, Inc. 1 What does Fleet mean? A Fleet is any group of things that can generate data and that you would like to look at
More informationIntegrate MATLAB Analytics into Enterprise Applications
Integrate Analytics into Enterprise Applications Dr. Roland Michaely 2015 The MathWorks, Inc. 1 Data Analytics Workflow Access and Explore Data Preprocess Data Develop Predictive Models Integrate Analytics
More informationIntegrating Advanced Analytics with Big Data
Integrating Advanced Analytics with Big Data Ian McKenna, Ph.D. Senior Financial Engineer 2017 The MathWorks, Inc. 1 The Goal SCALE! 2 The Solution tall 3 Agenda Introduction to tall data Case Study: Predicting
More informationIntegrating MATLAB Analytics into Business-Critical Applications Marta Wilczkowiak Senior Applications Engineer MathWorks
Integrating MATLAB Analytics into Business-Critical Applications Marta Wilczkowiak Senior Applications Engineer MathWorks 2015 The MathWorks, Inc. 1 Problem statement Democratization: Is it possible to
More informationTechnical Computing with MATLAB
Technical Computing with MATLAB University Of Bath Seminar th 19 th November 2010 Adrienne James (Application Engineering) 1 Agenda Introduction to MATLAB Importing, visualising and analysing data from
More informationScaling up MATLAB Analytics Marta Wilczkowiak, PhD Senior Applications Engineer MathWorks
Scaling up MATLAB Analytics Marta Wilczkowiak, PhD Senior Applications Engineer MathWorks 2013 The MathWorks, Inc. 1 Agenda Giving access to your analytics to more users Handling larger problems 2 When
More informationIntegrate MATLAB Analytics into Enterprise Applications
Integrate Analytics into Enterprise Applications Lyamine Hedjazi 2015 The MathWorks, Inc. 1 Data Analytics Workflow Preprocessing Data Business Systems Build Algorithms Smart Connected Systems Take Decisions
More informationEmbedded Technosolutions
Hadoop Big Data An Important technology in IT Sector Hadoop - Big Data Oerie 90% of the worlds data was generated in the last few years. Due to the advent of new technologies, devices, and communication
More informationIntroduction to MATLAB application deployment
Introduction to application deployment Antti Löytynoja, Application Engineer 2015 The MathWorks, Inc. 1 Technical Computing with Products Access Explore & Create Share Options: Files Data Software Data
More informationParallel Computing with MATLAB
Parallel Computing with MATLAB CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University
More informationChallenges for Data Driven Systems
Challenges for Data Driven Systems Eiko Yoneki University of Cambridge Computer Laboratory Data Centric Systems and Networking Emergence of Big Data Shift of Communication Paradigm From end-to-end to data
More informationWearable Technology Orientation Using Big Data Analytics for Improving Quality of Human Life
Wearable Technology Orientation Using Big Data Analytics for Improving Quality of Human Life Ch.Srilakshmi Asst Professor,Department of Information Technology R.M.D Engineering College, Kavaraipettai,
More informationOracle Big Data Connectors
Oracle Big Data Connectors Oracle Big Data Connectors is a software suite that integrates processing in Apache Hadoop distributions with operations in Oracle Database. It enables the use of Hadoop to process
More informationApplication Development and Deployment With MATLAB
Application Development and Deployment With Jean-Philippe Villaréal Application Engineer Applications Engineering Group MathWorks Benelux June 11, 2015 2015 The MathWorks, Inc. 1 Typical Industry Challenges
More informationProcess Big Data in MATLAB Using MapReduce
Process Big Data in MATLAB Using MapReduce This example shows how to use the datastore and mapreduce functions to process a large amount of file-based data. The MapReduce algorithm is a mainstay of many
More informationMit MATLAB auf der Überholspur Methoden zur Beschleunigung von MATLAB Anwendungen
Mit MATLAB auf der Überholspur Methoden zur Beschleunigung von MATLAB Anwendungen Frank Graeber Application Engineering MathWorks Germany 2013 The MathWorks, Inc. 1 Speed up the serial code within core
More informationAnalyzing Big Data with Microsoft R
Analyzing Big Data with Microsoft R 20773; 3 days, Instructor-led Course Description The main purpose of the course is to give students the ability to use Microsoft R Server to create and run an analysis
More informationDATA SCIENCE USING SPARK: AN INTRODUCTION
DATA SCIENCE USING SPARK: AN INTRODUCTION TOPICS COVERED Introduction to Spark Getting Started with Spark Programming in Spark Data Science with Spark What next? 2 DATA SCIENCE PROCESS Exploratory Data
More informationSharing and Deploying MATLAB Applications
Sharing and Deploying Applications Dr. Roland Michaely Applications Engineer 2015 The MathWorks, Inc. 1 ICICI Securities Develops Online Financial Planning and Advisory Platform Challenge Launch a scalable
More informationOverview. Audience profile. At course completion. Course Outline. : 20773A: Analyzing Big Data with Microsoft R. Course Outline :: 20773A::
Module Title Duration : 20773A: Analyzing Big Data with Microsoft R : 3 days Overview The main purpose of the course is to give students the ability to use Microsoft R Server to create and run an analysis
More informationSpecialist ICT Learning
Specialist ICT Learning APPLIED DATA SCIENCE AND BIG DATA ANALYTICS GTBD7 Course Description This intensive training course provides theoretical and technical aspects of Data Science and Business Analytics.
More informationWebinar Series TMIP VISION
Webinar Series TMIP VISION TMIP provides technical support and promotes knowledge and information exchange in the transportation planning and modeling community. Today s Goals To Consider: Parallel Processing
More informationAccelerate Big Data Insights
Accelerate Big Data Insights Executive Summary An abundance of information isn t always helpful when time is of the essence. In the world of big data, the ability to accelerate time-to-insight can not
More informationHDInsight > Hadoop. October 12, 2017
HDInsight > Hadoop October 12, 2017 2 Introduction Mark Hudson >20 years mixing technology with data >10 years with CapTech Microsoft Certified IT Professional Business Intelligence Member of the Richmond
More informationIBM Data Science Experience White paper. SparkR. Transforming R into a tool for big data analytics
IBM Data Science Experience White paper R Transforming R into a tool for big data analytics 2 R Executive summary This white paper introduces R, a package for the R statistical programming language that
More informationBehind Today s Trends The Technologies Driving Change. Jason Ghidella Simulink Product Manager MathWorks
Behind Today s Trends The Technologies Driving Change Jason Ghidella Simulink Product Manager MathWorks Industry 4.0 Big Data Wearable Tech Cloud Computing Internet of Things MOOC 3 In prior years Smart
More informationWindows 10 IoT Overview. Microsoft Corporation
Windows 10 IoT Overview Microsoft Corporation 25 $7.2 BILLION TRILLION Connected things will by 2020 be in use by 2020 worldwide market for IoT solutions IDC: Worldwide and Regional Internet of Things
More informationOracle Big Data Science
Oracle Big Data Science Tim Vlamis and Dan Vlamis Vlamis Software Solutions 816-781-2880 www.vlamis.com @VlamisSoftware Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri
More informationMulticore Computer, GPU 및 Cluster 환경에서의 MATLAB Parallel Computing 기능
Multicore Computer, GPU 및 Cluster 환경에서의 MATLAB Parallel Computing 기능 성호현 MathWorks Korea 2012 The MathWorks, Inc. 1 A Question to Consider Do you want to speed up your algorithms? If so Do you have a multi-core
More informationSFO15-TR6: Hadoop on ARM
SFO15-TR6: Hadoop on ARM Presented by Nachiket Bhoyar Steve Capper Date Wednesday 23 September 2015 Nachiket Bhoyar Steve Capper Event SFO15 Agenda 1. Quick intro to Hadoop stack. 2. Summary of our work.
More information2^48 - keine Angst vor großen Datensätzen in MATLAB
2^48 - keine Angst vor großen Datensätzen in MATLAB 9. July 2014 Rainer Mümmler Application Engineering Group 2014 The MathWorks, Inc. 1 Challenges with Large Data Sets Out of memory Running out of address
More informationThe Evolution of Big Data Platforms and Data Science
IBM Analytics The Evolution of Big Data Platforms and Data Science ECC Conference 2016 Brandon MacKenzie June 13, 2016 2016 IBM Corporation Hello, I m Brandon MacKenzie. I work at IBM. Data Science - Offering
More information2013 AWS Worldwide Public Sector Summit Washington, D.C.
2013 AWS Worldwide Public Sector Summit Washington, D.C. EMR for Fun and for Profit Ben Butler Sr. Manager, Big Data butlerb@amazon.com @bensbutler Overview 1. What is big data? 2. What is AWS Elastic
More informationApache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context
1 Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes
More informationOptimizing Your Analytics Life Cycle with SAS & Teradata. Rick Lower
Optimizing Your Analytics Life Cycle with SAS & Teradata Rick Lower 1 Agenda The Analytic Life Cycle Common Problems SAS & Teradata solutions Analytical Life Cycle Exploration Explore All Your Data Preparation
More informationWhat s New in MATLAB May 16, 2017
What s New in MATLAB May 16, 2017 2017 The MathWorks, Inc. 1 Agenda MATLAB Foundation Working with Data Building & Sharing MATLAB Applications Application Specific Enhancements Summary and Wrap-up 2 Agenda
More informationAPI, DEVOPS & MICROSERVICES
API, DEVOPS & MICROSERVICES RAPID. OPEN. SECURE. INNOVATION TOUR 2018 April 26 Singapore 1 2018 Software AG. All rights reserved. For internal use only THE NEW ARCHITECTURAL PARADIGM Microservices Containers
More informationINTRODUCTION TO MATLAB PARALLEL COMPUTING TOOLBOX
INTRODUCTION TO MATLAB PARALLEL COMPUTING TOOLBOX Keith Ma ---------------------------------------- keithma@bu.edu Research Computing Services ----------- help@rcs.bu.edu Boston University ----------------------------------------------------
More informationOracle Big Data Science IOUG Collaborate 16
Oracle Big Data Science IOUG Collaborate 16 Session 4762 Tim and Dan Vlamis Tuesday, April 12, 2016 Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri Developed 200+ Oracle
More informationAdvanced Software Development with MATLAB
Advanced Software Development with MATLAB From research and prototype to production 2017 The MathWorks, Inc. 1 What Are Your Software Development Concerns? Accuracy Compatibility Cost Developer Expertise
More informationIntroducing SAS Model Manager 15.1 for SAS Viya
ABSTRACT Paper SAS2284-2018 Introducing SAS Model Manager 15.1 for SAS Viya Glenn Clingroth, Robert Chu, Steve Sparano, David Duling SAS Institute Inc. SAS Model Manager has been a popular product since
More informationUsing Parallel Computing Toolbox to accelerate the Video and Image Processing Speed. Develop parallel code interactively
Using Parallel Computing Toolbox to accelerate the Video and Image Processing Speed Presenter: Claire Chuang TeraSoft Inc. Agenda Develop parallel code interactively parallel applications for faster processing
More informationMATLAB is a multi-paradigm numerical computing environment fourth-generation programming language. A proprietary programming language developed by
1 MATLAB is a multi-paradigm numerical computing environment fourth-generation programming language. A proprietary programming language developed by MathWorks In 2004, MATLAB had around one million users
More informationAn Introduction to Apache Spark
An Introduction to Apache Spark 1 History Developed in 2009 at UC Berkeley AMPLab. Open sourced in 2010. Spark becomes one of the largest big-data projects with more 400 contributors in 50+ organizations
More informationIntelligence for the connected world How European First-Movers Manage IoT Analytics Projects Successfully
Intelligence for the connected world How European First-Movers Manage IoT Analytics Projects Successfully Thomas Rohrmann, Michael Probst Analytics Experience 2016, Rome #analyticsx C opyr i g ht 2016,
More informationBig Data and Object Storage
Big Data and Object Storage or where to store the cold and small data? Sven Bauernfeind Computacenter AG & Co. ohg, Consultancy Germany 28.02.2018 Munich Volume, Variety & Velocity + Analytics Velocity
More informationPervasive DataRush TM
Pervasive DataRush TM Parallel Data Analysis with KNIME www.pervasivedatarush.com Company Overview Global Software Company Tens of thousands of users across the globe Americas, EMEA, Asia ~230 employees
More informationFit für die MATLAB EXPO
Fit für die MATLAB EXPO Eine kurze Einführung in MATLAB Michael Glaßer 2015 The MathWorks, Inc. 1 Hinweise für Betrachter der PDF Version Die Folien sind eher eine unterstützende Zusammenfassung Der Vortrag
More informationOverview and Practical Application of Machine Learning in Pricing
Overview and Practical Application of Machine Learning in Pricing 2017 CAS Spring Meeting May 23, 2017 Duncan Anderson and Claudine Modlin (Willis Towers Watson) Mark Richards (Allstate Insurance Company)
More informationSimplifier la mise en production d applications MATLAB. Marc Wolff Application Engineer MathWorks 1
Simplifier la mise en production d applications MATLAB Marc Wolff Application Engineer MathWorks marc.wolff@mathworks.fr 1 What if you could turn a MATLAB application into an interactive standalone application?
More informationDeep Learning mit PowerAI - Ein Überblick
Stephen Lutz Deep Learning mit PowerAI - Open Group Master Certified IT Specialist Technical Sales IBM Cognitive Infrastructure IBM Germany Ein Überblick Stephen.Lutz@de.ibm.com What s that? and what s
More informationGoogle GSuite Intro Demo of GSuite and GCP integration
Google GSuite Intro Demo of GSuite and GCP integration May 2017 Sara Djelassi - Sales Steve Mansfield - PSO 7 Cloud products with 1 billion users ML is core to differentiating Google services Search Search
More informationBig Data with Hadoop Ecosystem
Diógenes Pires Big Data with Hadoop Ecosystem Hands-on (HBase, MySql and Hive + Power BI) Internet Live http://www.internetlivestats.com/ Introduction Business Intelligence Business Intelligence Process
More informationSpotfire Data Science with Hadoop Using Spotfire Data Science to Operationalize Data Science in the Age of Big Data
Spotfire Data Science with Hadoop Using Spotfire Data Science to Operationalize Data Science in the Age of Big Data THE RISE OF BIG DATA BIG DATA: A REVOLUTION IN ACCESS Large-scale data sets are nothing
More informationBig Data and Hadoop. Course Curriculum: Your 10 Module Learning Plan. About Edureka
Course Curriculum: Your 10 Module Learning Plan Big Data and Hadoop About Edureka Edureka is a leading e-learning platform providing live instructor-led interactive online training. We cater to professionals
More informationTalend Big Data Sandbox. Big Data Insights Cookbook
Overview Pre-requisites Setup & Configuration Hadoop Distribution Download Demo (Scenario) Overview Pre-requisites Setup & Configuration Hadoop Distribution Demo (Scenario) About this cookbook What is
More informationR Language for the SQL Server DBA
R Language for the SQL Server DBA Beginning with R Ing. Eduardo Castro, PhD, Principal Data Analyst Architect, LP Consulting Moderated By: Jose Rolando Guay Paz Thank You microsoft.com idera.com attunity.com
More informationMATLAB 에서작업한응용프로그램의공유 : App 에서부터웹서비스까지
MATLAB 에서작업한응용프로그램의공유 : App 에서부터웹서비스까지 Application Engineer 엄준상 2013 The MathWorks, Inc. 1 Application Deployment with MATLAB Suppliers MATLAB Author Clients Organization Group Members Collaborators 2
More information2/26/2017. Originally developed at the University of California - Berkeley's AMPLab
Apache is a fast and general engine for large-scale data processing aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes Low latency: sub-second
More informationDetect, Diagnose and Solve Problems with Application Insights
Detect, Diagnose and Solve Problems with Application Insights Vishesh Oberoi Technical Evangelist, Microsoft @ovishesh visho@microsoft.com The Cloud for Modern Business Vishesh Oberoi Technical Evangelist,
More informationEvolving To The Big Data Warehouse
Evolving To The Big Data Warehouse Kevin Lancaster 1 Copyright Director, 2012, Oracle and/or its Engineered affiliates. All rights Insert Systems, Information Protection Policy Oracle Classification from
More informationCloud Computing 3. CSCI 4850/5850 High-Performance Computing Spring 2018
Cloud Computing 3 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning
More informationMixApart: Decoupled Analytics for Shared Storage Systems. Madalin Mihailescu, Gokul Soundararajan, Cristiana Amza University of Toronto and NetApp
MixApart: Decoupled Analytics for Shared Storage Systems Madalin Mihailescu, Gokul Soundararajan, Cristiana Amza University of Toronto and NetApp Hadoop Pig, Hive Hadoop + Enterprise storage?! Shared storage
More informationMit MATLAB auf der Überholspur Methoden zur Beschleunigung von MATLAB Anwendungen
Mit MATLAB auf der Überholspur Methoden zur Beschleunigung von MATLAB Anwendungen Michael Glaßer Application Engineering MathWorks Germany 2014 The MathWorks, Inc. 1 Key Takeaways 1. Speed up your serial
More informationLogging Reservoir Evaluation Based on Spark. Meng-xin SONG*, Hong-ping MIAO and Yao SUN
2017 2nd International Conference on Wireless Communication and Network Engineering (WCNE 2017) ISBN: 978-1-60595-531-5 Logging Reservoir Evaluation Based on Spark Meng-xin SONG*, Hong-ping MIAO and Yao
More informationMCT620 Distributed Systems Module Handbook
MCT620 Distributed Systems Module Handbook Master of Science in Software Engineering & Database Technologies (MScSED) Diploma in Software Engineering Table of Contents 1 Module Details 2 1.1 Module Description
More informationBig Data The end of Data Warehousing?
Big Data The end of Data Warehousing? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Big data, data warehousing, advanced analytics, Hadoop, unstructured data Introduction If there was an Unwort
More informationBringing Data to Life
Bringing Data to Life Data management and Visualization Techniques Benika Hall Rob Harrison Corporate Model Risk March 16, 2018 Introduction Benika Hall Analytic Consultant Wells Fargo - Corporate Model
More informationRandom Forests for Big Data
Random Forests for Big Data R. Genuer a, J.-M. Poggi b, C. Tuleau-Malot c, N. Villa-Vialaneix d a Bordeaux University c Nice University b Orsay University d INRA Toulouse October 27, 2017 CNAM, Paris Outline
More informationAnalyzing Flight Data
IBM Analytics Analyzing Flight Data Jeff Carlson Rich Tarro July 21, 2016 2016 IBM Corporation Agenda Spark Overview a quick review Introduction to Graph Processing and Spark GraphX GraphX Overview Demo
More informationJava in the world of Software AG JCP EC May 2018
Java in the world of Software AG JCP EC May 2018 Georgi Stanev Architect Software AG 2017 Software AG. All rights reserved. History of the Software AG 1969 The concept for an adaptable and extremely versatile
More informationPutting it all together: Creating a Big Data Analytic Workflow with Spotfire
Putting it all together: Creating a Big Data Analytic Workflow with Spotfire Authors: David Katz and Mike Alperin, TIBCO Data Science Team In a previous blog, we showed how ultra-fast visualization of
More informationArcGIS Enterprise: Architecture & Deployment. Anthony Myers
ArcGIS Enterprise: Architecture & Deployment Anthony Myers 1 2 3 4 5 Web GIS Overview of ArcGIS Enterprise Federation & Hosted Server Deployment Patterns Implementation 1 Web GIS ArcGIS Enabling GIS for
More informationBig Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours
Big Data Hadoop Developer Course Content Who is the target audience? Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Complete beginners who want to learn Big Data Hadoop Professionals
More informationBringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security
Bringing OpenStack to the Enterprise An enterprise-class solution ensures you get the required performance, reliability, and security INTRODUCTION Organizations today frequently need to quickly get systems
More informationBIG DATA COURSE CONTENT
BIG DATA COURSE CONTENT [I] Get Started with Big Data Microsoft Professional Orientation: Big Data Duration: 12 hrs Course Content: Introduction Course Introduction Data Fundamentals Introduction to Data
More informationBig Data. Big Data Analyst. Big Data Engineer. Big Data Architect
Big Data Big Data Analyst INTRODUCTION TO BIG DATA ANALYTICS ANALYTICS PROCESSING TECHNIQUES DATA TRANSFORMATION & BATCH PROCESSING REAL TIME (STREAM) DATA PROCESSING Big Data Engineer BIG DATA FOUNDATION
More informationOverview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::
Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized
More informationIntelligent Enterprise meets Science of Where. Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018
Intelligent Enterprise meets Science of Where Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018 Value The Esri & SAP journey Customer Impact Innovation Track Record Customer
More informationFlash Storage Complementing a Data Lake for Real-Time Insight
Flash Storage Complementing a Data Lake for Real-Time Insight Dr. Sanhita Sarkar Global Director, Analytics Software Development August 7, 2018 Agenda 1 2 3 4 5 Delivering insight along the entire spectrum
More informationBig Data com Hadoop. VIII Sessão - SQL Bahia. Impala, Hive e Spark. Diógenes Pires 03/03/2018
Big Data com Hadoop Impala, Hive e Spark VIII Sessão - SQL Bahia 03/03/2018 Diógenes Pires Connect with PASS Sign up for a free membership today at: pass.org #sqlpass Internet Live http://www.internetlivestats.com/
More information2015 The MathWorks, Inc. 1
2015 The MathWorks, Inc. 1 웨어러블디바이스의신호분석 Senior Application Engineer 김종남 2015 The MathWorks, Inc. 2 Agenda Internet Of Things Signal Analytics and Classification : On data from wareable and mobile device
More informationWelcome to the New Era of Cloud Computing
Welcome to the New Era of Cloud Computing Aaron Kimball The web is replacing the desktop 1 SDKs & toolkits are there What about the backend? Image: Wikipedia user Calyponte 2 Two key concepts Processing
More informationProcessing of big data with Apache Spark
Processing of big data with Apache Spark JavaSkop 18 Aleksandar Donevski AGENDA What is Apache Spark? Spark vs Hadoop MapReduce Application Requirements Example Architecture Application Challenges 2 WHAT
More information