What's New in MATLAB for Engineering Data Analytics?

Similar documents
Tackling Big Data Using MATLAB

Analyzing Fleet Data with MATLAB and Spark

Integrate MATLAB Analytics into Enterprise Applications

Scaling MATLAB. for Your Organisation and Beyond. Rory Adams The MathWorks, Inc. 1

What s New in MATLAB May 16, 2017

Integrate MATLAB Analytics into Enterprise Applications

Integrate MATLAB Analytics into Enterprise Applications

Navigating Big Data with MATLAB

MATLAB. Senior Application Engineer The MathWorks Korea The MathWorks, Inc. 2

Integrating Advanced Analytics with Big Data

Advanced Software Development with MATLAB

Big Data con MATLAB. Lucas García The MathWorks, Inc. 1

Sharing and Deploying MATLAB Programs Sundar Umamaheshwaran Amit Doshi Application Engineer-Technical Computing

MATLAB 에서작업한응용프로그램의공유 : App 에서부터웹서비스까지

Applying Supervised Learning

Data Analytics with MATLAB. Tackling the Challenges of Big Data

Parallel and Distributed Computing with MATLAB Gerardo Hernández Manager, Application Engineer

Integrating MATLAB Analytics into Business-Critical Applications Marta Wilczkowiak Senior Applications Engineer MathWorks

Parallel and Distributed Computing with MATLAB The MathWorks, Inc. 1

2015 The MathWorks, Inc. 1

Introduction to MATLAB application deployment

Driving Efficiency with MATLAB and Simulink. Centurion Lake Hotel 25 May 2017

Data Analytics with MATLAB

What s New for MATLAB David Willingham

MATLAB is a multi-paradigm numerical computing environment fourth-generation programming language. A proprietary programming language developed by

System Requirements & Platform Availability by Product for R2016b

Sharing and Deploying MATLAB Applications

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

Application Development and Deployment With MATLAB

Design Challenges for Sensor Data Analytics in Internet of Things (IoT)

Tutorial on Machine Learning Tools

Optimizing and Accelerating Your MATLAB Code

MATH36032 Problem Solving by Computer. Data Science

Putting it all together: Creating a Big Data Analytic Workflow with Spotfire

Introduction to Deep Learning in Signal Processing & Communications with MATLAB

Getting Started with MATLAB Francesca Perino

Simplifier la mise en production d applications MATLAB. Marc Wolff Application Engineer MathWorks 1

개발과정에서의 MATLAB 과 C 의연동 ( 영상처리분야 )

Fit für die MATLAB EXPO

What s New MATLAB and Simulink

POINT-CLOUD PROCESSING USING HDL CODER. April 17th 2018

Přehled novinek v SQL Server 2016

BIG DATA COURSE CONTENT

What s New in Computational Finance

Overview. Audience profile. At course completion. Course Outline. : 20773A: Analyzing Big Data with Microsoft R. Course Outline :: 20773A::

What s New in MATLAB and Simulink The MathWorks, Inc. 1

IBM Data Science Experience White paper. SparkR. Transforming R into a tool for big data analytics

About Intellipaat. About the Course. Why Take This Course?

What s New in MATLAB and Simulink

Simulink as Your Enterprise Simulation Platform

BIG DATA: Data Analytics with MATLAB Christophe POUILLOT Senior Consultant MathWorks

Process Big Data in MATLAB Using MapReduce

Decision models for the Digital Economy

As a reference, please find a version of the Machine Learning Process described in the diagram below.

2015 The MathWorks, Inc. 1

Analyzing Big Data with Microsoft R

##SQLSatMadrid. Project [Vélib by Cortana]

SAS (Statistical Analysis Software/System)

microsoft

2015 The MathWorks, Inc. 1

What s New in MATLAB and Simulink Young Joon Lee Principal Application Engineer

Modern Data Warehouse The New Approach to Azure BI

Reasons to Migrate from ProClarity to Pyramid Analytics

Deploying MATLAB Applications in Excel, Java, and.net Environments

Embarquez votre Intelligence Artificielle (IA) sur CPU, GPU et FPGA

Index A, B, C. Rank() function, steps, 199 Cloud services, 2 Comma-separated value (CSV), 27

Introduction to MATLAB for Finance

MACHINE LEARNING Example: Google search

Vertica Now. Quarterly Customer Webinar. January 19, 2017

Deep Learning for Computer Vision with MATLAB By Jon Cherrie

Scalable Machine Learning in R. with H2O

MathWorks Products and Prices North America January 2018

Technical Computing with MATLAB

컴퓨터비전의최신기술 : Deep Learning, 3D Vision and Embedded Vision

Hardware-Software Co-Design and Prototyping on SoC FPGAs Puneet Kumar Prateek Sikka Application Engineering Team

ODL based AI/ML for Networks Prem Sankar Gopannan, Ericsson YuLing Chen, Cisco

Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS

Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context

MCSE Cloud Platform & Infrastructure CLOUD PLATFORM & INFRASTRUCTURE.

Developing Optimization Algorithms for Real-World Applications

Applied Machine Learning

MATLAB Introduction. Ron Ilizarov Application Engineer

Flash Storage Complementing a Data Lake for Real-Time Insight

MATLAB as a Financial Engineering Development Platform Delivering Financial / Quantitative Models to the Enterprise Eugene McGoldrick

CloudExpo November 2017 Tomer Levi

Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs

Microsoft. Exam Questions Perform Data Engineering on Microsoft Azure HDInsight (beta) Version:Demo

2015 The MathWorks, Inc. 1

Stages of Data Processing

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List)

Oracle Big Data Science IOUG Collaborate 16

Déploiement embarqué et connectivité hardware avec MATLAB et Simulink

STREAMLINED CERTIFICATION PATHS

Spotfire Data Science with Hadoop Using Spotfire Data Science to Operationalize Data Science in the Age of Big Data

Oracle Big Data Science

Data Science Bootcamp Curriculum. NYC Data Science Academy

Audio Signal Processing in MATLAB Youssef Abdelilah Senior Product Manager

Visual Traffic Jam Analysis based on Trajectory Data

Data Science. Data Analyst. Data Scientist. Data Architect

Exam Questions

Transcription:

What's New in MATLAB for Engineering Data Analytics? Will Wilson Application Engineer MathWorks, Inc. 2017 The MathWorks, Inc. 1

Agenda Data Types Tall Arrays for Big Data Machine Learning (for Everyone) Deploying your Analytics 2

Example Use Case: Vehicle Log (MDF) File Analysis MDF (Measurement Data Format) is the de facto standard for measurement data in the automotive industry. Official ASAM standard Typically file extensions include:.mdf,.mf4, &.dat. Goal: Use MATLAB to process and analyze MDF data. Considerations: Data are MDF format, could be many files Data is messy May or may not know what you are looking for Compute statistics, report format 3

MATLAB Language Enhancements Expressing more types of data naturally Numeric datetime duration double, single, logical categorical calendarduration str Heterogeneous str timetable str structure cell table str Text c h char {c h} cell string str string tall 4

Agenda Data Types Tall Arrays for Big Data Machine Learning (for Everyone) Deploying your Analytics 5

Key Concept - MATLAB datastore A datastore is an object for reading a single file or a collection of files or data. Idea of properties. Data dependent images, tabular text, user defined. Onramp to Big Data. Properties of the datastore 6

tall arrays New data type designed for data that doesn t fit into memory Many rows (hence tall ) Looks like a normal MATLAB array Supports numeric types, tables, datetimes, strings, etc Supports several hundred functions for basic math, stats, indexing, etc. Statistics and Machine Learning Toolbox support (clustering, classification, etc.) 7

tall arrays Single Machine Memory tall array Process Single Machine Memory Automatically breaks data up into small chunks that fit in memory Tall arrays scan through the dataset one chunk at a time Processing code for tall arrays is the same as ordinary arrays 8

tall arrays Single Machine Memory tall array Process Single Machine Memory With Parallel Computing Toolbox, process several chunks at once Process Single Machine Memory Can scale up to clusters with MATLAB Distributed Computing Server Cluster of Machines Memory Process Single Machine Memory Process Single Machine Memory 9

Big Data Workflow With Tall Data Types Access Data Tall Data Types Exploration & Pre-processing Machine Learning Text Spreadsheet (Excel) Database (SQL) Custom Reader table cell double numeric cellstr datetime categorical Numeric functions Basic stats reductions Date/Time capabilities Categorical String processing Table wrangling Missing Data handling Summary visualizations: Histogram/histogram2 Kernel density plot Bin-scatter Linear Model Logistic Regression Discriminant analysis K-means PCA Random data sampling Summary statistics Datastores for common types of structured data Tall versions of commonly used MATLAB data types Hundreds of pre-built functions Key statistics and machine learning algorithms MATLAB framework for data that does not fit into memory 10

Example Use Case: Create a Predictive Model Goal: Contribute to a Ride Sharing project by creating a model to predict the cost of a Taxi Ride in New York City. Considerations: Raw data are.csv taxi ride log files File size ranges from 22 26MB The full data set contains > 2 million rows Start with linear regression (to facilitate prediction) Scale up initial work RIDE SHARING 11

Agenda Data Types Tall Arrays for Big Data Machine Learning (for Everyone) Deploying your Analytics 12

When Might you Consider Machine Learning? Problem is too complex for hand written rules or equations Because algorithms can learn complex nonlinear relationships Speech Recognition Object Recognition Engine Health Monitoring Program needs to adapt with changing data update as more data becomes available Weather Forecasting Energy Load Forecasting Stock Market Prediction Program needs to scale learn efficiently from very large data sets IoT Analytics Taxi Availability Airline Flight Delays 13

Statistics and Machine Learning Toolbox Making Machine Learning Easy and Accessible Classification Learner App I would have never attempted machine learning if this app was not available. 1-click parallel computing Big data algorithms (using tall arrays) C code generation for predictive models (requires MATLAB Coder) New methods for feature selection and hyperparameter tuning Regression Learner App 14

Example Use Case: Day-Ahead Load Forecasting Goal: Create and implement a tool for easy and accurate computation of day-ahead system load forecast Considerations: Multiple data sources Significant data clean up is required Predictive model must be accurate Easily deploy to production environment 15

Agenda Data Types Tall Arrays for Big Data Machine Learning (for Everyone) Deploying your Analytics 16

Integrate Analytics with Systems Embedded Hardware Enterprise Systems C, C++ HDL PLC Standalone Application Excel Add-in Hadoop/ Spark C/C++ Java ++ Python.NET MATLAB Production Server MATLAB Runtime 17

Technology Stack for Enterprise Integration Many possible solutions. MathWorks can help! Data Analytics Business System Databases Visualization Cloud Storage Azure Blob Azure SQL MATLAB Production Server Web Request Broker IoT Custom App Public Cloud Private Cloud 18

Key Takeaways MATLAB data types enable you to more efficiently tackle Data Analytics problems. tall Arrays for out of memory data sets. Use MATLAB apps to get started (or do more) Machine Learning. MATLAB based Analytics run where you need them to - Embedded or Enterprise IT systems. 19

2017 The MathWorks, Inc. MATLAB and Simulink are registered trademarks of The MathWorks, Inc. See www.mathworks.com/trademarks for a list of additional trademarks. Other product or brand names may be trademarks or registered trademarks of their respective holders. 2017 The MathWorks, Inc. 20