DRYAD / DRYADLINQ OVERVIEW. Xavier Pillons, Principal Program Manager, Technical Computing Customer Advocate Team
|
|
- Matthew Winfred Newton
- 5 years ago
- Views:
Transcription
1 DRYAD / DRYADLINQ OVERVIEW Xavier Pillons, Principal Program Manager, Technical Computing Customer Advocate Team
2 Data Intensive Scalable Computing (DISC) Market Customer needs for DISC lie on a spectrum One extreme is analytics targeting a traditional data warehouse. The analyst knows the cube he or she wants to build, and the analyst knows the data sources. Another extreme is analyzing raw unstructured data. The analyst does not know exactly what the data contains, nor what cube would be justified. The analyst needs to do ad-hoc analyses that may never be run again. HCP+Dryad targets the raw unstructured data extreme.
3 HPC+Dryad compliments Microsoft s existing data platform assets Microsoft already has great data platform assets PowerPivot, SQL Server Integration Services (SSIS), Parallel Data Warehouse (PDW), HPC+Dryad s focus on raw unstructured data analytics enables new solutions that incorporate multiple assets E.g., analyze raw unstructured data using HPC+Dryad then pipe it to SSIS and apply rest of BI stack
4 How is Microsoft responding?
5 Dryad is a technology MS uses internally to address the need Hadoop addresses for Yahoo Dryad has powered Microsoft Search for 5 years on thousands of servers Dryad is at the cutting edge of research on distributed computing Multiple publications at top-tier research conferences Dryad is a proven MS internal technology and HPC is making it available to customers Dryad is already available as a CTP
6 Dryad / DryadLinq / DSC Tools Languages and Libraries Visual Studio, Excel, etc. Visual Studio for C#/LINQ C#, C++, C#/LINQ for Dryad Distributed runtimes MPI SOA Dryad Cluster and cloud services HPC provisioning, management, etc. Distributed Storage Catalog Platform Windows Server Azure Bind individual NTFS shares together to support the Dryad distributed runtime
7 Dryad allows HPC to solve a new class of problems MPI Optimize CPU utilization for tightly coupled problems like climate modeling, car crash simulation, etc. SOA Optimize CPU utilization for loosely coupled problems like financial product pricing, etc. CPU Oriented Data Oriented Dryad Optimize for data locality rather than CPU utilization to support jobs that are primarily bound on disk I/O.
8 2-D Piping Unix Pipes: 1-D grep sed sort awk perl Dryad: 2-D grep 1000 sed 500 sort 1000 awk 500 perl 50
9 A Dryad Job Directed acyclic graph (DAG) Inputs Processing vertices Channels (file, fifo, pipe) Outputs
10 HPC + Dryad Job Overview 2a A Dryad job starts 1 basic task assigning a node as the DGM 3a Dryad Graph Manager starts/stops Dryad Vertices Application that calls Dryad APIs 1 1 Submit Dryad Job DSC HPC Head Node 2a 2b 2b The Dryad job also starts a set of parametric sweep tasks across the rest of the nodes as DVH 3b 3a HPC Compute Nodes 3b Dryad Graph Manager Dryad Vertex Host Dryad Vertices read and write files
11 Explaining DryadLINQ by example: extract Ulfar s favorite web pages from many web log files LINQ query transformed into computation graph var logentries = Input from line in logs where!line.startswith("#") Compute select new LogEntry(line); var user = from access in logentries where access.user.endswith(@"\ulfar") select access; var accesses = from access in user group access by access.page into pages select new UserPageCount("ulfar", pages.key, pages.count()); var htmaccesses = from access in accesses where access.page.endswith(".htm") orderby access.count descending select access; Compute and resort Compute and resort Output
12 More on HPC + Dryad mechanics 2a A Dryad job starts 1 basic task assigning a node as the DGM 3a DGM reads XML description of Dryad graph from share, calls DSC to locate files referenced in XML Application that calls Dryad APIs 1 DSC HPC Head Node 1 Publish to share: 1. binaries for Dryad job 2. XML description of Dryad graph 2a 2b 3b 2b The Dryad job also starts a set of parametric sweep tasks across the rest of the nodes as DVH 3a HPC Compute Nodes 3b Dryad Graph Manager Dryad Vertex Host DVH loads binaries for this Dryad job from share, executes them according to commands from DGM 4/14/
13 Takeaways Microsoft will compete with Hadoop using HPC+Dryad in the Data Intensive Scalable Computing market on both the cluster and cloud. Microsoft s DISC value proposition is about the overall stack, not about Dryad alone Dryad name subject to change before commercial release DryadLinq/DSC available as a CTP
14 Questions?
15 2010 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.
SQL Server New innovations. Ivan Kosyakov. Technical Architect, Ph.D., Microsoft Technology Center, New York
2016 New innovations Ivan Kosyakov Technical Architect, Ph.D., http://biz-excellence.com Microsoft Technology Center, New York The explosion of data sources... 25B 1.3B 4.0B There s an opportunity to drive
More informationNew technologies for BI and Data Warehousing they re cool alright, but how do they fit
New technologies for BI and Data Warehousing they re cool alright, but how do they fit Amit Bansal www.peoplewareindia.com www.sqlservergeeks.com www.amitbansal.net together? About Amit Bansal CTO, edominer
More informationStages of Data Processing
Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,
More informationBig Data Analytics. Yossi Elkayam Sr. BI Architect Microsoft Services
Big Data Analytics Yossi Elkayam Sr. BI Architect Microsoft Services yelkayam@microsoft.com Session Objectives And Takeaways 1. Lets talk about Big Data! 2. Enable you to find opportunities for Big Data
More informationUpdating your Database Skills to Microsoft SQL Server 2012
Course 40008A: Updating your Database Skills to Microsoft SQL Server 2012 Course Details Course Outline Module 1: Introducing SQL Server 2012 This module provides an overview of SQL Server 2012. Overview
More informationCOURSE 10977A: UPDATING YOUR SQL SERVER SKILLS TO MICROSOFT SQL SERVER 2014
ABOUT THIS COURSE This five-day instructor-led course teaches students how to use the enhancements and new features that have been added to SQL Server and the Microsoft data platform since the release
More informationMaster BIG DATA with SQL Server 2012
Roy Pasternak Data Platform & BI Lead Ori Weinroth Product Marketing Manager, SQL Server Master BIG DATA with SQL Server 2012 Characteristics of Big Data Large Data Volumes The Twitter Community generates
More informationMaking Supercomputing More Available and Accessible Windows HPC Server 2008 R2 Beta 2 Microsoft High Performance Computing April, 2010
Making Supercomputing More Available and Accessible Windows HPC Server 2008 R2 Beta 2 Microsoft High Performance Computing April, 2010 Windows HPC Server 2008 R2 Windows HPC Server 2008 R2 makes supercomputing
More informationTopics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples
Hadoop Introduction 1 Topics Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples 2 Big Data Analytics What is Big Data?
More informationMCSE Cloud Platform & Infrastructure CLOUD PLATFORM & INFRASTRUCTURE.
Exam 410: Installing and Configuring Server 2012 Exam 411: Administering Server 2012 Exam 412: Configuring Advanced Server 2012 services Server 2012 CLOUD PLATFORM & INFRASTRUCTURE Exam 740: Installation,
More informationAlisson Sol Knowledge Engineer Engineering Excellence June 08, Public version
Alisson Sol Knowledge Engineer Engineering Excellence June 08, 2011 Public version Information about the current inflection point Mature Mainframe, desktop, graphical user interface, client/server Evolving
More information"Charting the Course... MOC B Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course Summary
Course Summary Description This five-day instructor-led course teaches students how to use the enhancements and new features that have been added to SQL Server and the Microsoft data platform since the
More informationSaranya Sriram Developer Evangelist Microsoft Corporation India
Saranya Sriram Developer Evangelist Microsoft Corporation India Microsoft s Cloud ReCap Azure Services Platform Agenda Data is King Motivation? Why data outside your premise? Microsoft s Data Storage offerings
More informationIn-memory data pipeline and warehouse at scale using Spark, Spark SQL, Tachyon and Parquet
In-memory data pipeline and warehouse at scale using Spark, Spark SQL, Tachyon and Parquet Ema Iancuta iorhian@gmail.com Radu Chilom radu.chilom@gmail.com Big data analytics / machine learning 6+ years
More informationANALYZE. Business Analytics Technical White Paper. Microsoft Dynamics TM NAV. Technical White Paper
ANALYZE Microsoft Dynamics TM NAV Business Analytics Technical White Paper Technical White Paper This technical white paper provides a conceptual overview of Business Analytics for Microsoft Dynamics NAV
More informationOracle Big Data Connectors
Oracle Big Data Connectors Oracle Big Data Connectors is a software suite that integrates processing in Apache Hadoop distributions with operations in Oracle Database. It enables the use of Hadoop to process
More informationHadoop. Introduction / Overview
Hadoop Introduction / Overview Preface We will use these PowerPoint slides to guide us through our topic. Expect 15 minute segments of lecture Expect 1-4 hour lab segments Expect minimal pretty pictures
More informationWHAT S NEW IN SQL SERVER 2016 REPORTING SERVICES?
WHAT S NEW IN SQL SERVER 2016 REPORTING SERVICES? Timothy P. McAliley CISA, CISM, CISSP, ITIL V3, MCSA, MCSE, MCT, PMP Microsoft Account Technology Strategist Try It Yourself! Two TechNet Virtual Labs
More informationDRYAD: DISTRIBUTED DATA- PARALLEL PROGRAMS FROM SEQUENTIAL BUILDING BLOCKS
DRYAD: DISTRIBUTED DATA- PARALLEL PROGRAMS FROM SEQUENTIAL BUILDING BLOCKS Authors: Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, Dennis Fetterly Presenter: Zelin Dai WHAT IS DRYAD Combines computational
More informationThe Private, Public and Hybrid Cloud
The Private, Public and Hybrid Cloud Jeff Woolsey (#wsv_guy) Windows Server & Cloud Régis Laurent Director of Operations, Global Knowledge Competencies include: Gold Learning Silver System Management Transformational
More informationSTREAMLINED CERTIFICATION PATHS
STREAMLINED CERTIFICATION PATHS MOBILITY 10 Mobility CLOUD PLATFORM & INFRASTRUCTURE Server 2016 Cloud Platform & Infrastructure Linux on Azure Cloud Platform PRODUCTIVITY Server 2016 Productivity Office
More informationImplementing a Data Warehouse with Microsoft SQL Server 2012
Implementing a Data Warehouse with Microsoft SQL Server 2012 Course 10777A 5 Days Instructor-led, Hands-on Introduction Data warehousing is a solution organizations use to centralize business data for
More informationSQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024
Current support level End Mainstream End Extended SQL Server 2005 SQL Server 2008 and 2008 R2 SQL Server 2012 SQL Server 2005 SP4 is in extended support, which ends on April 12, 2016 SQL Server 2008 and
More informationAvailability and Performance for Tier1 applications
Assaf Fraenkel Senior Architect (MCA+MCM SQL 2008) MCS Israel Availability and Performance for Tier1 applications Agenda and Takeaways Agenda: Introduce the new SQL Server High Availability and Disaster
More informationIntegrate MATLAB Analytics into Enterprise Applications
Integrate Analytics into Enterprise Applications Aurélie Urbain MathWorks Consulting Services 2015 The MathWorks, Inc. 1 Data Analytics Workflow Data Acquisition Data Analytics Analytics Integration Business
More informationMicrosoft Analytics Platform System (APS)
Microsoft Analytics Platform System (APS) The turnkey modern data warehouse appliance Matt Usher, Senior Program Manager @ Microsoft About.me @two_under Senior Program Manager 9 years at Microsoft Visual
More informationCourse Outline. Upgrading Your Skills to SQL Server 2016 Course 10986A: 3 days Instructor Led
Upgrading Your Skills to SQL Server 2016 Course 10986A: 3 days Instructor Led About this course This three-day instructor-led course provides students moving from earlier releases of SQL Server with an
More informationDEVELOPING SQL DATA MODELS
20768 - DEVELOPING SQL DATA MODELS CONTEÚDO PROGRAMÁTICO Module 1: Introduction to Business Intelligence and Data Modeling This module introduces key BI concepts and the Microsoft BI product suite. Introduction
More informationIBM Data Science Experience White paper. SparkR. Transforming R into a tool for big data analytics
IBM Data Science Experience White paper R Transforming R into a tool for big data analytics 2 R Executive summary This white paper introduces R, a package for the R statistical programming language that
More informationIntegrate MATLAB Analytics into Enterprise Applications
Integrate Analytics into Enterprise Applications Dr. Roland Michaely 2015 The MathWorks, Inc. 1 Data Analytics Workflow Access and Explore Data Preprocess Data Develop Predictive Models Integrate Analytics
More informationModern Data Warehouse The New Approach to Azure BI
Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics
More informationImplementing a Data Warehouse with Microsoft SQL Server 2012
10777 - Implementing a Data Warehouse with Microsoft SQL Server 2012 Duration: 5 days Course Price: $2,695 Software Assurance Eligible Course Description 10777 - Implementing a Data Warehouse with Microsoft
More informationUpdating your Database Skills to Microsoft SQL Server 2012
Updating your Database Skills to Microsoft SQL Server 2012 Course 40008A - Three Days - Instructor-led - Hands on Introduction This three-day instructor-led course provides existing SQL Server database
More informationUpdating Your Skills to SQL Server 2016
Updating Your Skills to SQL Server 2016 OD10986B; On-Demand, Video-based Course Description This course provides students moving from earlier releases of SQL Server with an introduction to the new features
More informationChris Skorlinski Microsoft SQL Escalation Services Charlotte, NC
Chris Skorlinski Microsoft SQL Escalation Services Charlotte, NC http://blogs.msdn.com/repltalk One of my favorite shows is CarTalk on Public Radio. Those guys always make me laugh. I enjoy listening to
More informationData Intensive Scalable Computing
Data Intensive Scalable Computing Randal E. Bryant Carnegie Mellon University http://www.cs.cmu.edu/~bryant Examples of Big Data Sources Wal-Mart 267 million items/day, sold at 6,000 stores HP built them
More informationBI ENVIRONMENT PLANNING GUIDE
BI ENVIRONMENT PLANNING GUIDE Business Intelligence can involve a number of technologies and foster many opportunities for improving your business. This document serves as a guideline for planning strategies
More informationPowerPivot, an Introduction. By: Steve Lewis Principal Pyxis Analytics
PowerPivot, an Introduction By: Steve Lewis Principal Pyxis Analytics Agenda What is the BISM Model? Components of the BISM Model DAX Overview Walkthroughs What is the BISM Model Business Intelligence
More informationComposite Software Data Virtualization The Five Most Popular Uses of Data Virtualization
Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization Composite Software, Inc. June 2011 TABLE OF CONTENTS INTRODUCTION... 3 DATA FEDERATION... 4 PROBLEM DATA CONSOLIDATION
More informationMicrosoft Windows HPC Server 2008 R2 for the Cluster Developer
50291B - Version: 1 02 May 2018 Microsoft Windows HPC Server 2008 R2 for the Cluster Developer Microsoft Windows HPC Server 2008 R2 for the Cluster Developer 50291B - Version: 1 5 days Course Description:
More informationPBS PROFESSIONAL VS. MICROSOFT HPC PACK
PBS PROFESSIONAL VS. MICROSOFT HPC PACK On the Microsoft Windows Platform PBS Professional offers many features which are not supported by Microsoft HPC Pack. SOME OF THE IMPORTANT ADVANTAGES OF PBS PROFESSIONAL
More information2012 Microsoft Corporation. All rights reserved. Microsoft, Active Directory, Excel, Lync, Outlook, SharePoint, Silverlight, SQL Server, Windows,
2012 Microsoft Corporation. All rights reserved. Microsoft, Active Directory, Excel, Lync, Outlook, SharePoint, Silverlight, SQL Server, Windows, Windows Server, and other product names are or may be registered
More informationSTREAMLINED CERTIFICATION PATHS
STREAMLINED CERTIFICATION PATHS MOBILITY Windows 10 Mobility CLOUD PLATFORM & INFRASTRUCTURE Cloud Platform Cloud Platform & Infrastructure Linux on Azure PRODUCTIVITY Productivity Office 365 APP BUILDER
More informationMATLAB. Senior Application Engineer The MathWorks Korea The MathWorks, Inc. 2
1 Senior Application Engineer The MathWorks Korea 2017 The MathWorks, Inc. 2 Data Analytics Workflow Business Systems Smart Connected Systems Data Acquisition Engineering, Scientific, and Field Business
More informationJava Web Service Essentials (TT7300) Day(s): 3. Course Code: GK4232. Overview
Java Web Service Essentials (TT7300) Day(s): 3 Course Code: GK4232 Overview Geared for experienced developers, Java Web Service Essentials is a three day, lab-intensive web services training course that
More informationMCSE Mobility Earned: MCSE Cloud Platform & Infrastructure Earned: 2017 MCSE MCSE. MCSD App Builder. MCSE Business Applications Earned 2017
MOBILITY 10 Mobility CLOUD PLATFORM & INFRASTRUCTURE Server 2012 Server 2016 MCSA Linux on Azure Cloud Platform & Infrastructure MCSA Cloud Platform PRODUCTIVITY Server 2012 or 2016 MCSA Office 365 Productivity
More informationUnlock more volume with broad match on Bing Ads
Unlock more volume with broad match on Bing Ads Over the past 12 months, we ve invested heavily in broad match technology to deliver increased volume and controls that preserve ad relevance and match consumer
More informationScaling MATLAB. for Your Organisation and Beyond. Rory Adams The MathWorks, Inc. 1
Scaling MATLAB for Your Organisation and Beyond Rory Adams 2015 The MathWorks, Inc. 1 MATLAB at Scale Front-end scaling Scale with increasing access requests Back-end scaling Scale with increasing computational
More informationCIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )
Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL
More informationAggregating Knowledge in a Data Warehouse and Multidimensional Analysis
Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com Objectives Explain the basics of: 1. Data
More informationDealing with Data Especially Big Data
Dealing with Data Especially Big Data INFO-GB-2346.01 Fall 2017 Professor Norman White nwhite@stern.nyu.edu normwhite@twitter Teaching Assistant: Frenil Sanghavi fps241@stern.nyu.edu Administrative Assistant:
More informationXiaoying Guo Program Manager Microsoft Server and Tools China Session Code: DEV377
Xiaoying Guo Program Manager Microsoft Server and Tools China Session Code: DEV377 Agenda Why building Business Application in Silverlight How to build Business Applications with: Silverlight Silverlight
More informationEnterprise Planning Large Scale
Enterprise Planning Large Scale ARGUS Enterprise 11.6.0 3/8/2017 ARGUS Software An Altus Group Company Large Enterprise Planning Guide ARGUS Enterprise 11.6.0 3/8/2017 Published by: ARGUS Software, Inc.
More informationDryadLINQ. Distributed Computation. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India
Dryad Distributed Computation Smruti R. Sarangi Department of Computer Science Indian Institute of Technology New Delhi, India Smruti R. Sarangi Distributed Batch Processing 1/34 Outline Motivation 1 Motivation
More informationEnterprise Planning Large Scale
Enterprise Planning Large Scale 11.7.0 12/13/2017 11.7.0 12/13/2017 Published by: ARGUS Software, Inc. 750 Town and Country Blvd Suite 800 Houston, TX 77024 Telephone (713) 621-4343 Facsimile (713) 621-2787
More informationSharePoint Portal Server 2003 Advanced Migration Scenarios
SharePoint Portal Server 2003 Advanced Migration Scenarios White Paper Published: March 2004 Table of Contents Introduction 1 Related White Papers 1 Background 2 SharePoint Portal Server 2003 Document
More informationBIG DATA COURSE CONTENT
BIG DATA COURSE CONTENT [I] Get Started with Big Data Microsoft Professional Orientation: Big Data Duration: 12 hrs Course Content: Introduction Course Introduction Data Fundamentals Introduction to Data
More informationWhat s New for Oracle Database 11gR2 on Windows?
1 What s New for Oracle Database 11gR2 on Windows? Santanu Datta ` Alex Keh Dennis Ruane Sr. Director Principal Product Manager Senior Database Software Engineer Server Technologies
More informationDeveloping In The Cloud
Developing In The Cloud What is the Cloud? How does it work? What is P&P doing to help? What Is The Cloud? Cloud computing is a model for enabling Cloud convenient, computingon-demand is the provision
More informationColumnstore Technology Improvements in SQL Server Presented by Niko Neugebauer Moderated by Nagaraj Venkatesan
Columnstore Technology Improvements in SQL Server 2016 Presented by Niko Neugebauer Moderated by Nagaraj Venkatesan Thank You microsoft.com hortonworks.com aws.amazon.com red-gate.com Empower users with
More informationAsanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks
Asanka Padmakumara ETL 2.0: Data Engineering with Azure Databricks Who am I? Asanka Padmakumara Business Intelligence Consultant, More than 8 years in BI and Data Warehousing A regular speaker in data
More informationEMC s IT TRANSFORMATION
EMC s IT TRANSFORMATION Sanjay Mirchandani Chief Information Officer 1 EMC IT At A Glance INTERNAL USERS IT ENVIRONMENT BUSINESS APPLICATIONS VIRTUALIZATION 2004 24,000 5 DATA CENTERS, 960 TB STORAGE ~400
More informationActivator Library. Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success.
Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success. ACTIVATORS Designed to give your team assistance when you need it most without
More informationBI, Big Data, Mission Critical. Eduardo Rivadeneira Specialist Sales Manager
BI, Big Data, Mission Critical Eduardo Rivadeneira Specialist Sales Manager Required 9s & Protection Blazing-Fast Performance Enhanced Security & Compliance Rapid Data Exploration & Visualization Managed
More informationSql Server Reporting Services Ssrs Web Reports
SQL SERVER REPORTING SERVICES SSRS WEB REPORTS PDF - Are you looking for sql server reporting services ssrs web reports Books? Now, you will be happy that at this time sql server reporting services ssrs
More informationNicholas Dritsas Principal Program Manager Microsoft Corporation Microsoft Corporation. All rights reserved
Nicholas Dritsas Principal Program Manager Microsoft Corporation Who is SQL Customer Advisory Team (SQL CAT) Overview of large AS projects Lessons Learned People and Infrastructure Performance Improving
More informationNecto Platform Requirements
December 10 th, 2017 Necto Platform Requirements The following is a list of supported platforms to be used with Necto 16.3. Important note this document is separated into five parts: 1. Necto Client Environments
More informationImproving the ROI of Your Data Warehouse
Improving the ROI of Your Data Warehouse Many organizations are struggling with a straightforward but challenging problem: their data warehouse can t affordably house all of their data and simultaneously
More informationTraining 24x7 DBA Support Staffing. MCSA:SQL 2016 Business Intelligence Development. Implementing an SQL Data Warehouse. (40 Hours) Exam
MCSA:SQL 2016 Business Intelligence Development Implementing an SQL Data Warehouse (40 Hours) Exam 70-767 Prerequisites At least 2 years experience of working with relational databases, including: Designing
More informationIBM Spectrum Scale vs EMC Isilon for IBM Spectrum Protect Workloads
89 Fifth Avenue, 7th Floor New York, NY 10003 www.theedison.com @EdisonGroupInc 212.367.7400 IBM Spectrum Scale vs EMC Isilon for IBM Spectrum Protect Workloads A Competitive Test and Evaluation Report
More informationData Analytics using MapReduce framework for DB2's Large Scale XML Data Processing
IBM Software Group Data Analytics using MapReduce framework for DB2's Large Scale XML Data Processing George Wang Lead Software Egnineer, DB2 for z/os IBM 2014 IBM Corporation Disclaimer and Trademarks
More informationHANA Performance. Efficient Speed and Scale-out for Real-time BI
HANA Performance Efficient Speed and Scale-out for Real-time BI 1 HANA Performance: Efficient Speed and Scale-out for Real-time BI Introduction SAP HANA enables organizations to optimize their business
More informationYour New Autonomous Data Warehouse
AUTONOMOUS DATA WAREHOUSE CLOUD Your New Autonomous Data Warehouse What is Autonomous Data Warehouse Autonomous Data Warehouse is a fully managed database tuned and optimized for data warehouse workloads
More informationA SAS/AF Application for Parallel Extraction, Transformation, and Scoring of a Very Large Database
Paper 11 A SAS/AF Application for Parallel Extraction, Transformation, and Scoring of a Very Large Database Daniel W. Kohn, Ph.D., Torrent Systems Inc., Cambridge, MA David L. Kuhn, Ph.D., Innovative Idea
More informationCSE6331: Cloud Computing
CSE6331: Cloud Computing Leonidas Fegaras University of Texas at Arlington c 2019 by Leonidas Fegaras Cloud Computing Fundamentals Based on: J. Freire s class notes on Big Data http://vgc.poly.edu/~juliana/courses/bigdata2016/
More informationProvide Real-Time Data To Financial Applications
Provide Real-Time Data To Financial Applications DATA SHEET Introduction Companies typically build numerous internal applications and complex APIs for enterprise data access. These APIs are often engineered
More informationMining Your Warranty Data Finding Anomalies (Part 1)
Mining Your Warranty Data Finding Anomalies (Part 1) Rob Evans (vrevans@us.ibm.com), Support Warranty Analyst, IBM 3 December 2010 The problem One of my jobs each month is to review all of the warranty
More informationPrincipal Software Engineer Red Hat Emerging Technology June 24, 2015
USING APACHE SPARK FOR ANALYTICS IN THE CLOUD William C. Benton Principal Software Engineer Red Hat Emerging Technology June 24, 2015 ABOUT ME Distributed systems and data science in Red Hat's Emerging
More informationIntroduction to Hadoop. Owen O Malley Yahoo!, Grid Team
Introduction to Hadoop Owen O Malley Yahoo!, Grid Team owen@yahoo-inc.com Who Am I? Yahoo! Architect on Hadoop Map/Reduce Design, review, and implement features in Hadoop Working on Hadoop full time since
More informationPerform scalable data exchange using InfoSphere DataStage DB2 Connector
Perform scalable data exchange using InfoSphere DataStage Angelia Song (azsong@us.ibm.com) Technical Consultant IBM 13 August 2015 Brian Caufield (bcaufiel@us.ibm.com) Software Architect IBM Fan Ding (fding@us.ibm.com)
More information20767B: IMPLEMENTING A SQL DATA WAREHOUSE
ABOUT THIS COURSE This 5-day instructor led course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse with Microsoft SQL Server
More informationThe Cortana Intelligence Suite
Slide 1 The Cortana Intelligence Suite Foundations Data Discovery and Ingestion Microsoft Machine Learning and Data Science Team CortanaIntelligence.com Main page: http://cortanaanalytics.com To begin
More informationOracle. Procurement Cloud Creating and Administering Analytics and Reports. Release 13 (update 18B)
Oracle Procurement Cloud Creating and Administering Analytics and Reports Release 13 (update 18B) Release 13 (update 18B) Part Number E94417-02 Copyright 2011-2018, Oracle and/or its affiliates. All rights
More informationSQT03 Big Data and Hadoop with Azure HDInsight Andrew Brust. Senior Director, Technical Product Marketing and Evangelism
Big Data and Hadoop with Azure HDInsight Andrew Brust Senior Director, Technical Product Marketing and Evangelism Datameer Level: Intermediate Meet Andrew Senior Director, Technical Product Marketing and
More informationGuide to Licensed Options. SAP Sybase IQ 16.0 SP03
Guide to Licensed Options SAP Sybase IQ 16.0 SP03 DOCUMENT ID: DC01646-01-1603-01 LAST REVISED: November 2013 Copyright 2013 by SAP AG or an SAP affiliate company. All rights reserved. No part of this
More informationProgramming in C# Jump Start. Jerry Nixon Microsoft Developer Evangelist Daren May President & Co-founder, Crank211
Programming in C# Jump Start Jerry Nixon Microsoft Developer Evangelist Daren May President & Co-founder, Crank211 06 Advanced C#, Part 2 Jerry Nixon Microsoft Developer Evangelist Daren May President
More informationMicrosoft TechNet Academy UPDATING YOUR SQL SERVER 2005 SKILLS TO
Microsoft TechNet Academy UPDATING YOUR SQL SERVER 2005 SKILLS TO SQL SERVER 2008 An Introduction to SQL Server 2008 The Evolution of SQL Server Preparing for SQL Server 2008 The Evolution of SQL Server
More informationName: Vijay Rajagopalan Title : Principal Architect Company : Microsoft Corporation Session Code:
Name: Vijay Rajagopalan Title : Principal Architect Company : Microsoft Corporation Session Code: Agenda Microsoft Interoperability Strategy Multifaceted Approach to Interoperability Microsoft and PHP
More informationMicrosoft SQL Server Training Course Catalogue. Learning Solutions
Training Course Catalogue Learning Solutions Querying SQL Server 2000 with Transact-SQL Course No: MS2071 Two days Instructor-led-Classroom 2000 The goal of this course is to provide students with the
More informationMaking the Most of Hadoop with Optimized Data Compression (and Boost Performance) Mark Cusack. Chief Architect RainStor
Making the Most of Hadoop with Optimized Data Compression (and Boost Performance) Mark Cusack Chief Architect RainStor Agenda Importance of Hadoop + data compression Data compression techniques Compression,
More informationTaming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems
1 Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems The Defacto Choice For Convergence 2 ABSTRACT & SPEAKER BIO Dealing with enormous data growth is a key challenge for
More informationCOURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER
ABOUT THIS COURSE The focus of this five-day instructor-led course is on creating managed enterprise BI solutions. It describes how to implement multidimensional and tabular data models, deliver reports
More informationPredictable Time-Sharing for DryadLINQ Cluster. Sang-Min Park and Marty Humphrey Dept. of Computer Science University of Virginia
Predictable Time-Sharing for DryadLINQ Cluster Sang-Min Park and Marty Humphrey Dept. of Computer Science University of Virginia 1 DryadLINQ What is DryadLINQ? LINQ: Data processing language and run-time
More informationChapter 5. The MapReduce Programming Model and Implementation
Chapter 5. The MapReduce Programming Model and Implementation - Traditional computing: data-to-computing (send data to computing) * Data stored in separate repository * Data brought into system for computing
More informationSQL Server Business Intelligence 20768: Developing SQL Server 2016 Data Models in SSAS. Upcoming Dates. Course Description.
SQL Server Business Intelligence 20768: Developing SQL Server 2016 Data Models in SSAS Get the skills needed to successfully create multidimensional databases using Microsoft SQL Server Analysis Services
More informationARCHITECTURE. Architecture. Microsoft Dynamics GP. White Paper
Microsoft Dynamics GP Architecture White Paper Date: May, 2007 Table of Contents Introduction...4 Application Structure...4 Dexterity and the Dexterity Runtime Engine...5 Dexterity...5 Dexterity Runtime
More informationMicroStrategy Evaluation Edition Quick Start Guide
MicroStrategy Evaluation Edition Quick Start Guide Version: 10.9 10.9, September 2017 Copyright 2017 by MicroStrategy Incorporated. All rights reserved. Trademark Information The following are either trademarks
More informationThe Hadoop Paradigm & the Need for Dataset Management
The Hadoop Paradigm & the Need for Dataset Management 1. Hadoop Adoption Hadoop is being adopted rapidly by many different types of enterprises and government entities and it is an extraordinarily complex
More informationIntegrate MATLAB Analytics into Enterprise Applications
Integrate Analytics into Enterprise Applications Lyamine Hedjazi 2015 The MathWorks, Inc. 1 Data Analytics Workflow Preprocessing Data Business Systems Build Algorithms Smart Connected Systems Take Decisions
More informationPower BI on SAP HANA. by Gerhard Brueckl and Markus Begerow
Power BI on SAP HANA by Gerhard Brueckl and Markus Begerow Who we are http://www.pmone.com Gerhard Brueckl Analytical Databases (SSAS, HANA) Power BI and Office 365 Windows Azure @gbrueckl gerhard@gbrueckl.at
More information