Can Enterprise Storage Fix Hadoop? PRESENTATION TITLE GOES HERE John Webster Senior Partner Evaluator Group
|
|
- Arleen Phillips
- 6 years ago
- Views:
Transcription
1 Can Enterprise Storage Fix Hadoop? PRESENTATIN TITLE GES HERE John Webster Senior Partner Evaluator Group
2 Agenda What is the Internet Data Center and how is it different from Enterprise Data Center? How is the Apache Software Foundation (ASF) addressing the issues? What needs fixing from the perspective of Enterprise Storage vendors and the Enterprise Storage world? What are the proposed fixes? Can Hadoop fix Enterprise Storage? Can the Internet Data Center/Enterprise Data Center Chasm be Crossed? FYI: I will use vendor names and products as examples only no explicit or implied endorsements 4/13/
3 The Data Center Chasm Internet Data Center Enterpris e Data Center 4/13/
4 Defining the Data Center Chasm Internet Data Center Embraces open source Automates IT Comfortable with systems that run in failure mode Cheep and deep hardware inefficiency not an obvious issue More willing to build their own systems and self-support Manages storage (often JBD) from a systems perspective Enterprise Data Center Prefers proprietary but learning open source Approaches IT automation conservatively Doesn t get failure mode Hardware efficiency-conscious More willing to buy from proprietary vendors and deal with them for support Sees value in storage environment as a place for data and storage management 4 4/13/2014 4
5 4/13/
6 What has the ASF Fixed in HDFS? NameNode SPF NameNode active/standby failover support Snapshot Read-only Copy on Write (CW) included in latest v2 Beta (2.1.0) NFS support Support for NFSv3 in latest v2 Beta (2.1.0) DR Support Distributed Copy (distcp) 4/13/
7 What Needs to be Fixed the Enterprise Storage Vendor Perspective Hadoop NameNode is a single point of failure in V1. Manual failover in v2 (Beta). JobTracker is also a single point of failure For data integrity and protection, HDFS creates three full clone copies of data 3x the storage for each file slow and inefficient If all three copies are corrupted, you re still hosed (reload and start over) 60% of Enterprise Hadoop projects fail or are put on hold Steep learning curve six months is not uncommon for those that actually go from pilot to production No storage tiering Limited (if any) ways to respond to corporate security and data governance policies Difficult to move between cloud and data center Fundamentally a batch process Data in/out processes can take longer than the actual query process Inability to dis-aggregate storage from compute so that the two can be scaled independently Dearth of applications built on top Dearth of people available in the job market to run this beast and the ones that can go for big bucks.and more leading some analysts to believe that Big Data has entered the trough of disillusion 7
8 What Needs to be Fixed the Enterprise Storage Vendor Perspective Hadoop NameNode is a single point of failure JobTracker is also a single point of failure For data integrity and protection, HDFS creates three full clone copies of data 3x the storage for each file slow and inefficient If all three copies are corrupted, you re still hosed (reload and start over) 60% of Enterprise Hadoop projects fail or are put on hold Steep learning curve six months is not uncommon for those that actually go from pilot to production No storage tiering Limited (if any) ways to respond to corporate security and data governance policies Difficult to move between cloud and data center Fundamentally a batch process Data in/out processes can take longer than the actual query process Inability to dis-aggregate storage from compute so that the two can be scaled independently Dearth of applications built on top Dearth of people available in the job market to run this beast and the ones that can go for big bucks.and more leading some analysts to believe that Big Data has entered the trough of disillusion 8
9 Storage in Shared Nothing Network Layer 1-10 Gb Ethernet B8GMR3 nly the Ethernet network is shared 1 Link 2 3 Link 4 5 Link 6 7 Link 8 Pwr Console Active Active Active Active Active Compute Layer Commodity Servers Storage Layer 6-12 disks in each server typically JBD C N T R L DAS N D E 1 N D E 2 DAS DAS DAS DAS N D E 3 Scale to thousands of nodes N D E n In Hadoop, Control = Name Node; Node 1,2 = Data Node 9
10 Hadoop External Storage EMC Isilon Example Shared storage replaces nodelevel DAS HDFS implemented as over the wire protocol on nefs Isilon cluster nodes emulate NameNodes and DataNodes NameNode SPF eliminated Decoupled storage and compute layers Data services, data protection, and DR by nefs Analytics on data in place i.e. minimal if any data moving 4/13/
11 Hadoop External Storage NetApp Example MAPREDUCE Hadoop 10GbE Namenode Secondary Namenode 10GbE Datanodes/ Tasktracke rs FAS Series Data NTAP NFS Metadata Store E-Series Data Stores 4 separate, sharednothing partitions per chassis 6Gb SAS, Direct Connect Source: NetApp Preserves shared nothing architecture and HDFS Decouple compute and storage Hardware RAID: reduction in copies from 3 to 2 NameNode metadata in separate array for faster NameNode recovery DataNode drive failures do not blacklist the DataNode Apply built-in enterprise data and storage management functionality to Hadoop data 4/13/
12 Shared Storage as Secondary Storage Network Layer B8GMR3 1 Link 2 3 Link 4 5 Link 6 7 Link 8 Active Active Active Active Pwr Active Console Compute Layer C N T R L N D E N D E N D E N D E Primary Storage Layer 1 n Secondar y Storage Layer SAN/NAS Secondary Storage Layer Data mirrored or migrated from primary to secondary storage Storage services also live here 12
13 Progression of Yahoo! Source: Yahoo! 4/13/
14 Can Hadoop Fix Enterprise Storage? 4/13/
15 Can Hadoop Fix Enterprise Storage? Modern Enterprise Storage Issues Inflexible and non-elastic Siloed Proprietary pposite of Cheap and deep Bound to three-year product life cycles Developed for the traditional enterprise data center environment Doesn t offer performance at scale and low cost and all at the same time 4/13/
16 Is Hadoop a new Storage Platform? No Yes It s a distributed computing platform for analytics HDFS - Embedded, distributed file system (like scale-out NAS) Data protection and management built-in (like Enterprise Storage) Storage performance at scale and low cost and with native intelligence and all at the same time Growing use case as data repository for existing enterprise BI and Data Warehousing apps the Data Lake 16
17 What Does the Enterprise Want from Big Data? If we could harness all of our data, we would be a much stronger business. * * From CompTIA survey where two thirds of respondents either agreed or strongly agreed with the statement 4/13/
18 Can the Chasm Be Crossed? Internet Data Center Enterpris e Data Center 4/13/
19 Is a new computing paradigm lurking behind the Hadoop hype? 19
20 Summary Hadoop is crossing the chasm A more pragmatic approach to integrating shared storage with Hadoop is emerging The Hadoop Holy Grail: perational (transactional) processing with real time analytics 20
BUSINESS DATA LAKE FADI FAKHOURI, SR. SYSTEMS ENGINEER, ISILON SPECIALIST. Copyright 2016 EMC Corporation. All rights reserved.
BUSINESS DATA LAKE FADI FAKHOURI, SR. SYSTEMS ENGINEER, ISILON SPECIALIST 1 UNSTRUCTURED DATA GROWTH 75% 78% 80% 2015 71 EB 2016 106 EB 2017 133 EB Total Capacity Shipped, Worldwide % of Unstructured Data
More informationMixApart: Decoupled Analytics for Shared Storage Systems. Madalin Mihailescu, Gokul Soundararajan, Cristiana Amza University of Toronto and NetApp
MixApart: Decoupled Analytics for Shared Storage Systems Madalin Mihailescu, Gokul Soundararajan, Cristiana Amza University of Toronto and NetApp Hadoop Pig, Hive Hadoop + Enterprise storage?! Shared storage
More information5 Fundamental Strategies for Building a Data-centered Data Center
5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse
More informationNetApp Solutions for Hadoop Reference Architecture
White Paper NetApp Solutions for Hadoop Reference Architecture Gus Horn, Iyer Venkatesan, NetApp April 2014 WP-7196 Abstract Today s businesses need to store, control, and analyze the unprecedented complexity,
More informationTITLE: PRE-REQUISITE THEORY. 1. Introduction to Hadoop. 2. Cluster. Implement sort algorithm and run it using HADOOP
TITLE: Implement sort algorithm and run it using HADOOP PRE-REQUISITE Preliminary knowledge of clusters and overview of Hadoop and its basic functionality. THEORY 1. Introduction to Hadoop The Apache Hadoop
More informationTaming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems
1 Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems The Defacto Choice For Convergence 2 ABSTRACT & SPEAKER BIO Dealing with enormous data growth is a key challenge for
More informationLecture 11 Hadoop & Spark
Lecture 11 Hadoop & Spark Dr. Wilson Rivera ICOM 6025: High Performance Computing Electrical and Computer Engineering Department University of Puerto Rico Outline Distributed File Systems Hadoop Ecosystem
More informationBIG DATA STRATEGY FOR TODAY AND TOMORROW RYAN SAYRE, EMC ISILON CTO-AT-LARGE, EMEA
BIG DATA STRATEGY FOR TODAY AND TOMORROW RYAN SAYRE, EMC ISILON CTO-AT-LARGE, EMEA 1 Unstructured Data Growth 67% 74% 80% 2013 2015 2017 37 EB 71 EB 133 EB Total Capacity Shipped, Worldwide Unstructured
More informationTECHNICAL OVERVIEW OF NEW AND IMPROVED FEATURES OF EMC ISILON ONEFS 7.1.1
TECHNICAL OVERVIEW OF NEW AND IMPROVED FEATURES OF EMC ISILON ONEFS 7.1.1 ABSTRACT This introductory white paper provides a technical overview of the new and improved enterprise grade features introduced
More informationInternational Journal of Advance Engineering and Research Development. A Study: Hadoop Framework
Scientific Journal of Impact Factor (SJIF): e-issn (O): 2348- International Journal of Advance Engineering and Research Development Volume 3, Issue 2, February -2016 A Study: Hadoop Framework Devateja
More informationRethink Storage: The Next Generation Of Scale- Out NAS
Rethink Storage: The Next Generation Of Scale- Out NAS Norbert Funke EMEA Product Marketing Manager, EMC Isilon 1 Agenda EMC Isilon Momentum What Is The Difference? EMC Isilon Versus Traditional Architectures
More informationCloudera Exam CCA-410 Cloudera Certified Administrator for Apache Hadoop (CCAH) Version: 7.5 [ Total Questions: 97 ]
s@lm@n Cloudera Exam CCA-410 Cloudera Certified Administrator for Apache Hadoop (CCAH) Version: 7.5 [ Total Questions: 97 ] Question No : 1 Which two updates occur when a client application opens a stream
More informationApache Hadoop 3. Balazs Gaspar Sales Engineer CEE & CIS Cloudera, Inc. All rights reserved.
Apache Hadoop 3 Balazs Gaspar Sales Engineer CEE & CIS balazs@cloudera.com 1 We believe data can make what is impossible today, possible tomorrow 2 We empower people to transform complex data into clear
More informationAutomatic-Hot HA for HDFS NameNode Konstantin V Shvachko Ari Flink Timothy Coulter EBay Cisco Aisle Five. November 11, 2011
Automatic-Hot HA for HDFS NameNode Konstantin V Shvachko Ari Flink Timothy Coulter EBay Cisco Aisle Five November 11, 2011 About Authors Konstantin Shvachko Hadoop Architect, ebay; Hadoop Committer Ari
More informationNext Generation Storage for The Software-Defned World
` Next Generation Storage for The Software-Defned World John Hofer Solution Architect Red Hat, Inc. BUSINESS PAINS DEMAND NEW MODELS CLOUD ARCHITECTURES PROPRIETARY/TRADITIONAL ARCHITECTURES High up-front
More informationHadoop/MapReduce Computing Paradigm
Hadoop/Reduce Computing Paradigm 1 Large-Scale Data Analytics Reduce computing paradigm (E.g., Hadoop) vs. Traditional database systems vs. Database Many enterprises are turning to Hadoop Especially applications
More informationTECHNICAL OVERVIEW OF NEW AND IMPROVED FEATURES OF DELL EMC ISILON ONEFS 8.0
WHITE PAPER TECHNICAL OVERVIEW OF NEW AND IMPROVED FEATURES OF DELL EMC ISILON ONEFS 8.0 Abstract This introductory white paper provides a technical overview of the new and improved enterprise grade features
More informationNAS for Server Virtualization Dennis Chapman Senior Technical Director NetApp
NAS for Server Virtualization Dennis Chapman Senior Technical Director NetApp Agenda The Landscape has Changed New Customer Requirements The Market has Begun to Move Comparing Performance Results Storage
More informationBIG DATA READY WITH ISILON JEUDI 19 NOVEMBRE Bertrand OUNANIAN: Advisory System Engineer
BIG DATA READY WITH ISILON JEUDI 19 NOVEMBRE 2015 Bertrand OUNANIAN: Advisory System Engineer Unstructured Data Growth Total Capacity Shipped Worldwide % of Unstructured Data 67% 74% 80% 2013 37 EB 2015
More informationMapReduce. U of Toronto, 2014
MapReduce U of Toronto, 2014 http://www.google.org/flutrends/ca/ (2012) Average Searches Per Day: 5,134,000,000 2 Motivation Process lots of data Google processed about 24 petabytes of data per day in
More informationSoluzioni integrate con vsphere La virtualizzazione abilita il percorso evolutivo di innovazione dell'it
Soluzioni integrate con vsphere La virtualizzazione abilita il percorso evolutivo di innovazione dell'it Matteo Montuori Systems Engineer, VMware mmontuori@vmware.com 2010 VMware Inc. All rights reserved
More informationCloud Bursting: Top Reasons Your Organization will Benefit. Scott Jeschonek Director of Cloud Products Avere Systems
Cloud Bursting: Top Reasons Your Organization will Benefit Scott Jeschonek Director of Cloud Products Avere Systems Agenda Define Cloud Bursting Benefits of using Cloud Bursting Identify Cloud Bursting
More informationHadoop File System S L I D E S M O D I F I E D F R O M P R E S E N T A T I O N B Y B. R A M A M U R T H Y 11/15/2017
Hadoop File System 1 S L I D E S M O D I F I E D F R O M P R E S E N T A T I O N B Y B. R A M A M U R T H Y Moving Computation is Cheaper than Moving Data Motivation: Big Data! What is BigData? - Google
More informationDell EMC Isilon All-Flash
Enterprise Strategy Group Getting to the bigger truth. ESG Lab Validation Dell EMC Isilon All-Flash Scale-out All-flash Storage for Demanding Unstructured Data Workloads By Tony Palmer, Senior Lab Analyst
More informationQuobyte The Data Center File System QUOBYTE INC.
Quobyte The Data Center File System QUOBYTE INC. The Quobyte Data Center File System All Workloads Consolidate all application silos into a unified highperformance file, block, and object storage (POSIX
More informationMicrosoft Analytics Platform System (APS)
Microsoft Analytics Platform System (APS) The turnkey modern data warehouse appliance Matt Usher, Senior Program Manager @ Microsoft About.me @two_under Senior Program Manager 9 years at Microsoft Visual
More informationMicrosoft SQL Server HA and DR with DVX
Microsoft SQL Server HA and DR with DVX 385 Moffett Park Dr. Sunnyvale, CA 94089 844-478-8349 www.datrium.com Technical Report Introduction A Datrium DVX solution allows you to start small and scale out.
More informationA Review Approach for Big Data and Hadoop Technology
International Journal of Modern Trends in Engineering and Research www.ijmter.com e-issn No.:2349-9745, Date: 2-4 July, 2015 A Review Approach for Big Data and Hadoop Technology Prof. Ghanshyam Dhomse
More informationTopics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples
Hadoop Introduction 1 Topics Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples 2 Big Data Analytics What is Big Data?
More informationCPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University
CPSC 426/526 Cloud Computing Ennan Zhai Computer Science Department Yale University Recall: Lec-7 In the lec-7, I talked about: - P2P vs Enterprise control - Firewall - NATs - Software defined network
More informationBigData and Map Reduce VITMAC03
BigData and Map Reduce VITMAC03 1 Motivation Process lots of data Google processed about 24 petabytes of data per day in 2009. A single machine cannot serve all the data You need a distributed system to
More informationHADOOP 3.0 is here! Dr. Sandeep Deshmukh Sadepach Labs Pvt. Ltd. - Let us grow together!
HADOOP 3.0 is here! Dr. Sandeep Deshmukh sandeep@sadepach.com Sadepach Labs Pvt. Ltd. - Let us grow together! About me BE from VNIT Nagpur, MTech+PhD from IIT Bombay Worked with Persistent Systems - Life
More informationHDFS What is New and Futures
HDFS What is New and Futures Sanjay Radia, Founder, Architect Suresh Srinivas, Founder, Architect Hortonworks Inc. Page 1 About me Founder, Architect, Hortonworks Part of the Hadoop team at Yahoo! since
More informationVxRail: Level Up with New Capabilities and Powers GLOBAL SPONSORS
VxRail: Level Up with New Capabilities and Powers GLOBAL SPONSORS VMware customers trust their infrastructure to vsan #1 Leading SDS Vendor >10,000 >100 83% vsan Customers Countries Deployed Critical Apps
More informationModern Data Warehouse The New Approach to Azure BI
Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics
More informationEMC ISILON HARDWARE PLATFORM
EMC ISILON HARDWARE PLATFORM Three flexible product lines that can be combined in a single file system tailored to specific business needs. S-SERIES Purpose-built for highly transactional & IOPSintensive
More informationMixing and matching virtual and physical HPC clusters. Paolo Anedda
Mixing and matching virtual and physical HPC clusters Paolo Anedda paolo.anedda@crs4.it HPC 2010 - Cetraro 22/06/2010 1 Outline Introduction Scalability Issues System architecture Conclusions & Future
More informationVendor: Cloudera. Exam Code: CCA-505. Exam Name: Cloudera Certified Administrator for Apache Hadoop (CCAH) CDH5 Upgrade Exam.
Vendor: Cloudera Exam Code: CCA-505 Exam Name: Cloudera Certified Administrator for Apache Hadoop (CCAH) CDH5 Upgrade Exam Version: Demo QUESTION 1 You have installed a cluster running HDFS and MapReduce
More informationTHE EMC ISILON STORY. Big Data In The Enterprise. Deya Bassiouni Isilon Regional Sales Manager Emerging Africa, Egypt & Lebanon.
THE EMC ISILON STORY Big Data In The Enterprise Deya Bassiouni Isilon Regional Sales Manager Emerging Africa, Egypt & Lebanon August, 2012 1 Big Data In The Enterprise Isilon Overview Isilon Technology
More informationPart 1: Indexes for Big Data
JethroData Making Interactive BI for Big Data a Reality Technical White Paper This white paper explains how JethroData can help you achieve a truly interactive interactive response time for BI on big data,
More informationHow Apache Hadoop Complements Existing BI Systems. Dr. Amr Awadallah Founder, CTO Cloudera,
How Apache Hadoop Complements Existing BI Systems Dr. Amr Awadallah Founder, CTO Cloudera, Inc. Twitter: @awadallah, @cloudera 2 The Problems with Current Data Systems BI Reports + Interactive Apps RDBMS
More informationA Multilevel Secure MapReduce Framework for Cross-Domain Information Sharing in the Cloud
Calhoun: The NPS Institutional Archive Faculty and Researcher Publications Faculty and Researcher Publications 2013-03 A Multilevel Secure MapReduce Framework for Cross-Domain Information Sharing in the
More informationHadoop and HDFS Overview. Madhu Ankam
Hadoop and HDFS Overview Madhu Ankam Why Hadoop We are gathering more data than ever Examples of data : Server logs Web logs Financial transactions Analytics Emails and text messages Social media like
More information2014 年 3 月 13 日星期四. From Big Data to Big Value Infrastructure Needs and Huawei Best Practice
2014 年 3 月 13 日星期四 From Big Data to Big Value Infrastructure Needs and Huawei Best Practice Data-driven insight Making better, more informed decisions, faster Raw Data Capture Store Process Insight 1 Data
More informationSTORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp
STORAGE CONSOLIDATION WITH IP STORAGE David Dale, NetApp SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individuals may use this material in
More informationWe are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info
We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info START DATE : TIMINGS : DURATION : TYPE OF BATCH : FEE : FACULTY NAME : LAB TIMINGS : PH NO: 9963799240, 040-40025423
More informationGain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved.
Gain Insights From Unstructured Data Using Pivotal HD 1 Traditional Enterprise Analytics Process 2 The Fundamental Paradigm Shift Internet age and exploding data growth Enterprises leverage new data sources
More informationSOLUTION BRIEF Fulfill the promise of the cloud
SOLUTION BRIEF Fulfill the promise of the cloud NetApp Solutions for Amazon Web Services Fulfill the promise of the cloud NetApp Cloud Volumes Service for AWS: Move and manage more workloads faster Many
More informationThe Best Storage for Virtualized Environments
The Best Storage for Virtualized Environments Paul Kessler Asia Pacific Solutions Marketing Alliances, NetApp Nov.4,2008 The Best Storage for Virtualized Environments Paul Kessler Solutions Marketing &
More informationBuilding a Multi-protocol, analytics-enabled, Data Lake with Isilon
Building a Multi-protocol, analytics-enabled, Data Lake with Isilon Ahmad Muammar @muammara #EMCForum 1 Trends 2 3 Big Data X in T 4 Unstructured Data Growth 67% 74% 80% 2013 2015 2017 37 EB 71 EB 133
More informationBoost your data protection with NetApp + Veeam. Schahin Golshani Technical Partner Enablement Manager, MENA
Boost your data protection with NetApp + Veeam Schahin Golshani Technical Partner Enablement Manager, MENA NetApp Product Strategy Market-leading innovations, that are NetApp Confidential Limited Use 3
More informationBest Practices for Deploying Hadoop Workloads on HCI Powered by vsan
Best Practices for Deploying Hadoop Workloads on HCI Powered by vsan Chen Wei, ware, Inc. Paudie ORiordan, ware, Inc. #vmworld HCI2038BU #HCI2038BU Disclaimer This presentation may contain product features
More informationStorage for HPC, HPDA and Machine Learning (ML)
for HPC, HPDA and Machine Learning (ML) Frank Kraemer, IBM Systems Architect mailto:kraemerf@de.ibm.com IBM Data Management for Autonomous Driving (AD) significantly increase development efficiency by
More informationELASTIC DATA PLATFORM
SERVICE OVERVIEW ELASTIC DATA PLATFORM A scalable and efficient approach to provisioning analytics sandboxes with a data lake ESSENTIALS Powerful: provide read-only data to anyone in the enterprise while
More informationDisclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme
STO1794BE Evolution of vsan Vijay Ramachandran Sr.Director, Product Management Christos Karamanolis VMware Fellow & CTO of Storage & Availability #VMworld #STO1794BE Disclaimer This presentation may contain
More informationSolution Brief. Bridging the Infrastructure Gap for Unstructured Data with Object Storage. 89 Fifth Avenue, 7th Floor. New York, NY 10003
89 Fifth Avenue, 7th Floor New York, NY 10003 www.theedison.com @EdisonGroupInc 212.367.7400 Solution Brief Bridging the Infrastructure Gap for Unstructured Data with Object Storage Printed in the United
More informationFlexPod. The Journey to the Cloud. Technical Presentation. Presented Jointly by NetApp and Cisco
FlexPod The Journey to the Cloud Technical Presentation Presented Jointly by NetApp and Cisco Agenda Alliance Highlights Introducing FlexPod One Shared Vision and Journey FlexPod for the Oracle base base
More informationDELL EMC ISILON SCALE-OUT NAS PRODUCT FAMILY Unstructured data storage made simple
SCALE-OUT NAS PRODUCT FAMILY Unstructured data storage made simple ESSENTIALS Simple storage management designed for ease of use Massive scalability of capacity and performance Unmatched efficiency to
More informationMI-PDB, MIE-PDB: Advanced Database Systems
MI-PDB, MIE-PDB: Advanced Database Systems http://www.ksi.mff.cuni.cz/~svoboda/courses/2015-2-mie-pdb/ Lecture 10: MapReduce, Hadoop 26. 4. 2016 Lecturer: Martin Svoboda svoboda@ksi.mff.cuni.cz Author:
More informationHDP Security Overview
3 HDP Security Overview Date of Publish: 2018-07-15 http://docs.hortonworks.com Contents HDP Security Overview...3 Understanding Data Lake Security... 3 What's New in This Release: Knox... 5 What's New
More informationHDP Security Overview
3 HDP Security Overview Date of Publish: 2018-07-15 http://docs.hortonworks.com Contents HDP Security Overview...3 Understanding Data Lake Security... 3 What's New in This Release: Knox... 5 What's New
More informationEsgynDB Enterprise 2.0 Platform Reference Architecture
EsgynDB Enterprise 2.0 Platform Reference Architecture This document outlines a Platform Reference Architecture for EsgynDB Enterprise, built on Apache Trafodion (Incubating) implementation with licensed
More informationDistributed File Systems II
Distributed File Systems II To do q Very-large scale: Google FS, Hadoop FS, BigTable q Next time: Naming things GFS A radically new environment NFS, etc. Independence Small Scale Variety of workloads Cooperation
More informationDELL EMC ISILON SCALE-OUT NAS PRODUCT FAMILY
DATA SHEET DELL EMC ISILON SCALE-OUT NAS PRODUCT FAMILY Unstructured data storage made simple ESSENTIALS Simple storage management designed for ease of use Massive scalability of capacity and performance
More informationHyperconverged Infrastructure: Cost-effectively Simplifying IT to Improve Business Agility at Scale
Enterprise Strategy Group Getting to the bigger truth. White Paper Hyperconverged Infrastructure: Cost-effectively Simplifying IT to Improve Business Agility at Scale By Mike Leone, ESG Senior Analyst;
More informationConfiguring Ports for Big Data Management, Data Integration Hub, Enterprise Information Catalog, and Intelligent Data Lake 10.2
Configuring s for Big Data Management, Data Integration Hub, Enterprise Information Catalog, and Intelligent Data Lake 10.2 Copyright Informatica LLC 2016, 2017. Informatica, the Informatica logo, Big
More informationBig Data Analytics. Izabela Moise, Evangelos Pournaras, Dirk Helbing
Big Data Analytics Izabela Moise, Evangelos Pournaras, Dirk Helbing Izabela Moise, Evangelos Pournaras, Dirk Helbing 1 Big Data "The world is crazy. But at least it s getting regular analysis." Izabela
More informationSTORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp
STORAGE CONSOLIDATION WITH IP STORAGE David Dale, NetApp SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individuals may use this material in
More informationDistributed Systems. CS422/522 Lecture17 17 November 2014
Distributed Systems CS422/522 Lecture17 17 November 2014 Lecture Outline Introduction Hadoop Chord What s a distributed system? What s a distributed system? A distributed system is a collection of loosely
More informationHADOOP FRAMEWORK FOR BIG DATA
HADOOP FRAMEWORK FOR BIG DATA Mr K. Srinivas Babu 1,Dr K. Rameshwaraiah 2 1 Research Scholar S V University, Tirupathi 2 Professor and Head NNRESGI, Hyderabad Abstract - Data has to be stored for further
More informationUsing Self-Protecting Storage to Lower Backup TCO
Economic Insight Paper Using Self-Protecting Storage to Lower Backup TCO A TCO comparison of NetApp s integrated data protection solution vs. a traditional backup to an external PBBA By John Webster, Sr.
More informationBig Data and Object Storage
Big Data and Object Storage or where to store the cold and small data? Sven Bauernfeind Computacenter AG & Co. ohg, Consultancy Germany 28.02.2018 Munich Volume, Variety & Velocity + Analytics Velocity
More informationFrom Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019
From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways
More informationA brief history on Hadoop
Hadoop Basics A brief history on Hadoop 2003 - Google launches project Nutch to handle billions of searches and indexing millions of web pages. Oct 2003 - Google releases papers with GFS (Google File System)
More information1Z Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions
1Z0-449 Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions Table of Contents Introduction to 1Z0-449 Exam on Oracle Big Data 2017 Implementation Essentials... 2 Oracle 1Z0-449
More informationAt-Scale Data Centers & Demand for New Architectures
Allen Samuels At-Scale Data Centers & Demand for New Architectures Software Architect, Software and Systems Solutions June 4, 2015 1 Forward-Looking Statements During our meeting today we may make forward-looking
More informationBig Data com Hadoop. VIII Sessão - SQL Bahia. Impala, Hive e Spark. Diógenes Pires 03/03/2018
Big Data com Hadoop Impala, Hive e Spark VIII Sessão - SQL Bahia 03/03/2018 Diógenes Pires Connect with PASS Sign up for a free membership today at: pass.org #sqlpass Internet Live http://www.internetlivestats.com/
More informationDecentralized Distributed Storage System for Big Data
Decentralized Distributed Storage System for Big Presenter: Wei Xie -Intensive Scalable Computing Laboratory(DISCL) Computer Science Department Texas Tech University Outline Trends in Big and Cloud Storage
More informationEvaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization
Evaluating Cloud Storage Strategies James Bottomley; CTO, Server Virtualization Introduction to Storage Attachments: - Local (Direct cheap) SAS, SATA - Remote (SAN, NAS expensive) FC net Types - Block
More informationIBM Spectrum Scale vs EMC Isilon for IBM Spectrum Protect Workloads
89 Fifth Avenue, 7th Floor New York, NY 10003 www.theedison.com @EdisonGroupInc 212.367.7400 IBM Spectrum Scale vs EMC Isilon for IBM Spectrum Protect Workloads A Competitive Test and Evaluation Report
More informationRedefining Enterprise Storage: EMC Storage Strategy
Redefining Enterprise Storage: EMC Storage Strategy 1 BILLIONS OF USERS MILLIONS OF APPS Mobile Cloud Big Data Social Mobile Devices HUNDREDS OF MILLIONS OF USERS TENS OF THOUSANDS OF APPS LAN/Internet
More informationSoftware Defined Storage
Software Defined Storage Riyadh Forum 2016 By: Ayman El Marazky Sr. System Engineer software defined solutions The CIO Dilemma IT Budget $ By 2020, the digital universe of data will go to 44 ZB Modern
More informationData Storage Infrastructure at Facebook
Data Storage Infrastructure at Facebook Spring 2018 Cleveland State University CIS 601 Presentation Yi Dong Instructor: Dr. Chung Outline Strategy of data storage, processing, and log collection Data flow
More informationHadoop. copyright 2011 Trainologic LTD
Hadoop Hadoop is a framework for processing large amounts of data in a distributed manner. It can scale up to thousands of machines. It provides high-availability. Provides map-reduce functionality. Hides
More informationDisclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme
VIRT1351BE New Architectures for Virtualizing Spark and Big Data Workloads on vsphere Justin Murray Mohan Potheri VMworld 2017 Content: Not for publication #VMworld #VIRT1351BE Disclaimer This presentation
More informationWhat is the maximum file size you have dealt so far? Movies/Files/Streaming video that you have used? What have you observed?
Simple to start What is the maximum file size you have dealt so far? Movies/Files/Streaming video that you have used? What have you observed? What is the maximum download speed you get? Simple computation
More informationA Survey on Big Data
A Survey on Big Data D.Prudhvi 1, D.Jaswitha 2, B. Mounika 3, Monika Bagal 4 1 2 3 4 B.Tech Final Year, CSE, Dadi Institute of Engineering & Technology,Andhra Pradesh,INDIA ---------------------------------------------------------------------***---------------------------------------------------------------------
More informationCloud Computing. Hwajung Lee. Key Reference: Prof. Jong-Moon Chung s Lecture Notes at Yonsei University
Cloud Computing Hwajung Lee Key Reference: Prof. Jong-Moon Chung s Lecture Notes at Yonsei University Cloud Computing Cloud Introduction Cloud Service Model Big Data Hadoop MapReduce HDFS (Hadoop Distributed
More informationSGI Hadoop Based on Intel Xeon Processor E5 Family. Getting Started Guide
SGI Hadoop Based on Intel Xeon Processor E5 Family Getting Started Guide 007-5875-001 COPYRIGHT 2013 Silicon Graphics International Corp. All rights reserved; provided portions may be copyright in third
More informationMultiprotocol Locking and Lock Failover in OneFS Aravind Srinivasan EMC, Isilon Storage Division
Multiprotocol Locking and Lock Failover in OneFS Aravind Srinivasan EMC, Isilon Storage Division 2013 Storage Developer Conference. Insert Your Company Name. All Rights Reserved. Agenda Overview OneFS
More informationEMC STORAGE STRATEGY. Copyright 2015 EMC Corporation. All rights reserved.
EMC STORAGE STRATEGY 1 CREATING A MAJOR PAIN POINT FOR CIO S(BI-MODAL IT) MANAGE RISK INVEST REDUCE COST WORKLOAD CHARACTERISTICS Performance Traditional Apps Sequential Stability Reliability Greater Impact
More informationCONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM
CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications
More informationKubernetes Integration with Virtuozzo Storage
Kubernetes Integration with Virtuozzo Storage A Technical OCTOBER, 2017 2017 Virtuozzo. All rights reserved. 1 Application Container Storage Application containers appear to be the perfect tool for supporting
More informationStarWind Virtual SAN Free
#1 HyperConverged Appliance for SMB and ROBO StarWind Virtual SAN Free Value Proposition For virtualization admins, IT geeks, bloggers, students or those who look to build and maintain virtualization infrastructure
More informationXcellis Technical Overview: A deep dive into the latest hardware designed for StorNext 5
TECHNOLOGY BRIEF Xcellis Technical Overview: A deep dive into the latest hardware designed for StorNext 5 ABSTRACT Xcellis represents the culmination of over 15 years of file system and data management
More informationBig Data with Hadoop Ecosystem
Diógenes Pires Big Data with Hadoop Ecosystem Hands-on (HBase, MySql and Hive + Power BI) Internet Live http://www.internetlivestats.com/ Introduction Business Intelligence Business Intelligence Process
More informationCS370 Operating Systems
CS370 Operating Systems Colorado State University Yashwant K Malaiya Spring 2018 Lecture 24 Mass Storage, HDFS/Hadoop Slides based on Text by Silberschatz, Galvin, Gagne Various sources 1 1 FAQ What 2
More informationEmerging Technologies for HPC Storage
Emerging Technologies for HPC Storage Dr. Wolfgang Mertz CTO EMEA Unstructured Data Solutions June 2018 The very definition of HPC is expanding Blazing Fast Speed Accessibility and flexibility 2 Traditional
More informationThin Provisioning. ESG Storage Innovations Series Focus on 3PAR. By Tony Asaro Senior Analyst April 2006
Thin Provisioning ESG Storage Innovations Series Focus on 3PAR By Tony Asaro Senior Analyst April 2006 Copyright 2006, Enterprise Strategy Group, Inc. All Rights Reserved Thin Provisioning ESG believes
More informationMapR Enterprise Hadoop
2014 MapR Technologies 2014 MapR Technologies 1 MapR Enterprise Hadoop Top Ranked Cloud Leaders 500+ Customers 2014 MapR Technologies 2 Key MapR Advantage Partners Business Services APPLICATIONS & OS ANALYTICS
More information