Oracle Big Data Fundamentals Ed 2

Similar documents
Oracle Big Data Fundamentals Ed 1

Big Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours

1Z Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions

Oracle Big Data Fundamentals

Oracle Big Data. A NA LYT ICS A ND MA NAG E MENT.

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a)

Innovatus Technologies

Oracle Big Data Fundamentals

Big Data Analytics using Apache Hadoop and Spark with Scala

Big Data Hadoop Stack

Introduction to the Hadoop Ecosystem - 1

The Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou

Securing the Oracle BDA - 1

Quick Deployment Step-by-step instructions to deploy Oracle Big Data Lite Virtual Machine

CERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI)

Big Data Syllabus. Understanding big data and Hadoop. Limitations and Solutions of existing Data Analytics Architecture

Big Data Architect.

Hadoop. Course Duration: 25 days (60 hours duration). Bigdata Fundamentals. Day1: (2hours)

Blended Learning Outline: Cloudera Data Analyst Training (171219a)

Certified Big Data and Hadoop Course Curriculum

Introduction to the Oracle Big Data Appliance - 1

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training::

Big Data Hadoop Course Content

Hadoop. Introduction / Overview

Configuring and Deploying Hadoop Cluster Deployment Templates

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::

Certified Big Data Hadoop and Spark Scala Course Curriculum

Big Data with Hadoop Ecosystem

Oracle BDA: Working With Mammoth - 1

Question: 1 You need to place the results of a PigLatin script into an HDFS output directory. What is the correct syntax in Apache Pig?

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info

MapR Enterprise Hadoop

Hadoop Development Introduction

Big Data com Hadoop. VIII Sessão - SQL Bahia. Impala, Hive e Spark. Diógenes Pires 03/03/2018

Techno Expert Solutions An institute for specialized studies!

HADOOP COURSE CONTENT (HADOOP-1.X, 2.X & 3.X) (Development, Administration & REAL TIME Projects Implementation)

Quick Deployment Step- by- step instructions to deploy Oracle Big Data Lite Virtual Machine

Hadoop & Big Data Analytics Complete Practical & Real-time Training

Gain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved.

Hadoop course content

Delving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture

Hadoop Overview. Lars George Director EMEA Services

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect

Oracle Big Data Appliance

This is a brief tutorial that explains how to make use of Sqoop in Hadoop ecosystem.

Hadoop. Introduction to BIGDATA and HADOOP

Oracle 1Z Oracle Big Data 2017 Implementation Essentials.

Expert Lecture plan proposal Hadoop& itsapplication

Apache Spark and Scala Certification Training

Oracle Big Data Connectors

SOLUTION TRACK Finding the Needle in a Big Data Innovator & Problem Solver Cloudera

SQT03 Big Data and Hadoop with Azure HDInsight Andrew Brust. Senior Director, Technical Product Marketing and Evangelism

BIG DATA COURSE CONTENT

Hadoop An Overview. - Socrates CCDH

Introduction to BigData, Hadoop:-

Cloud Computing & Visualization

Oracle Data Integrator 12c: Integration and Administration

1z0-449.exam. Number: 1z0-449 Passing Score: 800 Time Limit: 120 min File Version: Oracle. 1z0-449

MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS

Hadoop 2.x Core: YARN, Tez, and Spark. Hortonworks Inc All Rights Reserved

Oracle Big Data Appliance

Exam Questions CCA-505

Oracle Big Data Appliance

Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context

Oracle Data Integrator 12c: Integration and Administration

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Enabling Universal Authorization Models using Sentry

Spatial Analytics Built for Big Data Platforms

Oracle Public Cloud Machine

Building an Integrated Big Data & Analytics Infrastructure September 25, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle

Oracle Cloud Using Oracle Big Data Cloud Service. Release

BIG DATA ANALYTICS USING HADOOP TOOLS APACHE HIVE VS APACHE PIG

Introduction into Big Data analytics Lecture 3 Hadoop ecosystem. Janusz Szwabiński

<Insert Picture Here> Introduction to Big Data Technology

DATA SCIENCE USING SPARK: AN INTRODUCTION

Hortonworks PR PowerCenter Data Integration 9.x Administrator Specialist.

Microsoft Big Data and Hadoop

Things Every Oracle DBA Needs to Know about the Hadoop Ecosystem. Zohar Elkayam

Exam Questions 1z0-449

Introduction to Big-Data

Cloudera Manager Quick Start Guide

Hortonworks Data Platform

Cloudera Introduction

Hadoop-PR Hortonworks Certified Apache Hadoop 2.0 Developer (Pig and Hive Developer)

Stages of Data Processing

Modern ETL Tools for Cloud and Big Data. Ken Beutler, Principal Product Manager, Progress Michael Rainey, Technical Advisor, Gluent Inc.

1 Big Data Hadoop. 1. Introduction About this Course About Big Data Course Logistics Introductions

How Apache Hadoop Complements Existing BI Systems. Dr. Amr Awadallah Founder, CTO Cloudera,

Oracle GoldenGate for Big Data

An exceedingly high-level overview of ambient noise processing with Spark and Hadoop

Hortonworks University. Education Catalog 2018 Q1

Cloudera Introduction

Oracle Big Data Connectors


Exam Questions

Oracle Big Data Science

A complete Hadoop Development Training Program.

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk

Data Analytics Job Guarantee Program

Big Data and Hadoop. Course Curriculum: Your 10 Module Learning Plan. About Edureka

Transcription:

Oracle University Contact Us: 1.800.529.0165 Oracle Big Data Fundamentals Ed 2 Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, you learn about big data, the technologies used in processing big data and Oracle's solution to handle big data. You also learn to use Oracle Big Data Appliance to process big data, and obtain a hands-on experience in using Oracle Big Data Lite VM. You identify how to acquire the raw data from a variety of sources, and learn to use HDFS and Oracle NoSQL Database to store the data. You learn about data integration options available in Oracle Big Data. These include Oracle Big Data Connectors to move data to and from Oracle Database, Oracle Data Integrator and Oracle GoldenGate for Big Data which provide integration and synchronization capabilities for data unification of relational and Hadoop data, and Oracle Big Data SQL, which enables dynamic, integrated access for all of your data big data, whether it is stored in HDFS, NoSQL, or Oracle Database. Finally, you learn how to analyze your big data using Oracle Big Data SQL, Oracle Advance Analytics, and Oracle Big Data Spatial and Graph. Learn To: Define Big Data. Describe Oracle's Integrated Big Data Solution and its components. Define Cloudera's distribution of Hadoop and its core components and the Hadoop ecosystem. Use the Hadoop Distributed File System (HDFS). Acquire big data using the Command Line Interface, Flume, and Oracle NoSQL Database. Process big data using MapReduce, YARN, Hive, Oracle XQuery for Hadoop, Solr, and Spark. Integrate big data and warehouse data using Sqoop, Oracle Big Data Connectors, Copy to Hadoop, Oracle Data Integrator, and Oracle GoldenGate for big data, and Oracle Big Data SQL. Analyze big data using Oracle Big Data SQL, Oracle Big Data Spatial and Graph, and Oracle Advanced Analytics technologies. Use and manage Oracle Big Data Appliance. Identify the key features and benefits of Oracle Big Data Cloud Service. Identify the key features and benefits of Oracle Big Data Cloud Service - Compute Edition. Benefits To You You will benefit from this course as you define the term big data and discuss Oracle s Big Data solution and use cases. You learn about Apache Hadoop and its core components: HDFS, YARN, and MapReduce. You will also learn about some of the major projects in the Hadoop ecosystem. You will learn how to acquire data into HDFS and Oracle NoSQL Database by using CLI, Flume, and Kafka. To process the data stored in HDFS, you run MapReduce and Spark jobs. Copyright 2017, Oracle. All rights reserved. Page 1

You also explore a range of analysis options, including Oracle Advanced Analytics (OAA) (comprised of Oracle Data Mining and Oracle R Enterprise), and Oracle Big Data Spatial and Graph. You will learn about the Oracle Big Data Appliance, Oracle Big Data Cloud Service, and Oracle Big Data Cloud Service - Compute Edition. You will study case scenarios where Oracle Big Data stands as the perfect solution. Audience Application Developers Database Administrators Database Developers Related Training Required Prerequisites Database Basics and Administration Suggested Prerequisites Exposure to Big Data Course Objectives Define Big Data Describe Oracle's Integrated Big Data Solution and its components Define Cloudera's distribution of Hadoop and its core components and the Hadoop ecosystem Use the Hadoop Distributed File System (HDFS) Acquire big data using the Command Line Interface, Flume, and Oracle NoSQL Database Process big data using MapReduce, YARN, Hive, Oracle XQuery for Hadoop, Solr, and Spark Integrate big data and warehouse data using Sqoop, Oracle Big Data Connectors, Copy to Hadoop, Oracle Data Integrator, and Oracle GoldenGate for big data, and Oracle Big Data SQL Analyze big data using Oracle Big Data SQL, Oracle Big Data Spatial and Graph, and Oracle Advanced Analytics technologies Use and manage Oracle Big Data Appliance Identify the key features and benefits of Oracle Big Data Cloud Service Identify the key features and benefits of Oracle Big Data Cloud Service - Compute Edition Copyright 2017, Oracle. All rights reserved. Page 2

Course Topics Introduction Questions About You Course Objectives Course Road Map Oracle Big Data Lite (BDLite) Virtual Machine (VM) Home Page Starting the Oracle BDLite VM and accessing the Practice Files Reviewing the Available Big Data Documentation, Tutorials, and Other Resources Introducing Oracle Big Data Strategy Characteristics of Big Data Importance of Big Data Big Data Opportunities: Some Examples Big Data Challenges Big Data implementation examples Oracle strategy for Big Data: combining Big Data Processing Engines: Hadoop / NoSQL / RDBMS Using Oracle Big Data Lite Virtual Machine and Movieplex Application Oracle Big Data Lite VM Used in this Course Oracle Big Data Lite VM Home Page Sections Reviewing the Deployment Guide Downloading and installing Oracle VM VirtualBox and its Extension Pack Downloading and Running 7-zip Files to create Virtual Box Appliance File Importing the Appliance File Staring the Big Data Lite VM and Starting and Stopping Services Introducing the Oracle Movieplex Case Study Introduction to the Big Data Ecosystem Computer Clusters and Distributed Computing Apache Hadoop Types of Analysis That Use Hadoop Types of Data Generated Apache Hadoop Core Components: HDFS, MapReduce (MR1), and YARN (MR2) Apache Hadoop Ecosystem Cloudera s Distribution Including Apache Hadoop (CDH) CDH Architecture and Components Introduction to the Hadoop Distributed File System Hadoop Distributed Filesystem (HDFS) Design Principles, Characteristics, and Key Definitions Sample Hadoop High Availability (HA) Cluster HDFS Files and Blocks Active and Standby Daemons (Services) Functions DataNodes (DN) Daemons Functions Writing a File to HDFS: Example Interacting With Data Stored in HDFS: Hue, Hadoop Client, WebHDFS, and HttpFS Acquire Data using CLI, Fuse, Flume, and Kafka Reviewing the Command Line Interface (CLI) Viewing File System Contents Using the CLI FS Shell Commands Loading Data Using the CLI Copyright 2017, Oracle. All rights reserved. Page 3

Overview of FuseDFS What is Flume? Kafka topics Additional Resources Acquire and Access Data Using Oracle NoSQL Database What is a NoSQL Database RDBMS Compared to NoSQL HDFS Compared to NoSQL Define Oracle NoSQL Database Oracle NoSQL models: Key-Value and Table Acquiring and Accessing Data in a NoSQL DB Accessing the CLIs (Data, Admin, SQL) Accessing the KVStore Introduction to MapReduce and YARN Processing Frameworks MapReduce Framework Features, Benefits, and Jobs Parallel Processing with MapReduce Word Count Examples Data Locality Optimization in Hadoop Submitting and Monitoring a MapReduce Job YARN Architecture, Features, and Daemons YARN Application Workflow Hadoop Basic Cluster: MapReduce 1 Versus YARN (MR 2) Resource Management Using Yarn Job Scheduling in YARN First In, First Out (FIFO) Scheduler, Capacity Scheduler, and Fair Scheduler Cloudera Manager Resource Management Features Static Service Pools Working with the Fair Scheduler Cloudera Manager Dynamic Resource Management: Example Submitting and Monitoring a MapReduce Job Using YARN Using the YARN application Command Overview of Apache Spark Benefits of Using Spark Spark Architecture Spark Application Components: Driver, Master, Cluster Manager, and Executors Running a Spark Application on YARN (yarn-cluster Mode) Resilient Distributed Dataset (RDD) Spark Interactive Shells: spark-shell and pyspark Word Count Example by Using Interactive Scala Monitoring Spark Jobs Using YARN's ResourceManager Web UI Overview of Apache Hive What is Hive? Use Case: Storing Clickstream Data Hadoop Architecture How is Data Stored in HDFS? Organizing and Describing Data With Hive Big Data SQL on Top of Hive Data Copyright 2017, Oracle. All rights reserved. Page 4

Defining Tables Over HDFS Hive Queries Overview of Cloudera Impala Overview of Cloudera Impala Hadoop: Some Data Access/Processing Options Cloudera Impala Cloudera Impala: Key Features Cloudera Impala: Supported Data Formats Cloudera Impala: Programming Interfaces How Impala Fits Into the Hadoop Ecosystem How Impala Works with Hive Using Oracle XQuery for Hadoop XML Review Oracle XQuery for Hadoop (OXH) OXH Features OXH Data Flow Using OXH: Installation, Functions, Adapters, and Configuration Properties Running an OXH Query XQuery Transformation and Basic Filtering Viewing the Completed Query in YARN's ResourceManager Overview of Solr Overview of Solr Apache Solr (Cloudera Search) Cloudera Search: Key Capabilities Cloudera Search: Features Cloudera Search Tasks Indexing in Cloudera Search Types of Indexing The solrctl Command Integrating Your Big Data Unifying Data: A Typical Requirement Comparing Big Data Processing Engines Introducing Data Unification Options When To Use These Options? Batch Loading Options Apache Sqoop Oracle Loader for Hadoop Oracle Copy to Hadoop Using Oracle SQL Connector for HDFS Batch and Dynamic Loading: Oracle SQL Connector for HDFS OSCH Architecture Using OSCH Features Parallelism and Performance Performance Tuning Key Benefits Copyright 2017, Oracle. All rights reserved. Page 5

Loading: Choosing a Connector Using Oracle Data Integrator and Oracle GoldenGate for Big Data ETL and Synchronization: Oracle Data Integrator ODI s Declarative Design ODI Knowledge Modules (KMs)Simpler Physical Design / Shorter Implementation Time Using ODI with Big Data Heterogeneous Integration with Hadoop Environments Using ODI Studio ODI Studio Components: Overview ODI Studio: Big Data Knowledge Modules Oracle GoldenGate for Big Data Using Oracle Big Data SQL Barriers to Effective Big Data Adoption Overcoming Big Data Barriers Oracle Big Data SQL: The Hybrid Solution Benefits: Virtualizes data access across Oracle Database, Hadoop and NoSQL stores Using Oracle Big Data SQL Query Performance Overview Deployment Options Using Oracle Big Data Spatial and Graph Graph and Spatial Analysis: All About Relationships What is Oracle Big Data Spatial and Graph (BDSG)? Strategy (supported platforms, etc) BDSG: Graph Analysis Oracle BDSG: Spatial Analysis Multimedia Analytics Framework Deployment Options for Oracle BDSG Additional Resources Using Oracle Advanced Analytics Oracle Advanced Analytics (OAA) OAA: Oracle Data Mining OAA: Oracle R Enterprise Oracle Big Data Deployment Options Introduction to the Oracle Big Data Appliance Running the Oracle BDA Configuration Generation Utility Oracle BDA Mammoth Software Deployment Bundle Using the Oracle BDA mammoth Utility BDA Hardware and Integrated and Optional Software Administering and Securing the Oracle BDA Introduction to the Oracle Big Data Cloud Service Introduction to the Oracle Big Data Cloud Service Compute Edition Copyright 2017, Oracle. All rights reserved. Page 6