Big Data Architect www.austech.edu.au
WHAT IS BIG DATA ARCHITECT? A big data architecture is designed to handle the ingestion, processing, and analysis of data that is too large or complex for traditional database systems. The threshold at which organizations enter the big data realm differs, depending on the capabilities of the users and their tools. BENEFITS OF BIG DATA ARCHITECT Extract information from extensive networking or web logs. Process massive datasets over 100GB in size. Willing to invest in a big data project, including third-party products to optimize your environment. Store large amounts of unstructured data that need to summarize or transform into a structured format for better analytics. Analyze multiple large data sources, including structured and unstructured. Analyze big data for business needs, such as analyzing store sales by season and advertising, applying sentiment analysis to social media posts, or investigating email for suspicious communication patterns. WHO SHOULD ATTEND? Data Science & Big Data professionals Software developers Business Intelligence professionals Information architects Project Managers Those looking to be a Big Data architect
LEARNING OBJECTIVES Big Data Hadoop Architect Program will help you master skills and tools like Cassandra Architecture Data Model Creation Database Interfaces Advanced Architecture Spark Scala RDD Spark SQL Spark Streaming Spark ML GraphX Replication Sharding Scalability Hadoop clusters Storm Architecture Ingestion Zookeeper Kafka Architecture
COURSE CONTENTS Big Data Hadoop and Spark Developer Introduction to Big data and Hadoop Ecosystem HDFS and YARN MapReduce and Sqoop Basics of Hive and Impala Types of Data Formats Advanced Hive Concept and Data File Partitioning Apache Flume and HBase Pig Basics of Apache Spark RDDs in Spark Implementation of Spark Applications Spark Parallel Processing Spark RDD Optimization Techniques Spark Algorithm Spark SQL Apache Spark and Scala Course Overview Introduction to Spark Introduction to Programming in Scala Using RDD for Creating Applications in Spark Running SQL Queries Using Spark SQL Spark Streaming Spark ML Programming Spark GraphX Programming MongoDB Developer and Administrator Course Introduction Introduction to NoSQL databases MongoDB A Database for the Modern Web CRUD Operations in MongoDB Indexing and Aggregation Replication and Sharding Developing Java and Node JS Application with MongoDB Administration of MongoDB Cluster Operations Apache Storm, Kafka Introduction Big Data Overview Introduction to Storm Installation and Configuration Storm Advanced Concepts Storm Interfaces Storm Trident Apache Kafka architecture Producers and consumers Advanced Kafka Understanding Internals
COURSE CONTENT Apache Cassandra Course Overview Apache Cassandra L1 Overview Big Data and NoSQL Database Introduction to Cassandra Cassandra Architecture Cassandra Installation and Configuration Cassandra Data Model Cassandra Interfaces Cassandra Advanced Architecture Apache Ecosystem around Cassandra
AUSTRALIA 5, Everage Street, Moonee Ponds, Victoria, 3039 Melbourne, Ph +61 3 8371 0000 Fax +61 3 8371 0099 Email learn@austech.edu.au