HOMEWORK 8. M. Neumann. Due: THU 29 MAR PM. Getting Started SUBMISSION INSTRUCTIONS

Size: px
Start display at page:

Download "HOMEWORK 8. M. Neumann. Due: THU 29 MAR PM. Getting Started SUBMISSION INSTRUCTIONS"

Transcription

1 CSE427S HOMEWORK 8 M. Neumann Due: THU 29 MAR PM Getting Started Update your SVN repository. When needed, you will find additional materials for homework x in the folder hwx. So, for the current assignment the folder is hw8. Hint: You can check your submission to the SVN repository by viewing wustl.edu/repositories/<yourwustlkey>/cse427s_sp18 in a web browser. SUBMISSION INSTRUCTIONS WRITTEN: all written work needs to be submitted electronically in pdf format 1 via GRADESCOPE provide the following information on every page of your pdf file: name student ID wustlkey (your wustlkey indicates the location (SVN repository) for your code submissions; without this information we will not be able to grade your code!) start every problem on a new page FOR GROUPS: make a group submission on GRADESCOPE and provide the following information for all group members on every page of your pdf file: names, student IDs, and location of code submissions (one student s wustlkey) 2 CODE: code needs to be submitted via SVN repository commit (detailed submission instructions are provided whenever code submissions are required) make sure to always use the required file name(s) and submission format(s) comment your code to receive maximum credit 1 Please, type your solutions or use clear hand-writing. If we cannot read your answer, we cannot give you credit nor will we be able to meet any regrade requests concerning your writing. 2 It is sufficient to commit code to one SVN repository. If you do not specify the repository for your group s code submission clearly, we will only check the first repository, sorting wustlkeys alphabetically. 1

2 Preparation 1. Complete Lab 8: Data Ingest with FLUME as you will need the data. 2. If you have not yet executed this command, run it in the command line: $ ~/training_materials/dev1/scripts/training_setup_dev1.sh Problem 1: ETL with SPARK (45%) In this problem you will parse a set of activation records in XML format to extract the account numbers and model names. Spark is commonly used for ETL (Extract/Transform/Load) operations. Sometimes data is stored in line-oriented records, like the web logs in the previous exercise, but sometimes the data is in a multi-line format that must be processed as a whole file. In this exercise you will practice working with file-based instead of line-based formats. You can do this problem using the spark shell. Include all required commands in your written answers. The data for this problem can be found in your local file system under: ~/training_materials/dev1/data/activations Review the activations data. We will be interested in the following information: accountnumber and model. Each XML file contains data for all the devices activated by customers during a specific month. Put the data into the loudacre folder in HDFS. HINT: consult the API documentation for RDD operations on the Spark API page you bookmarked in Lab 8. Review the list of available methods of the RDD class. (a) Create an RDD from the activations dataset, where the entire content of each XML file is a single RDD element. What information is stored in the first value, what in the second value of the tuples in the resulting RDD? (b) Which SPARK function can be used to map the information in the second value to separate RDD elements? Create a new RDD of separate activation records. You can use the getactivations function provided in the stub file in the hw8 folder in your SVN repository. getactivations takes an XML string, parses it, and returns a collection of XML records. (c) Extract the account number and device model for each activation record, and save the list to a text file formatted as account_number:model. Store the file in /loudacre/ account-models in HDFS. You can use the provided getaccount and getmodel functions to find the account number and device model values from the activation record. Include all required pyspark or Scala commands in your written answer in hw8.pdf. 2

3 Problem 2: Joining Datasets with SPARK (40%) In this problem you will explore the Loudacre web server log files using key-value Pair RDDs. After adding some new data containing Loudacre user account information to HDFS you will join the two RDDs. You can do this problem using the spark shell. Include all required commands in your written answers. After completing Lab 8 (FLUME) you will find the weblogs data in HDFS under: /loudacre/weblogs HINT: You will be reducing and joining large datasets, which can take a lot of time. You may wish to develop your spark commands using a smaller dataset, consisting of only a few of the web log files, rather than all of them. One way of getting a sample is to use a wildcard while loading the data as for instance textfile("/loudacre/weblogs/*6") would include only filenames ending with the digit 6. Use the full dataset to produce the results to be included in your submission! (a) Load the weblogs data into an RDD. How many records are there in the weblogs data? Filter the RDD to only contain HTML requests. How many records for HTML requests are in the weblogs data? (b) Create a Pair RDD with the user ID as the key, and the integer 1 as the value. The user ID is the third field in each line. Then, sum the values for each user ID. Your RDD data will be similar to: (userid,5) (userid,7) (userid,2)... (c) Determine how many users visited the site for each frequency. That is, how many users visited once, twice, three times and so on. How many users visited the site once, 7 times, and 12 times? Preparation: We will perform another simulation process to ingest data into HDFS similar to the one we did in the lab. Now, we will use SQOOP to get data from an RDMS and put it into HDFS. First, we have to create the database in mysql. Enter the following command in the terminal: $ mysql -u root -p when prompted enter the password cloudera. Then enter: mysql> CREATE DATABASE loudacre; mysql> quit 3

4 Put the data into this database: $ cd ~/training_materials/dev1/data $ mysql -u root -p loudacre < loudacre.sql when prompted enter the password cloudera. Now, import the accounts data from the SQL database into the loudacre\accounts folder in HDFS using SQOOP by executing the following command in the terminal: $ sqoop import \ --connect jdbc:mysql://localhost/loudacre \ --username root --password cloudera \ --table accounts \ --target-dir /loudacre/accounts \ --null-non-string '\\N' (d) Create an RDD based on the accounts data consisting of key/value-array pairs: (userid, [values...]). The results will look sth like this: (userid1,[userid1, :04:08,\N,Cheryl,West, 4905 Olive Street,San Francisco,CA,...]) (userid2,[ userid2, :05:07,\N,Elizabeth,Kerns, 4703 Eva Pearl Street,Richmond,CA,...]) (userid3,[userid3, :12:12, :42:36,Melissa, Roman,3539 James Martin Circle,Oakland,CA,...]) (e) Join this Pair RDD with the set of user-id/hit-count pairs calculated in part (a). The result will look something like this: (userid1,([userid1, :04:08,\N,Cheryl,West, 4905 Olive Street,San Francisco,CA,...],4)) (userid2,([ userid2, :05:07,\N,Elizabeth,Kerns, 4703 Eva Pearl Street,Richmond,CA,...],8)) (userid3,([userid3, :12:12, :42:36,Melissa, Roman,3539 James Martin Circle,Oakland,CA,...],1)) Display the user ID, hit count, and first name (3rd value) and last name (4th value) for the first 5 elements. The result for the above input would look like this: userid1 4 Cheryl West userid2 8 Elizabeth Kerns userid3 1 Melissa Roman HINT: you can use for-loops in the SPARK shell. Include all required pyspark or Scala commands in your written answer in hw8.pdf. 4

5 Problem 3: SPARK Job Execution (15%) (a) Describe what pipelining means in the context of a SPARK job execution. What is its benefit? (b) Give an example of two operations that can be pipelined together. (c) Give an example of two operations that cannot be pipelined together. Bonus Problem (5% up to a max. of 100%) - no group work! Write a review for this homework and store it in the file hw8_review.txt provided in your SVN repository (and commit your changes). This file should only include the review, no other information such as name, wustlkey, etc. Remember that you are not graded for the content of your review, solely it s completion. You can only earn bonus points if you write at least 50 words. Bonus points are given to the owner of the repository only (no group work!). Submit your review in the file hw8_review.txt provided in the hw8 folder in your SVN repository. To commit the file run: $ svn commit -m hw8 review submission. 5

HOMEWORK 9. M. Neumann. Due: THU 8 NOV PM. Getting Started SUBMISSION INSTRUCTIONS

HOMEWORK 9. M. Neumann. Due: THU 8 NOV PM. Getting Started SUBMISSION INSTRUCTIONS CSE427S HOMEWORK 9 M. Neumann Due: THU 8 NOV 2018 4PM Getting Started Update your SVN repository. When needed, you will find additional materials for homework x in the folder hwx. So, for the current assignment

More information

HOMEWORK 8. M. Neumann. Due: THU 1 NOV PM. Getting Started SUBMISSION INSTRUCTIONS

HOMEWORK 8. M. Neumann. Due: THU 1 NOV PM. Getting Started SUBMISSION INSTRUCTIONS CSE427S HOMEWORK 8 M. Neumann Due: THU 1 NOV 2018 4PM Getting Started Update your SVN repository. When needed, you will find additional materials for homework x in the folder hwx. So, for the current assignment

More information

HOMEWORK 9. M. Neumann. Due: THU 5 APR PM. Getting Started SUBMISSION INSTRUCTIONS

HOMEWORK 9. M. Neumann. Due: THU 5 APR PM. Getting Started SUBMISSION INSTRUCTIONS CSE427S HOMEWORK 9 M. Neumann Due: THU 5 APR 2018 1PM Getting Started Update your SVN repository. When needed, you will find additional materials for homework x in the folder hwx. So, for the current assignment

More information

HOMEWORK 7. M. Neumann. Due: THU 8 MAR PM. Getting Started SUBMISSION INSTRUCTIONS

HOMEWORK 7. M. Neumann. Due: THU 8 MAR PM. Getting Started SUBMISSION INSTRUCTIONS CSE427S HOMEWORK 7 M. Neumann Due: THU 8 MAR 2018 1PM Getting Started Update your SVN repository. When needed, you will find additional materials for homework x in the folder hwx. So, for the current assignment

More information

HOMEWORK 5. M. Neumann. Due: THU 22 FEB PM. Getting Started SUBMISSION INSTRUCTIONS

HOMEWORK 5. M. Neumann. Due: THU 22 FEB PM. Getting Started SUBMISSION INSTRUCTIONS CSE427S HOMEWORK 5 M. Neumann Due: THU 22 FEB 2018 1PM Getting Started Update your SVN repository. When needed, you will find additional materials for homework x in the folder hwx. So, for the current

More information

HOMEWORK M. M. Neumann. Due: THU 26 APR PM. Getting Started SUBMISSION INSTRUCTIONS

HOMEWORK M. M. Neumann. Due: THU 26 APR PM. Getting Started SUBMISSION INSTRUCTIONS CSE427S HOMEWORK M M. Neumann Due: THU 26 APR 2018 1PM Getting Started Update your SVN repository. When needed, you will find additional materials for homework x in the folder hwx. So, for the current

More information

Developer Training for Apache Spark and Hadoop: Hands-On Exercises

Developer Training for Apache Spark and Hadoop: Hands-On Exercises 201611 Developer Training for Apache Spark and Hadoop: Hands-On Exercises General Notes... 3 Hands-On Exercise: Query Hadoop Data with Apache Impala... 6 Hands-On Exercise: Access HDFS with the Command

More information

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a)

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Cloudera s Developer Training for Apache Spark and Hadoop delivers the key concepts and expertise need to develop high-performance

More information

LAB 0: LINUX COMMAND LINE AND SVN

LAB 0: LINUX COMMAND LINE AND SVN CSE427S LAB 0: LINUX COMMAND LINE AND SVN M. Neumann Due: TUE 23 JAN 2018 1PM No group work. The submission for this lab needs to be done via SVN repository commit. The completion of this tutorial counts

More information

CERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI)

CERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI) CERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI) The Certificate in Software Development Life Cycle in BIGDATA, Business Intelligence and Tableau program

More information

Homework 3: Relational Database Design Theory (100 points)

Homework 3: Relational Database Design Theory (100 points) CS 122A: Introduction to Data Management Spring 2018 Homework 3: Relational Database Design Theory (100 points) Due Date: Wed, Apr 25 (5:00 PM) Submission All HW assignments should be turned in with a

More information

FINAL PROJECT #3: GEO-LOCATION CLUSTERING IN SPARK

FINAL PROJECT #3: GEO-LOCATION CLUSTERING IN SPARK CSE427S FINAL PROJECT #3: GEO-LOCATION CLUSTERING IN SPARK M. Neumann Due: NO EXTENSION FRI 4 MAY 2018 (MIDNIGHT) Project Goal In this project you and your group will interactively get to know SPARK and

More information

HHH Instructional Computing Fall

HHH Instructional Computing Fall Quick Start Guide for School Web Lockers Teacher log-on is the same as for Infinite Campus Student log-on is the same initial log on to the network except no school year is required before their user name

More information

In this exercise, you will import orders table from MySQL database. into HDFS. Get acquainted with some of basic commands of Sqoop

In this exercise, you will import orders table from MySQL database. into HDFS. Get acquainted with some of basic commands of Sqoop Practice Using Sqoop Data Files: ~/labs/sql/retail_db.sql MySQL database: retail_db In this exercise, you will import orders table from MySQL database into HDFS. Get acquainted with some of basic commands

More information

Homework 2: Translating between the ER and relational models Due at 5pm on Wednesday, April 25

Homework 2: Translating between the ER and relational models Due at 5pm on Wednesday, April 25 CS 461, Databases Systems, Spring 2018 Description Homework 2: Translating between the ER and relational models Due at 5pm on Wednesday, April 25 This assignment covers translation between the two models

More information

DATABASE SYSTEMS. Introduction to MySQL. Database System Course, 2016

DATABASE SYSTEMS. Introduction to MySQL. Database System Course, 2016 DATABASE SYSTEMS Introduction to MySQL Database System Course, 2016 AGENDA FOR TODAY Administration Database Architecture on the web Database history in a brief Databases today MySQL What is it How to

More information

Using the New UCOP UAT Validation Reports For Undergraduate Admissions

Using the New UCOP UAT Validation Reports For Undergraduate Admissions To access the new UAT validation reports in Cognos for Course Enrollment data, please follow the instructions outlined below: 1. Copy and paste the following link into your browser - http://data.ucop.edu.

More information

Developer Training for Apache Spark and Hadoop: Hands-On Exercises

Developer Training for Apache Spark and Hadoop: Hands-On Exercises 201709c Developer Training for Apache Spark and Hadoop: Hands-On Exercises Table of Contents General Notes... 1 Hands-On Exercise: Starting the Exercise Environment (Local VM)... 5 Hands-On Exercise: Starting

More information

Real-time Data Engineering in the Cloud Exercise Guide

Real-time Data Engineering in the Cloud Exercise Guide Real-time Data Engineering in the Cloud Exercise Guide Jesse Anderson 2017 SMOKING HAND LLC ALL RIGHTS RESERVED Version 1.12.a9779239 1 Contents 1 Lab Notes 3 2 Kafka HelloWorld 6 3 Streaming ETL 8 4 Advanced

More information

Blended Learning Outline: Cloudera Data Analyst Training (171219a)

Blended Learning Outline: Cloudera Data Analyst Training (171219a) Blended Learning Outline: Cloudera Data Analyst Training (171219a) Cloudera Univeristy s data analyst training course will teach you to apply traditional data analytics and business intelligence skills

More information

Oracle Big Data Fundamentals Ed 2

Oracle Big Data Fundamentals Ed 2 Oracle University Contact Us: 1.800.529.0165 Oracle Big Data Fundamentals Ed 2 Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, you learn about big data, the technologies

More information

Algorithm Design and Analysis Homework #4

Algorithm Design and Analysis Homework #4 Algorithm Design and Analysis Homework #4 Due: 14:20, December 6, 2012 Homework submission instructions Submit your programming assignment (problem 1) to the Judgegirl System (http://katrina.csie.ntu.edu.tw/judgegirl/).

More information

CS 2316 Homework 9a Login Due: Friday, November 2nd, before 11:55 PM Out of 100 points. Premise

CS 2316 Homework 9a Login Due: Friday, November 2nd, before 11:55 PM Out of 100 points. Premise CS 2316 Homework 9a Login Due: Friday, November 2nd, before 11:55 PM Out of 100 points Files to submit: 1. HW9.py This is an INDIVIDUAL assignment! Collaboration at a reasonable level will not result in

More information

Data Structure and Algorithm Homework #3 Due: 2:20pm, Tuesday, April 9, 2013 TA === Homework submission instructions ===

Data Structure and Algorithm Homework #3 Due: 2:20pm, Tuesday, April 9, 2013 TA   === Homework submission instructions === Data Structure and Algorithm Homework #3 Due: 2:20pm, Tuesday, April 9, 2013 TA email: dsa1@csientuedutw === Homework submission instructions === For Problem 1, submit your source code, a Makefile to compile

More information

Lab 1 Introduction to UNIX and C

Lab 1 Introduction to UNIX and C Name: Lab 1 Introduction to UNIX and C This first lab is meant to be an introduction to computer environments we will be using this term. You must have a Pitt username to complete this lab. NOTE: Text

More information

Step 1. Final Grade Roster Submission. IT Department Printed on 05/21/2012 Page 1. Steps to Submit Grades. Step 1. Step 2. Step 3

Step 1. Final Grade Roster Submission. IT Department Printed on 05/21/2012 Page 1. Steps to Submit Grades. Step 1. Step 2. Step 3 /Notes 1. Log into CUNYfirst Enter your username and password AND Click on the Go button icon 2. From the Enterprise Menu, select the HR/Campus Solutions link IT Department Printed on 05/21/2012 Page 1

More information

Homework #1: SSH. Step 1: From the start button (lower left hand corner) type Secure. Then click on the Secure Shell File Transfer Client.

Homework #1: SSH. Step 1: From the start button (lower left hand corner) type Secure. Then click on the Secure Shell File Transfer Client. Homework #1: SSH Due WEEK 3 at the BEGINNING of lab CSE 3, Spring 2018 A. The program Some students had trouble using this program in the past. It isn t too bad if you just take a few minutes to read and

More information

MATH 1MP3 Homework #4 Due: 11:59pm, Wednesday, March 6.

MATH 1MP3 Homework #4 Due: 11:59pm, Wednesday, March 6. MATH 1MP3 Homework #4 Due: 11:59pm, Wednesday, March 6. Important notes: To start the assignment, download the Jupyter notebook file assignment 4 template.ipynb found here: https://ms.mcmaster.ca/~matt/1mp3/homework/assignment_4_template.

More information

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training::

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training:: Module Title Duration : Cloudera Data Analyst Training : 4 days Overview Take your knowledge to the next level Cloudera University s four-day data analyst training course will teach you to apply traditional

More information

Assignment 1: Plz Tell Me My Password

Assignment 1: Plz Tell Me My Password Assignment 1: Plz Tell Me My Password Part I Due: 11:59 pm, September 23, 2013 Part II Due : 11:59 pm, September 27, 2013 1 Introduction The objective of this assignment is to gain experience in: socket

More information

Hue Application for Big Data Ingestion

Hue Application for Big Data Ingestion Hue Application for Big Data Ingestion August 2016 Author: Medina Bandić Supervisor(s): Antonio Romero Marin Manuel Martin Marquez CERN openlab Summer Student Report 2016 1 Abstract The purpose of project

More information

CSE100 Principles of Programming with C++

CSE100 Principles of Programming with C++ 1 Instructions You may work in pairs (that is, as a group of two) with a partner on this lab project if you wish or you may work alone. If you work with a partner, only submit one lab project with both

More information

by the evening of Tuesday, Feb 6

by the evening of Tuesday, Feb 6 Homework 1 Due 14 February Handout 6 CSCI 334: Spring 2018 Notes This homework has three types of problems: Self Check: You are strongly encouraged to think about and work through these questions, and

More information

How to change your password - Students

How to change your password - Students - Students Introduction Upon joining the University, you will have been issued a password to access our systems. You are required to change this password immediately to ensure your continued access to

More information

Registration, Data Download, and Creating/ Restoring Backups

Registration, Data Download, and Creating/ Restoring Backups Registration, Data Download, and Creating/ Restoring Backups E Lab Users Register on the Armond Dalton Resources Website... Download the Three Initial Company Backup Files... Load Waren Sport Supply Dataset

More information

Claims Loss Run Report

Claims Loss Run Report About the Claims Loss Run Report The Claims Loss Run report gives you the ability to build custom reports from the data collected via the Claims modules with financial information pertinent to actuary

More information

Laboratory Exercise #0

Laboratory Exercise #0 Laboratory Exercise #0 This assignment focuses on the mechanics of installing and using Python. The deadline for Mimir submission is 11:59 PM on Monday, January 8. 1. Complete the steps given below to

More information

DATABASE SYSTEMS. Introduction to MySQL. Database System Course, 2016

DATABASE SYSTEMS. Introduction to MySQL. Database System Course, 2016 DATABASE SYSTEMS Introduction to MySQL Database System Course, 2016 AGENDA FOR TODAY Administration Database Architecture on the web Database history in a brief Databases today MySQL What is it How to

More information

MariaDB ColumnStore PySpark API Usage Documentation. Release d1ab30. MariaDB Corporation

MariaDB ColumnStore PySpark API Usage Documentation. Release d1ab30. MariaDB Corporation MariaDB ColumnStore PySpark API Usage Documentation Release 1.2.3-3d1ab30 MariaDB Corporation Mar 07, 2019 CONTENTS 1 Licensing 1 1.1 Documentation Content......................................... 1 1.2

More information

Running Kmeans Spark on EC2 Documentation

Running Kmeans Spark on EC2 Documentation Running Kmeans Spark on EC2 Documentation Pseudo code Input: Dataset D, Number of clusters k Output: Data points with cluster memberships Step1: Read D from HDFS as RDD Step 2: Initialize first k data

More information

Big Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours

Big Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Big Data Hadoop Developer Course Content Who is the target audience? Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Complete beginners who want to learn Big Data Hadoop Professionals

More information

CSE547: Machine Learning for Big Data Spring Problem Set 1. Please read the homework submission policies.

CSE547: Machine Learning for Big Data Spring Problem Set 1. Please read the homework submission policies. CSE547: Machine Learning for Big Data Spring 2019 Problem Set 1 Please read the homework submission policies. 1 Spark (25 pts) Write a Spark program that implements a simple People You Might Know social

More information

Big Data Analytics using Apache Hadoop and Spark with Scala

Big Data Analytics using Apache Hadoop and Spark with Scala Big Data Analytics using Apache Hadoop and Spark with Scala Training Highlights : 80% of the training is with Practical Demo (On Custom Cloudera and Ubuntu Machines) 20% Theory Portion will be important

More information

CS 1803 Pair Homework 4 Greedy Scheduler (Part I) Due: Wednesday, September 29th, before 6 PM Out of 100 points

CS 1803 Pair Homework 4 Greedy Scheduler (Part I) Due: Wednesday, September 29th, before 6 PM Out of 100 points CS 1803 Pair Homework 4 Greedy Scheduler (Part I) Due: Wednesday, September 29th, before 6 PM Out of 100 points Files to submit: 1. HW4.py This is a PAIR PROGRAMMING Assignment: Work with your partner!

More information

CS 315 Software Design Homework 3 Preconditions, Postconditions, Invariants Due: Sept. 29, 11:30 PM

CS 315 Software Design Homework 3 Preconditions, Postconditions, Invariants Due: Sept. 29, 11:30 PM CS 315 Software Design Homework 3 Preconditions, Postconditions, Invariants Due: Sept. 29, 11:30 PM Objectives Defining a wellformed method to check class invariants Using assert statements to check preconditions,

More information

CS 215 Software Design Homework 3 Due: February 28, 11:30 PM

CS 215 Software Design Homework 3 Due: February 28, 11:30 PM CS 215 Software Design Homework 3 Due: February 28, 11:30 PM Objectives Specifying and checking class invariants Writing an abstract class Writing an immutable class Background Polynomials are a common

More information

DELIVERY SERVICE WEB SITE (DSWEB) PROCEDURES MANUAL

DELIVERY SERVICE WEB SITE (DSWEB) PROCEDURES MANUAL DELIVERY SERVICE WEB SITE (DSWEB) BGE - SUPPLIER EXCHANGE OF INFORMATION FOR GAS CHOICE PROGRAMS Table of Contents I. Introduction... 3 II. System Requirements... 3 III. Requesting Access to DSWeb... 3

More information

Implement an ADT while using Subversion

Implement an ADT while using Subversion 1 Objectives Learn to use Subversion Implement an ADT while using Subversion In this lab, you learn about the version control tool called Subversion and you will implement a Java class given an interface.

More information

First, let's make sure we have all of the starter code downloaded. MAC (Go to the second part of the tutorial if you are using windows)

First, let's make sure we have all of the starter code downloaded. MAC (Go to the second part of the tutorial if you are using windows) CSE 167 HW 0 - Due Thur. Jan 18th at 11:59 p.m. This homework will help you set up OpenGL on your computer. First, let's make sure we have all of the starter code downloaded. https://github.com/ht413/cse167startercode

More information

CSE 444: Database Internals. Lecture 23 Spark

CSE 444: Database Internals. Lecture 23 Spark CSE 444: Database Internals Lecture 23 Spark References Spark is an open source system from Berkeley Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. Matei

More information

Using RANCID. Contents. 1 Introduction Goals Notes Install rancid Add alias Configure rancid...

Using RANCID. Contents. 1 Introduction Goals Notes Install rancid Add alias Configure rancid... Using RANCID Contents 1 Introduction 2 1.1 Goals................................. 2 1.2 Notes................................. 2 2 Install rancid 2 2.1 Add alias............................... 3 2.2 Configure

More information

Hyundai Training and Certification System [TACS] Learner User Guide

Hyundai Training and Certification System [TACS] Learner User Guide Hyundai Training and Certification System [TACS] Learner User Guide REV. 2010-01-29 TACS USER GUIDE - TABLE OF CONTENTS 1. OVERVIEW 4 2. ACCESSING HYUNDAI TACS / LOGIN 4 3. LOG OUT 7 4. LEARNER REGISTRATION

More information

CMSC Introduction to Database Systems

CMSC Introduction to Database Systems CMSC 23500 Introduction to Database Systems Department of Computer Science University of Chicago Spring 2009 Quarter Dates: March 30 through June 2, 2009 Lectures: TuTh 12:00-1:20 in Ryerson 277 Labs:

More information

CMSC 201 Spring 2018 Lab 01 Hello World

CMSC 201 Spring 2018 Lab 01 Hello World CMSC 201 Spring 2018 Lab 01 Hello World Assignment: Lab 01 Hello World Due Date: Sunday, February 4th by 8:59:59 PM Value: 10 points At UMBC, the GL system is designed to grant students the privileges

More information

Installing WordPress CMS

Installing WordPress CMS Installing WordPress CMS Extract the contents of the wordpress zip file to D:/public_html/wordpress folder as shown in diagram 1. D:/public_html/wordpress is a virtual domain controlled by Apache Web server

More information

CE151 ASSIGNMENT

CE151 ASSIGNMENT Set by: Mike Sanderson Credit: 20% of total module mark Deadline: 11.59.59, Monday 8 December Submission of this assignment will be via the online submission system; your programs may be tested during

More information

Hadoop Development Introduction

Hadoop Development Introduction Hadoop Development Introduction What is Bigdata? Evolution of Bigdata Types of Data and their Significance Need for Bigdata Analytics Why Bigdata with Hadoop? History of Hadoop Why Hadoop is in demand

More information

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect Big Data Big Data Analyst INTRODUCTION TO BIG DATA ANALYTICS ANALYTICS PROCESSING TECHNIQUES DATA TRANSFORMATION & BATCH PROCESSING REAL TIME (STREAM) DATA PROCESSING Big Data Engineer BIG DATA FOUNDATION

More information

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development::

Overview. Prerequisites. Course Outline. Course Outline :: Apache Spark Development:: Title Duration : Apache Spark Development : 4 days Overview Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized

More information

Talend Big Data Sandbox. Big Data Insights Cookbook

Talend Big Data Sandbox. Big Data Insights Cookbook Overview Pre-requisites Setup & Configuration Hadoop Distribution Download Demo (Scenario) Overview Pre-requisites Setup & Configuration Hadoop Distribution Demo (Scenario) About this cookbook What is

More information

Oakland University Obtaining Your 1098-T Electronically

Oakland University Obtaining Your 1098-T Electronically Accessing a student 1098-T is easy - simply go to tra.vangent.com, click on First Time Student and follow the instructions. 1. Open a web browser (such as Internet Explorer, Safari, Chrome, Firefox, etc.

More information

CMSC 201 Spring 2017 Lab 01 Hello World

CMSC 201 Spring 2017 Lab 01 Hello World CMSC 201 Spring 2017 Lab 01 Hello World Assignment: Lab 01 Hello World Due Date: Sunday, February 5th by 8:59:59 PM Value: 10 points At UMBC, our General Lab (GL) system is designed to grant students the

More information

Talend Big Data Sandbox. Big Data Insights Cookbook

Talend Big Data Sandbox. Big Data Insights Cookbook Overview Pre-requisites Setup & Configuration Hadoop Distribution Download Demo (Scenario) Overview Pre-requisites Setup & Configuration Hadoop Distribution Demo (Scenario) About this cookbook What is

More information

A simple OpenGL animation Due: Wednesday, January 27 at 4pm

A simple OpenGL animation Due: Wednesday, January 27 at 4pm CMSC 23700 Winter 2010 Introduction to Computer Graphics Project 1 January 12 A simple OpenGL animation Due: Wednesday, January 27 at 4pm 1 Summary This project is the first part of a three-part project.

More information

Network Administration/System Administration (NTU CSIE, Spring 2018) Homework #1. Homework #1

Network Administration/System Administration (NTU CSIE, Spring 2018) Homework #1. Homework #1 Submission Homework #1 Due Time: 2018/3/11 (Sun.) 22:00 Contact TAs: vegetable@csie.ntu.edu.tw Compress all your files into a file named HW1_[studentID].zip (e.g. HW1_bxx902xxx.zip), which contains two

More information

CSC209H Lecture 1. Dan Zingaro. January 7, 2015

CSC209H Lecture 1. Dan Zingaro. January 7, 2015 CSC209H Lecture 1 Dan Zingaro January 7, 2015 Welcome! Welcome to CSC209 Comments or questions during class? Let me know! Topics: shell and Unix, pipes and filters, C programming, processes, system calls,

More information

Homework 1 Due Monday April 24, 2017, 11 PM

Homework 1 Due Monday April 24, 2017, 11 PM CME 213 Spring 2017 1/6 Homework 1 Due Monday April 24, 2017, 11 PM In this programming assignment you will implement Radix Sort, and will learn about OpenMP, an API which simplifies parallel programming

More information

CMSC 201 Fall 2016 Homework 6 Functions

CMSC 201 Fall 2016 Homework 6 Functions CMSC 201 Fall 2016 Homework 6 Functions Assignment: Homework 6 Functions Due Date: Wednesday, October 26th, 2016 by 8:59:59 PM Value: 40 points Collaboration: For Homework 6, collaboration is not allowed

More information

CS 2223 B15 Term. Homework 1 (100 pts.)

CS 2223 B15 Term. Homework 1 (100 pts.) CS 2223 B15 Term. Homework 1 (100 pts.) Homework Instructions This homework is to be completed individually. If you have any questions as to what constitutes improper behavior, review the examples I have

More information

Apache Spark and Scala Certification Training

Apache Spark and Scala Certification Training About Intellipaat Intellipaat is a fast-growing professional training provider that is offering training in over 150 most sought-after tools and technologies. We have a learner base of 600,000 in over

More information

CS 200 Fall 2012 Assignment 8 SQL Due Monday November 10 by 4:00 pm

CS 200 Fall 2012 Assignment 8 SQL Due Monday November 10 by 4:00 pm CS 200 Fall 2012 Assignment 8 SQL Due Monday November 10 by 4:00 pm Read Notes 6, in particular Database Design and Watcom SQL User s Guide. Please follow these instructions very carefully to find the

More information

About Codefrux While the current trends around the world are based on the internet, mobile and its applications, we try to make the most out of it. As for us, we are a well established IT professionals

More information

DATABASE SYSTEMS. Introduction to MySQL. Database System Course, 2018

DATABASE SYSTEMS. Introduction to MySQL. Database System Course, 2018 DATABASE SYSTEMS Introduction to MySQL Database System Course, 2018 CAUTION! *This class is NOT a recitation* We will NOT discuss the course material relevant to the exam and homework assignment We have

More information

COSC-589 Web Search and Sense-making Information Retrieval In the Big Data Era. Spring Instructor: Grace Hui Yang

COSC-589 Web Search and Sense-making Information Retrieval In the Big Data Era. Spring Instructor: Grace Hui Yang COSC-589 Web Search and Sense-making Information Retrieval In the Big Data Era Spring 2016 Instructor: Grace Hui Yang The Web provides abundant information which allows us to live more conveniently and

More information

Version February, User s Manual for Instructors. Texas A&M Department of Communication Participant Pool Sign-Up Software

Version February, User s Manual for Instructors. Texas A&M Department of Communication Participant Pool Sign-Up Software Version February, 2018 User s Manual for Instructors Texas A&M Department of Communication Participant Pool Sign-Up Software Table of Contents INSTRUCTOR DOCUMENTATION... 3 Introduction... Error! Bookmark

More information

Commands are in black

Commands are in black Starting From the Shell Prompt (Terminal) Commands are in black / +--------+---------+-------+---------+---------+------ +------ +------ +------ +------ +------ +-- Bin boot dev etc home media sbin bin

More information

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info

We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info We are ready to serve Latest Testing Trends, Are you ready to learn?? New Batches Info START DATE : TIMINGS : DURATION : TYPE OF BATCH : FEE : FACULTY NAME : LAB TIMINGS : PH NO: 9963799240, 040-40025423

More information

This lab exercise is to be submitted at the end of the lab session! passwd [That is the command to change your current password to a new one]

This lab exercise is to be submitted at the end of the lab session! passwd [That is the command to change your current password to a new one] Data and Computer Security (CMPD414) Lab II Topics: secure login, moving into HOME-directory, navigation on Unix, basic commands for vi, Message Digest This lab exercise is to be submitted at the end of

More information

Data Structure and Algorithm Homework #6 Due: 5pm, Friday, June 14, 2013 TA === Homework submission instructions ===

Data Structure and Algorithm Homework #6 Due: 5pm, Friday, June 14, 2013 TA   === Homework submission instructions === Data Structure and Algorithm Homework #6 Due: 5pm, Friday, June 14, 2013 TA email: dsa1@csie.ntu.edu.tw === Homework submission instructions === For Problem 1, submit your source codes, a Makefile to compile

More information

CSC 101 Spring 2010 Lab #8 Report Gradesheet

CSC 101 Spring 2010 Lab #8 Report Gradesheet CSC 101 Spring 2010 Lab #8 Report Gradesheet Name WFU Username Lab Section: A B C D Partner s Name (if you had one): Topic Points Notes Pre-lab questions 20 total - 5 at 4 points each Lab report questions

More information

Sona - Frequently Asked Questions

Sona - Frequently Asked Questions Sona - Frequently Asked Questions List of topics: Sona Accounts o Didn t receive my email notification o Forgot my password First time logging into Sona o PreScreen Questionnaire Available / Eligible Studies

More information

DAITSS Demo Virtual Machine Quick Start Guide

DAITSS Demo Virtual Machine Quick Start Guide DAITSS Demo Virtual Machine Quick Start Guide The following topics are covered in this document: A brief Glossary Downloading the DAITSS Demo Virtual Machine Starting up the DAITSS Demo Virtual Machine

More information

Biocomputing II Coursework guidance

Biocomputing II Coursework guidance Biocomputing II Coursework guidance I refer to the database layer as DB, the middle (business logic) layer as BL and the front end graphical interface with CGI scripts as (FE). Standardized file headers

More information

The Top 6 Things You Need to Know About MyMCPS Classroom

The Top 6 Things You Need to Know About MyMCPS Classroom The Top 6 Things You Need to Know About MyMCPS Classroom 1. How to Find MyMCPS Classroom at School & at Home 2. How to Navigate Around MyMCPS Classroom 3. How to Navigate a Course Classroom 4. How to Find

More information

Homework 1 (a and b) Convex Sets and Convex Functions

Homework 1 (a and b) Convex Sets and Convex Functions Homework 1 (a and b) Convex Sets and Convex Functions CMU 10-725/36-725: Convex Optimization (Fall 2017) OUT: Sep 1 DUE: Prob 1-3 Sep 11, 5:00 PM; Prob 4 Sep 15, 5:00 PM START HERE: Instructions Collaboration

More information

CS Homework 1 p. 1. CS Homework 1

CS Homework 1 p. 1. CS Homework 1 CS 335 - Homework 1 p. 1 Deadline: CS 335 - Homework 1 IF turned in on-paper: 11:59 am on Friday, February 4 IF submitted electronically: 11:59 pm on Friday, February 4 How to submit: Because of the nature

More information

A complete Hadoop Development Training Program.

A complete Hadoop Development Training Program. Asterix Solution s Big Data - Hadoop Training Program A complete Hadoop Development Training Program. Your Journey to Professional Hadoop Development training starts here! Hadoop! Hadoop! Hadoop! If you

More information

SciFinder On-Campus User Registration

SciFinder On-Campus User Registration SciFinder On-Campus User Registration Introduction SciFinder s web-based user registration form allows you to create your own SciFinder username and password. A Self-Registration URL used to access the

More information

This lab will introduce you to MySQL. Begin by logging into the class web server via SSH Secure Shell Client

This lab will introduce you to MySQL. Begin by logging into the class web server via SSH Secure Shell Client Lab 2.0 - MySQL CISC3140, Fall 2011 DUE: Oct. 6th (Part 1 only) Part 1 1. Getting started This lab will introduce you to MySQL. Begin by logging into the class web server via SSH Secure Shell Client host

More information

Web API Lab. The next two deliverables you shall write yourself.

Web API Lab. The next two deliverables you shall write yourself. Web API Lab In this lab, you shall produce four deliverables in folder 07_webAPIs. The first two deliverables should be pretty much done for you in the sample code. 1. A server side Web API (named listusersapi.jsp)

More information

Homework 6. Question Points Score Query Optimization 20 Functional Dependencies 20 Decompositions 30 Normal Forms 30 Total: 100

Homework 6. Question Points Score Query Optimization 20 Functional Dependencies 20 Decompositions 30 Normal Forms 30 Total: 100 Carnegie Mellon University Department of Computer Science 15-415/615- Database Applications C. Faloutsos & A. Pavlo, Spring 2015 Prepared by Vinay & Jiayu DUE DATE: Tuesday, 3/24/2015, 1:30pm Homework

More information

Your current address will be used to access schooltool. Please provide the school registrar with this if you haven t already done so.

Your current  address will be used to access schooltool. Please provide the school registrar with this if you haven t already done so. Parent Guide Quick Reference Worksheet For schooltool s ParentPortal Parent Guide -- Quick Reference Worksheet Accessing your student's on-line schooltool school records is now a simple matter. There are

More information

CSE 361 Fall 2017 Lab Assignment L2: Defusing a Binary Bomb Assigned: Wednesday Sept. 20 Due: Wednesday Oct. 04 at 11:59 pm

CSE 361 Fall 2017 Lab Assignment L2: Defusing a Binary Bomb Assigned: Wednesday Sept. 20 Due: Wednesday Oct. 04 at 11:59 pm CSE 361 Fall 2017 Lab Assignment L2: Defusing a Binary Bomb Assigned: Wednesday Sept. 20 Due: Wednesday Oct. 04 at 11:59 pm 1 Introduction NOTE: You will want to read this entire document carefully before

More information

Due: March 8, 11:59pm. Project 1

Due: March 8, 11:59pm. Project 1 COS 433: Cryptography (Spring 2018) Project 1 Princeton University Due: March 8, 11:59pm Project 1 Introduction You are interning at the super secretive SLA (Three Letter Agency). The SLA has intercepted

More information

Importing and Exporting Data Between Hadoop and MySQL

Importing and Exporting Data Between Hadoop and MySQL Importing and Exporting Data Between Hadoop and MySQL + 1 About me Sarah Sproehnle Former MySQL instructor Joined Cloudera in March 2010 sarah@cloudera.com 2 What is Hadoop? An open-source framework for

More information

LAB 8 (Aug 4/5) Unix Utilities

LAB 8 (Aug 4/5) Unix Utilities Aug 4/5 Due: Aug 11 in class Name: CSE number: LAB 8 (Aug 4/5) Unix Utilities The purpose of this lab exercise is for you to get some hands-on experience on using some fundamental Unix utilities (commands).

More information

Hadoop & Big Data Analytics Complete Practical & Real-time Training

Hadoop & Big Data Analytics Complete Practical & Real-time Training An ISO Certified Training Institute A Unit of Sequelgate Innovative Technologies Pvt. Ltd. www.sqlschool.com Hadoop & Big Data Analytics Complete Practical & Real-time Training Mode : Instructor Led LIVE

More information

Assignment 3 ITCS-6010/8010: Cloud Computing for Data Analysis

Assignment 3 ITCS-6010/8010: Cloud Computing for Data Analysis Assignment 3 ITCS-6010/8010: Cloud Computing for Data Analysis Due by 11:59:59pm on Tuesday, March 16, 2010 This assignment is based on a similar assignment developed at the University of Washington. Running

More information

CSC116: Introduction to Computing - Java

CSC116: Introduction to Computing - Java CSC116: Introduction to Computing - Java Course Information Introductions Website Syllabus Computers First Java Program Text Editor Helpful Commands Java Download Intro to CSC116 Instructors Course Instructor:

More information

CS4400: Database Systems Homework 3 SQL queries/physical tuning Due February 13, 2016, 11:59pm

CS4400: Database Systems Homework 3 SQL queries/physical tuning Due February 13, 2016, 11:59pm CS4400: Database Systems Homework 3 SQL queries/physical tuning Due February 13, 2016, 11:59pm Instructions: Do part 0 as soon as possible! Upload your submission on ICON under Assignments > Homework 3.

More information