Big Data Exercises. Fall 2016 Week 0 ETH Zurich

Size: px
Start display at page:

Download "Big Data Exercises. Fall 2016 Week 0 ETH Zurich"

Transcription

1 Big Data Exercises Fall 2016 Week 0 ETH Zurich 1. Jupyter Basics Welcome to this Jupyter notebook. Jupyter is a web-based open-source tool based on Python that allows you to run python (and other types of) code, visualize results and discuss results, and organize everything into notebooks like this one. We use the notebook server on Microsoft Azure, but you can also install your own. A notebook is organized in cells. Cells of this notebook contain Python code (but other cell types exists). To run a cell, select it, then press ctrl+enter. Try it out! In [1]: print("hello World") Hello World By default, the last expression is printed. Like this: In [2]: maxx = 10 [x * x for x in range(maxx)] Out[2]: [0, 1, 4, 9, 16, 25, 36, 49, 64, 81] You can also edit the text. Just double-click on a cell. It's made with markdown code. After you are done editing, press ctrl+enter We will do about half of the exercises with Jupyter. You will learn most things as we go. The notebook you are seing is your copy stored on your account (including the output of the cells). Write your answers inline and save regularly. (Also save a copy somewhere else from time to time this service is still beta.) Read more: Jupyter ( Learn Python ( Python documentation ( 2. Bash Scripts Code blocks by default are executed using a python interpreter (for a python notebook, such as this one). Other languages can be used with annotations. For instance, a code block can be converted into a bash code block using %%bash at the beginning: In [3]: %%bash echo "Test File" > test_file cat test_file Test File Note: do not expect files you write into this file system to be durable. The sandboxed environment may be reset and data lost when notebook is closed. The notebook files themselves are durable though. Bash commands can be also inlined using exclamation mark (!) infront of the bash line

2 In [7]: print("first I modify the file.")!echo "Hello World" > test_file print("then I check its content:") print()!cat test_file First I modify the file. Then I check its content: Hello World 3. Extensions As part of this course you will use Jupyter to interact with various systems and interfaces (e.g. SQL, Map Reduce, Spark). To use these, you will need to install certain Python and Jupyter extensions. Note, that as like the file system the extensions will not be durable and need to be rerun when the notebook server is restarted Setting up a SQL connection As part of preparation for the next week's exercise, let's setup a connection to a sample relational database (hosted from this course's Azure account) First set the access variables (make sure you execute the next code block by running ctrl+enter) In [8]: server='ethbigdata2017' user='student' password='bigdata17' database='' Next, install the PyMSSQL ( python package. Run the following cell and make sure that the output confirms that the installation was successfull. In [9]:!pip install pymssql==2.1.2 Collecting pymssql==2.1.2 Using cached pymssql tar.gz Building wheels for collected packages: pymssql Running setup.py bdist_wheel for pymssql... done Stored in directory: /home/nbuser/.cache/pip/wheels/6d/25/5d/3c673e8cdcbaffd5b ed62a09 13c380814da16b00f0 Successfully built pymssql Installing collected packages: pymssql Found existing installation: pymssql Uninstalling pymssql-2.1.1: Successfully uninstalled pymssql Successfully installed pymssql Running a SQL Query Then we run a first query against our server (following this tutorial ( f=255&mspperror= ) from the Azure website). Make sure that running the following cell successfully prints some rows.

3 In [10]: import pymssql import os os.environ['tdsver'] = '7.3' conn = pymssql.connect(server=server + '.database.windows.net', user=user + '@' + server, password=password, database=database) cursor = conn.cursor(as_dict=true) cursor.execute('select TOP 10 Id, DisplayName FROM Users') row = cursor.fetchone() while row: print(row) row = cursor.fetchone() {'Id': -1, 'DisplayName': 'Community'} {'Id': 1, 'DisplayName': 'Geoff Dalgas'} {'Id': 2, 'DisplayName': 'Kasra Rahjerdi'} {'Id': 3, 'DisplayName': 'Adam'} {'Id': 4, 'DisplayName': 'Arie Litovsky'} {'Id': 5, 'DisplayName': 'Brian Nickel'} {'Id': 6, 'DisplayName': 'Jeremy T'} {'Id': 7, 'DisplayName': 'Tom Medley'} {'Id': 8, 'DisplayName': 'LessPop_MoreFizz'} {'Id': 9, 'DisplayName': 'Nick Craver'} 3.3. Inlining SQL It is also possible to inline SQL code. To do this install the following Jupyter extension: In [11]:!git clone \ cd ipython-sql && LC_CTYPE="C.UTF-8" python setup.py -q install fatal: destination path 'ipython-sql' already exists and is not an empty directory. **At this point, we need to restart the notebook kernel!** To do that, go to Kernel in the menu at the top, then click on Restart. You need to run the cell with the connection details again after that. We can now load the extension and establish a connection to our database from above. Run the following cell and make sure the output says "Connected: <connection string>". In [12]: %load_ext sql connection_string = 'mssql+pymssql://{user}@{server}:{password}@{server}.database.windows.net:1433/{database }'.format( server=server, user=user, password=password, database=database) print(connection_string) %sql $connection_string mssql+pymssql://student@ethbigdata2017:bigdata17@ethbigdata2017.database.windows.net:1433/beer. stackexchange.com Out[12]: 'Connected: student@ethbigdata2017@' Now we can use the %sql and %%sql magic words to run SQL directly. %%sql makes a cell a SQL cell. A SQL cell can run an arbitrary number of SQL statements and displays the result of the last one of them.

4 In [13]: %%sql SELECT TOP 10 Id, DisplayName FROM Users Out[13]: Id DisplayName -1 Community 1 Geoff Dalgas 2 Kasra Rahjerdi 3 Adam 4 Arie Litovsky 5 Brian Nickel 6 Jeremy T 7 Tom Medley 8 LessPop_MoreFizz 9 Nick Craver The %sql magic words lets us run SQL statements in a regular cell. Again, the result of the last statement is displayed. In [14]: print("run a query!") %sql SELECT TOP 10 Id, DisplayName FROM Users; Run a query! Out[14]: Id DisplayName -1 Community 1 Geoff Dalgas 2 Kasra Rahjerdi 3 Adam 4 Arie Litovsky 5 Brian Nickel 6 Jeremy T 7 Tom Medley 8 LessPop_MoreFizz 9 Nick Craver 3.4. Plotting results Matplotlib can also be used to plot results. Next is a plot of a sample query that finds the number of created users per year.

5 In [15]: %matplotlib inline import matplotlib.pyplot as plt result = %sql SELECT YEAR(CreationDate) as CreationYear, COUNT(*) as YearCount \ FROM Users \ GROUP BY YEAR(CreationDate) \ ORDER BY YEAR(CreationDate) ASC; # Print the result in tabular form print(result) # Convert the result to a Pandas data frame df = result.dataframe() # Extract x and y values for a plot x = df['creationyear'].tolist() y = df['yearcount'].tolist() # Plot the distribution of registrations per year fig, ax = plt.subplots() ax.bar(range(len(df.index)), y, tick_label=[int(i) for i in x], align='center') ax.set_xlabel('creation Year') ax.set_ylabel('number of Users') CreationYear YearCount Out[15]: <matplotlib.text.text at 0x7fe5f9bf7d30>

COSC 490 Computational Topology

COSC 490 Computational Topology COSC 490 Computational Topology Dr. Joe Anderson Fall 2018 Salisbury University Course Structure Weeks 1-2: Python and Basic Data Processing Python commonly used in industry & academia Weeks 3-6: Group

More information

Lecture 3: Processing Language Data, Git/GitHub. LING 1340/2340: Data Science for Linguists Na-Rae Han

Lecture 3: Processing Language Data, Git/GitHub. LING 1340/2340: Data Science for Linguists Na-Rae Han Lecture 3: Processing Language Data, Git/GitHub LING 1340/2340: Data Science for Linguists Na-Rae Han Objectives What do linguistic data look like? Homework 1: What did you process? How does collaborating

More information

Containers. Pablo F. Ordóñez. October 18, 2018

Containers. Pablo F. Ordóñez. October 18, 2018 Containers Pablo F. Ordóñez October 18, 2018 1 Welcome Song: Sola vaya Interpreter: La Sonora Ponceña 2 Goals Containers!= ( Moby-Dick ) Containers are part of the Linux Kernel Make your own container

More information

About the Tutorial. Audience. Prerequisites. Copyright & Disclaimer

About the Tutorial. Audience. Prerequisites. Copyright & Disclaimer i About the Tutorial Project is a comprehensive software suite for interactive computing, that includes various packages such as Notebook, QtConsole, nbviewer, Lab. This tutorial gives you an exhaustive

More information

LECTURE 22. Numerical and Scientific Computing Part 2

LECTURE 22. Numerical and Scientific Computing Part 2 LECTURE 22 Numerical and Scientific Computing Part 2 MATPLOTLIB We re going to continue our discussion of scientific computing with matplotlib. Matplotlib is an incredibly powerful (and beautiful!) 2-D

More information

CSE 101 Introduction to Computers Development / Tutorial / Lab Environment Setup

CSE 101 Introduction to Computers Development / Tutorial / Lab Environment Setup CSE 101 Introduction to Computers Development / Tutorial / Lab Environment Setup Purpose: The purpose of this lab is to setup software that you will be using throughout the term for learning about Python

More information

JUPYTER (IPYTHON) NOTEBOOK CHEATSHEET

JUPYTER (IPYTHON) NOTEBOOK CHEATSHEET JUPYTER (IPYTHON) NOTEBOOK CHEATSHEET About Jupyter Notebooks The Jupyter Notebook is a web application that allows you to create and share documents that contain executable code, equations, visualizations

More information

Pandas plotting capabilities

Pandas plotting capabilities Pandas plotting capabilities Pandas built-in capabilities for data visualization it's built-off of matplotlib, but it's baked into pandas for easier usage. It provides the basic statistic plot types. Let's

More information

CircuitPython with Jupyter Notebooks

CircuitPython with Jupyter Notebooks CircuitPython with Jupyter Notebooks Created by Brent Rubell Last updated on 2018-08-22 04:08:47 PM UTC Guide Contents Guide Contents Overview What's a Jupyter Notebook? The Jupyter Notebook is an open-source

More information

Data Science Essentials

Data Science Essentials Data Science Essentials Lab 2 Working with Summary Statistics Overview In this lab, you will learn how to use either R or Python to compute and understand the basics of descriptive statistics. Descriptive

More information

ME30_Lab1_18JUL18. August 29, ME 30 Lab 1 - Introduction to Anaconda, JupyterLab, and Python

ME30_Lab1_18JUL18. August 29, ME 30 Lab 1 - Introduction to Anaconda, JupyterLab, and Python ME30_Lab1_18JUL18 August 29, 2018 1 ME 30 Lab 1 - Introduction to Anaconda, JupyterLab, and Python ME 30 ReDev Team 2018-07-18 Description and Summary: This lab introduces Anaconda, JupyterLab, and Python.

More information

As I begin to write this, I m returning on the plane from PyCon 2013,

As I begin to write this, I m returning on the plane from PyCon 2013, DAVID BEAZLEY David Beazley is an open source developer and author of the Python Essential Reference (4th Edition, Addison-Wesley, 2009) and Python Cookbook (3rd Edition, O Reilly & Associates, 2013).

More information

KNIME Python Integration Installation Guide. KNIME AG, Zurich, Switzerland Version 3.7 (last updated on )

KNIME Python Integration Installation Guide. KNIME AG, Zurich, Switzerland Version 3.7 (last updated on ) KNIME Python Integration Installation Guide KNIME AG, Zurich, Switzerland Version 3.7 (last updated on 2019-02-05) Table of Contents Introduction.....................................................................

More information

Platform Migrator Technical Report TR

Platform Migrator Technical Report TR Platform Migrator Technical Report TR2018-990 Munir Contractor mmc691@nyu.edu Christophe Pradal christophe.pradal@inria.fr Dennis Shasha shasha@cs.nyu.edu May 12, 2018 CONTENTS: 1 Abstract 4 2 Platform

More information

Programming with Python

Programming with Python Programming with Python EOAS Software Carpentry Workshop September 21st, 2016 https://xkcd.com/353 Getting started For our Python introduction we re going to pretend to be a researcher studying inflammation

More information

Lecture 6: more pandas (and git/github) LING 1340/2340: Data Science for Linguists Na-Rae Han

Lecture 6: more pandas (and git/github) LING 1340/2340: Data Science for Linguists Na-Rae Han Lecture 6: more pandas (and git/github) LING 1340/2340: Data Science for Linguists Na-Rae Han Objectives git and GitHub: Let's be more disciplined! Python's pandas library Tools: Git and GitHub Jupyter

More information

ipython-gremlin Documentation

ipython-gremlin Documentation ipython-gremlin Documentation Release 0.0.4 David M. Brown Mar 16, 2017 Contents 1 Releases 3 2 Requirements 5 3 Dependencies 7 4 Installation 9 5 Getting Started 11 5.1 Contribute................................................

More information

ARTIFICIAL INTELLIGENCE AND PYTHON

ARTIFICIAL INTELLIGENCE AND PYTHON ARTIFICIAL INTELLIGENCE AND PYTHON DAY 1 STANLEY LIANG, LASSONDE SCHOOL OF ENGINEERING, YORK UNIVERSITY WHAT IS PYTHON An interpreted high-level programming language for general-purpose programming. Python

More information

IPython Cypher Documentation

IPython Cypher Documentation IPython Cypher Documentation Release 1.0.0 Javier de la Rosa December 11, 2016 Contents 1 Releases 3 2 Requirements 5 3 Dependencies 7 4 Installation 9 5 Getting Started 11 6 Configuration 13 7 Contents

More information

Installation and Introduction to Jupyter & RStudio

Installation and Introduction to Jupyter & RStudio Installation and Introduction to Jupyter & RStudio CSE 4/587 Data Intensive Computing Spring 2017 Prepared by Jacob Condello 1 Anaconda/Jupyter Installation 1.1 What is Anaconda? Anaconda is a freemium

More information

Introduction to Python for Scientific Computing

Introduction to Python for Scientific Computing 1 Introduction to Python for Scientific Computing http://tinyurl.com/cq-intro-python-20151022 By: Bart Oldeman, Calcul Québec McGill HPC Bart.Oldeman@calculquebec.ca, Bart.Oldeman@mcgill.ca Partners and

More information

Using jupyter notebooks on Blue Waters. Roland Haas (NCSA / University of Illinois)

Using jupyter notebooks on Blue Waters.   Roland Haas (NCSA / University of Illinois) Using jupyter notebooks on Blue Waters https://goo.gl/4eb7qw Roland Haas (NCSA / University of Illinois) Email: rhaas@ncsa.illinois.edu Jupyter notebooks 2/18 interactive, browser based interface to Python

More information

Scientific computing platforms at PGI / JCNS

Scientific computing platforms at PGI / JCNS Member of the Helmholtz Association Scientific computing platforms at PGI / JCNS PGI-1 / IAS-1 Scientific Visualization Workshop Josef Heinen Outline Introduction Python distributions The SciPy stack Julia

More information

windrose Documentation Lionel Roubeyrie & Sebastien Celles

windrose Documentation Lionel Roubeyrie & Sebastien Celles Lionel Roubeyrie & Sebastien Celles Sep 04, 2018 Contents: 1 Install 3 1.1 Requirements............................................... 3 1.2 Install latest release version via pip...................................

More information

5 File I/O, Plotting with Matplotlib

5 File I/O, Plotting with Matplotlib 5 File I/O, Plotting with Matplotlib Bálint Aradi Course: Scientific Programming / Wissenchaftliches Programmieren (Python) Installing some SciPy stack components We will need several Scipy components

More information

Getting started with the Spyder IDE

Getting started with the Spyder IDE Getting started with the Spyder IDE Claudius Gräbner 1,2 1 Johannes Kepler University 2 ZOE. Institute for Future-Fit Economies Version 1.0 of July 18, 2018 Abstract Here I provide you with some basic

More information

Practical Statistics for Particle Physics Analyses: Introduction to Computing Examples

Practical Statistics for Particle Physics Analyses: Introduction to Computing Examples Practical Statistics for Particle Physics Analyses: Introduction to Computing Examples Louis Lyons (Imperial College), Lorenzo Moneta (CERN) IPMU, 27-29 March 2017 Introduction Hands-on session based on

More information

Genome data analysis in Python

Genome data analysis in Python Genome data analysis in Python A brief tutorial on the use of jupyter notebooks and the python data analysis library pandas for genomic data analysis. Workshop on Population and Speciation Genomics, Český

More information

PYTHON DATA VISUALIZATIONS

PYTHON DATA VISUALIZATIONS PYTHON DATA VISUALIZATIONS from Learning Python for Data Analysis and Visualization by Jose Portilla https://www.udemy.com/learning-python-for-data-analysis-and-visualization/ Notes by Michael Brothers

More information

Installation Manual and Quickstart Guide

Installation Manual and Quickstart Guide JuliaPro (v0.6.2.2) Installation Manual and Quickstart Guide Contents 1. Objective 2. Prerequisites 2.1. Installation of Xcode command line tools 3. Installing JuliaPro 4. Using the JuliaPro Command Prompt

More information

INTRODUCTION TO DATABASES IN PYTHON. Filtering and Targeting Data

INTRODUCTION TO DATABASES IN PYTHON. Filtering and Targeting Data INTRODUCTION TO DATABASES IN PYTHON Filtering and Targeting Data Where Clauses In [1]: stmt = select([census]) In [2]: stmt = stmt.where(census.columns.state == 'California') In [3]: results = connection.execute(stmt).fetchall()

More information

CME 193: Introduction to Scientific Python Lecture 1: Introduction

CME 193: Introduction to Scientific Python Lecture 1: Introduction CME 193: Introduction to Scientific Python Lecture 1: Introduction Nolan Skochdopole stanford.edu/class/cme193 1: Introduction 1-1 Contents Administration Introduction Basics Variables Control statements

More information

Getting Started with Python

Getting Started with Python Getting Started with Python A beginner course to Python Ryan Leung Updated: 2018/01/30 yanyan.ryan.leung@gmail.com Links Tutorial Material on GitHub: http://goo.gl/grrxqj 1 Learning Outcomes Python as

More information

Windows. Everywhere else

Windows. Everywhere else Git version control Enable native scrolling Git is a tool to manage sourcecode Never lose your coding progress again An empty folder 1/30 Windows Go to your programs overview and start Git Bash Everywhere

More information

Spyder Documentation. Release 3. Pierre Raybaut

Spyder Documentation. Release 3. Pierre Raybaut Spyder Documentation Release 3 Pierre Raybaut Aug 31, 2017 Contents 1 Overview 3 2 Installation 5 2.1 Installing on Windows Vista/7/8/10................................... 5 2.2 Installing on MacOS X..........................................

More information

callgraph Documentation

callgraph Documentation callgraph Documentation Release 1.0.0 Oliver Steele Jun 15, 2018 Contents: 1 Jupyter / IPython Usage 3 2 Decorator Usage 5 3 Development 7 4 Acknowledgements 9 5 License 11 6 API 13 Python Module Index

More information

Pandas and Friends. Austin Godber Mail: Source:

Pandas and Friends. Austin Godber Mail: Source: Austin Godber Mail: godber@uberhip.com Twitter: @godber Source: http://github.com/desertpy/presentations What does it do? Pandas is a Python data analysis tool built on top of NumPy that provides a suite

More information

Introduction to Python

Introduction to Python Introduction to Python Ryan Gutenkunst Molecular and Cellular Biology University of Arizona Before we start, fire up your Amazon instance, open a terminal, and enter the command sudo apt-get install ipython

More information

PyQ Documentation. Release 3.8. Enlightenment Research, LLC.

PyQ Documentation. Release 3.8. Enlightenment Research, LLC. PyQ Documentation Release 3.8 Enlightenment Research, LLC. November 21, 2016 Contents 1 Quickstart 3 2 Table of Contents 5 2.1 Installation................................................ 5 2.1.1 OS Support...........................................

More information

Data Acquisition and Processing

Data Acquisition and Processing Data Acquisition and Processing Adisak Sukul, Ph.D., Lecturer,, adisak@iastate.edu http://web.cs.iastate.edu/~adisak/bigdata/ Topics http://web.cs.iastate.edu/~adisak/bigdata/ Data Acquisition Data Processing

More information

Matplotlib Python Plotting

Matplotlib Python Plotting Matplotlib Python Plotting 1 / 6 2 / 6 3 / 6 Matplotlib Python Plotting Matplotlib is a Python 2D plotting library which produces publication quality figures in a variety of hardcopy formats and interactive

More information

Instituto Politécnico de Tomar. Python. Introduction. Ricardo Campos. Licenciatura ITM Técnicas Avançadas de Programação Abrantes, Portugal, 2018

Instituto Politécnico de Tomar. Python. Introduction. Ricardo Campos. Licenciatura ITM Técnicas Avançadas de Programação Abrantes, Portugal, 2018 Instituto Politécnico de Tomar Python Introduction Ricardo Campos Licenciatura ITM Técnicas Avançadas de Programação Abrantes, Portugal, 2018 This presentation was developed by Ricardo Campos, Professor

More information

VIP Documentation. Release Carlos Alberto Gomez Gonzalez, Olivier Wertz & VORTEX team

VIP Documentation. Release Carlos Alberto Gomez Gonzalez, Olivier Wertz & VORTEX team VIP Documentation Release 0.8.9 Carlos Alberto Gomez Gonzalez, Olivier Wertz & VORTEX team Feb 17, 2018 Contents 1 Introduction 3 2 Documentation 5 3 Jupyter notebook tutorial 7 4 TL;DR setup guide 9

More information

tutorial : modeling synaptic plasticity

tutorial : modeling synaptic plasticity tutorial : modeling synaptic plasticity Computational Neuroscience by the Mediterranean Winter School, Jan 20th, 2016 Michael Graupner Université Paris Descartes CNRS UMR 8118, Paris, France michael.graupner@parisdescartes.fr

More information

Python for Oracle. Oracle Database Great for Data Store. Critical for Business Operations. Performance? Run in laptop? But how do you share it?

Python for Oracle. Oracle Database Great for Data Store. Critical for Business Operations. Performance? Run in laptop? But how do you share it? Python for Oracle Arup Nanda Longtime Oracle Technologist And Python Explorer Oracle Database Great for Data Store Critical for Business Operations Performance? Run in laptop? But how do you share it?

More information

Scientific Python. 1 of 10 23/11/ :00

Scientific Python.   1 of 10 23/11/ :00 Scientific Python Neelofer Banglawala Kevin Stratford nbanglaw@epcc.ed.ac.uk kevin@epcc.ed.ac.uk Original course authors: Andy Turner Arno Proeme 1 of 10 23/11/2015 00:00 www.archer.ac.uk support@archer.ac.uk

More information

Django-CSP Documentation

Django-CSP Documentation Django-CSP Documentation Release 3.0 James Socol, Mozilla September 06, 2016 Contents 1 Installing django-csp 3 2 Configuring django-csp 5 2.1 Policy Settings..............................................

More information

Django Localized Recurrence Documentation

Django Localized Recurrence Documentation Django Localized Recurrence Documentation Release 3.2.0 Erik Swanson Jan 03, 2019 Contents 1 Table of Contents 3 1.1 Installation................................................ 3 1.2 Quickstart and Basic

More information

[%]%async_run. an IPython notebook* magic for asynchronous (code) cell execution. Valerio Maggio Researcher

[%]%async_run. an IPython notebook* magic for asynchronous (code) cell execution. Valerio Maggio Researcher [%]%async_run an IPython notebook* magic for asynchronous (code) cell execution Valerio Maggio Researcher valeriomaggio@gmail.com @leriomaggio Premises Jupyter Notebook Jupyter Notebook Jupyter Notebook

More information

Agenda. - Final Project Info. - All things Git. - Make sure to come to lab for Python next week

Agenda. - Final Project Info. - All things Git. - Make sure to come to lab for Python next week Lab #8 Git Agenda - Final Project Info - All things Git - Make sure to come to lab for Python next week Final Project Low Down The Projects are Creative AI, Arduino, Web Scheduler, ios and Connect 4 Notes

More information

Installation Manual and Quickstart Guide

Installation Manual and Quickstart Guide JuliaPro (v0.6.2.2) Installation Manual and Quickstart Guide Contents 1. Objective 2. Prerequisites 3. Installing JuliaPro 4. Using the JuliaPro Command Prompt 4.1. Configuring PyCall and IJulia for use

More information

RIT REST API Tutorial

RIT REST API Tutorial RIT User Guide Build 1.00 RIT REST API Tutorial Table of Contents Introduction... 2 Python/Environment Setup... 3 Rotman Interactive Trader Install... 3 Text Editor... 3 Python Distribution... 3 Verifying

More information

For examples, documentation, tutorials, etc, see Astropy at ( # For retrieving an image from a URL

For examples, documentation, tutorials, etc, see Astropy at   (  # For retrieving an image from a URL Astronomy example 1 Downloading images and writing FITS files For examples, documentation, tutorials, etc, see Astropy at http://www.astropy.org (http://www.astropy.org) In [1]: import scipy as sp import

More information

Exercise #1: ANALYZING SOCIAL MEDIA AND CUSTOMER SENTIMENT WITH APACHE NIFI AND HDP SEARCH INTRODUCTION CONFIGURE AND START SOLR

Exercise #1: ANALYZING SOCIAL MEDIA AND CUSTOMER SENTIMENT WITH APACHE NIFI AND HDP SEARCH INTRODUCTION CONFIGURE AND START SOLR Exercise #1: ANALYZING SOCIAL MEDIA AND CUSTOMER SENTIMENT WITH APACHE NIFI AND HDP SEARCH INTRODUCTION We will use Solr and the LucidWorks HDP Search to view our streamed data in real time to gather insights

More information

LSST software stack and deployment on other architectures. William O Mullane for Andy Connolly with material from Owen Boberg

LSST software stack and deployment on other architectures. William O Mullane for Andy Connolly with material from Owen Boberg LSST software stack and deployment on other architectures William O Mullane for Andy Connolly with material from Owen Boberg Containers and Docker Packaged piece of software with complete file system it

More information

1. BASICS OF PYTHON. JHU Physics & Astronomy Python Workshop Lecturer: Mubdi Rahman

1. BASICS OF PYTHON. JHU Physics & Astronomy Python Workshop Lecturer: Mubdi Rahman 1. BASICS OF PYTHON JHU Physics & Astronomy Python Workshop 2017 Lecturer: Mubdi Rahman HOW IS THIS WORKSHOP GOING TO WORK? We will be going over all the basics you need to get started and get productive

More information

ModBot Software Documentation 4CAD April 30, 2018

ModBot Software Documentation 4CAD April 30, 2018 Password to the Raspberry Pi: 4cadinc ModBot Software Documentation 4CAD April 30, 2018 Main Installations In Pi BIOS, enable i2c and camera Ruby Version Manager (downloadable at rvm.io) MySQL server and

More information

MATPLOTLIB. Python for computational science November 2012 CINECA.

MATPLOTLIB. Python for computational science November 2012 CINECA. MATPLOTLIB Python for computational science 19 21 November 2012 CINECA m.cestari@cineca.it Introduction (1) plotting the data gives us visual feedback in the working process Typical workflow: write a python

More information

Session 1: Introduction to Python from the Matlab perspective. October 9th, 2017 Sandra Diaz

Session 1: Introduction to Python from the Matlab perspective. October 9th, 2017 Sandra Diaz Session 1: Introduction to Python from the Matlab perspective October 9th, 2017 Sandra Diaz Working with examples in this course Git repository Work: Exercises we will be interactively working on Slides

More information

python 01 September 16, 2016

python 01 September 16, 2016 python 01 September 16, 2016 1 Introduction to Python adapted from Steve Phelps lectures - (http://sphelps.net) 2 Python is interpreted Python is an interpreted language (Java and C are not). In [1]: 7

More information

Homework 01 : Deep learning Tutorial

Homework 01 : Deep learning Tutorial Homework 01 : Deep learning Tutorial Introduction to TensorFlow and MLP 1. Introduction You are going to install TensorFlow as a tutorial of deep learning implementation. This instruction will provide

More information

Dealing with Data Especially Big Data

Dealing with Data Especially Big Data Dealing with Data Especially Big Data INFO-GB-2346.01 Fall 2017 Professor Norman White nwhite@stern.nyu.edu normwhite@twitter Teaching Assistant: Frenil Sanghavi fps241@stern.nyu.edu Administrative Assistant:

More information

Bash Tutorial. ASL Fall 2017 Week 2

Bash Tutorial. ASL Fall 2017 Week 2 Bash Tutorial ASL Fall 2017 Week 2 Comments from previous years I had to stay up all night to run experiments! I cannot work on this from home.. It took me more than 40hrs./week to work on this. Solution

More information

Chapter 4. Unix Tutorial. Unix Shell

Chapter 4. Unix Tutorial. Unix Shell Chapter 4 Unix Tutorial Users and applications interact with hardware through an operating system (OS). Unix is a very basic operating system in that it has just the essentials. Many operating systems,

More information

Nonlinear curve-fitting example

Nonlinear curve-fitting example Nonlinear curve-fitting example Implementation of curve-fitting in Python. Compare with results of Mathematica for same data sets: see pythontest.nb. In [1]: import scipy as sp from scipy.optimize import

More information

SAS and Python: The Perfect Partners in Crime

SAS and Python: The Perfect Partners in Crime Paper 2597-2018 SAS and Python: The Perfect Partners in Crime Carrie Foreman, Amadeus Software Limited ABSTRACT Python is often one of the first languages that any programmer will study. In 2017, Python

More information

pandas & ggplot quick analysis with python and friends Vincent Warmerdam Data

pandas & ggplot quick analysis with python and friends Vincent Warmerdam Data pandas & ggplot quick analysis with python and friends Vincent Warmerdam Data Scientist @fishnets88 vincentwarmerdam@godatadriven.com GoDataDriven PROUDLY PART OF THE XEBIA GROUP Who is this guy? - Data

More information

Python With Data Science

Python With Data Science Course Overview This course covers theoretical and technical aspects of using Python in Applied Data Science projects and Data Logistics use cases. Who Should Attend Data Scientists, Software Developers,

More information

Math 3316, Fall 2016 Due Nov. 3, 2016

Math 3316, Fall 2016 Due Nov. 3, 2016 Math 3316, Fall 2016 Due Nov. 3, 2016 Project 3 Polynomial Interpolation The first two sections of this project will be checked in lab the week of Oct. 24-26 this completion grade will count for 10% of

More information

The Python interpreter

The Python interpreter The Python interpreter Daniel Winklehner, Remi Lehe US Particle Accelerator School (USPAS) Summer Session Self-Consistent Simulations of Beam and Plasma Systems S. M. Lund, J.-L. Vay, D. Bruhwiler, R.

More information

Data Science Essentials Lab 5 Transforming Data

Data Science Essentials Lab 5 Transforming Data Data Science Essentials Lab 5 Transforming Data Overview In this lab, you will learn how to use tools in Azure Machine Learning along with either Python or R to integrate, clean and transform data. Collectively,

More information

Using bash. Administrative Shell Scripting COMP2101 Fall 2017

Using bash. Administrative Shell Scripting COMP2101 Fall 2017 Using bash Administrative Shell Scripting COMP2101 Fall 2017 Bash Background Bash was written to replace the Bourne shell The Bourne shell (sh) was not a good candidate for rewrite, so bash was a completely

More information

Utilization of Python in clinical study by SASPy

Utilization of Python in clinical study by SASPy Paper TT10 Utilization of Python in clinical study by SASPy Yuichi Nakajima, Novartis Pharma K.K, Tokyo, Japan ABSTRACT Python is the one of the most popular programming languages in recent years. It is

More information

django-dynamic-db-router Documentation

django-dynamic-db-router Documentation django-dynamic-db-router Documentation Release 0.1.1 Erik Swanson August 24, 2016 Contents 1 Table of Contents 3 1.1 Installation................................................ 3 1.2 Quickstart................................................

More information

Installation Manual and Quickstart Guide

Installation Manual and Quickstart Guide JuliaPro (v0.6.2.1) Installation Manual and Quickstart Guide Contents 1. Objective 2. Prerequisites 2.1. System Library Requirements 2.1.1 Prerequisites for Installation on CentOS 7 2.1.2 Prerequisites

More information

Python Packaging. Jakub Wasielak

Python Packaging. Jakub Wasielak Python Packaging Jakub Wasielak http://blog.pykonik.org/ http://koderek.edu.pl/ facebook.com/startechkrk https://pl.pycon.org/2017/ What? Why? Architecture https://packaging.python.org/current/ Installation

More information

Chris Calloway for Triangle Python Users Group at Caktus Group December 14, 2017

Chris Calloway for Triangle Python Users Group at Caktus Group December 14, 2017 Chris Calloway for Triangle Python Users Group at Caktus Group December 14, 2017 What Is Conda Cross-platform Language Agnostic Package Manager Dependency Manager Environment Manager Package Creator Command

More information

Introduction to Computer Vision Laboratories

Introduction to Computer Vision Laboratories Introduction to Computer Vision Laboratories Antonino Furnari furnari@dmi.unict.it www.dmi.unict.it/~furnari/ Computer Vision Laboratories Format: practical session + questions and homeworks. Material

More information

Parsl: Developing Interactive Parallel Workflows in Python using Parsl

Parsl: Developing Interactive Parallel Workflows in Python using Parsl Parsl: Developing Interactive Parallel Workflows in Python using Parsl Kyle Chard (chard@uchicago.edu) Yadu Babuji, Anna Woodard, Zhuozhao Li, Ben Clifford, Ian Foster, Dan Katz, Mike Wilde, Justin Wozniak

More information

Data Science with Python Course Catalog

Data Science with Python Course Catalog Enhance Your Contribution to the Business, Earn Industry-recognized Accreditations, and Develop Skills that Help You Advance in Your Career March 2018 www.iotintercon.com Table of Contents Syllabus Overview

More information

Data Analysis R&D. Jim Pivarski. February 5, Princeton University DIANA-HEP

Data Analysis R&D. Jim Pivarski. February 5, Princeton University DIANA-HEP Data Analysis R&D Jim Pivarski Princeton University DIANA-HEP February 5, 2018 1 / 20 Tools for data analysis Eventual goal Query-based analysis: let physicists do their analysis by querying a central

More information

RHCE BOOT CAMP. System Administration

RHCE BOOT CAMP. System Administration RHCE BOOT CAMP System Administration NAT CONFIGURATION NAT Configuration, eth0 outside, eth1 inside: sysctl -w net.ipv4.ip_forward=1 >> /etc/sysctl.conf iptables -t nat -A POSTROUTING -o eth0 -j MASQUERADE

More information

Jupyter and TMVA. Attila Bagoly (Eötvös Loránd University, Hungary) Mentors: Sergei V. Gleyzer Enric Tejedor Saavedra

Jupyter and TMVA. Attila Bagoly (Eötvös Loránd University, Hungary) Mentors: Sergei V. Gleyzer Enric Tejedor Saavedra Jupyter and TMVA Attila Bagoly (Eötvös Loránd University, Hungary) Mentors: Sergei V. Gleyzer Enric Tejedor Saavedra 1 Motivation Jupyter notebook: Interactive coding environment Document: HTML, Markdown

More information

Interactive Mode Python Pylab

Interactive Mode Python Pylab Short Python Intro Gerald Schuller, Nov. 2016 Python can be very similar to Matlab, very easy to learn if you already know Matlab, it is Open Source (unlike Matlab), it is easy to install, and unlike Matlab

More information

PROJECT INFRASTRUCTURE AND BASH INTRODUCTION MARKUS PILMAN<

PROJECT INFRASTRUCTURE AND BASH INTRODUCTION MARKUS PILMAN< PROJECT INFRASTRUCTURE AND BASH INTRODUCTION MARKUS PILMAN< MPILMAN@INF.ETHZ.CH> ORGANIZATION Tutorials on Tuesdays - Sometimes, will be announced In General: no exercise sessions (unless you get an email

More information

Core Python is small by design

Core Python is small by design Core Python is small by design One of the key features of Python is that the actual core language is fairly small. This is an intentional design feature to maintain simplicity. Much of the powerful functionality

More information

MariaDB ColumnStore PySpark API Usage Documentation. Release d1ab30. MariaDB Corporation

MariaDB ColumnStore PySpark API Usage Documentation. Release d1ab30. MariaDB Corporation MariaDB ColumnStore PySpark API Usage Documentation Release 1.2.3-3d1ab30 MariaDB Corporation Mar 07, 2019 CONTENTS 1 Licensing 1 1.1 Documentation Content......................................... 1 1.2

More information

Index. Bessel function, 51 Big data, 1. Cloud-based version-control system, 226 Containerization, 30 application, 32 virtualize processes, 30 31

Index. Bessel function, 51 Big data, 1. Cloud-based version-control system, 226 Containerization, 30 application, 32 virtualize processes, 30 31 Index A Amazon Web Services (AWS), 2 account creation, 2 EC2 instance creation, 9 Docker, 13 IP address, 12 key pair, 12 launch button, 11 security group, 11 stable Ubuntu server, 9 t2.micro type, 9 10

More information

INTRODUCTION TO DATA VISUALIZATION WITH PYTHON. Visualizing Regressions

INTRODUCTION TO DATA VISUALIZATION WITH PYTHON. Visualizing Regressions INTRODUCTION TO DATA VISUALIZATION WITH PYTHON Visualizing Regressions Seaborn https://stanford.edu/~mwaskom/software/seaborn/ Recap: Pandas DataFrames Labelled tabular data structure Labels on rows: index

More information

Reproducibility and Extensibility in Scientific Research. Jessica Forde

Reproducibility and Extensibility in Scientific Research. Jessica Forde Reproducibility and Extensibility in Scientific Research Jessica Forde Project Jupyter @projectjupyter @mybinderteam Project Jupyter IPython Jupyter Notebook Architecture of JupyterHub Overview The problem

More information

Unix basics exercise MBV-INFX410

Unix basics exercise MBV-INFX410 Unix basics exercise MBV-INFX410 In order to start this exercise, you need to be logged in on a UNIX computer with a terminal window open on your computer. It is best if you are logged in on freebee.abel.uio.no.

More information

Lab 5 - Repetition. September 26, 2018

Lab 5 - Repetition. September 26, 2018 Lab 5 - Repetition September 26, 2018 1 ME 30 Lab 5 - Repetition ME 30 ReDev Team Description and Summary: This lab introduces the programming concept of repetition, also called looping, where some operations

More information

classy - The Python wrapper

classy - The Python wrapper classy - The Python wrapper Thomas Tram Institute of Gravitation and Cosmology October 27, 2014 T. Tram (ICG) Lecture 7 : Wrapper October 27, 2014 1 / 15 Compiled and interpreted languages Compiled languages

More information

Discrete-Event Simulation and Performance Evaluation

Discrete-Event Simulation and Performance Evaluation Discrete-Event Simulation and Performance Evaluation 01204525 Wireless Sensor Networks and Internet of Things Chaiporn Jaikaeo (chaiporn.j@ku.ac.th) Department of Computer Engineering Kasetsart University

More information

HW0 v3. October 2, CSE 252A Computer Vision I Fall Assignment 0

HW0 v3. October 2, CSE 252A Computer Vision I Fall Assignment 0 HW0 v3 October 2, 2018 1 CSE 252A Computer Vision I Fall 2018 - Assignment 0 1.0.1 Instructor: David Kriegman 1.0.2 Assignment Published On: Tuesday, October 2, 2018 1.0.3 Due On: Tuesday, October 9, 2018

More information

Pulp Python Support Documentation

Pulp Python Support Documentation Pulp Python Support Documentation Release 1.0.1 Pulp Project October 20, 2015 Contents 1 Release Notes 3 1.1 1.0 Release Notes............................................ 3 2 Administrator Documentation

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Python and Notebooks Dr. David Koop Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively.

More information

Webgurukul Programming Language Course

Webgurukul Programming Language Course Webgurukul Programming Language Course Take One step towards IT profession with us Python Syllabus Python Training Overview > What are the Python Course Pre-requisites > Objectives of the Course > Who

More information

Catbook Workshop: Intro to NodeJS. Monde Duinkharjav

Catbook Workshop: Intro to NodeJS. Monde Duinkharjav Catbook Workshop: Intro to NodeJS Monde Duinkharjav What is NodeJS? NodeJS is... A Javascript RUNTIME ENGINE NOT a framework NOT Javascript nor a JS package It is a method for running your code in Javascript.

More information

Contents. Note: pay attention to where you are. Note: Plaintext version. Note: pay attention to where you are... 1 Note: Plaintext version...

Contents. Note: pay attention to where you are. Note: Plaintext version. Note: pay attention to where you are... 1 Note: Plaintext version... Contents Note: pay attention to where you are........................................... 1 Note: Plaintext version................................................... 1 Hello World of the Bash shell 2 Accessing

More information