Pandas - an open source library for fast data analysis, cleaning and preparation

Size: px
Start display at page:

Download "Pandas - an open source library for fast data analysis, cleaning and preparation"

Transcription

1 Pandas - an open source library for fast data analysis, cleaning and preparation In [1]: import numpy as np In [2]: import pandas as pd In [4]: labels = ["a", "b", "c"] In [5]: my_data = [10,20,30] In [6]: arr = np.array(my_data) In [7]: d = {"a":10, "b":20, "c":30} In [9]: print(labels) print(my_data) print(arr) print(d) ['a', 'b', 'c'] [10, 20, 30] [ ] {'a': 10, 'b': 20, 'c': 30} In [10]: pd.series(data = my_data) Out[10]: dtype: int64

2 In [12]: pd.series(data = my_data, index = labels) Out[12]: a 10 b 20 c 30 dtype: int64 In [13]: pd.series(my_data, labels) Out[13]: a 10 b 20 c 30 dtype: int64 In [14]: pd.series(arr) Out[14]: dtype: int32 In [15]: pd.series(arr, labels) Out[15]: a 10 b 20 c 30 dtype: int32 In [16]: d Out[16]: {'a': 10, 'b': 20, 'c': 30} In [17]: pd.series(d) Out[17]: a 10 b 20 c 30 dtype: int64

3 In [18]: pd.series(data = labels) Out[18]: 0 a 1 b 2 c dtype: object In [19]: pd.series(data=[sum,print]) Out[19]: 0 <built-in function sum> 1 <built-in function print> dtype: object In [20]: ser1 = pd.series(data = [1,2,3,4], index = ["USA", "Germany", "USSR", "Japan"]) ser1 Out[20]: USA 1 Germany 2 USSR 3 Japan 4 dtype: int64 In [21]: ser2 = pd.series(data = [1,2,5,4], index = ["USA", "Germany", "Italy", "Japan"]) ser2 Out[21]: USA 1 Germany 2 Italy 5 Japan 4 dtype: int64 In [22]: ser1[0] Out[22]: 1 In [23]: ser1["usa"] Out[23]: 1

4 In [24]: ser2["italy"] Out[24]: 5 In [25]: ser3 = pd.series(data = labels) ser3 Out[25]: 0 a 1 b 2 c dtype: object In [26]: Out[26]: ser3[0] 'a' In [28]: ser1 Out[28]: USA 1 Germany 2 USSR 3 Japan 4 dtype: int64 In [29]: ser2 Out[29]: USA 1 Germany 2 Italy 5 Japan 4 dtype: int64 In [31]: ser1 + ser2 # it does not find a match, so it produces NaN Out[31]: Germany 4.0 Italy NaN Japan 8.0 USA 2.0 USSR NaN dtype: float64

5 Data Frames In [10]: from numpy.random import randn In [32]: np.random.seed(101) In [37]: = pd.dataframe(randn(5,4), ["Row1", "Row2", "Row3", "Row4", "Row5"], ["Col1", "Col2", "Col3", "Col4"]) In [38]: Out[38]: Row Row Row In [39]: ["Col2"] Out[39]: Row Row Row Row Row Name: Col2, dtype: float64 In [40]: Out[40]: type(["col2"]) pandas.core.series.series

6 In [41]: Out[41]: type() pandas.core.frame.dataframe In [44]:.Col4 Out[44]: Row Row Row Row Row Name: Col4, dtype: float64 In [45]: Out[45]: [["Col1","Col2"]] Col1 Col2 Row Row Row Row Row In [46]: ["NewCol"] = ["Col1"] + ["Col2"]

7 In [47]: Out[47]: NewCol Row Row Row In [52]:.drop("NewCol", axis = 1, inplace = True) In [53]: Out[53]: Row Row Row In [57]:.drop("Row5", axis = 0, inplace = True)

8 In [58]: Out[58]: Row Row In [59]:.shape Out[59]: (4, 4) There are two ways to index rows In [61]:.loc["Row1"] Out[61]: Col Col Col Col Name: Row1, dtype: float64 In [62]:.iloc[2] Out[62]: Col Col Col Col Name: Row3, dtype: float64 Subsets of rows and columns

9 Just like Numpy In [63]:.loc["Row1","Col3"] Out[63]: In [65]:.loc[["Row1", "Row2"], ["Col1", "Col3"]] Out[65]: Col1 Col3 Row Row Conditional Selection in data frames In [66]: > 0 Out[66]: Row1 True True False False Row2 False True True True Row3 True True True True Row4 False False False True

10 In [67]: [ > 0] Out[67]: Row NaN NaN Row2 NaN Row4 NaN NaN NaN In [68]: # or bool = > 0 In [69]: Out[69]: bool Row1 True True False False Row2 False True True True Row3 True True True True Row4 False False False True In [70]: [bool] Out[70]: Row NaN NaN Row2 NaN Row4 NaN NaN NaN

11 In [71]: Out[71]: Row Row In [73]: ["Col1"] > 0 Out[73]: Row1 True Row2 False Row3 True Row4 False Name: Col1, dtype: bool In [75]: [["Col1"] > 0] Out[75]: In [76]: Out[76]: Row Row

12 In [77]: [["Col4"] < 0] Out[77]: In [78]: result = [["Col1"] > 0] In [79]: result Out[79]: In [80]: result["col2"] Out[80]: Row Row Name: Col2, dtype: float64 In [81]: # or [["Col1"] > 0]["Col2"] Out[81]: Row Row Name: Col2, dtype: float64 In [82]: [["Col1"] > 0][["Col2", "Col3"]] Out[82]: Col2 Col3 Row Row

13 In [83]: [(["Col1"] > 0) and (["Col2"] > 1)] ValueError Traceback (most recent call last) <ipython-input-83-6c94b9b71d90> in <module>() ----> 1 [(["Col1"] > 0) and (["Col2"] > 1)] D:\Training\PythonforDataScience\lib\site-packages\pandas\core\generic.py in nonzero (self) 953 raise ValueError("The truth value of a {0} is ambiguous. " 954 "Use a.empty, a.bool(), a.item(), a.any() or a.all()." --> 955.format(self. class. name )) bool = nonzero ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all(). You need to write & for multiple conditions in data frames In [84]: [(["Col1"] > 0) & (["Col2"] > 1)] Out[84]: The same with or In [85]: [(["Col1"] > 0) (["Col2"] > 1)] Out[85]:

14 Index In [86]: Out[86]: Row Row In [87]: Out[87]:.reset_index() index 0 1 Row Row In [91]: new_ind = "CA NY WY OR".split() In [92]: Out[92]: new_ind ['CA', 'NY', 'WY', 'OR'] In [93]: ["States"] = new_ind

15 In [94]: Out[94]: States CA Row NY WY Row OR In [95]: Out[95]:.set_index("States") States CA NY WY OR In [96]: Out[96]: States CA Row NY WY Row OR In [3]: outside = ["G1", "G1", "G1", "G2", "G2", "G2"]

16 In [4]: inside = [1,2,3,1,2,3] In [5]: hier_index = list(zip(outside,inside)) In [6]: hier_index = pd.multiindex.from_tuples(hier_index) In [7]: hier_index Out[7]: MultiIndex(levels=[['G1', 'G2'], [1, 2, 3]], labels=[[0, 0, 0, 1, 1, 1], [0, 1, 2, 0, 1, 2]]) List function with zip function create the following: In [8]: list(zip(outside,inside)) Out[8]: [('G1', 1), ('G1', 2), ('G1', 3), ('G2', 1), ('G2', 2), ('G2', 3)] In [11]: = pd.dataframe(randn(6,2),hier_index,["a","b"]) In [12]: Out[12]: A B G G

17 In [13]: Out[13]:.loc["G1"] A B In [14]:.loc["G1"].loc[1] Out[14]: A B Name: 1, dtype: float64 In [15]: Out[15]:.index.names FrozenList([None, None]) In [16]:.index.names = ["Groups", "Numbers"] In [17]: Out[17]: A B Groups Numbers G G

18 Indexing G2,2,B value, meaning In [19]:.loc["G2"].loc[2].loc["B"] Out[19]: In [20]: # or loc["G1"].loc[3].loc["B"] Out[20]: Cross section In [21]: Out[21]: A B Groups Numbers G G

19 In [22]: Out[22]:.loc["G1"] Numbers A B In [23]: Out[23]:.xs("G1") Numbers A B In [26]:.xs(1,level = "Numbers") # grabbing the ones from both groups Out[26]: A B Groups G G

INTERMEDIATE PYTHON FOR DATA SCIENCE. Comparison Operators

INTERMEDIATE PYTHON FOR DATA SCIENCE. Comparison Operators INTERMEDIATE PYTHON FOR DATA SCIENCE Comparison Operators Numpy Recap In [1]: import numpy as np In [2]: np_height = np.array([1.73, 1.68, 1.71, 1.89, 1.79]) In [3]: np_weight = np.array([65.4, 59.2, 63.6,

More information

Python (version 3.6) for R Users: Stat Modules I

Python (version 3.6) for R Users: Stat Modules I Python (version 3.6) for R Users: Stat Modules I CMU MSP 36601, Fall 2017, Howard Seltman 1. Use the numpy module to get vector, matrix, and array functionality as well as linear algebra. The official

More information

MERGING DATAFRAMES WITH PANDAS. Appending & concatenating Series

MERGING DATAFRAMES WITH PANDAS. Appending & concatenating Series MERGING DATAFRAMES WITH PANDAS Appending & concatenating Series append().append(): Series & DataFrame method Invocation: s1.append(s2) Stacks rows of s2 below s1 Method for Series & DataFrames concat()

More information

Pandas. Data Manipulation in Python

Pandas. Data Manipulation in Python Pandas Data Manipulation in Python 1 / 27 Pandas Built on NumPy Adds data structures and data manipulation tools Enables easier data cleaning and analysis import pandas as pd 2 / 27 Pandas Fundamentals

More information

CSC Advanced Scientific Computing, Fall Numpy

CSC Advanced Scientific Computing, Fall Numpy CSC 223 - Advanced Scientific Computing, Fall 2017 Numpy Numpy Numpy (Numerical Python) provides an interface, called an array, to operate on dense data buffers. Numpy arrays are at the core of most Python

More information

Pandas. Data Manipulation in Python

Pandas. Data Manipulation in Python Pandas Data Manipulation in Python 1 / 26 Pandas Built on NumPy Adds data structures and data manipulation tools Enables easier data cleaning and analysis import pandas as pd 2 / 26 Pandas Fundamentals

More information

CLEANING DATA IN PYTHON. Data types

CLEANING DATA IN PYTHON. Data types CLEANING DATA IN PYTHON Data types Prepare and clean data Cleaning Data in Python Data types In [1]: print(df.dtypes) name object sex object treatment a object treatment b int64 dtype: object There may

More information

Introducing Python Pandas

Introducing Python Pandas Introducing Python Pandas Based on CBSE Curriculum Class -11 By- Neha Tyagi PGT CS KV 5 Jaipur II Shift Jaipur Region Neha Tyagi, KV 5 Jaipur II Shift Introduction Pandas or Python Pandas is a library

More information

What is Data Science?

What is Data Science? What is Data Science? Data science ~ computer science + mathematics/statistics + visualization Outline of a data science project Harvesting Cleaning Analyzing Visualizing Publishing Actively used Python

More information

Python Pandas- II Dataframes and Other Operations

Python Pandas- II Dataframes and Other Operations Python Pandas- II Dataframes and Other Operations Based on CBSE Curriculum Class -11 By- Neha Tyagi PGT CS KV 5 Jaipur II Shift Jaipur Region Neha Tyagi, KV 5 Jaipur II Shift Introduction In last chapter,

More information

Chapter 1 : Informatics Practices. Class XII ( As per CBSE Board) Advance operations on dataframes (pivoting, sorting & aggregation/descriptive

Chapter 1 : Informatics Practices. Class XII ( As per CBSE Board) Advance operations on dataframes (pivoting, sorting & aggregation/descriptive Chapter 1 : Informatics Practices Class XII ( As per CBSE Board) Advance operations on dataframes (pivoting, sorting & aggregation/descriptive statistics) Pivoting - dataframe DataFrame -It is a 2-dimensional

More information

IMPORTING & MANAGING FINANCIAL DATA IN PYTHON. Read, inspect, & clean data from csv files

IMPORTING & MANAGING FINANCIAL DATA IN PYTHON. Read, inspect, & clean data from csv files IMPORTING & MANAGING FINANCIAL DATA IN PYTHON Read, inspect, & clean data from csv files Import & clean data Ensure that pd.dataframe() is same as csv source file Stock exchange listings: amex-listings.csv

More information

Pandas and Friends. Austin Godber Mail: Source:

Pandas and Friends. Austin Godber Mail: Source: Austin Godber Mail: godber@uberhip.com Twitter: @godber Source: http://github.com/desertpy/presentations What does it do? Pandas is a Python data analysis tool built on top of NumPy that provides a suite

More information

Scientific Programming. Lecture A07 Pandas

Scientific Programming. Lecture A07 Pandas Scientific Programming Lecture A07 Pandas Alberto Montresor Università di Trento 2018/10/19 Acknowledgments: Stefano Teso, Pandas Documentation http://disi.unitn.it/~teso/courses/sciprog/python_pandas.html

More information

MACHINE LEARNING WITH THE EXPERTS: SCHOOL BUDGETS. Introducing the challenge

MACHINE LEARNING WITH THE EXPERTS: SCHOOL BUDGETS. Introducing the challenge MACHINE LEARNING WITH THE EXPERTS: SCHOOL BUDGETS Introducing the challenge Introducing the challenge Learn from the expert who won DrivenData s challenge Natural language processing Feature engineering

More information

SAS and Python: The Perfect Partners in Crime

SAS and Python: The Perfect Partners in Crime Paper 2597-2018 SAS and Python: The Perfect Partners in Crime Carrie Foreman, Amadeus Software Limited ABSTRACT Python is often one of the first languages that any programmer will study. In 2017, Python

More information

INTRODUCTION TO DATA VISUALIZATION WITH PYTHON. Visualizing time series

INTRODUCTION TO DATA VISUALIZATION WITH PYTHON. Visualizing time series INTRODUCTION TO DATA VISUALIZATION WITH PYTHON Visualizing time series Introduction to Data Visualization with Python Datetimes & time series In [1]: type(weather) Out[1]: pandas.core.frame.dataframe In

More information

Scientific Programming. Lecture A08 Numpy

Scientific Programming. Lecture A08 Numpy Scientific Programming Lecture A08 Alberto Montresor Università di Trento 2018/10/25 Acknowledgments: Stefano Teso, Documentation http://disi.unitn.it/~teso/courses/sciprog/python_appendices.html https://docs.scipy.org/doc/numpy-1.13.0/reference/

More information

Computer Science 121. Scientific Computing Winter 2016 Chapter 8 Loops

Computer Science 121. Scientific Computing Winter 2016 Chapter 8 Loops Computer Science 121 Scientific Computing Winter 2016 Chapter 8 Loops Loops: Motivation We've already seen algorithms (findsmallest, Fibonacci, factorial) that repeat the same steps over and over Without

More information

Problem Based Learning 2018

Problem Based Learning 2018 Problem Based Learning 2018 Introduction to Machine Learning with Python L. Richter Department of Computer Science Technische Universität München Monday, Jun 25th L. Richter PBL 18 1 / 21 Overview 1 2

More information

file:///users/jacobperricone/desktop/stanford/cme193_s18/cme193/lectures/presentations/lecture6.slides.html?print-pdf=true#/ 1/112

file:///users/jacobperricone/desktop/stanford/cme193_s18/cme193/lectures/presentations/lecture6.slides.html?print-pdf=true#/ 1/112 CME 193 Introduction to Scienti c Python Spring 2018 Lecture 6 Pandas file:///users/jacobperricone/desktop/stanford/cme193_s18/cme193/lectures/presentations/lecture6.slides.html?print-pdf=true#/ 1/112

More information

WordEmbeddingLoader Documentation

WordEmbeddingLoader Documentation WordEmbeddingLoader Documentation Release 0.2.0 Yuta Koreeda Aug 14, 2017 Modules 1 Issues with encoding 3 2 Development 5 3 CHANGELOG 7 3.1 v0.2.................................................... 7

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Reading Data Dr. David Koop Data Frame A dictionary of Series (labels for each series) A spreadsheet with column headers Has an index shared with each series Allows

More information

Dhavide Aruliah Director of Training, Anaconda

Dhavide Aruliah Director of Training, Anaconda PARALLEL COMPUTING WITH DASK Understanding Computer Storage & Dhavide Aruliah Director of Training, Anaconda Big Data What is "Big Data"? "Data > one machine" Storage Units: Bytes, Kilobytes, Megabytes,...

More information

DATA VISUALIZATION. In [2]: %matplotlib inline. import numpy as np import pandas as pd. import matplotlib.pyplot as plt

DATA VISUALIZATION. In [2]: %matplotlib inline. import numpy as np import pandas as pd. import matplotlib.pyplot as plt DATA VISUALIZATION In [2]: %matplotlib inline import numpy as np import pandas as pd import matplotlib.pyplot as plt In [3]: # CASE 1: Potasium Ion Channel Kv1.2 # the input file has been generated by

More information

Part VI. Scientific Computing in Python. Tobias Neckel: Scripting with Bash and Python Compact Max-Planck, February 16-26,

Part VI. Scientific Computing in Python. Tobias Neckel: Scripting with Bash and Python Compact Max-Planck, February 16-26, Part VI Scientific Computing in Python Compact Course @ Max-Planck, February 16-26, 2015 81 More on Maths Module math Constants pi and e Functions that operate on int and float All return values float

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Data Frames Dr. David Koop pandas Contains high-level data structures and manipulation tools designed to make data analysis fast and easy in Python Built on top of

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Data Frames Dr. David Koop List, Array, or Series? [[1,2,3],[4,5,6]] 2 List, Array, or Series? [[1,2,3],[4,5,6]] 3 List, Array, or Series? Which should I use to store

More information

Exercise: Introduction to NumPy arrays

Exercise: Introduction to NumPy arrays Exercise: Introduction to NumPy arrays Aim: Introduce basic NumPy array creation and indexing Issues covered: Importing NumPy Creating an array from a list Creating arrays of zeros or ones Understanding

More information

Python Data Analysis Library

Python Data Analysis Library What is pandas? Pandas: Panel data system Python data analysis library, built on top of numpy Open Sourced by AQR Capital Management, LLC in late 2009 30.000 lines of tested Python/Cython code More details

More information

Python for Data Analysis. Prof.Sushila Aghav-Palwe Assistant Professor MIT

Python for Data Analysis. Prof.Sushila Aghav-Palwe Assistant Professor MIT Python for Data Analysis Prof.Sushila Aghav-Palwe Assistant Professor MIT Four steps to apply data analytics: 1. Define your Objective What are you trying to achieve? What could the result look like? 2.

More information

DATA STRUCTURE AND ALGORITHM USING PYTHON

DATA STRUCTURE AND ALGORITHM USING PYTHON DATA STRUCTURE AND ALGORITHM USING PYTHON Common Use Python Module II Peter Lo Pandas Data Structures and Data Analysis tools 2 What is Pandas? Pandas is an open-source Python library providing highperformance,

More information

ARTIFICIAL INTELLIGENCE AND PYTHON

ARTIFICIAL INTELLIGENCE AND PYTHON ARTIFICIAL INTELLIGENCE AND PYTHON DAY 1 STANLEY LIANG, LASSONDE SCHOOL OF ENGINEERING, YORK UNIVERSITY WHAT IS PYTHON An interpreted high-level programming language for general-purpose programming. Python

More information

Interfacing With Other Programming Languages Using Cython

Interfacing With Other Programming Languages Using Cython Lab 19 Interfacing With Other Programming Languages Using Cython Lab Objective: Learn to interface with object files using Cython. This lab should be worked through on a machine that has already been configured

More information

MANIPULATING TIME SERIES DATA IN PYTHON. How to use Dates & Times with pandas

MANIPULATING TIME SERIES DATA IN PYTHON. How to use Dates & Times with pandas MANIPULATING TIME SERIES DATA IN PYTHON How to use Dates & Times with pandas Date & Time Series Functionality At the root: data types for date & time information Objects for points in time and periods

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Data Frames Dr. David Koop 2D Indexing [W. McKinney, Python for Data Analysis] 2 Boolean Indexing names == 'Bob' gives back booleans that represent the elementwise

More information

pyblock Documentation

pyblock Documentation pyblock Documentation Release 0.4 James Spencer Apr 02, 2018 Contents 1 Installation 3 2 pyblock API 5 3 pyblock tutorial 11 4 References 17 5 Indices and tables 19 Bibliography 21 Python Module Index

More information

Control Structures 1 / 17

Control Structures 1 / 17 Control Structures 1 / 17 Structured Programming Any algorithm can be expressed by: Sequence - one statement after another Selection - conditional execution (not conditional jumping) Repetition - loops

More information

Numpy fast array interface

Numpy fast array interface NUMPY Numpy fast array interface Standard Python is not well suitable for numerical computations lists are very flexible but also slow to process in numerical computations Numpy adds a new array data type

More information

Series. >>> import numpy as np >>> import pandas as pd

Series. >>> import numpy as np >>> import pandas as pd 7 Pandas I: Introduction Lab Objective: Though NumPy and SciPy are powerful tools for numerical computing, they lack some of the high-level functionality necessary for many data science applications. Python

More information

df2gspread Documentation

df2gspread Documentation df2gspread Documentation Release Eduard Trott Apr 05, 2017 Contents 1 df2gspread 3 1.1 Description................................................ 3 1.2 Status...................................................

More information

Exam #3, Form 3 A CSE 231 Fall 2015 (1) DO NOT OPEN YOUR EXAM BOOKLET UNTIL YOU HAVE BEEN TOLD TO BEGIN.

Exam #3, Form 3 A CSE 231 Fall 2015 (1) DO NOT OPEN YOUR EXAM BOOKLET UNTIL YOU HAVE BEEN TOLD TO BEGIN. Name: Section: Date: INSTRUCTIONS: (1) DO NOT OPEN YOUR EXAM BOOKLET UNTIL YOU HAVE BEEN TOLD TO BEGIN. (2) Please fill in the requested information at the top of this exam booklet. (3) Use a #2 pencil

More information

NumPy quick reference

NumPy quick reference John W. Shipman 2016-05-30 12:28 Abstract A guide to the more common functions of NumPy, a numerical computation module for the Python programming language. This publication is available in Web form1 and

More information

Introductory Scientific Computing with Python

Introductory Scientific Computing with Python Introductory Scientific Computing with Python More plotting, lists and FOSSEE Department of Aerospace Engineering IIT Bombay SciPy India, 2015 December, 2015 FOSSEE (FOSSEE IITB) Interactive Plotting 1

More information

NumPy. Daniël de Kok. May 4, 2017

NumPy. Daniël de Kok. May 4, 2017 NumPy Daniël de Kok May 4, 2017 Introduction Today Today s lecture is about the NumPy linear algebra library for Python. Today you will learn: How to create NumPy arrays, which store vectors, matrices,

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Arrays and Series Dr. David Koop Exception Example def divide(mylist, x,y): newlist = [] try: z = x // y below, mid, above = \ mylist[:z], mylist[z], mylist[z+1:]

More information

Data Science and Machine Learning Essentials

Data Science and Machine Learning Essentials Data Science and Machine Learning Essentials Lab 2B Transforming Data with Scripts By Graeme Malcolm and Stephen Elston Overview In this lab, you will learn how to use Python or R to manipulate and analyze

More information

NumPy is suited to many applications Image processing Signal processing Linear algebra A plethora of others

NumPy is suited to many applications Image processing Signal processing Linear algebra A plethora of others Introduction to NumPy What is NumPy NumPy is a Python C extension library for array-oriented computing Efficient In-memory Contiguous (or Strided) Homogeneous (but types can be algebraic) NumPy is suited

More information

Q1. Write code to Import an entire module named as Calculator.py in your program.*1+

Q1. Write code to Import an entire module named as Calculator.py in your program.*1+ KENDRIYA VIDYALAYA, IIT POWAI, MUMBAI REGION 2 nd Unit Test 2018-19 Class 11 th (Eleventh) Subject : Informatics Practices(065) Max.Mark : 50 Time : 1½ hrs. Instructions : (i) All ques. Are compulsory

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Arrays Dr. David Koop Class Example class Rectangle: def init (self, x, y, w, h): self.x = x self.y = y self.w = w self.h = h def set_corner(self, x, y): self.x =

More information

The UOB Python Lectures: Part 3 - Python for Data Analysis

The UOB Python Lectures: Part 3 - Python for Data Analysis The UOB Python Lectures: Part 3 - Python for Data Analysis Hesham al-ammal University of Bahrain Small Data BIG Data Data Scientist s Tasks Interacting with the outside world Reading and writing with a

More information

Python Working with files. May 4, 2017

Python Working with files. May 4, 2017 Python Working with files May 4, 2017 So far, everything we have done in Python was using in-memory operations. After closing the Python interpreter or after the script was done, all our input and output

More information

Chapter 5 : Informatics Practices. Class XII ( As per CBSE Board) Numpy - Array. New Syllabus Visit : python.mykvs.in for regular updates

Chapter 5 : Informatics Practices. Class XII ( As per CBSE Board) Numpy - Array. New Syllabus Visit : python.mykvs.in for regular updates Chapter 5 : Informatics Practices Class XII ( As per CBSE Board) Numpy - Array New Syllabus 2019-20 NumPy stands for Numerical Python.It is the core library for scientific computing in Python. It consist

More information

windrose Documentation Lionel Roubeyrie & Sebastien Celles

windrose Documentation Lionel Roubeyrie & Sebastien Celles Lionel Roubeyrie & Sebastien Celles Sep 04, 2018 Contents: 1 Install 3 1.1 Requirements............................................... 3 1.2 Install latest release version via pip...................................

More information

DSC 201: Data Analysis & Visualization

DSC 201: Data Analysis & Visualization DSC 201: Data Analysis & Visualization Classes & Arrays Dr. David Koop Sets Sets are like dictionaries but without any values: s = {'MA', 'RI', 'CT', 'NH'}; t = {'MA', 'NY', 'NH'} {} is an empty dictionary,

More information

Ch.5: Array computing and curve plotting (Part 1)

Ch.5: Array computing and curve plotting (Part 1) Ch.5: Array computing and curve plotting (Part 1) Joakim Sundnes 1,2 Hans Petter Langtangen 1,2 Simula Research Laboratory 1 University of Oslo, Dept. of Informatics 2 Sep 20, 2017 (Adjusted) Plan for

More information

The NumPy Array: A Structure for Efficient Numerical Computation

The NumPy Array: A Structure for Efficient Numerical Computation The NumPy Array: A Structure for Efficient Numerical Computation Presented at the G-Node Autumn School on Advanced Scientific Programming in Python, held in Kiel, Germany Stéfan van der Walt UC Berkeley

More information

A Look Back at Arithmetic Operators: the Increment and Decrement

A Look Back at Arithmetic Operators: the Increment and Decrement A Look Back at Arithmetic Operators: the Increment and Decrement Spring Semester 2016 Programming and Data Structure 27 Increment (++) and Decrement (--) Both of these are unary operators; they operate

More information

datascience Documentation

datascience Documentation datascience Documentation Release 0.8.1 John DeNero, David Culler, Alvin Wan, and Sam Lau September 26, 2016 Contents 1 Start Here: datascience Tutorial 3 1.1 Getting Started..............................................

More information

File Input/Output in Python. October 9, 2017

File Input/Output in Python. October 9, 2017 File Input/Output in Python October 9, 2017 Moving beyond simple analysis Use real data Most of you will have datasets that you want to do some analysis with (from simple statistics on few hundred sample

More information

Derek Bridge School of Computer Science and Information Technology University College Cork

Derek Bridge School of Computer Science and Information Technology University College Cork CS4618: rtificial Intelligence I Vectors and Matrices Derek Bridge School of Computer Science and Information Technology University College Cork Initialization In [1]: %load_ext autoreload %autoreload

More information

INTERMEDIATE PYTHON FOR DATA SCIENCE. while loop

INTERMEDIATE PYTHON FOR DATA SCIENCE. while loop INTERMEDIATE PYTHON FOR DATA SCIENCE while loop if-elif-else control.py Goes through construct only once z = 6 if z % 2 == 0 : True print(z is divisible by 2) Executed elif z % 3 == 0 : print(z is divisible

More information

cosmos_python_ Python as calculator May 31, 2018

cosmos_python_ Python as calculator May 31, 2018 cosmos_python_2018 May 31, 2018 1 Python as calculator Note: To convert ipynb to pdf file, use command: ipython nbconvert cosmos_python_2015.ipynb --to latex --post pdf In [3]: 1 + 3 Out[3]: 4 In [4]:

More information

CS 2316 Exam 3 ANSWER KEY

CS 2316 Exam 3 ANSWER KEY CS 2316 Exam 3 Practice ANSWER KEY Failure to properly fill in the information on this page will result in a deduction of up to 5 points from your exam score. Signing signifies you are aware of and in

More information

pyopendata Documentation

pyopendata Documentation pyopendata Documentation Release 0.0.3.dev sinhrks October 26, 2014 Contents 1 What s New 3 1.1 0.0.2................................................... 3 1.2 0.0.1...................................................

More information

Lab 10 - Ridge Regression and the Lasso in Python

Lab 10 - Ridge Regression and the Lasso in Python Lab 10 - Ridge Regression and the Lasso in Python March 9, 2016 This lab on Ridge Regression and the Lasso is a Python adaptation of p. 251-255 of Introduction to Statistical Learning with Applications

More information

Hands-on Machine Learning for Cybersecurity

Hands-on Machine Learning for Cybersecurity Hands-on Machine Learning for Cybersecurity James Walden 1 1 Center for Information Security Northern Kentucky University 11th Annual NKU Cybersecurity Symposium Highland Heights, KY October 11, 2018 Topics

More information

Toolboxes for Data Scientists

Toolboxes for Data Scientists Toolboxes for Data Scientists 2 2.1 Introduction In this chapter, first we introduce some of the tools that data scientists use. The toolbox of any data scientist, as for any kind of programmer, is an

More information

Software Carpentry. Nicola Chiapolini. Physik-Institut University of Zurich. June 16, 2015

Software Carpentry. Nicola Chiapolini. Physik-Institut University of Zurich. June 16, 2015 Nicola Chiapolini, June 16, 2015 1 / 45 Software Carpentry Nicola Chiapolini Physik-Institut University of Zurich June 16, 2015 Based on a talk by Pietro Berkes This work is licensed under the Creative

More information

5. Using the precendence rules for arithmetic operators, determine which statement is true.

5. Using the precendence rules for arithmetic operators, determine which statement is true. 1. Which statement assigns the value 3 to the variable mynum and prints the value of mynum to the command window? (a) mynum == 2 + 1; (b) mynum = 2 + 1; (c) mynum == 3 (d) mynum = 2 + 1 (e) mynum = 3;

More information

PYTHON DATA VISUALIZATIONS

PYTHON DATA VISUALIZATIONS PYTHON DATA VISUALIZATIONS from Learning Python for Data Analysis and Visualization by Jose Portilla https://www.udemy.com/learning-python-for-data-analysis-and-visualization/ Notes by Michael Brothers

More information

NumPy. Arno Proeme, ARCHER CSE Team Attributed to Jussi Enkovaara & Martti Louhivuori, CSC Helsinki

NumPy. Arno Proeme, ARCHER CSE Team Attributed to Jussi Enkovaara & Martti Louhivuori, CSC Helsinki NumPy Arno Proeme, ARCHER CSE Team aproeme@epcc.ed.ac.uk Attributed to Jussi Enkovaara & Martti Louhivuori, CSC Helsinki Reusing this material This work is licensed under a Creative Commons Attribution-

More information

Implement NN using NumPy

Implement NN using NumPy Implement NN using NumPy Hantao Zhang Deep Learning with Python Reading: https://www.tutorialspoint.com/numpy/ Recommendation for Using Python Install anaconda on your PC. If you already have installed

More information

dyplot Documentation Release Tsung-Han Yang

dyplot Documentation Release Tsung-Han Yang dyplot Documentation Release Tsung-Han Yang February 25, 2016 Contents 1 Motivation 3 2 Introduction 5 3 Tutorials 7 3.1 Plot three series.............................................. 7 3.2 Pie Chart.................................................

More information

Thomas Vincent Head of Data Science, Getty Images

Thomas Vincent Head of Data Science, Getty Images VISUALIZING TIME SERIES DATA IN PYTHON Clean your time series data Thomas Vincent Head of Data Science, Getty Images The CO2 level time series A snippet of the weekly measurements of CO2 levels at the

More information

Part VI. Scientific Computing in Python. Alfredo Parra : Scripting with Python Compact Max-PlanckMarch 6-10,

Part VI. Scientific Computing in Python. Alfredo Parra : Scripting with Python Compact Max-PlanckMarch 6-10, Part VI Scientific Computing in Python Compact Course @ Max-PlanckMarch 6-10, 2017 63 Doing maths in Python Standard sequence types (list, tuple,... ) Can be used as arrays Can contain different types

More information

Case study: accessing financial data

Case study: accessing financial data Case study: accessing financial data Prof. Mauro Gaspari: gaspari@cs.unibo.it Methods for accessing databases What methods exist to access financial databases? Basically there are several approaches to

More information

Week 2. Relational Operators. Block or compound statement. if/else. Branching & Looping. Gaddis: Chapters 4 & 5. CS 5301 Spring 2018.

Week 2. Relational Operators. Block or compound statement. if/else. Branching & Looping. Gaddis: Chapters 4 & 5. CS 5301 Spring 2018. Week 2 Branching & Looping Gaddis: Chapters 4 & 5 CS 5301 Spring 2018 Jill Seaman 1 Relational Operators l relational operators (result is bool): == Equal to (do not use =)!= Not equal to > Greater than

More information

Python Crash Course Numpy, Scipy, Matplotlib

Python Crash Course Numpy, Scipy, Matplotlib Python Crash Course Numpy, Scipy, Matplotlib That is what learning is. You suddenly understand something you ve understood all your life, but in a new way. Doris Lessing Steffen Brinkmann Max-Planck-Institut

More information

MANIPULATING TIME SERIES DATA IN PYTHON. Compare Time Series Growth Rates

MANIPULATING TIME SERIES DATA IN PYTHON. Compare Time Series Growth Rates MANIPULATING TIME SERIES DATA IN PYTHON Compare Time Series Growth Rates Comparing Stock Performance Stock price series: hard to compare at different levels Simple solution: normalize price series to start

More information

Introduction to Artificial Neural Networks and Deep Learning

Introduction to Artificial Neural Networks and Deep Learning Introduction to Artificial Neural Networks and Deep Learning A Practical Guide with Applications in Python Sebastian Raschka This book is for sale at http://leanpub.com/ann-and-deeplearning This version

More information

python-samplerate Documentation

python-samplerate Documentation python-samplerate Documentation Release 0.1.0+4.ga9b5d2a Tino Wagner February 24, 2017 Contents 1 Installation 3 2 Usage 5 3 See also 7 4 License 9 5 API documentation 11 5.1 samplerate module documentation....................................

More information

CS 112 Introduction to Computing II. Wayne Snyder Computer Science Department Boston University

CS 112 Introduction to Computing II. Wayne Snyder Computer Science Department Boston University 9/5/6 CS Introduction to Computing II Wayne Snyder Department Boston University Today: Arrays (D and D) Methods Program structure Fields vs local variables Next time: Program structure continued: Classes

More information

datascience Documentation

datascience Documentation datascience Documentation Release 0.7.1 John DeNero, David Culler, Alvin Wan, and Sam Lau September 14, 2016 Contents 1 Start Here: datascience Tutorial 3 1.1 Getting Started..............................................

More information

Session 04: Introduction to Numpy

Session 04: Introduction to Numpy Session 04: Introduction to Numpy October 9th, 2017 Wouter Klijn Overview Introduction Hello world Arrays Creating Interacting Copying Differences with Matlab Matrixes vs Array Why Why not Matlib module

More information

CS 2316 Exam 3. Practice. Failure to properly fill in the information on this page will result in a deduction of up to 5 points from your exam score.

CS 2316 Exam 3. Practice. Failure to properly fill in the information on this page will result in a deduction of up to 5 points from your exam score. CS 2316 Exam 3 Practice Name (print clearly): T-Square ID (gtg, gth, msmith3, etc): Section (e.g., B1): Signature: Failure to properly fill in the information on this page will result in a deduction of

More information

Search. The Nearest Neighbor Problem

Search. The Nearest Neighbor Problem 3 Nearest Neighbor Search Lab Objective: The nearest neighbor problem is an optimization problem that arises in applications such as computer vision, pattern recognition, internet marketing, and data compression.

More information

Exceptions in Python. AMath 483/583 Lecture 27 May 27, Exceptions in Python. Exceptions in Python

Exceptions in Python. AMath 483/583 Lecture 27 May 27, Exceptions in Python. Exceptions in Python AMath 483/583 Lecture 27 May 27, 2011 Today: Python exception handling Python plus Fortran: f2py Next week: More Python plus Fortran Visualization Parallel IPython Read: Class notes and references If you

More information

Introduction to Python: The Multi-Purpose Programming Language. Robert M. Porsch June 14, 2017

Introduction to Python: The Multi-Purpose Programming Language. Robert M. Porsch June 14, 2017 Introduction to Python: The Multi-Purpose Programming Language Robert M. Porsch June 14, 2017 What is Python Python is Python is a widely used high-level programming language for general-purpose programming

More information

Introduction to Python

Introduction to Python Introduction to Python Michael Krisper Thomas Wurmitzer March 22, 2014 Michael Krisper, Thomas Wurmitzer Introduction to Python March 22, 2014 1 / 27 Schedule Tutorium Dates & Deadlines Submission System

More information

pvextractor Documentation

pvextractor Documentation pvextractor Documentation Release 0.0.dev289 Adam Ginsburg and Thomas Robitaille Sep 12, 2017 Contents I Extracting slices programmatically 3 1 Defining a path 5 2 Extracting a slice 7 3 Saving the slice

More information

Scientific Computing with Python and CUDA

Scientific Computing with Python and CUDA Scientific Computing with Python and CUDA Stefan Reiterer High Performance Computing Seminar, January 17 2011 Stefan Reiterer () Scientific Computing with Python and CUDA HPC Seminar 1 / 55 Inhalt 1 A

More information

A. Python Crash Course

A. Python Crash Course A. Python Crash Course Agenda A.1 Installing Python & Co A.2 Basics A.3 Data Types A.4 Conditions A.5 Loops A.6 Functions A.7 I/O A.8 OLS with Python 2 A.1 Installing Python & Co You can download and install

More information

Chapter 1. 1 Computer-Aided Logic Design. 1.1 Introduction. 1.2 General Philosophy of Problem Specification and Solution

Chapter 1. 1 Computer-Aided Logic Design. 1.1 Introduction. 1.2 General Philosophy of Problem Specification and Solution 1 Computer-Aided Logic Design 1.1 Introduction Hardware components of computers are physical models of logical reasoning. Procedures based on ligical disciplines of mathematics are used to design these

More information

Math 1MP3, final exam

Math 1MP3, final exam Math 1MP3, final exam 23 April 2015 Please write your name and student number on this test and on your answer sheet You have 120 minutes No external aids (calculator, textbook, notes) Please number your

More information

CS 115 Lecture 8. Selection: the if statement. Neil Moore

CS 115 Lecture 8. Selection: the if statement. Neil Moore CS 115 Lecture 8 Selection: the if statement Neil Moore Department of Computer Science University of Kentucky Lexington, Kentucky 40506 neil@cs.uky.edu 24 September 2015 Selection Sometime we want to execute

More information

The SciPy Stack. Jay Summet

The SciPy Stack. Jay Summet The SciPy Stack Jay Summet May 1, 2014 Outline Numpy - Arrays, Linear Algebra, Vector Ops MatPlotLib - Data Plotting SciPy - Optimization, Scientific functions TITLE OF PRESENTATION 2 What is Numpy? 3rd

More information

Robot Vision Systems Lecture 8: Python wrappers in OpenCV

Robot Vision Systems Lecture 8: Python wrappers in OpenCV Robot Vision Systems Lecture 8: Python wrappers in OpenCV Michael Felsberg michael.felsberg@liu.se Why Python Wrappers Assume a small library based on OpenCV Python interface for Testing Distribution Prototyping

More information

Built-in Types of Data

Built-in Types of Data Built-in Types of Data Types A data type is set of values and a set of operations defined on those values Python supports several built-in data types: int (for integers), float (for floating-point numbers),

More information

Lab 16 - Multiclass SVMs and Applications to Real Data in Python

Lab 16 - Multiclass SVMs and Applications to Real Data in Python Lab 16 - Multiclass SVMs and Applications to Real Data in Python April 7, 2016 This lab on Multiclass Support Vector Machines in Python is an adaptation of p. 366-368 of Introduction to Statistical Learning

More information