Lecture 6: more pandas (and git/github) LING 1340/2340: Data Science for Linguists Na-Rae Han

Similar documents
Lecture 3: Processing Language Data, Git/GitHub. LING 1340/2340: Data Science for Linguists Na-Rae Han

Lecture 2: Data in Linguistics, Git/GitHub, Jupyter Notebook. LING 1340/2340: Data Science for Linguists Na-Rae Han

Getting the files for the first time...2. Making Changes, Commiting them and Pull Requests:...5. Update your repository from the upstream master...

Tips on how to set up a GitHub account:

Effective Software Development and Version Control

The Old World. Have you ever had to collaborate on a project by

Human-Computer Interaction Design

Version Control with Git ME 461 Fall 2018

Human-Computer Interaction Design

Lab 08. Command Line and Git

Git Tutorial. André Sailer. ILD Technical Meeting April 24, 2017 CERN-EP-LCD. ILD Technical Meeting, Apr 24, 2017 A. Sailer: Git Tutorial 1/36

LSST software stack and deployment on other architectures. William O Mullane for Andy Connolly with material from Owen Boberg

Git Setup Help using GitKraken (CSE 154)

Object Oriented Programming. Week 1 Part 2 Git and egit

CSE 344: Section 1 Git Setup for HW Introduction to SQLite. September 28, 2017

How to version control like a pro: a roadmap to your reproducible & collaborative research

Welcome to Bootcamp2015 s documentation!

Git Guide. Meher Krishna Patel. Created on : Octorber, 2017 Last updated : October, More documents are freely available at PythonDSP

Git, the magical version control

Software Development I

Git. CSCI 5828: Foundations of Software Engineering Lecture 02a 08/27/2015

CSE 331 Software Design & Implementation

CSE 344: Section 1 Git Setup for HW Introduction to SQLite

Agenda. - Final Project Info. - All things Git. - Make sure to come to lab for Python next week

Windows. Everywhere else

Version control. with git and GitHub. Karl Broman. Biostatistics & Medical Informatics, UW Madison

Intro to Github. Jessica Young


CSE 391 Lecture 9. Version control with Git

How to git with proper etiquette

CircuitPython with Jupyter Notebooks

What is git? Distributed Version Control System (VCS); Created by Linus Torvalds, to help with Linux development;

Software Revision Control for MASS. Git Installation / Configuration / Use

KTH Royal Institute of Technology SEMINAR 2-29 March Simone Stefani -

Django MFA Documentation

Github/Git Primer. Tyler Hague

Version control with git and Rstudio. Remko Duursma

Tutorial 2 GitHub Tutorial

Table of Contents. Concepts

CPSC 491. Lecture 19 & 20: Source Code Version Control. VCS = Version Control Software SCM = Source Code Management

L Modeling and Simulating Social Systems with MATLAB

SECTION 2: HW3 Setup.

699DR git/github Tutorial

FEEG Applied Programming 3 - Version Control and Git II

Using GitHub for scientific research

Lab 01 How to Survive & Introduction to Git. Web Programming DataLab, CS, NTHU

Midterm Next Week. Last year s midterm (which didn t include remotes):

Git GitHub & secrets

SECTION 2: Loop Reasoning & HW3 Setup

EECS 470 Lab 4. Version Control System. Friday, 31 st January, 2014

SECTION 2: Loop Reasoning & HW3 Setup

Working with GIT. Florido Paganelli Lund University MNXB Florido Paganelli MNXB Working with git 1/47

Git. Presenter: Haotao (Eric) Lai Contact:

IC Documentation. Release 0.1. IC team

Git version control with Eclipse (EGit) Tutorial

Esri on GitHub: How to Participate in Open Source Projects

Grading Rubric Homework 1

Git Resolve Conflict Using Mine Command Line

Effective Programming Practices for Economists. 7. Version Control, Part II: Git with a Shared Repository

Visualizing Git Workflows. A visual guide to 539 workflows

Git. SSE2034: System Software Experiment 3, Fall 2018, Jinkyu Jeong

Introduction to Git and Github

Using GitHub and SourceTree to work with DITA TC repositories

Programming for Data Science Syllabus

G E T T I N G S TA R T E D W I T H G I T

CSCE UVM Hands-on Session-1 Pre-Work

Setting up GitHub Version Control with Qt Creator*

How to be a 1337 h4ck3r: Git + Linux

TDDC88 Lab 4 Software Configuration Management

CSE 332: Data Structures and Parallelism Winter 2019 Setting Up Your CSE 332 Environment

Chapter 5. Version Control: Git

Using git to download and update BOUT++

Adafruit WebIDE. Created by Tyler Cooper. Last updated on :29:47 PM UTC

Git and GitHub. Dan Wysocki. February 12, Dan Wysocki Git and GitHub February 12, / 48

Version control system (VCS)

Lab 4: Shell Scripting

Lab 4: Bash Scripting

GitHub Classroom. Click Sign up in to top right of the page, and the following dialog is displayed.

Tizen/Artik IoT Practice Part 4 Open Source Development

Use git rm to remove files from workspace

Revision control. INF5750/ Lecture 2 (Part I)

Recap: Developer's Environment


Introduction to Git and GitHub for Writers Workbook February 23, 2019 Peter Gruenbaum

Setting up PyCharm Professional

TangeloHub Documentation

Version Control for Fun and Profit

2 Initialize a git repository on your machine, add a README file, commit and push

Using GitHub to Share with SparkFun a

CS 520: VCS and Git. Intermediate Topics Ben Kushigian

Introduction to Python

How to Archive s in Outlook 2007

A L A TEX-oriented intro to Git

Overview. 1. Install git and create a Github account 2. What is git? 3. How does git work? 4. What is GitHub? 5. Quick example using git and GitHub

Versioning with git. Moritz August Git/Bash/Python-Course for MPE. Moritz August Versioning with Git

Intermediate Programming, Spring Misha Kazhdan

Working in Teams CS 520 Theory and Practice of Software Engineering Fall 2018

A short tutorial on Git. Servesh Muralidharan 4 March 2014

Accessing OSIRIS and using OSIRIS through GitHub

CMSC 201 Fall 2016 Homework 6 Functions

Transcription:

Lecture 6: more pandas (and git/github) LING 1340/2340: Data Science for Linguists Na-Rae Han

Objectives git and GitHub: Let's be more disciplined! Python's pandas library Tools: Git and GitHub Jupyter Notebook Markdown 9/13/2017 2

Pull trouble? Let's shoot. You are trying to pull from upstream, and git says conflicts! Fatal, even! Chances are your local file is one that needs to be discarded. Pay attention to the conflict message. Which file is causing trouble? Copy that local file into a location outside of the repo, just in case Then, do: git rm troublefile git commit -m "trouble making file gone" Also: keep checking git status in between git pull upstream master ( this step might not even be necessary) After that, if needed, put back your original file with a different name Still stuck? Google. Look up stack overflow. Email Na-Rae with a screenshot. 9/14/2017 3

Some house rules for happy collabo-gitting To add files for committing, use individual file names. Use: git add file1 file2 Do not add and commit unnecessary files/dirs! To delete previously committed files, do git-delete, not just delete. rm filename (or drag file to trash bin), immediately followed by git rm filename or just do git rm filename Keep files small (especially for shared repos) GitHub has 100MB file size limit. In general, keep your data files < 3MB. Stop using: git add * git add. git rm tells git to stop tracking the file. It also deletes the file itself if it still exists. 9/14/2017 4

Many ways of git add/rm Different calls have different behaviors. New files Modified files Deleted files Files beginning with. Notes git add file1 file2 Use TAB completion git add * git add. AVOID! Only use like git add *.txt Any changes in the directory. AVOID! git add -A Likewise. AVOID! git add -u Any updates. git rm file1 If file1 exists, delete and stop tracking. 9/13/2017 5 Opposite of 'git add'. Use with deleted files or to delete files. * Files and directories work the same.

Class-Practice-Repo: we'll keep it going Let's not resort to "scrap fork/repo and start fresh" again. Keeping to a few git/github house rules will hopefully keep our Class-Practice-Repo relatively complication-free. Before: Now: Lost continuity. 9/14/2017 6

git is better in color (actually, everything is) Windows folks are using git bash, which has nice colorized git output Mac users: there are ways to customize your Terminal and git Brianna will demonstrate her setup 9/14/2017 7

Dealing with a repository We now have some private repos: HW2-Repo, Licensed-Data-Sets If you get an error while cloning them, you might need to alter your clone URL. This is the usual one (copied through GitHub's green button): git clone https://github.com/name/repo.git If that fails, try this one instead: git clone https://username@github.com/name/repo.git Forces fresh credential check Changed1: you should put your actual user name string in the command line Changed2: I originally had password in the line as well, but apparently that is ill-advised (will be recorded in git history). Leaving it out makes git prompt for your password, which is better. 9/14/2017 8

pandas practice, continued Activity 3 50 mins 50 Years of Pop Music http://kaylinwalker.com/50-years-of-pop-music/ Download CSV file 'billboard_lyrics_1964-2015.csv' In Class-Practice-Repo, activity3 folder: Move or copy the CSV file into the directory. You will find pop_music_lyrics.ipynb Rename it pop_music_lyrics_yourname.ipynb and work on it. 9/13/2017 9

Homework 2: Process ETS Corpus http://www.pitt.edu/~naraehan/ling1340/hw2.html Corpus distributed via private GitHub repo "Licensed-Data- Sets" https://github.com/data-science-for-linguists/licensed-data-sets No need to fork: clone directly. (Why?) If you have trouble cloning, see slide #8. 9/13/2017 10

Wrapping up To-do 5: due Tuesday. Ultimate pandas notebook. Also: visualization. HW2: Process ETS Corpus Due next Tuesday THURSDAY Don't wait -- get started this weekend! This HW is in period! Project ideas: you should start thinking. Learn: pandas matplotlib, visualization 9/13/2017 11