Tutorial 1: Using Excel to find unique values in a list

Similar documents
Advanced Formulas and Functions in Microsoft Excel

Microsoft Excel Lookup Functions - Reference Guide

Excel VLOOKUP. An EMIS Coordinator s Friend

Excel 2. Module 2 Formulas & Functions

1.a) Go to it should be accessible in all browsers

Become strong in Excel (2.0) - 5 Tips To Rock A Spreadsheet!

Instructions on Adding Zeros to the Comtrade Data

Safari ODBC on Microsoft 2010

MODULE VI: MORE FUNCTIONS

exam. Number: Passing Score: 800 Time Limit: 120 min MICROSOFT Excel 2013 Expert Part One.

Ahmad Al-Rjoub Excel Tutorial 7. Using Advanced Functions, Conditional Formatting, and Filtering

VLOOKUP Function Purpose (Mac Guide)

Microsoft Excel 2007

Separate Text Across Cells The Convert Text to Columns Wizard can help you to divide the text into columns separated with specific symbols.

How to use the Vlookup function in Excel

IF & VLOOKUP Function

Pivot Table Project. Objectives. By the end of this lesson, you will be able to:

Tutorial 2. Building a Database and Defining Table Relationships

Intermediate Excel Training Course Content

Excel & Business Math Video/Class Project #33 VLOOKUP Function for Incentive Pay: Commissions and Piecework

Open you WordPad/NotePad File in Excel. How to Move Text to Columns (You can see all data in Column A)

Excel as a Tool to Troubleshoot SIS Data for EMIS Reporting

Getting Started with Excel

Printing Monthly Eligible List for Providers

Programs for American Fidelity WorxTime

Excel: Tips and Tricks Speaker: Marlene Groh, CCE, ICCE Date: June 13, 2018 Time: 2:00 to 3:00 & 3:30 to 4:30 Session Number: & 27097

Formulas, LookUp Tables and PivotTables Prepared for Aero Controlex

FSFOA EXCEL INSTRUCTIONS. Tips and Shortcuts

UPDATING E-AUTOMATE VENDOR ITEM NUMBERS WITH NEW SN REFERENCE NUMBERS

NUMERICAL COMPUTING For Finance Using Excel. Interpolation

My Top 5 Formulas OutofhoursAdmin

CMPF124 Microsoft Excel Tutorial

Advanced Excel. IMFOA Conference. April 11, :15 pm 4:15 pm. Presented By: Chad Jarvi, CPA President, Civic Systems

Introduction to Microsoft Excel

Microsoft Excel Prepare Test Session File

Excel Level 3 - Advanced

2. create the workbook file

Microsoft Office Excel 2007: Advanced Unit 02 - Lookups and Data Tables

Excel Tips for Compensation Practitioners Weeks 9-12 Working with Lookup Formulae

University of North Dakota PeopleSoft Finance Tip Sheets. Utilizing the Query Download Feature

Comparing and linking tables of data using VLOOKUP

Key concepts through Excel Basic videos 01 to 25

Troubleshooting in Microsoft Excel 2002

Excel. Spreadsheet functions

Tutorial 1: Exploring the UCSC Genome Browser

Excel Shortcuts Increasing YOUR Productivity

MS EXCEL: TABLES, FORMATS, FUNCTIONS AND MACROS

Candy is Dandy Project (Project #12)

Patricia Andrada Quick Guide Excel 2010 Data Management-July 2011 Page 1

DESCRIPTION 1 TO DEFINE A NAME 2. USING RANGE NAMES 2 Functions 4 THE IF FUNCTION 4 THE VLOOKUP FUNCTION 5 THE HLOOKUP FUNCTION 6

Excel & Business Math Video/Class Project #39 Create Invoices in Excel with Data Validation Drop-down, VLOOKUP & IF Functions

How Commercial Off-the-Shelf (COTS) Business Intelligence (BI) Tools Can Improve Financial Management Analysis

2. In Video #6, we used Power Query to append multiple Text Files into a single Proper Data Set:

Streamlined Reporting with

NHS e-referral Service

Quick Guide for Excel 2015 Data Management November 2015 Training:

Vlookup and Sumif Formulas to assist summarizing queried data

More Skills 12 Create Web Queries and Clear Hyperlinks

IITS Workshop Creating a Gradebook in Microsoft Office Excel 2007

Introduction to Information Technology

Excel Formulas 2018 Cindy Kredo Page 1 of 23

Some useful shortcut keys applicable for both Excel and Word (16 to 19 is only for Excel): Sr.No. Shortcut Keys Description

Business Process Procedures

What is a VLOOKUP? Source

Excel for Data Visualization

Using Excel for a Gradebook: Advanced Gradebook Formulas

Tutorial 7. Working With Excel s Editing and Web Tools. Review

ADVANCED EXCEL: LOOKUP FUNCTIONS

Excel: Linking sheets and summary sheets (Mac OS)

MICROSOFT OFFICE APPLICATIONS

Microsoft Office Excel Create a worksheet group. A worksheet group. Tutorial 6 Working With Multiple Worksheets and Workbooks

EXCEL WORKSHOP II ADVANCED FORMULAS AND FUNCTIONS IN EXCEL

Technology Applications for the Financial Aid Office. Wes Brothers Director of Financial Aid Ohio Christian University

MS Excel How To Use VLOOKUP In Microsoft Excel

3 Excel Tips for Marketing Efficiency

INSERT SUBTOTALS Database Exercise Sort the Data Department Department Data Tab Sort and Filter Group

Microsoft MOS-EXP. Microsoft Excel 2002 Core.

What is a spreadsheet?

Advanced formula construction

Excel Expert Microsoft Excel 2010

PHLI Instruction (734) Introduction. Lists.

Service Line Export and Pivot Table Report (Windows Excel 2010)

Excel Lesson 1 Microsoft Excel Basics

2. Take a few minutes to look around the site. The goal is to familiarize yourself with a few key components of the NCBI.

MICROSOFT EXCEL TUTORIAL HANDOUT

EXCELLING WITH ANALYSIS AND VISUALIZATION

6. In the last Import Wizard dialog box, click Finish. Saving Excel Data in CSV File Format

VLOOKUP() takes three mandatory parameters and one default/optional parameter:

Data Service Center May, Compiled by: Katey Semmel Donna Frieze

Formulas and Functions

Using Advanced Formulas and 9 Securing Workbooks

Chart Wizard: Step 1 (Chart Types)

Excel Flash Fill. Excel Flash Fill Example

Spreadsheet Functions

Creating and Using Genome Assemblies Tutorial

A Brief Word About Your Exam

CEU Online System, The Friday Center for Continuing Education, UNC-Chapel Hill How to Obtain Participant IDs for Awarding of CEUs

1. Right-click the worksheet tab you want to rename. The worksheet menu appears. 2. Select Rename.

WORKING WITH LOOKUP TABLES

Using Advanced Formulas

Transcription:

Tutorial 1: Using Excel to find unique values in a list It is not uncommon to have a list of data that contains redundant values. Genes with multiple transcript isoforms is one example. If you are only interested in the genes and not the different transcripts, then you will probably want to filter the list to remove the redundant values. I did a search of the UCSC human genome browser with the query colon cancer and got back >500 matches. I created a text file listing the first 500 matches. You can download this data from the Exercise 1 home page by clicking on the link ListofGenesfromUCSC.txt. The file has 2 columns: Gene Name and Chromosome Location. You will filter on Gene Name. Once you ve downloaded the text file, do the following: Open Excel and from within Excel open the text document. If the file you want to open is greyed out, change the drop down menu to Enable: All Readable Documents. Double-click the file you want to open and this should bring up the Text Import Wizard It should recognize it as delimited. Click the Next button to define the delimiters. By default, Excel assumes a.txt file is tab-delimited Click Next and then Finish to finish the import. Advanced filter: Select the column of gene names Click on the Data menu and select Advanced filter (if you get a warning about being unable to determine which row contains column labels and you have a column header in row 1, just click OK). Check the radio button Copy to another location This should move our mouse to the Copy to text box. Select a column (not Columns A-C) Check the box Unique records only Click the OK button. This should produce a list of 208 genes from the original 500 genes. BCHM 6280 2017 Excel Tutorial Page 1 of 5

Tutorial 2: Using Excel to manage text data An issue common to gene names or gene identifiers is slight variations that can prevent their identification via a database lookup. An example is that as gene or transcript records are reviewed by curators, they are often given an appended number such as NM_0012345.1 or NM_0012345.3 indicating which version they are. The base identifier of NM_0012345 is the same between them but if your list has the appended version number, the database lookup or Excel lookup won t recognize the two as being the same record. In this example, there are two Excel files available from the Exercise 2 homepage: ExpressionData.xlsx and GeneInfo.xlsx The ExpressionData file has two columns. The first has Ensembl GeneIDs with the version number. The second column contains gene expression information in the form of Log2 ratio of treatment/control. The GeneInfo file has four columns. The first has Ensemble GeneIDs, but as the stable identifier rather than as a version. The remaining columns have the gene symbol, NCBI Gene ID and gene description. You want to be able to bring in information from the GeneInfo file into the ExpressionData file but at the moment, they do not share the same identifiers. To correct this, you will use a text-related function called LEFT to change the GeneIDs in the ExpressionData file to match those in the GeneInfo file. 1. Insert a column to the left of the GeneID column in the ExpressionData file. 2. In cell A2, type = and select the LEFT function 3. Select cell B2 for the text box in the FormulaBuilder dialog box 4. Tab to the num_chars box and type in 15 5. This should return the ENSG## up to the. as it was originially 6. Select the newly generated ID in A2, then copy down to the end of the column. Type Ctrl-D to copy the function down the rest of the column. 7. Then Edit->copy the newly generated IDs and use Edit->Paste->Special->Values to replace the formula with values. 8. Now you can use the two files in the next section to bring the data from GeneInfo into the ExpressionData file Tutorial 3: Using Excel to compare lists of data. A very common problem in bioinformatics or information processing of any kind is having multiple lists of data that you want to compare to each other. In Excel is a function called VLOOKUP that makes this easy to do. It is also useful for transferring data from 1 worksheet to another. For this part of the tutorial, you will use the GeneInfo and your modified ExpressionData file from the previous section. You can delete the column from the ExpressionData file that had the GeneIDs with version number in them. In this part of the tutorial, you will bring in the Gene Name and NCBI Gene ID into the ExpressionData file. BCHM 6280 2017 Excel Tutorial Page 2 of 5

Last updated: May, 2017 Open both worksheets in Excel. o In the ExpressionData file, insert a column between columns 1 and 2. o In the second row of column 2 (cell B2), type and = sign. Then go to the drop down menu in the upper left of the worksheet, find the function VLOOKUP and select it. If you do not see VLOOKUP on the main menu, scroll down to more functions which opens a dialog box with all of the available Excel functions. Under lookup and reference you will find VLOOKUP. Figure 1: Inserting a VLOOKUP function into column 2 of ExpressionData worksheet. o Once you ve inserted the function, you must fill out the arguments for the function using the dialog box that opens up. Select cell A2 as the lookup value. o Then click into the box Table_array. Go up to the window menu and select GeneInfor_ExcelTutorial.xlsx as shown in Figure 2. Figure 2: Selecting second worksheet for as table_array in the VLOOKUP function. o This will activate GeneInfo.xlsx. BCHM 6280 2017 Excel Tutorial Page 3 of 5

Last updated: May, 2017 o Select the first 2 columns of GeneInfo.xlsx. o Tab or click on the box Col_index_num. This tells the argument which column of data to bring over to the first worksheet. Type in a 2. o In the final box, Range_lookup, type false. If A2 in the ExpressionData worksheet matches A2 in GeneInfo worksheet, then the value from column 2 of GeneInfo will be entered into cell B2 of ExpressionData. If the 2 cells do not match, it will fill in N/A. o To fill in the rest of the column, select from cell B2 through then end of the data and under the Edit menu, select Fill Down or use the keyboard shortcut of Ctl+D. Figure 3: Filling in the rest of the column with the same function. Figure 5: Filling in the rest of the column with the same function. When you are done, your ExpressionData worksheet should look like that shown in Figure 4: Figure 4: GeneExpression worksheet after completing VLOOKUP BCHM 6280 2017 Excel Tutorial Page 4 of 5

At this point, the data in column 2 is still linked to the GeneInfo worksheet. You can see this if you click on one of the gene names and look at what is displayed in the text box at the top of the sheet. You do not want to leave your file like that, otherwise every time you open it will go through the data lookup function again. To avoid this, select the entire column, copy it and then do a Edit->Paste Special and select values in the Paste special dialog box. This will replace the function with the value of the function. After you complete that, click on a gene name. You should see just the gene name displayed in the text box at the top. Figure 5: GeneExpression worksheet after copying and paste special with values To bring in the NCBI gene ID, just insert another column in the ExpressionData worksheet and repeat the VLOOKUP process bringing in column 3 data from GeneInfo rather than column 2. BCHM 6280 2017 Excel Tutorial Page 5 of 5