Identifying Updated Metadata and Images from a Content Provider

Similar documents
Tab-Delimited File and Compound Objects - Documents, Postcards, and Cubes. (Not Monographs)

Microsoft Access 2010

Business Process Procedures

Microsoft Access 2013

Microsoft Access 2013

Create Mailing Labels using SUPER and Mail Merge (Word 2010)

Excel Tips for Compensation Practitioners Month 1

URGENT: MEDICAL DEVICE CORRECTION

University of North Dakota PeopleSoft Finance Tip Sheets. Utilizing the Query Download Feature

More Skills 11 Export Queries to Other File Formats

Relativity. User s Guide. Contents are the exclusive property of Municipal Software, Inc. Copyright All Rights Reserved.

Group Administrator. ebills csv file formatting by class level. User Guide

CEU Online System, The Friday Center for Continuing Education, UNC-Chapel Hill How to Obtain Participant IDs for Awarding of CEUs

Microsoft Excel Microsoft Excel

Successmaker Student and Teacher Imports

How to Import Part Numbers to Proman

IMPORTING A STUDENT LIST FROM SYNERGY INTO A GOOGLE CONTACT LIST

GAZIANTEP UNIVERSITY INFORMATICS SECTION SEMETER

Excel Format cells Number Percentage (.20 not 20) Special (Zip, Phone) Font

Sort, Filter, Pivot Table

3/31/2016. Spreadsheets. Spreadsheets. Spreadsheets and Data Management. Unit 3. Can be used to automatically

Open. Select the database and click. Print. Set printing options using the dropdown menus, then click the

How to Archive s in Outlook 2007

CSV Roll Documentation

ADVANCED INQUIRIES IN ALBEDO: PART 2 EXCEL DATA PROCESSING INSTRUCTIONS

Become strong in Excel (2.0) - 5 Tips To Rock A Spreadsheet!

Objective: Class Activities

Microsoft Office Excel 2003

Topic 4D: Import and Export Contacts

SPREADSHEET (Excel 2007)

How to Transfer Your Contact Information Into Microsoft Outlook 2010

Patricia Andrada Quick Guide Excel 2010 Data Management-July 2011 Page 1

WEEK NO. 12 MICROSOFT EXCEL 2007

IMPORTING A STUDENT LIST FROM SYNERGY INTO A GOOGLE CONTACT LIST

Advanced Excel for EMIS Coordinators

Reporting Excel Tutorial

Skills Funding Agency

Office 2016 Excel Basics 25 Video/Class Project #37 Excel Basics 25: Power Query (Get & Transform Data) to Convert Bad Data into Proper Data Set

Microsoft Access 2013

Making Excel Work for Your Tribal Community

Excel Level Three. You can also go the Format, Column, Width menu to enter the new width of the column.

Using Excel to Troubleshoot EMIS Data

6. In the last Import Wizard dialog box, click Finish. Saving Excel Data in CSV File Format

1. AUTO CORRECT. To auto correct a text in MS Word the text manipulation includes following step.

Contents Part I: Background Information About This Handbook... 2 Excel Terminology Part II: Advanced Excel Tasks...

Interfacing with MS Office Conference 2017

Importing Local Contacts from Thunderbird

Quick Guide for Excel 2015 Data Management November 2015 Training:

2. create the workbook file

Data. Selecting Data. Sorting Data

Excel Formulas 2018 Cindy Kredo Page 1 of 23

Technical White Paper

Open Excel by following the directions listed below: Click on Start, select Programs, and the click on Microsoft Excel.

Basic tasks in Excel 2013

GiftWorks Import Guide Page 2

Microsoft Power Tools for Data Analysis #04: Power Query: Import Multiple Excel Files & Combine (Append) into Proper Data Set.

Objective 1: Familiarize yourself with basic database terms and definitions. Objective 2: Familiarize yourself with the Access environment.

StatTrak Address Manager Business Edition User Manual

UNIT ONE: The Worksheet. Workbook Window Excel Worksheet Fill handle Automatic fill Column widths Opening a file Saving a file

EXCEL BASICS: MICROSOFT OFFICE 2010

Excel Shortcuts Increasing YOUR Productivity

Mail Merge Quick Reference Guide

Advanced Excel. Click Computer if required, then click Browse.

Prepared By: Graeme Hilson. U3A Nunawading

Excel Tools Features... 1 Comments... 2 List Comments Formatting... 3 Center Across... 3 Hide Blank Rows... 3 Lists... 3 Sheet Links...

HOW TO USE THE EXPORT FEATURE IN LCL

Advanced Excel Charts : Tables : Pivots

Manual Physical Inventory Upload Created on 3/17/2017 7:37:00 AM

Code Plug Management: Contact List Import/Export. Version 1.0, Dec 16, 2015

Importing and Exporting Data

EXCEL BASICS: MICROSOFT OFFICE 2007

Excel Level 1

INTRODUCTION ACCESS 2010

Microsoft How to Series

Importing from Blackboard Learn Grade Center Data to Banner 9 User Learning Scenarios

Excel. Dashboard Creation. Microsoft # KIRSCHNER ROAD KELOWNA, BC V1Y4N TOLL FREE:

Welcome to Cole On-line Help system!

7 CREATING QUERY WITH QUERY WIZARD AND QUERY DESIGNER

OPENING A LEADS.TXT FILE IN EXCEL 2010

Session 10 MS Word. Mail Merge

How to Excel - Part 2

One time Setup Paragon4 MLS (ex. NNVR MLS)

Calendar Guide: Exchange (Outlook) -> Google. How to manually transfer your Exchange (Outlook) calendar over to Google Calendar

Excel Tools for Internal Auditing

ClickFORMS Quick Start Guide

Streamlined Reporting with

Basic Excel. Helen Mills OME-RESA

Creating a Directory with a Mail Merge from an Excel Document

10 Ways To Efficiently Analyze Your Accounting Data in Excel

Microsoft Excel XP. Intermediate

SUM - This says to add together cells F28 through F35. Notice that it will show your result is

Introducing Microsoft Office Specialist Excel Module 1. Adobe Captivate Wednesday, May 11, 2016

EXCEL 2010 TIPS & TRICKS

Lesson 4: Auditing and Additional Formulas. Return to the FastCourse Excel 2007 Level 3 book page

How to Import a Text File into Gorilla 4

Pivot Tables, Lookup Tables and Scenarios

Microsoft Excel 2010

Advanced Excel Selecting and Navigating Cells

Intermediate Excel 2016

Guide to Importing Data

Transcription:

University of Iowa Libraries Staff Publications 4-8-2010 Identifying Updated Metadata and Images from a Content Provider Wendy Robertson University of Iowa 2010 Wendy C Robertson Comments Includes presenter's notes Hosted by Iowa Research Online. For more information please contact: lib-ir@uiowa.edu.

Identifying Updated Metadata and Images from a Content Provider Wendy Robertson The University of Iowa Libraries http://ir.uiowa.edu/lib_pubs/48/ 1

Overview Using MS Access to identify new items Using MS Excel to identify fields with changes Using MS Access to identify changes to images Using MS Excel to clean up data according to controlled terms Questions/Discussion 2

http://digital.lib.uiowa.edu/uima/ 3

UIMA Data from 10/02/09 4

Creating an Access Database 1. Open MS Access. 2. Click blank database. 3. Give your database a name. 4. Click Create. 5

Import Data from Provider 1. Go to External Data tab. 2. In the Import section, choose Excel. 3. Browse to the data file and click open. 6

Import Data from Provider 4. Leave Import the source data into a new table in the current database selected. 5. Click OK. 7

Import Data from Provider 6. Select correct worksheet and click Next. 8

Import Data from Provider 7. Select First row contains column headings and click Next. 9

Import Data from Provider 8. You can change settings for fields or skip fields. Generally you will be best off using data type text. 10

Import Data from Provider 9. Uncheck Let Access add primary key. 11

Import Data from Provider 10. Name the table clearly. 11. Click Finish. 12. Click Close. 13. If you get any import errors, take a look at them. They probably are due to the wrong data type in step 8. 12

Export Current Metadata from CDM You all probably know how to do this from the web administration for the collection. Export as tab delimited, returning field names in first record. 13

Import CDM Export.txt to Access Follow the same basic steps as for Excel except on step 2 select text file. 14

Import CDM Export.txt to Access Leave delimited selected, click next, leave tab selected. v 15

16

Import CDM Export.txt to Access To preserve special characters, on the last import screen, click Advanced. 17

Import CDM Export.txt to Access Change Code page to Unicode. 18

Create a Query to Find New Items 1. On the Create tab, click Query Design. 19

Create a Query to Find New Items 2. Choose your two tables and click Add and then Close. 20

Create a Query to Find New Items 3. Move the scroll bars for each table to show the unique number for the record. You need to have some sort of unique value to match the two tables. This should be your CDM identifier. It may be called something different in the file from your content provider. 21

Create a Query to Find New Items 4. Highlight one of the unique numbers (identifier) and then drag to the identifier in the other window this will join the tables. 22

Two tables linked. 23

Create a Query to Find New Items 5. Click the line between the two tables, making it bold. 6. Double click the line to open Join properties. 7. Select the option that gives you ALL the records from the new data and only those from the export where the values are equal. 8. Click OK. 24

Selecting the right option is key to finding the new records. 25

Notice the line now has an arrow at one end. 26

Create a Query to Find New Items 9. Drag the * at the top of the box with the new data to the section at the bottom. The * represents all fields. This means the result of the query will include all your fields. 10. Click the word identifier in the CDM export data and drag it to the lower part of the window. This will add the identifier field to the query. 27

28

Create a Query to Find New Items 11. In the criteria line under the identifier for the existing CDM data, type is null. This will make a query that looks for everything in the new data that does NOT match an identifier in the CDM export, i.e. newly added items. 29

Create a Query to Find New Items 12. Select the make table option and give the new table a name. 13.Click the red exclamation point labeled run 30

Create a Query to Find New Items 14. Access will warn you that you are about to paste records into a new table. Click yes. 15. Highlight the table name and right click. 16. Select Export and then Excel and save the spreadsheet where you want. 17. Click Close. 31

32

Spreadsheet of new items. 33

Create a Query to Find Updates Follow the steps above, except: 1. On step 7, in join properties select Only include rows where the joined fields from both tables are equal. This creates a line with no arrows. 34

Create a Query to Find Updates 2. You also want to include all the fields from both tables (i.e. drag the * for both tables). 3. You don t need criteria because the join itself is limiting the results to records in both tables. 35

Create a Query to Find Updates 36

Query results viewed in Access. Note all fields are in results. 37

Create a Query to Find Updates Optionally: Drag each field you want to compare from both tables into the query to order them how you want. 38

Identifying Changed Fields 1. Open the updated spreadsheet in Excel. 2. Add a column next to the first pair of values. 3. Label the column something clear like title differences. 39

Identifying Changed Fields 4. Type the following logical function into the cell: =if(exact(b2,c2), "","difference") exact(b2,c2) is a case-sensitive test to see if the values match the part between the commas is the value that will result from a true test (nothing) the part after the 2 nd comma is the value that will result from a false test 40

Identifying Changed Fields 5. Fill the value down the spreadsheet. 41

Identifying Changed Fields 6. Repeat with additional pairs of values. 7. If you make routine changes to the content provider s data, you will need to go through those changes again before comparison. We split dimensions into separate fields so we would need to split the new data in the same way to compare it with our CDM data. 42

Identifying Changed Fields We invert the author s name in CDM. This formula un-inverts the name which needs to be done for the comparison to work: =IF(ISNUMBER(SEARCH(",",E2)), (CONCATENATE(RIGHT(E2, (LEN(E2)-(SEARCH(",",E2)+1)))," ", LEFT(E2,(SEARCH(",",E2)-1)))),E2) 43

Identifying Changed Fields Fill the formula down. The E2 cell reference will automatically increase by one, keeping the formula correct for every row. 44

Identifying Changed Fields 8. You can count how many corrections you need to make with the countif function. =COUNTIF(D2:D1524,"difference") 45

Identifying Changed Fields 9. Using a filter, you can show only the fields with a difference. Highlight a column or a continuous range of columns (or the whole sheet) and select data and filter. 46

Identifying Changed Fields 10. To get a simple list of controlled terms that have changed, filter to display only the value difference, copy the before and after columns, and paste on a new worksheet. 47

Identifying Changed Fields 12.With the copied columns highlighted, go to the data tab and select remove duplicates. 48

Identifying Updated Images 1. Navigate to the folder with new images in Windows explorer, leaving the folder names in the right window. 49

Identifying Updated Images 2. Holding down shift, right click on the folder with the images and choose Open command window here. 50

Identifying Updated Images 3. Enter this command: dir /s/n > filenames.txt dir will return a directory listing /s includes subfolders /n includes additional information like filesize > means output the results filenames.txt is the name of file that will be created 51

Identifying Updated Images 4. Open a new worksheet in Excel and, using get external data, import text using the fixed width option, making all fields text. 52

Identifying Updated Images 5. Delete all the lines that don t contain a filename. Sorting by column D moves all these lines together. Delete column D. 6. Add column headings. 7. Save this file (Excel or delimited txt) with a clear name. 8. Repeat these steps with the older images. 53

Identifying Updated Images 9. Import both name files into Access. 10. Create a make table query use the old and new image name tables. 54

Identifying Updated Images 11. Join the tables using the filenames, including all the fields from the new table, and use the query criteria is null for the old name. This will identify all the new images. 55

Identifying Updated Images 12. Make a 2 nd table with only the old items. 13. Export both tables to excel and sort by filename. Compare the files manually to identify updated images with different names. 56

Identifying Updated Images 14. To identify changed images with the same name, make a table query joining on the name, size, date and time. This will identify images that have had no change. 57

Identifying Updated Images 15. Make a new query with the old images, the new images, and the exact matches, joining them on the file name. 58

Identifying Updated Images 16. Export this table to find all the images you need to update in CDM. 59

Changing Provider Vocabulary 1. Create an Excel file mapping the content provider s terms to your terms. Put the incorrect terms in column A and the correct terms in B. 2. Make sure each incorrect term appears only once in the mapping. a) Highlight column A. 60

Changing Provider Vocabulary b) On the Home tab, click Conditional Formatting, Highlight cells rule, Duplicate Value. Click OK. 61

Changing Provider Vocabulary c) Select the entire worksheet and sort column A by cell color, with pink on top. d) This will put all duplicate values at the top for you to fix. 62

Changing Provider Vocabulary 3. Open the file with the new data from the content provider. 4. Save a backup (in case something goes wrong). 5. Sort the table by the column you will be editing. 63

Changing Provider Vocabulary 6. Filter the column to remove blank values. 64

Changing Provider Vocabulary 7. Copy all the terms into the third column of the mapping spreadsheet. 8. Paste this formula into the 4 th column: =IFERROR((INDEX(B$2:B$100, MATCH(C2,A$2:A$100,0))),"no mapping") 9. Change the 100 to the last line of your pairs of mapped terms. 65

Changing Provider Vocabulary The last line of pairs is on row 558 so formula changed accordingly. 66

Changing Provider Vocabulary 10. Fill the formula down. The $ keeps the range of numbers constant while the cell value automatically increases. 67

Changing Provider Vocabulary 11. Filter the column with new terms to find the value no mapping. 68

Changing Provider Vocabulary 12. Note the terms needing to be mapped. 13. Remove the filter. 14. Add the new terms. 69

Changing Provider Vocabulary 15. Add the mapped values. 16. In Cell D2, correct the formula to include the new lines and fill down. 70

Changing Provider Vocabulary 17. Copy the values and paste into the original data file. 18. Repeat with other fields as needed. 71

Questions? Other ways to identify and work with updates? wendy-robertson@uiowa.edu http://ir.uiowa.edu/lib_pubs/48/ 72

This presentation by Wendy Robertson is licensed under a Creative Commons Attribution 3.0 United States License 73