Data Science Centre (DSC) Data Preparation Policy: Guidelines for managing and preparing your data for statistical analysis
|
|
- Jessie Norton
- 5 years ago
- Views:
Transcription
1 Data Science Centre (DSC) Data Preparation Policy: Guidelines for managing and preparing your data for statistical analysis Dr Fiona Boland, RCSI Data Science Centre & HRB Centre for Primary Care Research Version: 1.0 Date: October 2017 Table of Contents 1. Sharing your data with the DSC Introduction Choosing a file format Organising and entering data Codebook / Data dictionary Backups Checking your data data cleaning Data protection Useful resources Appendix A Saving your Excel spreadsheet as a CSV file ~ 1 ~
2 1. Sharing your data with the DSC Providing the necessary documents and ensuring data is in an organised manner will reduce the likelihood of errors and facilitate the most efficient and timely analysis. The following files should be provided to the DSC: 1. A brief description of the study design and research questions. 2. A structured and organised raw data file in an appropriate file format ensure it is structured as outlined in section 4 below. 3. A code book describing each variable and possible values see section 5 below. Please note that the DSC will return any data file(s) not adhering to the standards set out in this policy for further formatting and/or cleaning. 2. Introduction An important aspect of any project is recording, organising and preparing your data for statistical analysis. When carefully planned and implemented from the beginning of a project it can save you a lot of time and hassle. Furthermore, often data may need to be shared with a colleague or the Data Science Centre (DSC). If the DSC is receiving the data, they should be able to work with it in the format in which they receive it. This will speed up the amount of time required for analysis and obtaining results. The aim of these guidelines is to provide researchers with information and advice on how to manage and prepare research data for statistical analysis and / or sharing. 3. Choosing a file format In planning a research project, it is important that you consider which file formats you will use to store and prepare your data for statistical analysis. You should consider what formats will be easiest to share with the DSC and /or colleagues for future projects. You can enter your data directly into a statistical package or you can use a spreadsheet (such as Excel). Entering data directly into a statistical package is preferable, you can label the variables within your data file (variable labels) and label the values for your variables (value labels). ~ 2 ~
3 Variable Labels: Variable labels are composed of a few words that describe what a variable represents. If the variable labels are properly formatted, they will show in output tables and graphs, instead of variable names. Value Labels: Value labels are labels for coded variables in a data set. For example, "Gender" may be coded 0 (Males) and 1 (Females). Excel Please save your Excel spreadsheet as a comma separated values' or CSV file. See Appendix A for saving your Excel spreadsheet as a CSV file. SPSS and Stata You can also use SPSS or Stata to enter your data. This allows you to specify missing values, attach labels to numeric values and several other useful features to speed up data entry. You can send the DSC the SPSS file (.sav) or the Stata file (.dta). R You can send the DSC the R data file (.Rdata). Other statistical packages Data can be read from just about any statistical package, but please check with the DSC beforehand. 4. Organising and entering data To ensure that your data can be read by a statistical package, and analysed promptly, you need to follow a couple of simple rules: 1. Just one spreadsheet: Keep all your data in one spreadsheet; do not use multiple spreadsheets. Do not use different spreadsheets for different groups of participants or different time points. For example, if you are collecting data at three separate time points, do not use a worksheet for each time point. ~ 3 ~
4 Bad Example: Here there is an excel file with a spreadsheet for each time point. Good Example: There is only one spreadsheet where the variables Weight_1, Weight_2 and Weight_3 correspond to weight at time 1, 2 and 3 respectively. ~ 4 ~
5 2. Be consistent: Enter your data in a consistent manner (variable names, labels, spelling and codes for variables, dates, etc.). Please note that capitalisation is acceptable as long as it is used consistently throughout the data file. If in doubt please use lower case only. Examples: a. If you are collecting data at various time points, for example weight over ten weeks, and you begin labelling using weight_wk1 for week 1 and weight_wk2 for week 2, continue in this manner. Do not change to week3_weight or wk10 weight. b. Gender: If you pick Male and Female, do not occasionally use M or F. c. Dates: Pick a format and stick to it throughout the spreadsheet. The main thing is to be consistent but if in doubt use dd/mm/yyyy, where dd is a 2 digit day, mm is a 2 digit month and yyyy is a 4 digit year. d. Subject identifiers: If you begin by using PT001 for patient 1, use this format throughout. Bad Example: Good Example: Inconsistent date formats Inconsistent ID numbers Inconsistent coding for Sex Consistent formats for all three variables; ID, DOB and Sex. ID DOB Sex PT001 31/12/1966 f PT Jun-82 m 3 17/03/1977 Female PT004 10/11/1972 F 5 27/04/1960 male 6 23/05/1973 female 7 13/06/1974 M PT008 05/07/1962 M 9 20-Sep-84 Male PT010 08/10/1989 female ID DOB Sex PT001 31/12/1966 female PT002 01/06/1982 male PT003 17/03/1977 female PT004 10/11/1972 female PT005 27/04/1960 male PT006 23/05/1973 female PT007 13/06/1974 male PT008 05/07/1962 male PT009 20/09/1984 male PT010 08/10/1989 female ~ 5 ~
6 3. Row 1 = variable names: The first row of the spreadsheet should always contain the names of the variables. Make sure that each variable has a unique and informative name. Try not to use a name that is very long. Keep the names of the variables in the first row only - do not use the second row as some sort of continuation. The second row of the spreadsheet should contain data on the first case. As a general rule, variable names should not begin with a number, contain spaces or contain arithmetic symbols like +, >, %,? or!. Use underscores in place of spaces or /. For example, if you have several time points you could use walk24, walk48, walk72 or weight_1, weight_7 etc. Be careful not to leave spaces at the beginning or end of variable names, for example walk24 is different to walk24 Bad Example: 1. Variable names and information about the variables spans the first 3 rows. 2. Date of birth and Usual Activities contain spaces and Self/care contains a /. Section 1 EQ5D EQ5D EQ5D ID Date of Birth Sex Mobility Self/care Usual Activities /12/1966 female /06/1982 male /03/1977 female /11/1972 female /04/1960 male /05/1973 female /06/1974 male /07/1962 male /09/1984 male /10/1989 female ~ 6 ~
7 Good Example: ID DOB Sex EQ5D_Mobility EQ5D_Selfcare EQ5D_UsualActivities /12/1966 female /06/1982 male /03/1977 female /11/1972 female /04/1960 male /05/1973 female /06/1974 male /07/1962 male /09/1984 male /10/1989 female One row = one case: Each row of the spreadsheet should contain all the data from one case (usually a person or animal). Have a unique ID number for each case, to cross-link paper and computer records and to check for duplicates or omissions. 5. One column = one variable: A variable contains one item of information only. a. Do not put things like '120/80' into a spreadsheet. 120/80 is two things, systolic and diastolic pressure. They must be recorded in separate columns. Bad Example: ID Sex Age Systolic/Diastolic / / /79 Good Example: ID Sex Age Systolic Diastolic ~ 7 ~
8 b. If you have an item/question where multiple responses can be ticked (e.g. survey), put each response into a separate column, coded yes/no or 1/0 (as opposed to one column listing the options people selected). Generally, 0 (zero) is used for absent and 1 for present. Bad Example: In a survey Q1 has 5 possible options that can be selected and the question states to select all that apply. PT001 selected options 1 and 2 for Q1 and PT002 selected options 2 and 4. Recording the data in the following manner makes it impossible to analyse the data. ID Sex Age Q1 PT ,2 PT ,4 PT ,3,4 Good Example: In the following table a column is assigned to each option in Q1 (i.e. Q1_1 for option 1, Q1_2 for option 2 and so on.) If PT001 selected options 1 and 2, then Q1_1 and Q1_2 are denoted with a 1. PT001 did not select options 3 or 4; hence, Q1_3 and Q1_4 are denoted with a 0. Similarly for PT002 and PT003. ID Sex Age Q1_1 Q1_2 Q1_3 Q1_4 PT PT PT ~ 8 ~
9 6. Cases and controls, or different groups of subjects: If you have, for example, cases and controls, put them all in the same spreadsheet. Use a column to record which are cases and which are controls. Do not put them in different places (e.g. different spreadsheets). Similarly if you have different groups of subjects, the same thing applies. Use one column to identify the group that a subject belongs to. ID Sex Age Group BMI Patient_type Walking_aid wheel rollator Walking stick Walking stick 7. ID numbers: Each case in the spreadsheet should have a unique identifier, so that errors in the spreadsheet can be traced and corrected. 8. Pair numbers (matched data only): If your study has matched pairs of cases and controls, each pair should have an identifying number. This is different from the unique identifier for each case. It is also different from the group identifier. It tells the statistics package which case and control make up a matched pair. In other words, you need two ID numbers, an ID number for each participant and an ID number for the pair that the participant belongs to. ~ 9 ~
10 9. Do not place text in columns that contain numbers: If a variable contains numbers, do not put extra text into it. For instance, do not write 'm' or metres to indicate metres. If you need to record the unit of measurement try to incorporate it into the variable name. Bad Example: Distance 22m metres m Good Example: Distance_m Units: Keep the same units throughout. For example, if you are recording minutes, don't put "1 hour 10 minutes" - convert this to minutes (70). The relevant unit for each variable should be recorded in the codebook/data dictionary (see section 4). Bad Example: Time 10 mins 1 hour 2 hours 5mins Good Example: Time_mins ~ 10 ~
11 11. Missing data: If information is missing just leave the cell blank. Do not put in things like 'NA' or 'Unknown' or?. In particular, do not use a numerical value such as 999 as this can be easily mistaken as non-missing. If you need to record why values in a particular variable are missing, put this information in a separate column. In the example below, a variable Missing_Age was used to record why Age was missing. It can be helpful to develop a numeric coding scheme describing reasons for missing values (see point 13 below). ID Sex Age Missing_Age PT PT002 2 Age illegible on patient form PT PT004 2 Age was not recorded 12. Writing numbers: Do not put commas into large numbers; write 2500, and not 2,500. And make sure you don't use the letters O and l when you should have 0 and 1. If you are recording the presence or absence of something, use 0 (zero) for absent and 1 for present. 13. Text variables: Text allows plenty of scope for misspelling and inconsistency. Don't use free text (e.g.: other: please specify) if you can avoid it. Develop a numerical coding scheme and classify the answers before entering the data. Where use of text is unavoidable, beware of capital letters: Depression and depression will appear as different things in your data. ~ 11 ~
12 14. Blank Lines/Empty Cells: A spreadsheet should never contain blank lines or empty cells unless the data is missing. Example 1 Bad Example: Cells were left blank where a date was supposed to be repeated several times - this should not be done under any circumstance. Good Example: No cells left blank, all completely filled in. ID Date Weight /05/ /05/ /05/ /05/ ID Date Weight /05/ /05/ /05/ /05/ /05/ /05/ /05/ /05/ /05/ /05/ ~ 12 ~
13 Example 2 Bad example: Day 1 Day 7 Beaumont Patient Pain Walking Distance Pain Walking Distance Connolly Empty cells: Appears as if the first recording of pain and walking distance was measured on day 1 and following this pain and walking distance was measured on day 7 again. Blank lines: between sites, Beaumont and Connolly (also data for sites should not be recorded in this manner) ID numbers: Not unique Good example: Patient Site Pain_Day1 WalkDist_Day1 Pain_Day7 WalkDist_Day7 1 Beaumont Beaumont Beaumont Connolly Connolly Connolly ~ 13 ~
14 15. No colour or highlighting: Do not use colour or alternatively highlight cells to indicate unusual values, different groups of people etc. For possible unusual values it would be better to have another column to indicate this. Similarly, for various groups of people, include a separate column to distinguish the various groups. 16. No calculations or graphs: Your primary data file should only contain data. You should not start calculating, for example, means of various variables, or insert graphs to look at trends. If you want to do this, make a copy of your final data file and do the calculations there. Finally, if in doubt about how to record your data in a spreadsheet, ask the DSC. You can always put in a couple of cases and attend one of the scheduled clinics to make sure it makes sense. Details of the scheduled clinics can be found on the DSC webpage: 5. Codebook / Data dictionary A codebook or data dictionary is a separate file (e.g. word file, excel spreadsheet) that explains what all the variables are. It contains information that helps users decipher data file content and structure and is extremely useful if data is being shared. A codebook should contain: The variable name as in the data file A small description or explanation of what the variable is if not self-explanatory Coding schemes Units Anything else you think might be useful (Filename(s), File location(s), brief description of study design and research questions) ~ 14 ~
15 Example Variable Name Descriptions Variable Type Code Units/Format Notes Age Age Continuous years Age should be >65 years only. Sex Sex Categorical 1=Female; 2=Male GPvisit Hospital Group Time Number of GP visits in the last 6 months. Number of hospital visits in last 12 months Whether the patient was in the intervention or control group Time to complete observations Categorical Categorical Categorical Continuous 1 = intervention; 2 = control minutes 6. Backups Always make back-ups of your data in multiple locations. Keep all versions of your data, in case you accidently delete something or the file is corrupt, or even worse, your computer breaks or is stolen. Before updating your data file each time you enter new data, make a copy and enter the data in the new version. ~ 15 ~
16 7. Checking your data data cleaning Once your data is entered (or part of your data entered) you should conduct a few checks to ensure that the data is correct. This is known as data cleaning. Data cleaning is the process of detecting and correcting (or removing) corrupt or inaccurate records (i.e. checking data entry). Identify incomplete, incorrect, inaccurate, irrelevant data and then replace, modify, or delete this data. As mentioned above, always store a copy of the original data. If you make any changes to the data file save it as a new file. Steps involved: 1. Define and determine error types and instances 2. Correct the errors 3. Document error instances and error types in a separate file. 4. If data entry is not complete, modify data entry procedures to reduce future errors. Examples of errors to check for: Extreme values, examples include an age of 130 the number of GP visits in a week = 56 the number of children =20 Inconsistencies, for example sex = male and hysterectomy = yes. Missing values - check that missing values are in fact missing and that they have not been omitted accidently. Duplicate Records - Check that records are not entered twice. How to check for errors It is relatively easy to visually check if you have a small data file. If you have a large data file there are two simple approaches to help identify invalid records in data set: frequency tables - to list all the unique values of a variable ~ 16 ~
17 summary / descriptive statistics listing means, ranges, minimum and maximum values Additionally, if you are familiar with excel you can use basic functions (e.g. conditional formatting, IF statements) to explore data and detect errors. Conditional formatting and IF functions can be used to highlight cells that do not conform to certain criteria. Example: This experiment was conducted on 10 patients aged years of age. Two treatments were investigated. ID Age Sex Treatment 1 46 male female female female male male female female mael male 2 Potential errors / inaccurate data: An ID of 44 when there are only 10 patients. Study is supposed to be conducted on patients aged years old but patient 5 is apparently 25. The variable Sex = mael for patient 9. There are only supposed to be two treatments but patient 9 received treatment 3. Where possible, check the errors and correct them. Thoroughly document any material changes made to the raw data file (e.g. removal of outliers) in a separate file to be shared with future collaborators. Always retain a copy of the original raw data for reference. ~ 17 ~
18 8. Data protection The Data Protection Act 1998 (DPA) protects individuals personal data and sets the standards for the processing of this information. It does this by requiring those processing personal data to comply with the eight data protection principles. 1. Obtain and process information fairly 2. Keep it only for one or more specified, explicit and lawful purpose 3. Process it only in ways compatible with these purposes 4. Keep it safe and secure 5. Keep it accurate, complete and up-to-date 6. Ensure that it is adequate, relevant and not excessive 7. Retain it for no longer than is necessary for the purpose or purposes 8. Give a copy of his/her personal data to that individual, on request All staff and students who are using personal data should comply with the provisions of the Act, including the eight principles above. Please see the following for more information on data protection at RCSI: Anonymisation and Pseudonymisation Before data obtained from research with individuals can be shared with other researchers, the DSC, or archived, you may need to anonymise the data so that individuals cannot be identified. Please note that there should be no reason to supply the DSC with data which includes information such as patient names, addresses etc. Any such data sent to the DSC will be returned to the researcher. Anonymisation This is the removal of information that could lead to an individual being identified. Data can be considered anonymised when it does not allow identification of the individuals to whom it relates, and it is not possible that any individual could be identified from the data, either by any further processing of that data or by processing it together with other information which is available or likely to be available. Identifiers can be direct, like the individual s name, or indirect, like their ~ 18 ~
19 date of birth, phone number and address. Hence, removing direct identifiers does not render data anonymous. Generally, information is fully anonymised if there are at least 3-5 individuals to whom the information could refer. Pseudonymisation Pseudonymisation should be distinguished from anonymisation. Pseudonymisation is a procedure by which the most identifying fields within a data record are replaced by one or more artificial identifiers, or pseudonyms. However, while pseudonymisation has many uses, unlike anonymisation, it only provides a limited protection for the identity of individuals and in many cases allows for identification using indirect means. 9. Useful resources RCSI Data Protection and Data Storage: EU General Data Protection Regulation: Data Protection - Anonymisation and Pseudonymisation: Data organization in spreadsheets: How to share data for collaboration: ~ 19 ~
20 Appendix A Saving your Excel spreadsheet as a CSV file 1. In Excel, click on File and choose Save As. ~ 20 ~
21 2. Under Save as type, choose CSV (Comma delimited). Click Save. 3. You may see a message "The selected file type does not support workbooks that contain multiple sheets." This message is to inform you that any additional sheets (e.g. Sheet2) that you may have information on will not be kept in this format (i.e. all information on the other sheets will be lost). Ensure that you don t have information on other sheets that you need and click OK to continue. ~ 21 ~
22 4. You may also see a message that your file "may contain features that are not compatible with CSV (Comma delimited). Do you want to keep the workbook in this format?" This message is to inform you that any formatting you may have (e.g. colours, lines, bold text), and / or any formulas will not be preserved in this format. Click Yes to continue. ~ 22 ~
Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University
Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University While your data tables or spreadsheets may look good to
More informationCreating a data file and entering data
4 Creating a data file and entering data There are a number of stages in the process of setting up a data file and analysing the data. The flow chart shown on the next page outlines the main steps that
More informationNational Child Measurement Programme 2017/18. IT System User Guide part 3. Pupil Data Management
National Child Measurement Programme 2017/18 IT System User Guide part 3 Pupil Data Management Published September 2017 Version 4.0 Introduction 3 Who Should Read this Guidance? 3 How will this Guidance
More informationWORKSHOP: Using the Health Survey for England, 2014
WORKSHOP: Using the Health Survey for England, 2014 There are three sections to this workshop, each with a separate worksheet. The worksheets are designed to be accessible to those who have no prior experience
More informationSystem Administrator s Handbook
System Administrator s Handbook www.lamplightdb.co.uk Contents The role of system administrators p.4 Database operators adding, setting permissions and deleting p.5 Lockouts and factor authentication
More informationTracking Database. COL live (COL only)
II. Tracking Database The Tracking Database is used to keep track of information about each participant enrolled in the COL/AS+ Project. Native Boys and Girls Club staff who are participating in the project
More informationBiology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 2 Working with data in Excel and exporting to JMP Introduction
Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 2 Working with data in Excel and exporting to JMP Introduction In this exercise, we will learn how to reorganize and reformat a data
More informationQDS V4.0. New Features Documentation. NOVA Research Company
QDS V4.0 New Features Documentation NOVA Research Company Design Studio Features... 3 Data Element: Ranking Response Type... 3 Adding a Ranking Item... 3 Ranking Variables... 4 Automatic Variable New Type:
More informationExcel Tables and Pivot Tables
A) Why use a table in the first place a. Easy to filter and sort if you only sort or filter by one item b. Automatically fills formulas down c. Can easily add a totals row d. Easy formatting with preformatted
More informationMicrosoft Excel 2010
www.jadehorizon.com Microsoft Excel 2010 Sorting and Filtering Sorting and Filtering Microsoft Excel 2010 Table of Contents Table of Contents INTRODUCTION... 3 CONVENTIONS... 3 TABLE DESIGN RULES... 5
More informationLIFE LONG LEARNING LEVEL INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS
AMERICAN BOARD OF UROLOGY LIFE LONG LEARNING LEVEL 2 2018 INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS Please read all instructions carefully before preparing your log. It is imperative that you carefully
More informationFor many people, learning any new computer software can be an anxietyproducing
1 Getting to Know Stata 12 For many people, learning any new computer software can be an anxietyproducing task. When that computer program involves statistics, the stress level generally increases exponentially.
More informationAMERICAN BOARD OF UROLOGY 2017 INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS
AMERICAN BOARD OF UROLOGY 2017 INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS Please read all instructions carefully before preparing your log. It is imperative that you carefully review the data contained
More informationLet s get started with the module Getting Data from Existing Sources.
Welcome to Data Academy. Data Academy is a series of online training modules to help Ryan White Grantees be more proficient in collecting, storing, and sharing their data. Let s get started with the module
More informationACER Online Assessment and Reporting System (OARS) User Guide
ACER Online Assessment and Reporting System (OARS) User Guide January 2015 Contents Quick guide... 3 Overview... 4 System requirements... 4 Account access... 4 Account set up... 5 Create student groups
More informationExcel Tips for Compensation Practitioners Weeks Data Validation and Protection
Excel Tips for Compensation Practitioners Weeks 29-38 Data Validation and Protection Week 29 Data Validation and Protection One of the essential roles we need to perform as compensation practitioners is
More informationIdentifying Updated Metadata and Images from a Content Provider
University of Iowa Libraries Staff Publications 4-8-2010 Identifying Updated Metadata and Images from a Content Provider Wendy Robertson University of Iowa 2010 Wendy C Robertson Comments Includes presenter's
More informationObjective: Class Activities
Objective: A Pivot Table is way to present information in a report format. The idea is that you can click drop down lists and change the data that is being displayed. Students will learn how to group data
More informationDO NOT SEND DUPLICATE COPIES OF YOUR LOG AND DO NOT SEND A PRINTED COPY.
AMERICAN BOARD OF UROLOGY 2018 LIFE LONG LEARNING (LLL) LEVEL 2 PEDIATRIC UROLOGY SUBSPECIALTY CERTIFICATION EXAMINATION PROCESS INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS Please read all instructions
More informationThe WellComm Report Wizard Guidance and Information
The WellComm Report Wizard Guidance and Information About Testwise Testwise is the powerful online testing platform developed by GL Assessment to host its digital tests. Many of GL Assessment s tests are
More informationcomma separated values .csv extension. "save as" CSV (Comma Delimited)
What is a CSV and how do I import it? A CSV is a comma separated values file which allows data to be saved in a table structured format. CSVs look like normal spreadsheet but with a.csv extension. Traditionally
More information2017 MOC PEDIATRIC PRACTICE LOG TEMPLATE:
AMERICAN BOARD OF UROLOGY 2017 MAINTENANCE OF CERTIFICATION (MOC) Level 4 PEDIATRIC UROLOGY SUBSPECIALTY CERTIFICATION EXAMINATION PROCESS INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS Please read all
More informationUNESCO, Division for Planning and Development of Education Systems, Section for Sector Policy Advice and ICT in Education (ED/PDE/PAD)
Guidelines for On- line Data E ntry and Downloading Impact of the Global Financial and Economic Crisis on Education in Selected Developing Countries (DFID RIVAF) UNESCO, Division for Planning and Development
More informationHandling Your Data in SPSS. Columns, and Labels, and Values... Oh My! The Structure of SPSS. You should think about SPSS as having three major parts.
Handling Your Data in SPSS Columns, and Labels, and Values... Oh My! You might think that simple intuition will guide you to a useful organization of your data. If you follow that path, you might find
More informationedofe Management Toolkit
edofe Management Toolkit A guide to effective edofe management for Directly Licensed Centres 1 2 Contents Section one: Setting up the correct infrastructure on edofe... 4 Creating a group... 4 Editing
More informationAdobe Captivate Monday, February 08, 2016
Slide 1 - Slide 1 MT+ How to export and import mobilities In this demo you will see how to export the template or the list of existing mobilities, an explanation of the format of the CSV file how to use
More informationehepqual- HCV Quality of Care Performance Measure Program
NEW YORK STATE DEPARTMENT OF HEALTH AIDS INSTITUTE ehepqual- HCV Quality of Care Performance Measure Program USERS GUIDE A GUIDE FOR PRIMARY CARE AND HEPATITIS C CARE PROVIDERS * * For use with ehepqual,
More informationUK Data Archive SN National Child Measurement Programme, National Child Measurement Programme 2012/13
UK Data Archive SN 7567 - National Child Measurement Programme, 2012-2013 National Child Measurement Programme 2012/13 Upload Tool: Guidance for users September 2012 National Child Measurement Programme
More informationSoftware Reference Sheet: Inserting and Organizing Data in a Spreadsheet
Inserting and formatting text Software Reference Sheet: Inserting and Organizing Data in a Spreadsheet Column headings are very important to include in your spreadsheet so that you can remember what the
More informationBRIDGEWATER SURGERIES. Privacy Notice
BRIDGEWATER SURGERIES Privacy Notice We understand how important it is to keep your personal information safe and secure and we take this very seriously. We have taken steps to make sure your personal
More informationRunning Minitab for the first time on your PC
Running Minitab for the first time on your PC Screen Appearance When you select the MINITAB option from the MINITAB 14 program group, or click on MINITAB 14 under RAS you will see the following screen.
More information(Updated 29 Oct 2016)
(Updated 29 Oct 2016) 1 Class Maker 2016 Program Description Creating classes for the new school year is a time consuming task that teachers are asked to complete each year. Many schools offer their students
More informationImporting Local Contacts from Thunderbird
1 Importing Local Contacts from Thunderbird Step 1, Export Contacts from Thunderbird In Thunderbird, select Address Book. In the Address Book, click on Personal Address Book and then select Export from
More informationData. Selecting Data. Sorting Data
1 of 1 Data Selecting Data To select a large range of cells: Click on the first cell in the area you want to select Scroll down to the last cell and hold down the Shift key while you click on it. This
More informationGuidance for building Study and CRF in OpenClinica
Guidance for building Study and CRF in OpenClinica 1. Use of Patient Identifying information Patient Identifying Data (PID) is any data within clinical data that could potentially be used to identify subjects,
More informationMYOB Exo PC Clock. User Guide
MYOB Exo PC Clock User Guide 2018.01 Table of Contents Introduction to MYOB Exo PC Clock... 1 Installation & Setup... 2 Server-based... 2 Standalone... 3 Using Exo PC Clock... 4 Clocking Times... 5 Updating
More informationStata: A Brief Introduction Biostatistics
Stata: A Brief Introduction Biostatistics 140.621 2005-2006 1. Statistical Packages There are many statistical packages (Stata, SPSS, SAS, Splus, etc.) Statistical packages can be used for Analysis Data
More informationPart A. EpiData Entry
Part A. EpiData Entry Part A: EpiData Entry Exercise 1 A data documentation sheet for a simple questionnaire Exercise 2 The QES-REC-CHK triplet Exercise 3 Derived fields and Check file commands unrelated
More informationOpening a Data File in SPSS. Defining Variables in SPSS
Opening a Data File in SPSS To open an existing SPSS file: 1. Click File Open Data. Go to the appropriate directory and find the name of the appropriate file. SPSS defaults to opening SPSS data files with
More informationExcel 2007 Fundamentals
Excel 2007 Fundamentals Introduction The aim of this document is to introduce some basic techniques for using Excel to enter data, perform calculations and produce simple charts based on that information.
More informationCreating and Managing Surveys
Creating and Managing Surveys May 2014 Survey Software Contents 1. INTRODUCTION 2 2. HOW TO ACCESS THE SURVEY SOFTWARE 3 The User Profile 3 3. GETTING STARTED ON A NEW SURVEY 5 4. FURTHER WORK ON SURVEY
More informationEU General Data Protection Regulation A Compliance Guide
Vanderbilt Entro EU General Data Protection Regulation A Compliance Guide Contents Abstract... 2 Overview... 2 What is personal data?... 2 What constitutes data processing?... 3 Am I a data controller
More informationThe first three steps in data entry, with examples in PASW/SPSS. Steve Simon P.Mean Consulting
The first three steps in data entry, with examples in PASW/SPSS Steve Simon P.Mean Consulting www.pmean.com 2. Abstract This training class will give you a general introduction to data management using
More informationCreating and Managing Surveys
S Computing Services Department Creating and Managing Surveys Select Survey Apr 2016 Page 0 of 27 U n i v e r s i t y o f L i v e r p o o l Table of Contents 1. Introduction... 2 2. How to Access the Survey
More informationChapter 3: The IF Function and Table Lookup
Chapter 3: The IF Function and Table Lookup Objectives This chapter focuses on the use of IF and LOOKUP functions, while continuing to introduce other functions as well. Here is a partial list of what
More informationTutorial: Create an Excel Dashboard. Overview. **** Download the Example Dashboard ****
Tutorial: Create an Excel Dashboard This page walks through building an Excel dashboard with a mobile tool called CommCare. Similar to how Excel helps people build spreadsheets, CommCare enables anyone
More informationOutline. Simple Commands. Definitions. Microsoft Excel for Data Management. Dr. K. A. Korb PSY 828 University of Jos 10/31/2017
Outline Microsoft Excel for Data Management Introduction to Microsoft Excel Data Processing Data Checking Dr. K. A. Korb Definitions Microsoft Excel: Spreadsheet with rows and columns of a grid to store,
More informationBaSICS OF excel By: Steven 10.1
BaSICS OF excel By: Steven 10.1 Workbook 1 workbook is made out of spreadsheet files. You can add it by going to (File > New Workbook). Cell Each & every rectangular box in a spreadsheet is referred as
More informationSage Hibernia Limited Copyright Statement
Sage Hibernia Limited Copyright Statement Sage Hibernia Limited, 2010. All rights reserved If this documentation includes advice or information relating to any matter other than using Sage software, such
More informationCancer Waiting Times. User Manual. Version 7.0 Published 4 August 2016
Cancer Waiting Times User Manual Version 7.0 Published 4 August 2016 Copyright 2016 Health and Social Care Information Centre. NHS Digital is the trading name of the Health and Social Care Information
More informationPICS Form & Survey Managers Version 14.xx Document Version: 2.04 Release Date: 13/01/16
PICS Form & Survey Managers Version 14.xx Document Version: 2.04 Release Date: 13/01/16 SECTION 1. INTRODUCTION CONTENTS SECTION 1. INTRODUCTION... 1 About PICS form & survey managers... 2 What do I need
More informationICT & MATHS. Excel 2003 in Mathematics Teaching
ICT & MATHS Excel 2003 in Mathematics Teaching Published by The National Centre for Technology in Education in association with the Project Maths Development Team. Permission granted to reproduce for educational
More informationFormulas, LookUp Tables and PivotTables Prepared for Aero Controlex
Basic Topics: Formulas, LookUp Tables and PivotTables Prepared for Aero Controlex Review ribbon terminology such as tabs, groups and commands Navigate a worksheet, workbook, and multiple workbooks Prepare
More informationExcel Introduction to Excel Databases & Data Tables
Creating an Excel Database Key field: Each record should have some field(s) that helps to uniquely identify them, put these fields at the start of your database. In an Excel database each column is a field
More informationDatabase Concepts Using Microsoft Access
lab Database Concepts Using Microsoft Access 9 Objectives: Upon successful completion of Lab 9, you will be able to Understand fundamental concepts including database, table, record, field, field name,
More informationIS357 Further Features Of Excel 2010
IS Further Features Of Excel 00 October 0 0 th October 0 Table of Contents FILES REQUIRED AND INTRODUCTION... ACCESSING THE IT TRAINING EXERCISE FILES FROM YOUR OWN PC/LAPTOP... OPENING AN EXISTING WORKBOOK...
More informationExcel. Database functions
Excel Database functions Objectives Week 3 By the end of this session you will be able to :- Move around a worksheet Recognise cell references Sort data alphabetically Insert rows and columns Delete rows
More informationJoint Oil Sands Monitoring: Environment Canada Cause and Effects Monitoring for Landbirds
P a g e 1 Joint Oil Sands Monitoring: Environment Canada Cause and Effects Monitoring for Landbirds Standard Operating Procedure (SOP) 5: Data Entry and Verification This Standard Operating Procedure describes
More informationFormulas Learn how to use Excel to do the math for you by typing formulas into cells.
Microsoft Excel 2007: Part III Creating Formulas Windows XP Microsoft Excel 2007 Microsoft Excel is an electronic spreadsheet program. Electronic spreadsheet applications allow you to type, edit, and print
More informationCPRD Aurum Frequently asked questions (FAQs)
CPRD Aurum Frequently asked questions (FAQs) Version 2.0 Date: 10 th April 2019 Authors: Helen Booth, Daniel Dedman, Achim Wolf (CPRD, UK) 1 Documentation Control Sheet During the course of the project
More informationExploring extreme weather with Excel - The basics
Exploring extreme weather with Excel - The basics These activities will help to develop your data skills using Excel and explore extreme weather in the UK. This activity introduces the basics of using
More informationUsing Microsoft Excel
Using Microsoft Excel Formatting a spreadsheet means changing the way it looks to make it neater and more attractive. Formatting changes can include modifying number styles, text size and colours. Many
More informationADVANCED INQUIRIES IN ALBEDO: PART 2 EXCEL DATA PROCESSING INSTRUCTIONS
ADVANCED INQUIRIES IN ALBEDO: PART 2 EXCEL DATA PROCESSING INSTRUCTIONS Once you have downloaded a MODIS subset, there are a few steps you must take before you begin analyzing the data. Directions for
More informationPubzilla Online Pub Database 18 th October 2011
Pubzilla Online Pub Database 18 th October 2011 Background Pubzilla is an online pub database that branches can use if they wish. It is centrally maintained and operated, so all a branch needs to use it
More informationWorkshop MedSciNet - Building an Electronic Data Capture System for a Clinical Trial
Note: for non-commercial purposes only Workshop MedSciNet - Building an Electronic Data Capture System for a Clinical Trial Presenters: Prof Marius Kublickas Laima Juodvirsiene Prof Magnus Westgren Friday,
More informationMr. Kongmany Chaleunvong. GFMER - WHO - UNFPA - LAO PDR Training Course in Reproductive Health Research Vientiane, 22 October 2009
Mr. Kongmany Chaleunvong GFMER - WHO - UNFPA - LAO PDR Training Course in Reproductive Health Research Vientiane, 22 October 2009 1 Object of the Course Introduction to SPSS The basics of managing data
More informationExcel Level 1
Excel 2016 - Level 1 Tell Me Assistant The Tell Me Assistant, which is new to all Office 2016 applications, allows users to search words, or phrases, about what they want to do in Excel. The Tell Me Assistant
More informationChecklist. ORB Education Quality Teaching Resources. ORB Education Visit for the full, editable versions.
Checklist 1. Data in our Lives 2. Representing Data 3. Working with Data 4. Introducing Spreadsheets 5. Basic Excel Skills 6. Google Sheets 7. Sorting and Filtering 8. Formulas 9. Plots and Trendlines
More informationPrepared By: Graeme Hilson. U3A Nunawading
0 Prepared By: Graeme Hilson U3A Nunawading - 2015 1 CONTENTS This Course Page 3 Reference Material Page 3 Introduction page 3 Microsoft Excel Page 3 What is a Spreadsheet Page 4 Excel Screen Page 4 Using
More informationThe Excel Project: Excel for Accountants, Business People... from the Beginning Duncan Williamson
The Excel Project: Excel for Accountants, Business People... from the Beginning Duncan Williamson Introduction In this book you will see that we use Excel 2007 as the focal point of much of the work we
More informationExcel Foundation (Step 2)
Excel 2007 Foundation (Step 2) Table of Contents Working with Names... 3 Default Names... 3 Naming Rules... 3 Creating a Name... 4 Defining Names... 4 Creating Multiple Names... 5 Selecting Names... 5
More informationA Simple Guide to Using SPSS (Statistical Package for the. Introduction. Steps for Analyzing Data. Social Sciences) for Windows
A Simple Guide to Using SPSS (Statistical Package for the Social Sciences) for Windows Introduction ٢ Steps for Analyzing Data Enter the data Select the procedure and options Select the variables Run the
More informationIntroduction to Stata Getting Data into Stata. 1. Enter Data: Create a New Data Set in Stata...
Introduction to Stata 2016-17 02. Getting Data into Stata 1. Enter Data: Create a New Data Set in Stata.... 2. Enter Data: How to Import an Excel Data Set.... 3. Import a Stata Data Set Directly from the
More informationThe Data Protection Act 1998 Clare Hall Data Protection Policy
The Data Protection Act 1998 Clare Hall Data Protection Policy Introduction This document is a guide to the main requirements of the new Data Protection Act (DPA) that came into force on 24th October 2001.
More informationExporting Data Importing Data
How To Guide Exporting Data Importing Data Wiztec Business Suite & WizHR 5 i Table Of Contents Table Of Contents Imports & Exports...1 Export Data...1 A. Exporting Data from One Screen...1 B. Exporting
More informationExcel Tips and FAQs - MS 2010
BIOL 211D Excel Tips and FAQs - MS 2010 Remember to save frequently! Part I. Managing and Summarizing Data NOTE IN EXCEL 2010, THERE ARE A NUMBER OF WAYS TO DO THE CORRECT THING! FAQ1: How do I sort my
More informationExcel Conditional Formatting (Mac)
[Type here] Excel Conditional Formatting (Mac) Using colour to make data analysis easier Excel conditional formatting automatically formats cells in your worksheet if specified criteria are met, giving
More informationSecurities Lending Reporting
Securities Lending Reporting User notes for submitting Borrowed, Loaned and Committed Securities information via ASX Online November 2013 User notes for submitting Borrowed and Loaned Stock information
More informationOpenClinica - Data Import & Export
OpenClinica - Data Import & Export Purpose of this document: To provide instruction and best practice guidelines for the import and export of study data into and out of OpenClinica. Version: OpenClinica
More informationUsing Excel to produce graphs - a quick introduction:
Research Skills -Using Excel to produce graphs: page 1: Using Excel to produce graphs - a quick introduction: This handout presupposes that you know how to start Excel and enter numbers into the cells
More informationSTEP Household Questionnaire. Guidelines for Data Processing
STEP Household Questionnaire Guidelines for Data Processing This Version: December 11, 2012 Table of Contents 1. Data Entry Process and Timing... 3 2. Data Files Structure... 4 3. Consistency Checks...
More informationINTRODUCTORY SPSS. Dr Feroz Mahomed Swalaha x2689
INTRODUCTORY SPSS Dr Feroz Mahomed Swalaha fswalaha@dut.ac.za x2689 1 Statistics (the systematic collection and display of numerical data) is the most abused area of numeracy. 97% of statistics are made
More informationExcel Tips. Contents. By Dick Evans
Excel Tips By Dick Evans Contents Pasting Data into an Excel Worksheet... 2 Divide by Zero Errors... 2 Creating a Dropdown List... 2 Using the Built In Dropdown List... 3 Entering Data with Forms... 4
More informationTeamMate Analytics 6.0 Release Notes October 2018
TeamMate Analytics 6.0 Release Notes October 2018 This is a major release with exciting new features and enhancements, including: Capability to Analyze More than 1 Million Rows are no longer constrained
More informationCreating an ArborCAD filter from Excel.
ArborCAD Help File Windows systems only Creating an ArborCAD filter from Excel. This help file assumes you have your tree survey data in an Excel sheet. The ArborCAD filter is a special file which holds
More informationCite: CTSA NIH Grant UL1- RR024982
PREREQUISITE FOR USE Review and approval of the project by the Institutional Review Board is required If colleting data for the purpose of human subject s research. Cite: CTSA NIH Grant UL1- RR024982 1
More informationGuide to setting up and using your NOW: Pensions payroll bureau microsite
Guide to setting up and using your NOW: Pensions payroll bureau microsite PM00050.0416/11 0 Step 1 Logging in for the first time You will have received a web link, via email, that will provide access to
More informationData Protection Policy
Data Protection Policy Data Protection Policy Version 3.00 May 2018 For more information, please contact: Technical Team T: 01903 228100 / 01903 550242 E: info@24x.com Page 1 The Data Protection Law...
More informationData Protection and Information Security. Presented by Emma Hawksworth Slater and Gordon
Data Protection and Information Security Webinar Presented by Emma Hawksworth Slater and Gordon 1 3 ways to participate Ask questions link below this presentation Answer the polls link below this presentation
More informationUSER MANUAL BULK UPLOAD CONTENTS FEBRUARY 2014 INTRODUCTION 2 1. HOW DO I ACCESS THE BULK UPLOAD FACILITY? 3 2. BULK UPLOAD CODES 4
USER MANUAL BULK UPLOAD CONTENTS INTRODUCTION 2. HOW DO I ACCESS THE BULK UPLOAD FACILITY? 3 2. BULK UPLOAD CODES 4 3. BULK UPLOAD TEMPLATE - EXCEL FORMAT 5 4. BULK UPLOAD TEMPLATE - CSV FORMAT 0 5. ACW
More informationExploring Microsoft Office Excel 2007
Exploring Microsoft Office Excel 2007 Chapter 5 Data to Information Robert Grauer, Keith Mulbery, Judy Scheeren Committed to Shaping the Next Generation of IT Experts. Copyright 2008 Pearson Prentice Hall.
More informationCreate your first workbook
Create your first workbook You've been asked to enter data in Excel, but you've never worked with Excel. Where do you begin? Or perhaps you have worked in Excel a time or two, but you still wonder how
More informationHow do we use Superlab conceptually.
How do we use Superlab conceptually. Superlab is constructed in 4 sections: 1. The compilation of the stimuli. This takes place outside of Superlab by the researcher producing pict file pictures in a paint
More informationCreating and Checking the PIRLS International Database
Chapter 8 Creating and Checking the PIRLS International Database Juliane Barth and Oliver Neuschmidt 8.1 Overview The PIRLS 2006 International Database is a unique resource for policy makers and analysts,
More informationHTC Data Use Tool - User s Manual
HTC Data Use Tool - User s Manual Module 1: Inputting data into the HTC Data Use Tool Global Strategic Information UCSF Global Health Sciences http://globalhealthsciences.ucsf.edu/pphg/gsi Contact us:
More informationCreating New Variables in JMP Datasets Using Formulas Exercises
Creating New Variables in JMP Datasets Using Formulas Exercises Exercise 3 Calculate the Difference of Two Columns 1. This Exercise will use the data table Cholesterol. This data table contains the following
More informationStatistical Good Practice Guidelines. 1. Introduction. Contents. SSC home Using Excel for Statistics - Tips and Warnings
Statistical Good Practice Guidelines SSC home Using Excel for Statistics - Tips and Warnings On-line version 2 - March 2001 This is one in a series of guides for research and support staff involved in
More informationTroubleshooting and FAQs
Troubleshooting and FAQs 1. The dropdown menu for a field on the Incident Data worksheet does not contain all the option I want to enter. There is a high likelihood that the option you want to enter is
More informationMobility Tool Guide for Beneficiaries of Leonardo da Vinci programme
EUROPEAN COMMISSION Directorate-General for Education and Culture Lifelong Learning: policies and programme Coordination of the "Lifelong learning" programme Mobility Tool Guide for Beneficiaries of Leonardo
More informationSTREET MOBILITY PROJECT User Guide for Analysing the Health and Neighbourhood Mobility Survey
STREET MOBILITY PROJECT User Guide for Analysing the Health and Neighbourhood Mobility Survey March 2017 S T R E E T M O B I L I T Y P R O J E C T T O O L K I T : M E A S U R I N G T H E E F F E C T S
More information