Working with Administrative Databases: Tips and Tricks

Size: px
Start display at page:

Download "Working with Administrative Databases: Tips and Tricks"

Transcription

1 3 Working with Administrative Databases: Tips and Tricks Canadian Institute for Health Information Emerging Issues Team Simon Tavasoli

2 Administrative Databases > Administrative databases are often used to synthesize information regarding health care system or to investigate health research questions > The data may be derived from population registries, vital statistics or other records of life events, or from health claims and services data > Canadian Institute for Health Information (CIHI), collect /receives essential data and prepares analyses on Canada s health system and the health of Canadians > Currently CIHI holds more than 27 databases with millions of Record (e.g. National Ambulatory Care Registry contains millions of records each year) 3

3 Working with Administrative Databases: General Tips and Tricks > Each day hundreds of employees conduct analyses using SAS > Given the magnitude of work load on the CIHI server, using resources wisely is important There is always a trade-off > Efficiency can be measured in many ways Real Time CPU time Memory Input /Output Original Programmer time Maintenance Programmer time 3

4 System Options for measure of performance > Options STIMER; (Default ) NOTE: DATA statement used: real time 1.16 seconds cpu time 0.09 seconds > Options FULLSTIMER; NOTE: The SAS System used: real time 0.14 seconds user cpu time 0.01 seconds system cpu time 0.05 seconds Memory 1452k Page Faults 1 Page Reclaims 2349 Page Swaps 0 Voluntary Context Switches 53 Involuntary Context Switches 5 Block Input Operations 1 Block Output Operations 0 4

5 Optimizing performance * Optimize performance by reducing CPU time -Check the program using the _null_ or the OBS -Use WHERE vs. IF -Use DROP and KEEP statements -Issues with merging data -Avoid unnecessary DATA steps or sorting -Manipulation of data with IF/THEN/ELSE statements -Dealing with resource intensive calculations *Keep the libraries clean *Reduce the size of the tables using COMPRESS=YES 5

6 When checking your programs, use a null data set or limit the number of observations 6

7 Subsetting Datasets: WHERE vs. IF statements 7

8 Process only the variables that you need Need only two variables Social Sciences computing cooperative 8

9 Subsetting datasets 9

10 Subsetting datasets: KEEP Statement 10

11 Subsetting datasets: KEEP Statement 11

12 Subsetting datasets: KEEP Statement 12

13 Some other Shortcuts 13

14 Merging data 14

15 Merging data 15

16 When only one condition can be true for a given observation, write a series of IF-THEN/ELSE statements. Social Sciences computing cooperative 16

17 When only one condition can be true for a given observation, write a series of IF-THEN/ELSE statements. 17

18 When only one condition can be true for a given observation, write a series of IF-THEN/ELSE statements. 18

19 Perform resource-intensive calculations and comparisons only once Social Sciences computing cooperative 19

20 Assign many values in one statement Social Sciences computing cooperative 20

21 Dealing with Missing Values Put missing values last in expressions Check for missing values before using a variable in multiple statements. Social Sciences computing cooperative 21

22 Avoid unnecessary sorting 22

23 If several different subsets are needed, avoid rereading the data for each subset 23

24 Keep your SAS environment clean 24

25 COMPRESS= 25

Green Eggs And SAS. Presented To The Edmonton SAS User Group October 24, 2017 By John Fleming. SAS is a registered trademark of The SAS Institute

Green Eggs And SAS. Presented To The Edmonton SAS User Group October 24, 2017 By John Fleming. SAS is a registered trademark of The SAS Institute Green Eggs And SAS Presented To The Edmonton SAS User Group October 24, 2017 By John Fleming SAS is a registered trademark of The SAS Institute ESUG - October 24, 2017 1 How To Merge SAS Programming With

More information

A Practical Approach to Process Improvement Using Parallel Processing

A Practical Approach to Process Improvement Using Parallel Processing Paper PA06 A Practical Approach to Process Improvement Using Parallel Processing Viraj Kumbhakarna, JPMorgan Chase & Co., Columbus, OH, 43240 ABSTRACT In applications which process huge volumes of data

More information

Ten tips for efficient SAS code

Ten tips for efficient SAS code Ten tips for efficient SAS code Host Caroline Scottow Presenter Peter Hobart Managing the webinar In Listen Mode Control bar opened with the white arrow in the orange box Efficiency Overview Optimisation

More information

ERROR: The following columns were not found in the contributing table: vacation_allowed

ERROR: The following columns were not found in the contributing table: vacation_allowed Page 1 I DIDN T KNOW YOU COULD DO THAT! REVELATIONS FROM THE ADVANCED SAS CERTIFICATION EXAM Patricia Hettinger, Certified SAS Professional, Oakbrook Terrace, IL ABSTRACT Many people have extreme test

More information

Optimizing System Performance

Optimizing System Performance 243 CHAPTER 19 Optimizing System Performance Definitions 243 Collecting and Interpreting Performance Statistics 244 Using the FULLSTIMER and STIMER System Options 244 Interpreting FULLSTIMER and STIMER

More information

Paper CC16. William E Benjamin Jr, Owl Computer Consultancy LLC, Phoenix, AZ

Paper CC16. William E Benjamin Jr, Owl Computer Consultancy LLC, Phoenix, AZ Paper CC16 Smoke and Mirrors!!! Come See How the _INFILE_ Automatic Variable and SHAREBUFFERS Infile Option Can Speed Up Your Flat File Text-Processing Throughput Speed William E Benjamin Jr, Owl Computer

More information

The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets

The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets Jeffrey Brown, Lesley Curtis, and Rich Platt June 13, 2014 Previously The NIH Collaboratory:

More information

General Tips for Working with Large SAS datasets and Oracle tables

General Tips for Working with Large SAS datasets and Oracle tables General Tips for Working with Large SAS datasets and Oracle tables 1) Avoid duplicating Oracle tables as SAS datasets only keep the rows and columns needed for your analysis. Use keep/drop/where directly

More information

SAS Online Training: Course contents: Agenda:

SAS Online Training: Course contents: Agenda: SAS Online Training: Course contents: Agenda: (1) Base SAS (6) Clinical SAS Online Training with Real time Projects (2) Advance SAS (7) Financial SAS Training Real time Projects (3) SQL (8) CV preparation

More information

How to Monitor Your DAD and/or NACRS Data Submissions

How to Monitor Your DAD and/or NACRS Data Submissions Job Aid July 2017 How to Monitor Your DAD and/or NACRS Data Submissions Review the following products and services before the start of a new fiscal year to help you prepare for DAD and/or NACRS data submission.

More information

Lab #9: ANOVA and TUKEY tests

Lab #9: ANOVA and TUKEY tests Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for

More information

Data Quality Assessment Tool for health and social care. October 2018

Data Quality Assessment Tool for health and social care. October 2018 Data Quality Assessment Tool for health and social care October 2018 Introduction This interactive data quality assessment tool has been developed to meet the needs of a broad range of health and social

More information

PROC FORMAT: USE OF THE CNTLIN OPTION FOR EFFICIENT PROGRAMMING

PROC FORMAT: USE OF THE CNTLIN OPTION FOR EFFICIENT PROGRAMMING PROC FORMAT: USE OF THE CNTLIN OPTION FOR EFFICIENT PROGRAMMING Karuna Nerurkar and Andrea Robertson, GMIS Inc. ABSTRACT Proc Format can be a useful tool for improving programming efficiency. This paper

More information

Detecting Outliers in Column Profile Results in Informatica Analyst

Detecting Outliers in Column Profile Results in Informatica Analyst Detecting Outliers in Column Profile Results in Informatica Analyst 1993, 2016 Informatica LLC. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying,

More information

How to write ADaM specifications like a ninja.

How to write ADaM specifications like a ninja. Poster PP06 How to write ADaM specifications like a ninja. Caroline Francis, Independent SAS & Standards Consultant, Torrevieja, Spain ABSTRACT To produce analysis datasets from CDISC Study Data Tabulation

More information

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples Hadoop Introduction 1 Topics Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples 2 Big Data Analytics What is Big Data?

More information

SAS File Management. Improving Performance CHAPTER 37

SAS File Management. Improving Performance CHAPTER 37 519 CHAPTER 37 SAS File Management Improving Performance 519 Moving SAS Files Between Operating Environments 520 Converting SAS Files 520 Repairing Damaged Files 520 Recovering SAS Data Files 521 Recovering

More information

SAS Performance Tuning Strategies and Techniques

SAS Performance Tuning Strategies and Techniques SAS Performance Tuning Strategies and Techniques Kirk Paul Lafler, Software Intelligence Corporation, Spring Valley, CA ABSTRACT As SAS Software becomes increasingly more popular, guidelines for its efficient

More information

If You Need These OBS and These VARS, Then Drop IF, and Keep WHERE Jay Iyengar, Data Systems Consultants LLC

If You Need These OBS and These VARS, Then Drop IF, and Keep WHERE Jay Iyengar, Data Systems Consultants LLC Paper 2417-2018 If You Need These OBS and These VARS, Then Drop IF, and Keep WHERE Jay Iyengar, Data Systems Consultants LLC ABSTRACT Reading data effectively in the DATA step requires knowing the implications

More information

CITI PROGRAM NEW LEARNER ACCOUNT REGISTRATION

CITI PROGRAM NEW LEARNER ACCOUNT REGISTRATION CITI PROGRAM NEW LEARNER ACCOUNT REGISTRATION Go to www.citiprogram.org and click on the "Register" button located in the blue log in box to the right of the homepage. The next steps are numbered 1 7.

More information

Chapter 6: Modifying and Combining Data Sets

Chapter 6: Modifying and Combining Data Sets Chapter 6: Modifying and Combining Data Sets The SET statement is a powerful statement in the DATA step. Its main use is to read in a previously created SAS data set which can be modified and saved as

More information

Common Sense Tips and Clever Tricks for Programming with Extremely Large SAS Data Sets

Common Sense Tips and Clever Tricks for Programming with Extremely Large SAS Data Sets Common Sense Tips and Clever Tricks for Programming with Extremely Large SAS Data Sets Kathy Hardis Fraeman, United BioSource Corporation, Bethesda, MD ABSTRACT Working with extremely large SAS data sets

More information

FINNISH APPROACH TO CRITICAL INFRASTRUCTURE PROTECTION

FINNISH APPROACH TO CRITICAL INFRASTRUCTURE PROTECTION FINNISH APPROACH TO CRITICAL INFRASTRUCTURE PROTECTION Katri Liekkilä, M.M.Sc., M.Sc. (Econ) Special Adviser IMPROVER Operators workshop, Lisbon 2018 NATIONAL DOCUMENTS RELATED TO CIP SECURITY STRATEGY

More information

MDM 4UI: Navigating and Using the Statistics Canada Website

MDM 4UI: Navigating and Using the Statistics Canada Website MDM 4UI: Navigating and Using the Statistics Canada Website Method 1: Module search How to search the Statistics Canada website to find articles and data for projects The Statistics Canada website (www.statcan.gc.ca)

More information

Critical Information Infrastructure Protection Law

Critical Information Infrastructure Protection Law Critical Information Infrastructure Protection Law CCD COE Training 8 September 2009 Tallinn, Estonia Maeve Dion Center for Infrastructure Protection George Mason University School of Law Arlington, Virginia.

More information

The Pan-Canadian Real-world Health Data Network (PRHDN)

The Pan-Canadian Real-world Health Data Network (PRHDN) The Pan-Canadian Real-world Health Data Network (PRHDN) Building a SPOR National Data Platform April 2018 Consultation Deck Kim.mcgrail@ubc.ca Nominated Principal Investigator for the PRHDN Submission

More information

SAS Programming Efficiency: Tips, Examples, and PROC GINSIDE Optimization

SAS Programming Efficiency: Tips, Examples, and PROC GINSIDE Optimization SAS Programming Efficiency: Tips, Examples, and PROC GINSIDE Optimization Lingqun Liu, University of Michigan MISUG, Feb 2018 1 Outline This paper first explores the concepts of efficiency. Then reviews

More information

Performance Considerations

Performance Considerations 149 CHAPTER 6 Performance Considerations Hardware Considerations 149 Windows Features that Optimize Performance 150 Under Windows NT 150 Under Windows NT Server Enterprise Edition 4.0 151 Processing SAS

More information

CSE101: Design and Analysis of Algorithms. Ragesh Jaiswal, CSE, UCSD

CSE101: Design and Analysis of Algorithms. Ragesh Jaiswal, CSE, UCSD Recap. Growth rates: Arrange the following functions in ascending order of growth rate: n 2 log n n log n 2 log n n/ log n n n Introduction Algorithm: A step-by-step way of solving a problem. Design of

More information

Effective ways of handling various file types and importing techniques using SAS 9.4

Effective ways of handling various file types and importing techniques using SAS 9.4 Effective ways of handling various file types and importing techniques using SAS 9.4 Dadi, Divya Jhaver, Rahul 2016 SAS Analytics Day Introduction One major problem organizations face is huge data load

More information

Greenspace: A Macro to Improve a SAS Data Set Footprint

Greenspace: A Macro to Improve a SAS Data Set Footprint Paper AD-150 Greenspace: A Macro to Improve a SAS Data Set Footprint Brian Varney, Experis Business Intelligence and Analytics Practice ABSTRACT SAS programs can be very I/O intensive. SAS data sets with

More information

%DWFK$&&(66WR $'$%$6%$$ E\ 6WXDUW%LUFK IURP,QIRUPDWLRQ'HOLYHU\ 6\VWHPV6RXWK$IULFD

%DWFK$&&(66WR $'$%$6%$$ E\ 6WXDUW%LUFK IURP,QIRUPDWLRQ'HOLYHU\ 6\VWHPV6RXWK$IULFD %DWFK$&&(66WR $'$%$6%$$ E\ 6WXDUW%LUFK IURP,QIRUPDWLRQ'HOLYHU\ 6\VWHPV6RXWK$IULFD 1 ,QWURGXFWLRQ O Objectives and Benefits O Applicable Environment O Terms and Definitions O System Components Objectives

More information

Teammate Self-Service

Teammate Self-Service Teammate Self-Service Teammate Self-Service... 1 Overview of Teammate Self-Service... 2 Access to Teammate Self-Service... 3 Logging In to Teammate Self-Service... 3 Logging Out of Teammate Self-Service...

More information

Choosing the Right Technique to Merge Large Data Sets Efficiently Qingfeng Liang, Community Care Behavioral Health Organization, Pittsburgh, PA

Choosing the Right Technique to Merge Large Data Sets Efficiently Qingfeng Liang, Community Care Behavioral Health Organization, Pittsburgh, PA Choosing the Right Technique to Merge Large Data Sets Efficiently Qingfeng Liang, Community Care Behavioral Health Organization, Pittsburgh, PA ABSTRACT This paper outlines different SAS merging techniques

More information

Cleaning up your SAS log: Note Messages

Cleaning up your SAS log: Note Messages Paper 9541-2016 Cleaning up your SAS log: Note Messages ABSTRACT Jennifer Srivastava, Quintiles Transnational Corporation, Durham, NC As a SAS programmer, you probably spend some of your time reading and

More information

Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V)

Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Based on Industry Cases, Live Exercises, & Industry Executed Projects Module (I) Analytics Essentials 81 hrs 1. Statistics

More information

Data security statement Volunteers

Data security statement Volunteers Data security statement Volunteers 1 Register controller 2 Contact information for matters pertaining to the handling of personal information 3 Personal data group 4 The purpose for processing personal

More information

SOS (Save Our Space) Matters of Size

SOS (Save Our Space) Matters of Size SOS (Save Our Space) Matters of Size By Matthew Pearce Amadeus Software Limited 2001 Abstract Disk space is one of the most critical issues when handling large amounts of data. Large data means greater

More information

Comparison of different ways using table lookups on huge tables

Comparison of different ways using table lookups on huge tables PhUSE 007 Paper CS0 Comparison of different ways using table lookups on huge tables Ralf Minkenberg, Boehringer Ingelheim Pharma GmbH & Co. KG, Ingelheim, Germany ABSTRACT In many application areas the

More information

SPSS TRAINING SPSS VIEWS

SPSS TRAINING SPSS VIEWS SPSS TRAINING SPSS VIEWS Dataset Data file Data View o Full data set, structured same as excel (variable = column name, row = record) Variable View o Provides details for each variable (column in Data

More information

In your school or local public library, log on to the library catalogue.

In your school or local public library, log on to the library catalogue. 1.3 Databases A database is an organized store of records. Databases may contain information about almost any subject incomes, shopping habits, demographics, features of cars, and so on. INVESTIGATE &

More information

Using PROC PLAN for Randomization Assignments

Using PROC PLAN for Randomization Assignments Using PROC PLAN for Randomization Assignments Miriam W. Rosenblatt Division of General Internal Medicine and Health Care Research, University. Hospitals of Cleveland Abstract This tutorial is an introduction

More information

Using PROC SQL to Calculate FIRSTOBS David C. Tabano, Kaiser Permanente, Denver, CO

Using PROC SQL to Calculate FIRSTOBS David C. Tabano, Kaiser Permanente, Denver, CO Using PROC SQL to Calculate FIRSTOBS David C. Tabano, Kaiser Permanente, Denver, CO ABSTRACT The power of SAS programming can at times be greatly improved using PROC SQL statements for formatting and manipulating

More information

ACHOO - THE FLU, SAS & YOU

ACHOO - THE FLU, SAS & YOU ACHOO - THE FLU, SAS & YOU CHARU SHANKAR, SAS INSTITUTE CANADA Health User Group Toronto 20 November 2015 AGENDA ACHOO - THE FLU, SAS & YOU 1. Some like it cold -Ways to fight the flu 2. Data Collection

More information

Blackout 2003 Reliability Recommendations

Blackout 2003 Reliability Recommendations Blackout 2003 Reliability Recommendations 2005 NPCC General Meeting The Cranwell Resort Lenox, MA September 29, 2005 Philip A. Fedora Director, Market Reliability Interface Northeast Power Coordinating

More information

COLUMN DATABASES A NDREW C ROTTY & ALEX G ALAKATOS

COLUMN DATABASES A NDREW C ROTTY & ALEX G ALAKATOS COLUMN DATABASES A NDREW C ROTTY & ALEX G ALAKATOS OUTLINE RDBMS SQL Row Store Column Store C-Store Vertica MonetDB Hardware Optimizations FACULTY MEMBER VERSION EXPERIMENT Question: How does time spent

More information

Flex Program Guide: Using MBQIP Excel Files May 2017

Flex Program Guide: Using MBQIP Excel Files May 2017 Flex Program Guide: Using MBQIP Excel Files May 2017 This project is supported by the Health Resources and Services Administration (HRSA) of the U.S. Department of Health and Human Services (HHS) under

More information

Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc.

Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc. PharmaSUG2011 - Paper DM03 Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc., TX ABSTRACT In the Clinical trials data analysis

More information

Data Representation. Types of data: Numbers Text Audio Images & Graphics Video

Data Representation. Types of data: Numbers Text Audio Images & Graphics Video Data Representation Data Representation Types of data: Numbers Text Audio Images & Graphics Video Analog vs Digital data How is data represented? What is a signal? Transmission of data Analog vs Digital

More information

The Building Blocks of SAS Datasets. (Set, Merge, and Update) Andrew T. Kuligowski FCCI Insurance Group

The Building Blocks of SAS Datasets. (Set, Merge, and Update) Andrew T. Kuligowski FCCI Insurance Group The Building Blocks of SAS Datasets S-M-U (Set, Merge, and Update) Andrew T. Kuligowski FCCI Insurance Group S-M-U What is S M U? 2 S-M-U What is S M U? Shmoo? 3 S-M-U What is S M U? Southern Methodist

More information

APPENDIX 3 Tuning Tips for Applications That Use SAS/SHARE Software

APPENDIX 3 Tuning Tips for Applications That Use SAS/SHARE Software 177 APPENDIX 3 Tuning Tips for Applications That Use SAS/SHARE Software Authors 178 Abstract 178 Overview 178 The SAS Data Library Model 179 How Data Flows When You Use SAS Files 179 SAS Data Files 179

More information

Paper DB2 table. For a simple read of a table, SQL and DATA step operate with similar efficiency.

Paper DB2 table. For a simple read of a table, SQL and DATA step operate with similar efficiency. Paper 76-28 Comparative Efficiency of SQL and Base Code When Reading from Database Tables and Existing Data Sets Steven Feder, Federal Reserve Board, Washington, D.C. ABSTRACT In this paper we compare

More information

The network marketing industry has grown by ninety percent during the last ten years as reported by the Direct Sellers Association.

The network marketing industry has grown by ninety percent during the last ten years as reported by the Direct Sellers Association. The network marketing industry has grown by ninety percent during the last ten years as reported by the Direct Sellers Association. The Bureau of Labor Statistics states: 95% of people, age 65 and over

More information

MRR (Multi Resolution Raster) Revolutionizing Raster

MRR (Multi Resolution Raster) Revolutionizing Raster MRR (Multi Resolution Raster) Revolutionizing Raster Praveen Gupta Praveen.Gupta@pb.com Pitney Bowes, Noida, India T +91 120 4026000 M +91 9810 659 350 Pitney Bowes, pitneybowes.com/in 5 th Floor, Tower

More information

50 WAYS TO MERGE YOUR DATA INSTALLMENT 1 Kristie Schuster, LabOne, Inc., Lenexa, Kansas Lori Sipe, LabOne, Inc., Lenexa, Kansas

50 WAYS TO MERGE YOUR DATA INSTALLMENT 1 Kristie Schuster, LabOne, Inc., Lenexa, Kansas Lori Sipe, LabOne, Inc., Lenexa, Kansas Paper 103-26 50 WAYS TO MERGE YOUR DATA INSTALLMENT 1 Kristie Schuster, LabOne, Inc., Lenexa, Kansas Lori Sipe, LabOne, Inc., Lenexa, Kansas ABSTRACT When you need to join together two datasets, how do

More information

Working with Composite Endpoints: Constructing Analysis Data Pushpa Saranadasa, Merck & Co., Inc., Upper Gwynedd, PA

Working with Composite Endpoints: Constructing Analysis Data Pushpa Saranadasa, Merck & Co., Inc., Upper Gwynedd, PA PharmaSug2016- Paper HA03 Working with Composite Endpoints: Constructing Analysis Data Pushpa Saranadasa, Merck & Co., Inc., Upper Gwynedd, PA ABSTRACT A composite endpoint in a Randomized Clinical Trial

More information

UCB CALSTAPH EXCEL AND EPIDEMIOLOGY

UCB CALSTAPH EXCEL AND EPIDEMIOLOGY UCB CALSTAPH EXCEL AND EPIDEMIOLOGY Gail Sondermeyer Cooksey, MPH Infectious Diseases Branch California Department of Public Health gail.cooksey@cdph.ca.gov Overview Introduction Functions Point and Click

More information

Multi-Threaded Reads in SAS/Access for Relational Databases Sarah Whittier, ISO New England, Holyoke, MA

Multi-Threaded Reads in SAS/Access for Relational Databases Sarah Whittier, ISO New England, Holyoke, MA Multi-Threaded Reads in SAS/Access for Relational Databases Sarah Whittier, ISO New England, Holyoke, MA ABSTRACT Multi-threading was implemented in SAS 9. This new feature affects the performance of certain

More information

Cheat sheet: Data Processing Optimization - for Pharma Analysts & Statisticians

Cheat sheet: Data Processing Optimization - for Pharma Analysts & Statisticians Cheat sheet: Data Processing Optimization - for Pharma Analysts & Statisticians ABSTRACT Karthik Chidambaram, Senior Program Director, Data Strategy, Genentech, CA This paper will provide tips and techniques

More information

Planning for disaster recovery in a health care setting

Planning for disaster recovery in a health care setting E-Guide Planning for disaster recovery in a health care setting For hospitals, timely access to patient data is critical for maintaining normal operations during a natural or man- made disaster. This Eguide

More information

TIPS AND TRICKS: IMPROVE EFFICIENCY TO YOUR SAS PROGRAMMING

TIPS AND TRICKS: IMPROVE EFFICIENCY TO YOUR SAS PROGRAMMING TIPS AND TRICKS: IMPROVE EFFICIENCY TO YOUR SAS PROGRAMMING Guillaume Colley, Lead Data Analyst, BCCFE Page 1 Contents Customized SAS Session Run system options as SAS starts Labels management Shortcut

More information

Lecture 8 Wireless Sensor Networks: Overview

Lecture 8 Wireless Sensor Networks: Overview Lecture 8 Wireless Sensor Networks: Overview Reading: Wireless Sensor Networks, in Ad Hoc Wireless Networks: Architectures and Protocols, Chapter 12, sections 12.1-12.2. I. Akyildiz, W. Su, Y. Sankarasubramaniam

More information

Programming Beyond the Basics. Find() the power of Hash - How, Why and When to use the SAS Hash Object John Blackwell

Programming Beyond the Basics. Find() the power of Hash - How, Why and When to use the SAS Hash Object John Blackwell Find() the power of Hash - How, Why and When to use the SAS Hash Object John Blackwell ABSTRACT The SAS hash object has come of age in SAS 9.2, giving the SAS programmer the ability to quickly do things

More information

UNCLASSIFIED. National and Cyber Security Branch. Presentation for Gridseccon. Quebec City, October 18-21

UNCLASSIFIED. National and Cyber Security Branch. Presentation for Gridseccon. Quebec City, October 18-21 National and Cyber Security Branch Presentation for Gridseccon Quebec City, October 18-21 1 Public Safety Canada Departmental Structure 2 National and Cyber Security Branch National and Cyber Security

More information

dtalink Faster probabilistic record linking and deduplication methods in Stata for large data files Keith Kranker

dtalink Faster probabilistic record linking and deduplication methods in Stata for large data files Keith Kranker dtalink Faster probabilistic record linking and deduplication methods in Stata for large data files Presentation at the 2018 Stata Conference Columbus, Ohio July 20, 2018 Keith Kranker Abstract Stata users

More information

Andrew H. Karp Sierra Information Services, Inc. San Francisco, California USA

Andrew H. Karp Sierra Information Services, Inc. San Francisco, California USA Indexing and Compressing SAS Data Sets: How, Why, and Why Not Andrew H. Karp Sierra Information Services, Inc. San Francisco, California USA Many users of SAS System software, especially those working

More information

EEE 435 Principles of Operating Systems

EEE 435 Principles of Operating Systems EEE 435 Principles of Operating Systems Modeling Page Replacement Algorithms (Modern Operating Systems 4.5) Quick Review How is WSClock different from Clock? What is the optimal algorithm Is it implementable?

More information

Anonymization Case Study 1: Randomizing Names and Addresses

Anonymization Case Study 1: Randomizing Names and Addresses Anonymization Case Study 1: Randomizing Names and Addresses The PrivacyAnalytics Tool is being developed as part of a joint research project between the Children s Hospital of Eastern Ontario Research

More information

Week 6, Week 7 and Week 8 Analyses of Variance

Week 6, Week 7 and Week 8 Analyses of Variance Week 6, Week 7 and Week 8 Analyses of Variance Robyn Crook - 2008 In the next few weeks we will look at analyses of variance. This is an information-heavy handout so take your time reading it, and don

More information

BORN Ontario s Data Quality Framework

BORN Ontario s Data Quality Framework BORN Ontario s Data Quality Framework At BORN Ontario we make Data Privacy and Data Quality our highest priority. We recognize that the quality of the data directly impacts use of the data. With addition

More information

CS 525: Advanced Database Organization 04: Indexing

CS 525: Advanced Database Organization 04: Indexing CS 5: Advanced Database Organization 04: Indexing Boris Glavic Part 04 Indexing & Hashing value record? value Slides: adapted from a course taught by Hector Garcia-Molina, Stanford InfoLab CS 5 Notes 4

More information

System Requirements. SAS Profitability Management 2.3. Deployment Options. Supported Operating Systems and Versions. Windows Server Operating Systems

System Requirements. SAS Profitability Management 2.3. Deployment Options. Supported Operating Systems and Versions. Windows Server Operating Systems SAS Profitability Management 2.3 This document provides the requirements for installing and running SAS Profitability Management. This document has been updated for the first maintenance release of SAS

More information

R commander an introduction

R commander an introduction R commander an introduction free, user-friendly, and powerful software Ho Kim SCHOOL OF PUBLIC HEALTH, SNU Useful sites R is a free software with powerful tools The Comprehensive R Archives Network http://cran.r-project.org/

More information

Mia Stephens JMP Academic Ambassador, SAS, NC

Mia Stephens JMP Academic Ambassador, SAS, NC Japan Discovery Summit 11/18/2016 Shaping up Big Data A data workout with JMP Michèle Boulanger Rollins College, FL Chair of ISO/Technical Committee on Applications of Statistics Mia Stephens JMP Academic

More information

Disk Subsystem Capacity Management, Based on Business Drivers, I/O Performance Metrics and MASF. Igor Trubin, Ph.D. and Linwood Merritt

Disk Subsystem Capacity Management, Based on Business Drivers, I/O Performance Metrics and MASF. Igor Trubin, Ph.D. and Linwood Merritt Disk Subsystem Capacity Management, Based on Business Drivers, I/O Performance Metrics and MASF Igor Trubin, Ph.D. and Linwood Merritt Capital One Services, Inc. igor.trubin@capitalone.com May 2004 Page

More information

Locate the patent portfolio of interest

Locate the patent portfolio of interest Derwent Innovation & Derwent Data Analyzer Blueprint for Success Identify problematic patents, abandoned technology, and other trends in a Patent Portfolio How can you quickly analyze a company s patent

More information

SAS Scalable Performance Data Server 4.3

SAS Scalable Performance Data Server 4.3 Scalability Solution for SAS Dynamic Cluster Tables A SAS White Paper Table of Contents Introduction...1 Cluster Tables... 1 Dynamic Cluster Table Loading Benefits... 2 Commands for Creating and Undoing

More information

IBM InfoSphere Data Replication s Change Data Capture (CDC) for DB2 LUW databases (Version ) Performance Evaluation and Analysis

IBM InfoSphere Data Replication s Change Data Capture (CDC) for DB2 LUW databases (Version ) Performance Evaluation and Analysis Page 1 IBM InfoSphere Data Replication s Change Data Capture (CDC) for DB2 LUW databases (Version 10.2.1) Performance Evaluation and Analysis 2014 Prasa Urithirakodeeswaran Page 2 Contents Introduction...

More information

Navigate to Financial Management > Vendors > Setup >Configuration > Custom Forms Setup.

Navigate to Financial Management > Vendors > Setup >Configuration > Custom Forms Setup. Vendors Custom Forms Setup Finance Custom Forms are user defined fields that can be created to keep track of information on vendors that is not currently tracked in Skyward. You are able to create your

More information

SIMULATING SECURE DATA EXTRACTION IN EXTRACTION TRANSFORMATION LOADING (ETL) PROCESSES

SIMULATING SECURE DATA EXTRACTION IN EXTRACTION TRANSFORMATION LOADING (ETL) PROCESSES http:// SIMULATING SECURE DATA EXTRACTION IN EXTRACTION TRANSFORMATION LOADING (ETL) PROCESSES Ashish Kumar Rastogi Department of Information Technology, Azad Group of Technology & Management. Lucknow

More information

Excel Training - Beginner March 14, 2018

Excel Training - Beginner March 14, 2018 Excel Training - Beginner March 14, 2018 Working File File was emailed to you this morning, please log in to your email, download and open the file. Once you have the file PLEASE CLOSE YOUR EMAIL. Open

More information

Sage Canadian SMB Survey on Mobile Devices March 2013

Sage Canadian SMB Survey on Mobile Devices March 2013 Sage Canadian SMB Survey on Mobile Devices March 2013 Summary Report Introduction Sage North America, a leading provider of business management software and services to more than 6 million small and midsized

More information

EBOOK 4 TIPS FOR STRENGTHENING THE SECURITY OF YOUR VPN ACCESS

EBOOK 4 TIPS FOR STRENGTHENING THE SECURITY OF YOUR VPN ACCESS EBOOK 4 TIPS FOR STRENGTHENING THE SECURITY OF YOUR VPN ACCESS HOW SECURE IS YOUR VPN ACCESS? Remote access gateways such as VPNs and firewalls provide critical anywhere-anytime connections to the networks

More information

PharmaSUG Paper TT11

PharmaSUG Paper TT11 PharmaSUG 2014 - Paper TT11 What is the Definition of Global On-Demand Reporting within the Pharmaceutical Industry? Eric Kammer, Novartis Pharmaceuticals Corporation, East Hanover, NJ ABSTRACT It is not

More information

PDF // TUTSPLUS WEB DESIGN DOCUMENT

PDF // TUTSPLUS WEB DESIGN DOCUMENT 16 November, 2017 PDF // TUTSPLUS WEB DESIGN DOCUMENT Document Filetype: PDF 396.96 KB 0 PDF // TUTSPLUS WEB DESIGN DOCUMENT Once rarely used in this dynamic medium, retro and vintage elements are now

More information

So Much Data, So Little Time: Splitting Datasets For More Efficient Run Times and Meeting FDA Submission Guidelines

So Much Data, So Little Time: Splitting Datasets For More Efficient Run Times and Meeting FDA Submission Guidelines Paper TT13 So Much Data, So Little Time: Splitting Datasets For More Efficient Run Times and Meeting FDA Submission Guidelines Anthony Harris, PPD, Wilmington, NC Robby Diseker, PPD, Wilmington, NC ABSTRACT

More information

Collective Mind. Early Warnings of Systematic Failures of Equipment. Dr. Artur Dubrawski. Dr. Norman Sondheimer. Auton Lab Carnegie Mellon University

Collective Mind. Early Warnings of Systematic Failures of Equipment. Dr. Artur Dubrawski. Dr. Norman Sondheimer. Auton Lab Carnegie Mellon University Collective Mind Early Warnings of Systematic Failures of Equipment Dr. Artur Dubrawski Auton Lab Carnegie Mellon University Dr. Norman Sondheimer University of Massachusetts Amherst 1 Collective Mind Unique

More information

An Introduction to Analysis (and Repository) Databases (ARDs)

An Introduction to Analysis (and Repository) Databases (ARDs) An Introduction to Analysis (and Repository) TM Databases (ARDs) Russell W. Helms, Ph.D. Rho, Inc. Chapel Hill, NC RHelms@RhoWorld.com www.rhoworld.com Presented to DIA-CDM: Philadelphia, PA, 1 April 2003

More information

Information Retrieval

Information Retrieval Introduction to Information Retrieval Lecture 4: Index Construction Plan Last lecture: Dictionary data structures Tolerant retrieval Wildcards This time: Spell correction Soundex Index construction Index

More information

Run Search and Export Results in Derwent Innovation

Run Search and Export Results in Derwent Innovation Derwent Data Analyzer Blueprint for Success Technology Report How can I quickly analyze a technology? Who are the main companies and inventors in this technology? Are there emerging trends, e.g. new target

More information

Memory Management! How the hardware and OS give application pgms:" The illusion of a large contiguous address space" Protection against each other"

Memory Management! How the hardware and OS give application pgms: The illusion of a large contiguous address space Protection against each other Memory Management! Goals of this Lecture! Help you learn about:" The memory hierarchy" Spatial and temporal locality of reference" Caching, at multiple levels" Virtual memory" and thereby " How the hardware

More information

Simple Rules to Remember When Working with Indexes

Simple Rules to Remember When Working with Indexes Simple Rules to Remember When Working with Indexes Kirk Paul Lafler, Software Intelligence Corporation, Spring Valley, CA Abstract SAS users are always interested in learning techniques related to improving

More information

A detailed comparison of EasyMorph vs Tableau Prep

A detailed comparison of EasyMorph vs Tableau Prep A detailed comparison of vs We at keep getting asked by our customers and partners: How is positioned versus?. Well, you asked, we answer! Short answer and are similar, but there are two important differences.

More information

An Introduction to the WERS-REPONSE Stata dataset. Version 1.0 (May 2016)

An Introduction to the WERS-REPONSE Stata dataset. Version 1.0 (May 2016) An Introduction to the WERS-REPONSE Stata dataset Version 1.0 (May 2016) 1. Introduction The WERS-REPONSE Stata dataset ( the WR dataset hereafter) was compiled as part of a research project to comparatively

More information

A SAS and Java Application for Reporting Clinical Trial Data. Kevin Kane MSc Infoworks (Data Handling) Limited

A SAS and Java Application for Reporting Clinical Trial Data. Kevin Kane MSc Infoworks (Data Handling) Limited A SAS and Java Application for Reporting Clinical Trial Data Kevin Kane MSc Infoworks (Data Handling) Limited Reporting Clinical Trials Is Resource Intensive! Reporting a clinical trial program for a new

More information

Introduction to SharePoint 2016 for Collaboration and Document Management

Introduction to SharePoint 2016 for Collaboration and Document Management Course 55193A: Introduction to SharePoint 2016 for Collaboration and Document Management - Course details Course Outline Module 1: SharePoint Overview This module provides an overview of SharePoint and

More information

How Managers and Executives Can Leverage SAS Enterprise Guide

How Managers and Executives Can Leverage SAS Enterprise Guide Paper 8820-2016 How Managers and Executives Can Leverage SAS Enterprise Guide ABSTRACT Steven First and Jennifer First-Kluge, Systems Seminar Consultants, Inc. SAS Enterprise Guide is an extremely valuable

More information

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS TO SAS NEED FOR SAS WHO USES SAS WHAT IS SAS? OVERVIEW OF BASE SAS SOFTWARE DATA MANAGEMENT FACILITY STRUCTURE OF SAS DATASET SAS PROGRAM PROGRAMMING LANGUAGE ELEMENTS OF THE SAS LANGUAGE RULES FOR SAS

More information

Software Testing and Maintenance 1

Software Testing and Maintenance 1 Combinatorial Testing!! Introduction!! Combinatorial Coverage Criteria!! Pairwise Test Generation!! Summary Software Testing and Maintenance 1 Motivation!! The behavior of a software application may be

More information

Definitive Healthcare Training Guide

Definitive Healthcare Training Guide 1 Definitive Healthcare Training Guide Thank you for subscribing to Definitive Healthcare s online database of intelligence on hospitals and healthcare providers. Definitive Healthcare updates information

More information