The Role of Data Profiling In Health Analytics

Similar documents
Healthcare in the Public Cloud DIY vs. Managed Services

Solving the Enterprise Data Dilemma

2 The IBM Data Governance Unified Process

Living with HIPAA: Compendium of Next steps from Rural Hospitals to Large Health Systems to Physician Practices

Interoperability. Doug Fridsma, MD PhD President & CEO, AMIA

Continuous auditing certification

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability.

Healthcare IT Modernization and the Adoption of Hybrid Cloud

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN

Randy House Vice President of Health Informatics. Saint Luke s Health System. Lynsey McNeal Director Data of Governance. Saint Luke s Health System

SOLUTION BRIEF Virtual CISO

Data Governance Toolkit

THE ESSENCE OF DATA GOVERNANCE ARTICLE

Data governance and data quality: is it on your agenda or lurking in the shadows?

Business Architecture in Healthcare

Vulnerability Assessments and Penetration Testing

White Paper. How to Write an MSSP RFP

SECURITY THAT FOLLOWS YOUR FILES ANYWHERE

What is the Northern Ireland ehealth and Care strategy?

tyuiopasdfghjklzxcvbnmqwertyuiopas dfghjklzxcvbnmqwertyuiopasdfghjklzx cvbnmqwertyuiopasdfghjklzxcvbnmq

Best Practices & Lesson Learned from 100+ ITGRC Implementations

ConCert FAQ s Last revised December 2017

Oracle Buys Automated Applications Controls Leader LogicalApps

Best Practices in Enterprise Data Governance

INTRODUCTION TO DATA GOVERNANCE AND STEWARDSHIP

IBM InfoSphere Information Analyzer

Effective Risk Data Aggregation & Risk Reporting

IBM InfoSphere Information Server Version 8 Release 7. Reporting Guide SC

12 Minute Guide to Archival Search

Shaping the Cloud for the Healthcare Industry

Data Governance. Mark Plessinger / Julie Evans December /7/2017

STEP Data Governance: At a Glance

DATA SHEET RSA NETWITNESS PLATFORM PROFESSIONAL SERVICES ACCELERATE TIME-TO-VALUE & MAXIMIZE ROI

Agile Master Data Management TM : Data Governance in Action. A whitepaper by First San Francisco Partners

Operationalizing Cybersecurity in Healthcare IT Security & Risk Management Study Quantitative and Qualitative Research Program Results

Working with Health IT Systems is available under a Creative Commons Attribution-NonCommercial- ShareAlike 3.0 Unported license.

EuroRec Functional Statements Repository. EHR-QTN Workshop Vilnius, January 26, 2011 Dr. Jos Devlies, Belgium

Mortality, Mayhem and You: Risk Management in Digital Health

A SERVICE ORGANIZATION S GUIDE SOC 1, 2, & 3 REPORTS

Demystifying GRC. Abstract

Strategy & Planning: Data Governance & Data Quality

HIPAA Compliance is not a Cybersecurity Strategy

Data Protection. Practical Strategies for Getting it Right. Jamie Ross Data Security Day June 8, 2016

a publication of the health care compliance association MARCH 2018

How to Write an MSSP RFP. White Paper

Healthcare Security Success Story

VMware Cloud Operations Management Technology Consulting Services

WHO SHOULD ATTEND? ITIL Foundation is suitable for anyone working in IT services requiring more information about the ITIL best practice framework.

Cyber Risk Program Maturity Assessment UNDERSTAND AND MANAGE YOUR ORGANIZATION S CYBER RISK.

The Role of IT in HIPAA Security & Compliance

Making a Business Case for Electronic Document or Records Management

Advisor Workstation Quick Start Guide

(60 min) California State Updates

Match data set availability to data resource requirements, including gap analysis and remediation assistance.

IBM Software IBM InfoSphere Information Server for Data Quality

Accelerate Your Enterprise Private Cloud Initiative

The Importance of Data Profiling

GDPR: A QUICK OVERVIEW

Data Clairvoyance. A business approach to data. Real data practitioners, delivering real improvements to your enterprise data assets.

The Emerging Data Lake IT Strategy

Clearwater HIPAA Security Assessment Software. Demonstration

Go Cloud. VMware vcloud Datacenter Services by BIOS

Designing High-Performance Data Structures for MongoDB

Governance, Risk, and Compliance: A Practical Guide to Points of Entry

Find Hidden Budget Dollars with PDF Software. How making a software switch can improve your bottom line

Building Better Interfaces: HL7 Conformance Profiles

How to get the Enterprise to Understand the Value of Security

Use Cases for Argonaut Project -- DRAFT Page

Big data privacy in Australia

Comments submitted at: ange+framework

Sage Data Security Services Directory

Achieving regulatory compliance by improving data quality

How Secure Do You Feel About Your HIPAA Compliance Plan? Daniel F. Shay, Esq.

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

Matt Quinn.

Modern Database Architectures Demand Modern Data Security Measures

Protecting PHI in the Cloud. Session #47, February 20, 2017 Kurt J. Long, Founder & CEO, FairWarning, Inc.

Vendor: The Open Group. Exam Code: OG Exam Name: TOGAF 9 Part 1. Version: Demo

Chances & Challenges of ehealth

How to Become a DATA GOVERNANCE EXPERT

2016 Survey: A Pulse on Mobility in Healthcare

Search Engine Optimisation

From Integration to Interoperability: The Role of Public Health Systems in the Emerging World of Health Information Exchange

Engaging Executives and Boards in Cybersecurity Session 303, Feb 20, 2017 Sanjeev Sah, CISO, Texas Children s Hospital Jimmy Joseph, Senior Manager,

HOW WELL DO YOU KNOW YOUR IT NETWORK? BRIEFING DOCUMENT

Session 4.07 Accountability for Use or Disclosure of a Patient s Electronic Record

The HITECH Act. 5 things you can do Right Now to pave the road to compliance. 1. Secure PHI in motion.

VMware vcloud Air Network Service Providers Ensure Smooth Cloud Deployment

Data Governance: Are Governance Models Keeping Up?

A Distinctive View across the Continuum of Care with Oracle Healthcare Master Person Index ORACLE WHITE PAPER NOVEMBER 2015

Update from HIMSS National Privacy & Security. Lisa Gallagher, VP Technology Solutions November 14, 2013

Achieving effective risk management and continuous compliance with Deloitte and SAP

The Value of Data Modeling for the Data-Driven Enterprise

OVERVIEW BROCHURE GRC. When you have to be right

Improve the User Experience on Your Website

Incentives for IoT Security. White Paper. May Author: Dr. Cédric LEVY-BENCHETON, CEO

Navigating the Clouds Fortifying ITIL for Cloud Governance

ICAEW REPRESENTATION 68/16

Helping State Government Agencies Deliver a Better Constituent Experience through Better Communications

The New Healthcare Economy is rising up

Transcription:

WHITE PAPER 10101000101010101010101010010000101001 10101000101101101000100000101010010010 The Role of Data Profiling In Health Analytics 101101010001010101010101010100100001010 101101010001011011010001000001010100100 010001010101010101001010101001010010110 110011010001010101010101010100101001010 An Encore Point of View Randy L. Thomas October 2013 AN ENCORE POINT OF VIEW Using data captured in the course of providing healthcare has never been more important. The shift from fee-for-service to fee-for value demands that healthcare organizations understand the quality and cost of the care they provide to the communities they serve. To succeed in the new world of at-risk contracts and responsibility for not only clinical outcomes but the overall health status of defined populations, data-driven decisions in support of performance improvement has never been more critical. Healthcare organizations must harvest the data from across its portfolio of implemented applications and put it to use re-purpose it in pursuit of healthier people at optimal cost. This requires that organizations integrate and aggregate data from across the continuum, even including data from affiliated organizations. And the shift to electronic clinical quality measures (ecqm) included in at-risk contracts and required by regulation requires discrete, consistent and reliable data. 2013 Encore, A Quintiles Company

Key Points for Data Reusability Direction have an analytics roadmap Governance enterprise-wide accountability for data assets Integrity profile data to assess reliability Improvement workflow remediation to improve data reliability AN ENCORE POINT OF VIEW, cont. Health care organizations have been building data warehouses of various sorts (clinical, financial, enterprise) for a number of years. These initiatives typically start with high hopes for the reporting, measurement and analytics the collected data and associated technology will support. The reality, sadly, often does not meet expectations. Usually, as the first reports and dashboards start rolling out from the warehouse, the organization will hear cries of this can t be right and I don t believe the data. Why? What goes wrong? Is it the data model or reporting tools or ability to access the data? The answer to the last question is most likely no. No data model is perfect, every reporting tool has certain limitations and data accessibility must be balanced with HIPAA, other privacy and compliance requirements; but these issues aren t the primary reasons that expectations are unmet. Often the data populating the warehouse just isn t as expected. Data dictionaries, organizational standards, and pick lists for data entry fields may describe the intent of a particular data field but don t guarantee that the data captured in the source system actually reflects that intent. Front-end users are incredibly inventive at finding ways to circumvent edit checks and data field requirements to expedite completing a transaction (e.g., registering a patient) as the pressure to speed throughput increases. It is also unwise to assume that data complying with an HL7 transaction standard is 100% as expected. Yet ecqms and the analytics supporting performance improvement require consistent, reliable data. To ensure the data landing in a data warehouse or even in an analytic application is as expected, an organization needs to profile and evaluate that data prior to first moving it. WHAT IS DATA PROFILING Data profiling is the activity that examines the format and values of data and helps an organization understand exactly the state it s in. Data profiling describes data in a way in which the data s strengths and weaknesses become apparent.¹ It is the statistical analysis and assessment of the data values in source systems (e.g., EHR, ADT) for consistency, uniqueness and logic. Profiling evaluates the actual content, structure and quality of the data by exploring relationships that exist between value collections both within and across data sets (e.g., valid phone numbers or ICD-9 procedure codes). Data profiling will convey: Characteristics of the data compared to what is expected in a field Facts about the data (e.g., occurrence of null values) Fields that need further investigation Data profiling does have limitations, however. It will not convey: Accuracy of the data (e.g., procedure miscoded) Rules that apply to the data Efficiency of data capture process The Role of Data Profiling In Health Analytics : WHITE PAPER 2

There are two aspects of data profiling quantitative and qualitative. The first can be done with any number of data profiling tools or even spreadsheets but the second requires subject matter expertise specific to the type of data (e.g., a clinician to evaluate validity TWO ASPECTS OF DATA PROFILING There are two aspects of data profiling quantitative and qualitative. The first can be done with any number of data profiling tools or even spreadsheets but the second requires subject matter expertise specific to the type of data (e.g., a clinician to evaluate validity of clinical data). QUANTITATIVE DATA ASSESSMENT Quantitative assessment identifies the following: Type (e.g., numeric or text) Format (e.g., nn.n or mm/dd/yyyy) Frequency (e.g., count of null values, count of all 9s in a field) Reference Table Matches (e.g., discharge disposition codes) This quantitative evaluation results in a set of statistics about the data. Example: of clinical data). Figure 1. Quantitative Evaluation Results The data source has been processed by a data profiling tool, which has provided the above counts and percentages that summarize the following field content characteristics: NULL count of the number of records with a NULL value Missing count of the number of records with a missing value but not null (e.g., space instead of a number or letter or punctuation mark) Actual count of the number of records with an actual value (i.e., non-null and non-missing) Completeness percentage calculated as Actual divided by the total number of records Cardinality count of the number of distinct actual values (i.e., value occurs only once, not repeated; e.g., unique patient ID) Uniqueness percentage calculated as Cardinality divided by the total number of records Distinctness percentage calculated as Cardinality divided by Actual The Role of Data Profiling In Health Analytics : WHITE PAPER 3

The variety of both source systems and the data captured in these systems is quite broad in healthcare. QUALITATIVE DATA ASSESSMENT Qualitative aspects of data profiling involve a subject matter expert manually examining the data values for rational values. For example, a field in an EHR labeled temperature may contain data in the correct type and format (nn.n nnn.n) but the range of values may exceed what is logical from a clinical perspective such as a temperature exceeding 500. Or a field labeled vaccine dose might comply with reference table values and expected percentage of null values, but the dose amount may be incorrect (e.g., 0.5 ml vs. 5 ml). Qualitative analysis can also identify data fields that capture similar information in different ways (duplicative fields). For example, a field in an EHR labeled Tobacco Type may have a pick list for the type of tobacco utilized (e.g., cigarettes, oral, cigar, pipe). The same EHR may also contain individual fields that indicate a Yes/No response for field names Cigar use, Oral Tobacco Use, Pipe use. A subject-matter expert would identify these duplicate fields during a qualitative assessment. Other industries may not have the same need for the qualitative aspects of data profiling as healthcare. The variety of both source systems and the data captured in these systems is quite broad in healthcare. Other factors have contributed to the variability of actual data values compared to expected data values, particularly when it comes to clinical systems or any system when time is scarce and the pressures to encourage adoption or complete transactions is great. Often documentation and consistent training for the intent of a field is minimal, confusing or does not exist. This leaves the end user the role of interpreting the intent of the field resulting in inconsistency across a health system. Inconsistent change control practices may also lead to creation of duplicate or ill-defined fields. An example of how variation can occur in a seemingly simple and straightforward field is a telephone number. Figure 2. Telephone Number Data Standards In these examples, 9 represents any digit (i.e., number), A represents any upper case alpha (i.e., letter) character, and a represents any lower case alpha character. All may be valid formats and data types in different applications but to bring this information into a common repository, this variation needs to be understood and transformed into a common format to enable consistent re-use of the data.² The Role of Data Profiling In Health Analytics : WHITE PAPER 4

Linked with appropriate enterprise-wide data governance and ongoing data management processes, data profiling will set any organization on the fast track to driving actionable information from the data it collects. IMPROVING DATA RELIABILITY In the case of EHRs, system adoption frequently trumps standardization of data capture. Often, as long as the providers can find the data they need to enable clinical decision-making and plan interventions, standardizing where and how data is captured isn t paramount. But, when data is moved to an enterprise data warehouse to support analytics and measurement, inconsistencies are uncovered that render the information suspect. Data profiling can identify these issues prior to moving the data. Then decisions can be made about whether the data can be fixed as it is moved (e.g., multiple different code sets that mean the same thing mapped to a standard code set) or workflow can be modified at the front end to ensure consistent, accurate data capture moving forward. Decisions can also be made about how firmly to lock down a data field such as making it mandatory and prescribing a set of values (ensuring discrete data). But this rigidity of data capture needs to be used judiciously; users will find their way around what they perceive are barriers to completing their tasks (e.g., registering a patient, documenting a clinical finding) as quickly as possible. Often, the path to convince end users to change workflow and accept perceived limitations on valid data values is through their frustration at not being able to obtain the analysis and measurement using the data they collect. Only when they can tangibly see the reward will the pain be worth it! VALUABLE ENTERPRISE ASSET Healthcare organizations are now coming to the realization that data is a valuable enterprise asset. Time and money invested in implementing EHRs and other systems at the front line of patient care are great sources of data to support analytics and measurement for performance improvement and other initiatives. To ensure the data is accurate, reliable and consistent, data profiling can establish a realistic picture of the current state of the data and help guide efforts to improve its re-usability. Linked with appropriate enterprise-wide data governance and ongoing data management processes, data profiling will set any organization on the fast track to driving actionable information from the data it collects. The Role of Data Profiling In Health Analytics : WHITE PAPER 5

REFERENCES 1. Elliott King, Data Quality 101: The Ultimate Guide for Data Stewards, A MELISSA DATA ebook; Downloaded from http://www.melissadata.com/whitepaper/data -quality-stewards-ebook.asp 2. Data Profiling: The Foundation for Data Management; DataFlux; October, 2003. ABOUT ENCORE Encore, A Quintiles Company, is one of the most successful consulting firms in the health information technology (HIT) industry. Founded in 2009 and led by Encore CEO Dana Sellers and President Tom Niehaus, the company provides consulting services and solutions that assist its expanding client base with a wide range of HIT strategy, advisory, implementation, process-redesign, and optimization initiatives. Encore focuses on capturing the right data at the right time, establishing analytical capabilities that meet the evolving information and reporting needs of healthcare providers to document and improve clinical and operational performance. For more information about Encore, please visit www.encorehealthresources.com. The Role of Data Profiling In Health Analytics : WHITE PAPER 6