SOLUTION OVERVIEW DATA CATALOGS FOR DATA RATIONALIZATION

Similar documents
SOLUTION OVERVIEW: DATA CATALOGS FOR DATA RATIONALIZATION

Frequently Asked Questions

High Security SaaS Concept Software as a Service (SaaS) for Life Science

Date: October User guide. Integration through ONVIF driver. Partner Self-test. Prepared By: Devices & Integrations Team, Milestone Systems

INSTALLING CCRQINVOICE

Independent Adjudication for Customers. Royal Institution of Chartered Surveyors (RICS) Application Form

Introduction. by Surekha Parekh

Access the site directly by navigating to in your web browser.

Your New Service Request Process: Technical Support Reference Guide for Cisco Customer Journey Platform

Long Term Project WITS software modernization

Forcepoint UEBA Management of Personal Data

THE CASE FOR MOVING TO DISK-BASED ARCHIVES FOR ENTERPRISE IT

INFORMATION TECHNOLOGY SERVICES NIST COMPLIANCE AT FSU - CONTROLLED UNCLASSIFIED INFORMATION

CA CMDB Connector for z/os

CLOUD & DATACENTER MONITORING WITH SYSTEM CENTER OPERATIONS MANAGER. Course 10964B; Duration: 5 Days; Instructor-led

Customer Upgrade Checklist

NTP SOFTWARE SEAMLESS ARCHIVER FOR DELL EMC S ELASTIC CLOUD STORAGE (ECS)

McGill University School of Computer Science COMP-206. Software Systems. Due: September 29, 2008 on WEB CT at 23:55.

EU General Data Protection Regulation

TRANSPIRE Data Management plan Version 1.0 April

Privacy Policy. Information We Collect. Information You Choose to Give Us. Information We Get When You Use Our Services

General Data Protection Regulation (GDPR) for CEO s Quick overview & impact

UPGRADING TO DISCOVERY 2005

ERS IT Portfolio Report

CodeSlice. o Software Requirements. o Features. View CodeSlice Live Documentation

Top 10 Concerns: Legacy Archiving Solutions

HP Server Virtualization Solution Planning & Design

Overview of Data Furnisher Batch Processing

25years. MQ High Availability. Celebrating. David Ware IBM MQ Chief Architect. MQ Technical Conference v

Privacy Policy. What this policy covers. What information we collect about you

ComplyWorks Subscription User Guide. October 6, 2011

RISKMAN REFERENCE GUIDE TO USER MANAGEMENT (Non-Network Logins)

Module: Items in DSpace

Please contact technical support if you have questions about the directory that your organization uses for user management.

Introduction to Mindjet on-premise

OBSERVATIONS FROM CYBERSECURITY EXAMINATIONS

FLEXPOD A Scale-Out Converged System for the Next-Generation Data Center

ClassFlow Administrator User Guide

ONLINE GRANT APPLICATION INSTRUCTIONS

USPS Picture Permit indicia

PRIVACY AND E-COMMERCE POLICY STATEMENT

Constituent Page Upgrade Utility for Blackbaud CRM

Pages of the Template

Nominee: Green Mountain

BMC Remedyforce Integration with Remote Support

Succeed in ISO/IEC Audit Checks. Bob Cordisco Systems Engineer

Procurement Contract Portal. User Guide

Overview. Enhancement for Policy Configuration Module

Educational Satalite signal

IT Essentials (ITE v6.0) Chapter 8 Exam Answers 100% 2016

SSDNow vs. HDD and Use Cases/Scenarios. U.S.T.S. Tech. Comm

Implementing a Data Warehouse with Microsoft SQL Server

Licensing the Core Client Access License (CAL) Suite and Enterprise CAL Suite

EcoStruxure for Data Centers FAQ

Cisco EPN Manager Network Administration

Admin Report Kit for Exchange Server

Cisco EPN Manager Operations

Getting Started with DocuSign

Performance and Scalability Benchmark: Siebel CRM Release 7 on IBM eserver pseries and IBM DB2 UDB. An Oracle White Paper Updated September 2006

ITIL and ISO20000 Pick One or Use Both? Track: Business Services

John R. Robles CISA, CISM, CRISC

HP OpenView Performance Insight Report Pack for Quality Assurance

Vulnerability Protection A Buffer for Patching

Proper Document Usage and Document Distribution. TIP! How to Use the Guide. Managing the News Page

DELL EMC DATA PROTECTION ARCHITECTURE FOR VMWARE SOLUTION BRIEF

Information on using ChurchApp

Dynamic Storage (ECS)

TRAINING GUIDE. Overview of Lucity Spatial

Beyond Continuous Build: Build Grids. Darryl Bowler, CollabNet

Networks: Communicating and Sharing Resources. Chapter 7: Networks: Communicating and Sharing Resources

Reporting Requirements Specification

Ascii Art Capstone project in C

Oracle Database 11g Replay: The In-built Recorder for Real Application Testing

ALCATEL-LUCENT RAINBOW TM

A solution for automating desktop applications with Java skill set

TL 9000 Quality Management System. Measurements Handbook. SFQ Examples

Registering for FEMA assistance

Use of GIS & GPS in Trail and Land Management

Chapter 1 Introduction. What is a Design Pattern? Design Patterns in Smalltalk MVC

E-Lock Policy Manager White Paper

Exchange Archive Monitoring

Power365. Quick Start Guide

Performance of VSA in VMware vsphere 5

Frequently Asked Questions Read and follow all instructions for success!

Chapter 14. Basic Planning Methodology

BMC Remedyforce Integration with Bomgar Remote Support

Report Writing Guidelines Writing Support Services

ShapethefutureofCloud

S4S Support Services. Audit4 version 14+ Aug Copyright 2017 S4S Pty Ltd. S4S Pty Ltd. Phone: Web:

You may receive a total of two GSA graduate student grants in your entire academic career, regardless of what program you are currently enrolled in.

Hands-on Windows Azure Application Architecture & Development (3 days)

System Requirements. SAS Digital Marketing 6.5. Overview. Major Sections in this Document. Installation Requirements. Third-Party Support

Dear Student, Here is a sample of how the immunization process will work for Fall 2018:

Link-layer switches. Jurassic Park* LANs with backbone hubs are good. LANs with backbone hubs are bad. Hubs, bridges, and switches

Software Toolbox Extender.NET Component. Development Best Practices

Top 10 Questions About the Next-Generation Registration Directory Service (RDS)

Guidelines for Electronic Abstract Submission 29th International Nursing Research Congress July 2018 Melbourne, Australia

Mobile Usability Testing Notes: Participant 2

Product Documentation. New Features Guide. Version 8.7.5/XE6

SmartLink for Albridge Web Services

Transcription:

SOLUTION OVERVIEW DATA CATALOGS FOR DATA RATIONALIZATION

Intrductin Hw big f a prblem is data redundancy? If yu are like mst cmpanies, it is much bigger than yu wuld care t admit. Fr mst businesses data has becme like flies t flypaper. When yu cnsider that fr every prductin database, yu als tend t spawn a develpment, test, QA, staging, experimental and reprting database, it desn t take lng fr that single database t result in 6x mre in ttal vlume. That additinal vlume requires additinal database licenses, plus strage, as well as dat base administratrs t manage the entire thing. In fact, if we think abut the data prliferatin pyramid, the multiplicatin factr is almst 50x. And nce cst f strage began its steep descent in 2005, and clud prviders came alng and began t prvide lts f space at lw cst, rganizatins used that as an excuse t cntinue t hard data, even if they weren t actually using it. SOLUTION OVERVIEW DATA CATALOGS FOR DATA RATIONALIZATION WATERLINE DATA 2017 2

The Hidden Cst f Big Data In 2016, Veritas surveyed 10,000 rganizatins and 83% f IT decisin makers said their rganizatins were data harders. This is neither a surprise nr a bad thing because the cst f strage has becme relatively cheap. But while it is cheap, it still isn t free. Let s d the math! 83% f rganizatins are data harders In ne example, a cnsumer retail cmpany has ver 8 petabytes f data currently csting them abut $1 per gigabyte (this is a very cnservative number that des nt cnsider lts f factrs including database sftware csts, peple csts r strage management sftware), r $8 millin t stre. They als estimate that apprximately 30% f that data (2.4 petabytes) is redundant and can be eliminated, resulting in savings f $2.4 millin. S, what is stpping them? They knw at a high level they have lts f duplicate data, but at a granular level they dn t knw exactly what infrmatin is the riginal surce, what is replicated and what can be eliminated. They have n data map tracking all their data, its freshness r its prvenance. The result is that they hard everything, keeping redundant legacy develpment and QA data arund fr ld prjects that have lng since been cancelled. They maintain data marts that were created fr a specific ne-time analysis, but then never delete them, all the time cntinuing t pay the $1 per GB fr duplicate data that is n lnger in use. And these strage csts dn t even cver all the ancillary csts f redundant dark data. Much f that dark data culd als pse security, cmpliance and ther crprate risks that are hard t calculate. Imagine if sme f that data vilated the upcming EU Gverning Data Prtectin Regulatin (GDPR), als knwn as the right fr a cnsumer t be frgtten. That regulatin carries a fine up t 4% f wrldwide revenues. The implicatin is that redundant, dark-data can n lnger be left sitting arund. The physical cst and the ptential risk is becming t high. SOLUTION OVERVIEW DATA CATALOGS FOR DATA RATIONALIZATION WATERLINE DATA 2017 3

Using a Data Catalg t Ratinalize Data The reslutin t this issue is cnceptually simple. By implementing Waterline Smart Data Catalg, yur rganizatin can dcument the lcatin, quality and lineage f data lcated in relatinal surces, in the clud r in Hadp clusters. A smart data catalg autmatically tags the data fields acrss different data sets with cmmn business terms, dcuments its verall quality, existing data lineage and inferred missing lineage. During this prcess, it als identifies where yu have high levels f redundant data that can be ratinalized. the catalg identifies high levels f duplicate data fr ratinalizatin SOLUTION OVERVIEW DATA CATALOGS FOR DATA RATIONALIZATION WATERLINE DATA 2017 4

Hw Des It Wrk? The Waterline Data Catalg is built n the premise that there are three parts t the data catalging prcess. The first is t help data prfessinals discver, rganize and curate the data; the secnd is t allw gvernance prfessinals leverage tagged data fr cmpliance reprting and access cntrl and the third is t expse the newly rganized data fr business prfessinals t use. Discver Organize Curate Cmpliance Reprting Access Cntrl Search Rate & Cllabratin DATA PROFESSIONALS GOVERANCE PROFESSIONALS BUSINESS PROFESSIONALS Discver: Autmatically and incrementally fingerprint data and infer data lineage at scale by analyzing actual data values Organize: Uses machine learning t autmatically tag and match data fingerprints t glssary terms Curate: Human reviewers accept r reject tags, machine learning fine tunes the tagging prcess and imprves the matching algrithm Cmpliance: Map yur cmpliance plicies t yur data assets: acceptable use, legal hlds, expiry. Reprting: Simplify nging mandated reprting as the catalg uncvers dark data and catalgs it dynamically Access cntrl: Autmate data access cntrl via tag based security Search: Search fr data thrugh the Waterline GUI r thrugh integratin via 3rd party applicatins Rate & Cllabrate: Users cllabrate t create subjective crwdsurced ratings/reviews which cmbined with bjective prfiling metadata prvides users with a view int data quality and usefulness. SOLUTION OVERVIEW DATA CATALOGS FOR DATA RATIONALIZATION WATERLINE DATA 2017 5

Benefits fr Data Ratinalizatin Autmating the end t end data catalging prcess with Waterline Smart Data Catalg significantly reduces the excess cst f redundant data: Ratinalizes excess data, reducing the cst ft Redundant database sftware licensing & supprt Excess server & strage cst Excess data center perating expenses Reduces business risk due t: Data stred fr gvernment cmpliances rules and regulatins Database perfrmance degradatin Questinable data quality Cnclusin Data ratinalizatin presents a large cst savings pprtunity fr mst IT rganizatins. By autmating the underlying prcess fr inventrying, tagging and curating data and data lineage, redundant data can be identified, ratinalized and the savings can be easily harvested. The bttm line is that data redundancy is a much larger hidden cst than mst rganizatins realize. The flip side hwever is that this is a cst that rganizatins are already bearing. This means it is als a surce t mine fr savings and budget. By eliminating the excess strage, database and assciated csts f data redundancy, there is a significant vein f gld t mine in cst savings that can be used t fund ther data related prjects. waterlinedata.cm Sales Technical Supprt Crprate Headquarters sales@waterlinedata.cm Visit the Supprt Center 201 San Antni Circle Suite 260 help@waterlinedata.cm Muntain View CA 94040 SOLUTION OVERVIEW DATA CATALOGS FOR DATA RATIONALIZATION WATERLINE DATA 2017 (650) 946-2104 6