Metadata Architectures

Similar documents
W H I T E P A P E R : O P E N. V P N C L O U D. Implementing A Secure OpenVPN Cloud

Disk-Based Data Protection Architecture Comparisons

Tips for creating a VDI backup plan

Cisco Start. IT solutions designed to propel your business

KEY FINDINGS INTERACTIVE GUIDE. Uncovering Hidden Threats within Encrypted Traffic

Enterprise Geographic Information Servers. Dr David Maguire Director of Products Kevin Daugherty ESRI

IBM iseries Models 800 and 810 for small to medium enterprises

Enabling Branch Office Consolidation

Technical Overview. Elastic Path Commerce

RED HAT ENTERPRISE LINUX. STANDARDIZE & SAVE.

Focus On: Oracle Database 11g Release 2

The Business Case for a Web Content Management System. Published: July 2001

CA ERwin Data Profiler

The Value of Force.com as a GRC Platform

ORACLE DEPLOYMENT DECISION GUIDE: COMPARING YOUR DEPLOYMENT OPTIONS

White Paper BC/DR in the Cloud Era

Tableau Metadata Model

How to Leverage Containers to Bolster Security and Performance While Moving to Google Cloud

Data Quality in the MDM Ecosystem

W H I T E P A P E R U n l o c k i n g t h e P o w e r o f F l a s h w i t h t h e M C x - E n a b l e d N e x t - G e n e r a t i o n V N X

Analyzing the Economic Value of HPE ConvergedSystem 700 in Enterprise Environments. By Mark Bowker, Senior Analyst and Adam DeMattia, Research Analyst

PAGE - 16 PAGE - 1. Sometimes, the solution is just a benchmark away..

Virtualizing Open Text Fax Server with Realtime Fax over IP and Open Text Fax Gateway

Making the case for SD-WAN

Remodel. New server deployment time is reduced from weeks to minutes

Data center interconnect for the enterprise hybrid cloud

Peer Software and Scality - A Distributed File System Approach to Scale-out Storage

e BOOK Do you feel trapped by your database vendor? What you can do to take back control of your database (and its associated costs!

WHITE PAPER. How Virtualization Complements ShoreTel s Highly Reliable Distributed Architecture

802.11n in the Outdoor Environment

Qualitative ROI for MDA Projects. Ken Sayers - Chubb and Son, Inc. OMG UML Workshop San Francisco, CA October 21-24, 2002

NetAnalyst Test Management Software Automated, Centralized Network Testing. NetComplete Service Assurance Solutions Portfolio

Technology Insight Series

Overcoming IT Challenges in the Education Segment Leveraging Cloud and On-Premise Resources for Maximum Impact

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability

Terminal Emulation Productivity Continues. Terminal Emulation. Productivity Continues. 1

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 1 Database Systems

I D C T E C H N O L O G Y S P O T L I G H T. V i r t u a l and Cloud D a t a Center Management

Configuration changes such as conversion from a single instance to RAC, ASM, etc.

Discover the all-flash storage company for the on-demand world

DocuPhase Enterprise Configuration Guide

Implementing ITIL v3 Service Lifecycle

Evolution For Enterprises In A Cloud World

IBM Real-time Compression and ProtecTIER Deduplication

Enterprise Data-warehouse (EDW) In Easy Steps

Pivot3 Acuity with Microsoft SQL Server Reference Architecture

Symantec Data Center Transformation

Smart Data Center From Hitachi Vantara: Transform to an Agile, Learning Data Center

Hybrid WAN Operations: Extend Network Monitoring Across SD-WAN and Legacy WAN Infrastructure

Hitachi Unified Compute Platform Pro for VMware vsphere

Verint Knowledge Management Solution Brief Overview of the Unique Capabilities and Benefits of Verint Knowledge Management

THE FOUR STEPS TO A TAPELESS BACKUP ENVIRONMENT: YOUR HOW-TO GUIDE FOR DATA MANAGEMENT SUCCESS INTRODUCTION

Service Delivery Platforms and the Evolving Role of OSS by Doug Bellinger

SAP security solutions Is your business protected?

Delivering High-Throughput SAN with Brocade Gen 6 Fibre Channel. Friedrich Hartmann SE Manager DACH & BeNeLux

Inside SD-WAN: WAN Virtualization Traffic Routing Options

Informatica Data Quality Product Family

Hybrid Cloud for Business Communications

Hitachi Virtual Storage Platform Family: Virtualize and Consolidate Storage Management

CENTRALIZED UPS AND DISTRIBUTED UPS: A COMPARISON DELTA POWER SOLUTIONS

Why Information Architecture is Vital for Effective Information Management. J. Kevin Parker, CIP, INFO CEO & Principal Architect at Kwestix

Introduction to iscsi

Big Data Integration BIG DATA 9/15/2017. Business Performance

Public, Private, or Hybrid Cloud

Model Driven Architecture and Rhapsody

Design Build Services - Service Description-v7

Network Diversity and Survivability:

Why Converged Infrastructure?

Optimizing and Managing File Storage in Windows Environments

A Practical Guide to Cost-Effective Disaster Recovery Planning

Data Driving the Smart Grid

HCI: Hyper-Converged Infrastructure

Introduction to K2View Fabric

Move Beyond Primitive Drawing Tools with SAP Sybase PowerDesigner Create and Manage Business Change in Your Enterprise Architecture

FIPS Validated i WLAN

FROM A RIGID ECOSYSTEM TO A LOGICAL AND FLEXIBLE ENTITY: THE SOFTWARE- DEFINED DATA CENTRE

WHITE PAPER: ENTERPRISE SOLUTIONS. Disk-Based Data Protection Achieving Faster Backups and Restores and Reducing Backup Windows

Crewstation-over-IP. Real-time mission-critical remote multi-desktops. Esterline Control & Communication Systems. Content

The Transition to Networked Storage

Executive Brief CHOOSING THE RIGHT MODEL FOR ENTERPRISE BACKUP & RECOVERY. The Executive Guide Q417-CON-10681

Veritas Storage Foundation for Oracle RAC from Symantec

Optimizing Pulse Secure Access Suite with Pulse Secure Virtual Application Delivery Controller solution

FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION

The Economic Benefits of a Cooperative Control Wireless LAN Architecture

Accelerate Your Enterprise Private Cloud Initiative

Virtualizing the SAP Infrastructure through Grid Technology. WHITE PAPER March 2007

Implementing Fax over IP in your Organization

BRINGING CLARITY TO THE CLOUD

Challenges of Analyzing Parametric CFD Results. White Paper Published: January

Oracle Exadata Statement of Direction NOVEMBER 2017

Backup-as-a-Service Powered by Veritas

Core Services for ediscovery Perfection

MODERNIZE INFRASTRUCTURE

Backup and Recovery: New Strategies Drive Disk-Based Solutions

The Case for Virtualizing Your Oracle Database Deployment

Iron Networks, Inc. Turnkey Converged Infrastructure-as-a-Service Platforms

Next Generation Backup: Better ways to deal with rapid data growth and aging tape infrastructures

Lower Total Cost of Ownership for a J2EE Infrastructure

MS Series: Ethernet Power Study

3 Ways Businesses Use Network Virtualization. A Faster Path to Improved Security, Automated IT, and App Continuity

Transcription:

Metadata Architectures

Evaluating Metadata Architectures Introduction The IT world has practiced metadata management in one form or another for more than twenty years. IT has developed three models for metadata management: Centralized metadata management Distributed metadata management Hybrid centralized/distributed metadata management If you understand the strengths and weaknesses of these models, you can better judge how you can use them to implement an effective metadata management strategy. This paper will provide an outline of the advantages and disadvantages inherent in each architectural approach. Page 2 of 10

Centralized Architecture The centralized architecture, also known as a passive repository, is the oldest approach to metadata management. Legacy metadata repository solutions often use this approach, which uses a centralized repository to which you copy metadata from other data sources (Figure 1). The earliest centralized metadata repository solutions were homegrown solutions in flat files that captured information within text documents and spreadsheets. These repositories are considered to be the first-generation metadata repositories. Over the past two decades, the amount of information these solutions stored has grown exponentially and these solutions have evolved to be more sophisticated. The metadata has moved out of flat-file based systems and into relational databases, the secondgeneration repository which in-turn became the first commercial metadata management products. Figure 1 - Centralized Architecture: The repository copies data from other sources. Strengths of Centralized Architecture A centralized architecture approach has the following strengths: Efficient access to information Users only need to access one area to find information by navigating the repository, performing searches, or running reports. Independence from integrated systems When a centralized repository integrates with third-party products, it copies data from those products into the metadata repository. Accessing the metadata is then independent of access to the original system, as the duplication of the data in the repository frees the system from any required access to the original metadata. Ability to capture additional metadata Page 3 of 10

Since a separate database stores the metadata, you can capture additional metadata. This additional metadata may be customized extensions to the source metadata, or it may be entirely separate information not previously created by your organization. For example, you may have a system that tracks workflows extensively and provides metadata about the flows. You may want to add information such as process definitions and metrics that the source system doesn t provide. Another possibility is to associate contact information with specific objects in the target system, where the source system doesn t do contact management. Enhanced performance A single database stores all metadata, even if it comes from a third-party product, so when a user accesses information, the system retrieves it from only one location, reducing query response time. Disadvantages of Centralized Architecture A centralized architecture approach has the following weaknesses: Cumbersome implementation and maintenance If a third-party system stores the only copy of the metadata, the centralized approach must copy it into the centralized database. This can be a time-consuming procedure that requires developers to create and maintain the data transfer routines, and to execute them regularly. Some vendors may offer pre-built software solutions for data transfer, but often you ll need to customize these solutions to work with the specifics of today s highly complex and diverse systems. Keeping the metadata synchronized requires a massive effort on the part of IT. One result is to limit the amount and detail of the captured metadata. Another result is the highly manual intervention in the data loading process. The centralized copy will always be out-of-date compared to the original metadata, especially in today s fast-changing enterprise systems. The need to create and maintain the complex copying routines dramatically reduces IT productivity, while the need to wait for data to become current dramatically reduces end user productivity. Both such reductions result in low-roi systems that often become shelfware. Reduced data quality duplication in the centralized repository also compromises the quality of the data. End users often do not know the currency or validity of the data they are accessing. Additionally, the redundancy in data may result in significant divergence between the original system and its representation in the repository, despite the best efforts of IT programmers. User confidence in the system then diminishes significantly, often resulting in lower adoption by end users. Page 4 of 10

Distributed Architecture A distributed architecture, also referred to as an active repository, was developed in response to the weaknesses of centralized architecture. This approach avoids the tedious process of maintaining copies of the source metadata within the metadata repository. Rather, the distributed system accesses metadata from third-party repositories in real time. As with the centralized architecture, this approach facilitates a single access point to the information. The distributed metadata solutions are considered to be third generation repositories (Figure 2). Figure 2: Distributed Architecture The metadata engine accesses metadata in real time. Reporting Modeling base Other Metadata Sources Strengths of Distributed Architecture A distributed architecture has the following benefits: Efficient access to information As with centralized architecture, distributed architectures access information through a single system, making access to metadata easy. The focus on distribution can make it much easier to integrate disparate systems throughout the enterprise. Increased ROI through productivity The distributed architecture doesn t copy metadata into a central repository and does not require any customized transfer routines and periodic synchronization. The elimination of the need to synchronize data eliminates any need for maintenance or manual intervention. This makes for much higher productivity for both IT and the end user. Superior data quality Since metadata is accessed in real time from source systems, end users always see the most current information available. This approach guarantees that the end user is looking at the correct data, as it currently exists in the source systems. Page 5 of 10

The analysis tools of the system, such as searching and impact analysis tools, always work with current data. This greatly improves confidence of end users and increases system adoption. Disadvantages of Distributed Architecture A distributed architecture approach has the following disadvantages: Dependent on integrated systems Distributed systems access metadata in real time. The availability of the source system thus becomes a limit on the availability of metadata through the distributed system. Since a production metadata solution typically accesses production third party repositories, the issue of availability is important but is seldom an issue, as production systems typically have availability goals of greater than 99.9%. Inability to capture additional metadata A distributed approach can only access metadata from third-party repositories in real time so it cannot capture additional metadata, as it does not maintain a centralized database. This inability to customize metadata often means that IT must implement custom requirements through other tools, leading to lower IT productivity and decreasing user acceptance of the system. Hybrid Architecture A hybrid approach leverages the strengths and mitigates the weaknesses of both distributed and centralized architectures. The hybrid approach facilitates metadata access in real time from third-party sources and provides the ability to capture additional metadata attributes not existing in the source repositories. Additionally, it lets users create original data within the repository. Page 6 of 10

Figure 3: Hybrid Architecture The repository accesses metadata in real time and facilitates the capture and storage of additional metadata and original information. Reporting Modeling base Other Metadata Sources Strengths of Hybrid Architecture A hybrid architecture has the following benefits: Efficient access to information As with the other approaches, hybrid systems offer a single point of access to metadata. Increased ROI through productivity The hybrid approach combines the ability to get metadata in real time with the ability to provide customized solutions using additional metadata linked to the original metadata. This reduces the need for IT customization work and gives end users the ability to maintain their own metadata. Ability to capture additional metadata The metadata repository can store additional information, so you can capture new metadata. This additional metadata can be customized extensions to the source metadata, or it can be entirely separate information. The hybrid system can also implement metadata versioning, enabling IT and end users to track changes to their systems and metadata. Superior data quality Since end users see their metadata in real time, they always see the most current information. The ability to add extended metadata and completely new metadata can add tremendous value to the end user s productivity This greatly improves end-user confidence and increases system acceptance. Page 7 of 10

Enhanced performance By providing both real-time metadata and custom metadata together, the system improves the overall system response time by integrating disparate applications into a single system. By distributing data access across multiple back-end systems, the hybrid system can perform as well as a centralized database while providing much of the advantages of the centralized approach. Disadvantages of Hybrid Architecture Accessing source metadata in real time presents one problem: if the source system is not available, neither is the metadata. Since a production metadata solution typically accesses production third-party repositories, the issue of availability is important but is seldom an issue, as production systems typically have availability goals greater than 99.9%. Hybrid systems can mitigate this weakness through storage of baseline metadata; the end user can see the baseline as of a certain time along with the extended and new metadata even while the source system is down. Conclusion Most of the time, a hybrid approach is the optimal solution due to its flexibility in leveraging the strengths of both the distributed and centralized architectures. Even if your organization s requirements fall within the scope of a distributed or centralized architecture, in time the necessity to accommodate flexibility may change. Therefore, planning ahead by adopting the hybrid approach in lieu of limiting your system to a centralized or distributed approach may be the most scalable deployment strategy to meet your organization s short and long term growth expectations. However, if your organization only anticipates the need to access metadata within existing systems and does not expect to have the need to capture additional information, then a distributed architecture may be the most appropriate solution. A centralized approach is only appropriate when there is a technical or political roadblock preventing the adoption of a hybrid or distributed system. For instance, a technical roadblock exists if it is not possible to access metadata from a source system through programming interfaces or database queries. This is often the case for legacy systems from the mainframe days as well as for in-house developed systems. A political roadblock may often arise when administrators of mission critical systems worry that integrating their system with the metadata solution may interfere with system uptime and integrity. This is typical in systems with large groups of end-users accessing complex data sets (ERP, CRM, payroll, etc.). Page 8 of 10

Organizations should be mindful of long-term IT strategies when examining distributed or centralized solutions. Business requirements are always evolving, often resulting in changes to their supporting technical infrastructures. Keeping your metadata strategy flexible can help in managing these changes. Figure 4 outlines the requirements of metadata solutions and highlights how each of the three architectures fulfills them. Requirement Figure 4: Metadata Architectures & Requirements Matrix Centralized Architecture Distributed Architecture Hybrid Architecture Expedited Implementation One Stop Shopping for Metadata Independent from Integrated Systems * Capture Additional Metadata Performance No Redundancy High Quality No Synchronization Issues Low Maintenance Cost Cost of Ownership High Low Low Return on Investment ROI Low High High *Dependent on implementation. Metadata may be copied into the repository to make it independent from integrated systems. For Additional Information Contact:, Inc. 604 Mission Street San Francisco, California 94105 Tel: +1 (415) 947-0400 Fax: +1 (415) 947-0401 Email: info@dag.com Web: www.dag.com Fourth Edition, Copyright 2016, Inc. Page 9 of 10

Page 10 of 10