Panzura White Paper Panzura Distributed Cloud File System

Similar documents
Panzura White Paper Panzura Distributed File Locking

Rio-2 Hybrid Backup Server

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump

If you knew then...what you know now. The Why, What and Who of scale-out storage

Building Backup-to-Disk and Disaster Recovery Solutions with the ReadyDATA 5200

Dell Storage Point of View: Optimize your data everywhere

Hitachi Adaptable Modular Storage and Workgroup Modular Storage

Hybrid Cloud NAS for On-Premise and In-Cloud File Services with Panzura and Google Cloud Storage

Hitachi Adaptable Modular Storage and Hitachi Workgroup Modular Storage

SQL Server Consolidation with Server Virtualization on NetApp Storage

High performance and functionality

Virtualization with Protection for SMBs Using the ReadyDATA 5200

DELL EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE

HYCU and ExaGrid Hyper-converged Backup for Nutanix

Hedvig as backup target for Veeam

Microsoft Azure StorSimple Hybrid Cloud Storage. Manu Aery, Raju S

Moving From Reactive to Proactive Storage Management with an On-demand Cloud Solution

FOCUS ON THE FACTS: SOFTWARE-DEFINED STORAGE

Symantec NetBackup 7 for VMware

FOUR WAYS TO LOWER THE COST OF REPLICATION

Optimizing and Managing File Storage in Windows Environments

Protecting Mission-Critical Application Environments The Top 5 Challenges and Solutions for Backup and Recovery

PRESENTATION TITLE GOES HERE

Why Datrium DVX is Best for VDI

STORAGE CONSOLIDATION AND THE SUN ZFS STORAGE APPLIANCE

De-dupe: It s not a question of if, rather where and when! What to Look for and What to Avoid

NEC Express5800 R320f Fault Tolerant Servers & NEC ExpressCluster Software

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

50 TB. Traditional Storage + Data Protection Architecture. StorSimple Cloud-integrated Storage. Traditional CapEx: $375K Support: $75K per Year

Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES

Protect enterprise data, achieve long-term data retention

The Best Storage for Virtualized Environments

Boost your data protection with NetApp + Veeam. Schahin Golshani Technical Partner Enablement Manager, MENA

MODERNISE WITH ALL-FLASH. Intel Inside. Powerful Data Centre Outside.

powered by Cloudian and Veritas

THE SUMMARY. CLUSTER SERIES - pg. 3. ULTRA SERIES - pg. 5. EXTREME SERIES - pg. 9

Using Self-Protecting Storage to Lower Backup TCO

SoftNAS Cloud Data Management Products for AWS Add Breakthrough NAS Performance, Protection, Flexibility

7 Ways Compellent Optimizes VMware Server Virtualization WHITE PAPER FEBRUARY 2009

Virtual Disaster Recovery

1 Quantum Corporation 1

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability

The Data-Protection Playbook for All-flash Storage KEY CONSIDERATIONS FOR FLASH-OPTIMIZED DATA PROTECTION

DATA CENTRE SOLUTIONS

Technology Insight Series

Clouds, Convergence & Consolidation

Eight Tips for Better Archives. Eight Ways Cloudian Object Storage Benefits Archiving with Veritas Enterprise Vault

Construct a High Efficiency VM Disaster Recovery Solution. Best choice for protecting virtual environments

Dell DR4000 Replication Overview

Discover the all-flash storage company for the on-demand world

Merging Enterprise Applications with Docker* Container Technology

The Future of Business Depends on Software Defined Storage (SDS) How SSDs can fit into and accelerate an SDS strategy

THE FUTURE OF BUSINESS DEPENDS ON SOFTWARE DEFINED STORAGE (SDS)

Technical White Paper: IntelliFlash Architecture

REFERENCE ARCHITECTURE Quantum StorNext and Cloudian HyperStore

CDMI Support to Object Storage in Cloud K.M. Padmavathy Wipro Technologies

A Promise Kept: Understanding the Monetary and Technical Benefits of STaaS Implementation. Mark Kaufman, Iron Mountain

The Nasuni Security Model

VMware vsphere 4. The Best Platform for Building Cloud Infrastructures

Scale-Out Architectures for Secondary Storage

CONTENTS. 1. Introduction. 2. How To Store Data. 3. How To Access Data. 4. Manage Data Storage. 5. Benefits Of SAN. 6. Conclusion

Copyright 2010 EMC Corporation. Do not Copy - All Rights Reserved.

MODERNIZE INFRASTRUCTURE

A Practical Guide to Cost-Effective Disaster Recovery Planning

SECURE CLOUD BACKUP AND RECOVERY

VIRTUALIZATION WITH THE SUN ZFS STORAGE APPLIANCE

Real-time Protection for Microsoft Hyper-V

Nasuni for Oil & Gas. The Challenge: Managing the Global Flow of File Data to Improve Time to Market and Profitability

Hyperconvergence and Medical Imaging

ORACLE RMAN DESIGN BEST PRACTICES WITH PANZURA QUICKSILVER CLOUD STORAGE CONTROLLERS

VMware Mirage Getting Started Guide

Simplified. Software-Defined Storage INSIDE SSS

IBM Spectrum NAS. Easy-to-manage software-defined file storage for the enterprise. Overview. Highlights

VMware Virtual SAN Technology

"Software-defined storage Crossing the right bridge"

Storage Performance Validation for Panzura

Nasuni UniFS a True Global File System

Warsaw. 11 th September 2018

THE FOUR STEPS TO A TAPELESS BACKUP ENVIRONMENT: YOUR HOW-TO GUIDE FOR DATA MANAGEMENT SUCCESS INTRODUCTION

Introduction to iscsi

Fully Converged Cloud Storage

SwiftStack and python-swiftclient

All-Flash Storage Solution for SAP HANA:

Nimble Storage Adaptive Flash

Solution Brief: Commvault HyperScale Software

A Cloud WHERE PHYSICAL ARE TOGETHER AT LAST

SOLUTION BRIEF Fulfill the promise of the cloud

SAFEGUARD INFORMATION AND ENSURE AVAILABILITY WITH THE NETAPP BACKUP AND RECOVERY SOLUTION

Understanding Virtual System Data Protection

Controlling Costs and Driving Agility in the Datacenter

Market Report. Scale-out 2.0: Simple, Scalable, Services- Oriented Storage. Scale-out Storage Meets the Enterprise. June 2010.

MASERGY S MANAGED SD-WAN

How to Increase VMware Application Availability with Shared Storage

Using Cohesity with Amazon Web Services (AWS)

São Paulo. August,

Veeam Availability Solution for Cisco UCS: Designed for Virtualized Environments. Solution Overview Cisco Public

Hyper-Convergence De-mystified. Francis O Haire Group Technology Director

VMWARE CLOUD FOUNDATION: INTEGRATED HYBRID CLOUD PLATFORM WHITE PAPER NOVEMBER 2017

Backup and archiving need not to create headaches new pain relievers are around

Active Archive and the State of the Industry

Transcription:

Panzura White Paper Panzura Distributed Cloud File System Panzura s game-changing Distributed Cloud File System finally brings the full power and benefits of cloud storage to enterprise customers, helping to break the unending on-site storage expansion cycle while eliminating islands of storage that inhibit cross-site user interaction and productivity and real-time data protection. Panzura makes deploying cloud storage and a global file system easy and transparent to users.

Executive Summary Today, enterprise IT executives struggle with storage growth, storage capacity balancing, and data mobility. Traditional technologies such as tape, SAN, and NAS were more than sufficient to meet the needs of the past but an increasingly distributed workforce generating massive amounts of data from multiple platforms forced enterprises to consider new storage paradigms, in particular cloud storage. But adopting the cloud as a storage tier can be terribly problematic. Integrating with existing IT environments, ensuring data security, and managing data across sites plus the cloud is not a trivial exercise. Panzura created a solution based on a distributed cloud file system and global namespace that makes adding and using cloud storage seamless and secure while enabling global file sharing, cloud-integrated NAS, and data protection, including archiving, disaster recovery, and backup. Panzura Freedom Products deploy quickly and easily without changes to existing infrastructure, all while securing data with snapshots and military-grade encryption. Panzura Freedom Solutions make cloud storage a seamless, viable storage tier for enterprises of all size. Ongoing Storage Challenges As unstructured data and metadata continue to grow upwards of 62% on average per year, enterprise IT managers face an unwinnable struggle to maintain or reduce costs while ensuring user needs (i.e., SLAs) are met. Traditional means of storing data, primarily local NAS, and protecting data, primarily tape or disk-to-disk solutions, do not scale quickly or economically enough to accommodate the growth in demand from massive expansion in data generated by individuals and organizations. In addition, constantly connected individuals demand that data be available anytime, from anywhere, without reduced performance. Limited budgets, limited technology alternatives, and strict user demand have put IT on an unsustainable trajectory that must be addressed without negatively impacting organizational performance. Data storage challenge. The primary technology for storing unstructured enterprise data today is high-performance, highcost network-attached storage (NAS). With this technology, the standard method for addressing data growth is to add additional expensive spindles and/or new disk systems. In addition to the high hardware costs, more disks mean more datacenter space, power and cooling requirements, and added offsite capacity for replication. And because capacity expansion takes time, IT managers must try to forecast storage needs and forward provision to accommodate these potentially incorrect forecasts. Unanticipated spikes in storage demand send IT managers scrambling for added capacity and overly aggressive forecasts result in investment in idle capacity. When multiple sites are taken into consideration, using standard NAS can often result in over-provisioning at some sites and under-provisioning at others, as well as in storage islands where data at one site is not visible to users at other sites. The only way for users at one site to view files created or edited at another site is to save copies of files stored off-site to their own location, resulting in potentially significant duplication of files and commensurate overspending on expensive storage, not to mention significant version control challenges. This problem is compounded when backup and archiving also occurs locally, since duplicate copies on islands of storage means greater storage needed for data protection. (Figure 1). Data Time Hardware cost Power, cooling, space Forecasting demand Cross-site capacity balancing 2

As an alternative to customer-owned NAS, some vendors offer what they call Cloud NAS. Almost all of these solutions suffer from limitations in performance, scale, or both, and none are yet enterprise-class. What about the nature of the data itself? On-site NAS today supports both structured and unstructured data storage. Highperformance applications like databases and Exchange that utilize structured data require block storage in order to provide the As unstructured data and metadata continue to grow upwards of 62% on average per year, enterprise IT managers response times and synchronous replication speeds needed to avoid face an unwinnable struggle applications timing out. This storage often uses high-performance drives or, with growing frequency, SSD storage. iscsi is a common to maintain or reduce costs while ensuring user needs (i.e., SLAs) are met. interface used for block storage to provide direct disk access to these applications. But applications using unstructured data (which represents 80% of data under management, on average) store it in file systems (which provide the structure), not as blocks, and usually use interfaces like SMB or unless the apps are rewritten for iscsi. For the most part, block storage interfaces like iscsi do not lend themselves to applications using unstructured data. In addition to compatibility, block storage interfaces suffer other shortcomings relative to file-based protocols. Because block-based applications are primarily limited to single-node storage targets, their ability to scale can be quite limited compared to file-based storage systems spanning multiple servers. And unlike with unstructured data, replication of structured data requires that the disks at the replication site be identical to those at the source site. Since applications using unstructured data are disk agnostic and can address multiple servers, they are particularly suited to solutions with or interfaces and that target scalable storage, particularly object storage. Thus for optimal storage performance and cost control, administrators devote special attention to tier storage according to the specific requirements of each class of user or application. (Figure 2) Block File Example Application Interface iscsi Standard CFS/ Application Types Databases, Exchange Home Directories Share of Data <20% >80% Scalability Limited Massive Replication Storage Type Identical Mixed OK Figure 2: Comparing aspects of block- and file-based storage Data Protection Challenge. Data protection comprises archiving, backup, and disaster recovery (DR). The primary purpose is to maintain access to data that is no longer regularly used or to be able to recover current data if it is lost. For archiving, the authoritative copy of the data is stored offsite and recalled when needed. With backup, the data remains in use, or at least kept in primary storage, and a copy is made and stored for retrieval if the original data is lost. The most common method for protecting data is using a software application to direct data to disk or tape. Magnetic tape has been used for data storage for over 60 years and the technology has not changed all that much during that time. It is still in use due primarily to inertia (the devil you know ) and its perception as being cheap on a $/GB basis. Using tape, however, is very cumbersome, time consuming, and prone to error, making it a much derided medium for data backups and archiving. 3

With steep reductions in the cost of disk and the development of deduplication over the last decade, disk-to-disk archive and backup has gained more and more share from tape. Disk targets range from removable (very slow) optical disk and commodity magnetic disk to specialized backup and archiving appliances. But all disk-to-disk backup still suffers from one or more of the following major drawbacks: high cost, limited functionality, vendor lock-in, limited scalability, and cumbersome deployment and management. Sometimes disk-to-disk-to-tape methodologies are also deployed. More recently, disk-to-disk-to-cloud data protection solutions have appeared, offering the scalability, availability, and economy of the cloud as a storage target. While theoretically, using the cloud is quite appealing, in practice, integrating cloud storage into an established IT infrastructure can be incredibly problematic due to issues like latency, communication protocols, and data security. DR can leverage both backup and archiving and centers around bringing operations back up when a site either partially or fully fails. DR involves rebuilding site functionality as quickly as possible so as to minimize the impact on overall IT operations. This rebuilding can occur either in the same location or off-site at another location. Traditional reliance on tape, with its complex off-site logistics and slow data search/access, has made rapid tape-based DR unachievable. Replication (mirroring or asynchronously storing backup data of one location at another) is a common but potentially expensive way to implement a DR strategy if it involves full hardware duplication, which doubles datacenter capital costs. DR planning is challenging, time consuming, and difficult to get right. For these reasons, DR is often put off or avoided as long as possible, with organizations adopting a hope for the best strategy. The Panzura Hybrid Cloud Storage System To address the storage tiering and data protection challenges outlined above, Panzura went back to the drawing board to examine how data is created, stored, and consumed within an enterprise-class organization. By developing a proprietary cloud-integrated file system to accommodate both global file access and cloud storage, Panzura was able to overcome the main limitations of today s storage systems when used either as tiered NAS or for data protection. The Panzura Freedom Family comprises three components: Panzura Distributed Cloud File System, Freedom Flash Cache appliance, and Panzura Freedom OS (PFOS). (Figure 3) Together, they provide a cloud-ready, high-performance and secure storage platform that enables tiered NAS, seamless global file sharing, active archiving, backup, and DR across all enterprise locations, facilitating consolidation and eliminating islands of storage. Key features of the Panzura solution are described in this section. Panzura Distributed Cloud File System Panzura Freedom Flash Cache appliance Panzura Freedom OS (PFOS) Cloud Global namespace Distributed File locking Access control VM, 1U, 2U /, Cloud API Expandable to >200TB Encryption Deduplication Pinning Figure 3: Panzura Global Cloud Storage System components File-Based Storage: As discussed above, storage optimized for block data will be very different from that optimized for file data, due to different requirements. Panzura developed a high-performance file-based global storage platform for the cloud to address the 80% of current data that is unstructured. By supporting and transfer protocols commonly used by most applications, Freedom Flash Cache appliances can plug into existing IT infrastructures without any changes while connecting to all major cloud storage platforms, simplifying deployment and minimizing impact on operations. All data is managed under a distributed cloud file system, simplifying user interaction and system administration while tying into enterprise applications and targeting both local disk and the cloud. 4

Cloud Storage: Object storage, the typical storage system used in the cloud, breaks data up and stores it as flexibly-sized containers or chunks that can be individually addressed and manipulated and stored in many locations, not tied to any particular disk. Each object usually has some associated metadata. Object storage can scale to billions of objects and exabytes of capacity while protecting data with greater effectiveness than RAID. In addition, drive failure allows for very rapid recovery vs. weeks for large capacity RAID systems. This combination of scale and robustness make object storage an ideal target for warehousing enterprise data. Panzura Freedom Systems interface directly with all major cloud object storage APIs (http://www.panzura.com/partners/cloud-storage-partners/), avoiding vendor lock-in, and leverage object-based cloud storage as a data warehouse to provide massive scale and availability at a very compelling cost structure. Global File System: The heart of any storage system for unstructured data is the file system. Key file systems that have shaped the market include VxFS (Veritas), NTFS (Microsoft), WAFL (NetApp), and ZFS (Sun). A successful file system must be highly scalable, high performing, flexible, and manageable. NetApp built much of its success around WAFL and its ONTAP OS. WAFL combined RAID and the disk device manager with the file system plus replication and snapshots (limited per volume). Its primary target is HDD. ZFS put all these elements plus encryption and deduplication in one stack, is massively scalable, and targets HDD and SSD natively but has no native cloud integration. The Panzura Distributed Cloud File System was engineered to closely manage how files are managed and stored to provide seamless, high-performance, and robust data management. It improves on WAFL and ZFS while integrating cloud storage as a native capability. (Figure 4) 1992 2004 2008 WAFL ZFS RAID Disk Device Manager File System Replication Snapshots + Encryption + Deduplication + HDD/SSD + Cloud + Advanced Deduplication Figure 4: Comparison of modern file systems Any user at any location can view and access files created by anyone, anywhere, at any time. The file system dynamically coordinates where files get stored, what gets sent to the cloud, who has edit and access rights, what files get locally cached for improved performance, and how data, metadata, and snapshots are managed. The structure of the file system allows for nearly unlimited scale with up to 10,000 user-managed snapshots per volume. Panzura s innovative use of metadata and snapshots for file system updates, combined with unique caching and pinning capabilities in the Freedom Flash Cache appliance, allows customers to view data and interact through an enterprise-wide file system that is continually updated in real time. Support for extended file system access control lists (ACLs) empowers administrators to set policies that determine what access and management functions per file will be available on a per user basis. Because the file system is global and shared across all Freedom Flash Cache appliances and because all data is also stored in the cloud, all data is always available to anyone, even if network connections are temporarily lost to one site for some reason. 1 (Figure 5) 1) Any data not yet uploaded to the cloud from a controller will not be available to other sites if network access to that controller is lost. 5

White Paper - Panzura Distributed Cloud File System Boston Hong Kong Frankfurt San Jose Figure 5: Distributed cloud file system with cloud as storage tier Global Namespace: Ideally, a global storage solution would present a consistent view of all files to a user regardless of where the files are stored in the system and regardless of from which location a user is accessing the file system. Panzura has merged a distributed cloud file system with a global namespace to present a seamless view of file across the network. The Panzura global namespace presents users with a single, unified and consistent view of all files across all locations and consists of the sum of all local Panzura file systems that are known on the network. The Panzura Distributed Cloud File System intelligence tracks all copies of all files, including different versions of the same file, so users always have an up-todate directory available to them. Because Panzura has interlaced its global namespace and distributed cloud file system, the resulting file intelligence makes file storage and tracking transparent to the user and ensures that the file is available whenever and wherever it is needed. Nodes can be added as necessary and all have rapid access to all files on the network, for truly seamless scale-out capability. (Figure 6) Sydney Hong Kong Sydney Frankfurt San Jose Figure 6: Adding a node seamlessly updates distributed cloud file system 6

Distributed File Locking: Panzura Freedom Systems are designed with unique distributed file locking that manages File A: Locked write access across the entire network, by User 1 preventing data collisions and version corruption. It is impossible for more than one person on the network to edit a given version of a file at any given time and cause a data collision. When one user is editing a Figure 7: Dynamic file locking preserves file integrity and prevents file, that file is read-only locked to everyone File A: else, and anyone accessing that version data collision of the file will receive a message that it is read-only locked by user X (they can always File A: save their edits under a different version of the file). This will continue until the first user has completed their file revisions. At that point, when another user requests write access, the file lock is transferred to that user s Freedom Flash Cache appliance and all other users are locked out of write access to that file until this next user completes the edits. (Figure 7) This robust, elegant, fast system allows global file access and rapid file sharing while ensuring version integrity, avoiding the need to store multiple copies of the same file at multiple locations and the associated unnecessary network and storage capacity consumption from these duplicate files. Global Deduplication: Unlike other A B E deduplication solutions, which were B E B F designed to offset inherent data duplication in localized, inefficient file systems, Panzura designed an interconnected, global file system that stops file-level duplication before data gets stored. Since only unique copies of files across all sites are preserved by the file system, data is deduplicated before it is ever stored. Capacity is A B C D E F optimized further by running advanced, inline block-level deduplication on A A D D F C C Figure 8: Data is deduplicated within any data that gets stored on the and across sites for maximum data network to remove blocks common reduction across different files. Unlike any other deduplication provider, Panzura embeds the deduplication reference table in metadata, which is instantly shared among all Freedom Flash Cache appliances. This inline deduplication method removes data redundancy across controllers, rather than just based on data seen by a single controller. Thus each controller in the network benefits from data seen by all other controllers, ensuring even greater capacity reduction, guaranteeing all data in the cloud is unique, and driving down cloud storage and network capacity (and cost) consumed by the enterprise. (Figure 8) User 1 User 1 User 2 User 2 7

User 1 White Paper - Panzura Distributed Cloud File System Military-Grade Encryption: One of the top concerns most frequently expressed by IT professionals about cloud storage is data security. Because data is being transmitted to and stored by a 3rd-party cloud storage provider outside the corporate firewall, some worry that their data will be exposed and at risk for theft. The perception is that keeping data inside the firewall is inherently safer. This concern must be overcome by any cloud storage solution before it can become mainstream within an enterprise. Panzura addresses data security concerns directly by applying military-grade encryption to all data stored in the cloud. Each controller applies AES-256-CBC encryption for all data at rest in the cloud. In addition, all data transmitted to or from the cloud is encrypted with SSL v3.1 to prevent access via interception. Encryption keys are managed by the enterprise, never stored in the cloud. This complete, robust two-tier encryption solution is in addition to the typical multi-layer security provided by mainstream cloud storage providers. (Figure 9) In some cases, customers find that the combined security of a Panzura+cloud solution is greater than they can reasonably achieve within their own infrastructure, making cloud storage safer than some private cloud deployments. AES-256-CBC Jack and Jill went up the Hill... File encrypted before transmission SSL v3.1 1@3Fa $%As% $#m:45^ &GFJbfg vc!@dsf F34%hf dggh&<t ghf$*fx WAN Figure 9: Military-grade encryption protects data in transit and at rest in the cloud. 8

Summary The cloud offers tremendous potential for enterprises to reduce storage costs, improve productivity, and reduce data availability risk. Tapping that potential fully and effectively can provide significant competitive advantage while reducing both business and technological risk. To date, enterprises attempting to fully integrate the cloud as a storage tier have been faced with building their own limited-capability solution by kludging together different technologies from various vendors, many of which were never designed to be used with cloud storage. This Frankenstein cloud storage implementation fails to realize the full benefits of cloud storage while consuming precious IT resources in implementation and management. Panzura s Freedom Family breaks this cycle with a fully cloud-integrated enterprise storage solution to handle NAS, active archiving, DR, and backup. By designing a distributed cloud file system and global namespace with cloud integration at a fundamental level, the Panzura solution brings the cloud as a seamless storage tier for the first time while enabling global file sharing and full access to all files in the system from any location at any time. This game-changing technology finally brings the full power and benefits of cloud storage to enterprise customers, helping to break the unending on-site storage expansion cycle while eliminating islands of storage that inhibit cross-site user interaction and productivity and real-time data protection. Panzura makes deploying cloud storage easy and transparent to users. Panzura, Inc. 695 Campbell Technology Pkwy #225, Campbell, CA, USA 855-PANZURA www.panzura.com Copyright 2017 Panzura, Inc. All rights reserved. Panzura is a registered trademark or trademark of Panzura, Inc. in the United States and/or other jurisdictions. All other marks and names mentioned herein may be trademarks of their respective companies. WP-Panzura-Distributed-Cloud-File-System-022117b