CERNBox. deployment experience and status. Data & Storage Services. CERNBox. Jakub T. Mościcki Massimo Lamanna CERN IT- DSS TNC 2014
|
|
- Gwenda Richards
- 6 years ago
- Views:
Transcription
1 Data & Storage Services CERNBox deployment experience and status CERN IT Department CH-1211 Geneva 23 Switzerland Jakub T. Mościcki Massimo Lamanna CERN IT- DSS TNC 2014 CERNBox
2 Content Background & context Experience deploying owncloud CERNBox: next steps Conclusions 2
3 Main data storage services from IT/DSS group AFS: general purpose filesystem 2.5 billion files, 300 TB on disk 3 millions read/writes per day (34Hz) 300 disk drives, RAID model EOS: low- latency physics storage fast file namespace (20KHz file stats) scalable IO (22GB/s) ~50 PB on disk, 170 million files 30K disk drives, RAIN model CASTOR: archival and bulk data ~100 PB on tape 3
4 The origins of the CERNBox project Missing link? 4500 dis`nct IPs in DNS from cern.ch to *.dropbox.com (daily...) What we are missing easy access cloud storage for end users files go automa`cally to the cloud and are available always everywhere broken laptop data lost offline access to data work on the plane and rsync when back online keep files in sync across devices access on mobile clients (easy) sharing of files with colleagues CERNBox s`ll surprisingly difficult Can we have this? for documents (small files, ojen ppts, text, ) for science data (integrated into data processing workflows and exis`ng infrastructure) 4
5 CERNBox beta service Address the classical Dropbox use- case first for the documents usability and easiness over high- performance Based on 5
6 CERNBox beta deployment Setup 100% RH6 on standard hardware Guaranteed failover (redundant nodes) MySQL server 48GB RAM diagram source: owncloud.com Apache, PHP 5.4 (SCL1.0) mod_proxy_balancer 64 core, 64GB RAM NFS servers, async, RAID JBODs Ini`al space: 20 TB 6
7 Usage of CERNBox CERNBox Beta 2014 March April May users 190 (*) files 191K 907K 1.6M size 480GB 1TB 1.5TB 15% 1% Size per user 84% <10GB >10GB up to 100GB (*) users inherited from the ini`al prototype deployment Files per user 1% 5% 94% < 5K 5K- 20K up to 100K 7
8 User feedback very posi`ve but we see many idle accounts (~50%) is our feedback survey skewed by the 5% enthusias`c users? I start using the cernbox since I'm a heavy user of Dropbox and I recently reached the limit of free disk space (5Gb). For work it will be great to have at least 50Gb of personal space I would like to have is a free client for Android, which should be much more stable. I find the service perfect to be able to get always the files/sources/documents I need independently of the place and connec`on. On my Macbook Air I no`ced that the bavery was draining much faster than usual. I checked on the ac`vity monitor and CERNBOX was consuming 80-95% of the total energy. What I would like to do in the future is to combine my private data like my photos for example on my home owncloud server, and my work data on the Cern owncloud server. I'm very glad that CERN has launched the service using the OwnCloud plaxorm. I hope that you will be suppor`ng this service officially soon!! 8
9 File access paherns GET/PUT ra`o: 2/1 File type distribu`on: 1200 different file extensions! 30%.c.h.C 30%.jpg.png 15% no extension (UNIX world!) 25% other:.pdf,.txt,.ppt,.docx,.root,.py,.eps,.tex ~100 URL shares, ~40 synced shares 9
10 TesIng owncloud in the lab Our lab tests par`ally automated github.com/opensmashbox Client issues Change detec`on: full scans of the local/remote folders (takes long `me, drains bavery) Confusing client behaviour if filesystem modified while syncing File conflicts resolu`on - YES, directory conflict resolu`on - NO Corner cases of illegi`mate file dele`ons (symlinks, character cases) A safety net required to prevent data loss (e.g. local trashbin ) Centralized admin control needed (e.g. excluded file pavern list and other capabili`es) File versioning and trashbin are a must Some conflict situa`ons may only be resolved manually by the user à Owncloud client improvements are under heavy development Stability? How quickly can we ramp- up with fixes? 10
11 Towards large- scale data sync and share Currently deployed CERNBox beta works OK so far for the classical Dropbox use- case low- frequency document sync and share But can we bring this system to the next level? Our core- business workflows and large- scale workloads expose PBs of exis%ng data from day 1 integra`on into physics data processing eco- system central services: batch, interac`ve data analysis applica`ons sync higher data volumes at higher rates Can we s`ll keep the simplicity of cloud storage access? 11
12 Current architecture Sync client (webdav) Web access (hvps) HTTPS LB Data flow Metadata flow USER AS OC AS OC AS OC Image courtesy of Keeps track of sync state for every file in the system SQL overheads (Hz metadata ops) DB filesystem (POSIX) Files not exposed directly to the user STORAGE
13 Revised architecture Sync client (webdav) Web access (hvps) HTTPS HTTPS LB HTTPS LB HTTPS LB LB Data flow Metadata flow Data directly accessible by the user USER hvp (public data) hvps (private data) hvp (internal) KHz metadata ops OC fuse All sync state as metadata in the storage Files wriwen with USER credenxals STORAGE (EOS) IO redirect disk servers (1000s) namespace
14 Immediate access to all our data! 14
15 Massive scaling at reduced cost? No need to keep track of all files and directories in the database avoids explosive growth of your DB infrastructure Our file number es`mate? With 10K users we have 2.5 billion files in AFS already! What is your number for 100K users? Before we start throwing hardware at the problem consider the cost of running the service Fixed: hardware purchase, service deployment, infrastructure Scaling: hardware incidents, user support; backup; integrity checks; upgrades Infrastructure: space, electricity and cooling in the data center For massive scaling we need to keep TCO under control profit from exis`ng large- scale opera`ons and support of our storage services exploit economies of scale We have regular discussions on scaling issues with several partners CNRS: DB hardware scaling analysis for 100K users h)p://xstra.u- strasbg.fr/lib/exe/fetch.php?media=doc:josy- cloud:josy- cloud projet- owncore.pdf CERNBox - J.T.Moscicki, M.Lamanna - TNC2014 Dublin 15
16 Integrated storage ecosystem for scienific research sync / share / offline access 2.0 USER webdav & hvps:// online file- system access fuse CERNBox Analysis cluster Central Services high- performance applica`on access xrootd:// batch access xrdcopy LARGE- SCALE STORAGE
17 CERNBox 2.0 Advanced prototype stage Adapted exis`ng webdav interface in EOS to be compa`ble with owncloud sync clients Test environment (EOS PPS) standard hardware namespace node with Xeon 2.2GHz, 16 cores, 24GB RAM 50 disk servers: cheap JBODs (1000 disks), total 800TB usable space Storage layout: 2 replicas in RAIN mode à every file PUT = 2 copies of the file on two independent storage nodes (with adler32 checksums of content) Event- based hvp(s) load- balancer (nginx) Underlying storage scalability (EOS Prod) Max observed IO: 28GB/s from 160 clients reading with total of 1KHz file reads Max observed file stats: 10s KHz Thousands of connected clients Server should never be a bovleneck for CERNBox 17
18 First performance numbers Sync lots of small files to a laptop in campus WIFI 30Hz download, 50Hz upload Sync big files on fixed 1Gb network upload: 20MB/s download: ~1Gb/s owncloud client 1.6rc2 + external redirects your sync client is directly talking to 100s fileservers go as fast as your network goes! 18
19 First performance numbers User- perceived performance (client) Metadata opera`on (pycurl with SSL sessions) PROPFIND with 1 entry: 90 Hz PROPFIND with 1K entries: 8.5 KHz PROPFIND with 10K entries: 10KHz ops/s Download Upload Delete Small files (10KB) pycurl seq pycurl P=10 pycurl P=50 owncloud sync client owncloud sync client (1.6.2rc2) TH=10 19
20 Summary beta service for document sharing provide a useful func`onality already now appreciated by beta users sync clients need further improvements heading towards large sync and share layer for science research all our data exposed from day 1 massive scalability, high performance integrated into exis`ng workflows - new capabili`es! small overhead on top of our exis`ng opera`ons and development TCO control and s`ll as easy to use as Dropbox.com CERNBox 20
21 Backup slides 21
22 Basic sync algorithm client owncloud server PROPFIND: get remote ETAG PUT, GET, MOVE files ino`fy local changes sqlite DB propagate ETAG on file change Notes: ETAG is a standard HTTP header for cache control ETAG is a unique iden`fier generated by the server No file diffs over the wire icons: hvp:// 22
23 CERNBOX - J.T.Moscicki - White Area Lecture, April
24 File versioning and trashbin 24
25 Sharing Named shares Shared files to known users are synced automa`cally Share with hashed link Anonymous access It is possible to share a folder for web- based upload 25
26 Choices for CERN Dropbox service buy service private cloud on 3rd party storage - dropbox, wuala,... we could also buy other services: gmail,. private cloud on own storage with closed sojware licensing costs? vendor lock- in? bit torrent sync: mixed mode service private cloud with open source sojware large choice but products immature develop our own solu`on plaxorm integra`on? can we design good interfaces? none of these solu`ons is ideal... for various reasons 26
27 Pilot limitaions Move On the origin client move is propagated to the server On the other clients it is propagated as COPY/DELETE (subop`mal) Symlinks are not supported Ignored files:, :? * < > We currently recommend one sync folder setup: ~/cernbox High per- file overhead Expect 2-5Hz PUT Expect ~10Hz GET Transfer rates Expect 10-30MB download Expect 5-10MB upload Larger files: 400MB file on my desktop hvps/upload: ~25MB/s, hvps/download: ~60MB/s For wireless devices, laptops, phones do we care about transfer rates? 27
28 Protocol WebDAV (extension to HTTP with XML body) OwnCloud Server is RFC 2518 compliant (litmus check) Protocol is HTTP with XML body so it is bloated Basic metadata query for a file ~0.5KB Compresses well: metadata for 1000 files ~16KB Some good points Integra`on with other web- services Desktop browsers: OSX/Finder, Simply curl/wget to GET, PROPFIND, PUT, DELETE, MOVE Fuse mount (davfs2) HTTP POST/GET 4GB file limit for upload currently (browser limita`ons) Sync client: PUT Chunked upload (10MB chunks) Extension: OC- CHUNKED header avribute Sharing: REST API 28
29 Desktop sync client Implementa`on: Qt, libneon Keep sync state in sqlite db on the client side Basic workflow Check- for- updates Propagate Reconcile No file locking Change detec`on trigger a tree scan periodically asynchronously (ino`fy events via QFileSystemWatcher) Conflict resolu`on Local file is renamed: FILE_conflict txt No delta sync 29
30 Client issues Efficient cross- plaxorm client implementa`on is not THAT easy But it is improving. oc client 1.4 oc client 1.5 hvps://github.com/owncloud/mirall/issues/
Overview. Jakub T. Mościcki, IT/DSS. Meeting with Palestinian fellows
Overview Jakub T. Mościcki, IT/DSS Meeting with Palestinian fellows 2016-05-31 What is CERNBox? CERNBox provides a cloud synchronisation service Available for all CERN users (1TB/user) Synchronise files
More informationTesting storage and metadata backends
Testing storage and metadata backends Hugo González Labrador, Arno Formella LIA2, University of Vigo CS3: Cloud Storage Services for Novel Applications and Workflows Zürich, January 2016 Outline Origin
More informationVisita delegazione ditte italiane
Visita delegazione ditte italiane CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it Massimo Lamanna/CERN IT department - Data Storage Services group Innovation in Computing in High-Energy
More informationNovember 7, DAN WILSON Global Operations Architecture, Concur. OpenStack Summit Hong Kong JOE ARNOLD
November 7, 2013 DAN WILSON Global Operations Architecture, Concur dan.wilson@concur.com @tweetdanwilson OpenStack Summit Hong Kong JOE ARNOLD CEO, SwiftStack joe@swiftstack.com @joearnold Introduction
More informationhashfs Applying Hashing to Op2mize File Systems for Small File Reads
hashfs Applying Hashing to Op2mize File Systems for Small File Reads Paul Lensing, Dirk Meister, André Brinkmann Paderborn Center for Parallel Compu2ng University of Paderborn Mo2va2on and Problem Design
More informationDell EMC CIFS-ECS Tool
Dell EMC CIFS-ECS Tool Architecture Overview, Performance and Best Practices March 2018 A Dell EMC Technical Whitepaper Revisions Date May 2016 September 2016 Description Initial release Renaming of tool
More informationCon$nuous Integra$on Development Environment. Kovács Gábor
Con$nuous Integra$on Development Environment Kovács Gábor kovacsg@tmit.bme.hu Before we start anything Select a language Set up conven$ons Select development tools Set up development environment Set up
More informationHTTP/WebDAV synchronization protocol optimizations. Piotr Mrowczynski
HTTP/WebDAV synchronization protocol optimizations. Piotr Mrowczynski HTTP/WebDAV synchronization protocol optimizations. - HTTP2 (https://github.com/owncloud/client/compare/http2) - Bundling (https://github.com/owncloud/client/pull/5319)
More informationOutline. ASP 2012 Grid School
Distributed Storage Rob Quick Indiana University Slides courtesy of Derek Weitzel University of Nebraska Lincoln Outline Storage Patterns in Grid Applications Storage
More informationHandling Big Data an overview of mass storage technologies
SS Data & Handling Big Data an overview of mass storage technologies Łukasz Janyst CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it GridKA School 2013 Karlsruhe, 26.08.2013 What is Big Data?
More informationDatabase Services at CERN with Oracle 10g RAC and ASM on Commodity HW
Database Services at CERN with Oracle 10g RAC and ASM on Commodity HW UKOUG RAC SIG Meeting London, October 24 th, 2006 Luca Canali, CERN IT CH-1211 LCGenève 23 Outline Oracle at CERN Architecture of CERN
More informationChanging Requirements for Distributed File Systems in Cloud Storage
Changing Requirements for Distributed File Systems in Cloud Storage Wesley Leggette Cleversafe Presentation Agenda r About Cleversafe r Scalability, our core driver r Object storage as basis for filesystem
More informationThe DMLite Rucio Plugin: ATLAS data in a filesystem
Journal of Physics: Conference Series OPEN ACCESS The DMLite Rucio Plugin: ATLAS data in a filesystem To cite this article: M Lassnig et al 2014 J. Phys.: Conf. Ser. 513 042030 View the article online
More information6,000 Cameras in Time Square 210 million Cameras worldwide
SMILE!! You are on camera 75 $mes per day Average American ci$zen can be caught on camera 1:29 Camera to person ra$o World Wide 6,000 Cameras in Time Square 210 million Cameras worldwide What is the LTO
More informationOpendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES
Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES May, 2017 Contents Introduction... 2 Overview... 2 Architecture... 2 SDFS File System Service... 3 Data Writes... 3 Data Reads... 3 De-duplication
More informationWestern Michigan University
CS-6030 Cloud compu;ng Google App engine Sepideh Mohammadi Summer II 2017 Western Michigan University content Categories of cloud compu;ng Google cloud plaborm Google App Engine Storage technologies Datastore
More informationWindows Servers In Microsoft Azure
$6/Month Windows Servers In Microsoft Azure What I m Going Over 1. How inexpensive servers in Microsoft Azure are 2. How I get Windows servers for $6/month 3. Why Azure hosted servers are way better 4.
More informationServices: Monitoring and Logging. 9/16/2018 IST346: Info Tech Management & Administration 1
Services: Monitoring and Logging 9/16/2018 IST346: Info Tech Management & Administration 1 Recall: Server vs. Service A server is a computer. A service is an offering provided by server(s). HTTP 9/16/2018
More informationCIT 668: System Architecture. Amazon Web Services
CIT 668: System Architecture Amazon Web Services Topics 1. AWS Global Infrastructure 2. Foundation Services 1. Compute 2. Storage 3. Database 4. Network 3. AWS Economics Amazon Services Architecture Regions
More informationFIRST EXPERIENCE WITH SEAFILE SYNC & SHARE SOLUTION
FIRST EXPERIENCE WITH SEAFILE SYNC & SHARE SOLUTION Maciej Brzeźniak, Stanisław Jankowski, Sławomir Zdanowski, Krzysztof Wadówka box.psnc.pl PSNC, Poznań, Poland AGENDA NRENs in sync & share business:
More informationGoogle File System and BigTable. and tiny bits of HDFS (Hadoop File System) and Chubby. Not in textbook; additional information
Subject 10 Fall 2015 Google File System and BigTable and tiny bits of HDFS (Hadoop File System) and Chubby Not in textbook; additional information Disclaimer: These abbreviated notes DO NOT substitute
More informationIsilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team
Isilon: Raising The Bar On Performance & Archive Use Cases John Har Solutions Product Manager Unstructured Data Storage Team What we ll cover in this session Isilon Overview Streaming workflows High ops/s
More informationData Protection for Cisco HyperFlex with Veeam Availability Suite. Solution Overview Cisco Public
Data Protection for Cisco HyperFlex with Veeam Availability Suite 1 2017 2017 Cisco Cisco and/or and/or its affiliates. its affiliates. All rights All rights reserved. reserved. Highlights Is Cisco compatible
More informationBuilding a Big IaaS Cloud. David /
Building a Big IaaS Cloud David Nalley @ke4qqq ke4qqq@apache.org / david@gnsa.us #whoami Recovering Sysadmin F/LOSS contributor Committer on Apache CloudStack Assumptions You have a need for an IaaS compute
More informationHPSS Treefrog Introduction.
HPSS Treefrog Introduction Disclaimer Forward looking information including schedules and future software reflect current planning that may change and should not be taken as commitments by IBM or the other
More informationHCI: Hyper-Converged Infrastructure
Key Benefits: Innovative IT solution for high performance, simplicity and low cost Complete solution for IT workloads: compute, storage and networking in a single appliance High performance enabled by
More informationIvanti Service Desk and Asset Manager Technical Specifications and Architecture Guidelines
Ivanti Service Desk and Asset Manager Technical Specifications and Architecture Guidelines This document contains the confidential information and/or proprietary property of Ivanti, Inc. and its affiliates
More informationAmbry: LinkedIn s Scalable Geo- Distributed Object Store
Ambry: LinkedIn s Scalable Geo- Distributed Object Store Shadi A. Noghabi *, Sriram Subramanian +, Priyesh Narayanan +, Sivabalan Narayanan +, Gopalakrishna Holla +, Mammad Zadeh +, Tianwei Li +, Indranil
More informationHPSS Treefrog Summary MARCH 1, 2018
HPSS Treefrog Summary MARCH 1, 2018 Disclaimer Forward looking information including schedules and future software reflect current planning that may change and should not be taken as commitments by IBM
More informationEaSync: A Transparent File Synchronization Service across Multiple Machines
EaSync: A Transparent File Synchronization Service across Multiple Machines Huajian Mao 1,2, Hang Zhang 1,2, Xianqiang Bao 1,2, Nong Xiao 1,2, Weisong Shi 3, and Yutong Lu 1,2 1 State Key Laboratory of
More informationHadoop File System S L I D E S M O D I F I E D F R O M P R E S E N T A T I O N B Y B. R A M A M U R T H Y 11/15/2017
Hadoop File System 1 S L I D E S M O D I F I E D F R O M P R E S E N T A T I O N B Y B. R A M A M U R T H Y Moving Computation is Cheaper than Moving Data Motivation: Big Data! What is BigData? - Google
More informationVMware AirWatch Content Gateway Guide for Windows
VMware AirWatch Content Gateway Guide for Windows AirWatch v9.1 Have documentation feedback? Submit a Documentation Feedback support ticket using the Support Wizard on support.air-watch.com. This product
More informationDispatcher. Phoenix. Dispatcher Phoenix Enterprise White Paper Version 0.2
Dispatcher Phoenix Dispatcher Phoenix Enterprise CONTENTS Introduction... 3 Terminology... 4 Planning & Considerations... 5 Security Features... 9 Enterprise Features... 10 Cluster Overview... 11 Deployment
More informationEsgynDB Enterprise 2.0 Platform Reference Architecture
EsgynDB Enterprise 2.0 Platform Reference Architecture This document outlines a Platform Reference Architecture for EsgynDB Enterprise, built on Apache Trafodion (Incubating) implementation with licensed
More informationGlusterFS Architecture & Roadmap
GlusterFS Architecture & Roadmap Vijay Bellur GlusterFS co-maintainer http://twitter.com/vbellur Agenda What is GlusterFS? Architecture Integration Use Cases Future Directions Challenges Q&A What is GlusterFS?
More informationName: Instructions. Problem 1 : Short answer. [48 points] CMU / Storage Systems 20 April 2011 Spring 2011 Exam 2
CMU 18-746/15-746 Storage Systems 20 April 2011 Spring 2011 Exam 2 Instructions Name: There are four (4) questions on the exam. You may find questions that could have several answers and require an explanation
More informationScaling DreamFactory
Scaling DreamFactory This white paper is designed to provide information to enterprise customers about how to scale a DreamFactory Instance. The sections below talk about horizontal, vertical, and cloud
More informationDistributed File Systems II
Distributed File Systems II To do q Very-large scale: Google FS, Hadoop FS, BigTable q Next time: Naming things GFS A radically new environment NFS, etc. Independence Small Scale Variety of workloads Cooperation
More informationOnline data storage service strategy for the CERN computer Centre G. Cancio, D. Duellmann, M. Lamanna, A. Pace CERN, Geneva, Switzerland
Online data storage service strategy for the CERN computer Centre G. Cancio, D. Duellmann, M. Lamanna, A. Pace CERN, Geneva, Switzerland Abstract. The Data and Storage Services group at CERN is conducting
More informationMatthias Wobben working in Berlin, Germany. Senior Sales Engineer at Nextcloud
Matthias Wobben matthias@nextcloud.com working in Berlin, Germany Senior Sales Engineer at Nextcloud Before: 3 rd level IT Engineer and Administrator at Systems Provider with focus on EFSS and collaboration
More informationBigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI Presented by Xiang Gao
Bigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI 2006 Presented by Xiang Gao 2014-11-05 Outline Motivation Data Model APIs Building Blocks Implementation Refinement
More informationIntroduc)on to Computer Networks
Introduc)on to Computer Networks COSC 4377 Lecture 3 Spring 2012 January 25, 2012 Announcements Four HW0 s)ll missing HW1 due this week Start working on HW2 and HW3 Re- assess if you found HW0/HW1 challenging
More informationDSS User Guide. End User Guide. - i -
DSS User Guide End User Guide - i - DSS User Guide Table of Contents End User Guide... 1 Table of Contents... 2 Part 1: Getting Started... 1 How to Log in to the Web Portal... 1 How to Manage Account Settings...
More informationMapR Enterprise Hadoop
2014 MapR Technologies 2014 MapR Technologies 1 MapR Enterprise Hadoop Top Ranked Cloud Leaders 500+ Customers 2014 MapR Technologies 2 Key MapR Advantage Partners Business Services APPLICATIONS & OS ANALYTICS
More informationVMware AirWatch Content Gateway Guide for Windows
VMware AirWatch Content Gateway Guide for Windows AirWatch v9.2 Have documentation feedback? Submit a Documentation Feedback support ticket using the Support Wizard on support.air-watch.com. This product
More informationWISE: Big Data, Little Money - Lessons Learned Tim Conrow WISE System Architect
WISE: Big Data, Little Money - Lessons Learned Tim Conrow WISE System Architect GRITS III June 17, 2011 The Problem Heavy ops processing load R/W >3TB/day, Sustained 1Gbit/s network load 180 TB ops archive
More informationINFRASTRUCTURE BEST PRACTICES FOR PERFORMANCE
INFRASTRUCTURE BEST PRACTICES FOR PERFORMANCE Michael Poulson and Devin Jansen EMS Software Software Support Engineer October 16-18, 2017 Performance Improvements and Best Practices Medium-Volume Traffic
More informationOnCommand Cloud Manager 3.2 Deploying and Managing ONTAP Cloud Systems
OnCommand Cloud Manager 3.2 Deploying and Managing ONTAP Cloud Systems April 2017 215-12035_C0 doccomments@netapp.com Table of Contents 3 Contents Before you create ONTAP Cloud systems... 5 Logging in
More informationVoldemort. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation
Voldemort Smruti R. Sarangi Department of Computer Science Indian Institute of Technology New Delhi, India Smruti R. Sarangi Leader Election 1/29 Outline 1 2 3 Smruti R. Sarangi Leader Election 2/29 Data
More informationGRS Enterprise Synchronization Tool
GRS Enterprise Synchronization Tool Last Revised: Thursday, April 05, 2018 Page i TABLE OF CONTENTS Anchor End User Guide... Error! Bookmark not defined. Last Revised: Monday, March 12, 2018... 1 Table
More informationowncloud Android App Manual
owncloud Android App Manual Release 2.7.0 The owncloud developers October 30, 2018 CONTENTS 1 Release Notes 1 1.1 Changes in 2.7.0............................................. 1 1.2 Changes in 2.6.0.............................................
More informationSCS Distributed File System Service Proposal
SCS Distributed File System Service Proposal Project Charter: To cost effectively build a Distributed networked File Service (DFS) that can grow to Petabyte scale, customized to the size and performance
More informationCA485 Ray Walshe Google File System
Google File System Overview Google File System is scalable, distributed file system on inexpensive commodity hardware that provides: Fault Tolerance File system runs on hundreds or thousands of storage
More informationVMware AirWatch Content Gateway Guide for Windows
VMware AirWatch Content Gateway Guide for Windows AirWatch v9.3 Have documentation feedback? Submit a Documentation Feedback support ticket using the Support Wizard on support.air-watch.com. This product
More informationDistributed Systems 16. Distributed File Systems II
Distributed Systems 16. Distributed File Systems II Paul Krzyzanowski pxk@cs.rutgers.edu 1 Review NFS RPC-based access AFS Long-term caching CODA Read/write replication & disconnected operation DFS AFS
More informationStorage Optimization with Oracle Database 11g
Storage Optimization with Oracle Database 11g Terabytes of Data Reduce Storage Costs by Factor of 10x Data Growth Continues to Outpace Budget Growth Rate of Database Growth 1000 800 600 400 200 1998 2000
More informationAnchor User Guide. Presented by: Last Revised: August 07, 2017
Anchor User Guide Presented by: Last Revised: August 07, 2017 TABLE OF CONTENTS GETTING STARTED... 1 How to Log In to the Web Portal... 1 How to Manage Account Settings... 2 How to Configure Two-Step Authentication...
More informationVMware AirWatch Content Gateway for Windows. VMware Workspace ONE UEM 1811 Unified Access Gateway
VMware AirWatch Content Gateway for Windows VMware Workspace ONE UEM 1811 Unified Access Gateway You can find the most up-to-date technical documentation on the VMware website at: https://docs.vmware.com/
More informationHardware & System Requirements
Safend Data Protection Suite Hardware & System Requirements System Requirements Hardware & Software Minimum Requirements: Safend Data Protection Agent Requirements Console Safend Data Access Utility Operating
More informationOutline. Spanner Mo/va/on. Tom Anderson
Spanner Mo/va/on Tom Anderson Outline Last week: Chubby: coordina/on service BigTable: scalable storage of structured data GFS: large- scale storage for bulk data Today/Friday: Lessons from GFS/BigTable
More informationVMware AirWatch Content Gateway Guide for Windows
VMware AirWatch Content Gateway Guide for Windows Workspace ONE UEM v1810 Have documentation feedback? Submit a Documentation Feedback support ticket using the Support Wizard on support.air-watch.com.
More informationData Centers. Tom Anderson
Data Centers Tom Anderson Transport Clarification RPC messages can be arbitrary size Ex: ok to send a tree or a hash table Can require more than one packet sent/received We assume messages can be dropped,
More informationVMware AirWatch Content Gateway for Linux. VMware Workspace ONE UEM 1811 Unified Access Gateway
VMware AirWatch Content Gateway for Linux VMware Workspace ONE UEM 1811 Unified Access Gateway You can find the most up-to-date technical documentation on the VMware website at: https://docs.vmware.com/
More informationInteroperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG
Interoperable Cloud Storage with the CDMI Standard Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA.
More informationName: Instructions. Problem 1 : Short answer. [48 points] CMU / Storage Systems 20 April 2011 Spring 2011 Exam 2
CMU 18-746/15-746 Storage Systems 20 April 2011 Spring 2011 Exam 2 Instructions Name: There are four (4) questions on the exam. You may find questions that could have several answers and require an explanation
More information! Design constraints. " Component failures are the norm. " Files are huge by traditional standards. ! POSIX-like
Cloud background Google File System! Warehouse scale systems " 10K-100K nodes " 50MW (1 MW = 1,000 houses) " Power efficient! Located near cheap power! Passive cooling! Power Usage Effectiveness = Total
More informationIBM Active Cloud Engine centralized data protection
IBM Active Cloud Engine centralized data protection Best practices guide Sanjay Sudam IBM Systems and Technology Group ISV Enablement December 2013 Copyright IBM Corporation, 2013 Table of contents Abstract...
More informationSharePoint Best Practices. Presented By: Mark Weinstein
SharePoint Best Practices Presented By: Mark Weinstein Installing SharePoint 32 or 64 bit? Which SharePoint version? WSS or MOSS? When installing SharePoint, be sure to select the Advanced method of installation
More informationStreamlining CASTOR to manage the LHC data torrent
Streamlining CASTOR to manage the LHC data torrent G. Lo Presti, X. Espinal Curull, E. Cano, B. Fiorini, A. Ieri, S. Murray, S. Ponce and E. Sindrilaru CERN, 1211 Geneva 23, Switzerland E-mail: giuseppe.lopresti@cern.ch
More informationThe Future of Storage
The Future of Storage A Technical Discussion Replacing Your Proprietary Scale-out NAS With GlusterFS [Presenter name] Solutions Architect Jacob Shucart SA, Red Hat January 2012 October, 2011 1 Agenda Introduction
More informationWeak Consistency and Disconnected Operation in git. Raymond Cheng
Weak Consistency and Disconnected Operation in git Raymond Cheng ryscheng@cs.washington.edu Motivation How can we support disconnected or weakly connected operation? Applications File synchronization across
More informationAgenda. Qsync usage scenarios and sample applications. QNAP NAS specifications recommended by various types of users.
Agenda Qsync usage scenarios and sample applications QNAP NAS specifications recommended by various types of users Advanced features Qsync Ubuntu Beta / Qsync ios Beta Users' Common Issues include... Is
More informationCluster-Level Google How we use Colossus to improve storage efficiency
Cluster-Level Storage @ Google How we use Colossus to improve storage efficiency Denis Serenyi Senior Staff Software Engineer dserenyi@google.com November 13, 2017 Keynote at the 2nd Joint International
More informationReal Life Web Development. Joseph Paul Cohen
Real Life Web Development Joseph Paul Cohen joecohen@cs.umb.edu Index 201 - The code 404 - How to run it? 500 - Your code is broken? 200 - Someone broke into your server? 400 - How are people using your
More informationFacilitating Collaborative Analysis in SWAN
Facilitating Collaborative Analysis in SWAN E. Tejedor, D. Castro, D. Piparo, P. Mato E. Bocchi, J. Moscicki, M. Lamanna, P. Kothuri https://swan.cern.ch July 11th, 2018 CHEP 2018, Sofia (Bulgaria) Introduction
More informationSetting up Microsoft Exchange Server 2016 with Avi
Page 1 of 14 Setting up Microsoft Exchange Server 2016 with Avi Networks view online What is Exchange Server 2016 Microsoft Exchange Server 2016 is an e-mail server solution, with calendar and contact
More informationHigh Performance Oracle Database in a Flash Sumeet Bansal, Principal Solutions Architect
High Performance Oracle Database in a Flash Sumeet Bansal, Principal Solutions Architect WHY USE NAND FLASH Overcome traditional I/O bottlenecks Create simpler architectures Save on energy costs Drive
More informationCloudian Sizing and Architecture Guidelines
Cloudian Sizing and Architecture Guidelines The purpose of this document is to detail the key design parameters that should be considered when designing a Cloudian HyperStore architecture. The primary
More informationDeduplication Storage System
Deduplication Storage System Kai Li Charles Fitzmorris Professor, Princeton University & Chief Scientist and Co-Founder, Data Domain, Inc. 03/11/09 The World Is Becoming Data-Centric CERN Tier 0 Business
More informationCSE 124: Networked Services Fall 2009 Lecture-19
CSE 124: Networked Services Fall 2009 Lecture-19 Instructor: B. S. Manoj, Ph.D http://cseweb.ucsd.edu/classes/fa09/cse124 Some of these slides are adapted from various sources/individuals including but
More informationNEC Express5800 A2040b 22TB Data Warehouse Fast Track. Reference Architecture with SW mirrored HGST FlashMAX III
NEC Express5800 A2040b 22TB Data Warehouse Fast Track Reference Architecture with SW mirrored HGST FlashMAX III Based on Microsoft SQL Server 2014 Data Warehouse Fast Track (DWFT) Reference Architecture
More informationInteroperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG
Interoperable Cloud Storage with the CDMI Standard Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member
More informationproduct overview CRASH
product overview CRASH e back up everyone, everywhere PROe Software Continuous backup for business Use CrashPlan PROe for continuous, cross-platform backup that works everywhere, enabling users to easily
More informationMarkLogic Server. MarkLogic Server on Microsoft Azure Guide. MarkLogic 9 January, 2018
MarkLogic Server on Microsoft Azure Guide 1 MarkLogic 9 January, 2018 Last Revised: 9.0-4, January, 2018 2018 MarkLogic Corporation. MarkLogic and the MarkLogic logo are trademarks or registered trademarks
More informationAppSense DataNow. Release Notes (Version 4.0) Components in this Release. These release notes include:
AppSense DataNow Release Notes (Version 4.0) These release notes include: Components in this Release Important Upgrade Information New Features Bugs Fixed Known Issues and Limitations Supported Operating
More informationOracle VM Workshop Applica>on Driven Virtualiza>on
Oracle VM Workshop Applica>on Driven Virtualiza>on Simon COTER Principal Product Manager Oracle VM & VirtualBox simon.coter@oracle.com hnps://blogs.oracle.com/scoter November 25th, 2015 Copyright 2014
More informationDistributed Filesystem
Distributed Filesystem 1 How do we get data to the workers? NAS Compute Nodes SAN 2 Distributing Code! Don t move data to workers move workers to the data! - Store data on the local disks of nodes in the
More informationCS 470 Spring Distributed Web and File Systems. Mike Lam, Professor. Content taken from the following:
CS 470 Spring 2017 Mike Lam, Professor Distributed Web and File Systems Content taken from the following: "Distributed Systems: Principles and Paradigms" by Andrew S. Tanenbaum and Maarten Van Steen (Chapters
More informationEMC Integrated Infrastructure for VMware. Business Continuity
EMC Integrated Infrastructure for VMware Business Continuity Enabled by EMC Celerra and VMware vcenter Site Recovery Manager Reference Architecture Copyright 2009 EMC Corporation. All rights reserved.
More informationAgenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache
Databases on AWS 2017 Amazon Web Services, Inc. and its affiliates. All rights served. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon Web Services,
More informationEmerging Technologies for HPC Storage
Emerging Technologies for HPC Storage Dr. Wolfgang Mertz CTO EMEA Unstructured Data Solutions June 2018 The very definition of HPC is expanding Blazing Fast Speed Accessibility and flexibility 2 Traditional
More informationAn overview of irods clients. Ton Smeele
An overview of irods clients Ton Smeele agenda irods client-server architecture Client libraries Out of the box clients Example customer-developed client irods used in a 2-tier model Client Server Client
More informationAppSense DataNow. Release Notes (Version 4.1) Components in this Release. These release notes include:
AppSense DataNow Release Notes (Version 4.1) These release notes include: Components in this Release Important Upgrade Information New Features Bugs Fixed Known Issues and Limitations Supported Operating
More informationThe Lion of storage systems
The Lion of storage systems Rakuten. Inc, Yosuke Hara Mar 21, 2013 1 The Lion of storage systems http://www.leofs.org LeoFS v0.14.0 was released! 2 Table of Contents 1. Motivation 2. Overview & Inside
More informationSync User Guide. Powered by Axient Anchor
Sync Powered by Axient Anchor TABLE OF CONTENTS End... Error! Bookmark not defined. Last Revised: Wednesday, October 10, 2018... Error! Bookmark not defined. Table of Contents... 2 Getting Started... 7
More informationThe Hadoop Distributed File System Konstantin Shvachko Hairong Kuang Sanjay Radia Robert Chansler
The Hadoop Distributed File System Konstantin Shvachko Hairong Kuang Sanjay Radia Robert Chansler MSST 10 Hadoop in Perspective Hadoop scales computation capacity, storage capacity, and I/O bandwidth by
More informationDeveloping Microsoft Azure Solutions (70-532) Syllabus
Developing Microsoft Azure Solutions (70-532) Syllabus Cloud Computing Introduction What is Cloud Computing Cloud Characteristics Cloud Computing Service Models Deployment Models in Cloud Computing Advantages
More informationElastic Compute Service. Quick Start for Windows
Overview Purpose of this document This document describes how to quickly create an instance running Windows, connect to an instance remotely, and deploy the environment. It is designed to walk you through
More informationWorldwide Production Distributed Data Management at the LHC. Brian Bockelman MSST 2010, 4 May 2010
Worldwide Production Distributed Data Management at the LHC Brian Bockelman MSST 2010, 4 May 2010 At the LHC http://op-webtools.web.cern.ch/opwebtools/vistar/vistars.php?usr=lhc1 Gratuitous detector pictures:
More informationSECURE, FLEXIBLE ON-PREMISE STORAGE WITH EMC SYNCPLICITY AND EMC ISILON
White Paper SECURE, FLEXIBLE ON-PREMISE STORAGE WITH EMC SYNCPLICITY AND EMC ISILON Abstract This white paper explains the benefits to the extended enterprise of the on-premise, online file sharing storage
More information