BESIII physical offline data analysis on virtualization platform

Size: px
Start display at page:

Download "BESIII physical offline data analysis on virtualization platform"

Transcription

1 BESIII physical offline data analysis on virtualization platform Qiulan Huang Computing Center, IHEP,CAS CHEP 2015

2 Outline Overview of HEP computing in IHEP What is virtualized computing cluster? Why virtualized computing cluster? What we have done? Schedule BESIII jobs to virtual computing cluster Current status Conclusion Qiulan Huang/CC/IHEP 2015/4/10-2

3 HEP computing in IHEP Support several experiments BEPCII & BESIII Cosmic Ray/Astrophysics in Tibet DayaBay CMS, ATLAS experiments on LHC Accelerator-driven Subcritical System (ADS) China Spallation Neutron Source (CSNS) Future experiments Jiangmen Underground Neutrino Observatory:~500TB*10years Lhaasso:2PB per year after 2017,accumulate 20PB+ in 10 years Qiulan Huang/CC/IHEP 2015/4/10-3

4 Computing status in IHEP CPU cores ~ ~ 60 queues, managed by Torque Problems Low resource utilization Poor resource sharing Qiulan Huang/CC/IHEP 2015/4/10-4

5 What is virtualized computing cluster? KVM+OpenStack+Torque /maui: Integrated Openstack and Torque /maui to provide computing service based on IHEPCloud Virtual cluster and physical cluster work together When a job queue is busy, the jobs can be allocated to a virtual queue VMs are created according to application requirement Distributed computing User interface Resource Management Unified deployment (puppet) Send results Submit jobs CloudAPI(create/stop VMs) CloudScheduler Query load Schedule to virtual Queue IHEPCloud (Virtual queue) Monitoring Qiulan Huang/CC/IHEP 2015/4/10-5

6 Why? Advantages Improve resource utilization Improve the efficiency of resource scheduling Simplify management Elasticity Resources heterogeneous is transparent to applications and users Energy saving To solve problems The overall computing resources utilization rate is relative low IHEP Computing cluster supports various experiments such as BES, Daya Bay, YBJ Computing resource is separated by each experiment which cannot be shared. At a certain period of time, some subsets are busy, some subsets are idle. It leads to take much time to queue. Qiulan Huang/CC/IHEP 2015/4/10-6

7 What we have done? Qiulan Huang/CC/IHEP 2015/4/10-7

8 BESIII Offline software optimization BESIII analysis is I/O heavy Creating event metadata and doing pre-selection by metadata according to event property to reduce IO throughput significantly Raw Data DST Data TAG Data Event construction Tag event Event index Event TAG Event TAG Detailed: see xiaofeng s talk, track2, 12:00 16/4 (BESIII Physics Data Storing and Processing on HBase and MapReduce) Qiulan Huang/CC/IHEP 2015/4/10-8

9 Optimized KVM performance Benchmark testing by default CPU performance penalty of KVM is about 10% and IO is about 12% Network performance penalty is about 3% CPU affinity The process is bound to a specific CPU but not allowed to dispatch to the other ones No need to migrate process between processors frequently to improve cache hit rate Extended Page Table Disabled EPT: modprobe kvm-intel enable_ept=0 VCPU 0 affini 亲和性 ty CPU 0 VCPU 1 亲和性 affini ty CPU 1 Qiulan Huang/CC/IHEP 2015/4/10-9

10 HS06 Rate Optimized CPU performance Optimized CPU benchmark testing Specifications Intel(R) Xeon(R) CPU X5650(2.67GHz),8 CPU cores,24gb 内存 OS:SLC release 5.5 (Boron) el5.cve ,64bit KVM-83 Tools HEP-SPEC06 Optimized CPU performance increased about 3% / 通用格式 CPU Benchmark / 通用格式 / 通用格式 / 通用格式 / 通用格式 UnPinned Pinned Default / 通用格式 / 通用格式 VM0 VM1 VM2 VM3 VM4 VM5 VM6 VM7 Qiulan Huang/CC/IHEP 2015/4/10-10

11 BESIII jobs running in VMs(1) BES simulation job (BOSS ) event number=1000 BesRndmGenSvc.RndmSeed=483366; #include "$BESSIMROOT/share/G4Svc_BesSim.txt" #include "$CALIBSVCROOT/share/calibConfig_sim.txt" RealizationSvc.RunIdList= {-9989}; #include "$ROOTIOROOT/share/jobOptions_Digi2Root.txt" DatabaseSvc.SqliteDbPath="/panfs/panfs.ihep.ac.cn/home/data/dengzy/pacman_bak/database"; RootCnvSvc.digiRootOutputFile= "/scratchfs/cc/shijy/rhopi-bws rtraw"; MessageSvc.OutputLevel= 5; ApplicationMgr.EvtMax=10000; Test environment VM: 2 cores,2gb memory The job requires 22% CPU ability Physics machine: 8cores,16GB memory Test results Jobs running time in VM is 1:45:05 while in physical machine is 1:42:04 which indicates the performance penalty is about 2.9% The penalty is optimistic Qiulan Huang/CC/IHEP 2015/4/10-11

12 BESIII jobs running in VMs(2) BES analysis job(boss ) ApplicationMgr.EvtMax = 1E9 "/besfs2/offline/data/663-1/jpsi/tmp2/120520/run_ _all_file006_sfo-1.dst", "/besfs2/offline/data/663-1/jpsi/tmp2/120520/run_ _all_file006_sfo-2.dst" }; // Set output level threshold (2=DEBUG, 3=INFO, 4=WARNING, 5=ERROR, 6=FATAL ) MessageSvc.OutputLevel = 6; // Number of events to be processed (default is 10) ApplicationMgr.EvtMax = 1E9; ApplicationMgr.HistogramPersistency = "ROOT"; Testing specification VM: 1 CPU cores,2gb memory Physical machine: 8CPU cores,16gb memory Testing results Job running time in VM is 8:04:47 while in physical machine is 7:48:32 It takes more 975s in VM than in physical machine which illustrated the performance penalty is about 3% Qiulan Huang/CC/IHEP 2015/4/10-12

13 IHEPCloud Lauched in May A private Iaas platform aiming to provide a self-service cloud platform for users and IHEP scientific computing Open for any user who has IHEP account (>1000 users, >70 active users) Qiulan Huang/CC/IHEP 2015/4/10-13

14 CloudScheduler Integrated virtual computing cluster into the traditional physical cluster to optimize the resource utilization. Take fine-grained resource allocation to schedule tasks instead of taking nodes. Design flexible allocating policy to provisioning VMs dynamically, considering job types, system load and cluster real-time status. Schedule jobs to IHEPCloud Qiulan Huang/CC/IHEP 2015/4/10-14

15 Architecture of Cloudscheduler PBS (VM Queue) Expends the original Torque PBS to support vm queue. PBS (VM Queue) VM central controller A matcher between the various modules. Polling Calculate Publish VM job controller Central controller Provide job query service schedule the jobs running in virtual queue (record the jobs in database) VM resource controller Policymaker: make vm allocation strategy. VM controller: start or stop vm CloudAPI: a packaged module based on the openstack api and with some extension. Job agent(deployed in VMs) Pull jobs to run, return job exit status and transfer the output files VM Resource Controller Job agent IHEPCloud Job agent VM Job Controller Qiulan Huang/CC/IHEP 2015/4/10-15

16 Workflow PBS VM central controller VM Job Controller VM Resource Controller Job Agent Request: ask running number for each queue reply: job running number of each queue check the vm queue and resource status Reply: The maximum job number for queue Request: queue name Reply: job run and queue number of queue, VM ip addr scheduled jobs Request: VM tpye, total VM num and VM ip list scheduled jobs Reply: the num of active vm start/stop vm active vm number Qiulan Huang/CC/IHEP 2015/4/10-16

17 Push+Pull mode Pull and push mode Pull: allocate the cpu/core for BESIII jobs with suitable resource When new job is coming, the PBS will request VM central controller to get vm resource. VM central controller prepares corresponding resource for the job VMs request matched jobs Push: Cluster internal keeps the original way of push, schedule job into the virtualized queue. Be transparent to users Qiulan Huang/CC/IHEP 2015/4/10-17

18 VM allocating policy Allocating VMs dynamically Provisioning VM number is determined by current virtual cluster status How to allocate VMs according to load of cluster especially considering the information from monitoring system Configurable VM allocating strategy interface. Linear addition and subtraction. And so on Qiulan Huang/CC/IHEP 2015/4/10-18

19 Current status Completed the testbed Submitted hundreds of test jobs Still has some problems Sometimes message communicated between modules lost Jobagent service go to offline when not connect to the server side Next steps: Fix bugs Implement more vm allocation strategies Applied to other experiments like JUNO,DAYABAY,YBJ and so on. Provide online service in this year. Qiulan Huang/CC/IHEP 2015/4/10-19

20 Summary Creating event metadata can reduce the IO throughput significantly CPU and network performance of KVM is optimistic, which can meet the BESIII experiment s requirement Virtual computing cluster is a good supplement for the existing physical cluster Qiulan Huang/CC/IHEP 2015/4/10-20

21 Any Question? Qiulan Huang/CC/IHEP 2015/4/10-21

The High-Level Dataset-based Data Transfer System in BESDIRAC

The High-Level Dataset-based Data Transfer System in BESDIRAC The High-Level Dataset-based Data Transfer System in BESDIRAC T Lin 1,2, X M Zhang 1, W D Li 1 and Z Y Deng 1 1 Institute of High Energy Physics, 19B Yuquan Road, Beijing 100049, People s Republic of China

More information

Clouds in High Energy Physics

Clouds in High Energy Physics Clouds in High Energy Physics Randall Sobie University of Victoria Randall Sobie IPP/Victoria 1 Overview Clouds are integral part of our HEP computing infrastructure Primarily Infrastructure-as-a-Service

More information

KVM 在 OpenStack 中的应用. Dexin(Mark) Wu

KVM 在 OpenStack 中的应用. Dexin(Mark) Wu KVM 在 OpenStack 中的应用 Dexin(Mark) Wu Agenda Overview CPU Memory Storage Network Architecture Overview nova-api REST API nova-scheduler nova-conductor nova-compute DB RPC Call libvirt driver libvirt Cinder

More information

A comparison of performance between KVM and Docker instances in OpenStack

A comparison of performance between KVM and Docker instances in OpenStack A comparison of performance between KVM and Docker instances in OpenStack Wataru Takase High Energy Accelerator Research Organiza on (KEK), Japan HEPiX Fall 2015 Workshop at BNL 1 KEK site will become

More information

The ATLAS EventIndex: Full chain deployment and first operation

The ATLAS EventIndex: Full chain deployment and first operation The ATLAS EventIndex: Full chain deployment and first operation Álvaro Fernández Casaní Instituto de Física Corpuscular () Universitat de València CSIC On behalf of the ATLAS Collaboration 1 Outline ATLAS

More information

BESIII Physical Analysis on Hadoop Platform

BESIII Physical Analysis on Hadoop Platform BESIII Physical Analysis on Hadoop Platform Jing HUO 12, Dongsong ZANG 12, Xiaofeng LEI 12, Qiang LI 12, Gongxing SUN 1 1 Institute of High Energy Physics, Beijing, China 2 University of Chinese Academy

More information

August Li Qiang, Huang Qiulan, Sun Gongxing IHEP-CC. Supported by the National Natural Science Fund

August Li Qiang, Huang Qiulan, Sun Gongxing IHEP-CC. Supported by the National Natural Science Fund August 15 2016 Li Qiang, Huang Qiulan, Sun Gongxing IHEP-CC Supported by the National Natural Science Fund The Current Computing System What is Hadoop? Why Hadoop? The New Computing System with Hadoop

More information

BOSS and LHC computing using CernVM and BOINC

BOSS and LHC computing using CernVM and BOINC BOSS and LHC computing using CernVM and BOINC otn-2010-0x openlab Summer Student Report BOSS and LHC computing using CernVM and BOINC Jie Wu (Supervisor: Ben Segal / IT) 1 December 2010 Version 1 Distribution::

More information

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez Scientific data processing at global scale The LHC Computing Grid Chengdu (China), July 5th 2011 Who I am 2 Computing science background Working in the field of computing for high-energy physics since

More information

High Performance Computing on MapReduce Programming Framework

High Performance Computing on MapReduce Programming Framework International Journal of Private Cloud Computing Environment and Management Vol. 2, No. 1, (2015), pp. 27-32 http://dx.doi.org/10.21742/ijpccem.2015.2.1.04 High Performance Computing on MapReduce Programming

More information

Part2: Let s pick one cloud IaaS middleware: OpenStack. Sergio Maffioletti

Part2: Let s pick one cloud IaaS middleware: OpenStack. Sergio Maffioletti S3IT: Service and Support for Science IT Cloud middleware Part2: Let s pick one cloud IaaS middleware: OpenStack Sergio Maffioletti S3IT: Service and Support for Science IT, University of Zurich http://www.s3it.uzh.ch/

More information

MOHA: Many-Task Computing Framework on Hadoop

MOHA: Many-Task Computing Framework on Hadoop Apache: Big Data North America 2017 @ Miami MOHA: Many-Task Computing Framework on Hadoop Soonwook Hwang Korea Institute of Science and Technology Information May 18, 2017 Table of Contents Introduction

More information

Clouds at other sites T2-type computing

Clouds at other sites T2-type computing Clouds at other sites T2-type computing Randall Sobie University of Victoria Randall Sobie IPP/Victoria 1 Overview Clouds are used in a variety of ways for Tier-2 type computing MC simulation, production

More information

Virtualization of the ATLAS Tier-2/3 environment on the HPC cluster NEMO

Virtualization of the ATLAS Tier-2/3 environment on the HPC cluster NEMO Virtualization of the ATLAS Tier-2/3 environment on the HPC cluster NEMO Ulrike Schnoor (CERN) Anton Gamel, Felix Bührer, Benjamin Rottler, Markus Schumacher (University of Freiburg) February 02, 2018

More information

Aeromancer: A Workflow Manager for Large- Scale MapReduce-Based Scientific Workflows

Aeromancer: A Workflow Manager for Large- Scale MapReduce-Based Scientific Workflows Aeromancer: A Workflow Manager for Large- Scale MapReduce-Based Scientific Workflows Presented by Sarunya Pumma Supervisors: Dr. Wu-chun Feng, Dr. Mark Gardner, and Dr. Hao Wang synergy.cs.vt.edu Outline

More information

Evolution of Cloud Computing in ATLAS

Evolution of Cloud Computing in ATLAS The Evolution of Cloud Computing in ATLAS Ryan Taylor on behalf of the ATLAS collaboration 1 Outline Cloud Usage and IaaS Resource Management Software Services to facilitate cloud use Sim@P1 Performance

More information

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science T. Maeno, K. De, A. Klimentov, P. Nilsson, D. Oleynik, S. Panitkin, A. Petrosyan, J. Schovancova, A. Vaniachine,

More information

LHCb experience running jobs in virtual machines

LHCb experience running jobs in virtual machines LHCb experience running jobs in virtual machines Andrew McNab, University of Manchester Federico Stagni & Cinzia Luzzi, CERN on behalf of the LHCb collaboration Overview Starting from DIRAC + Grid CernVM

More information

HEP Grid Activities in China

HEP Grid Activities in China HEP Grid Activities in China Sun Gongxing Institute of High Energy Physics, Chinese Academy of Sciences CANS Nov. 1-2, 2005, Shen Zhen, China History of IHEP Computing Center Found in 1974 Computing Platform

More information

Enabling Efficient and Scalable Zero-Trust Security

Enabling Efficient and Scalable Zero-Trust Security WHITE PAPER Enabling Efficient and Scalable Zero-Trust Security FOR CLOUD DATA CENTERS WITH AGILIO SMARTNICS THE NEED FOR ZERO-TRUST SECURITY The rapid evolution of cloud-based data centers to support

More information

Benchmarking and accounting for the (private) cloud

Benchmarking and accounting for the (private) cloud Journal of Physics: Conference Series PAPER OPEN ACCESS Benchmarking and accounting for the (private) cloud To cite this article: J Belleman and U Schwickerath 2015 J. Phys.: Conf. Ser. 664 022035 View

More information

CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase. Chen Zhang Hans De Sterck University of Waterloo

CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase. Chen Zhang Hans De Sterck University of Waterloo CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase Chen Zhang Hans De Sterck University of Waterloo Outline Introduction Motivation Related Work System Design Future Work Introduction

More information

Transparent Throughput Elas0city for IaaS Cloud Storage Using Guest- Side Block- Level Caching

Transparent Throughput Elas0city for IaaS Cloud Storage Using Guest- Side Block- Level Caching Transparent Throughput Elas0city for IaaS Cloud Storage Using Guest- Side Block- Level Caching Bogdan Nicolae (IBM Research, Ireland) Pierre Riteau (University of Chicago, USA) Kate Keahey (Argonne National

More information

escience in the Cloud: A MODIS Satellite Data Reprojection and Reduction Pipeline in the Windows

escience in the Cloud: A MODIS Satellite Data Reprojection and Reduction Pipeline in the Windows escience in the Cloud: A MODIS Satellite Data Reprojection and Reduction Pipeline in the Windows Jie Li1, Deb Agarwal2, Azure Marty Platform Humphrey1, Keith Jackson2, Catharine van Ingen3, Youngryel Ryu4

More information

What is KVM? KVM patch. Modern hypervisors must do many things that are already done by OSs Scheduler, Memory management, I/O stacks

What is KVM? KVM patch. Modern hypervisors must do many things that are already done by OSs Scheduler, Memory management, I/O stacks LINUX-KVM The need for KVM x86 originally virtualization unfriendly No hardware provisions Instructions behave differently depending on privilege context(popf) Performance suffered on trap-and-emulate

More information

Accelerate OpenStack* Together. * OpenStack is a registered trademark of the OpenStack Foundation

Accelerate OpenStack* Together. * OpenStack is a registered trademark of the OpenStack Foundation Accelerate OpenStack* Together * OpenStack is a registered trademark of the OpenStack Foundation Considerations to Build a Production OpenStack Cloud Ruchi Bhargava, Intel IT Shuquan Huang, Intel IT Kai

More information

Workload management at KEK/CRC -- status and plan

Workload management at KEK/CRC -- status and plan Workload management at KEK/CRC -- status and plan KEK/CRC Hiroyuki Matsunaga Most of the slides are prepared by Koichi Murakami and Go Iwai CPU in KEKCC Work server & Batch server Xeon 5670 (2.93 GHz /

More information

Elastic Efficient Execution of Varied Containers. Sharma Podila Nov 7th 2016, QCon San Francisco

Elastic Efficient Execution of Varied Containers. Sharma Podila Nov 7th 2016, QCon San Francisco Elastic Efficient Execution of Varied Containers Sharma Podila Nov 7th 2016, QCon San Francisco In other words... How do we efficiently run heterogeneous workloads on an elastic pool of heterogeneous resources,

More information

Scalability Testing of DNE2 in Lustre 2.7 and Metadata Performance using Virtual Machines Tom Crowe, Nathan Lavender, Stephen Simms

Scalability Testing of DNE2 in Lustre 2.7 and Metadata Performance using Virtual Machines Tom Crowe, Nathan Lavender, Stephen Simms Scalability Testing of DNE2 in Lustre 2.7 and Metadata Performance using Virtual Machines Tom Crowe, Nathan Lavender, Stephen Simms Research Technologies High Performance File Systems hpfs-admin@iu.edu

More information

Pricing Intra-Datacenter Networks with

Pricing Intra-Datacenter Networks with Pricing Intra-Datacenter Networks with Over-Committed Bandwidth Guarantee Jian Guo 1, Fangming Liu 1, Tao Wang 1, and John C.S. Lui 2 1 Cloud Datacenter & Green Computing/Communications Research Group

More information

An Experimental Cloud Resource Broker System for Virtual Application Control with VM Allocation Scheme

An Experimental Cloud Resource Broker System for Virtual Application Control with VM Allocation Scheme An Experimental Cloud Resource Broker System for Virtual Application Control with VM Allocation Scheme Seong-Hwan Kim 1, Dong-Ki Kang 1, Ye Ren 1, Yong-Sung Park 1, Kyung-No Joo 1, Chan-Hyun Youn 1, YongSuk

More information

Performance quality monitoring system (PQM) for the Daya Bay experiment

Performance quality monitoring system (PQM) for the Daya Bay experiment Performance quality monitoring system (PQM) for the Daya Bay experiment LIU Yingbiao Institute of High Energy Physics On behalf of the Daya Bay Collaboration ACAT2013, Beijing, May 16-21, 2013 2 The Daya

More information

HPC over Cloud. July 16 th, SCENT HPC Summer GIST. SCENT (Super Computing CENTer) GIST (Gwangju Institute of Science & Technology)

HPC over Cloud. July 16 th, SCENT HPC Summer GIST. SCENT (Super Computing CENTer) GIST (Gwangju Institute of Science & Technology) HPC over Cloud July 16 th, 2014 2014 HPC Summer School @ GIST (Super Computing CENTer) GIST (Gwangju Institute of Science & Technology) Dr. JongWon Kim jongwon@nm.gist.ac.kr Interplay between Theory, Simulation,

More information

Data Grid Infrastructure for YBJ-ARGO Cosmic-Ray Project

Data Grid Infrastructure for YBJ-ARGO Cosmic-Ray Project Data Grid Infrastructure for YBJ-ARGO Cosmic-Ray Project Gang CHEN, Hongmei ZHANG - IHEP CODATA 06 24 October 2006, Beijing FP6 2004 Infrastructures 6-SSA-026634 http://www.euchinagrid.cn Extensive Air

More information

Developing Microsoft Azure Solutions (70-532) Syllabus

Developing Microsoft Azure Solutions (70-532) Syllabus Developing Microsoft Azure Solutions (70-532) Syllabus Cloud Computing Introduction What is Cloud Computing Cloud Characteristics Cloud Computing Service Models Deployment Models in Cloud Computing Advantages

More information

PROOF-Condor integration for ATLAS

PROOF-Condor integration for ATLAS PROOF-Condor integration for ATLAS G. Ganis,, J. Iwaszkiewicz, F. Rademakers CERN / PH-SFT M. Livny, B. Mellado, Neng Xu,, Sau Lan Wu University Of Wisconsin Condor Week, Madison, 29 Apr 2 May 2008 Outline

More information

Towards Network Awareness in LHC Computing

Towards Network Awareness in LHC Computing Towards Network Awareness in LHC Computing CMS ALICE CERN Atlas LHCb LHC Run1: Discovery of a New Boson LHC Run2: Beyond the Standard Model Gateway to a New Era Artur Barczyk / Caltech Internet2 Technology

More information

Virtualizing a Batch. University Grid Center

Virtualizing a Batch. University Grid Center Virtualizing a Batch Queuing System at a University Grid Center Volker Büge (1,2), Yves Kemp (1), Günter Quast (1), Oliver Oberst (1), Marcel Kunze (2) (1) University of Karlsruhe (2) Forschungszentrum

More information

VM Life Cycle Management

VM Life Cycle Management VM life cycle management refers to the entire process of registering, deploying, updating, monitoring VMs, and getting them service chained as per your requirements. You can perform these tasks and more

More information

CS / Cloud Computing. Recitation 11 November 5 th and Nov 8 th, 2013

CS / Cloud Computing. Recitation 11 November 5 th and Nov 8 th, 2013 CS15-319 / 15-619 Cloud Computing Recitation 11 November 5 th and Nov 8 th, 2013 Announcements Encounter a general bug: Post on Piazza Encounter a grading bug: Post Privately on Piazza Don t ask if my

More information

How can you implement this through a script that a scheduling daemon runs daily on the application servers?

How can you implement this through a script that a scheduling daemon runs daily on the application servers? You ve been tasked with implementing an automated data backup solution for your application servers that run on Amazon EC2 with Amazon EBS volumes. You want to use a distributed data store for your backups

More information

Cloud Programming. Programming Environment Oct 29, 2015 Osamu Tatebe

Cloud Programming. Programming Environment Oct 29, 2015 Osamu Tatebe Cloud Programming Programming Environment Oct 29, 2015 Osamu Tatebe Cloud Computing Only required amount of CPU and storage can be used anytime from anywhere via network Availability, throughput, reliability

More information

SurFS Product Description

SurFS Product Description SurFS Product Description 1. ABSTRACT SurFS An innovative technology is evolving the distributed storage ecosystem. SurFS is designed for cloud storage with extreme performance at a price that is significantly

More information

AWS Reference Design Document

AWS Reference Design Document AWS Reference Design Document Contents Overview... 1 Amazon Web Services (AWS), Public Cloud and the New Security Challenges... 1 Security at the Speed of DevOps... 2 Securing East-West and North-South

More information

Distributed Computing Framework. A. Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille

Distributed Computing Framework. A. Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille Distributed Computing Framework A. Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille EGI Webinar, 7 June 2016 Plan DIRAC Project Origins Agent based Workload Management System Accessible computing resources Data

More information

Dynamic Extension of a Virtualized Cluster by using Cloud Resources

Dynamic Extension of a Virtualized Cluster by using Cloud Resources Dynamic Extension of a Virtualized Cluster by using Cloud Resources Oliver Oberst, Thomas Hauth, David Kernert, Stephan Riedel, Günter Quast Institut für Experimentelle Kernphysik, Karlsruhe Institute

More information

Execo tutorial Grid 5000 school, Grenoble, January 2016

Execo tutorial Grid 5000 school, Grenoble, January 2016 Execo tutorial Grid 5000 school, Grenoble, January 2016 Simon Delamare Matthieu Imbert Laurent Pouilloux INRIA/CNRS/LIP ENS-Lyon 03/02/2016 1/34 1 introduction 2 execo, core module 3 execo g5k, Grid 5000

More information

Developing Microsoft Azure Solutions (70-532) Syllabus

Developing Microsoft Azure Solutions (70-532) Syllabus Developing Microsoft Azure Solutions (70-532) Syllabus Cloud Computing Introduction What is Cloud Computing Cloud Characteristics Cloud Computing Service Models Deployment Models in Cloud Computing Advantages

More information

Oracle Enterprise Manager 12c IBM DB2 Database Plug-in

Oracle Enterprise Manager 12c IBM DB2 Database Plug-in Oracle Enterprise Manager 12c IBM DB2 Database Plug-in May 2015 Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and

More information

Work Queue + Python. A Framework For Scalable Scientific Ensemble Applications

Work Queue + Python. A Framework For Scalable Scientific Ensemble Applications Work Queue + Python A Framework For Scalable Scientific Ensemble Applications Peter Bui, Dinesh Rajan, Badi Abdul-Wahid, Jesus Izaguirre, Douglas Thain University of Notre Dame Distributed Computing Examples

More information

Balancing Fairness and Efficiency in Tiered Storage Systems with Bottleneck-Aware Allocation

Balancing Fairness and Efficiency in Tiered Storage Systems with Bottleneck-Aware Allocation Balancing Fairness and Efficiency in Tiered Storage Systems with Bottleneck-Aware Allocation Hui Wang, Peter Varman Rice University FAST 14, Feb 2014 Tiered Storage Tiered storage: HDs and SSDs q Advantages:

More information

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF Conference 2017 The Data Challenges of the LHC Reda Tafirout, TRIUMF Outline LHC Science goals, tools and data Worldwide LHC Computing Grid Collaboration & Scale Key challenges Networking ATLAS experiment

More information

Hillstone CloudEdge For Network Function Virtualization (NFV) Solutions

Hillstone CloudEdge For Network Function Virtualization (NFV) Solutions Hillstone CloudEdge For Network Function Virtualization (NFV) Solutions Introduction With the advancing technologies, business applications and operations naturally have become more dynamic in order to

More information

Deploying a Private OpenStack Cloud at Scale. Matt Fischer & Clayton O Neill

Deploying a Private OpenStack Cloud at Scale. Matt Fischer & Clayton O Neill Deploying a Private OpenStack Cloud at Scale Matt Fischer & Clayton O Neill Introduction Clayton O Neill clayton.oneill@twcable.com IRC: clayton Matt Fischer matt.fischer@twcable.com IRC: mfisch What Is

More information

Installation runbook for Hedvig + Cinder Driver

Installation runbook for Hedvig + Cinder Driver Installation runbook for Hedvig + Cinder Driver Partner Name: Product Name: Product Version: Hedvig Inc. Hedvig Distributed Storage Platform V-1.0.0 MOS Version: Kilo on Ubuntu 14.04 (2015.1.0-7.0) OpenStack

More information

Use of containerisation as an alternative to full virtualisation in grid environments.

Use of containerisation as an alternative to full virtualisation in grid environments. Journal of Physics: Conference Series PAPER OPEN ACCESS Use of containerisation as an alternative to full virtualisation in grid environments. Related content - Use of containerisation as an alternative

More information

Using DC/OS for Continuous Delivery

Using DC/OS for Continuous Delivery Using DC/OS for Continuous Delivery DevPulseCon 2017 Elizabeth K. Joseph, @pleia2 Mesosphere 1 Elizabeth K. Joseph, Developer Advocate, Mesosphere 15+ years working in open source communities 10+ years

More information

CFS-v: I/O Demand-driven VM Scheduler in KVM

CFS-v: I/O Demand-driven VM Scheduler in KVM CFS-v: Demand-driven VM Scheduler in KVM Hyotaek Shim and Sung-Min Lee (hyotaek.shim, sung.min.lee@samsung.com) Software R&D Center, Samsung Electronics 2014. 10. 16 Problem in Server Consolidation 2/16

More information

NCP Computing Infrastructure & T2-PK-NCP Site Update. Saqib Haleem National Centre for Physics (NCP), Pakistan

NCP Computing Infrastructure & T2-PK-NCP Site Update. Saqib Haleem National Centre for Physics (NCP), Pakistan NCP Computing Infrastructure & T2-PK-NCP Site Update Saqib Haleem National Centre for Physics (NCP), Pakistan Outline NCP Overview Computing Infrastructure at NCP WLCG T2 Site status Network status and

More information

Huawei FusionSphere 6.0 Technical White Paper on OpenStack Integrating FusionCompute HUAWEI TECHNOLOGIES CO., LTD. Issue 01.

Huawei FusionSphere 6.0 Technical White Paper on OpenStack Integrating FusionCompute HUAWEI TECHNOLOGIES CO., LTD. Issue 01. Technical White Paper on OpenStack Integrating Issue 01 Date 2016-04-30 HUAWEI TECHNOLOGIES CO., LTD. 2016. All rights reserved. No part of this document may be reproduced or transmitted in any form or

More information

Next-Generation Cloud Platform

Next-Generation Cloud Platform Next-Generation Cloud Platform Jangwoo Kim Jun 24, 2013 E-mail: jangwoo@postech.ac.kr High Performance Computing Lab Department of Computer Science & Engineering Pohang University of Science and Technology

More information

Was ist dran an einer spezialisierten Data Warehousing platform?

Was ist dran an einer spezialisierten Data Warehousing platform? Was ist dran an einer spezialisierten Data Warehousing platform? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Data warehousing, Exadata, specialized hardware proprietary hardware Introduction

More information

IN2P3-CC cloud computing (IAAS) status FJPPL Feb 9-11th 2016

IN2P3-CC cloud computing (IAAS) status FJPPL Feb 9-11th 2016 Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules IN2P3-CC cloud computing (IAAS) status FJPPL Feb 9-11th 2016 1 Outline Use cases R&D Internal core services Computing

More information

HEP replica management

HEP replica management Primary actor Goal in context Scope Level Stakeholders and interests Precondition Minimal guarantees Success guarantees Trigger Technology and data variations Priority Releases Response time Frequency

More information

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)?

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)? What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)? What is Amazon Machine Image (AMI)? Amazon Elastic Compute Cloud (EC2)?

More information

8. CONCLUSION AND FUTURE WORK. To address the formulated research issues, this thesis has achieved each of the objectives delineated in Chapter 1.

8. CONCLUSION AND FUTURE WORK. To address the formulated research issues, this thesis has achieved each of the objectives delineated in Chapter 1. 134 8. CONCLUSION AND FUTURE WORK 8.1 CONCLUSION Virtualization and internet availability has increased virtualized server cluster or cloud computing environment deployments. With technological advances,

More information

CouchDB-based system for data management in a Grid environment Implementation and Experience

CouchDB-based system for data management in a Grid environment Implementation and Experience CouchDB-based system for data management in a Grid environment Implementation and Experience Hassen Riahi IT/SDC, CERN Outline Context Problematic and strategy System architecture Integration and deployment

More information

Consolidating Complementary VMs with Spatial/Temporalawareness

Consolidating Complementary VMs with Spatial/Temporalawareness Consolidating Complementary VMs with Spatial/Temporalawareness in Cloud Datacenters Liuhua Chen and Haiying Shen Dept. of Electrical and Computer Engineering Clemson University, SC, USA 1 Outline Introduction

More information

Sky Computing on FutureGrid and Grid 5000 with Nimbus. Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France

Sky Computing on FutureGrid and Grid 5000 with Nimbus. Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France Sky Computing on FutureGrid and Grid 5000 with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France Outline Introduction to Sky Computing The Nimbus Project

More information

SNiPER: an offline software framework for non-collider physics experiments

SNiPER: an offline software framework for non-collider physics experiments SNiPER: an offline software framework for non-collider physics experiments J. H. Zou 1, X. T. Huang 2, W. D. Li 1, T. Lin 1, T. Li 2, K. Zhang 1, Z. Y. Deng 1, G. F. Cao 1 1 Institute of High Energy Physics,

More information

Automated Deployment of Private Cloud (EasyCloud)

Automated Deployment of Private Cloud (EasyCloud) Automated Deployment of Private Cloud (EasyCloud) Mohammed Kazim Musab Al-Zahrani Mohannad Mostafa Moath Al-Solea Hassan Al-Salam Advisor: Dr.Ahmed Khayyat 1 Table of Contents Introduction Requirements

More information

Deep Insights: High Availability VMs via a Simple Host-to-Guest Interface OpenStack Masakari Greg Waines (Wind River Systems)

Deep Insights: High Availability VMs via a Simple Host-to-Guest Interface OpenStack Masakari Greg Waines (Wind River Systems) Deep Insights: High Availability VMs via a Simple Host-to-Guest Interface OpenStack Masakari Greg Waines (Wind River Systems) WHEN IT MATTERS, IT RUNS ON WIND RIVER. 2017 WIND RIVER. ALL RIGHTS RESERVED.

More information

Apache HBase Andrew Purtell Committer, Apache HBase, Apache Software Foundation Big Data US Research And Development, Intel

Apache HBase Andrew Purtell Committer, Apache HBase, Apache Software Foundation Big Data US Research And Development, Intel Apache HBase 0.98 Andrew Purtell Committer, Apache HBase, Apache Software Foundation Big Data US Research And Development, Intel Who am I? Committer on the Apache HBase project Member of the Big Data Research

More information

Overview of ATLAS PanDA Workload Management

Overview of ATLAS PanDA Workload Management Overview of ATLAS PanDA Workload Management T. Maeno 1, K. De 2, T. Wenaus 1, P. Nilsson 2, G. A. Stewart 3, R. Walker 4, A. Stradling 2, J. Caballero 1, M. Potekhin 1, D. Smith 5, for The ATLAS Collaboration

More information

Users and utilization of CERIT-SC infrastructure

Users and utilization of CERIT-SC infrastructure Users and utilization of CERIT-SC infrastructure Equipment CERIT-SC is an integral part of the national e-infrastructure operated by CESNET, and it leverages many of its services (e.g. management of user

More information

Developing Microsoft Azure Solutions (70-532) Syllabus

Developing Microsoft Azure Solutions (70-532) Syllabus Developing Microsoft Azure Solutions (70-532) Syllabus Cloud Computing Introduction What is Cloud Computing Cloud Characteristics Cloud Computing Service Models Deployment Models in Cloud Computing Advantages

More information

CS 261 Fall Caching. Mike Lam, Professor. (get it??)

CS 261 Fall Caching. Mike Lam, Professor. (get it??) CS 261 Fall 2017 Mike Lam, Professor Caching (get it??) Topics Caching Cache policies and implementations Performance impact General strategies Caching A cache is a small, fast memory that acts as a buffer

More information

Virtualized Infrastructure Managers for edge computing: OpenVIM and OpenStack comparison IEEE BMSB2018, Valencia,

Virtualized Infrastructure Managers for edge computing: OpenVIM and OpenStack comparison IEEE BMSB2018, Valencia, Virtualized Infrastructure Managers for edge computing: OpenVIM and OpenStack comparison IEEE BMSB2018, Valencia, 2018-06-08 Teodora Sechkova contact@virtualopensystems.com www.virtualopensystems.com Authorship

More information

Agilio CX 2x40GbE with OVS-TC

Agilio CX 2x40GbE with OVS-TC PERFORMANCE REPORT Agilio CX 2x4GbE with OVS-TC OVS-TC WITH AN AGILIO CX SMARTNIC CAN IMPROVE A SIMPLE L2 FORWARDING USE CASE AT LEAST 2X. WHEN SCALED TO REAL LIFE USE CASES WITH COMPLEX RULES TUNNELING

More information

Red Hat CloudForms Hybrid Cloud Management (CL220)

Red Hat CloudForms Hybrid Cloud Management (CL220) Red Hat CloudForms Hybrid Cloud Management (CL220) DESCRIPTION: Course overview In this course, students use a hybrid environment, configure Red Hat CloudForms to work with Red Hat Virtualization and Red

More information

Making Nested Virtualization Real by Using Hardware Virtualization Features

Making Nested Virtualization Real by Using Hardware Virtualization Features Making Nested Virtualization Real by Using Hardware Virtualization Features May 28, 2013 Jun Nakajima Intel Corporation 1 Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL

More information

Agile CI/CD with Jenkins and/at ZeroStack. Kiran Bondalapati CTO, Co-Founder & Jenkins Admin ZeroStack, Inc. (

Agile CI/CD with Jenkins and/at ZeroStack. Kiran Bondalapati CTO, Co-Founder & Jenkins Admin ZeroStack, Inc. ( Agile CI/CD with Jenkins and/at ZeroStack Kiran Bondalapati CTO, Co-Founder & Jenkins Admin ZeroStack, Inc. (www.zerostack.com) Outline ZeroStack Hybrid Cloud Platform Jenkins and ZeroStack Jenkins at

More information

Baadal: the IITD computing cloud (Beta release)

Baadal: the IITD computing cloud (Beta release) Baadal: the IITD computing cloud (Beta release) The CSC has commissioned a new cloud computing environment for high performance computing based on 1. 32 blade servers each with 2x6 core Intel(R) Xeon(R)

More information

Title Text. Making OpenStack Work in an Existing Environment - Challenges and Solutions. Amrish Kapoor, Pushkar Acharya, Ken Hui, Roopak Parikh

Title Text. Making OpenStack Work in an Existing Environment - Challenges and Solutions. Amrish Kapoor, Pushkar Acharya, Ken Hui, Roopak Parikh Title Text Making OpenStack Work in an Existing Environment - Challenges and Solutions Amrish Kapoor, Pushkar Acharya, Ken Hui, Roopak Parikh About Platform9 Founded 2013, San Francisco Bay Area Founded

More information

Running Jobs in the Vacuum

Running Jobs in the Vacuum Running Jobs in the Vacuum A. McNab 1, F. Stagni 2 and M. Ubeda Garcia 2 1 School of Physics and Astronomy, University of Manchester, UK 2 CERN, Switzerland E-mail: andrew.mcnab@cern.ch fstagni@cern.ch

More information

BESIII Computing Model and most recent R&Ds

BESIII Computing Model and most recent R&Ds BESIII Computing Model and most recent R&Ds Marco Maggiora University of Turin and INFN Turin Workshop CCR INFN La Biodola, May 16 th - 20 th 2016 IHEP cluster New Resources Since Sep 2015 86 blade servers

More information

Distributed File Systems Issues. NFS (Network File System) AFS: Namespace. The Andrew File System (AFS) Operating Systems 11/19/2012 CSC 256/456 1

Distributed File Systems Issues. NFS (Network File System) AFS: Namespace. The Andrew File System (AFS) Operating Systems 11/19/2012 CSC 256/456 1 Distributed File Systems Issues NFS (Network File System) Naming and transparency (location transparency versus location independence) Host:local-name Attach remote directories (mount) Single global name

More information

Guillimin HPC Users Meeting. Bryan Caron

Guillimin HPC Users Meeting. Bryan Caron July 17, 2014 Bryan Caron bryan.caron@mcgill.ca McGill University / Calcul Québec / Compute Canada Montréal, QC Canada Outline Compute Canada News Upcoming Maintenance Downtime in August Storage System

More information

Increasing Cloud Power Efficiency through Consolidation Techniques

Increasing Cloud Power Efficiency through Consolidation Techniques Increasing Cloud Power Efficiency through Consolidation Techniques Antonio Corradi, Mario Fanelli, Luca Foschini Dipartimento di Elettronica, Informatica e Sistemistica (DEIS) University of Bologna, Italy

More information

Is remote GPU virtualization useful? Federico Silla Technical University of Valencia Spain

Is remote GPU virtualization useful? Federico Silla Technical University of Valencia Spain Is remote virtualization useful? Federico Silla Technical University of Valencia Spain st Outline What is remote virtualization? HPC Advisory Council Spain Conference 2015 2/57 We deal with s, obviously!

More information

AN 831: Intel FPGA SDK for OpenCL

AN 831: Intel FPGA SDK for OpenCL AN 831: Intel FPGA SDK for OpenCL Host Pipelined Multithread Subscribe Send Feedback Latest document on the web: PDF HTML Contents Contents 1 Intel FPGA SDK for OpenCL Host Pipelined Multithread...3 1.1

More information

SmartNIC Programming Models

SmartNIC Programming Models SmartNIC Programming Models Johann Tönsing 206--09 206 Open-NFP Agenda SmartNIC hardware Pre-programmed vs. custom (C and/or P4) firmware Programming models / offload models Switching on NIC, with SR-IOV

More information

Cloud Open Source Innovation on Software Defined Storage

Cloud Open Source Innovation on Software Defined Storage NorthEast ASIA OSS Promotion Forum Cloud Open Source Innovation on Software Defined Storage Hiroshi Miura Director of Japan OSS Promotion Forum OSS Cloud Evangelist, NTT DATA Corporation. Copyright 2014

More information

Building a Data-Friendly Platform for a Data- Driven Future

Building a Data-Friendly Platform for a Data- Driven Future Building a Data-Friendly Platform for a Data- Driven Future Benjamin Hindman - @benh 2016 Mesosphere, Inc. All Rights Reserved. INTRO $ whoami BENJAMIN HINDMAN Co-founder and Chief Architect of Mesosphere,

More information

Virtualization. A very short summary by Owen Synge

Virtualization. A very short summary by Owen Synge Virtualization A very short summary by Owen Synge Outline What is Virtulization? What's virtulization good for? What's virtualisation bad for? We had a workshop. What was presented? What did we do with

More information

The ATLAS EventIndex: an event catalogue for experiments collecting large amounts of data

The ATLAS EventIndex: an event catalogue for experiments collecting large amounts of data The ATLAS EventIndex: an event catalogue for experiments collecting large amounts of data D. Barberis 1*, J. Cranshaw 2, G. Dimitrov 3, A. Favareto 1, Á. Fernández Casaní 4, S. González de la Hoz 4, J.

More information

March 10 11, 2015 San Jose

March 10 11, 2015 San Jose March 10 11, 2015 San Jose Health monitoring & predictive analytics To lower the TCO in a datacenter Christian B. Madsen & Andrei Khurshudov Engineering Manager & Sr. Director Seagate Technology christian.b.madsen@seagate.com

More information

High Availability for Enterprise Clouds: Oracle Solaris Cluster and OpenStack

High Availability for Enterprise Clouds: Oracle Solaris Cluster and OpenStack High Availability for Enterprise Clouds: Oracle Solaris Cluster and OpenStack Eve Kleinknecht Principal Product Manager Thorsten Früauf Principal Software Engineer November 18, 2015 Safe Harbor Statement

More information

Shadowfax: Scaling in Heterogeneous Cluster Systems via GPGPU Assemblies

Shadowfax: Scaling in Heterogeneous Cluster Systems via GPGPU Assemblies Shadowfax: Scaling in Heterogeneous Cluster Systems via GPGPU Assemblies Alexander Merritt, Vishakha Gupta, Abhishek Verma, Ada Gavrilovska, Karsten Schwan {merritt.alex,abhishek.verma}@gatech.edu {vishakha,ada,schwan}@cc.gtaech.edu

More information

Achieve Low Latency NFV with Openstack*

Achieve Low Latency NFV with Openstack* Achieve Low Latency NFV with Openstack* Yunhong Jiang Yunhong.Jiang@intel.com *Other names and brands may be claimed as the property of others. Agenda NFV and network latency Why network latency on NFV

More information