Brief review of the HEPIX 2011 spring Darmstadt, 2-6 May

Similar documents
High-density Grid storage system optimization at ASGC. Shu-Ting Liao ASGC Operation team ISGC 2011

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft. Presented by Manfred Alef Contributions of Jos van Wezel, Andreas Heiss

CC-IN2P3: A High Performance Data Center for Research

Physics Computing at CERN. Helge Meinhard CERN, IT Department OpenLab Student Lecture 21 July 2011

Ioan Raicu. Everyone else. More information at: Background? What do you want to get out of this course?

Parallel Computing at DESY Zeuthen. Introduction to Parallel Computing at DESY Zeuthen and the new cluster machines

The LHC Computing Grid

Physics Computing at CERN. Helge Meinhard CERN, IT Department OpenLab Student Lecture 27 July 2010

Batch Services at CERN: Status and Future Evolution

Ivane Javakhishvili Tbilisi State University High Energy Physics Institute HEPI TSU

Virtualization of the ATLAS Tier-2/3 environment on the HPC cluster NEMO

X Grid Engine. Where X stands for Oracle Univa Open Son of more to come...?!?

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN

HPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances)

What s new in HTCondor? What s coming? HTCondor Week 2018 Madison, WI -- May 22, 2018

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

Distributed Computing Framework. A. Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille

Austrian Federated WLCG Tier-2

Considerations for a grid-based Physics Analysis Facility. Dietrich Liko

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

CERN and Scientific Computing

Cloud Computing. UCD IT Services Experience

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief

The INFN Tier1. 1. INFN-CNAF, Italy

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

ElastiCluster Automated provisioning of computational clusters in the cloud

How то Use HPC Resources Efficiently by a Message Oriented Framework.

COMPARING COST MODELS - DETAILS

Grid Computing Activities at KIT

VMs at a Tier-1 site. EGEE 09, Sander Klous, Nikhef

Data storage services at KEK/CRC -- status and plan

InfoBrief. Platform ROCKS Enterprise Edition Dell Cluster Software Offering. Key Points

BOSCO Architecture. Derek Weitzel University of Nebraska Lincoln

McAfee Virtual Network Security Platform 8.4 Revision A

UW-ATLAS Experiences with Condor

Batch Systems. Running calculations on HPC resources

Introducing the HTCondor-CE

Spanish Tier-2. Francisco Matorras (IFCA) Nicanor Colino (CIEMAT) F. Matorras N.Colino, Spain CMS T2,.6 March 2008"

Opportunities A Realistic Study of Costs Associated

COST EFFICIENCY VS ENERGY EFFICIENCY. Anna Lepak Universität Hamburg Seminar: Energy-Efficient Programming Wintersemester 2014/2015

Evaluating Hyperconverged Full Stack Solutions by, David Floyer

High-Energy Physics Data-Storage Challenges

Online data storage service strategy for the CERN computer Centre G. Cancio, D. Duellmann, M. Lamanna, A. Pace CERN, Geneva, Switzerland

A short introduction to the Worldwide LHC Computing Grid. Maarten Litmaath (CERN)

Power Systems for Your Business

Report on the HEPiX Virtualisation Working Group

Virtual Server Service

Grid Operation at Tokyo Tier-2 Centre for ATLAS

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute

Delegated Access for Hadoop Clusters in the Cloud

Developing Scientific Applications with the IBM Parallel Environment Developer Edition

Comet Virtualization Code & Design Sprint

WITH ACTIVEWATCH EXPERT BACKED, DETECTION AND THREAT RESPONSE BENEFITS HOW THREAT MANAGER WORKS SOLUTION OVERVIEW:

EGEE and Interoperation

Introduction 2 Load Testing

Second workshop for MARS administrators

12/04/ Dell Inc. All Rights Reserved. 1

McAfee Network Security Platform 9.2

Embedded Filesystems (Direct Client Access to Vice Partitions)

HPC Architectures. Types of resource currently in use

-Presented By : Rajeshwari Chatterjee Professor-Andrey Shevel Course: Computing Clusters Grid and Clouds ITMO University, St.

Accelerated Technology Refresh Project (ATRP) at VCH and PHC. Frequently Asked Questions

University at Buffalo Center for Computational Research

IT MANAGER PERMANENT SALARY SCALE: P07 (R ) Ref:AgriS042/2019 Information Technology Manager. Reporting to. Information Technology (IT)

COMPUTER SCIENCE DEPARTMENT IT INFRASTRUCTURE

150 million sensors deliver data. 40 million times per second

Batch system usage arm euthen F azo he Z J. B T

Outline. March 5, 2012 CIRMMT - McGill University 2

Introduction to High-Performance Computing (HPC)

5 Trends That Will Impact Your IT Planning in Layered Security. Executive Brief

The Center for High Performance Computing. Dell Breakfast Events 20 th June 2016 Happy Sithole

Managing Petabytes of data with irods. Jean-Yves Nief CC-IN2P3 France

Nutanix Two Schools Perspectives

epldt Web Builder Security March 2017

Storage and Storage Access

Magellan Project. Jeff Broughton NERSC Systems Department Head October 7, 2009

High Performance Computing Data Management. Philippe Trautmann BDM High Performance Computing Global Research

Practical Scientific Computing

NCP Computing Infrastructure & T2-PK-NCP Site Update. Saqib Haleem National Centre for Physics (NCP), Pakistan

ROLE DESCRIPTION IT SPECIALIST

irods usage at CC-IN2P3 Jean-Yves Nief

Oracle Real Application Clusters One Node

Layer Security White Paper

Computing / The DESY Grid Center

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan

Introduction to Amazon Web Services

Figure 1: cstcdie Grid Site architecture

Reduce costs and enhance user access with Lenovo Client Virtualization solutions

Design your network to aid forensics investigation

How is server administration different in the Cloud?

Managed Services Rely on us to manage your business services

Compact Muon Solenoid: Cyberinfrastructure Solutions. Ken Bloom UNL Cyberinfrastructure Workshop -- August 15, 2005

Geneva 10.0 System Requirements

Clouds at other sites T2-type computing

Oxford University Particle Physics Site Report

Cornell Red Cloud: Campus-based Hybrid Cloud. Steven Lee Cornell University Center for Advanced Computing

<Insert Picture Here> Managing Oracle Exadata Database Machine with Oracle Enterprise Manager 11g

The ATLAS EventIndex: an event catalogue for experiments collecting large amounts of data

Trusted Cloud protects your critical data by ensuring that no unauthorised code can run undetected on your critical server infrastructure.

CERN IT procurement update HEPiX Autumn/Fall 2018 at Port d Informació Científica

Transcription:

Brief review of the HEPIX 2011 spring Darmstadt, 2-6 May http://indico.cern.ch/conferencedisplay.py?confid=118192 Andrey Y Shevel 7 June 2011 Andrey Y Shevel 1

The presentation outlook HEPiX program Site reports Main topics from CERN GSI Batch systems Trends Commercialization HPC performance trends Security 7 June 2011 Andrey Y Shevel 2

Quick HEPiX program overview Site reports (15 minutes all reports) [16] Networking&Security [4] IT infrastructure [10] Computing [details] [6] Storage and file systems [8] Cloud, Grid and virtualization [8] Oracle [2 ] 7 June 2011 Andrey Y Shevel 3

Discussed problems Material life Electric consumption and Cooling Hardware & software Upgrade FTEs support Technical&Scientific Virtualization Computing Clouds Network Security Computing Grids IPv6 7 June 2011 Andrey Y Shevel 4

Site reports... GSI site report Fermilab Site Report - Spring 2011 HEPiX GridKa Site Report Nikhef site report CERN site report DESY Site Report RAL Site Report SLAC Site Report CC-IN2P3 Site-Report update IHEP (China) Site Report NDGF Site Report Petersburg Nuclear Physics Institute (PNPI) status report DLS site report BNL Site report ASGC site report PSI - Site report 7 June 2011 Andrey Y Shevel 5

Main info from CERN CERN: Record intensities, record luminosities Peaks of more than 6 GB/s to tape Frontier instead local databases Obligatory web-based security course (with test) 7 June 2011 Andrey Y Shevel 6

Virtualization Working group around CERN How to transfer the virtual images How to revoke virtual image How to trust virtual image How to create virtual image How to keep How to... 7 June 2011 Andrey Y Shevel 7

CERN Electronic Documents http://cds.cern.ch/ - CERN Document Server (80 contributors for 10 years) Hardware: 2x Dell Blade, Intel(R) Xeon(R) CPU E5410 @ 2.33GHz (4 cores), 16 GB RAM Http://indico.cern.ch - team: 2 LD staff, 1 fellow, 2 technical students, 1 internship student CVS, SVN and Twiki 2.2 FTE, Hardware: Server DELL Poweredge 1950 2.33GHz, 2 CPU/8 core 48 GB of memory 7 June 2011 Andrey Y Shevel 8

7 June 2011 Andrey Y Shevel 9

Computing Selecting a new batch system at CC-IN2P3 The candidates were: Condor, PBS-Pro, Torque/Maui, LSF, BQS, SGE, Loadleveler (IBM), SLURM (LLNL) Requirements Scalability, robustness, sharing quality, scheduler, AFS, worker nodes features, interface to the Grid, administration, information repository, accounting, parallel and interactive jobs, software support, procurement cost, maintenance cost Final choice was «SGE» (the second by criteria was «LSF») 7 June 2011 Andrey Y Shevel 10

Oracle and commercialization related issues Oracle corporation has bought Sun corporation Oracle stopped support for several software packages: Lustre and several others. In result people on HEPiX suggested to ask EU for money to keep Lustre free of charge for anybody (same procedure as Scientific Linux). Other commercializations: Similar commercialization we see with «IBM systems journal» and «IBM research journal», «Intel Technology Journal» (earlier they were available for free over Internet) 7 June 2011 Andrey Y Shevel 11

7 June 2011 Andrey Y Shevel 12

Security update SONY play station network: 77 million account were compromised (personal details, credit cards data). Other play sites almost the same. In academic area: SSH remain the primary intrusion vector Passwords+Keys: sniffed/copied and reused by attackers The vast majority of Linux incidents at CERN results from compromised account at other sites 7 June 112011 Andrey Y Shevel 13

Rootkit checkers Signature-based: rkhunter, chkrootkit, etc. Very efficient against known versions of rootkits Very easy to use Not so efficient if the rootkit is open source or maintained e.g. there are at least 10+ Phalanx updates snapshot and check: Samhain, Tripwire, Zeppoo, the99lb, etc. Good results, but often require significant work Little/no public rootkit checkers would detect DR rookits OSSECʼs rootcheck could detect a sample discovered last year (hidden process + wrong link count on the filesystem 7 June 2011 Andrey Y Shevel 14

Good ways to manage security risks at a reasonable cost? Strict access control of users (multifactor is a significant help) Tight security patching policy Periodic reinstallation of the nodes Prevent users to escalate as root (system hardening) Implement in-depth monitoring of the system Detailed log trails (remote syslog) Good incident response capability Ability to manage incidents as part of normal operations 7 June 2011 Andrey Y Shevel 15

Data loss might happen at any time Amazon's Cloud Crash Disaster Permanently Destroyed Many Customers' Data «Many days after the crash, Amazon still hasn't gotten its systems fully up and running again.» Read more: http://www.businessinsider.com/amazon-lost-data-2011-4 Need to have spare copies in independant sites. The same might happen with your own communicators, laptops, desktops, etc. at any time. 7 June 2011 Andrey Y Shevel 16

Thank you! Questions? 7 June 2011 Andrey Y Shevel 17