The Science DMZ: Evolution

Similar documents
Programmable Information Highway (with no Traffic Jams)

ESnet Update Winter 2008 Joint Techs Workshop

ESnet Update Summer 2008 Joint Techs Workshop

SubOptic 2007 May 15, 2007 Baltimore, MD

Engagement With Scientific Facilities

ESnet5 Deployment Lessons Learned

Implementation of the Pacific Research Platform over Pacific Wave

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

International Big Science Coming to Your Campus Soon (Sooner Than You Think )

Improving Network Infrastructure to Enable Large Scale Scientific Data Flows and Collaboration (Award # ) Klara Jelinkova Joseph Ghobrial

Discovery, Unconstrained by Geography

ESnet4: Networking for the Future of DOE Science

Network Support for Data Intensive Science

NLR Update: Backbone Upgrade Joint Techs July 2008

Internet2: Presentation to Astronomy Community at Haystack. T. Charles Yun April 2002

SLIDE 1 - COPYRIGHT 2015 ELEPHANT FLOWS IN THE ROOM: SCIENCEDMZ NATIONALLY DISTRIBUTED

ESnet s primary mission is to enable the largescale science that is the mission of the Office of Science (SC) and that depends on:

Enhancing Infrastructure: Success Stories

Challenges of Big Data Movement in support of the ESA Copernicus program and global research collaborations

ESnet Planning, Status, and Future Issues

Data Intensive Science Impact on Networks

Connectivity Services, Autobahn and New Services

SLATE. Services Layer at the Edge. First Meeting of the National Research Platform Montana State University August 7-8, 2017

Philippe Laurens, Michigan State University, for USATLAS. Atlas Great Lakes Tier 2 collocated at MSU and the University of Michigan

Canadian Networks for Particle Physics Research 2011 Report to the Standing Committee on Interregional Connectivity, ICFA Panel January 2012

New International Connectivities of SINET5

Virtual Circuits Landscape

ESnet Status Update. ESCC, July Networking for the Future of Science

A short introduction to the Worldwide LHC Computing Grid. Maarten Litmaath (CERN)

A Brief Overview of the Science DMZ

Network Architecture and Services to Support Large-Scale Science: An ESnet Perspective

US West-Coast Future Internet Infrastructure Pacific Wave Update Pacific Research Platform International Routing Research Collaboration

The Pacific Research Platform (PRP)

Production Petascale Climate Data Replication at NCI Lustre and our engagement with the Earth Systems Grid Federation (ESGF)

Globus Research Data Management: Campus Deployment and Configuration. Steve Tuecke Vas Vasiliadis

File Transfer: Basics and Best Practices. Joon Kim. Ph.D. PICSciE. Research Computing 09/07/2018

Achieving the Science DMZ

Brent Sweeny GRNOC at Indiana University APAN 32 (Delhi), 25 August 2011

Scaling Across the NRP Ecosystem From Campus to Regional to National - What Support Is There? 2NRP Workshop Bozeman, Montana Tuesday, August 7, 2018

Clemson HPC and Cloud Computing

Research Cyberinfrastructure Upgrade Proposal - CITI

Abilene: An Internet2 Backbone Network

Distributed e-infrastructures for data intensive science

The Grid: Processing the Data from the World s Largest Scientific Machine

Preparing for High-Luminosity LHC. Bob Jones CERN Bob.Jones <at> cern.ch

Enabling a SuperFacility with Software Defined Networking

ICN for Cloud Networking. Lotfi Benmohamed Advanced Network Technologies Division NIST Information Technology Laboratory

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

IRNC:RXP SDN / SDX Update

ESnet s (100G) SDN Testbed

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan

COMPUTE CANADA GLOBUS PORTAL

APAN Global Collaboration Linking the World with Light

International Climate Network Working Group (ICNWG) Meeting

HTC/HPC Russia-EC. V. Ilyin NRC Kurchatov Institite Moscow State University

The perfsonar Project at 10 Years: Status and Trajectory

The Science DMZ Design Pattern

CENIC2000. Internet2 and Global Development: Institutional Impact

America Connects to Europe (ACE) (Award # ) Year 7 Annual Report 1- Mar through 31- May Jennifer Schopf Principal Investigator

The ATLAS-Canada network

5 August 2010 Eric Boyd, Internet2 Deputy CTO

The LHC computing model and its evolution. Dr Bob Jones CERN

perfsonar Going Forward Eric Boyd, Internet2 Internet2 Technology Exchange September 27 th 2016

T0-T1-T2 networking. Vancouver, 31 August 2009 LHCOPN T0-T1-T2 Working Group

Developing Networking and Human Expertise in Support of International Science

Towards Network Awareness in LHC Computing

The Software Journey: from networks to visualization

The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists

Challenges and Evolution of the LHC Production Grid. April 13, 2011 Ian Fisk

Internet2 Advanced Network Services Today

Next Generation Networking and The HOPI Testbed

Climate Data Management using Globus

BUCKNELL S SCIENCE DMZ

ALICE Grid Activities in US

NTT Com Press Conference March 1, 2016 #enterprisecloud

LHC and LSST Use Cases

Grid Computing: dealing with GB/s dataflows

perfsonar Deployment on ESnet

Vasilis Maglaris. Chairman, NREN Policy Committee - GÉANT Consortium Coordinator, NOVI FIRE Project

By establishing IGTF, we are seeing

Storage and I/O requirements of the LHC experiments

CERN Network activities update

Design patterns for data-driven research acceleration

Clouds in High Energy Physics

Clouds at other sites T2-type computing

The Evolution of Exchange Points

GÉANT Services Supporting International Networking and Collaboration

Intercontinental Multi-Domain Monitoring for LHC with perfsonar

Managed Hosting Services

Presentation of the LHCONE Architecture document

Grid Computing a new tool for science

GÉANT Enabling Global R&E Collaboration. Thomas Fryer, DANTE AMERICAS Conference, CUDI Friday, 4 th October 2013

International Exchanges Current and Future

THOUGHTS ON SDN IN DATA INTENSIVE SCIENCE APPLICATIONS

Zhengyang Liu! Oct 25, Supported by NSF Grant OCI

DICE Diagnostic Service

Constant monitoring of multi-site network connectivity at the Tokyo Tier2 center

November 1 st 2010, Internet2 Fall Member Mee5ng Jason Zurawski Research Liaison

Network and Host Design to Facilitate High Performance Data Transfer

Transcription:

The Science DMZ: Evolution Eli Dart, ESnet CC-NIE PI Meeting Washington, DC May 1, 2014

Why Are We Doing This? It s good to build high-quality infrastructure As network engineers, we like building networks Anything worth doing is worth doing well But really what s a network worth? Networks have very little intrinsic value The value of a network is in what you can do with it In order to be valuable a network must be useful Who uses the Science DMZ? 5/5/14 2

One Experimental Data Flow Triples Network Utilization for Major HPC Center 5/5/14 3

Our Past: Network as Infrastructure SEAT 0 PNNL LBNL JGI 0 0 SUNN 0 SNLL LLNL Salt Lake 0 AMES 1 0 ANL 0 STAR 0 0 EQCH CLEV 0 0 0 PPPL GFDL PU Physics JLAB BNL LOSA SDSC 1 LASV ALBU 0 LANL SNLA 0 0 0 0 0 0 Geography U.S. Department is of Energy Office of Science only representational

Three Historical Inflection Points for Global Research Networks 1. Abundant capacity (88 λ x 0Gbps) 2. Programmability 3. Campus architectures newly optimized for data mobility. Science DMZ + NSF grants.

Our Future: Network as Instrument SEAT 0 PNNL LBNL JGI 0 0 SUNN 0 SNLL LLNL Salt Lake 0 AMES 1 0 ANL 0 STAR 0 0 EQCH CLEV 0 0 0 PPPL GFDL PU Physics JLAB BNL LOSA SDSC 1 LASV ALBU 0 LANL SNLA 0 0 0 0 Networks are at the heart of the telescope. - Roshene McCool, Signal Transport Engineer for SKA, at NORDUNET 2012 0 0 Geography U.S. Department is of Energy Office of Science only representational

Network-Centric View of Large Hadron Collider (@CERN) Q1: where does discovery occur? CERN T1 mile s kms France 350 565 Italy 570 920 UK 625 00 Netherlands 625 00 Germany 700 1185 Spain 850 1400 Nordic 1300 20 USA New York 3900 6300 USA - Chicago 4400 70 Canada BC 5200 8400 Taiwan 60 9850 Source: Bill Johnston The LHC Open Network Environment (LHCONE) The LHC Optical Private Network (LHCOPN) O(1-) meter O(-0) meters O(1) km 500-,000 km CERN Computer Center detector Level 1 and 2 triggers Level 3 trigger ~50 Gb/s (25Gb/s ATLAS, 25Gb/s CMS) 1 PB/s Q2: where does the instrument end? LHC Tier 0 Deep archive and send data to Tier 1 centers LHC Tier 1 Data Centers LHC Tier 2 Analysis Centers

Evolution of LHC Data Model In chronological order: 1. Copy as much data as feasible to analysis centers worldwide, with hierarchical distribution. 2. Relax the hierarchy and rely on caching. 3. Use federated data stores to fetch portions of relevant data sets from remote storage (anywhere), just before they re needed. Increasing faith in global science networks.

How Do We Build The Network Instrument? First, it all has to work Build it well, keep it clean Run perfsonar, take action based on data Next, people need to know it s there Engage with users, experiments, programs Find out who is doing what, and what they would like to do After that, scientists need to see value Can they do their work better? Are there things they could not do without the network? They probably need help maybe a lot of help Helping is good then we succeed together 5/5/14 9

Example Map Out A Security Policy What does the DMZ resource need to do? Single workflow involving a single remote resource Data ingest/export involving a single remote system Identify the remote system, understand the tools, write the filter Local resource dedicated to a collaboration Where are the other parts of the collaboration? Does the collaboration use specific tools (e.g. workflow engine)? Global data service Data service probably uses standard tools Data service ports open to entire Internet How tightly does it need to be filtered? Do a realistic risk assessment Don t forget the auxiliary services! DNS, NTP, SSH, OAuth, patch servers, outbound mail for status, etc. These can typically be more tightly controlled (they are typically local services) 5/5/14

Example Globus DTN, Single Workflow Lab1 DTN security filters Lab1 DTN GE DTN TCP ports 50000-500 DATA DTN Lab2 DTN GE Lab2 DTN security filters Lab1 Science DMZ TCP ports 443, 2811, 7512 TCP ports 443, 2811, 7512 Lab2 Science DMZ 0GE Orchestration Orchestration GE Lab1 Border Router Lab2 Border Router 0GE Amazon AWS GE ESnet Router 0GE ESnet 0GE ESnet Router Logical data path Logical control path Physical data path Physical control path Lab1 DTN security filters Lab2 DTN security filters 5/5/14 11

Example Globus DTN, Global Data Service Local DTN DATA TCP ports 50000-500 DATA DTN GE DTN security filters Science DMZ Orchestration TCP ports 443, 2811, 7512 0GE DTN Remote DTNs GE 0GE Site / Campus Border Router Amazon AWS DTN GE World Logical data path Physical data path Logical control path Physical control path 5/5/14 12

Example Requirements Analysis Often scientists know they need to do something, but don t know how to integrate the pieces Working together, networking people can help guide toward a solution Don t ask what network pieces they need ask what they are trying to do, then derive the requirements from the science ESnet has a formal requirements analysis process that incorporates these ideas (see Lauren s talk tomorrow) 5/5/14 13

Example Integration of Instruments Many scientific instruments come with embedded or attached computing systems Sequencers Electron microscopes Mass spec. machines Typically, no user serviceable parts inside So, how does this thing get integrated into a workflow? Have it mount the DTN over a back-to-back G Double copy the files Better yet, start working with the vendor on a better way 5/5/14 14

The Infrastructure View Is Not Enough It s necessary, but not sufficient. Infrastructure-only view regards networks as other, static, opaque The substrate must be solid, flexible, robust but that s not the whole story The differentiating value of a network, or a Science DMZ, does not come just from infrastructure, but also from: ü The people who run it ü The services it provides customized for science ü The audacity to try to accomplish new things 5/5/14 15

The Instrument View Inspires Us Innovative Capabilities Tailored for Science Partnerships and Outreach Operational Substrate that Scales Quickly, Cheaply, Flexibly What can a network instrument do? enable new discovery processes and workflows offer APIs for discovery, inspection, virtualization, and control ü for applications, middleware, and other networks decouple data acquisition, data storage, and computation (making geography irrelevant) 5/5/14 16

ESnet Vision: discovery unconstrained by geography. US R&E (DREN/Internet2/NLR) CANADA (CANARIE) ASIA-PACIFIC (ASGC/Kreonet2/ TWAREN) RUSSIA AND CHINA (GLORIAD) CANADA (CANARIE) LHCONE FRANCE (OpenTransit) CERN (USLHCNet) ASIA-PACIFIC (KAREN/KREONET2/ NUS-GP/ODN/ REANNZ/SINET/ TRANSPAC/TWAREN) SEATTLE PNNL RUSSIA AND CHINA (GLORIAD) ASIA-PACIFIC (BNP/HEPNET) AUSTRALIA (AARnet) LATIN AMERICA CLARA/CUDI SUNNYVALE ASIA-PACIFIC (ASCC/KAREN/ KREONET2/NUS-GP/ ODN/REANNZ/ SINET/TRANSPAC) LBNL SACRAMENTO SLAC BOISE US R&E (DREN/Internet2/ NASA) US R&E (NASA/NISN/ USDOI) DENVER US R&E (DREN/Internet2/ NISN/NLR) AMES CHICAGO KANSAS CITY FNAL ANL BOSTON BNL NEW YORK PPPL WASHINGTON DC JLAB US R&E (Internet2/ NLR) CERN CANADA (CANARIE) EUROPE (GÉANT/ NORDUNET) ASIA-PACIFIC (SINET) ORNL AUSTRALIA (AARnet) ALBUQUERQUE NASHVILLE EUROPE (GÉANT) ATLANTA LATIN AMERICA (AMPATH/CLARA) LATIN AMERICA (CLARA/CUDI) El PASO HOUSTON US R&E (DREN/Internet2/ NISN)

Links ESnet fasterdata knowledge base http://fasterdata.es.net/ Science DMZ paper http://www.es.net/assets/pubs_presos/sc13scidmz-final.pdf Science DMZ email list https://gab.es.net/mailman/listinfo/sciencedmz perfsonar http://fasterdata.es.net/performance-testing/perfsonar/ http://www.perfsonar.net/ Additional material http://fasterdata.es.net/science-dmz/ http://fasterdata.es.net/host-tuning/ 5/5/14 18

Thanks! Questions? Eli Dart dart@es.net http://www.es.net/ http://fasterdata.es.net/

Thanks! Questions? Eli Dart dart@es.net http://www.es.net/ http://fasterdata.es.net/