Data Challenges in Photon Science. Manuela Kuhn GridKa School 2016 Karlsruhe, 29th August 2016

Similar documents
High-speed data ingest and analysis for photon science at DESY with IBM Spectrum Scale.

Detectors for Future Light Sources. Gerhard Grübel Deutsches Elektronen Synchrotron (DESY) Notke-Str. 85, Hamburg

Text for the class, Pump-Probe Technique for Picosecond Time-resolved X-ray Diffraction at Cheiron School

Experiment Control Upgrades at DESY

DESY site report. HEPiX Spring 2016 at DESY. Yves Kemp, Peter van der Reest. Zeuthen,

Event-Synchronized Data Acquisition System of 5 Giga-bps Data Rate for User Experiment at the XFEL Facility, SACLA

HTCondor with KRB/AFS Setup and first experiences on the DESY interactive batch farm

Text for the class, Pump and probe technique for picosecond time-resolved x-ray diffraction at the Cheiron School

Challenges in Storage

TINE Video System. A Modular, Well-Defined, Component-Based and Interoperable TV System. Proceedings On Redesign VSv3

GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations

A COMMON SOFTWARE FRAMEWORK FOR FEL DATA ACQUISITION AND EXPERIMENT MANAGEMENT AT FERMI

MID: Materials Imaging and Dynamics Instrument

Einführung von VoIP am DESY. Kars Ohrenberg IT

PaNSIG (PaNData), and the interactions between SB-IG and PaNSIG

FEL diagnostics and control system

Online Data Analysis at European XFEL

Monte Carlo Production on the Grid by the H1 Collaboration

Radiation Protection in Experimental Hutches at European XFEL. Safety and Radiation Protection Group

Nexus Builder Developing a Graphical User Interface to create NeXus files

MEASUREMENT OF WIGNER DISTRIBUTION FUNCTION FOR BEAM CHARACTERIZATION OF FELs*

The National Analysis DESY

Lustre architecture for Riccardo Veraldi for the LCLS IT Team

WELCOME TO THE JUNGLE!

Overview of the CRISP proposal

Data preservation for the HERA experiments at DESY using dcache technology

Computing and Networking at Diamond Light Source. Mark Heron Head of Control Systems

Obtainment of Prototypical images and Detector performance simulations. Guillaume Potdevin for the XFEL-HPAD-Consortium

Performance quality monitoring system for the Daya Bay reactor neutrino experiment

How to Perform Your Experiment at P11 (for Priority Access Users)

Cisco TelePresence Content Server

How to Perform Your Experiment at P11

Towards a generalised approach for defining, organising and storing metadata from all experiments at the ESRF. by Andy Götz ESRF

Time-Resolved measurements by FEL spontaneous emission: A proposal for sub-picosecond pumps & probe structural and spectrometric investigations

DAQ system at SACLA and future plan for SPring-8-II

Pump-probe. probe optical laser systems at FLASH. S. Düsterer

Comissioning of the IR beamline at FLASH. DESY.DE

System Management and Infrastructure

Advanced Photon Source Data Management. S. Veseli, N. Schwarz, C. Schmitz (SDM/XSD) R. Sersted, D. Wallis (IT/AES)

Scientific Data Policy of European X-Ray Free-Electron Laser Facility GmbH

Cover Page. The handle holds various files of this Leiden University dissertation

Computing / The DESY Grid Center

Embedded Filesystems (Direct Client Access to Vice Partitions)

XUV pulse FLASH

SSRS-4 and the CREMLIN follow up project

Simulation studies of Coherent X-Ray diffraction Single molecule imaging on XFEL

Presented by Marco Lonza

Cisco TelePresence Content Server

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

Virtualizing a Batch. University Grid Center

JÜLICH SUPERCOMPUTING CENTRE Site Introduction Michael Stephan Forschungszentrum Jülich

Cisco TelePresence Content Server

DELL EMC ISILON F800 AND H600 I/O PERFORMANCE

FuncX: A Function Serving Platform for HPC. Ryan Chard 28 Jan 2019

IME (Infinite Memory Engine) Extreme Application Acceleration & Highly Efficient I/O Provisioning

ATLAS NOTE. December 4, ATLAS offline reconstruction timing improvements for run-2. The ATLAS Collaboration. Abstract

Data Acquisition. Amedeo Perazzo. SLAC, June 9 th 2009 FAC Review. Photon Controls and Data Systems (PCDS) Group. Amedeo Perazzo

DAQ for FLASH operations and experiments

Data oriented job submission scheme for the PHENIX user analysis in CCJ

Tomography data processing and management

NCAR Globally Accessible Data Environment (GLADE) Updated: 15 Feb 2017

Site Report. Stephan Wiesand DESY -DV

dcache Introduction Course

dcache, activities Patrick Fuhrmann 14 April 2010 Wuppertal, DE 4. dcache Workshop dcache.org

Transverse Coherence of a Vacuum Ultraviolet Free Electron Laser

The Fusion Distributed File System

TINE Video System Undergoing A Redesign A Modular, Well-Defined, Component-Based and Interoperable TV System

Image Management for View Desktops using Mirage

Lesson 1 Scattering, Diffraction, and Radiation


Next-Generation NVMe-Native Parallel Filesystem for Accelerating HPC Workloads

Diagnostic Tools for the Transverse Coherence of an X-FEL

Hall D and IT. at Internal Review of IT in the 12 GeV Era. Mark M. Ito. May 20, Hall D. Hall D and IT. M. Ito. Introduction.

Coherent Diffraction Imaging at Third and Fourth Generation X-Ray Sources

Control, data acquisition, management and analysis

SAXS at the ESRF Beamlines ID01 and ID02

1Z0-433

Optics for nonlinear microscopy

VUV FEL User Workshop 2005

irods at TACC: Secure Infrastructure for Open Science Chris Jordan

Managing Research Data for Diverse Scientific Experiments

Monash High Performance Computing

Real-world applications of intense light matter interaction beyond the scope of classical micromachining.

Quantitative 3D Imaging of Nanomaterials by Using Coherent X-rays

Global Collaboration on Accelerator Operations and Experiments

SERGEY STEPANOV. Argonne National Laboratory, Advanced Photon Source, Lemont, IL, USA. October 2017, ICALEPCS-2017, Barcelona, Spain

Analysis of small-angle X-ray scattering (SAXS) data

dcache: challenges and opportunities when growing into new communities Paul Millar on behalf of the dcache team

Control Systems at DESY Current State and future Projects

arxiv: v1 [physics.ins-det] 11 Jul 2015

SOFT X-RAY SELF-SEEDING SIMULATION METHODS AND THEIR APPLICATION FOR LCLS

Synchronization and pump-probe experiments

Using Memcahed as real-time database in the SPARC Control System. G. Di Pirro,, E. Pace, INFN LNF, Italy

IEPSAS-Kosice: experiences in running LCG site

Phys 1020, Day 18: Questions? Cameras, Blmfld Reminders: Next Up: digital cameras finish Optics Note Final Project proposals next week!

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft. Presented by Manfred Alef Contributions of Jos van Wezel, Andreas Heiss

CD s, DVD s, & Optical Communication: Binary Numbers and Diffraction (Ref. Tsokos Lsn 8.1)

DSIT WP1 WP2. Federated AAI and Federated Storage Report for the Autumn 2014 All Hands Meeting

AGIS: The ATLAS Grid Information System

X-ray imaging software tools for HPC clusters and the Cloud

Transcription:

Data Challenges in Photon Science Manuela Kuhn GridKa School 2016 Karlsruhe, 29th August 2016

Photon Science > Exploration of tiny samples of nanomaterials > Synchrotrons and free electron lasers generate extremely powerful and focused radiation > The X-ray beams are so intense that they can reveal even the finest details > Examples: Find the tiniest cracks and pores in a turbine blade or minute impurities in a semiconductor See the positions of individual atoms in a protein molecule With extremely short X-ray flashes it is even possible to observe ultrafast processes such as those that occur in a chemical reaction Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 2/25

Photon Science > Visualizing a lost painting by Vincent van Gogh using X-ray fluorescence mapping Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 3/25

Synchrotron radiation > Synchrotron radiation is the radiation coming from a beam of electrons turning in a magnetic field > The angular acceleration induces the radiation of photons which emerge tangentially to the curvature of the beam Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 4/25

Synchrotrons and Free Electron Lasers > Both synchrotron radiation and FEL have a particle accelerator to generate the beam Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 5/25

Free Electron Lasers > Utilizes the synchrotron effect in order to create a linear photon beam > The beam is similar to the one from lasers in its directionality and coherence > The photon beam from a FEL can be controlled in its range in frequency and power (this is not true for synchrotron radiation) Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 6/25

DESY > Founded December 1959 > Accelerator Center Research, construction and operation > Research topics Particle Physics (HEP) > e.g. Higgs@LHC, Gluon@PETRA Photon Science Hamburg > X-ray crystallography, broad spectrum of application Astro Particle Physics > 2 Sites Hamburg Zeuthen (Brandenburg), near Berlin > ~2300 employees, 3000 guest Zeuthen scientists annually Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 7/25

Light Sources on the DESY campus > PETRA III: With a circumference of 2.3 km the biggest and most brilliant synchrotron light source in the world > FLASH: The first free-electron laser worldwide to produce femtosecond pulses of soft X-rays > European XFEL: Once finished, this facility will deliver hard X-ray pulses far shorter than those from any other X-ray source, and their peak brilliance will be up to eight orders of magnitude higher Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 8/25

example - schematic sample mirror undulator detector storage & analysis Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 9/25

example Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 10/25

Data Life Cycle > Apply for an experiment > preparation > Start of the experiment > Data acquisition > Activities during the experiment > Stop of the experiment > Data access after the experiment > Data archival Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 11/25

Data Life Cycle > Apply for an > Preparation > Start of the Submit research proposals or experiment applications Complete all administrative steps required prior and after the experiment > Data Acquisition > Activities during the > Stop of the > Data Access after the > Data Archival Challenge: Photon science community not really aware of computing & storage problem, 100s of small groups, few computing experts, no ecosystem Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 12/25

Data Life Cycle > Apply for an > Preparation > Start of the Retrofitting of the experiment station > Integrate brought in equipment into the facility environment Start / stop commissioning > Data Acquisition > Activities during the > Stop of the > Data Access after the > Data Archival Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 13/25

Data Life Cycle > Apply for an System setup: > Preparation Give access to the storage space > Start of the Configure access control for the users > Data Acquisition > Activities during the > Stop of the Configure exports and endpoints Challenges: Lazy account/credential handling > Data Access after the Mix of OS s at experiment stations Windows & Linux of all ages and conditions (HW too) > Data Archival Black box detector PCs Open access vs. safety (brought in equipment even has root) Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 14/25

Detectors Detector OS/Access File size/rate Bandwidth Pilatus 300k Linux (Black box) 1,2 MB Files @ 200 Hz 240 MB/s Pilatus 6M Linux (Black box) 25 MB files @ 25 Hz 7 MB files @ 100 Hz 625 MB/s 700 MB/s PCO Edge Windows 8 MB files @ 100Hz 800 MB/s PerkinElmer Windows 16 MB + 700 Byte files @ 15 Hz 240 MB/s Lambda Linux 60 Gb/s @ 2000 Hz 7.5 GB/s Eiger Http (Black Box) 30 Gb/s @ 2000 Hz 3.8 GB/s Pilatus 6M PCO Edge Lambda Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 15/25

Data Life Cycle > Apply for an > Preparation > Start of the > Data Acquisition > Activities during the > Stop of the > Data Access after the Challenges: Data rates of the detectors > High demanding detectors > Multiple detectors in parallel Data flow from experiment stations to storage infrastructure (network limitation) Data distribution (to storage system, online analysis, ) Data reduction (e.g. XFEL expects 50 GB/s of data) Wide variety of data formats > Data Archival Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 16/25

Data Life Cycle > Apply for an > Preparation > Start of the > Data Acquisition > Activities during the System monitoring > equipment > Storage system Data monitoring and analysis > Live view > Hit rate detection > Dark+gain correction > 3D reconstruction > Stop of the > Data Access after the > Data Archival Parallel data access from outside the facility Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 17/25

Data Life Cycle > Apply for an Challenges > Preparation Data distribution > Start of the Access data without interfering with data taking > Data Acquisition > Activities during the > Stop of the Online analysis has to run on powerful infrastructure with fast access to the data Guarantee data safety even with external access during the experiment > Data Access after the > Data Archival Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 18/25

Data Life Cycle > Apply for an > Preparation > Start of the > Data Acquisition Removal of exports and endpoint Data not accessible for next user group Extract brought in equipment > Remove access to facility infrastructure > Activities during the > Stop of the > Data Access after the > Data Archival Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 19/25

Data Access after the > Apply for an Offline analysis on- and off-site > Preparation > Start of the > Data Acquisition > Activities during the > Stop of the Challenges > Data Access after the Authenticated access to data > Data Archival Analysis infrastructure on-site Manage authentication Brought in analysis software Fast data export Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 20/25

Data Life Cycle > Apply for an Data copied into long term storage > Preparation Data access after archival > Start of the > Data Acquisition Challenges How long preserved? > Activities during the Technology > Stop of the Data export Access control > Data Access after the > Data Archival Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 21/25

Example of the realization - ASAP3 Sandbox per Beamline Beamline Filesystem (Beamline-FS) Optimized for the ingestion of data in high speed bursts Analysis Data Export and Management Core Filesystem (Core-FS) Optimized for capacity and parallel concurrent access > ASAP3 project was developed for Petra3 during the LSDMA portfolio extension of the Helmholtz Association > Covers all data challenges mentioned on the previous slides Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 22/25

System monitoring Data monitoring OnDa Live-Viewer Data access Maxwell resources (HPC) SMB/NFS access Requires DESY account Data Archival Remove fileset on Beamline-FS Data not visible for next group Fileset on Core- FS remains Offline Data Access Write data to Beamline-FS Automatic migration to Core-FS (4 minutes) Machine based authentication Data distribution with HiDRA Stop Start Automated system setup Create filesets Create default directories NFSv4 ACL setup Initialize Gamma-Portal Online Activities Retrofitting of the experiment place Start/stop commissioning Data Acquisition Apply for experiment Metadata source for experiment Exp. Preparation Apply for Beamtime ASAP3 - The User's View of the System 7 days after stop experiment Data copied to dcache Copied to tape library for long term storage Accessible for user Maxwell resources (HPC) CPU and GPU Native GPFS SLURM (partially) DESY account required Gamma Portal DOOR account required NFS/SMB DESY account required Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 23/25

Current Status and Outlook > ASAP3 running successfully for one and a half years > Flash is currently joining > Other DESY labs (detector development, microscopy, ) join > XFEL: similar architecture + components (ASAP3 as blueprint) will become the only system for the DESY light sources and labs Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 24/25

> http://photon-science.desy.de/ > https://www.xfel.eu/ > http://www.helmholtz-lsdma.de/ > https://confluence.desy.de/display/asap3/asap3++data+storage+f or+petra+iii > Strutz et al (2015) ASAP3 - New Data Taking and Analysis Infrastructure for PETRA III. J. Phys.: Conf. Ser. (JPCS), Volume 664, doi:10.1088/1742-6596/664/00/001001 > Mariani et al (2016) OnDA: online data analysis and feedback for serial X-ray imagingthis article will form part of a virtual special issue of the journal on free-electron laser software. J. Appl. Cryst. 49, 1073-1080. doi:10.1107/s1600576716007469 Manuela Kuhn Data Challenges in Photon Science 2016-08-29 Page 25/25