HepData status. Mike Whalley - IPPP Durham UK. DASPOS/DPHEP7 Workshop 21-22nd March 2012 CERN

Similar documents
HepData status+history

CEDAR: HepData, JetWeb and Rivet

Summary of the LHC Computing Review

Kibana, Grafana and Zeppelin on Monitoring data

Computing / The DESY Grid Center

Challenges and Evolution of the LHC Production Grid. April 13, 2011 Ian Fisk

Early experience with the Run 2 ATLAS analysis model

HammerCloud: A Stress Testing System for Distributed Analysis

CERN Open Data and Data Analysis Knowledge Preservation

Worldwide Production Distributed Data Management at the LHC. Brian Bockelman MSST 2010, 4 May 2010

The CMS Computing Model

Data Analysis in ATLAS. Graeme Stewart with thanks to Attila Krasznahorkay and Johannes Elmsheuser

Rivet. July , CERN

WLCG Transfers Dashboard: a Unified Monitoring Tool for Heterogeneous Data Transfers.

Expressing Parallelism with ROOT

Distributed Data Management with Storage Resource Broker in the UK

SLHC-PP DELIVERABLE REPORT EU DELIVERABLE: Document identifier: SLHC-PP-D v1.1. End of Month 03 (June 2008) 30/06/2008

CEDAR: tools for event generator tuning

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

Computing for LHC in Germany

High-Energy Physics Data-Storage Challenges

September Development of favorite collections & visualizing user search queries in CERN Document Server (CDS)

Open data and scientific reproducibility

Evolution of Cloud Computing in ATLAS

INSPIRE and SPIRES Log File Analysis

DESY at the LHC. Klaus Mőnig. On behalf of the ATLAS, CMS and the Grid/Tier2 communities

SUPERGEN Wind Wind Energy Technology

Towards Network Awareness in LHC Computing

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

CSCS CERN videoconference CFD applications

where the Web was born Experience of Adding New Architectures to the LCG Production Environment

Test & Analysis Project aka statistical testing

LCG Conditions Database Project

Monitoring the ALICE Grid with MonALISA

Particle Track Reconstruction with Deep Learning

First Experience with LCG. Board of Sponsors 3 rd April 2009

PoS(EGICF12-EMITC2)106

Belle & Belle II. Takanori Hara (KEK) 9 June, 2015 DPHEP Collaboration CERN

SharePoint 2013 End User Level II

Benchmarking the ATLAS software through the Kit Validation engine

Andrea Sciabà CERN, Switzerland

Case Study. CMS for Management of Monetization Training Resources

Umweltbundesamt. Masaryk University Laboratory on Geoinformatics and Cartography

Recasting LHC analyses with MADANALYSIS 5

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

CERN Tape Archive (CTA) :

The IAC s Publications Archive. Monique Gómez & Jorge A. Pérez Prieto Instituto de Astrofísica de Canarias Tenerife, Spain

Big Computing and the Mitchell Institute for Fundamental Physics and Astronomy. David Toback

MICC 2017 Librarians and Evidence- Based Medical Practice: An Ever Closer Union. Michael Fanning, Customer Success - Training Manager 7 th June 2017

CERN and Scientific Computing

Edinburgh (ECDF) Update

Streamlining CASTOR to manage the LHC data torrent

Data Transfers Between LHC Grid Sites Dorian Kcira

UW-ATLAS Experiences with Condor

SharePoint 2013 End User Level II

We invented the Web. 20 years later we got Drupal.

Robin Dale RLG

Flexible Design for Simple Digital Library Tools and Services

Versatile Link and GBT Chipset Production

Deliverable No: D8.5

CIS-CAT Pro Dashboard Documentation

Repository In A Box (RIB)

Helix Nebula The Science Cloud

Implementing EDS. A ten step summary, as experienced at Hofstra University Library

UNIVERSITY EVENTS CALENDAR TIP SHEET

Grid and Cloud Activities in KISTI

Development of DKB ETL module in case of data conversion

Open Data and Data Analysis Preservation Services for LHC Experiments

Rudy Gonzales, D. J. Hagberg

IPEX The next version

Development of Contents Management System Based on Light-Weight Ontology

Software Requirements Specification for the Names project prototype

A Configurable Radiation Tolerant Dual-Ported Static RAM macro, designed in a 0.25 µm CMOS technology for applications in the LHC environment.

Batch Services at CERN: Status and Future Evolution

YaPPI Yet another Particle Property Interface

The LHC Computing Grid

Big Data Tools as Applied to ATLAS Event Data

WM2015 Conference, March 15 19, 2015, Phoenix, Arizona, USA

Reliability Engineering Analysis of ATLAS Data Reprocessing Campaigns

Profile Data Mining with PerfExplorer. Sameer Shende Performance Reseaerch Lab, University of Oregon

Verification and Diagnostics Framework in ATLAS Trigger/DAQ

ATLAS Nightly Build System Upgrade

GNU EPrints 2 Overview

The grid for LHC Data Analysis

THOUGHTS ON SDN IN DATA INTENSIVE SCIENCE APPLICATIONS

Improving Generators Interface to Support LHEF V3 Format

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

Preparing for High-Luminosity LHC. Bob Jones CERN Bob.Jones <at> cern.ch

RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE

BIBTEX-based Manuscript Writing Support System for Researchers *

CMS Computing Model with Focus on German Tier1 Activities

Troubleshooting Guide for Search Settings. Why Are Some of My Papers Not Found? How Does the System Choose My Initial Settings?

EVACUATE PROJECT WEBSITE

Trivial And Non-Trivial Data Analysis for Geant4

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Volunteer Computing at CERN

Review. Fundamentals of Website Development. Web Extensions Server side & Where is your JOB? The Department of Computer Science 11/30/2015

Status of the TORCH time-of-flight detector

Client-Side Detection of SQL Injection Attack

MT at NetApp This is how we do it

Transcription:

HepData status Mike Whalley - IPPP Durham UK DASPOS/DPHEP7 Workshop 21-22nd March 2012 CERN

Brief Introduction to HepData Aim - to compile published cross section data and make them available in a computer database Small group, based at IPPP, Durham U. (UK) - DBmanager/physicist+non-physicist assistant STFC(UK) funded - just received funding to October 2016. > 30 years, began in collaboration with PDG - original DB management system BDMS 2009 moved to more modern and long-term maintainable computing system based on MySQL and Java code - CEDAR

Experiment Paper Dataset...Dataset...Dataset... xaxis.. yaxis.. Bins Points Errors Java-Hibernate Object-relational mapping library relational database <==> OO model Apache-Tapestry open-source component oriented Java web application framework MySQL Database Hibernate Java OO Model Tapestry User Aida (Rivet) plot etc...

HepData - standard record type Output formats: html plain text AIDA - for RIVET PyRoot YODA mpl jhepwork plot (simple & advanced)

SUSY/Exotics non-standard record types At the beginning of 2011 we were asked (by the ATLAS SUSY group) if HepData could handle data sets other than the standard (2-D) cross section type data. Things like: SLHA files Tables of Acceptances & Efficiencies 3-D tables of signal cross sections We agreed to do this by creating a resource area on the main HepData server which was linked to the specific HepData main record.

Paper Dataset...Dataset...Dataset... xaxis.. Bins yaxis.. Points Errors MySQL Database Hibernate Java OO Model Tapestry Addition of resource file system on HepData server - linked into the specific data record linked using InspireId to record Resource Area SLHA Acc. Eff. etc.. Aida etc... (Rivet)

Beyond the standard record type arxiv:1203.2489 EPJ C72(2012)1976

SLHA file Expected # signal events Acceptance * efficiency

Contents of HepData 7882 records (=papers) 191 LHC ATLAS:95 CMS: 47 ALICE: 34 LHCB: 11 TOTEM: 3 LHCf: 1 ATLAS: CMS: 38/57 STDM 25/63 STDM 39/44 SUSY 1/32 SUSY 10/56 EXOT 2/65 EXOT 5/7 HION 5/20 HION 1/28 TOPQ 1/21 TOPQ 1/8 BPHYS 13/17 BPHYS

HepData - finding the record HepData has basic search facilities - EXP, RE, OBS, FSP, REF etc... + Inspire type searches (eg. title:xxx) + individual records linked from Inspire... + eg.. ATLAS publications pages... note the link to Rivet + working with Elsevier to place banner flag on their web page when paper has HepData record.

Inspire & HepData There are Inspire<==>HepData links in the records Plus, now: ( thanks to Piotr Praczyk - Inspire Group) * HepData data within and displayable in Inspire * Inspire search terms in HepData (eg keyword:supersymmetry)

Inspire & HepData

HepData - entering data At present all data entry is done at Durham by either myself or my assistant. Data files are sent to us by the experimenters which we convert into the required input format. Authors validate before data is transferred to the public database Before the LHC we had to ask...but now we are getting several request/week to enter data from the LHC experimenters! In the long term this situation needs to be changed. Need to get external encoders or experiments themselves to help by inputting direct into data records with Durham providing an overarching management role controlling the actual entry into the public database.

HepData - entering the data D.I.Y. We have been developing a web entry form + simplified entry language 1 2 3 4 Extracts the bibliographic metadata from arxiv to get started

HepData Usage As a metric to measure the use of HepData we have continued to count the number of distinct internet nodes, excluding robots etc., accessing per month. This remained steady from ~500 up to 2009, the start of the LHC, then has increased three-fold to the present.

HepData Summary Continue to compile cross sections based on publications >30years (3+1) years funded by STFC from Oct 2012 2009 moved from BDMS to new maintainable computing system based on Java OO data model & MySQL database. Expanded to include resource area for LHC extra data for eg. LHC acceptances, efficiencies SLHA files Inspire <=> Hepdata collaboration greatly increased HepData data in Inspire + Inspire fields searchable from HepData Record discovery (beyond direct keyword searching) expanded. URL link to/from Inspire URL link from LHC experiments publication pages Banner page URL link from Elsevier publications (Phys.Lett..) Proposed data input direct from experiments needed for future trial web based system being used/assessed by outside users Use figures increase 3-fold since 2009 now ~1500 distinct internet nodes per month.