Deep Learning Photon Identification in a SuperGranular Calorimeter

Similar documents
Data Quality Monitoring at CMS with Machine Learning

Precision Timing in High Pile-Up and Time-Based Vertex Reconstruction

Machine Learning for (fast) simulation

The HEP.TrkX Project: Deep Learning for Particle Tracking

MIP Reconstruction Techniques and Minimum Spanning Tree Clustering

Conditional Generative Adversarial Networks for Particle Physics

Machine Learning Software ROOT/TMVA

CMS Conference Report

PoS(High-pT physics09)036

Fast pattern recognition with the ATLAS L1Track trigger for the HL-LHC

Machine Learning in Data Quality Monitoring

Computing, Software and Deep Learning in High Energy Physics. NCSR DEMOKRITOS Feb Jean-Roch Vlimant

A New Segment Building Algorithm for the Cathode Strip Chambers in the CMS Experiment

Direct photon measurements in ALICE. Alexis Mas for the ALICE collaboration

arxiv: v1 [hep-ex] 14 Oct 2018

PoS(EPS-HEP2017)523. The CMS trigger in Run 2. Mia Tosi CERN

Study of the Higgs boson coupling to the top quark and of the b jet identification with the ATLAS experiment at the Large Hadron Collider.

Electron and Photon Reconstruction and Identification with the ATLAS Detector

ATLAS, CMS and LHCb Trigger systems for flavour physics

A Deep Learning primer

Particle Track Reconstruction with Deep Learning

b-jet identification at High Level Trigger in CMS

Data Reconstruction in Modern Particle Physics

Design of the new ATLAS Inner Tracker (ITk) for the High Luminosity LHC

Clustering algorithms and autoencoders for anomaly detection

Tau ID systematics in the Z lh cross section measurement

π ± Charge Exchange Cross Section on Liquid Argon

Tutorial on Machine Learning Tools

The Pandora Software Development Kit for. - Particle Flow Calorimetry. Journal of Physics: Conference Series. Related content.

How Can We Deliver Advanced Statistical Tools to Physicists. Ilya Narsky, Caltech

Application of Neural Networks to b-quark jet detection in Z b b

Analogue, Digital and Semi-Digital Energy Reconstruction in the CALICE AHCAL

ALICE ANALYSIS PRESERVATION. Mihaela Gheata DASPOS/DPHEP7 workshop

Polytechnic University of Tirana

ATLAS Simulation Computing Performance and Pile-Up Simulation in ATLAS

TPC Detector Response Simulation and Track Reconstruction

Stephen J. Gowdy (CERN) 12 th September 2012 XLDB Conference FINDING THE HIGGS IN THE HAYSTACK(S)

ATLAS NOTE. December 4, ATLAS offline reconstruction timing improvements for run-2. The ATLAS Collaboration. Abstract

Identification of the correct hard-scatter vertex at the Large Hadron Collider

Automated reconstruction of LAr events at Warwick. J.J. Back, G.J. Barker, S.B. Boyd, A.J. Bennieston, B. Morgan, YR

Convolutional Layer Pooling Layer Fully Connected Layer Regularization

Track reconstruction with the CMS tracking detector

Tracking and flavour tagging selection in the ATLAS High Level Trigger

INTRODUCTION TO DEEP LEARNING

Machine Learning 13. week

Deep Face Recognition. Nathan Sun

ILC Software Overview and recent developments

Tracking and Vertexing performance in CMS

GLAST tracking reconstruction Status Report

YOLO9000: Better, Faster, Stronger

Machine Learning (CSE 446): Practical Issues

Advanced Video Content Analysis and Video Compression (5LSH0), Module 8B

FAMOS: A Dynamically Configurable System for Fast Simulation and Reconstruction for CMS

The GAP project: GPU applications for High Level Trigger and Medical Imaging

Neural Networks and Deep Learning

Topics for the TKR Software Review Tracy Usher, Leon Rochester

1. Introduction. Outline

LAr Event Reconstruction with the PANDORA Software Development Kit

The GeantV prototype on KNL. Federico Carminati, Andrei Gheata and Sofia Vallecorsa for the GeantV team

First LHCb measurement with data from the LHC Run 2

CERN openlab II. CERN openlab and. Sverre Jarp CERN openlab CTO 16 September 2008

LArTPC Reconstruction Challenges

Performance of Tracking, b-tagging and Jet/MET reconstruction at the CMS High Level Trigger

Deep neural networks II

Outlier detection using autoencoders

Hall D and IT. at Internal Review of IT in the 12 GeV Era. Mark M. Ito. May 20, Hall D. Hall D and IT. M. Ito. Introduction.

PoS(TIPP2014)204. Tracking at High Level Trigger in CMS. Mia TOSI Universitá degli Studi di Padova e INFN (IT)

Object Recognition II

Weka ( )

The ALICE electromagnetic calorimeter high level triggers

How to discover the Higgs Boson in an Oracle database. Maaike Limper

PoS(ACAT08)101. An Overview of the b-tagging Algorithms in the CMS Offline Software. Christophe Saout

A Topologic Approach to Particle Flow PandoraPFA

Deep Learning with Intel DAAL

Neural Networks. Single-layer neural network. CSE 446: Machine Learning Emily Fox University of Washington March 10, /10/2017

Robustness Studies of the CMS Tracker for the LHC Upgrade Phase I

Keras: Handwritten Digit Recognition using MNIST Dataset

LCDG4 at NIU Status and Plans

TPC Detector Response Simulation and Track Reconstruction

arxiv: v2 [hep-ex] 14 Nov 2018

Machine Problem 8 - Mean Field Inference on Boltzman Machine

Simulation and data reconstruction framework slic & lcsim. Norman Graf, Jeremy McCormick SLAC HPS Collaboration Meeting May 27, 2011

TPC Detector Response Simulation and Track Reconstruction

Data Mining: Concepts and Techniques. Chapter 9 Classification: Support Vector Machines. Support Vector Machines (SVMs)

Recasting LHC analyses with MADANALYSIS 5

Simulation and Physics Studies for SiD. Norman Graf (for the Simulation & Reconstruction Team)

Monte Carlo programs

ATLAS PILE-UP AND OVERLAY SIMULATION

ATLAS NOTE ATLAS-CONF July 20, Commissioning of the ATLAS high-performance b-tagging algorithms in the 7 TeV collision data

SGD: Stochastic Gradient Descent

Spring 2010 Research Report Judson Benton Locke. High-Statistics Geant4 Simulations

Using the In-Memory Columnar Store to Perform Real-Time Analysis of CERN Data. Maaike Limper Emil Pilecki Manuel Martín Márquez

The Boundary Graph Supervised Learning Algorithm for Regression and Classification

PoS(IHEP-LHC-2011)002

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

The Compact Muon Solenoid Experiment. Conference Report. Mailing address: CMS CERN, CH-1211 GENEVA 23, Switzerland

A Topologic Approach to Particle Flow PandoraPFA

Figure 2: download MuRun2010B.csv (This is data from the first LHC run, taken in 2010)

Integrated CMOS sensor technologies for the CLIC tracker

The Belle II Software From Detector Signals to Physics Results

Transcription:

Deep Learning Photon Identification in a SuperGranular Calorimeter Nikolaus Howe Maurizio Pierini Jean-Roch Vlimant @ Williams College @ CERN @ Caltech 1

Outline Introduction to the problem What is Machine Learning Our work so far Limitations and next steps 2

The Problem 3

Introduction - Problem Want to take advantage of high granularity of future HEP detectors......can contemporary Machine Learning (ML) techniques play a significant role? Photon shower CALICE ECAL @ Fermilab Define ideal problem: Photon vs. Pi0 showers Focus on current cutting edge: Super-granular detector geometry Deep Neural Networks 4

Introduction - Problem Want to take advantage of high granularity of future HEP detectors......can contemporary Machine Learning (ML) techniques play a significant role? Photon shower CALICE ECAL @ Fermilab Define ideal problem: Photon vs. Pi0 showers Focus on current cutting edge: Super-granular detector geometry Deep Neural Networks 5

Introduction - Detector Photon vs. Pi0 showers in LCD ECAL Easy to simulate samples State of the art of the next-generation detector The ultimate granular calorimeter in (x,y) as well as z (3D problem) Very similar to classic ML problem (MNIST dataset, coming up later!) Photon shower CALICE ECAL @ Fermilab 6

Introduction - Particle ID ML in use in past & current experiments! Neural Networks @ LEP Boosted Decision Trees @ BaBar/Belle and the LHC PID - supervised classification problem: Current procedure: Get samples from simulation or detector Manually identify discriminating physics features (signal vs. background) Train algorithm (cut-based selection/bdt/nn/etc) to maximize separation Extract features and apply algorithm to classify new data Length? Width? Shape? Energy? See for instance CMS photon ID in Run I 7

Introduction - Particle ID ML in use in past & current experiments! Neural Networks @ LEP Boosted Decision Trees @ BaBar/Belle and the LHC PID - supervised classification problem: New Approach: Get samples from simulation or detector No feature identification use only raw data Train DNN to look at clusters (imaging problem!) for PID Apply DNN to classify new data 8

Introduction - Particle ID ML in use in past & current experiments! Neural Networks @ LEP Boosted Decision Trees @ BaBar/Belle and the LHC What kind of particle am I? PID - supervised classification problem: New Approach: Get samples from simulation or detector No feature identification use only raw data Train DNN to look at clusters (imaging problem!) for PID Apply DNN to classify new data 9

Machine Learning? 10

Machine Learning - MNIST Machine Learning: teaching computers to classify data Classic Example: MNIST handwritten digit dataset 70,000 28x28 centered arrays of labeled, handwritten digits 11

Machine Learning - Procedure Test Data Collect tons of labeled data Divide into train and test sets Train Data testing Use train set to learn to separate classes based on attributes training (train for multiple epochs until loss is small) Use test set to check model 12

Machine Learning - Neural Networks - Value at each node is weighted sum of previous nodes Probability of signal Start with random weights Get output Backpropagate! Repeat until small error Input data here width depth 13

Machine Learning - Hardware High dimensionality leads to highly parallelizable problem Use GPUs for training Currently using two NVIDIA GTX TITANs @ Caltech......because CERN has no GPUs :( 14

Machine Learning - Model Comparison How to compare techniques? Complexity / understandability Time to train Performance Probability Binary Classification Histogram 15 Give visual representation of Sensitivity vs. Specificity (aka True Positive Rate vs True Negative Rate) ROC Curve True Negative Rate ROC curves to compare performance Prediction 15 True Positive Rate

Particle ID - // to MNIST Imaging problem 2D vs 3D clusters 0 vs 8 analogy with gamma vs pi0 16

Our work 17

Project - Tools - Hardware Software 18

Extract ix, iy, iz, and E for each event; save in long python list (@ CERN) Send to Use ROOT to generate 100GeV singleparticle events in Geant4 LCD calorimeter (@ CERN) Caltech! Project - Data Preparation Parse into NumPy arrays; make 20x20x25 centered event arrays (@ Caltech) Store ~40,000 event gamma and pi0 datasets (@ Caltech) 19

Project - Network Topologies - - Three main types: Fully connected - Simple Generally fast (~1s per epoch) Branched and Multilayer Convolutional Even better feature recognition Takes long time to train (~180s per epoch) Convolutional - Preserves Structure Slower (~60s per epoch) 20

Project - Network Topologies - Fully Connected ~94% efficiency @ 10% false positive hmm! Signal Efficiency (True Positive) Training Error by Epoch Loss 1 - Mistag (True Negative) ROC Curve Epoch 21

Project - Network Topologies - Convolutional ~96% efficiency @ 10% false positive 1 - Mistag (True Negative) ROC Curve Training Error by Epoch Loss hmm! Signal Efficiency (True Positive) Epoch 22

Project - Network Topologies - Convolutional II ~97% efficiency @ 10% false positive Training Error by Epoch Loss 1 - Mistag (True Negative) ROC Curve Signal Efficiency (True Positive) Epoch 23

Project - Network Topologies - Convolutional II ~97% efficiency @ 10% false positive 1 - Mistag (True Negative) ROC Curve ~70% efficiency @ 10% false positive...clearly a more complicated problem Signal Efficiency (True Positive) 24

Limitations and Moving On 25

Project - Current Limitations How long to train? Best model? Loss keeps decreasing well after 100 epochs Don t have enough GPUs for quick training! Looking to get time on CSCS GPU cluster Loss??? 26 Epoch

Project - Current Limitations How long to train? Best model? Loss keeps decreasing well after 100 epochs Don t have enough GPUs for quick training! Looking to get time on CSCS GPU cluster ROC Curve Loss Epoch 1 - Mistag (True Negative) Training Error by Epoch 1000 epochs! ~98% efficiency @ 10% false positive Signal Efficiency (True Positive) 27

Project - Next weeks Identify ideal network topology for binary classification Scan over hyperparameters Regression on energy - Future Beyond 1-1 classification Provide general resource Jet ID Gluon vs quark vs boosted Higgs vs boosted Z vs boosted top Benchmark for professionals Publish on Open data as a ML standard (the MNIST of Particle Physics) 28

Big thank you to BU, Augusto, Maurizio & Jean-Roch!...any questions? 29

Backup Slides 30

31

32

https://titans.hep.caltech.edu:8182/notebooks/niki/plot_models.ipynb 33

One photon hits the calorimeter surface Two photons from a high-energy π0 merging into one cluster 34

Jet-constituent energy fraction as a function of the angular distance from the jet axis Particle ID - Importance 1) 2) 3) Fast yes/no answer based on what a cluster look like a) Problem reduced to reading the detector and evaluating the function b) No complex reconstruction involved c) Faster than actual reconstruction good for granular-calorimeter triggers at future colliders? (e.g. FCC, High-Luminosity LHC, etc) Potentially, could reach similar performances as traditional techniques a) Energy reconstruction from regression (already done offline with BDTs @LHC) b) Particle identification from classification Could be extended to more complex problems (identify jets of particles, rather than single particles) E = 2.5 TeV E = 15 TeV E = 2.5 TeV E = 15 TeV E = 2.5 TeV E = 15 TeV 35

Introduction - ML for Particle ID Machine learning successfully used for Particle Identification in HEP Neural Networks @ LEP Boosted Decision Trees @ BaBar/Belle and the LHC Particle Identification is a classical supervised classification problem: A training sample for signal and background available from real collisions and/or simulation A set of features identified, having discriminating power between signal and background. RAW data already manipulated to select an optimal set of physicsmotivated variables. Good for the physicist, but potentially limiting for the algorithm. A selection algorith (cut-based selection/bdt/nn/etc) trained to maximize the separation As a next step in this development line, we try to apply Deep Neural Networks Go back to RAW data Train a convolutional Neural Network to LOOK AT the clusters and see the pions [THE IMAGING CALORIMETRY] An example of high-level feature (i.e., not RAW data) R9 = (energy 3x3 cluster)/(energy 5x5 cluster) 36 See for instance CMS photon ID in Run I

Moving on... 1) 2) 3) In this study: a) Regression on energy as well as pid Potentially: a) Beyond 1-to-1 classification: Jet ID i) Gluon vs quark vs boosted Higgs vs boosted Z vs boosted top In the future: a) More complex algorithms: provide a benchmark for professionals i) Publish dataset on Open data as a standard (the MNIST of Particle Physics) how exactly does this work? Show the event display they made for us 37

Jet-constituent energy fraction as a function of the angular distance from the jet axis Particle ID - Importance E = 2.5 TeV Yes/No answer based on what a cluster looks like - no complex reconstruction Faster than actual reconstruction in triggers at future colliders? (e.g. FCC, High-Luminosity LHC, etc) Similar performance to traditional techniques for particle ID? E = 15 TeV E = 2.5 TeV E = 15 TeV E = 2.5 TeV E = 15 TeV Energy reconstruction from regression (already done offline with BDTs @LHC) More complex problems? Jet ID? Ok, so what have we done so far? 38