A framework for image analysis in plant breeding. Paul van Schayck

Size: px
Start display at page:

Download "A framework for image analysis in plant breeding. Paul van Schayck"

Transcription

1 A framework for image analysis in plant breeding Paul van Schayck

2 Paul van Schayck Wageningen University MSc Biotechnology (molecular life science) Nanoscopy Data management

3 Image analysis in plant breeding Marker assisted breeding Markers: Molecular Biochemical Morphological Shape, color and quantitative Tool for breeding From melon to cucumber

4 Examples of image analysis used in plant breeding An ImageJ plugin for plant variety testing Polder, G., Blokker, G., Heijden, G.W.A.M. van der (2012) In: Proceedings of the ImageJ User and Developer Conference 2012, October 2012, Mondorf-les-Bains, Luxembourg. - : Centre de Recherche Public Henri Tudor, p

5 Existing practice of image analysis Use of popular open source tools

6 Current work flow of image analysis Control flow Images and results Windows Server Via remote desktop Pipeline Macro End users: Breeders Researchers Network disks Portable disks SMB Local disks Storage

7 Current challenges Increasing size of datasets Data storage decentralized Little structure in metadata: Identification Results Quality assurance User interface

8 Current challenges Difficult end-user experience

9 Aim of project Improve end-user experience Store metadata of results and images Improve data storage capabilities Improve performance scalability Improve quality assurance

10 Discovering OMERO

11 Pipeline of image analysis Pipeline common library OMERO.Web: OMERO. Processor: Control script (Python) Macro Plugin OMERO.Insight: High Performance Computing cluster Control flow Data flow (images and annotations) Validation image Results

12 Progress monitoring Making use of jobhandle.setmessage()

13 Infrastructure of the framework Big-data storage VMware node NFS Image data OMERO.Web: Client NFS slurm DRMAA Slurm HPC nodes HPC cluster

14 Acquisition of images into OMERO OMERO. Importer Identification metadata Barcode scanner Image data NAS storage

15 Quality Assurance Version control (SVN) Doc generation (Sphinx) Automated testing Functional and acceptance (Python nose) Nightly tests. Reporting to monitoring system (PRTG). Testing environment Monitoring system availability and regressions

16 Test case: Shape parameters of carrot Determine different shape parameters Crop selection

17 Step 1: Acquisition and calibration Batch-wise import Color and light control script Image data After Before NAS storage HPC Cluster

18 Step 2: Classification and segmentation Crucial step Time-consuming calculations control script HPC cluster

19 Step 3: Measurement of parameters Results in CSV format FileAnnotation control script Validation image Results HPC cluster

20 Conclusions of the carrot test case Further implementation: Validation Integration Web Client User Results

21 Conclusions Improved end-user experience Improved end-user experience

22 Conclusions Before Improved end user experience Centralized storage After Network disks Image data Portable disks Local disks Storage Centralized NAS storage Centralized storage

23 Conclusions Improved end user experience Centralized storage Storage of metadata Storage of metadata

24 Conclusions Improved end user experience Centralized storage Storage of metadata Performance scalability Image data HPC nodes NAS storage HPC cluster Performance scalability

25 Conclusions Improved end user experience Centralized storage Storage of metadata Performance scalability Complexity for developers Software dependencies At the cost of More steps required by developers Software dependencies

26 Conclusions Improved end user experience Centralized storage Storage of metadata Performance scalability Complexity for developers Software dependencies Quality assurance Quality assurance

27 Acknowledgements Raimond Ravelli Peter Peters Jan-Willem Borst

Michal Kuneš

Michal Kuneš The Open Microscopy Environment A DataBase for the storage and manipulation of image data Michal Kuneš xkunes@utia.cas.cz ZOI UTIA, ASCR, Friday seminar 13.12.2013 OMERO http://www.openmicroscopy.org/site/support/omero4/users/index.html

More information

The walkthrough is available at /

The walkthrough is available at   / The walkthrough is available at https://downloads.openmicroscopy.org/presentations/2018/gbi-sydney / Description We will demonstrate a number of features of the OMERO platform using an OMERO server based

More information

You need to use the URL provided by your institute s OMERO administrator to access the OMERO.web client.

You need to use the URL provided by your institute s OMERO administrator to access the OMERO.web client. 1 OMERO.web Client Using OMERO.web to view and work with image data via a web browser. You need to use the URL provided by your institute s OMERO administrator to access the OMERO.web client. Logging in,

More information

1 Download the latest version of ImageJ for your platform from the website:

1 Download the latest version of ImageJ for your platform from the website: Using ImageJ with OMERO 4.4 This covers and version 4.4.x and the initial section assumes you have never installed ImageJ or Fiji before. For OMERO versions 5.1.x and 5.0.x see the Using ImageJ with OMERO

More information

2. RELATED WORK 3.DISEASE IDENTIFICATION SYSTEM IN VEGETABLES

2. RELATED WORK 3.DISEASE IDENTIFICATION SYSTEM IN VEGETABLES Volume 119 No. 10 2018, 1863-1869 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu DISEASE SEGMENTATION IN VEGETABLES USING K-MEANS CLUSTERING B.RAMAPRIYA

More information

Jozef Cernak, Marek Kocan, Eva Cernakova (P. J. Safarik University in Kosice, Kosice, Slovak Republic)

Jozef Cernak, Marek Kocan, Eva Cernakova (P. J. Safarik University in Kosice, Kosice, Slovak Republic) ARC tools for revision and nightly functional tests Jozef Cernak, Marek Kocan, Eva Cernakova (P. J. Safarik University in Kosice, Kosice, Slovak Republic) Outline Testing strategy in ARC ARC-EMI testing

More information

TOPIM-2018 schedule. Session 1: Introduction to OMERO & CellProfiler

TOPIM-2018 schedule. Session 1: Introduction to OMERO & CellProfiler Presentations are available at https://downloads.openmicroscopy.org/presentations/2018/topim-crete/ TOPIM-2018 schedule Thursday 12th July: 19:40-20:15 (35 mins) OMERO workshop introduction Friday 13th

More information

Kobe Open Workshop 25/10/2018. Software versions used for the workshop: Workshop content. Resources:

Kobe Open Workshop 25/10/2018. Software versions used for the workshop: Workshop content. Resources: Kobe Open Workshop 25/10/2018 The presentation and a PDF version of the workshop are available at https://downloads.openmicroscopy.org/presentations/2018/oow-kobe/ Software versions used for the workshop:

More information

Image Processing with KNIME

Image Processing with KNIME Image Processing with KNIME Who we are?! Martin Horn Martin.horn@uni-konstanz.de (+49) 07531 88-5017 Z815 Active Segmentation Christian Dietz Christian.dietz@uni-konstanz.de (+49) 07531 88-3641 Z815 Active

More information

Visualization on BioHPC

Visualization on BioHPC Visualization on BioHPC [web] [email] portal.biohpc.swmed.edu biohpc-help@utsouthwestern.edu 1 Updated for 2015-09-16 Outline What is Visualization - Scientific Visualization - Work flow for Visualization

More information

DURING THIS TWO-DAY CLASS, PARTICIPANTS WILL:

DURING THIS TWO-DAY CLASS, PARTICIPANTS WILL: Nuix Administration Nuix Administration ADVANCE TWO-DAY INSTRUCTOR-LED COURSE Nuix Administration is a two-day training course designed to assist Nuix administrators in properly configuring and troubleshooting

More information

CMPUT 695 Fall 2004 Assignment 2 Xelopes

CMPUT 695 Fall 2004 Assignment 2 Xelopes CMPUT 695 Fall 2004 Assignment 2 Xelopes Paul Nalos, Ben Chu November 5, 2004 1 Introduction We evaluated Xelopes, a data mining library produced by prudsys 1. Xelopes is available for Java, C++, and CORBA

More information

Cloud Computing 3. CSCI 4850/5850 High-Performance Computing Spring 2018

Cloud Computing 3. CSCI 4850/5850 High-Performance Computing Spring 2018 Cloud Computing 3 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning

More information

HMS Image Management Core

HMS Image Management Core HMS A NEW SERVICE FOR HMS RESEARCHERS Jay Copeland HMS Harvard Medical School The Promise of Data Reuse and Sharing Genomics 2 The Current Norm With Image Sharing 3 Challenges: Many File and Metadata Formats

More information

Prof. Konstantinos Krampis Office: Rm. 467F Belfer Research Building Phone: (212) Fax: (212)

Prof. Konstantinos Krampis Office: Rm. 467F Belfer Research Building Phone: (212) Fax: (212) Director: Prof. Konstantinos Krampis agbiotec@gmail.com Office: Rm. 467F Belfer Research Building Phone: (212) 396-6930 Fax: (212) 650 3565 Facility Consultant:Carlos Lijeron 1/8 carlos@carotech.com Office:

More information

Global Non-Rigid Alignment. Benedict J. Brown Katholieke Universiteit Leuven

Global Non-Rigid Alignment. Benedict J. Brown Katholieke Universiteit Leuven Global Non-Rigid Alignment Benedict J. Brown Katholieke Universiteit Leuven 3-D Scanning Pipeline Acquisition Scanners acquire data from a single viewpoint 3-D Scanning Pipeline Acquisition Alignment 3-D

More information

An Introduction to Apache Spark

An Introduction to Apache Spark An Introduction to Apache Spark 1 History Developed in 2009 at UC Berkeley AMPLab. Open sourced in 2010. Spark becomes one of the largest big-data projects with more 400 contributors in 50+ organizations

More information

Next Generation Storage for The Software-Defned World

Next Generation Storage for The Software-Defned World ` Next Generation Storage for The Software-Defned World John Hofer Solution Architect Red Hat, Inc. BUSINESS PAINS DEMAND NEW MODELS CLOUD ARCHITECTURES PROPRIETARY/TRADITIONAL ARCHITECTURES High up-front

More information

Designing an institutional research data management infrastructure for the life sciences

Designing an institutional research data management infrastructure for the life sciences Designing an institutional research data management infrastructure for the life sciences Paul van Schayck PhD student, data steward Maastricht University Medical Center + p.vanschayck@maastrichtuniversity.nl

More information

Metadata Models for Experimental Science Data Management

Metadata Models for Experimental Science Data Management Metadata Models for Experimental Science Data Management Brian Matthews Facilities Programme Manager Scientific Computing Department, STFC Co-Chair RDA Photon and Neutron Science Interest Group Task lead,

More information

Choosing an Interface

Choosing an Interface White Paper SUSE Enterprise Storage SUSE Enterprise Storage is a versatile, self-healing, and fault-tolerant storage alternative. The Ceph storage system embedded in SUSE Enterprise Storage is an object-based

More information

Introduction. Loading Images

Introduction. Loading Images Introduction CellProfiler is a free Open Source software for automated image analysis. Versions for Mac, Windows and Linux are available and can be downloaded at: http://www.cellprofiler.org/. CellProfiler

More information

CloudMan cloud clusters for everyone

CloudMan cloud clusters for everyone CloudMan cloud clusters for everyone Enis Afgan usecloudman.org This is accessibility! But only sometimes So, there are alternatives BUT WHAT IF YOU WANT YOUR OWN, QUICKLY The big picture A. Users in different

More information

Continuous Integration and Deployment (CI/CD)

Continuous Integration and Deployment (CI/CD) WHITEPAPER OCT 2015 Table of contents Chapter 1. Introduction... 3 Chapter 2. Continuous Integration... 4 Chapter 3. Continuous Deployment... 6 2 Chapter 1: Introduction Apcera Support Team October 2015

More information

Processing Genomics Data: High Performance Computing meets Big Data. Jan Fostier

Processing Genomics Data: High Performance Computing meets Big Data. Jan Fostier Processing Genomics Data: High Performance Computing meets Big Data Jan Fostier Traditional HPC way of doing things Communication network (Infiniband) Lots of communication c c c c c Lots of computations

More information

CSinParallel Workshop. OnRamp: An Interactive Learning Portal for Parallel Computing Environments

CSinParallel Workshop. OnRamp: An Interactive Learning Portal for Parallel Computing Environments CSinParallel Workshop : An Interactive Learning for Parallel Computing Environments Samantha Foley ssfoley@cs.uwlax.edu http://cs.uwlax.edu/~ssfoley Josh Hursey jjhursey@cs.uwlax.edu http://cs.uwlax.edu/~jjhursey/

More information

Multiple Usage of KNIME in a Screening Laboratory Environment

Multiple Usage of KNIME in a Screening Laboratory Environment Multiple Usage of KNIME in a Screening Laboratory Environment KNIME UGM Zürich, 02.02.2012 Marc Bickle HT-TDS, MPI-CBG Outline Presentation of TDS Our problem: large complex datasets KNIME as data mining

More information

CO 2 DAS Data Ingestion System Design Version 1.0 Charles J Antonelli 1 April 2011

CO 2 DAS Data Ingestion System Design Version 1.0 Charles J Antonelli 1 April 2011 CO 2 DAS Data Ingestion System Design Version 1.0 Charles J Antonelli 1 April 2011 Introduction The CO2DAS Data Ingestion System (DIS) is responsible for discovering and remembering sources of data, and

More information

70-414: Implementing an Advanced Server Infrastructure Course 01 - Creating the Virtualization Infrastructure

70-414: Implementing an Advanced Server Infrastructure Course 01 - Creating the Virtualization Infrastructure 70-414: Implementing an Advanced Server Infrastructure Course 01 - Creating the Virtualization Infrastructure Slide 1 Creating the Virtualization Infrastructure Slide 2 Introducing Microsoft System Center

More information

Alaris S2000 Series Scanner Firmware Release Notes

Alaris S2000 Series Scanner Firmware Release Notes Firmware Version 181201 Summary Purpose of Release: This is the updated release of the firmware for the S2050 and S2070 Date: December 6, 2018 S2000 Driver Installation Disk Version: 206 Use s2000_181201_installerexe

More information

Description. Speaker Patrizia Monteduro (International Consultant, FAO) TRAINING GEONETWORK OPENSOURCE Islamabad, Pakistan, Jan 29-31, 2014

Description. Speaker Patrizia Monteduro (International Consultant, FAO) TRAINING GEONETWORK OPENSOURCE Islamabad, Pakistan, Jan 29-31, 2014 BUILDING PROVINCIAL CAPACITY IN PAKISTAN FOR CROP ESTIMATION, FORECASTING, AND REPORTING BASED ON THE INTEGRATED USE OF REMOTELY SENSED DATA (GCP/PAK/125/USA) Description Speaker Patrizia Monteduro (International

More information

Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context

Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context 1 Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes

More information

Fault-Tolerant Parallel Analysis of Millisecond-scale Molecular Dynamics Trajectories. Tiankai Tu D. E. Shaw Research

Fault-Tolerant Parallel Analysis of Millisecond-scale Molecular Dynamics Trajectories. Tiankai Tu D. E. Shaw Research Fault-Tolerant Parallel Analysis of Millisecond-scale Molecular Dynamics Trajectories Tiankai Tu D. E. Shaw Research Anton: A Special-Purpose Parallel Machine for MD Simulations 2 Routine Data Analysis

More information

Based on Big Data: Hype or Hallelujah? by Elena Baralis

Based on Big Data: Hype or Hallelujah? by Elena Baralis Based on Big Data: Hype or Hallelujah? by Elena Baralis http://dbdmg.polito.it/wordpress/wp-content/uploads/2010/12/bigdata_2015_2x.pdf 1 3 February 2010 Google detected flu outbreak two weeks ahead of

More information

AutoTx. (and more ) Smart automatic background data transfer for Windows. Niko Ehrenfeuchter - Imaging Core Facility - University of Basel

AutoTx. (and more ) Smart automatic background data transfer for Windows. Niko Ehrenfeuchter - Imaging Core Facility - University of Basel AutoTx (and more ) Smart automatic background data transfer for Windows AutoTx Smart automatic background data transfer for Windows Windows Service based on.net 4.5 Targets an ActiveDirectory environment

More information

Groovy in Jenkins. Ioannis K. Moutsatsos. Repurposing Jenkins for Life Sciences Data Pipelining

Groovy in Jenkins. Ioannis K. Moutsatsos. Repurposing Jenkins for Life Sciences Data Pipelining Groovy in Jenkins Ioannis K. Moutsatsos Repurposing Jenkins for Life Sciences Data Pipelining Who Am I? Research scientist at local pharmaceutical company Software engineer Open Source advocate and contributor

More information

Virtualization with VMware ESX and VirtualCenter SMB to Enterprise

Virtualization with VMware ESX and VirtualCenter SMB to Enterprise Virtualization with VMware ESX and VirtualCenter SMB to Enterprise This class is an intense, four-day introduction to virtualization using VMware s immensely popular Virtual Infrastructure suite including

More information

Automatic Dependency Management for Scientific Applications on Clusters. Ben Tovar*, Nicholas Hazekamp, Nathaniel Kremer-Herman, Douglas Thain

Automatic Dependency Management for Scientific Applications on Clusters. Ben Tovar*, Nicholas Hazekamp, Nathaniel Kremer-Herman, Douglas Thain Automatic Dependency Management for Scientific Applications on Clusters Ben Tovar*, Nicholas Hazekamp, Nathaniel Kremer-Herman, Douglas Thain Where users are Scientist says: "This demo task runs on my

More information

FuncX: A Function Serving Platform for HPC. Ryan Chard 28 Jan 2019

FuncX: A Function Serving Platform for HPC. Ryan Chard 28 Jan 2019 FuncX: A Function Serving Platform for HPC Ryan Chard 28 Jan 2019 Outline - Motivation FuncX: FaaS for HPC Implementation status Preliminary applications - Machine learning inference Automating analysis

More information

Data Analysis in ATLAS. Graeme Stewart with thanks to Attila Krasznahorkay and Johannes Elmsheuser

Data Analysis in ATLAS. Graeme Stewart with thanks to Attila Krasznahorkay and Johannes Elmsheuser Data Analysis in ATLAS Graeme Stewart with thanks to Attila Krasznahorkay and Johannes Elmsheuser 1 ATLAS Data Flow into Analysis RAW detector data and simulated RDO data are reconstructed into our xaod

More information

Building an Exotic HPC Ecosystem at The University of Tulsa

Building an Exotic HPC Ecosystem at The University of Tulsa Building an Exotic HPC Ecosystem at The University of Tulsa John Hale Peter Hawrylak Andrew Kongs Changing Our CS Culture Platforms Desktop, workstations, mobile Programming Java, python Serial Changing

More information

Handling and Processing Big Data for Biomedical Discovery with MATLAB

Handling and Processing Big Data for Biomedical Discovery with MATLAB Handling and Processing Big Data for Biomedical Discovery with MATLAB Raphaël Thierry, PhD Image processing and analysis Software development Facility for Advanced Microscopy and Imaging 23 th June 2016

More information

ReFrame: A Regression Testing Framework Enabling Continuous Integration of Large HPC Systems

ReFrame: A Regression Testing Framework Enabling Continuous Integration of Large HPC Systems ReFrame: A Regression Testing Framework Enabling Continuous Integration of Large HPC Systems HPC Advisory Council 2018 Victor Holanda, Vasileios Karakasis, CSCS Apr. 11, 2018 ReFrame in a nutshell Regression

More information

DiskSavvy Disk Space Analyzer. DiskSavvy DISK SPACE ANALYZER. User Manual. Version Dec Flexense Ltd.

DiskSavvy Disk Space Analyzer. DiskSavvy DISK SPACE ANALYZER. User Manual. Version Dec Flexense Ltd. DiskSavvy DISK SPACE ANALYZER User Manual Version 10.3 Dec 2017 www.disksavvy.com info@flexense.com 1 1 Product Overview...3 2 Product Versions...7 3 Using Desktop Versions...8 3.1 Product Installation

More information

Data Protection Guide

Data Protection Guide SnapCenter Software 2.0 Data Protection Guide For Windows File Systems January 2017 215-11356_A0 doccomments@netapp.com Table of Contents 3 Contents Deciding whether to read this information... 5 SnapCenter

More information

VxRail: Level Up with New Capabilities and Powers GLOBAL SPONSORS

VxRail: Level Up with New Capabilities and Powers GLOBAL SPONSORS VxRail: Level Up with New Capabilities and Powers GLOBAL SPONSORS VMware customers trust their infrastructure to vsan #1 Leading SDS Vendor >10,000 >100 83% vsan Customers Countries Deployed Critical Apps

More information

Task-based distributed processing for radio-interferometric imaging with CASA

Task-based distributed processing for radio-interferometric imaging with CASA H2020-Astronomy ESFRI and Research Infrastructure Cluster (Grant Agreement number: 653477). Task-based distributed processing for radio-interferometric imaging with CASA BOJAN NIKOLIC 2 nd ASTERICS-OBELICS

More information

Managing CAE Simulation Workloads in Cluster Environments

Managing CAE Simulation Workloads in Cluster Environments Managing CAE Simulation Workloads in Cluster Environments Michael Humphrey V.P. Enterprise Computing Altair Engineering humphrey@altair.com June 2003 Copyright 2003 Altair Engineering, Inc. All rights

More information

Dealing with Large Datasets. or, So I have 40TB of data.. Jonathan Dursi, SciNet/CITA, University of Toronto

Dealing with Large Datasets. or, So I have 40TB of data.. Jonathan Dursi, SciNet/CITA, University of Toronto Dealing with Large Datasets or, So I have 40TB of data.. Jonathan Dursi, SciNet/CITA, University of Toronto Data is getting bigger Increase in computing power makes simulations larger/more frequent Increase

More information

KNIME What s new?! Bernd Wiswedel KNIME.com AG, Zurich, Switzerland

KNIME What s new?! Bernd Wiswedel KNIME.com AG, Zurich, Switzerland KNIME What s new?! Bernd Wiswedel KNIME.com AG, Zurich, Switzerland Data Access ASCII (File/CSV Reader, ) Excel Web Services Remote Files (http, ftp, ) Other domain standards (e.g. Sdf) Databases Data

More information

NVIDIA DATA LOADING LIBRARY (DALI)

NVIDIA DATA LOADING LIBRARY (DALI) NVIDIA DATA LOADING LIBRARY (DALI) RN-09096-001 _v01 September 2018 Release Notes TABLE OF CONTENTS Chapter Chapter Chapter Chapter Chapter 1. 2. 3. 4. 5. DALI DALI DALI DALI DALI Overview...1 Release

More information

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan Storage Virtualization Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan Storage Virtualization In computer science, storage virtualization uses virtualization to enable better functionality

More information

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype?

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype? Big data hype? Big Data: Hype or Hallelujah? Data Base and Data Mining Group of 2 Google Flu trends On the Internet February 2010 detected flu outbreak two weeks ahead of CDC data Nowcasting http://www.internetlivestats.com/

More information

Dac-Man: Data Change Management for Scientific Datasets on HPC systems

Dac-Man: Data Change Management for Scientific Datasets on HPC systems Dac-Man: Data Change Management for Scientific Datasets on HPC systems Devarshi Ghoshal Lavanya Ramakrishnan Deborah Agarwal Lawrence Berkeley National Laboratory Email: dghoshal@lbl.gov Motivation Data

More information

StarWind Virtual SAN Automating Management with SMI-S in System Center Virtual Machine Manager 2016

StarWind Virtual SAN Automating Management with SMI-S in System Center Virtual Machine Manager 2016 One Stop Virtualization Shop StarWind Virtual SAN Automating Management with SMI-S in System Center Virtual Machine Manager 2016 FEBRUARY 2018 TECHNICAL PAPER Trademarks StarWind, StarWind Software and

More information

THE EMC ISILON STORY. Big Data In The Enterprise. Deya Bassiouni Isilon Regional Sales Manager Emerging Africa, Egypt & Lebanon.

THE EMC ISILON STORY. Big Data In The Enterprise. Deya Bassiouni Isilon Regional Sales Manager Emerging Africa, Egypt & Lebanon. THE EMC ISILON STORY Big Data In The Enterprise Deya Bassiouni Isilon Regional Sales Manager Emerging Africa, Egypt & Lebanon August, 2012 1 Big Data In The Enterprise Isilon Overview Isilon Technology

More information

Manual. HTPheno. A Novel Image Analysis Pipeline for High-Throughput Plant Phenotyping

Manual. HTPheno. A Novel Image Analysis Pipeline for High-Throughput Plant Phenotyping Manual HTPheno A Novel Image Analysis Pipeline for High-Throughput Plant Phenotyping July 14, 2010 Contents 1 Basic step-by-step setup 1 1.1 Define the image object classes................. 2 1.2 Define

More information

DiskBoss DATA MANAGEMENT

DiskBoss DATA MANAGEMENT DiskBoss DATA MANAGEMENT Disk Change Monitor Version 9.3 May 2018 www.diskboss.com info@flexense.com 1 1 Product Overview DiskBoss is an automated, policy-based data management solution allowing one to

More information

Belnet and BrENIAC. High Performance Network for High Performance Computing. Jan Ooghe, Herman Moons KU Leuven BNC Oktober

Belnet and BrENIAC. High Performance Network for High Performance Computing. Jan Ooghe, Herman Moons KU Leuven BNC Oktober Belnet and BrENIAC High Performance Network for High Performance Computing Jan Ooghe, Herman Moons KU Leuven BNC 2016 25 Oktober Supercomputing? A supercomputer is a computer that is considered, or was

More information

Chuck Cartledge, PhD. 12 October 2018

Chuck Cartledge, PhD. 12 October 2018 Big Data: Data Analysis Boot Camp Introduction and Overview Chuck Cartledge, PhD 12 October 2018 1/14 Table of contents (1 of 1) 1 Introduction The global view 2 Overview The world from 50,000 feet. Text

More information

Cloud Bursting Jim Thompson Avere Systems

Cloud Bursting Jim Thompson Avere Systems Cloud Bursting Jim Thompson Avere Systems Why Use Someone Else s Cloud? Significantly reduce infrastructure management costs both in money and time Maintain operational flexibility during scale-out jobs

More information

The CEDA Archive: Data, Services and Infrastructure

The CEDA Archive: Data, Services and Infrastructure The CEDA Archive: Data, Services and Infrastructure Kevin Marsh Centre for Environmental Data Archival (CEDA) www.ceda.ac.uk with thanks to V. Bennett, P. Kershaw, S. Donegan and the rest of the CEDA Team

More information

How Scalable is your SMB?

How Scalable is your SMB? How Scalable is your SMB? Mark Rabinovich Visuality Systems Ltd. What is this all about? Visuality Systems Ltd. provides SMB solutions from 1998. NQE (Embedded) is an implementation of SMB client/server

More information

A COMMON SOFTWARE FRAMEWORK FOR FEL DATA ACQUISITION AND EXPERIMENT MANAGEMENT AT FERMI

A COMMON SOFTWARE FRAMEWORK FOR FEL DATA ACQUISITION AND EXPERIMENT MANAGEMENT AT FERMI A COMMON SOFTWARE FRAMEWORK FOR FEL DATA ACQUISITION AND EXPERIMENT MANAGEMENT AT FERMI R. Borghes, V. Chenda, A. Curri, G. Kourousias, M. Lonza, G. Passos, M. Prica, R. Pugliese 1 FERMI overview FERMI

More information

Windows Server 2012 Top Ten

Windows Server 2012 Top Ten Who am I? 11 Time Microsoft MVP Author of the Windows FAQ Senior Contributing Editor for Windows IT Pro magazine Latest book available Microsoft Virtualization Secrets Speaker at Tech Ed, Windows Connections

More information

BUSINESS DATA LAKE FADI FAKHOURI, SR. SYSTEMS ENGINEER, ISILON SPECIALIST. Copyright 2016 EMC Corporation. All rights reserved.

BUSINESS DATA LAKE FADI FAKHOURI, SR. SYSTEMS ENGINEER, ISILON SPECIALIST. Copyright 2016 EMC Corporation. All rights reserved. BUSINESS DATA LAKE FADI FAKHOURI, SR. SYSTEMS ENGINEER, ISILON SPECIALIST 1 UNSTRUCTURED DATA GROWTH 75% 78% 80% 2015 71 EB 2016 106 EB 2017 133 EB Total Capacity Shipped, Worldwide % of Unstructured Data

More information

2/26/2017. Originally developed at the University of California - Berkeley's AMPLab

2/26/2017. Originally developed at the University of California - Berkeley's AMPLab Apache is a fast and general engine for large-scale data processing aims at achieving the following goals in the Big data context Generality: diverse workloads, operators, job sizes Low latency: sub-second

More information

Characterizing Private Clouds: A Large-Scale Empirical Analysis of Enterprise Clusters

Characterizing Private Clouds: A Large-Scale Empirical Analysis of Enterprise Clusters Characterizing Private Clouds: A Large-Scale Empirical Analysis of Enterprise Clusters Ignacio Cano, Srinivas Aiyar, Arvind Krishnamurthy University of Washington Nutanix Inc. ACM Symposium on Cloud Computing

More information

HortiCube: A Platform for Transparent, Trusted Data Sharing in the Food Supply Chain

HortiCube: A Platform for Transparent, Trusted Data Sharing in the Food Supply Chain Available online at www.centmapress.org Proceedings in System Dynamics and Innovation in Food Networks 2016 HortiCube: A Platform for Transparent, Trusted Data Sharing in the Food Supply Chain Jack Verhoosel

More information

On-the-Fly Data Race Detection in MPI One-Sided Communication

On-the-Fly Data Race Detection in MPI One-Sided Communication On-the-Fly Data Race Detection in MPI One-Sided Communication Presentation Master Thesis Simon Schwitanski (schwitanski@itc.rwth-aachen.de) Joachim Protze (protze@itc.rwth-aachen.de) Prof. Dr. Matthias

More information

Harp-DAAL for High Performance Big Data Computing

Harp-DAAL for High Performance Big Data Computing Harp-DAAL for High Performance Big Data Computing Large-scale data analytics is revolutionizing many business and scientific domains. Easy-touse scalable parallel techniques are necessary to process big

More information

Atrium Webinar- What's new in ADDM Version 10

Atrium Webinar- What's new in ADDM Version 10 Atrium Webinar- What's new in ADDM Version 10 This document provides question and answers discussed during following webinar session: Atrium Webinar- What's new in ADDM Version 10 on May 8th, 2014 Q: Hi,

More information

Kerberos & HPC Batch systems. Matthieu Hautreux (CEA/DAM/DIF)

Kerberos & HPC Batch systems. Matthieu Hautreux (CEA/DAM/DIF) Kerberos & HPC Batch systems Matthieu Hautreux (CEA/DAM/DIF) matthieu.hautreux@cea.fr Outline Kerberos authentication HPC site environment Kerberos & HPC systems AUKS From HPC site to HPC Grid environment

More information

Community edition(open-source) Enterprise edition

Community edition(open-source) Enterprise edition Suseela Bhaskaruni Rapid Miner is an environment for machine learning and data mining experiments. Widely used for both research and real-world data mining tasks. Software versions: Community edition(open-source)

More information

S i m p l i f y i n g A d m i n i s t r a t i o n a n d M a n a g e m e n t P r o c e s s e s i n t h e P o l i s h N a t i o n a l C l u s t e r

S i m p l i f y i n g A d m i n i s t r a t i o n a n d M a n a g e m e n t P r o c e s s e s i n t h e P o l i s h N a t i o n a l C l u s t e r S i m p l i f y i n g A d m i n i s t r a t i o n a n d M a n a g e m e n t P r o c e s s e s i n t h e P o l i s h N a t i o n a l C l u s t e r Miroslaw Kupczyk, Norbert Meyer, Pawel Wolniewicz e-mail:

More information

Parallel & Scalable Machine Learning Introduction to Machine Learning Algorithms

Parallel & Scalable Machine Learning Introduction to Machine Learning Algorithms Parallel & Scalable Machine Learning Introduction to Machine Learning Algorithms Dr. Ing. Morris Riedel Adjunct Associated Professor School of Engineering and Natural Sciences, University of Iceland Research

More information

Recommender System for Personalization in. Daniel Mican Nicolae Tomai

Recommender System for Personalization in. Daniel Mican Nicolae Tomai Association-Rules-Based Recommender System for Personalization in Adaptive Web-Based Applications Daniel Mican Nicolae Tomai Introduction The ability of a web application to offer personalised content

More information

VIDAEXPERT: DATA ANALYSIS Here is the Statistics button.

VIDAEXPERT: DATA ANALYSIS Here is the Statistics button. Here is the Statistics button. After creating dataset you can analyze it in different ways. First, you can calculate statistics. Open Statistics dialog, Common tabsheet, click Calculate. Min, Max: minimal

More information

ImageJ2 Directions & Goals

ImageJ2 Directions & Goals ImageJ2 Directions & Goals Strengths of ImageJ Cross-platform and public domain Macro language makes batch processing easy Wealth of existing image processing plugins Large science-oriented community of

More information

EMC DATA PROTECTION, FAILOVER AND FAILBACK, AND RESOURCE REPURPOSING IN A PHYSICAL SECURITY ENVIRONMENT

EMC DATA PROTECTION, FAILOVER AND FAILBACK, AND RESOURCE REPURPOSING IN A PHYSICAL SECURITY ENVIRONMENT White Paper EMC DATA PROTECTION, FAILOVER AND FAILBACK, AND RESOURCE REPURPOSING IN A PHYSICAL SECURITY ENVIRONMENT Genetec Omnicast, EMC VPLEX, Symmetrix VMAX, CLARiiON Provide seamless local or metropolitan

More information

Insight: Measurement Tool. User Guide

Insight: Measurement Tool. User Guide OMERO Beta v2.2: Measurement Tool User Guide - 1 - October 2007 Insight: Measurement Tool User Guide Open Microscopy Environment: http://www.openmicroscopy.org OMERO Beta v2.2: Measurement Tool User Guide

More information

INTRODUCTION TO NEXTFLOW

INTRODUCTION TO NEXTFLOW INTRODUCTION TO NEXTFLOW Paolo Di Tommaso, CRG NETTAB workshop - Roma October 25th, 2016 @PaoloDiTommaso Research software engineer Comparative Bioinformatics, Notredame Lab Center for Genomic Regulation

More information

escience in the Cloud Dan Fay Director Earth, Energy and Environment

escience in the Cloud Dan Fay Director Earth, Energy and Environment escience in the Cloud Dan Fay Director Earth, Energy and Environment dan.fay@microsoft.com New ways to analyze and communicate data EOS Article: Mountain Hydrology, Snow Color, and the Fourth Paradigm

More information

Introduction to BioHPC New User Training

Introduction to BioHPC New User Training Introduction to BioHPC New User Training [web] [email] portal.biohpc.swmed.edu biohpc-help@utsouthwestern.edu 1 Updated for 2019-02-06 Overview Today we re going to cover: What is BioHPC? How do I access

More information

Hyper-V Innovations for the SMB. IT Pro Camp, Northwest Florida State College, Niceville, FL, October 5, 2013

Hyper-V Innovations for the SMB. IT Pro Camp, Northwest Florida State College, Niceville, FL, October 5, 2013 Hyper-V Innovations for the SMB IT Pro Camp, Northwest Florida State College, Niceville, FL, October 5, 2013 Ben Jones Systems Engineer 7 years of experience Microsoft Certified Solutions Expert: Server

More information

Parallel learning of content recommendations using map- reduce

Parallel learning of content recommendations using map- reduce Parallel learning of content recommendations using map- reduce Michael Percy Stanford University Abstract In this paper, machine learning within the map- reduce paradigm for ranking

More information

ISSN: International Journal of Science, Engineering and Technology Research (IJSETR) Volume 6, Issue 1, January 2017

ISSN: International Journal of Science, Engineering and Technology Research (IJSETR) Volume 6, Issue 1, January 2017 AUTOMATIC WEED DETECTION AND SMART HERBICIDE SPRAY ROBOT FOR CORN FIELDS G.Sowmya (1), J.Srikanth (2) MTech(VLSI&ES), Mallareddy Institute Of Technology (1) Assistant Professor at Mallareddy Institute

More information

Veritas Storage Foundation in a VMware ESX Environment

Veritas Storage Foundation in a VMware ESX Environment Veritas Storage Foundation in a VMware ESX Environment Linux and Solaris x64 platforms January 2011 TABLE OF CONTENTS Introduction... 3 Executive Summary... 4 Overview... 5 Virtual Machine File System...

More information

LOCI, Fiji & ImageJ2

LOCI, Fiji & ImageJ2 LOCI, Fiji & ImageJ2 Philosophy of optical research In vivo developmental biology Capture everything possible about a sample Multiple dimensions Emission spectra Lifetime Cell polarization Greater resolution

More information

SnapCenter Software 4.0 Concepts Guide

SnapCenter Software 4.0 Concepts Guide SnapCenter Software 4.0 Concepts Guide May 2018 215-12925_D0 doccomments@netapp.com Table of Contents 3 Contents Deciding whether to use the Concepts Guide... 7 SnapCenter overview... 8 SnapCenter architecture...

More information

Software Defined Storage. Reality or BS?

Software Defined Storage. Reality or BS? Software Defined Storage Reality or BS? Defining Software Defined Storage We must resist software defined washing My Criteria: Must run on x86 server hardware No custom ASICs NVRAM allowed Can be sold

More information

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version:

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version: Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version: 20180316 Peanut Innovation Lab Management Entity The University of Georgia, Athens, Georgia Feed the Future

More information

Virtualization with VMware ESX and VirtualCenter SMB to Enterprise

Virtualization with VMware ESX and VirtualCenter SMB to Enterprise Virtualization with VMware ESX and VirtualCenter SMB to Enterprise This class is an intense, five-day introduction to virtualization using VMware s immensely popular Virtual Infrastructure suite including

More information

Collaborative data-driven science. Collaborative data-driven science. Mike Rippin

Collaborative data-driven science. Collaborative data-driven science. Mike Rippin Collaborative data-driven science Mike Rippin Background and History of SciServer Major Objectives Current System SciServer Compute Now SciServer Compute Future Q&A 2 Collaborative data-driven science

More information

REFERENCE ARCHITECTURE Quantum StorNext and Cloudian HyperStore

REFERENCE ARCHITECTURE Quantum StorNext and Cloudian HyperStore REFERENCE ARCHITECTURE Quantum StorNext and Cloudian HyperStore CLOUDIAN + QUANTUM REFERENCE ARCHITECTURE 1 Table of Contents Introduction to Quantum StorNext 3 Introduction to Cloudian HyperStore 3 Audience

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information

Chuck Cartledge, PhD. 19 January 2018

Chuck Cartledge, PhD. 19 January 2018 Big Data: Data Analysis Boot Camp Introduction and Overview Chuck Cartledge, PhD 19 January 2018 1/11 Table of contents (1 of 1) 1 Introduction The global view 2 Overview The world from 50,000 feet. 3

More information

Glas robotprojecten + (PicknPack, Phenobot, Trimbot & Sweeper) Erik Pekkeriet

Glas robotprojecten + (PicknPack, Phenobot, Trimbot & Sweeper) Erik Pekkeriet 1 Glas 4.0 4 robotprojecten + (PicknPack, Phenobot, Trimbot & Sweeper) Erik Pekkeriet PicknPack3 4 Globale strategie Yield & quality Stress Gezonde planten Oogst prognose Groei snelheid(homogeneity) Rijheid

More information

An introduction to checkpointing. for scientifc applications

An introduction to checkpointing. for scientifc applications damien.francois@uclouvain.be UCL/CISM An introduction to checkpointing for scientifc applications November 2016 CISM/CÉCI training session What is checkpointing? Without checkpointing: $./count 1 2 3^C

More information

The VGG Face Finder (VFF) Engine

The VGG Face Finder (VFF) Engine The VGG Face Finder (VFF) Engine Performs a visual search over a dataset of images with faces Automatically finds images matching your query within the dataset Input can be a text string or an image It

More information