HMS Image Management Core

Size: px
Start display at page:

Download "HMS Image Management Core"

Transcription

1 HMS A NEW SERVICE FOR HMS RESEARCHERS Jay Copeland HMS Harvard Medical School

2 The Promise of Data Reuse and Sharing Genomics 2

3 The Current Norm With Image Sharing 3

4 Challenges: Many File and Metadata Formats 4

5 Challenges: Large numbers of files 5

6 Challenges: Moving Image Datasets IBM 305 RAMRAC, the first production hard disk drive. Weighing over one ton it could store 5 MB of data. Introduced in

7 A New Service: HMS A new service offered by HMS IT 2 Year Pilot Project Funded by TnT (Tools and Technology) Committee and HMS IT building on prior support from HiTS/LSP Part of RITS (Research Services) First offering: 7

8 Import Store and Protect Organize View Analyze Publish Export Collaborate source: openmicroscopy.org 8

9 Open Science Built On Open Source Secure APIs for Java, C++, Python, Matlab, Web Data Integrity Extensible Efficiency Convenience Enables Collaboration and Sharing Effectively Leverages Existing HMS Resources source: openmicroscopy.org 9

10 Web Application Example: HMS LINCS DB One Dataset: 25,920 images (720 conditions, 4 channels, 9 replicates) All images in the dataset are accessible 10

11 Web Application Example: HMS LINCS DB One Dataset: 25,920 images (720 conditions, 4 channels, 9 replicates) All images in the dataset are accessible Experimental conditions are stored with each image using a powerful Bulk Annotations utility 11

12 Using OMERO to Improve Workflows Typical Pattern: Multiplicity of files, often duplicates Analysis creates additional files separate from original data Replicating and documenting the process is difficult Analysis code is not shared Ad- hoc storage: 12

13 High Performance Workflow Architecture Using OMERO Acquire Data Image & Data Analysis Core Design Analysis Run Analysis Users Import Data To OMERO (In-Place) OMERO Distributed Parallel Filesystem Orchestra Cluster Read Image Data OMERO Database Read Metadata Attach Results source: Douglas Russell 13

14 Proof of Concept: Image Analysis By IDAC of 23K Image Dataset Produced by LSP Using Orchestra Cluster and OMERO Plate- level analysis results attached as files 20 x 384 well plates x 3 replicate fields = 23,040 images 14

15 Proof of Concept: Image Analysis By IDAC of 23K Image Dataset Produced by LSP Using Orchestra Cluster and OMERO Original Image Showing Nuclear Stain Original images and analysis results stored together for quality control, review, sharing, collaboration, and publishing. 15

16 Proof of Concept: Image Analysis By IDAC of 23K Image Dataset Produced by LSP Using Orchestra Cluster and OMERO Original Image Showing Nuclear Stain With Binary Mask Overlay Showing Segmentation Original images and analysis results stored together for quality control, review, sharing, collaboration, and publishing. 16

17 Acknowledgements Chris Botka, HMS IT Caroline Shamu, ICCB- L Peter Sorger, HiTS/LSP Mason Miranda, RITS Douglas Russell, HMS IT/LSP Glencoe Software Tiao Xie, IDAC Mario Niepel, LSP Funding: HMS TnT Committee NIH Grant U54 HL for initial development of LINCS- OMERO HMS IT HiTS/LSP 17

18 For additional information: HMS : (Available December 2015) Open Microscopy Environment: Questions and inquiries: jay_copeland@hms.harvard.edu 18

Michal Kuneš

Michal Kuneš The Open Microscopy Environment A DataBase for the storage and manipulation of image data Michal Kuneš xkunes@utia.cas.cz ZOI UTIA, ASCR, Friday seminar 13.12.2013 OMERO http://www.openmicroscopy.org/site/support/omero4/users/index.html

More information

Reproducible & Transparent Computational Science with Galaxy. Jeremy Goecks The Galaxy Team

Reproducible & Transparent Computational Science with Galaxy. Jeremy Goecks The Galaxy Team Reproducible & Transparent Computational Science with Galaxy Jeremy Goecks The Galaxy Team 1 Doing Good Science Previous talks: performing an analysis setting up and scaling Galaxy adding tools libraries

More information

Software + Services for Data Storage, Management, Discovery, and Re-Use

Software + Services for Data Storage, Management, Discovery, and Re-Use Software + Services for Data Storage, Management, Discovery, and Re-Use CODATA 22 Conference Stellenbosch, South Africa 25 October 2010 Alex D. Wade Director Scholarly Communication Microsoft External

More information

Mouse Hippocampus. Nika Mohannak / Meunier Lab. AxioImager + ApoTome 20x 0.8 NA Water Objective. 0.3 x 0.3 µm XY 1.2 Z step size. 12 slices x 6 tiles

Mouse Hippocampus. Nika Mohannak / Meunier Lab. AxioImager + ApoTome 20x 0.8 NA Water Objective. 0.3 x 0.3 µm XY 1.2 Z step size. 12 slices x 6 tiles 1 Mouse Hippocampus Nika Mohannak / Meunier Lab AxioImager + ApoTome 20x 0.8 NA Water Objective 0.3 x 0.3 µm XY 1.2 Z step size 12 slices x 6 tiles 2 Zebrafish / GCaMP6 Dr. Jeremy Ullmann Yokogawa W1 SDC

More information

The Data exacell DXC. J. Ray Scott DXC PI May 17, 2016

The Data exacell DXC. J. Ray Scott DXC PI May 17, 2016 The Data exacell DXC J. Ray Scott DXC PI May 17, 2016 DXC Leadership Mike Levine Co-Scientific Director Co-PI Nick Nystrom Senior Director of Research Co-PI Ralph Roskies Co-Scientific Director Co-PI Robin

More information

Co-ReSyF Hands-on sessions

Co-ReSyF Hands-on sessions This project has received funding from the European Union s Horizon 2020 Research and Innovation Programme under grant agreement no 687289 Co-ReSyF Hands-on sessions Coastal Waters Research Synergy Framework

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information

Image Processing with KNIME

Image Processing with KNIME Image Processing with KNIME Who we are?! Martin Horn Martin.horn@uni-konstanz.de (+49) 07531 88-5017 Z815 Active Segmentation Christian Dietz Christian.dietz@uni-konstanz.de (+49) 07531 88-3641 Z815 Active

More information

The iplant Data Commons

The iplant Data Commons The iplant Data Commons Using irods to Facilitate Data Dissemination, Discovery, and Reproducibility Jeremy DeBarry, jdebarry@iplantcollaborative.org Tony Edgin, tedgin@iplantcollaborative.org Nirav Merchant,

More information

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments *

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments * Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments * Joesph JaJa joseph@ Mike Smorul toaster@ Fritz McCall fmccall@ Yang Wang wpwy@ Institute

More information

You need to use the URL provided by your institute s OMERO administrator to access the OMERO.web client.

You need to use the URL provided by your institute s OMERO administrator to access the OMERO.web client. 1 OMERO.web Client Using OMERO.web to view and work with image data via a web browser. You need to use the URL provided by your institute s OMERO administrator to access the OMERO.web client. Logging in,

More information

Decrypting your genome data privately in the cloud

Decrypting your genome data privately in the cloud Decrypting your genome data privately in the cloud Marc Sitges Data Manager@Made of Genes @madeofgenes The Human Genome 3.200 M (x2) Base pairs (bp) ~20.000 genes (~30%) (Exons ~1%) The Human Genome Project

More information

Managing Exploratory Workflows

Managing Exploratory Workflows Managing Exploratory Workflows Juliana Freire Claudio T. Silva http://www.sci.utah.edu/~vgc/vistrails/ University of Utah Joint work with: Erik Andersen, Steven P. Callahan, David Koop, Emanuele Santos,

More information

Data Sharing Made Easier through Programmable Metadata. University of Wisconsin-Madison

Data Sharing Made Easier through Programmable Metadata. University of Wisconsin-Madison Data Sharing Made Easier through Programmable Metadata Zhe Zhang IBM Research! Remzi Arpaci-Dusseau University of Wisconsin-Madison How do applications share data today? Syncing data between storage systems:

More information

Multiple Usage of KNIME in a Screening Laboratory Environment

Multiple Usage of KNIME in a Screening Laboratory Environment Multiple Usage of KNIME in a Screening Laboratory Environment KNIME UGM Zürich, 02.02.2012 Marc Bickle HT-TDS, MPI-CBG Outline Presentation of TDS Our problem: large complex datasets KNIME as data mining

More information

Write a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical

Write a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical Identify a problem Review approaches to the problem Propose a novel approach to the problem Define, design, prototype an implementation to evaluate your approach Could be a real system, simulation and/or

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Perspectives on Open Data in Science Open Data in Science: Challenges & Opportunities for Europe

Perspectives on Open Data in Science Open Data in Science: Challenges & Opportunities for Europe Perspectives on Open Data in Science Open Data in Science: Challenges & Opportunities for Europe Stephane Berghmans, DVM PhD 31 January 2018 9 When talking about data, we talk about All forms of research

More information

Integrative Informatics

Integrative Informatics Early Vision Integrative Informatics Isaac S. Kohane 4.27.04 PIP s Integration Integrating Genomics and Pharmacology RNA expression in NCI 60 cell lines was determined using Affymetrix HU6000 arrays 5,223

More information

Managing the Evolution of Dataflows with VisTrails

Managing the Evolution of Dataflows with VisTrails Managing the Evolution of Dataflows with VisTrails Juliana Freire http://www.cs.utah.edu/~juliana University of Utah Joint work with: Steven P. Callahan, Emanuele Santos, Carlos E. Scheidegger, Claudio

More information

SQL SERVER DBA TRAINING IN BANGALORE

SQL SERVER DBA TRAINING IN BANGALORE SQL SERVER DBA TRAINING IN BANGALORE TIB ACADEMY #5/3 BEML LAYOUT, VARATHUR MAIN ROAD KUNDALAHALLI GATE, BANGALORE 560066 PH: +91-9513332301/2302 WWW.TRAININGINBANGALORE.COM Sql Server DBA Training Syllabus

More information

enanomapper database, search tools and templates Nina Jeliazkova, Nikolay Kochev IdeaConsult Ltd. Sofia, Bulgaria

enanomapper database, search tools and templates Nina Jeliazkova, Nikolay Kochev IdeaConsult Ltd. Sofia, Bulgaria enanomapper database, search tools and templates Nina Jeliazkova, Nikolay Kochev IdeaConsult Ltd. Sofia, Bulgaria www.ideaconsult.net Ø enanomapper database: data model, technology; NANoREG data transfer

More information

The MDPHnet Distributed Querying for Public Health Surveillance

The MDPHnet Distributed Querying for Public Health Surveillance The MDPHnet Distributed Querying for Public Health Surveillance NIH Health Care Systems Collaboratory Grand Rounds Automated Disease Surveillance Using Electronic Health Record Data June 28, 2013 Jeffrey

More information

CloudMan cloud clusters for everyone

CloudMan cloud clusters for everyone CloudMan cloud clusters for everyone Enis Afgan usecloudman.org This is accessibility! But only sometimes So, there are alternatives BUT WHAT IF YOU WANT YOUR OWN, QUICKLY The big picture A. Users in different

More information

Implementing a Genomic Data Management System using irods at Bayer HealthCare

Implementing a Genomic Data Management System using irods at Bayer HealthCare Implementing a Genomic Data Management System using irods at Bayer HealthCare irods User Group Meeting 2015 Carsten Jahn Bayer Business Services GmbH, R&D IT, HealthCare Research Navya Dabbiru Innovations

More information

WEB OF SCIENCE TM RELEASE NOTES v5.21

WEB OF SCIENCE TM RELEASE NOTES v5.21 WEB OF SCIENCE TM RELEASE NOTES v5.21 The following features are planned for the Web of Science on January 24 th, 2016. This document provides information about each of the features included in this release.

More information

IBM EXAM - C Information Analyzer v8.5. Buy Full Product.

IBM EXAM - C Information Analyzer v8.5. Buy Full Product. IBM EXAM - C2090-423 Information Analyzer v8.5 Buy Full Product http://www.examskey.com/c2090-423.html Examskey IBM C2090-423 exam demo product is here for you to test the quality of the product. This

More information

Getting Bits off Disks: Using open source tools to stabilize and prepare born-digital materials for long-term preservation

Getting Bits off Disks: Using open source tools to stabilize and prepare born-digital materials for long-term preservation Getting Bits off Disks: Using open source tools to stabilize and prepare born-digital materials for long-term preservation Sam Meister University of Montana Best Practices Exchange 2013 November 13, 2013

More information

Distributed System. Gang Wu. Spring,2018

Distributed System. Gang Wu. Spring,2018 Distributed System Gang Wu Spring,2018 Lecture7:DFS What is DFS? A method of storing and accessing files base in a client/server architecture. A distributed file system is a client/server-based application

More information

Federated Data Storage System Prototype based on dcache

Federated Data Storage System Prototype based on dcache Federated Data Storage System Prototype based on dcache Andrey Kiryanov, Alexei Klimentov, Artem Petrosyan, Andrey Zarochentsev on behalf of BigData lab @ NRC KI and Russian Federated Data Storage Project

More information

Dataverse: Modular Storage and Migration to the Cloud

Dataverse: Modular Storage and Migration to the Cloud Dataverse: Modular Storage and Migration to the Cloud Gustavo Durand, Dataverse Technical Lead / Architect Leonid Andreev, Dataverse Senior Developer Dataverse Overview An open-source platform to publish,

More information

Energy Design Plugin. Peter G. Ellis, Paul A. Torcellini. Drury B. Crawley

Energy Design Plugin. Peter G. Ellis, Paul A. Torcellini. Drury B. Crawley Energy Design Plugin An EnergyPlus Plugin for SketchUp Peter G. Ellis, Paul A. Torcellini National Renewable Energy Laboratory Drury B. Crawley U.S. Department of Energy EnergyPlus Whole-building energy

More information

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization Evaluating Cloud Storage Strategies James Bottomley; CTO, Server Virtualization Introduction to Storage Attachments: - Local (Direct cheap) SAS, SATA - Remote (SAN, NAS expensive) FC net Types - Block

More information

Managing large-scale workflows with Pegasus

Managing large-scale workflows with Pegasus Funded by the National Science Foundation under the OCI SDCI program, grant #0722019 Managing large-scale workflows with Pegasus Karan Vahi ( vahi@isi.edu) Collaborative Computing Group USC Information

More information

Introduction to Geodatabase and Spatial Management in ArcGIS. Craig Gillgrass Esri

Introduction to Geodatabase and Spatial Management in ArcGIS. Craig Gillgrass Esri Introduction to Geodatabase and Spatial Management in ArcGIS Craig Gillgrass Esri Session Path The Geodatabase - What is it? - Why use it? - What types are there? - What can I do with it? Query Layers

More information

Delving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture

Delving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture Delving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture Hadoop 1.0 Architecture Introduction to Hadoop & Big Data Hadoop Evolution Hadoop Architecture Networking Concepts Use cases

More information

The Storage Networking Industry Association (SNIA) Data Preservation and Metadata Projects. Bob Rogers, Application Matrix

The Storage Networking Industry Association (SNIA) Data Preservation and Metadata Projects. Bob Rogers, Application Matrix The Storage Networking Industry Association (SNIA) Data Preservation and Metadata Projects Bob Rogers, Application Matrix Overview The Self Contained Information Retention Format Rationale & Objectives

More information

CS60021: Scalable Data Mining. Sourangshu Bhattacharya

CS60021: Scalable Data Mining. Sourangshu Bhattacharya CS60021: Scalable Data Mining Sourangshu Bhattacharya In this Lecture: Outline: HDFS Motivation HDFS User commands HDFS System architecture HDFS Implementation details Sourangshu Bhattacharya Computer

More information

Advanced Cell Classifier (ACC) user manual. v3.0 - June 2017

Advanced Cell Classifier (ACC) user manual. v3.0 - June 2017 Advanced Cell Classifier (ACC) user manual www.cellclassifier.org v3.0 - June 2017 Prof. Peter Horvath, PhD Synthetic and Systems Biology Unit Biological Research Centre of the Hungarian Academy of Sciences

More information

A framework for image analysis in plant breeding. Paul van Schayck

A framework for image analysis in plant breeding. Paul van Schayck A framework for image analysis in plant breeding Paul van Schayck Paul van Schayck Wageningen University MSc Biotechnology (molecular life science) p.vanschayck@maastrichtuniversity.nl Nanoscopy Data management

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

SolexaLIMS: A Laboratory Information Management System for the Solexa Sequencing Platform

SolexaLIMS: A Laboratory Information Management System for the Solexa Sequencing Platform SolexaLIMS: A Laboratory Information Management System for the Solexa Sequencing Platform Brian D. O Connor, 1, Jordan Mendler, 1, Ben Berman, 2, Stanley F. Nelson 1 1 Department of Human Genetics, David

More information

Metadata Ingestion and Processinng

Metadata Ingestion and Processinng biomedical and healthcare Data Discovery Index Ecosystem Ingestion and Processinng Jeffrey S. Grethe, Ph.D. 2017 BioCADDIE All Hands Meeting prototype Ingestion Indexing Repositories Ingestion ElasticSearch

More information

The Wait is Over, Activiti 6 is Here. Doug Johnson (Thomas De Meo) Mario Romano

The Wait is Over, Activiti 6 is Here. Doug Johnson (Thomas De Meo) Mario Romano The Wait is Over, Activiti 6 is Here Doug Johnson (Thomas De Meo) Mario Romano Activiti is supporting real-world and important process needs everyday A large ERP Vendor Leverages Activiti for their cloud-based

More information

National Materials Data Initiatives

National Materials Data Initiatives National Materials Data Initiatives Chuck Ward Integrity Service Excellence Materials & Manufacturing Directorate Approved for public release, distribution is unlimited. 88ABW-2015-2270 Overview Policy

More information

Automatic Dependency Management for Scientific Applications on Clusters. Ben Tovar*, Nicholas Hazekamp, Nathaniel Kremer-Herman, Douglas Thain

Automatic Dependency Management for Scientific Applications on Clusters. Ben Tovar*, Nicholas Hazekamp, Nathaniel Kremer-Herman, Douglas Thain Automatic Dependency Management for Scientific Applications on Clusters Ben Tovar*, Nicholas Hazekamp, Nathaniel Kremer-Herman, Douglas Thain Where users are Scientist says: "This demo task runs on my

More information

Protecting Privacy while Sharing Medical Data between Regional Healthcare Entities

Protecting Privacy while Sharing Medical Data between Regional Healthcare Entities IBM Almaden Research Center Protecting Privacy while Sharing Medical Data between Regional Healthcare Entities Tyrone Grandison, Srivatsava Ranjit Ganta, Uri Braun, James Kaufman Session S113: Sharing

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

LOCI, Fiji & ImageJ2

LOCI, Fiji & ImageJ2 LOCI, Fiji & ImageJ2 Philosophy of optical research In vivo developmental biology Capture everything possible about a sample Multiple dimensions Emission spectra Lifetime Cell polarization Greater resolution

More information

Data Immersion : Providing Integrated Data to Infinity Scientists. Kevin Gilpin Principal Engineer Infinity Pharmaceuticals October 19, 2004

Data Immersion : Providing Integrated Data to Infinity Scientists. Kevin Gilpin Principal Engineer Infinity Pharmaceuticals October 19, 2004 Data Immersion : Providing Integrated Data to Infinity Scientists Kevin Gilpin Principal Engineer Infinity Pharmaceuticals October 19, 2004 Informatics at Infinity Understand the nature of the science

More information

Storage and Storage Access

Storage and Storage Access Rainer Többicke CERN/IT 1 Introduction Data access Raw data, analysis data, software repositories, calibration data Small files, large files Frequent access Sequential access, random access Large variety

More information

Helping Journals to Upgrade Data Publications for Reusable Research

Helping Journals to Upgrade Data Publications for Reusable Research Helping Journals to Upgrade Data Publications for Reusable Research Sonia Barbosa (Project Manager) Eleni Castro (Project Coordinator) Ins9tute for Quan9ta9ve Social Science (IQSS) Harvard University @thedataorg

More information

Data publication and discovery with Globus

Data publication and discovery with Globus Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,

More information

ITS. MySQL for Database Administrators (40 Hours) (Exam code 1z0-883) (OCP My SQL DBA)

ITS. MySQL for Database Administrators (40 Hours) (Exam code 1z0-883) (OCP My SQL DBA) MySQL for Database Administrators (40 Hours) (Exam code 1z0-883) (OCP My SQL DBA) Prerequisites Have some experience with relational databases and SQL What will you learn? The MySQL for Database Administrators

More information

SMCCSE: PaaS Platform for processing large amounts of social media

SMCCSE: PaaS Platform for processing large amounts of social media KSII The first International Conference on Internet (ICONI) 2011, December 2011 1 Copyright c 2011 KSII SMCCSE: PaaS Platform for processing large amounts of social media Myoungjin Kim 1, Hanku Lee 2 and

More information

Earthdata Cloud Analytics Project

Earthdata Cloud Analytics Project Earthdata Cloud Analytics Project Chris Lynnes* and Rahul Ramachandran* NASA *U.S. Civil Servant 2 Earth Observing System and Information System (EOSDIS) EOSDIS distribute Research Applications data downlink

More information

Basics of Data Management

Basics of Data Management Basics of Data Management Chaitan Baru 2 2 Objectives of this Module Introduce concepts and technologies for managing structured, semistructured, unstructured data Obtain a grounding in traditional data

More information

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016 Building on to the Digital Preservation Foundation at Harvard Library Andrea Goethals ABCD-Library Meeting June 27, 2016 What do we already have? What do we still need? Where I ll focus DIGITAL PRESERVATION

More information

Easy ArcObjects Turbocharging

Easy ArcObjects Turbocharging Easy ArcObjects Turbocharging Brian Goldin Erik Hoel Purpose of this talk How to get things done quick while your boss thinks it s hard agonizing work Save time Be efficient Write less code Separate the

More information

From sif to SOFA. Andrew Simpson (and David Power, Douglas Russell and Mark Slaymaker) June 18th, Oxford University Computing Laboratory

From sif to SOFA. Andrew Simpson (and David Power, Douglas Russell and Mark Slaymaker) June 18th, Oxford University Computing Laboratory From to (and David Power, Douglas Russell and Mark Slaymaker) Oxford University Computing Laboratory June 18th, 2010 From to 1 Motivation 2 3 4 5 6 From to Motivation Increasingly, there is a drive in

More information

The EHRI GraphQL API IEEE Big Data Workshop on Computational Archival Science

The EHRI GraphQL API IEEE Big Data Workshop on Computational Archival Science The EHRI GraphQL API IEEE Big Data Workshop on Computational Archival Science 13/12/2017 Mike Bryant CONNECTING COLLECTIONS The EHRI Project The main objective of EHRI is to support the Holocaust research

More information

Introduction to ChIP-seq using High-Performance Computing (HPC)

Introduction to ChIP-seq using High-Performance Computing (HPC) Introduction to ChIP-seq using High-Performance Computing (HPC) Harvard Chan Bioinformatics Core in collaboration with HMS Research Computing https://tinyurl.com/hbc-intro-to-chipseq Shannan Ho Sui John

More information

Executive Committee Meeting

Executive Committee Meeting Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Supplementary Note-- Williams et al The Image Data Resource: A Bioimage Data Integration and Publication Platform

Supplementary Note-- Williams et al The Image Data Resource: A Bioimage Data Integration and Publication Platform Supplementary Note-- Williams et al The Image Data Resource: A Bioimage Data Integration and Publication Platform 1. Exploring the IDR This current IDR web user interface (WUI) is based on the open source

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Introduction to The Storage Resource Broker

Introduction to The Storage Resource Broker http://www.nesc.ac.uk/training http://www.ngs.ac.uk Introduction to The Storage Resource Broker http://www.pparc.ac.uk/ http://www.eu-egee.org/ Policy for re-use This presentation can be re-used for academic

More information

Informatica Enterprise Information Catalog

Informatica Enterprise Information Catalog Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with

More information

ESRI Technology Update. Joe Holubar Larry Young

ESRI Technology Update. Joe Holubar Larry Young ESRI Technology Update Joe Holubar Larry Young Continued Improvement Improving Quality and Extending and Refining Functionality First Half of 2009: Minor Update Release (ArcGIS 9.3.1) ArcGIS Explorer Fall

More information

CA485 Ray Walshe Google File System

CA485 Ray Walshe Google File System Google File System Overview Google File System is scalable, distributed file system on inexpensive commodity hardware that provides: Fault Tolerance File system runs on hundreds or thousands of storage

More information

Handling and Processing Big Data for Biomedical Discovery with MATLAB

Handling and Processing Big Data for Biomedical Discovery with MATLAB Handling and Processing Big Data for Biomedical Discovery with MATLAB Raphaël Thierry, PhD Image processing and analysis Software development Facility for Advanced Microscopy and Imaging 23 th June 2016

More information

modencode Galaxy: Uniform ChIP-Seq Processing Tools for modencode and ENCODE Data

modencode Galaxy: Uniform ChIP-Seq Processing Tools for modencode and ENCODE Data modencode Galaxy: Uniform ChIP-Seq Processing Tools for modencode and ENCODE Data Quang M Trinh Ontario Institute for Cancer Research qtrinh@oicr.on.ca Outline Model Organism ENCyclopedia Of DNA Elements

More information

Hadoop/MapReduce Computing Paradigm

Hadoop/MapReduce Computing Paradigm Hadoop/Reduce Computing Paradigm 1 Large-Scale Data Analytics Reduce computing paradigm (E.g., Hadoop) vs. Traditional database systems vs. Database Many enterprises are turning to Hadoop Especially applications

More information

Biosphere: the interoperation of web services in microarray cluster analysis

Biosphere: the interoperation of web services in microarray cluster analysis Biosphere: the interoperation of web services in microarray cluster analysis Kei-Hoi Cheung 1,2,*, Remko de Knikker 1, Youjun Guo 1, Guoneng Zhong 1, Janet Hager 3,4, Kevin Y. Yip 5, Albert K.H. Kwan 5,

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Scott Meder Senior Regional Sales Manager

Scott Meder Senior Regional Sales Manager www.raima.com Scott Meder Senior Regional Sales Manager scott.meder@raima.com Short Introduction to Raima What is Data Management What are your requirements? How do I make the right decision? - Architecture

More information

BoutonAnalyzer, User Manual

BoutonAnalyzer, User Manual BoutonAnalyzer, User Manual BoutonAnalyzer is software for detection and tracking of structural changes in en passant boutons in time-lapse light-microscopy stacks of images. Technical details and validation

More information

Oracle Financial Consolidation and Close Cloud. What s New in the December Update (16.12)

Oracle Financial Consolidation and Close Cloud. What s New in the December Update (16.12) Oracle Financial Consolidation and Close Cloud What s New in the December Update (16.12) December 2016 TABLE OF CONTENTS REVISION HISTORY... 3 ORACLE FINANCIAL CONSOLIDATION AND CLOSE CLOUD, DECEMBER UPDATE...

More information

Y o u r V i s i o n, O u r F u t u r e..slide digital virtual microscopy

Y o u r V i s i o n, O u r F u t u r e..slide digital virtual microscopy .slide digital virtual microscopy .slide animation .slide products for virtual microscopy TMA scan conferencing.slide Desktop license SL50 Slide Loader Net Image Server OlyVIA OlyVIAWeb (web viewer) Main

More information

Chelonia. a lightweight self-healing distributed storage

Chelonia. a lightweight self-healing distributed storage Chelonia a lightweight self-healing distributed storage Zsombor Nagy (zsombor@niif.hu) Salman Toor (salman.toor@it.uu.se) Jon Kerr Nilsen (j.k.nilsen@fys.uio.no) Motivation How to easily... Create a storage

More information

EMPRESS Extensible Metadata PRovider for Extreme-scale Scientific Simulations

EMPRESS Extensible Metadata PRovider for Extreme-scale Scientific Simulations EMPRESS Extensible Metadata PRovider for Extreme-scale Scientific Simulations Photos placed in horizontal position with even amount of white space between photos and header Margaret Lawson, Jay Lofstead,

More information

SciSpark 201. Searching for MCCs

SciSpark 201. Searching for MCCs SciSpark 201 Searching for MCCs Agenda for 201: Access your SciSpark & Notebook VM (personal sandbox) Quick recap. of SciSpark Project What is Spark? SciSpark Extensions scitensor: N-dimensional arrays

More information

Sunday, May 1,

Sunday, May 1, 1 Governing Services, Data, Rules, Processes and more Randall Hauch Project Lead, ModeShape Kurt Stam Project Lead, Guvnor @rhauch @modeshape @guvtalk 2 Scenario 1 Build business processes using existing

More information

Amazon Web Services Presents. Oracle in the Cloud. A Webinar Featuring: Mike Culver Web Services Evangelist Amazon Web Services

Amazon Web Services Presents. Oracle in the Cloud. A Webinar Featuring: Mike Culver Web Services Evangelist Amazon Web Services Amazon Web Services Presents Oracle in the Cloud A Webinar Featuring: Mike Culver Web Services Evangelist Amazon Web Services Bill Hodak Senior Product Manager Oracle Corporation Amazon Retail Business

More information

Open Science, FAIR data and effective data management

Open Science, FAIR data and effective data management , FAIR data and effective data management This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License Federica Rosetta Director, Global Strategic Networks

More information

DATA SCIENCE USING SPARK: AN INTRODUCTION

DATA SCIENCE USING SPARK: AN INTRODUCTION DATA SCIENCE USING SPARK: AN INTRODUCTION TOPICS COVERED Introduction to Spark Getting Started with Spark Programming in Spark Data Science with Spark What next? 2 DATA SCIENCE PROCESS Exploratory Data

More information

REDCap Advanced Database Management (303)

REDCap Advanced Database Management (303) REDCap Advanced Database Management (303) Learning objectives How to manage bigger complexer projects? User right management Data quality Data manipulation Improving overview / workflow ITHS Focus Speeding

More information

BUILDING A NEW DIGITAL LIBRARY FOR THE NATIONAL LIBRARY OF AUSTRALIA

BUILDING A NEW DIGITAL LIBRARY FOR THE NATIONAL LIBRARY OF AUSTRALIA BUILDING A NEW DIGITAL LIBRARY FOR THE NATIONAL LIBRARY OF AUSTRALIA Strategic Directions 2012-2014 However, the growth of our digital collections is outpacing our capacity to manage, preserve and deliver

More information

Creating a Corporate Taxonomy. Internet Librarian November 2001 Betsy Farr Cogliano

Creating a Corporate Taxonomy. Internet Librarian November 2001 Betsy Farr Cogliano Creating a Corporate Taxonomy Internet Librarian 2001 7 November 2001 Betsy Farr Cogliano 2001 The MITRE Corporation Revised October 2001 2 Background MITRE is a not-for-profit corporation operating three

More information

Striped Data Server for Scalable Parallel Data Analysis

Striped Data Server for Scalable Parallel Data Analysis Journal of Physics: Conference Series PAPER OPEN ACCESS Striped Data Server for Scalable Parallel Data Analysis To cite this article: Jin Chang et al 2018 J. Phys.: Conf. Ser. 1085 042035 View the article

More information

irods at TACC: Secure Infrastructure for Open Science Chris Jordan

irods at TACC: Secure Infrastructure for Open Science Chris Jordan irods at TACC: Secure Infrastructure for Open Science Chris Jordan What is TACC? Texas Advanced Computing Center Cyberinfrastructure Resources for Open Science University of Texas System 9 Academic, 6

More information

DICOM Research Applications - life at the fringe of reality

DICOM Research Applications - life at the fringe of reality SPIE Medical Imaging 2009 DICOM Research Applications - life at the fringe of reality David Clunie RadPharm, Inc. Overview Range of research applications Clinical versus research context Commonalities

More information

Icahn School of Medicine at Mount Sinai LINCS Center for Drug Toxicity Signatures

Icahn School of Medicine at Mount Sinai LINCS Center for Drug Toxicity Signatures Icahn School of Medicine at Mount Sinai LINCS Center for Drug Toxicity Signatures Standard Operating Procedure: Identification of Differentially Expressed Genes DToxS SOP Index: CO-4.1 Last Revision: March

More information

Galaxy. Daniel Blankenberg The Galaxy Team

Galaxy. Daniel Blankenberg The Galaxy Team Galaxy Daniel Blankenberg The Galaxy Team http://galaxyproject.org Overview What is Galaxy? What you can do in Galaxy analysis interface, tools and datasources data libraries workflows visualization sharing

More information

Rich Web Application Development Solution. Simplifying & Accelerating WebSphere Portal Development & Deployment

Rich Web Application Development Solution. Simplifying & Accelerating WebSphere Portal Development & Deployment Rich Web Application Development Solution Simplifying & Accelerating WebSphere Portal Development & Deployment Rich Web Application Development 2 Richer= Application aspect is more application features

More information

Informatica 9.0 PowerCenter Installation Quick Start Guide

Informatica 9.0 PowerCenter Installation Quick Start Guide Informatica 9.0 PowerCenter Installation Quick Start Guide This quick start includes the following topics: Step 1. Complete the Pre-Installation Tasks, 1 Step 2. Install Informatica Services, 3 Step 3.

More information

Technology Development Studio (TDS) MPI-CBG, Dresden, Germany. Marc Bickle HT-TDS, MPI-CBG

Technology Development Studio (TDS) MPI-CBG, Dresden, Germany. Marc Bickle HT-TDS, MPI-CBG Technology Development Studio (TDS) MPI-CBG, Dresden, Germany Marc Bickle HT-TDS, MPI-CBG TDS: Core Screening Facility Core Screening facility of the MPI-CBG specialized in high content imaging (es 2003)

More information

Information technology Multimedia service platform technologies. Part 3: Conformance and reference software

Information technology Multimedia service platform technologies. Part 3: Conformance and reference software INTERNATIONAL STANDARD ISO/IEC 23006-3 Third edition 2016-12-01 Information technology Multimedia service platform technologies Part 3: Conformance and reference software Technologies de l information

More information

REDCap Importing and Exporting (302)

REDCap Importing and Exporting (302) REDCap Importing and Exporting (302) Learning objectives Report building Exporting data from REDCap Importing data into REDCap Backup options API Basics ITHS Focus Speeding science to clinical practice

More information

Pathology Image Informatics Platform (PathIIP) Year 1 Update

Pathology Image Informatics Platform (PathIIP) Year 1 Update Pathology Image Informatics Platform (PathIIP) Year 1 Update PIIP Project sites Specific Aims Aim 1: Development of an improved plugin framework for the existing Sedeen viewer; Aim 2: Incorporate and evaluate

More information

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license Inge Van Nieuwerburgh OpenAIRE NOAD Belgium Tools&Services OpenAIRE EUDAT can be reused under the CC BY license Open Access Infrastructure for Research in Europe www.openaire.eu Research Data Services,

More information