GRID Stream Database Managent for Scientific Applications
|
|
- Ira Martin
- 5 years ago
- Views:
Transcription
1 GRID Stream Database Managent for Scientific Applications Milena Ivanova (Koparanova) and Tore Risch IT Department, Uppsala University, Sweden
2 Outline Motivation Stream Data Management Computational GRIDs GSDM Distributed Architecture Project status Next steps GSDM Demo
3 Motivation: LOFAR/LOIS Application Data Fabric with nodes Data Processing Plant User post processing on-line hybrid data processing cluster Control Center(s)
4 Motivation: LOFAR/LOIS Application Geographically distributed set of electromagnetic sensors and emitters Data products: Raw data streams (beams) Very high data volume and rate Complex numerical data Continuous queries: filtering, reduction, combining of data streams User-defined computational functions Problem: High performance processing of distributed continuous queries over many data streams. Utilizing GRID infrastructure for achieving highperformance
5 How stream data are different? Streams peculiarities: Infinite Representation of substreams of limited size: windows Continuous Queries (CQs) Immediate processing of elements, followed by archiving or deletion Order preserving and non blocking processing Stream specific operations, e.g., windows join, moving average DBMS techniques for streams: approximate query answering, adaptability, data reduction, multi-query optimization
6 GRID Computing Heterogeneous sets of clusters of computers For applications with much need for CPU cycles Toolkits (e.g. GLOBUS, CONDOR-G) provide transparent access to GRID clusters and parallelization of jobs Batch processing (upload compute download) High data volume bottleneck
7 GSDM as a Computational GRIDs Application High data flow rate and large data volumes require high performance Parallelism and distribution Varying computer resource needs because of varying number of queries and streams, varying stream rates Dynamic resource allocation Computational GRID infrastructure
8 GSDM: GRID Stream Database Approach Manager for Scientific Data Distributed and parallel Stream-oriented Main-memory Object-Relational DBMS Utilization of GRID infrastructure for achieving high-performance
9 GSDM Scenario Query Coordinator 1 Application CQ1 Metadata Manager Application CQ2 Working Node 5 Query Coordinator 2 Working Node 1 Working Node 3 Working Node 2 Working Node 4 GRID Data Beam1 Data Beam 2 Data Beam 3 Legend: denotes data flow
10 GSDM Distributed Architecture Four types of nodes Metadata manager Coordinator server Working node Application
11 Metadata and Coordinator Server Functions: store metadata about the system decompose, install and activate CQs assign queries to working nodes start and kill working nodes coordinate processing at working nodes
12 System Metadata Type GSDM name(gsdm) -> Charstring working_node(gsdm) -> Boolean coordinator(gsdm) -> Boolean cur_load(gsdm) -> Real Type Query qid(query)->charstring query_string(query)-> Charstring producers(query) -> Bag of GSDM consumers(query) -> Bag of GSDM installed(query,gsdm) -> Boolean active (Query,GSDM)-> Boolean
13 Working Node Function: to process continuous queries over streams List of active CQ CQ Execution loop Stream Consumer receives pushed data and incoming commands scheduler picks next CQ ready for execution executes the CQ over current stream windows Continuous Query Producer sends streamed result to the consumer (application or working node)
14 GSDM Working Node Architecture Continuous Query Manager Application Query Executor Stream Consumer Cont Q Producer Query Executor Cont Q Producer Continuous Query Manager Plug-ins GSDM Working Node Stream Consumer Stream Consumer Stream Consumer Beam 1 Beam 2 GSDM Stream Data sources
15 GSDM Command Line User Interface Commands sent from application Executed at coordinator monitor(charstring qstring) -> Charstring qid; run(charstring qid) -> Boolean; stop(charstring qid) -> Boolean; status(charstring qid) -> <Charstring qstring, Charstring prodname>;
16 GSDM Communication Primitives User Node Monitor Query Qid Run Qid Status Qid Start node Register WN Metadata and Coordinator Install query qid, qstring Installed qid Activate qid Activated qid Working Node Deactivate qid Deactivated qid Uninstall qid Uninstalled qid Kill node
17 Current status GSDM prototype architecture Initial GSDM prototype implementation Formulation of continuous queries with user-defined computational functions from space physics UDP receiver of radio on the Internet Data beam simulator
18 Related Work in Stream DBMS Relatively low stream rates, centralized architectures Relational representations and relational operations modified for stream processing: selection, join, aggregations Our system High rates of LOIS/LOFAR streams Distributed architecture User-defined functions over non-relational data representations
19 GSDM requirements for GRID Interactive job: users can install and stop CQs interactively Accessibility & security issues Data delivery directly to the working nodes, e.g. inside of clusters High-performance communication between GSDM servers running on different GRID resources
20 Next steps Evaluation of initial prototype implementation and development Optimization of distributed stream database queries Utilize GRID infrastructure functionality for resource brokering and management New GRID services for streaming data
21 GSDM Demo Scenario CQ Application (UI) Name Server Metadata and Coordinator Server CQ Results Working Node 1 Application GSDM
22 Demo scenario Database schema start_udp_consumer(char s) -> integer; stop_udp_consumer() -> integer; change_udp_scale(real i) -> integer; get_udp_packet() -> < vector of complex, vector of complex, vector of complex >; Example CQ create function visualized_fft()->boolean as select repaint(v,6.0) from vector of real v, vector of complex x, vector of complex y, vector of complex z where v= vect_log_magnitude(fft(windowed_fft(x))) and get_udp_packet()= <x,y,z>;
Massive Scale-out of Expensive Continuous Queries
Massive Scale-out of Expensive Continuous Queries Erik Zeitler and Tore Risch Department of Information Technology Uppsala University Presented by Haikal Pribadi Massive Scale-out of Expensive Continuous
More informationKenneth A. Hawick P. D. Coddington H. A. James
Student: Vidar Tulinius Email: vidarot@brandeis.edu Distributed frameworks and parallel algorithms for processing large-scale geographic data Kenneth A. Hawick P. D. Coddington H. A. James Overview Increasing
More informationArchitecture Proposal
Nordic Testbed for Wide Area Computing and Data Handling NORDUGRID-TECH-1 19/02/2002 Architecture Proposal M.Ellert, A.Konstantinov, B.Kónya, O.Smirnova, A.Wäänänen Introduction The document describes
More informationGrid Computing Systems: A Survey and Taxonomy
Grid Computing Systems: A Survey and Taxonomy Material for this lecture from: A Survey and Taxonomy of Resource Management Systems for Grid Computing Systems, K. Krauter, R. Buyya, M. Maheswaran, CS Technical
More informationARC-XWCH bridge: Running ARC jobs on the XtremWeb-CH volunteer
ARC-XWCH bridge: Running ARC jobs on the XtremWeb-CH volunteer computing platform Internal report Marko Niinimaki, Mohamed BenBelgacem, Nabil Abdennadher HEPIA, January 2010 1. Background and motivation
More informationGrid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms
Grid Computing 1 Resource sharing Elements of Grid Computing - Computers, data, storage, sensors, networks, - Sharing always conditional: issues of trust, policy, negotiation, payment, Coordinated problem
More informationAdaptive Cluster Computing using JavaSpaces
Adaptive Cluster Computing using JavaSpaces Jyoti Batheja and Manish Parashar The Applied Software Systems Lab. ECE Department, Rutgers University Outline Background Introduction Related Work Summary of
More informationStreaming Data Integration: Challenges and Opportunities. Nesime Tatbul
Streaming Data Integration: Challenges and Opportunities Nesime Tatbul Talk Outline Integrated data stream processing An example project: MaxStream Architecture Query model Conclusions ICDE NTII Workshop,
More informationIoT Mashups with the WoTKit
IoT Mashups with the WoTKit Mike Blackstock, Rodger Lea Media and Graphics Interdisciplinary Centre University of British Columbia Vancouver, Canada Motivation IoT mashups are simple, personal, situational,
More informationHigh-performance GRID Database Manager for Scientific Data
Proc. 4th Distributed Data and Structures, WDAS 03, Paris, France, pp 99-106, Carleton Scientific, ISBN 1-894145-13-5, 2002 High-performance GRID Database Manager for Scientific Data TORE RISCH Uppsala
More informationNUSGRID a computational grid at NUS
NUSGRID a computational grid at NUS Grace Foo (SVU/Academic Computing, Computer Centre) SVU is leading an initiative to set up a campus wide computational grid prototype at NUS. The initiative arose out
More informationAdvanced School in High Performance and GRID Computing November Introduction to Grid computing.
1967-14 Advanced School in High Performance and GRID Computing 3-14 November 2008 Introduction to Grid computing. TAFFONI Giuliano Osservatorio Astronomico di Trieste/INAF Via G.B. Tiepolo 11 34131 Trieste
More informationMOHA: Many-Task Computing Framework on Hadoop
Apache: Big Data North America 2017 @ Miami MOHA: Many-Task Computing Framework on Hadoop Soonwook Hwang Korea Institute of Science and Technology Information May 18, 2017 Table of Contents Introduction
More informationHigh Performance Computing Course Notes Grid Computing I
High Performance Computing Course Notes 2008-2009 2009 Grid Computing I Resource Demands Even as computer power, data storage, and communication continue to improve exponentially, resource capacities are
More informationIntroduction to Grid Computing
Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able
More informationChapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.
Chapter 4:- Introduction to Grid and its Evolution Prepared By:- Assistant Professor SVBIT. Overview Background: What is the Grid? Related technologies Grid applications Communities Grid Tools Case Studies
More informationGrid-Based Data Mining and the KNOWLEDGE GRID Framework
Grid-Based Data Mining and the KNOWLEDGE GRID Framework DOMENICO TALIA (joint work with M. Cannataro, A. Congiusta, P. Trunfio) DEIS University of Calabria ITALY talia@deis.unical.it Minneapolis, September
More informationGlobus GTK and Grid Services
Globus GTK and Grid Services Michael Rokitka SUNY@Buffalo CSE510B 9/2007 OGSA The Open Grid Services Architecture What are some key requirements of Grid computing? Interoperability: Critical due to nature
More informationWeb Services for Visualization
Web Services for Visualization Gordon Erlebacher (Florida State University) Collaborators: S. Pallickara, G. Fox (Indiana U.) Dave Yuen (U. Minnesota) State of affairs Size of datasets is growing exponentially
More informationA SEMANTIC MATCHMAKER SERVICE ON THE GRID
DERI DIGITAL ENTERPRISE RESEARCH INSTITUTE A SEMANTIC MATCHMAKER SERVICE ON THE GRID Andreas Harth Yu He Hongsuda Tangmunarunkit Stefan Decker Carl Kesselman DERI TECHNICAL REPORT 2004-05-18 MAY 2004 DERI
More informationMonitoring the Usage of the ZEUS Analysis Grid
Monitoring the Usage of the ZEUS Analysis Grid Stefanos Leontsinis September 9, 2006 Summer Student Programme 2006 DESY Hamburg Supervisor Dr. Hartmut Stadie National Technical
More informationGrid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007
Grid Programming: Concepts and Challenges Michael Rokitka SUNY@Buffalo CSE510B 10/2007 Issues Due to Heterogeneous Hardware level Environment Different architectures, chipsets, execution speeds Software
More informationglite Grid Services Overview
The EPIKH Project (Exchange Programme to advance e-infrastructure Know-How) glite Grid Services Overview Antonio Calanducci INFN Catania Joint GISELA/EPIKH School for Grid Site Administrators Valparaiso,
More informationHigh Throughput WAN Data Transfer with Hadoop-based Storage
High Throughput WAN Data Transfer with Hadoop-based Storage A Amin 2, B Bockelman 4, J Letts 1, T Levshina 3, T Martin 1, H Pi 1, I Sfiligoi 1, M Thomas 2, F Wuerthwein 1 1 University of California, San
More informationGanga The Job Submission Tool. WeiLong Ueng
Ganga The Job Submission Tool WeiLong Ueng wlueng@twgrid.org Objectives This tutorial gives users to understand Why require Ganga in Grid environment What advantages of Ganga The Architecture of Ganga
More information30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy
Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Why the Grid? Science is becoming increasingly digital and needs to deal with increasing amounts of
More informationHigh-performance GRID Database Manager for Scientific Data
Presented at Workshop on Distributed Data & Structures - WDAS-2002, University Paris 9 Dauphine, March 20-23, 2002 High-performance GRID Database Manager for Scientific Data Tore Risch, Milena Koparanova,
More informationGT-OGSA Grid Service Infrastructure
Introduction to GT3 Background The Grid Problem The Globus Approach OGSA & OGSI Globus Toolkit GT3 Architecture and Functionality: The Latest Refinement of the Globus Toolkit Core Base s User-Defined s
More informationWas ist dran an einer spezialisierten Data Warehousing platform?
Was ist dran an einer spezialisierten Data Warehousing platform? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Data warehousing, Exadata, specialized hardware proprietary hardware Introduction
More informationScalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments *
Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments * Joesph JaJa joseph@ Mike Smorul toaster@ Fritz McCall fmccall@ Yang Wang wpwy@ Institute
More informationLow Latency Data Grids in Finance
Low Latency Data Grids in Finance Jags Ramnarayan Chief Architect GemStone Systems jags.ramnarayan@gemstone.com Copyright 2006, GemStone Systems Inc. All Rights Reserved. Background on GemStone Systems
More informationQuality of Service Aspects and Metrics in Grid Computing
Quality of Service Aspects and Metrics in Grid Computing Daniel A. Menascé Dept. of Computer Science George Mason University Fairfax, VA menasce@cs.gmu.edu Emiliano Casalicchio Dipt. InformaticaSistemi
More informationGrid Computing. Lectured by: Dr. Pham Tran Vu Faculty of Computer and Engineering HCMC University of Technology
Grid Computing Lectured by: Dr. Pham Tran Vu Email: ptvu@cse.hcmut.edu.vn 1 Grid Architecture 2 Outline Layer Architecture Open Grid Service Architecture 3 Grid Characteristics Large-scale Need for dynamic
More informationLecture 8. Database Management and Queries
Lecture 8 Database Management and Queries Lecture 8: Outline I. Database Components II. Database Structures A. Conceptual, Logical, and Physical Components III. Non-Relational Databases A. Flat File B.
More informationLetsMT!: Cloud-Based Platform for Building User Tailored Machine Translation Engines
LetsMT!: Cloud-Based Platform for Building User Tailored Machine Translation Engines Andrejs Vasiļjevs Raivis Skadiņš Jörg Tiedemann Tilde Tilde Uppsala University Vienibas gatve 75a, Riga Vienibas gatve
More informationAMGA metadata catalogue system
AMGA metadata catalogue system Hurng-Chun Lee ACGrid School, Hanoi, Vietnam www.eu-egee.org EGEE and glite are registered trademarks Outline AMGA overview AMGA Background and Motivation for AMGA Interface,
More informationR & G Chapter 13. Implementation of single Relational Operations Choices depend on indexes, memory, stats, Joins Blocked nested loops:
Relational Query Optimization R & G Chapter 13 Review Implementation of single Relational Operations Choices depend on indexes, memory, stats, Joins Blocked nested loops: simple, exploits extra memory
More informationUDP Packet Monitoring with Stanford Data Stream Manager
UDP Packet Monitoring with Stanford Data Stream Manager Nadeem Akhtar #1, Faridul Haque Siddiqui #2 # Department of Computer Engineering, Aligarh Muslim University Aligarh, India 1 nadeemalakhtar@gmail.com
More informationCisco Tetration Analytics
Cisco Tetration Analytics Enhanced security and operations with real time analytics Christopher Say (CCIE RS SP) Consulting System Engineer csaychoh@cisco.com Challenges in operating a hybrid data center
More informationCustomizable Parallel Execution of Scientific Stream Queries
Customizable Parallel Execution of Scientific Stream Queries Milena Ivanova Tore Risch Department of Information Technology Uppsala University, Sweden {milena.ivanova, tore.risch}@it.uu.se Abstract Scientific
More informationCloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase. Chen Zhang Hans De Sterck University of Waterloo
CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase Chen Zhang Hans De Sterck University of Waterloo Outline Introduction Motivation Related Work System Design Future Work Introduction
More informationA Resource Discovery Algorithm in Mobile Grid Computing based on IP-paging Scheme
A Resource Discovery Algorithm in Mobile Grid Computing based on IP-paging Scheme Yue Zhang, Yunxia Pei To cite this version: Yue Zhang, Yunxia Pei. A Resource Discovery Algorithm in Mobile Grid Computing
More informationS-Store: Streaming Meets Transaction Processing
S-Store: Streaming Meets Transaction Processing H-Store is an experimental database management system (DBMS) designed for online transaction processing applications Manasa Vallamkondu Motivation Reducing
More informationICAT Job Portal. a generic job submission system built on a scientific data catalog. IWSG 2013 ETH, Zurich, Switzerland 3-5 June 2013
ICAT Job Portal a generic job submission system built on a scientific data catalog IWSG 2013 ETH, Zurich, Switzerland 3-5 June 2013 Steve Fisher, Kevin Phipps and Dan Rolfe Rutherford Appleton Laboratory
More informationDesign of Distributed Data Mining Applications on the KNOWLEDGE GRID
Design of Distributed Data Mining Applications on the KNOWLEDGE GRID Mario Cannataro ICAR-CNR cannataro@acm.org Domenico Talia DEIS University of Calabria talia@deis.unical.it Paolo Trunfio DEIS University
More informationCloudView : Describe and Maintain Resource Views in Cloud
The 2 nd IEEE International Conference on Cloud Computing Technology and Science CloudView : Describe and Maintain Resource Views in Cloud Dehui Zhou, Liang Zhong, Tianyu Wo and Junbin, Kang Beihang University
More informationA Resource Discovery Algorithm in Mobile Grid Computing Based on IP-Paging Scheme
A Resource Discovery Algorithm in Mobile Grid Computing Based on IP-Paging Scheme Yue Zhang 1 and Yunxia Pei 2 1 Department of Math and Computer Science Center of Network, Henan Police College, Zhengzhou,
More informationFunctional Requirements for Grid Oriented Optical Networks
Functional Requirements for Grid Oriented Optical s Luca Valcarenghi Internal Workshop 4 on Photonic s and Technologies Scuola Superiore Sant Anna Pisa June 3-4, 2003 1 Motivations Grid networking connection
More informationData Stream Mining. Tore Risch Dept. of information technology Uppsala University Sweden
Data Stream Mining Tore Risch Dept. of information technology Uppsala University Sweden 2016-02-25 Enormous data growth Read landmark article in Economist 2010-02-27: http://www.economist.com/node/15557443/
More informationIOTIVITY AND EMBEDDED LINUX SUPPORT. Kishen Maloor Intel Open Source Technology Center
IOTIVITY AND EMBEDDED LINUX SUPPORT Kishen Maloor Intel Open Source Technology Center Outline Open Interconnect Consortium and IoTivity Software development challenges in embedded Yocto Project and how
More informationCHAPTER 3 GRID MONITORING AND RESOURCE SELECTION
31 CHAPTER 3 GRID MONITORING AND RESOURCE SELECTION This chapter introduces the Grid monitoring with resource metrics and network metrics. This chapter also discusses various network monitoring tools and
More informationGRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi
GRIDS INTRODUCTION TO GRID INFRASTRUCTURES Fabrizio Gagliardi Dr. Fabrizio Gagliardi is the leader of the EU DataGrid project and designated director of the proposed EGEE (Enabling Grids for E-science
More informationBIS Database Management Systems.
BIS 512 - Database Management Systems http://www.mis.boun.edu.tr/durahim/ Ahmet Onur Durahim Learning Objectives Database systems concepts Designing and implementing a database application Life of a Query
More informationMIS Database Systems.
MIS 335 - Database Systems http://www.mis.boun.edu.tr/durahim/ Ahmet Onur Durahim Learning Objectives Database systems concepts Designing and implementing a database application Life of a Query in a Database
More informationPrinciples of Data Management. Lecture #9 (Query Processing Overview)
Principles of Data Management Lecture #9 (Query Processing Overview) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Today s Notable News v Midterm
More informationThe ATLAS EventIndex: Full chain deployment and first operation
The ATLAS EventIndex: Full chain deployment and first operation Álvaro Fernández Casaní Instituto de Física Corpuscular () Universitat de València CSIC On behalf of the ATLAS Collaboration 1 Outline ATLAS
More informationMonitor your infrastructure with the Elastic Beats. Monica Sarbu
Monitor your infrastructure with the Elastic Beats Monica Sarbu Monica Sarbu Team lead, Beats team Email: monica@elastic.co Twitter: 2 Monitor your servers Apache logs 3 Monitor your servers Apache logs
More informationDSE 203 DAY 1: REVIEW OF DBMS CONCEPTS
DSE 203 DAY 1: REVIEW OF DBMS CONCEPTS Data Models A specification that precisely defines The structure of the data The fundamental operations on the data The logical language to specify queries on the
More informationBlended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a)
Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Cloudera s Developer Training for Apache Spark and Hadoop delivers the key concepts and expertise need to develop high-performance
More informationExadata Database Machine Administration Workshop NEW
Exadata Database Machine Administration Workshop NEW What you will learn: This course introduces students to Oracle Exadata Database Machine. Students learn about the various Exadata Database Machine features
More informationIEPSAS-Kosice: experiences in running LCG site
IEPSAS-Kosice: experiences in running LCG site Marian Babik 1, Dusan Bruncko 2, Tomas Daranyi 1, Ladislav Hluchy 1 and Pavol Strizenec 2 1 Department of Parallel and Distributed Computing, Institute of
More informationThe Grid: Processing the Data from the World s Largest Scientific Machine
The Grid: Processing the Data from the World s Largest Scientific Machine 10th Topical Seminar On Innovative Particle and Radiation Detectors Siena, 1-5 October 2006 Patricia Méndez Lorenzo (IT-PSS/ED),
More informationData Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013
Data Replication: Automated move and copy of data PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013 Claudio Cacciari c.cacciari@cineca.it Outline The issue
More informationWrapping a B-Tree Storage Manager in an Object Relational Mediator System
Uppsala Master s Theses in Computing Science No. 288 Examensarbete DVP Wrapping a B-Tree Storage Manager in an Object Relational Mediator System Maryam Ladjvardi Information Technology Computing Science
More informationData Storage Infrastructure at Facebook
Data Storage Infrastructure at Facebook Spring 2018 Cleveland State University CIS 601 Presentation Yi Dong Instructor: Dr. Chung Outline Strategy of data storage, processing, and log collection Data flow
More informationThe NASA/GSFC Advanced Data Grid: A Prototype for Future Earth Science Ground System Architectures
The NASA/GSFC Advanced Data Grid: A Prototype for Future Earth Science Ground System Architectures Samuel D. Gasster, Craig A. Lee, Brooks Davis, Matt Clark, Mike AuYeung, John R. Wilson Computer Systems
More informationDISTRIBUTED DATABASES
DISTRIBUTED DATABASES INTRODUCTION: Database technology has taken us from a paradigm of data processing in which each application defined and maintained its own data, i.e. one in which data is defined
More informationGrid Architectural Models
Grid Architectural Models Computational Grids - A computational Grid aggregates the processing power from a distributed collection of systems - This type of Grid is primarily composed of low powered computers
More informationFuncX: A Function Serving Platform for HPC. Ryan Chard 28 Jan 2019
FuncX: A Function Serving Platform for HPC Ryan Chard 28 Jan 2019 Outline - Motivation FuncX: FaaS for HPC Implementation status Preliminary applications - Machine learning inference Automating analysis
More informationIQ for DNA. Interactive Query for Dynamic Network Analytics. Haoyu Song. HUAWEI TECHNOLOGIES Co., Ltd.
IQ for DNA Interactive Query for Dynamic Network Analytics Haoyu Song www.huawei.com Motivation Service Provider s pain point Lack of real-time and full visibility of networks, so the network monitoring
More informationLayered Architecture
The Globus Toolkit : Introdution Dr Simon See Sun APSTC 09 June 2003 Jie Song, Grid Computing Specialist, Sun APSTC 2 Globus Toolkit TM An open source software toolkit addressing key technical problems
More informationWP3 Final Activity Report
WP3 Final Activity Report Nicholas Loulloudes WP3 Representative On behalf of the g-eclipse Consortium Outline Work Package 3 Final Status Achievements Work Package 3 Goals and Benefits WP3.1 Grid Infrastructure
More informationg-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.
g-eclipse A Framework for Accessing Grid Infrastructures Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.cy) EGEE Training the Trainers May 6 th, 2009 Outline Grid Reality The Problem g-eclipse
More informationApache Storm. Hortonworks Inc Page 1
Apache Storm Page 1 What is Storm? Real time stream processing framework Scalable Up to 1 million tuples per second per node Fault Tolerant Tasks reassigned on failure Guaranteed Processing At least once
More informationDATABASTEKNIK - 1DL116
1 DATABASTEKNIK - 1DL116 Fall 2003 An introductury course on database systems http://user.it.uu.se/~udbl/dbt-ht2003/ Kjell Orsborn Uppsala Database Laboratory Department of Information Technology, Uppsala
More informationNew Features Guide Sybase ETL 4.9
New Features Guide Sybase ETL 4.9 Document ID: DC00787-01-0490-01 Last revised: September 2009 This guide describes the new features in Sybase ETL 4.9. Topic Page Using ETL with Sybase Replication Server
More informationA Software Developing Environment for Earth System Modeling. Depei Qian Beihang University CScADS Workshop, Snowbird, Utah June 27, 2012
A Software Developing Environment for Earth System Modeling Depei Qian Beihang University CScADS Workshop, Snowbird, Utah June 27, 2012 1 Outline Motivation Purpose and Significance Research Contents Technology
More informationSpatio-Temporal Gridded Data Processing on the Semantic Web
Spatio-Temporal Gridded Data Processing on the Semantic Web Andrej Andrejev, Dimitar Misev*, Peter Baumann*, Tore Risch Department of Information Technology, Uppsala University * Computer Science & Electrical
More informationParallel Patterns for Window-based Stateful Operators on Data Streams: an Algorithmic Skeleton Approach
Parallel Patterns for Window-based Stateful Operators on Data Streams: an Algorithmic Skeleton Approach Tiziano De Matteis, Gabriele Mencagli University of Pisa Italy INTRODUCTION The recent years have
More informationGrid Compute Resources and Job Management
Grid Compute Resources and Job Management How do we access the grid? Command line with tools that you'll use Specialised applications Ex: Write a program to process images that sends data to run on the
More informationIntegration of Cloud and Grid Middleware at DGRZR
D- of International Symposium on Computing 2010 Stefan Freitag Robotics Research Institute Dortmund University of Technology March 12, 2010 Overview D- 1 D- Resource Center Ruhr 2 Clouds in the German
More informationSystems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2012/13
Systems Infrastructure for Data Science Web Science Group Uni Freiburg WS 2012/13 Data Stream Processing Topics Model Issues System Issues Distributed Processing Web-Scale Streaming 3 System Issues Architecture
More informationCopyright 2012, Oracle and/or its affiliates. All rights reserved.
1 Big Data Connectors: High Performance Integration for Hadoop and Oracle Database Melli Annamalai Sue Mavris Rob Abbott 2 Program Agenda Big Data Connectors: Brief Overview Connecting Hadoop with Oracle
More informationN. Marusov, I. Semenov
GRID TECHNOLOGY FOR CONTROLLED FUSION: CONCEPTION OF THE UNIFIED CYBERSPACE AND ITER DATA MANAGEMENT N. Marusov, I. Semenov Project Center ITER (ITER Russian Domestic Agency N.Marusov@ITERRF.RU) Challenges
More informationCubeStore. On the Design and Implementation of the Multidimensional CubeStore Storage Manager. Wolfgang Sporer Wolfgang Lehner
1 CubeStore On the Design and Implementation of the Multidimensional CubeStore Storage Manager Wolfgang Sporer Wolfgang Lehner wgsporer@cip.informatik.uni-erlangen.de Wolfgang.Lehner@informatik.uni-erlangen.de
More informationManaging Scientific Computations in Grid Systems
Managing Scientific Computations in Grid Systems Salman Toor Division of Scientific Computing Department of Information Technology Uppsala University November 11, 2008 Motivation Integration architecture
More informationInPress Systems. Tools for WebNative Suite. InPress Systems AB Jörgen Fransson
InPress Systems Tools for WebNative Suite InPress Systems AB Jörgen Fransson InPress Systems Company Info 1996 Xinet Authorized Reseller for Sweden Sweden early adopter of webbased solutions, first WebNative
More informationOutline. Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems
Distributed Systems Outline Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems What Is A Distributed System? A collection of independent computers that appears
More informationMonitor your containers with the Elastic Stack. Monica Sarbu
Monitor your containers with the Elastic Stack Monica Sarbu Monica Sarbu Team lead, Beats team monica@elastic.co 3 Monitor your containers with the Elastic Stack Elastic Stack 5 Beats are lightweight shippers
More informationMinimizing Server Throughput for Low-Delay Live Streaming in Content Delivery Networks. F. Zhou, S. Ahmad, E. Buyukkaya, R. Hamzaoui and G.
Minimizing Server Throughput for Low-Delay Live Streaming in Content Delivery Networks F. Zhou, S. Ahmad, E. Buyukkaya, R. Hamzaoui and G. Simon Live Stream Delivery Content Provider CDN encoders ingest
More informationThe Opal Toolkit. Wrapping Scientific Applications as Web Services
The Opal Toolkit Wrapping Scientific Applications as Web Services Outline!Introduction!Motivation!Opal!Summary 1 What is Opal?! Opal is a toolkit for wrapping scientific applications as Web services on
More informationGEOSPATIAL ENGINEERING COMPETENCIES. Geographic Information Science
GEOSPATIAL ENGINEERING COMPETENCIES Geographic Information Science The character and structure of spatial information, its methods of capture, organisation, classification, qualification, analysis, management,
More informationVisual Component Composition Environments
Visual Component Composition Environments Based on Comparing Visual Component Composition Environments written by Alejandra Cechich and Manuel Prieto in 00. Presented by Yuxi Bai on March 8, 006 Agenda
More informationMSG: An Overview of a Messaging System for the Grid
MSG: An Overview of a Messaging System for the Grid Daniel Rodrigues Presentation Summary Current Issues Messaging System Testing Test Summary Throughput Message Lag Flow Control Next Steps Current Issues
More informationThe Integration of Grid Technology with OGC Web Services (OWS) in NWGISS for NASA EOS Data
The Integration of Grid Technology with OGC Web Services (OWS) in NWGISS for NASA EOS Data Liping Di, Aijun Chen, Wenli Yang and Peisheng Zhao achen6@gmu.edu; achen@laits.gmu.edu Lab for Advanced Information
More informationTowards a Unified Monitoring and Performance Analysis System for the Grid
Towards a Unified Monitoring and Performance Analysis System for the Grid Hong-Linh Truong, Thomas Fahringer Institute for Software Science, University of Vienna, Austria {truong,tf}@par.univie.ac.at http://www.par.univie.ac.at/project/scalea
More informationEnabling the Autonomic Data Center with a Smart Bare-Metal Server Platform
Enabling the Autonomic Data Center with a Smart Bare-Metal Server Platform Arzhan Kinzhalin, Rodolfo Kohn, Ricardo Morin, David Lombard 6 th International Conference on Autonomic Computing Barcelona, Spain
More informationData Stream Management and Complex Event Processing in Esper. INF5100, Autumn 2010 Jarle Søberg
Data Stream Management and Complex Event Processing in Esper INF5100, Autumn 2010 Jarle Søberg Outline Overview of Esper DSMS and CEP concepts in Esper Examples taken from the documentation A lot of possibilities
More informationThe Problem of Grid Scheduling
Grid Scheduling The Problem of Grid Scheduling Decentralised ownership No one controls the grid Heterogeneous composition Difficult to guarantee execution environments Dynamic availability of resources
More informationDistributed Information Processing
Distributed Information Processing 1 st Lecture Eom, Hyeonsang ( 엄현상 ) Department of Computer Science & Engineering Seoul National University Copyrights 2017 Eom, Hyeonsang All Rights Reserved Outline
More information