The Virtual Observatory

Size: px
Start display at page:

Download "The Virtual Observatory"

Transcription

1 The Virtual Observatory Peter Fox HAO/ESSL/NCAR November 28, 2005

2 Outline Virtual Observatories - history and definition(s), I*Ys Some examples - within disciplines When is a VO, not a VO? Beyond disciplines, the emerging need What is missing, i.e. the enabling technology Challenges and interoperability Examples: VSTO and SESDI What s ahead

3 Encyclopedia - we ve made it! Virtual observatory is a collection of integrated astronomical data archives and software tools that utilize computer networks to create an environment in which research can be conducted. Several countries have initiated national virtual observatory programs that will combine existing databases from ground-based and orbiting observatories and make them easily accessible to researchers. As a result, data from all the world's major observatories will be available to all users and to the public. This is significant not only because of the immense volume of astronomical data but also because the data on stars and galaxies has been compiled from observations in a variety of wavelengths: optical, radio, infrared, gamma ray, X-ray and more. Each wavelength can provide different information about a celestial event or object, but also requires a special expertise to interpret. In a virtual observatory environment, all of this data is integrated so that it can be synthesized and used in a given study.

4 Yet more definitions AVO: A virtual observatory (VO) is a collection of interoperating data archives and software tools which utilize the internet to form a scientific research environment in which astronomical research programs can be conducted. In much the same way as a real observatory consists of telescopes, each with a collection of unique astronomical instruments, the VO consists of a collection of data centres each with unique collections of astronomical data, software systems and processing capabilities. From the Grid: virtual observatory - astronomical / solar / solar terrestrial data repositories made accessible through grid and web services. Workshop: A Virtual Observatory (VO) is a suite of software applications on a set of computers that allows users to uniformly find, access, and use resources (data, software, document, and image products and services using these) from a collection of distributed product repositories and service providers. A VO is a service that unites services and/or multiple repositories. VxOs

5 Virtual Observatories Conceptual examples: In-situ: Virtual measurements Related measurements Remote sensing: Virtual, integrative measurements Data integration Systems or frameworks? Brokers, or data providers, or service providers?» VOTables, VOQueries, etc. a sytnax for exchange Holding metadata? Who imposes the catalog, or vocabulary?

6 Virtual Observatory Concepts Take the user there and make it possible for data providers to get their data to a user in need May hold data, but mostly not Once there, enable them to browse, sample, collect, produce, partly analyze, etc. a variety of resources: data, ancillary data, documentation, plots, etc. Enable them to come away with something, a) useful, b) they could not, may not have been able to, find by other, i.e. direct, means If you are going to be in the middle then Need to encapsulate things you normally may not, e.g. directory organization of data files in a selected dataset, formats, names, Provide some acceptable, understandable levels of service(s) Build in a set of supporting services that the user and provider are not exposed to Be prepared to learn from what they do while using the VO and adapt to new and changing needs There s more, of course.

7 VOs and data providers Not a VO: When you hand off a user to another site Only one dataset When you do not deliver, or do not arrange for delivery of the data When your curation role is not evident DP: Acquire data and produce data products (static or dynamic). Preserve data in useable forms. Distribute data, and provide easy machine (API) and Internet browser access. Support a communication mechanism should support a standards-based messaging system (e.g., ftp, http, SOAP, XML) Produce, document, and make easily available metadata for product finding and detailed data granule content description. Ideally, maintain a catalogue of detailed data availability information. Assure the validity and quality of the data. Document the validation process. Provide quality information (flags). Maintain careful versioning including the processing history of a product. Maintain an awareness of standards (such as community accepted data models), and adhere to them as needed. Provide software required to read and interpret the data; ideally the routines used by the PI science team should be available to all.

8 What should a VO do? Make standard scientific research much more efficient. Even the PI teams should want to use them. Must improve on existing services (Mission and PI sites, etc.). VOs will not replace these, but will use them in new ways. Enable new, global problems to be solved. Rapidly gain integrated views from the solar origin to the terrestrial effects of an event. Find data related to any particular observation. (Ultimately) answer higher-order queries such as Show me the data from cases where a large CME observed by SOHO was also observed in situ.

9 What the NASA community wants Provide coordinated discovery and access to data and service resources for a specific scientific discipline Identify relevant data sources and appropriate repositories. Allow queries that yield data granules or pointers to them. Provide a user interface to access resources both through an API (or equivalent machine access) and a web browser application. Handle a wide range of provider types, as needed. Understand the data needs of its focus area: Recruit potential new providers. Provide support and "cookbooks" for easy incorporation of providers. Help to assure high data quality and completeness of the product set. Resolve issues of multiple versions of datasets.

10 More from NASA Provide documentation for metadata: Set standards for metadata and query items Assist providers, and review metadata. Maintain a global knowledge of data availability. Possibly maintain collection catalog metadata. Provide an API or other means for the VxO to appear to others as a single provider. Potentially provide value-added services (can be done by providers or elsewhere): Data Subsetting: Averaging of data Filtering Data Merging Format Conversion Provide access to event lists and ancillary data. Collect statistical information and community comments to assess success.

11 VSO and the small box The Virtual Solar-Terrestrial Observatory

12

13

14

15

16

17 CEDAR The Virtual Solar-Terrestrial Observatory

18

19

20

21

22 The Earth System Grid DATA storage SECURITY services LBNL METADATA services ANALYSIS & VIZ services TRANSPORT services MONITORING services gridftp server/client HRM FRAMEWORK services MySQL DISK RLS SLAMON daemon NCAR TOMCAT AXIS GRAM ANL Auth metadata GSI CAS server GSI CAS client NCL opendapg client LAS server NERSC HPSS gridftp server/client HRM opendapg server ORNL LLNL NCAR MSS DISK TOMCAT CDAT opendapg client gridftp server/client HRM MySQL RLS THREDDS catalogs Xindice SLAMON daemon gridftp server/client HRM DISK opendapg server GSI CAS client ISI MyProxy client MyProxy server ORNL HPSS DISK MySQL RLS MCS MySQL Xindice OGSA-DAIS MySQL RLS GSI CAS client GSI GSI The Virtual Solar-Terrestrial Observatory

23

24

25

26 Emerging needs Interdisciplinary science and engineering (not just between adjacent fields) Interdisciplinary data assimilation, integration Web service workflow orchestration (beyond syntax) Vortals as well as portals (specific to general) Agency (NASA) and community efforts (egy, IHY, IPY, IYPE)

27 ACOS at the MLSO Near real-time data from Hawaii from a variety of solar instruments, as a valuable source for space weather, solar variability and basic solar physics

28 CISM Goal: To create a physics-based numerical simulation model that describes the space environment from the Sun to the Earth. THE USES OF SPACE WEATHER MODELING A scientific tool for increased understanding of the complex space environment. A specification and forecast tool for space weather prediction. An educational tool for teaching about the space environment.

29 CEDARWEB Community data archive, documents, and support.

30 User requirements CEDAR Search must return data (i.e. no null searches) Search across instruments, models Know about special time periods, campaigns, etc. Allow selections based on (appropriate) geophysical conditions, e.g. Kp index Usual format returned and in correct units Must be able to easily re-create the search, access Visual browsing MLSO Same as CEDAR!! + sampling interval choice, e.g. minutely, daily, average, best of the day, synoptic

31 Challenges and interoperability Semantic misunderstanding E.g. sunspot number and variations in solar radiation: over 90% of researchers outside the sub-field of solar radiation think: sunspot number is a measure of solar radiation In reality: a sunspot number is a measure of the number of sunspots appearing on the visible solar surface, a sunspot is an indicator of the location of strong solar magnetic fields, strong magnetic fields are collectively known as solar activity, sunspots are observed to produce a localized decrease in the solar radiation output, etc. How to explain this to a computer? Interfaces are built by computer scientists with syntax that often works within a discipline but rarely across them

32 Concept and user needs Goal - find the right balance of data/model holdings, portals and client software that a researchers can use without effort or interference as if all the materials were available on his/her local computer. The Virtual Solar-Terrestrial Observatory (VSTO) is a: distributed, scalable education and research environment for searching, integrating, and analyzing observational, experimental and model databases in the fields of solar, solar-terrestrial and space physics VSTO comprises a: System-like framework which provides virtual access to specific data, model, tool and material archives containing items from a variety of space- and ground-based instruments and experiments, as well as individual and community modeling and software efforts bridging research and educational use

33 User needs In discussions with data providers and users, the needs are clear: ``Fast access to `portable' data, in a way that works with the tools we have; information must be easy to access, retrieve and work with.' Few clicks, get what I want, whose tools? MY tools Too often users (and data providers) have to deal with the organizational structure of the data sets which varies significantly --- data may be stored at one site in a small number of large files while similar data may be stored at another site in a large number of relatively smaller files. There is an equally large problem with the range of metadata descriptions for the data. Users often only want subsets of the data and struggle with getting it efficiently. One user expresses it as: ``(Please) solve the interface problem.' Encapsulate more

34 What s new in the VSTO? Datasets alone are not sufficient to build a virtual observatory: VSTO integrates tools, models, and data VSTO addresses the interface problem, effectively and scalably VSTO addresses the interdisciplinary metadata and ontology problem - bridging terminology and use of data across disciplines VSTO leverages the development of schema that adequately describe the syntax (name of a variable, its type, dimensions, etc. or the procedure name and argument list, etc.), semantics (what the variable physically is, its units, etc.) and pragmatics (or what the procedure does and returns, etc.) of the datasets and tools. VSTO provides a basis for a framework for building and distributing advanced data assimilation tools

35 Virtual Observatory: Need better glue Basic problem: schema are categorized rather than developed from an object model/class hierarchy -> significantly limits non-human use. However, they all form the basis to organize catalog interfaces for all types of data, images, etc. This limits data systems utilizing frameworks and prevents frameworks from truly interoperating (SOAP, WSDL only a start) Directories, e.g. NASA GCMD, CEDAR catalog, FITS (flat) keyword/ value pairs, are being turned into ontologies (SWEET, VSTO) Markup languages, e.g. ESML, SPDML, ESG/ncML are excellent bases

36 Methodologies Use-cases User requirements Semantics - what does this mean Data integration Ontologies Rapid prototyping

37 HAO and SCD from NCAR, McGuinness Assoicates: Peter Fox, Don Middleton, Stan Solomon, Deborah McGuinness, Jose Garcia, Patrick West, Luca Cinquini, James Benedict, Tony Darnell and soon Application domains - CEDAR, CISM, ACOS Realms (ontologies): Covers middle atmosphere to the Sun + SPDML Mesh with Earth Realm (SWEET) Mesh with GEON VSTO SWEET +SPDML ACOS CISM Use-cases and user requirements CEDAR

38 VSTO Use-case 1 UC1: Plot the observed/measured Neutral Temperature (Parameter) looking in the vertical direction for Millstone Hill Fabry-Perot interferometer (Instrument) from January 2000 to August 2000 (Temporal Domain) as a time series. Precondition: portal application is authorized to access the backend data extraction and plotting service 1. User accesses the portal application 2. User goes through a series of views to select (in order) the desired observatory, instrument, record-type (kind of data), parameter, start and stop dates, and the plot type is inferred. At each step, the user selection determines the range of available options in the subsequent steps. NB, an alternate path is selection of start and stop dates, then instrument, etc. 3. The application validates the user request: verifying the logical correctness of the request, i.e. that Millstone Hill is an observatory that operates a type of instrument that measures neutral temperature (i.e. check that Millstone Hill <isa> observatory and check that the range of the measures property on the Millstone Hill Fabry Perot Interferometer subsumes neutral temperature). Also, the application must verify that no necessary information is missing from the request. 4. The application processes the user request to locate the physical storage of the data, returning for example a URL-like expression: find Millstone Hill FPI data of the correct type (operating mode; defined by CEDAR KINDAT since the instrument has two operating modes) in the given time range (Millstone Hill FPI <haskindofdata> 1701 <intersects> TemporalDomain [January 2000, August 2000] ) 5. The application plots the data in the specified plot type (a time series). This step involves extracting the data from records of one or more files, creating an aggregate array of data with independent variable time (of day or day+time depending on time range selected) and passing this to a procedure to create the resulting image.

39

40

41

42

43 Demo The Virtual Solar-Terrestrial Observatory

44 Have you heard these questions? What do you mean by that? What did you mean by that? What does this mean? How did you get this, please explain? Does this also mean? Doesn t this contradict? <Insert your own here> Leads to: Inference Reasoning Explanation

45 Paradigm shift for NASA? From: Instrument based To: Measurement based Requires: bridging the discipline data divide Overall vision for SESDI: To integrate information technology in support of advancing measurement-based processing systems for NASA by integrating existing diverse science discipline and mission-specific data sources. Volcano SWEET Climate SESDI

46 Semantic connectors SWEET Process-oriented semantic content represented in SWSL Articulation axioms The SESDI re-useable component interfaces. The stub on each end of the connector is based on the GEON Ontology-Data registration technology and contains articulated axioms derived from the knowledge gained in the unit-level data registration. Includes integrity checks, domain and range, etc.

47 What s ahead? Virtual Observatories provide both framework and data system elements, users are already confusing VOs and data providers Many VO s are noting the need for better glue, scalability, expandability, etc. Success (to date) in utilizing formal methods for interface specification and development using ontologies Success in breaking all of the free tools! Commercial tools are under consideration Challenges exist for reasoning and interface with scientific datatypes, e.g. complex spatial and temporal concepts For VSTO (and SESDI) - more use-cases, populate the interfaces and test for scalability and interoperation in production settings

The Virtual Solar-Terrestrial Observatory ++

The Virtual Solar-Terrestrial Observatory ++ The Virtual Solar-Terrestrial Observatory ++ Peter Fox* Deborah McGuinness$#, Luca Cinquini%, Patrick West*, Jose Garcia*, Tony Darnell*, James Benedict$, Don Middleton%, Stan Solomon* *HAO/ESSL/NCAR $

More information

The Earth System Grid: A Visualisation Solution. Gary Strand

The Earth System Grid: A Visualisation Solution. Gary Strand The Earth System Grid: A Visualisation Solution Gary Strand Introduction Acknowledgments PI s Ian Foster (ANL) Don Middleton (NCAR) Dean Williams (LLNL) ESG Development Team Veronika Nefedova (ANL) Ann

More information

Semantically-Enabled Large-Scale Science Data Repositories

Semantically-Enabled Large-Scale Science Data Repositories Semantically-Enabled Large-Scale Science Data Repositories Peter Fox 1, Deborah McGuinness 2,3, Don Middleton 4, Luca Cinquini 4, J. Anthony Darnell 1, Jose Garcia 1,PatrickWest 1, James Benedict 3, and

More information

Semantically-Enabled Virtual Observatories

Semantically-Enabled Virtual Observatories Semantically-Enabled Virtual Observatories Deborah McGuinness 12, Peter Fox 3, Luca Cinquini 4, Patrick West 3, James Benedict 2, J. Anthony Darnell 3, Jose Garcia 3, and Don Middleton 4 1 McGuinness Associates,

More information

The NCAR Community Data Portal

The NCAR Community Data Portal The NCAR Community Data Portal http://cdp.ucar.edu/ QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this

More information

Ontologies in Practice - a Solar Terrestrial Example

Ontologies in Practice - a Solar Terrestrial Example Ontologies in Practice - a Solar Terrestrial Example Peter Fox1 (pfox@ucar.edu) and Deborah McGuinness23 (dlm@cs.stanford.edu) 1 HAO/ESSL/NCAR 2 Stanford Knowledge Systems, AI Lab 3 McGuinness Associates

More information

Interoperability ~ An Introduction

Interoperability ~ An Introduction Interoperability ~ An Introduction Cyndy Chandler Biological and Chemical Oceanography Data Management Office (BCO-DMO) Woods Hole Oceanographic Institution 26 July 2008 MMI OOS Interoperability Planning

More information

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography Christopher Crosby, San Diego Supercomputer Center J Ramon Arrowsmith, Arizona State University Chaitan

More information

The Virtual Observatory and the IVOA

The Virtual Observatory and the IVOA The Virtual Observatory and the IVOA The Virtual Observatory Emergence of the Virtual Observatory concept by 2000 Concerns about the data avalanche, with in mind in particular very large surveys such as

More information

Data Management Components for a Research Data Archive

Data Management Components for a Research Data Archive Data Management Components for a Research Data Archive Steven Worley and Bob Dattore Scientific Computing Division Computational and Information Systems Laboratory National Center for Atmospheric Research

More information

Next Generation Semantic Data Environments (or Linked Data, Semantics, and Standards in Scientific Applications)

Next Generation Semantic Data Environments (or Linked Data, Semantics, and Standards in Scientific Applications) Next Generation Semantic Data Environments (or Linked Data, Semantics, and Standards in Scientific Applications) Deborah L. McGuinness Tetherless World Senior Constellation Chair Professor of Computer

More information

Using VIVO at the Laboratory for Atmospheric and Space Physics (LASP)

Using VIVO at the Laboratory for Atmospheric and Space Physics (LASP) Using VIVO at the Laboratory for Atmospheric and Space Physics (LASP) Anne Wilson and the LASP Web Development Team VIVO 2016 Panel on Using VIVO for Scientific Applications Friday, 08.19.16 Laboratory

More information

NASA and International Open Standards and the Future of Space Weather Studies

NASA and International Open Standards and the Future of Space Weather Studies NASA and International Open Standards and the Future of Space Weather Studies D. Aaron Roberts NASA Goddard Space Flight Center ISWI2017, 4 August 2107 Virtual Observatory A virtual observatory is

More information

Index Introduction Setting up an account Searching and accessing Download Advanced features

Index Introduction Setting up an account Searching and accessing Download Advanced features ESGF Earth System Grid Federation Tutorial Index Introduction Setting up an account Searching and accessing Download Advanced features Index Introduction IT Challenges of Climate Change Research ESGF Introduction

More information

Introduction to Grid Computing

Introduction to Grid Computing Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able

More information

METADATA INTERCHANGE IN SERVICE BASED ARCHITECTURE

METADATA INTERCHANGE IN SERVICE BASED ARCHITECTURE UDC:681.324 Review paper METADATA INTERCHANGE IN SERVICE BASED ARCHITECTURE Alma Butkovi Tomac Nagravision Kudelski group, Cheseaux / Lausanne alma.butkovictomac@nagra.com Dražen Tomac Cambridge Technology

More information

Linking datasets with user commentary, annotations and publications: the CHARMe project

Linking datasets with user commentary, annotations and publications: the CHARMe project Linking datasets with user commentary, annotations and publications: the CHARMe project Jon Blower j.d.blower@reading.ac.uk University of Reading On behalf of all CHARMe partners! http://www.charme.org.uk

More information

The Virtual Solar-Terrestrial Observatory: A Deployed Semantic Web Application Case Study for Scientific Research

The Virtual Solar-Terrestrial Observatory: A Deployed Semantic Web Application Case Study for Scientific Research The Virtual Solar-Terrestrial Observatory: A Deployed Semantic Web Application Case Study for Scientific Research Deborah McGuinness 1,2, Peter Fox 3, Luca Cinquini 4, Patrick West 3, Jose Garcia 3, James

More information

The IPAC Research Archives. Steve Groom IPAC / Caltech

The IPAC Research Archives. Steve Groom IPAC / Caltech The IPAC Research Archives Steve Groom IPAC / Caltech IPAC overview The Infrared Processing and Analysis Center (IPAC) at Caltech is dedicated to science operations, data archives, and community support

More information

The Community Data Portal and the WMO WIS

The Community Data Portal and the WMO WIS he Community Data Portal and the WIS Presented by: Michael Burek Progress, metadata architecture, and collaboration in the future Acknowledgments NCAR - Luca Cinquini, Rob Markel, Nathan Willhelmi, Don

More information

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:

More information

Ontologies and The Earth System Grid

Ontologies and The Earth System Grid Ontologies and The Earth System Grid Line Pouchard (ORNL) PI s: Ian Foster (ANL); Don Middleton (NCAR); and Dean Williams (LLNL) http://www.earthsystemgrid.org The NIEeS Workshop Cambridge, UK Overview:

More information

Design and Implementation of the Japanese Virtual Observatory (JVO) system Yuji SHIRASAKI National Astronomical Observatory of Japan

Design and Implementation of the Japanese Virtual Observatory (JVO) system Yuji SHIRASAKI National Astronomical Observatory of Japan Design and Implementation of the Japanese Virtual Observatory (JVO) system Yuji SHIRASAKI National Astronomical Observatory of Japan 1 Introduction What can you do on Japanese Virtual Observatory (JVO)?

More information

Agent-Enabling Transformation of E-Commerce Portals with Web Services

Agent-Enabling Transformation of E-Commerce Portals with Web Services Agent-Enabling Transformation of E-Commerce Portals with Web Services Dr. David B. Ulmer CTO Sotheby s New York, NY 10021, USA Dr. Lixin Tao Professor Pace University Pleasantville, NY 10570, USA Abstract:

More information

Building on Existing Communities: the Virtual Astronomical Observatory (and NIST)

Building on Existing Communities: the Virtual Astronomical Observatory (and NIST) Building on Existing Communities: the Virtual Astronomical Observatory (and NIST) Robert Hanisch Space Telescope Science Institute Director, Virtual Astronomical Observatory Data in astronomy 2 ~70 major

More information

Building the NNEW Weather Ontology

Building the NNEW Weather Ontology Building the NNEW Weather Ontology Kelly Moran Kajal Claypool 5 May 2010 1 Outline Introduction Ontology Development Methods & Tools NNEW Weather Ontology Design Application: Semantic Search Summary 2

More information

SERVO - ACES Abstract

SERVO - ACES Abstract 1 of 6 12/27/2004 2:33 PM 2 of 6 12/27/2004 2:33 PM Implementing GIS Grid Services for the International Solid Earth Research Virtual Observatory Galip Aydin (1), Marlon Pierce (1), Geoffrey Fox (1), Mehmet

More information

PTA Control System Architecture

PTA Control System Architecture PTA Control System Architecture Robert Ackermann January 30, 2002 1 Introduction This document establishes the PTA Control System architecture. It contains a high-level description of the proposed hardware

More information

The Grid Architecture

The Grid Architecture U.S. Department of Energy Office of Science The Grid Architecture William E. Johnston Distributed Systems Department Computational Research Division Lawrence Berkeley National Laboratory dsd.lbl.gov What

More information

Version 3 Updated: 10 March Distributed Oceanographic Match-up Service (DOMS) User Interface Design

Version 3 Updated: 10 March Distributed Oceanographic Match-up Service (DOMS) User Interface Design Distributed Oceanographic Match-up Service (DOMS) User Interface Design Shawn R. Smith 1, Jocelyn Elya 1, Adam Stallard 1, Thomas Huang 2, Vardis Tsontos 2, Benjamin Holt 2, Steven Worley 3, Zaihua Ji

More information

Agent Framework For Intelligent Data Processing

Agent Framework For Intelligent Data Processing Agent Framework For Intelligent Data Processing Rahul Ramachandran, Sara Graves, Sunil Movva and Xiang Li Information Technology and Systems Center University of Alabama in Huntsville rramachandran@itsc.uah.edu

More information

Flexible Framework for Mining Meteorological Data

Flexible Framework for Mining Meteorological Data Flexible Framework for Mining Meteorological Data Rahul Ramachandran *, John Rushing, Helen Conover, Sara Graves and Ken Keiser Information Technology and Systems Center University of Alabama in Huntsville

More information

Levels of Service Authors: R. Duerr, A. Leon, D. Miller, D.J Scott Date 7/31/2009

Levels of Service Authors: R. Duerr, A. Leon, D. Miller, D.J Scott Date 7/31/2009 Levels of Service Authors: R. Duerr, A. Leon, D. Miller, D.J Scott Date 7/31/2009 CHANGE LOG Revision Date Description Author 1.0 7/31/2009 Original draft Duerr, Leon, Miller, Scott 2.0 3/19/2010 Updated

More information

Toward the Development of a Comprehensive Data & Information Management System for THORPEX

Toward the Development of a Comprehensive Data & Information Management System for THORPEX Toward the Development of a Comprehensive Data & Information Management System for THORPEX Mohan Ramamurthy, Unidata Steve Williams, JOSS Jose Meitin, JOSS Karyn Sawyer, JOSS UCAR Office of Programs Boulder,

More information

Realizing the Army Net-Centric Data Strategy (ANCDS) in a Service Oriented Architecture (SOA)

Realizing the Army Net-Centric Data Strategy (ANCDS) in a Service Oriented Architecture (SOA) Realizing the Army Net-Centric Data Strategy (ANCDS) in a Service Oriented Architecture (SOA) A presentation to GMU/AFCEA symposium "Critical Issues in C4I" Michelle Dirner, James Blalock, Eric Yuan National

More information

Uniform Resource Locator Wide Area Network World Climate Research Programme Coupled Model Intercomparison

Uniform Resource Locator Wide Area Network World Climate Research Programme Coupled Model Intercomparison Glossary API Application Programming Interface AR5 IPCC Assessment Report 4 ASCII American Standard Code for Information Interchange BUFR Binary Universal Form for the Representation of meteorological

More information

Reducing Consumer Uncertainty

Reducing Consumer Uncertainty Spatial Analytics Reducing Consumer Uncertainty Towards an Ontology for Geospatial User-centric Metadata Introduction Cooperative Research Centre for Spatial Information (CRCSI) in Australia Communicate

More information

Teiid Designer User Guide 7.5.0

Teiid Designer User Guide 7.5.0 Teiid Designer User Guide 1 7.5.0 1. Introduction... 1 1.1. What is Teiid Designer?... 1 1.2. Why Use Teiid Designer?... 2 1.3. Metadata Overview... 2 1.3.1. What is Metadata... 2 1.3.2. Editing Metadata

More information

The EC Presenting a multi-terabyte dataset MWF via ER the web

The EC Presenting a multi-terabyte dataset MWF via ER the web The EC Presenting a multi-terabyte dataset MWF via ER the web Data Management at the BADC Ag Stephens BADC Data Scientist 11 November 2003 Presentation outline An introduction to the BADC. The project

More information

Building for the Future

Building for the Future Building for the Future The National Digital Newspaper Program Deborah Thomas US Library of Congress DigCCurr 2007 Chapel Hill, NC April 19, 2007 1 What is NDNP? Provide access to historic newspapers Select

More information

Cataloguing GI Functions provided by Non Web Services Software Resources Within IGN

Cataloguing GI Functions provided by Non Web Services Software Resources Within IGN Cataloguing GI Functions provided by Non Web Services Software Resources Within IGN Yann Abd-el-Kader, Bénédicte Bucher Laboratoire COGIT Institut Géographique National 2 av Pasteur 94 165 Saint Mandé

More information

Data publication and discovery with Globus

Data publication and discovery with Globus Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,

More information

Ontology and Application for Reusable Search Interface Design

Ontology and Application for Reusable Search Interface Design Ontology and Application for Reusable Search Interface Design Plans for Advanced Semantic Technologies Final Project Eric Rozell, Tetherless World Constellation Outline Project Overview Paper Contents

More information

Implementing a Data Publishing Service via DSpace. Jon W. Dunn, Randall Floyd, Garett Montanez, Kurt Seiffert

Implementing a Data Publishing Service via DSpace. Jon W. Dunn, Randall Floyd, Garett Montanez, Kurt Seiffert Implementing a Data Publishing Service via DSpace Jon W. Dunn, Randall Floyd, Garett Montanez, Kurt Seiffert May 20, 2009 Outline IUScholarWorks Massive Data Storage Service Example of the data publishing

More information

The Integrated Data Viewer A Tool for Scientific Analysis and Visualization

The Integrated Data Viewer A Tool for Scientific Analysis and Visualization The Integrated Data Viewer A Tool for Scientific Analysis and Visualization Don Murray Unidata Program Center Overview What is the Integrated Data Viewer (IDV)? IDV features Web enabled features Client/Server

More information

The Earth System Grid: Supporting the Next Generation of Climate Modeling Research

The Earth System Grid: Supporting the Next Generation of Climate Modeling Research The Earth System Grid: Supporting the Next Generation of Climate Modeling Research DAVID BERNHOLDT, SHISHIR BHARATHI, DAVID BROWN, KASIDIT CHANCHIO, MEILI CHEN, ANN CHERVENAK, LUCA CINQUINI, BOB DRACH,

More information

Experiences From The Fermi Data Archive. Dr. Thomas Stephens Wyle IS/Fermi Science Support Center

Experiences From The Fermi Data Archive. Dr. Thomas Stephens Wyle IS/Fermi Science Support Center Experiences From The Fermi Data Archive Dr. Thomas Stephens Wyle IS/Fermi Science Support Center A Brief Outline Fermi Mission Architecture Science Support Center Data Systems Experiences GWODWS Oct 27,

More information

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA Rees Williams on behalf of A.N.Belikov, D.Boxhoorn, B. Dröge, J.McFarland, A.Tsyganov, E.A. Valentijn University of Groningen, Groningen,

More information

Design patterns for data-driven research acceleration

Design patterns for data-driven research acceleration Design patterns for data-driven research acceleration Rachana Ananthakrishnan, Kyle Chard, and Ian Foster The University of Chicago and Argonne National Laboratory Contact: rachana@globus.org Introduction

More information

EGEE and Interoperation

EGEE and Interoperation EGEE and Interoperation Laurence Field CERN-IT-GD ISGC 2008 www.eu-egee.org EGEE and glite are registered trademarks Overview The grid problem definition GLite and EGEE The interoperability problem The

More information

Archive II. The archive. 26/May/15

Archive II. The archive. 26/May/15 Archive II The archive 26/May/15 What is an archive? Is a service that provides long-term storage and access of data. Long-term usually means ~5years or more. Archive is strictly not the same as a backup.

More information

The Semantic Planetary Data System

The Semantic Planetary Data System The Semantic Planetary Data System J. Steven Hughes 1, Daniel J. Crichton 1, Sean Kelly 1, and Chris Mattmann 1 1 Jet Propulsion Laboratory 4800 Oak Grove Drive Pasadena, CA 91109 USA {steve.hughes, dan.crichton,

More information

The Herschel Data Processing System: History, Status and latest Developments

The Herschel Data Processing System: History, Status and latest Developments The Herschel Data Processing System: History, Status and latest Developments Stephan Ott Herschel Science Data Processing Development Manager Herschel Science Data Processing Coordinator Herschel Science

More information

Using ESML in a Semantic Web Approach for Improved Earth Science Data Usability

Using ESML in a Semantic Web Approach for Improved Earth Science Data Usability Using in a Semantic Web Approach for Improved Earth Science Data Usability Rahul Ramachandran, Helen Conover, Sunil Movva and Sara Graves Information Technology and Systems Center University of Alabama

More information

Metadata Models for Experimental Science Data Management

Metadata Models for Experimental Science Data Management Metadata Models for Experimental Science Data Management Brian Matthews Facilities Programme Manager Scientific Computing Department, STFC Co-Chair RDA Photon and Neutron Science Interest Group Task lead,

More information

A VO-friendly, Community-based Authorization Framework

A VO-friendly, Community-based Authorization Framework A VO-friendly, Community-based Authorization Framework Part 1: Use Cases, Requirements, and Approach Ray Plante and Bruce Loftis NCSA Version 0.1 (February 11, 2005) Abstract The era of massive surveys

More information

Building a Global Data Federation for Climate Change Science The Earth System Grid (ESG) and International Partners

Building a Global Data Federation for Climate Change Science The Earth System Grid (ESG) and International Partners Building a Global Data Federation for Climate Change Science The Earth System Grid (ESG) and International Partners 24th Forum ORAP Cite Scientifique; Lille, France March 26, 2009 Don Middleton National

More information

Indiana University Research Technology and the Research Data Alliance

Indiana University Research Technology and the Research Data Alliance Indiana University Research Technology and the Research Data Alliance Rob Quick Manager High Throughput Computing Operations Officer - OSG and SWAMP Board Member - RDA Organizational Assembly RDA Mission

More information

SEXTANT 1. Purpose of the Application

SEXTANT 1. Purpose of the Application SEXTANT 1. Purpose of the Application Sextant has been used in the domains of Earth Observation and Environment by presenting its browsing and visualization capabilities using a number of link geospatial

More information

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS 1 1. USE CASES For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS Business need: Users need to be able to

More information

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments *

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments * Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments * Joesph JaJa joseph@ Mike Smorul toaster@ Fritz McCall fmccall@ Yang Wang wpwy@ Institute

More information

Engagement With Scientific Facilities

Engagement With Scientific Facilities Engagement With Scientific Facilities Eli Dart, Network Engineer ESnet Science Engagement Lawrence Berkeley National Laboratory Global Science Engagement Panel Internet2 Technology Exchange San Francisco,

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

Semantic agents for location-aware service provisioning in mobile networks

Semantic agents for location-aware service provisioning in mobile networks Semantic agents for location-aware service provisioning in mobile networks Alisa Devlić University of Zagreb visiting doctoral student at Wireless@KTH September 9 th 2005. 1 Agenda Research motivation

More information

The C3S Climate Data Store and its upcoming use by CAMS

The C3S Climate Data Store and its upcoming use by CAMS Atmosphere The C3S Climate Data Store and its upcoming use by CAMS Miha Razinger, ECMWF thanks to Angel Alos, Baudouin Raoult, Cedric Bergeron and the CDS contractors Atmosphere What are C3S and CDS? The

More information

NATIONAL OCEANIC AND ATMOSPHERIC ADMINISTRATION S SCIENTIFIC DATA STEWARDSHIP PROGRAM

NATIONAL OCEANIC AND ATMOSPHERIC ADMINISTRATION S SCIENTIFIC DATA STEWARDSHIP PROGRAM J2.3A NATIONAL OCEANIC AND ATMOSPHERIC ADMINISTRATION S SCIENTIFIC DATA STEWARDSHIP PROGRAM John J. Bates * NOAA National Climatic Center, Asheville, North Carolina Richard G. Reynolds Office of Systems

More information

Growing Variety and Volume of Remote Sensing and In Situ Data

Growing Variety and Volume of Remote Sensing and In Situ Data The Potential Role of the World Data Centers in the Global Earth Observing System of Systems and the International Polar Year: CIESIN Experience to Date Dr. Robert S. Chen Director and Senior Research

More information

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center Robert Cook, DAAC Scientist Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN cookrb@ornl.gov

More information

VIRTUAL OBSERVATORY TECHNOLOGIES

VIRTUAL OBSERVATORY TECHNOLOGIES VIRTUAL OBSERVATORY TECHNOLOGIES / The Johns Hopkins University Moore s Law, Big Data! 2 Outline 3 SQL for Big Data Computing where the bytes are Database and GPU integration CUDA from SQL Data intensive

More information

The Heliophysics Data Policy: Status and Plans. Aaron Roberts NASA GSFC 27 February 2012 Heliophysics Subcommittee Meeting

The Heliophysics Data Policy: Status and Plans. Aaron Roberts NASA GSFC 27 February 2012 Heliophysics Subcommittee Meeting The Heliophysics Data Policy: Status and Plans Aaron Roberts NASA GSFC 27 February 2012 Heliophysics Subcommittee Meeting The Data Policy states various functions for the HP Data Environment Produce and

More information

Introduction to GT3. Introduction to GT3. What is a Grid? A Story of Evolution. The Globus Project

Introduction to GT3. Introduction to GT3. What is a Grid? A Story of Evolution. The Globus Project Introduction to GT3 The Globus Project Argonne National Laboratory USC Information Sciences Institute Copyright (C) 2003 University of Chicago and The University of Southern California. All Rights Reserved.

More information

The CEDA Archive: Data, Services and Infrastructure

The CEDA Archive: Data, Services and Infrastructure The CEDA Archive: Data, Services and Infrastructure Kevin Marsh Centre for Environmental Data Archival (CEDA) www.ceda.ac.uk with thanks to V. Bennett, P. Kershaw, S. Donegan and the rest of the CEDA Team

More information

The CISM Education Plan (updated August 2006)

The CISM Education Plan (updated August 2006) The CISM Education Mission The CISM Education Plan (updated August 2006) The CISM Education Mission is to recruit and train the next generation of space physicists and imbue them with an understanding

More information

Information retrieval concepts Search and browsing on unstructured data sources Digital libraries applications

Information retrieval concepts Search and browsing on unstructured data sources Digital libraries applications Digital Libraries Agenda Digital Libraries Information retrieval concepts Search and browsing on unstructured data sources Digital libraries applications What is Library Collection of books, documents,

More information

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation INSPIRE 2010, KRAKOW Dr. Arif Shaon, Dr. Andrew Woolf (e-science, Science and Technology Facilities Council, UK) 3

More information

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT. Chapter 4:- Introduction to Grid and its Evolution Prepared By:- Assistant Professor SVBIT. Overview Background: What is the Grid? Related technologies Grid applications Communities Grid Tools Case Studies

More information

The State of Arctic Data the IPY experience

The State of Arctic Data the IPY experience The State of Arctic Data the IPY experience Mark A. Parsons,Taco de Bruin, Scott Tomlinson, Øystein Godøy, Helen Campbell, Julie Leclert, Ellsworth LeDrew, David Carlson, and the IPY data community. 22

More information

Fusion Registry 9 SDMX Data and Metadata Management System

Fusion Registry 9 SDMX Data and Metadata Management System Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland CESSDA workshop Tampere, 5 October 2012 EUDAT Towards a pan-european Collaborative

More information

Implementation of Geospatial Product Virtualization in Grid Environment

Implementation of Geospatial Product Virtualization in Grid Environment Implementation of Geospatial Product Virtualization in Grid Environment Liping Di, Aijun Chen, Yuqi Bai, and Yaxing Wei Dr. Aijun Chen () George Mason University (GMU) Page 1 Outline Introduction Open

More information

OPeNDAP: Accessing HYCOM (and other data) remotely

OPeNDAP: Accessing HYCOM (and other data) remotely OPeNDAP: Accessing HYCOM (and other data) remotely Presented at The HYCOM NOPP GODAE Meeting By Peter Cornillon OPeNDAP Inc., Narragansett, RI 02882 7 December 2005 8/25/05 HYCOM NOPP GODAE 1 Acknowledgements

More information

Knowledge-based Grids

Knowledge-based Grids Knowledge-based Grids Reagan Moore San Diego Supercomputer Center (http://www.npaci.edu/dice/) Data Intensive Computing Environment Chaitan Baru Walter Crescenzi Amarnath Gupta Bertram Ludaescher Richard

More information

Kepler Scientific Workflow and Climate Modeling

Kepler Scientific Workflow and Climate Modeling Kepler Scientific Workflow and Climate Modeling Ufuk Turuncoglu Istanbul Technical University Informatics Institute Cecelia DeLuca Sylvia Murphy NOAA/ESRL Computational Science and Engineering Dept. NESII

More information

TIGGE and the EU Funded BRIDGE project

TIGGE and the EU Funded BRIDGE project TIGGE and the EU Funded BRIDGE project Baudouin Raoult Head of Data and Services Section ECMWF Slide 1 Slide 1 The TIGGE core dataset THORPEX Interactive Grand Global Ensemble Global ensemble forecasts

More information

B2FIND and Metadata Quality

B2FIND and Metadata Quality B2FIND and Metadata Quality 3 rd EUDAT Conference 25 September 2014 Heinrich Widmann and B2FIND team 1 Outline B2FIND the EUDAT Metadata Service Semantic Mapping of Metadata Quality of Metadata Summary

More information

Representing LEAD Experiments in a FEDORA digital repository

Representing LEAD Experiments in a FEDORA digital repository Representing LEAD Experiments in a FEDORA digital repository You-Wei Cheah, Beth Plale Indiana University Bloomington, IN {yocheah, plale}@cs.indiana.edu IU-CS TR666 ABSTRACT In this paper, we discuss

More information

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007 Grid Programming: Concepts and Challenges Michael Rokitka SUNY@Buffalo CSE510B 10/2007 Issues Due to Heterogeneous Hardware level Environment Different architectures, chipsets, execution speeds Software

More information

How to use Water Data to Produce Knowledge: Data Sharing with the CUAHSI Water Data Center

How to use Water Data to Produce Knowledge: Data Sharing with the CUAHSI Water Data Center How to use Water Data to Produce Knowledge: Data Sharing with the CUAHSI Water Data Center Jon Pollak The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) August 20,

More information

Data Centres in the Virtual Observatory Age

Data Centres in the Virtual Observatory Age Data Centres in the Virtual Observatory Age David Schade Canadian Astronomy Data Centre A few things I ve learned in the past two days There exist serious efforts at Long-Term Data Preservation Alliance

More information

Dagstuhl Seminar on Service-Oriented Computing Session Summary Cross Cutting Concerns. Heiko Ludwig, Charles Petrie

Dagstuhl Seminar on Service-Oriented Computing Session Summary Cross Cutting Concerns. Heiko Ludwig, Charles Petrie Dagstuhl Seminar on Service-Oriented Computing Session Summary Cross Cutting Concerns Heiko Ludwig, Charles Petrie Participants of the Core Group Monika Kazcmarek, University of Poznan Michael Klein, Universität

More information

ESPAS: How it fits and benefits

ESPAS: How it fits and benefits ESPAS: How it fits and benefits Dieter Bilitza Department of Physics and Astronomy, George Mason University, Fairfax, Virginia, USA and Heliospheric Laboratory, NASA Goddard Space Flight Center, Greenbelt,

More information

Abstract. Introduction. The Virtual Astronomy Multimedia Project

Abstract. Introduction. The Virtual Astronomy Multimedia Project The Virtual Astronomy Multimedia Project The Virtual Astronomy Multimedia Project Adrienne Gauthier 1, Lars Lindberg Christensen 2, Robert Hurt 3 & Ryan Wyatt 4 1 Steward Observatory, University of Arizona(agauthier@as.arizona.edu)

More information

Transitioning to Symyx

Transitioning to Symyx Whitepaper Transitioning to Symyx Notebook by Accelrys from Third-Party Electronic Lab Notebooks Ordinarily in a market with strong growth, vendors do not focus on competitive displacement of competitor

More information

New Datasets, Functionality and Future Development. Ashwanth Srinivasan, (FSU) Steve Hankin (NOAA/PMEL) Major contributors: Jon Callahan (Mazama(

New Datasets, Functionality and Future Development. Ashwanth Srinivasan, (FSU) Steve Hankin (NOAA/PMEL) Major contributors: Jon Callahan (Mazama( HYCOM Data Service New Datasets, Functionality and Future Development Ashwanth Srinivasan, (FSU) Steve Hankin (NOAA/PMEL) Major contributors: Jon Callahan (Mazama( Consulting) Roland Schweitzer (Weathertop

More information

Database Engineering. Percona Live, Amsterdam, September, 2015

Database Engineering. Percona Live, Amsterdam, September, 2015 Database Engineering Percona Live, Amsterdam, 2015 September, 2015 engineering, not administration 2 yesterday s DBA gatekeeper master builder superhero siloed specialized 3 engineering quantitative interdisciplinary

More information

Semantic Web Services for Ocean Knowledge Management

Semantic Web Services for Ocean Knowledge Management Semantic Web Services for Ocean Knowledge Management Syed SR. Abidi, Ali Daniyal, Ashraf Abusharek, Samina R. Abidi Abstract We present a web-services based e-research platform to support scientific research

More information

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA Sara Nieto on behalf of B.Altieri, G.Buenadicha, J. Salgado, P. de Teodoro European Space Astronomy Center, European Space Agency, Spain O.R.

More information

Semantic Web. Semantic Web Services. Morteza Amini. Sharif University of Technology Spring 90-91

Semantic Web. Semantic Web Services. Morteza Amini. Sharif University of Technology Spring 90-91 بسمه تعالی Semantic Web Semantic Web Services Morteza Amini Sharif University of Technology Spring 90-91 Outline Semantic Web Services Basics Challenges in Web Services Semantics in Web Services Web Service

More information

Globus GTK and Grid Services

Globus GTK and Grid Services Globus GTK and Grid Services Michael Rokitka SUNY@Buffalo CSE510B 9/2007 OGSA The Open Grid Services Architecture What are some key requirements of Grid computing? Interoperability: Critical due to nature

More information

The Common Framework for Earth Observation Data. US Group on Earth Observations Data Management Working Group

The Common Framework for Earth Observation Data. US Group on Earth Observations Data Management Working Group The Common Framework for Earth Observation Data US Group on Earth Observations Data Management Working Group Agenda USGEO and BEDI background Concise summary of recommended CFEOD standards today Full document

More information