Technology for the Virtual Observatory. The Virtual Observatory. Toward a new astronomy. Toward a new astronomy

Similar documents
VIRTUAL OBSERVATORY TECHNOLOGIES

Exploiting Virtual Observatory and Information Technology: Techniques for Astronomy

Usage of the Astro Runtime

Design and Implementation of the Japanese Virtual Observatory (JVO) system Yuji SHIRASAKI National Astronomical Observatory of Japan

The Astro Runtime. for data access. Noel Winstanley Jodrell Bank, AstroGrid. with the part of Noel played by John Taylor, IfA Edinburgh/AstroGrid

Virtual Observatory publication of interferometry simulations

Building on Existing Communities: the Virtual Astronomical Observatory (and NIST)

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

Extending the SDSS Batch Query System to the National Virtual Observatory Grid

Europlanet IDIS: Adapting existing VO building blocks to Planetary Sciences

SIA V2 Analysis, Scope and Concepts. Alliance. Author(s): D. Tody (ed.), F. Bonnarel, M. Dolensky, J. Salgado (others TBA in next version)

National Optical Astronomy Observatory Science Portal. Software Requirements Document for Release 1.0 Version 1.1 (02/08/06)

Knowledge-based Grids

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

The NOAO Data Lab Design, Capabilities and Community Development. Michael Fitzpatrick for the Data Lab Team

The Virtual Observatory and the IVOA

The Portal Aspect of the LSST Science Platform. Gregory Dubois-Felsmann Caltech/IPAC. LSST2017 August 16, 2017

Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and

Designing the Future Data Management Environment for [Radio] Astronomy. JJ Kavelaars Canadian Astronomy Data Centre

Introduction to Grid Computing

VAPE Virtual observatory Aided Publishing for Education

New VO-Related Features in TOPCAT

The IPAC Research Archives. Steve Groom IPAC / Caltech

A VOSpace deployment: interoperability and integration in big infrastructure

Virtual Observatory Tools. Khadija EL Bouchefry

Quality assurance in the ingestion of data into the CDS VizieR catalogue and data services

Structured Query Language for Virtual Observatory

IVOA Astronomical Data Query Language Version 0.6

IVOA and European VO Efforts Status and Plans

Astrophysics and the Grid: Experience with EGEE

Grid Middleware and Globus Toolkit Architecture

Prototype. Towards an AVO Interoperability. Mark Allen Françoise Genova CDS & the AVO Work Area 2 team:

Chapter 18: Web-based Tools NED VO Services

Introduction to GT3. Introduction to GT3. What is a Grid? A Story of Evolution. The Globus Project

P Structured Query Language for Virtual Observatory

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics

The Role of Repositories and Journals in the Astronomy Research Lifecycle

Common Interface for Astronomical Data Service

A VO-friendly, Community-based Authorization Framework

Resource Metadata for the Virtual Observatory Version 1.12

Exploring the Handling of Light Curves in VO tools. Petr Škoda. Jiří Nádvorník, David Andrešič

Spectra & images in VizieR. Gilles Landais (CDS) Laurent Michel (OAS) Pierre Ocvirk (CDS)

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

The NCAR Community Data Portal

Progress Report. Ian Evans On behalf of the Chandra Source Catalog Project Team. Chandra Users Committee Meeting October 25, 2010

AstroDAS: Sharing Assertions across Astronomy Catalogues through Distributed Annotation

SMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Knowledge Discovery Services and Tools on Grids

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

The IVOA in 2008: Assessment and Future Roadmap

The Materials Data Facility

Euclid Archive Science Archive System

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

arxiv: v1 [astro-ph.im] 14 Oct 2016

Design The way components fit together

Spitzer Heritage Archive

Digital Library Interoperability. Europeana

MuseKnowledge Hybrid Search

A Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research

DSpace Fedora. Eprints Greenstone. Handle System

Research on the Interoperability Architecture of the Digital Library Grid

METADATA INTERCHANGE IN SERVICE BASED ARCHITECTURE

ESA Science Archives Architecture Evolution. Iñaki Ortiz de Landaluce Science Archives Team 13 th Sept 2013

Research Data Repository Interoperability Primer

Simple Image Access Protocol. Version 2.0 (WD-SIAP ) IVOA Working Draft 2009 November 4. International. Virtual. Observatory.


The Social Grid. Leveraging the Power of the Web and Focusing on Development Simplicity

Large Scale Data Management of Astronomical Surveys with AstroSpark

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

Opal: Simple Web Services Wrappers for Scientific Applications

Agent-Enabling Transformation of E-Commerce Portals with Web Services

Design The way components fit together

Batch is back: CasJobs, serving multi-tb data on the Web

Service Interface Design RSVZ / INASTI 12 July 2006

DATA MANAGEMENT SYSTEMS FOR SCIENTIFIC APPLICATIONS

Web Services for the Virtual Observatory

Andrea Sciabà CERN, Switzerland

IVOA Cape Town Interop Closing Remarks

Case Study: CyberSKA - A Collaborative Platform for Data Intensive Radio Astronomy

Index Introduction Setting up an account Searching and accessing Download Advanced features

Data Centres in the Virtual Observatory Age

Mitigating Risk of Data Loss in Preservation Environments

DIGITAL STEWARDSHIP SUPPLEMENTARY INFORMATION FORM

Data Processing for SUBARU Telescope using GRID

ADQL/s Syntax (Proposal)

TAP services integration at IA2 data center

Theoretical Models in the Virtual Observatory

Distributed Archive System for the Cherenkov Telescope Array

IVOA Spectral Energy Distribution (SED) Data Model

(9A05803) WEB SERVICES (ELECTIVE - III)

TAP Service Discovery in TOPCAT

Harnessing Grid Resources to Enable the Dynamic Analysis of Large Astronomy Datasets

Database Developers Forum APEX

Java Development and Grid Computing with the Globus Toolkit Version 3

Grid computing with IVOA standards and VOTech components

Tutorial "Gaia in the CDS services" Gaia data Heidelberg June 19, 2018 Sébastien Derriere (adapted from Thomas Boch)

chinaxiv: v1

B2SAFE metadata management

Outline. Infrastructure and operations architecture. Operations. Services Monitoring and management tools

Transcription:

Technology for the Virtual Observatory BRAVO Lecture Series, INPE, Brazil July 23-26, 2007 1. Virtual Observatory Summary 2. Service Architecture and XML 3. Building and Using Services 4. Advanced Services Toward a new astronomy Past: Observations of small, carefully selected samples The Virtual Observatory Roy Williams California Institute of Technology Co-Director, US Virtual Observatory Technical Lead, Int l Virtual Observatory Toward a new astronomy Future: Multi-wavelength data for millions of objects, allowing us to: Discover significant patterns from the analysis of statistically rich and unbiased image/catalog databases Understand complex astrophysical systems via confrontation between data and sophisticated numerical simulation 1

Why the VO? International collaboration Astronomy is becoming Data-Centric! 1000 100 http://ivoa.net 10 1970 1975 1980 1985 1990 1995 2000 CCDs 1 0.1 Glass Total area of 3m+ telescopes in the world in m 2, total number of CCD pixels in Megapixels, as a function of time. Growth over 25 years is a factor of 30 in glass, 3000 in pixels. IVOA now has 16 member projects Fall 2007 Interop: Cambridge UK, Sept 2008 Spring 2008: Trieste IT, May 2008 Fall 2008: Baltimore US, Sept 2008 Adopted a standards process based on W3C Forum for discussion and sharing of experience Interoperability Give me all tables containing the V magnitude in the Johnson system. 144 different names for Johnson V. VizieR: Contains more than 4000 astronomical catalogues consisting of one or several tables. Interoperability challenges Data formats and delivery Data service request/response Metadata standards Publishing and discovery () Content descriptors and Units Database query language Web services Authentication and authorization 2

Discover Compute Publish Collaborate Portals, User Interfaces, Tools VOPlot DIS SkyQuery Aladin Mirage Topcat convot OASIS interfaces to data Layer Data Services Compute Services Compute Services HTTP Services SOAP Services Grid Services stateless, registered & self-describing & persistent, authenticated Semantics (UCD) crossmatch visualization OAI ADS FITS, GIF, VOTable OpenSkyQuery SIAP, SSAP image data mining source detection Virtual Data Digital Library Workflow (pipelines) Other registries XML, DC, METS Authentication & Authorization Existing Data Centers My Space storage services Grid Middleware SRB, Globus, OGSA Databases, Persistency, Replication SOAP, GridFTP Disks, Tapes, CPUs, Fiber Bulk Access Getting Data Data Standards Getting Data: Formats, Services VOTable Rich metadata XML Space-Time Coordinates Location and regions Celestial frames, surface of Mars, etc etc Sophisticated and comprehensive Unified Content Descriptors Standard names for astronomical data types Mined from thousands of published catalogs Johnson V magnitude: phot.mag;em.opt.v 3

Example Service Web Services Client interface name= M51 x= 10 y= 10 sky_survey=dss2_red mime_type=download_gif Request: Keyword/value http://archive.eso.org/dss/dss/image Server Principle: Click or Code Protocol: simple REST/GET or sophisticated SOAP Standards Basic service profile formal description (WSDL), input parameters, output formats, capabilities VOResource (who and what is it, for the ) Security (single sign-on, authentication and authorization) Asynchronous (batch) services Distributed data storage (VOSpace) Coordinated with IVOA, Open Grid Forum Response: image/gif Data Services Data Services Cone Search First standard NVO service: radius+position list of objects encoded as VOTable Simple Image Access Protocol cone search for images images are referenced by URL Simple Spectrum Access Protocol spectra have subtleties protocol more complicated Astronomical Data Query Language For database queries Core SQL functions plus astronomy-specific extensions Sky region, Xmatch SkyNode Exposes relational databases Accepts ADQL query Full SkyNodes support positional cross-match function OpenSkyQuery portal show database structure query tools Inventory services Footprint services 4

Protocol List Formats VOTable FITS Cone Search Protocol Simple Image Access (SIA) Protocol Simple Spectral Access (SSA) Protocol SkyNode Astronomical Data Query Language (ADQL) Uniform Column Descriptors (UCDs) Space-Time Coordinates (STC) IVOA Identifiers VO Resource Metadata Access Protocols OAI VOEvent NESSSI VOSpace PLASTIC Common Execution Architecture VOTable XML format for tabular information Catalog query results List of images or spectra Lists of resources Includes UCDs for columns FITS Standard astronomical binary data format Images, spectra, Semantic content IVOA identifiers Scheme for providing unique identifiers for any thing that needs to be distinguished. Services, requests, ivo://authority/authority_controlled_string#sub-resource Used in registries and other protocols resource metadata Standardized format for the description of a service, data collection, facility. XML format based on the Dublin-core metadata descriptions for published resources. Used inside registries. Uniform Column/Content Descriptors (UCDs) Standardized names for table columns with relatively high degree of specificity. pos.eq.ra or phot.flux;em.ir Used in VOTables and by Cone search and SIA Space/Time Coordinates Way of specifying points or regions of space-time. Can be simple or very complex Used in ADQL and VOEvent Discovering Resources Collection of resources that users can find included Cone Search, SIA, SSA and SkyNodes. User access via registry access protocol -registry communication via OAI Some publishing registries are meant to be seen only by other registries access How users query the registry In standards process Open Archive Interface (OAI) Library community standard -registry communication to ensure that all registries are up to date. VOEvent VO protocol for publishing and receiving notice of transient/urgent events Describes events using STC 5

Accessing and Controlling Computer Resources NESSSI NVO Extensible, Scalable, Secure Service Infrastructure Provide access to protected resources, especially Grid computational resources. Make it easy to do simple things. First implementations just coming on-line VOSpace Access to storage resources in a standardized way. PLASTIC Platform for Astronomical Task InterCommunication Common Execution Environment (CEA) Script/pipeline environment with security and such incorporated (Astrogrid) VOEvent Protocol for notification of transient events Gamma-ray burst, Supernova, Asteroid, M-dwarf flare, etc Enables immediate, automated robotic followup XML format: Who, What, How, Wherewhen, Why Citation of prevous events Push protocols for immediate delivery Google Sky publish -- find -- bind Metadata Descriptions of data collections data delivery services organizations, etc. Based on Dublin Core with astronomy-specific extensions Represented as XML schema; extensible Contents stored in Resource Registries exchange metadata records through the Open Archives Initiative Protocol (OAI-PMH) 6

Distributed NVO Portal Find source catalogs, image archives, and other astronomical resources registered with the NVO Caltech CfA NCSA STScI/JHU NOAO HEASARC Astrogrid CDS ESO JapanVO A is a distributed database of Virtual Observatory resources: access services for catalog, image, and spectral data; descriptions of organizations and data collections. Coordinated registry implementations that share information by harvesting each other's resources. Ongoing harvesting March 07 (CfA, ESO, NOAO soon) Searches for resources can be done by keyword, or advanced queries can be expressed in the SQL language. The registry is open for humans through web forms, or machines through SOAP web services. Search of services 7

Datascope another mashup DataScope DataScope Federates multiple cone, SIAP services Caltech Publish OAI NCSA Publish OAI 2 JHU/StSci OAI Query 4 Goddard DataScope 1 3 8

SkyQuery database queries and crossmatch SkyQuery: Interface At the heart of SkyQuery is spatial matching find the same objects listed in different catalogs. The SkyQuery interface window is divided in 2 parts: Astronomical archives double every year. Increasingly, data are stored in on-line archives. Instrumental surveys covering a large fraction of the sky are a major part of this revolution. SkyQuery relies on surveys stored at different geographic locations. The list of SkyNodes can be queried by marking checkboxes A SQL-like text box is used to submit queries SkyQuery: consult a SkyNode s parameters. Step 1 SkyQuery: consult a SkyNode s parameters The contents of each SkyNode can be browsed. Select a survey and its list of tables or functions. Click Search. From the entries returned, enter any column/function of interest in Argument. Click Search. To build a query, we need to know what keywords to use in table e.g. PhotoPrimary, so select Columns(TableName). Now run Steps 1 and 2 for another SkyNode(s). Columns from PhotoPrimary Now we are ready to run the query 9

Run an ADQL (SQL-like) query Open SkyQuery.net Running a query is simple. Write it in the SQL-like query box or Click and customize any of 4 sample queries, #1, #2 See also tutorial http://www.skyquery.net/ tutorial.html To submit query, click Submit. Cross correlation results appear! (ra,dec,radius) Execution Crossmatch result 10

Compute Services Compute Services Asynchronous May not get immediate answer just get a place to check back Security Expensive resources, big requests, sequestered data Strong or Weak or None Scalable Graduated path to powerful computation and big data Compute Services Compute Service Architecture Web-based Portals User interacts with web portal Portal runs code on Grid Service requests forwarded to grid resources node Scripted service call User writes code to submit and monitor jobs User manages certificate client certificate JSON-RPC Cert policy service container node node node workbench storage http 11

Graduated security Mosaic Service No certificate gets 15 CPU-minutes from community account Just switch on Javascript or connect to portal Weak certificate gets 1 CPU-hour from community account In exchange for registering name/email Can do scripted access Strong (gridmapped) certificate gets infinity from own account Get this one from TeraGrid HQ "nesssi_strong_cert_max_time" : 216000, "nesssi_weak_cert_max_time" : 3600, "nesssi_anon_max_time" : 900, "nesssi_anon_user" : nvo", "nesssi_weak_user" : "nvo", nesssiserver=nesssi.client('https://envoy.c acr.caltech.edu:8443/clarens/',debug=0) mosaic_loc = "-ra 49.1 -dec 60.1 -rawidth 0.5 -decwidth 0.5 -filt f -bgcorr 0" session = nesssiserver.dpossmosaic.mosaic(mosaic_loc) print "Your session ID is %s." % session msg = dbsvr.dpossmosaic.monitor(session) print msg Repeat the monitoring Questions? 12