Requirements for a Future Astronomical Data Analysis Environment

Similar documents
Prototypes of a Computational Grid for the Planck Satellite

The Virtual Observatory and the IVOA

The Interaction of the ISO-SWS Pipeline Software and the ISO-SWS Interactive Analysis System

Radio astronomy data reduction at the Institute of Radio Astronomy

ESO Science Archive Interfaces

DOI for Astronomical Data Centers: ESO. Hainaut, Bordelon, Grothkopf, Fourniol, Micol, Retzlaff, Sterzik, Stoehr [ESO] Enke, Riebe [AIP]

IVOA and European VO Efforts Status and Plans

Virtual Observatory and Grid-related projects: national and international status

1. Publishable Summary

Designing a System Engineering Environment in a structured way

A VO-friendly, Community-based Authorization Framework

Data Processing for SUBARU Telescope using GRID

A European Vision and Plan for a Common Grid Infrastructure

SPACE SITUATIONAL AWARENESS

Estonian research infrastructure policy. Toivo Räim Vilnius,

ESO Reflex (FinReflex)

PID System for eresearch

EU policy and the way forward for smart meters and smart grids

Synthesizing Communication Middleware from Explicit Connectors in Component Based Distributed Architectures

Direct 100G connectivity with optoelectronic POLYmer InP integration for data center SYStems

SatNEx: Satellite Communications Network of Excellence. SatNEx. Satellite Communications Network of Excellence

Mapping of the CVD models in Europe

6.871 Expert System: WDS Web Design Assistant System

Mobile Charging. Emphasis on Business and Subscriber Aspects

GRID 2008 International Conference on Distributed computing and Grid technologies in science and education 30 June 04 July, 2008, Dubna, Russia

Fausto Giunchiglia and Mattia Fumagalli

This document is a preview generated by EVS

Astrophysics and the Grid: Experience with EGEE

Lessons learnt from building the International Virtual Observatory Alliance. Françoise Genova

Italy - Information Day: 2012 FP7 Space WP and 5th Call. Peter Breger Space Research and Development Unit

Moving e-infrastructure into a new era the FP7 challenge

The Virtual Observatory in Australia Connecting to International Initiatives. Peter Lamb. CSIRO Mathematical & Information Sciences

EUROPEAN INTERFEROMETRY INITIATIVE

This document is a preview generated by EVS

BHL-EUROPE: Biodiversity Heritage Library for Europe. Jana Hoffmann, Henning Scholz

Building an Assurance Foundation for 21 st Century Information Systems and Networks

A GENERIC SOFTWARE INTERFACE SIMULATOR FOR ALMA COMMON SOFTWARE

Case Study. Deploying IP-Based Videoconferencing in a Large U.S. Government Agency

Basic Requirements for Research Infrastructures in Europe

AN EXPERT SYSTEM FOR MANAGING EARLY AGE CONCRETE CRACK PREDICTION

International Audit and Certification of Digital Repositories

EU funded research is keeping up trust in digital society

The Network Infrastructure for Radio Astronomy in Italy

Migrating Complex Oracle Forms Applications to APEX: Utopia or Reality?

esignature Infrastructure Marketing Model

Data Governance in Mass upload processes Case KONE. Finnish Winshuttle User Group , Helsinki

Towards Process-based Composition of Activities for Collecting Data in Supply Chains

Comments on the document ENTR/02/21-IDA/MIDDLEWARE-XML: Comparative assessment of Open Documents Formats Market Overview as submitted by Valoris

BROADMAP. Mapping Interoperable EU PPDR Broadband Communication Applications and Technology

Designing Survivable Services from Independent Components with Basic Functionality

Cybersecurity. Quality. security LED-Modul. basis. Comments by the electrical industry on the EU Cybersecurity Act. manufacturer s declaration

Key Elements of Global Data Infrastructures

Mobile Connect Pro. Activate Your Account. 200MB Data every month for 3 years included. Mobile Broadband

Structured Query Language for Virtual Observatory

2 The BEinGRID Project

ehaction Joint Action to Support the ehealth Network

Energy Management of Microgrids in a Smart City Environment

Research on Technologies in Smart Substation

Evaluating the Use of Domain-Specific Modeling in Practice

The EuroHPC strategic initiative

Mission-Critical Customer Service. 10 Best Practices for Success

Scientist s User Interface (SUI)

GRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi

Developing the ICSU World Data System (WDS)

Collaborative Editing and Linking of Astronomy Vocabularies Using Semantic Mediawiki

A CUSTOMER S PERSPECTIVE ON MUNICIPAL IT-SERVICES

North Carolina Visit and Assessment Tom Clarke Vice President for Research and Technology National Center for State Courts

CONFIRMED RETURN TO GROWTH FOR THE CONSTRUCTION INDUSTRY: INTERMAT 2018 INTRODUCES ITS NEW SPECIAL EVENTS TO MEET THE CHALLENGES OF THE FUTURE

Batch Services at CERN: Status and Future Evolution

SYS-Config v2.0. The clever way to configure your industrial control components. Request your free copy today

Portable Running of Programs for Automation

MORFEO and the MyMobileWeb project

Safety of Nuclear Installations

BroadMap Brief Intro Final Stakeholders Workshop

Smart Finance for Smart Buildings

The Impact of the coming Standard IEC61850 on the Life-cycle of Open Communication Systems in Substations

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

Quality and usability: A new framework

Geospatial Intelligence Centres Empowered by Airbus

An Architecture to Share Metadata among Geographically Distributed Archives

OPEN SOURCE TESTING IN SAUDI ARABIA 1

Bullet Proof. Dimension Technology Solutions, Inc. Methodologies and Practice Concerning. Monitoring, Backups and Recovery

Introduction to Bibliometrics and Tools for Organizing References. Uta Grothkopf ESO Library

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

Data Processing and Data Exchange

Report on integration of E-VLBI System

TI nine months old. Trusted Introducer Status report 1 June TI nine months old Slide 1

EUREKA European Network in international R&D Cooperation

WOJCIECH FROELICH CTO ARGOS MULTILINGUAL

REPORT FROM THE COMMISSION TO THE EUROPEAN PARLIAMENT AND THE COUNCIL

3D visualization of astronomy data using immersive displays

D 7.1 PROJECT WEBSITE

Design optimisation of industrial robots using the Modelica multi-physics modeling language

Spitzer Heritage Archive

HOPE Project AAL Smart Home for Elderly People

CASA Joe McMullin J. McMullin EVLA Advisory Committee Meeting September 6-7, 2007

Visualization & the CASA Viewer

Mediation Patterns for Message Exchange Protocols

Denmark - the new world class data centre hub

Page 1. Free Full HD documentary on Blu-ray or TV! The movie and book of the International Astronomical Union celebrating 400 years of the telescope

Transcription:

ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XIV ASP Conference Series, Vol. 347, 2005 P. L. Shopbell, M. C. Britton, and R. Ebert, eds. Requirements for a Future Astronomical Data Analysis Environment Preben Grosbøl and Klaus Banse European Southern Observatory, Germany Doug Tody, William Cotton, and Tim J. Cornwell National Radio Astronomical Observatory, USA Daniel Ponz European Space Agency, Spain Janne Ignatius CSC, Finnish IT Center for Science, Finland Peter Linde Lund Observatory, Sweden Thijs van der Hulst Kapteyn Astronomical Institute, Netherlands Vadim Burwitz Max-Planck-Institut für extraterrestrische Physik, Germany David Giaretta Starlink Project, United Kingdom Fabio Pasian and Bianca Garilli Istituto Nazionale di Astrofisica, Italy William Pence National Aeronautics and Space Administration, USA Dick Shaw National Optical Astronomical Observatory, USA Abstract. Most of the systems currently used to analyze astronomical data were designed and implemented more than a decade ago. Although they still are very useful for analysis, one often would like a better interface to newer concepts like archives, Virtual Observatories and GRID. Further, incompatibil- 124

Future Astronomical Data Analysis Environment 125 ities between most of the current systems with respect to control language and semantics make it cumbersome to mix applications from different origins. An OPTICON Network, funded by the Sixth Framework Programme of the European Commission, started this year to discuss high-level needs for an astronomical data analysis environment which could provide a flexible access to both legacy applications and new astronomical resources. The main objective of the Network is to establish widely accepted requirements and basic design recommendations for such an environment. The hope is that this effort will help other projects, which consider to implement such systems, in collaborating and achieving a common environment. 1. Why a New Common Software Environment? The major issues facing astronomers who want to analyze their data may be summarized as follows: there exist many different systems but: designed several decades ago, largely incompatible with respect to scripting language used complicated to share data between them (due to keyword semantics) current systems provide only rather limited interfaces to Web services, Virtual Observatories, archives and databases it is difficult to fully exploit available computer resources e.g., GRID Due to old (often monolithic) designs, the current generation of systems will be difficult to upgrade to remedy the problems listed above. A new environment seems to be the better option. It would have to fulfill the following criteria: usage of important legacy applications from current systems easy and attractive for users to develop new tasks increase the ability to collaborate and of sharing of software support easy access to resources such as archives, Virtual Observatories, and GRID computing open, stable and well controlled interface specifications define minimum implementation but allow full featured version specify open, free base system but also provide interface to commercial software 2. High Level requirements The success of a system/environment for data analysis is closely linked to the following attributes or points: stability: Users have to be sure that their investment in learning and using an environment also pays back in the long run. well tested system: Although the system must be subject to its own regression tests, only actual users will be able to find subtle issues in application procedures. availability of new state-of-the-art applications: This is only achievable if it s easy and attractive for users to develop new tasks in it.

126 Grosbøl et al. support: Even with good documentation, high stability and excellent testing, the surrounding software world is not fixed demanding a constant although limited support. collaboration: Modern science is often done in international teams. Features to make such collaborations as easy as possible must be provided. up-to-date features: Error propagation and hypothesis testing are examples of features which a modern environment must support. Also easy access to compare results with models is important. ease of installation: The environment must be trivial to install on most commonly used systems to ensure it can be used by normal astronomers. compatibility: Future data analysis will rely on access to Virtual Observatory data and Grid computer resources which must be fully supported. 3. What defines a new Environment? The concept of what an environment is differs significantly depending on whom you talk to. In this context, an environment must have the following properties: execute tasks in a transparent way so that users can select easily the appropriate computing resources e.g., desktop, GRID or supercomputer provide a standard for parsing information between tasks offer a high level scripting language for flow control access data transparently no matter where they are located such as in databases or through Virtual Observatories make a set of standard services available such as display of data, define a standard interface to services provided A UNIX system with a standard shell would satisfy many of these criteria although arguably not in an optimal way. The issue of defining a new environment may well be to find the minimum acceptable one. 4. From Dream to Reality High level requirements, beautiful concepts and solid architectural designs are always good to have but not worth much if they are not realized in an actual implementation. The OPTICON Network is currently only funded to establish the high level definitions but several organizations and projects have concrete plans for implementations of such an environment for data analysis, for example: ALMA: There is an explicit need for providing an off-line data analysis environment for ALMA data. ESO: The entry of Finland into ESO has made it possible to start a pilot project with the aim of providing better tools for the analysis of ESO data to its community. The OPTICON Network offers a forum where general requirements and design concepts can be discussed as it includes people associate to these and other similar projects. There is a strong feeling within the Network that a global environment with identical interfaces must be established although implementation

Future Astronomical Data Analysis Environment 127 User interactwithdata analyzedata loganalysis selectdata executetask viewdata accessdata Figure 1. UML diagram of typical Use Case details may differ. However, in the end only the funding organizations can make this wish come through. 5. OPTICON Network on Future Astronomical Software Environments OPTICON (Optical Infrared Coordination Network for Astronomy) includes a working group to discuss Future Astronomical Software Environments (FASE) for data analysis. The high level objectives for the FASE working group are: identify areas for which a common FASE is desirable and feasible establish high level requirements for a FASE draft interface and design recommendations for these areas learn from past experience The network has currently 15 members from European institutes and 6 associated members from USA. 6. Top level User Scenario Several user scenarios were considered including data analysis and development of tasks by both teams and individuals. A typical case for an astronomical user is shown as an UML diagram in Fig. 1. The Use Cases are summarized as: analyzedata: provides a high level scripting language for controlling the flow of the data analysis that is execution of specific tasks selectdata: lets the user view possible data sets and select appropriate ones

128 Grosbøl et al. interactwithdata: offers the ability to interact with the data by graphics or other means loganalysis: logs all actions and results providing the user with a comprehensive view over what has been done executetask: executes individual tasks either locally or in a Grid context viewdata: displays data to the user accessdata: provides transparent access to data either locally or at remote sites through the Virtual Observatory interface. Several of these Use Cases (e.g., accessdata, selectdata) will be based on the concepts and tools provided by Virtual Observatories while others (e.g., executetask) will use parts of Grid technology. 7. How can you give your input? It is important that potential users of the environment participate as much as possible in laying down its requirements, both scientific and technical. Although one would not be able to satisfy all wishes, it is essential to have a detailed view of what the community would like to see and work with. The two main channels for this are discussions on: the TWiki: <http://archive.eso.org/opticon/twiki/bin/view/main> the Majordomo e-mail list <fase@eso.org> Acknowledgments. This work is supported by OPTICON which is funded by the European Commission under Contract no. RII3-CT-2004-001566.