Managing Ecological and Biodiversity Data Using Ecoinformatics: Taiwan Experience. Chau Chin Lin Taiwan Forestry Research Institute

Similar documents
Putting the Archives to Work: Workflow and Metadata-driven Analysis in LTER Science

International Multidisciplinary Metadata Workshop 18 January Rebecca Koskela Arctic Region Supercomputing Center

Florida Coastal Everglades LTER Program

Site# Date H20 Temperature Conductance Turbidity KRS Sep KRS Aug KRS Aug

DataONE Enabling Cyberinfrastructure for the Biological, Environmental and Earth Sciences

Metadata Zoo Dataset Metadata Rebecca Koskela Execu4ve Director, DataONE

DataONE: Open Persistent Access to Earth Observational Data

Wade Sheldon. Georgia Coastal Ecosystems LTER University of Georgia CUAHSI Virtual Workshop Field Data Management Solutions

Using XML-encoded Metadata as a Basis for Advanced Information Systems for Ecological Research

EUDAT- Towards a Global Collaborative Data Infrastructure

Global Research Infrastructures for Biodiversity and Ecosystems Research

Introduction to Grid Computing

Solving informatics challenges to advance plant ecology: a vision for the next 100 years

Data Symposium 2012 SeWHIP & CTSI John W. Cobb, Ph.D. Milwaukee, WI March 1, 2012

Engaging and Connecting Faculty:

Generating EML from a Relational Database Management System (RDBMS)

Digital repositories as research infrastructure: a UK perspective

Sessions 3/4: Member Node Breakouts. John Cobb Matt Jones Laura Moyers 7 July 2013 DataONE Users Group

SEAD Data Services. Jim Best Practices in Data Infrastructure Workshop. Cooperative agreement #OCI

Data publication and discovery with Globus

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Introduction to SDIs (Spatial Data Infrastructure)

LifeWatch/EnvEurope User Forum Use Case Ecology

Relation between Geospatial information projects related to GBIF

Using Web Services and Scientific Workflow for Species Distribution Prediction Modeling 1

A High-Level Distributed Execution Framework for Scientific Workflows

BHL-EUROPE: Biodiversity Heritage Library for Europe. Jana Hoffmann, Henning Scholz

Report to the IMC EML Data Package Checks and the PASTA Quality Engine July 2012

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Advancing the fourth paradigm of research: Assimilating repositories into active research phases

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

Annotation in EML 2.2. knb. EML Dev Committee 2018

GLOBAL INFRASTRUCTURES FOR SUPPORTING BIODIVERSITY RESEARCH

Networking European Digital Repositories

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

THE NATIONAL DATA SERVICE(S) & NDS CONSORTIUM A Call to Action for Accelerating Discovery Through Data Services we can Build Ed Seidel

Jeffery S. Horsburgh. Utah Water Research Laboratory Utah State University

DataONE Cyberinfrastructure. Ma# Jones Dave Vieglais Bruce Wilson

Workflow Exchange and Archival: The KSW File and the Kepler Object Manager. Shawn Bowers (For Chad Berkley & Matt Jones)

An Ecoillllfowmatks Appllkatiollll fow JFowest Dyll1lamks Pilot Data Managemellllt and! hawing

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

Georgia Coastal Ecosystems LTER Information Management

Supporting Data Stewardship Throughout the Data Life Cycle in the Solid Earth Sciences

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Networking European Digital Repositories

ClinVar. Jennifer Lee, PhD, NCBI/NLM/NIH ClinVar

The EUDAT Collaborative Data Infrastructure

A data repository website on marine ornamental fin fishes and shell fishes from Indian waters

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

The New Bioinformatics: Integrating Ecological Data from the Gene to the Biosphere

DataONE. Promoting Data Stewardship Through Best Practices

Web Services for Integrated Management: a Case Study

Title: Interactive data entry and validation tool: A collaboration between librarians and researchers

GEOSS Data Management Principles: Importance and Implementation

Wade Sheldon. Georgia Coastal Ecosystems LTER University of Georgia

Data Curation Practices at the Oak Ridge National Laboratory Distributed Active Archive Center

EUDAT. Towards a pan-european Collaborative Data Infrastructure

NRF Open Access Statement

Global Data Sharing The Research Data Alliance

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

A USER S GUIDE TO REGISTERING AND MAINTAINING DATA SERVICES IN HIS CENTRAL 2.0

Networking European Digital Repositories

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

CREATING DIGITAL REPOSITORIES PRESENTED BY CHAMA MPUNDU MFULA CHIEF LIBRARIAN NATIONAL ASSEMBLY OF ZAMBIA

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

EarthCube and Cyberinfrastructure for the Earth Sciences: Lessons and Perspective from OpenTopography

Introduction to Data Management for Ocean Science Research

Grant Name: Development of Inter-Agency Rare Species Data Sharing and Exchange for Statewide Wildlife Conservation Planning.

DATA SHARING FOR BETTER SCIENCE

EUROPEAN COMISSION INFORMATION SOCIETY AND MEDIA DIRECTORATE-GENERAL. Information and Communication Technologies. Collaborative Project

Developing a national disease registry: the German approach to a rare disease registry

Windsor Essex Environmental Metadata System (WEEMS)

Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France

Arctic Data Center: Call for Synthesis Working Group Proposals Due May 23, 2018

Indiana University Research Technology and the Research Data Alliance

FAIR-aligned Scientific Repositories: Essential Infrastructure for Open and FAIR Data

EUDAT. Towards a pan-european Collaborative Data Infrastructure. Damien Lecarpentier CSC-IT Center for Science, Finland EUDAT User Forum, Barcelona

Data Curation Profile Botany / Plant Taxonomy

Improving Data Discovery in Metadata Repositories through Semantic Search

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

Building a Global Data Federation for Climate Change Science The Earth System Grid (ESG) and International Partners

Data Curation Profile Plant Genetics / Corn Breeding

A Data Management Plan Template for Ecological Restoration and Monitoring

Distributed Repository for Biomedical Applications

INSPIRE & Environment Data in the EU

NERD workshop. Luca ALMAnaCH - Inria Paris. Berlin, 18/09/2017

Quality Assured (QA) data

Dryad Curation Manual, Summer 2009

C3S Data Portal: Setting the scene

Metadata Management System (MMS)

Scientific Data Management for the ATP 3. Edward J. Wolfrum, Eric Knoshaug, Lieve Laurens, Valerie Harmon, John A. McGowen

SERVO - ACES Abstract

Title Vega: A Flexible Data Model for Environmental Time Series Data

Ecosystems Research & Environmental Assessment Biologist Department Division/Region Community Location Environment Wildlife Igloolik Nunavut

Presented by Dr Joanne Evans, Centre for Organisational and Social informatics Faculty of IT, Monash University Designing for interoperability

Research Elsevier

EUDAT. Towards a pan-european Collaborative Data Infrastructure

The Role of Repositories and Journals in the Astronomy Research Lifecycle

Transcription:

Managing Ecological and Biodiversity Data Using Ecoinformatics: Taiwan Experience Chau Chin Lin Taiwan Forestry Research Institute

Persons to Thank First for The Following Presentation Dr. Hen-biau King (TFRI Director 2003-2007) Dr. Bill Chang (US NSF)

Ecology:Information of Biocomplexity Biotic Abiotic Temporal Spatial

Biodiversity:Information of Life Class: Insecta Order: Lepidoptera Family: Pyralidae Genus: Ostrinia Hübner, 1825 Taxonomic Names Synonym: Pyralis nubilalis Hübner, 1796 Sequence Data Locus: AAL35331 Definition: acyl-coa Z/E11 desaturase 1 mvpyattadg hpekdecfed... Species: Ostrinia nubilalis (Hübner, 1796) Vernacular (EN): European Corn-borer Vernacular (DE): Maiszünsler Vernacular (ES): Piral del maíz Vernacular (FR): Pyrale du maïs Family: Gramineae Taxonomic Descriptions Diagnosis: Wingspan 26-30mm; sexually dimorphic;male: forewings ochreous to dark brown; female: forewings pale yellow; Digital Literature and Web Resources Foodplant: Zea mais L. 1753 Biotic Interactions Spatial /Temporal Observations Collection: DGH Lepidoptera Record id: DGHEUR_003217 Country: France Coordinates: 03.047 E 48.730 N Date: 28 June 2003 Collector: Donald Hobern Individuals: 3 Richness: Pheromones of Ostrinia http://www.nysaes.cornell.edu/fst/faculty/acree /pheronet/phlist/ostrinia.html Abiotic Average Rainfall Location: 48.82 N 2.29 E Jan Feb Mar Apr... 182.3 120.6 158.1 204.9...

All Based on Data Why Data Management Is Important in Ecological Research? http://siliconangle.com/blog/2012/

Data Informs Impacts of Biodiversity Loss on Ocean Ecosystem Services Annual Cumulative Worm et al., Science 2006

Data Enhances Understanding of The Real World Understanding this disease requires knowledge of epidemiology, genetics, and transmission modes, along with their ecological contexts. Integrating ecologically pertinent data into the chain of information from the gene to the biosphere will significantly enhance our understanding of the natural world. Whitfield J. 2003 Ape populations decimated by hunting and Ebola virus. Nature 422:551

However,

Data Collection Is A Hard Work information Data/Raw data/dataset Observations/experiments the real world

Traditional Way of Research Doesn t Care About Data Analysis and modeling Raw Data Data Collection Problem Planning

Information Content Data Entropy Occurs Without Managing Time of publication Specific details General details Accident Retirement or career change Death Time (Michener et al. 1997)

What Data We Have Collected Slide from Dr. John Porter

For Example: Forest Dynamics Plot Data

Forest Dynamics Plots in Taiwan 16 Plots Around the Island

For Example : Biodiversity Data

For Example : Carbon Flux Towers

How Did We Do? Used data Collection Original Observations Analysis and modeling Selection and extraction Secondary Observations Planning Problem Definition (Research Objectives) Planning

What Techniques We Need? A framework that enables scientists to generate new knowledge through innovative tools and approaches For management, archiving, curation, discovering, retrieval, integrating, analyzing, and visualization of biodiversity and ecological data It is called Ecoinformatics

Search and Adapt The Existing Tools < EML> Ecological Metadata Language, EML Morpho metadata and data management software Metacat distributed data system registries: KNB, UCNRS, OBFS, NCEAS, PISCO, LTER EcoGrid and Tool Kit integrating distinct data systems and networks Kepler grid-enabled scientific workflows

Assembling Tools As An Information Management System

EML Driven IMS Senor Network ecogrid QA/QC Information Management Information Synthesis

Dealing with Data Flow Change Slide from US LTER

Dealing with Data Collecting Change Interpret a number 10 x daily Interpret a pattern 1,000 x daily

Dealing with Data Deluge

Providing Good Quality Data Available Online

Capacity Building and Training Helped from US LTER

International Collaborations 2006

Help Each Other within EAP! U.S. LTER Taiwan TFRI Malaysia (FRIM) Kasetsart University Thailand

Apart from software products there have also been a series of publications in both Asian and Western journals, including TREE, Bioscience and Ecological Informatics

Management, Archiving (Creating Metadata) Metadata?

Standard for Ecology/Biodiversity: EML

EML Modules

Metadata/Data Depository System

Data Curation Network SEV SEV? AND OBFS TFRI Harvester CAP Replication ECNU Key Metacat Catalog LNO Morpho clients Web clients Site metadata system XML output filter PISCO

Forming A Decentralized National System Internet User-2 User-3 Forestry User-1 Agriculture Authentication National GIS National Park Database Server National Science Council

Joining Data Observation Network for Earth DataONE DataONE DataONE is a data repository for sharing and preserving data is capable of providing researchers to access globally distributed, networked data from a single point of discovery. is a collaboration among many partner organizations, and is funded by the US-NSF. [Through the knowledge and infrastructure integrates information] National Center for Ecological Analysis and Synthesis (NCEAS), U.S.A; ; http://www.dataone.org/what-dataone.

Data Integration Data integration refers to linking research & monitoring data to the modeling community & vice versa. Data integration also refers to archiving data from monitoring, research, & modeling efforts, as well as making the data easily available for others to access & use. http://www.clear.lsu.edu/data_integration/

Toward An Automation of Data Process Workflow archive Data Site 1 Data Site 2 Data Depository Metadata Shared Data Registry Compute grid Service Broker (UDDI) Web Service WSDL Algorithm Simulation Model Get Data Query Data Grid to find data Return URL Query Service broker to find services Return URL & call functions Get Component Archive output data to Depository Archive workflow

Scientific Workflow Approach to Analysis ASCII C RESULTS: Tables Maps Graphs

Application-A Case

Ogawan,Japan Luquillo,Puerdo Rico Lienhuachih,Taiwan Pasho,Malaysia

Metadata Upload EML Document Metadata Catalog EcoGrid Scientific Workflow Morpho EML + Raw data Download Raw data CTFS Data Model Data Retrieval (SQL) Other Data Models (LDAP) WebServer (Apache+PHP)

Action Items for Individual Ecologists Organize, document, and preserve data for posterity Share data Collaborate with networks of colleagues to bring together heterogeneous datasets to address larger scale questions Address data management issues with students and peers

Data Sharing 1.Data policy What are fair policies for providing access to data? 2.Agreements Specification What controls, embargoes, usage constraints, or other limitations are needed to assure fairness of access and use? 3.Policy Administration What data publication models are appropriate?

Experience Learned: Many hands truly do make "light work!" Kaohsiung, Taiwan 2007

THANK YOU FOR YOUR ATTENTION!! chin@tfri.gov.tw