Solving informatics challenges to advance plant ecology: a vision for the next 100 years

Size: px
Start display at page:

Download "Solving informatics challenges to advance plant ecology: a vision for the next 100 years"

Transcription

1 Solving informatics challenges to advance plant ecology: a vision for the next 100 years Brian McGill ESA Baltimore August 14, 2015

2 5 10 First point

3 Where I m coming from 1990s work in computer consulting world doing datawarehousing (building databases for business analytics) 2000s analyze BBS and other large datasets 2010s part of numerous ecoinformatics projects BIEN Botanical Information and Ecology Network 3 iterations 20 million records ETE Evolution of Terrestrial Ecosystems E&O Environment and Organisms Consultation with lots of others

4 The trend Papers in ecology with 'database*'

5 Why the rise of ecoinformatics? push The capacity of hard disks double every 23 months Data growing >exponentially Soberón & Peterson (2004)

6 Why the rise of ecoinformatics? pull Conservation & policy Large scales Long time series Inventory mentality Basic ecology Macroecology Ecosystems ecology Global change

7 The data Biotic measurements Abiotic (environmental measurements Time Time Site Site

8 McGill s 9 easy steps of ecoinformatics Collect Scrub Join Store Update Analyze Manage Staff Fund

9 McGill s 9 easy steps of ecoinformatics The real order a 1b Collect Scrub Join Store Update Analyze Manage Staff Fund

10 McGill s 9 easy steps of ecoinformatics Scr Amount of Work Gartner Group 70% of datwarehousing is in data preparation ub Jo Collect in Store Update Analyze Manage Staff Fund

11 McGill s 9 easy steps of ecoinformatics Collect Scrub Join Store Update Analyze Manage Staff Fund

12 Collecting data sources Some are open and on the web Rest is a complex social process Use evolution of sociality 1. Compel/punish (journal, funder requirements) 2. Trust/reputation (small homogeneous group) 3. Joint fate (group selection) Write out an agreement in advance! Most ecoinformatics projects evolve towards openness

13 McGill s 9 easy steps of ecoinformatics Collect Scrub Join Store Update Analyze Manage Staff Fund

14 Gartner Group 70% of datwarehousing is in data preparation

15 Values 100 cm of rainfall yesterday 0 for NA 1.00 vs 10.0 (transcription errors) Instrument errors Data filling? Space Time 4 dimensions Geocoding (Convention center Baltimore N, W) Geoscrubbing 42,100 for North America 100, 42 for North America 0,0 State centers Best tools, but amazing how often 6/14/2015 vs 2015/6/14 Taxonomy Misspellings Synonymy

16 Synonyms and errors Soberon & Peterson 2004

17 Taxonomic Scrubbing in BIEN 2.5M records 600,000 species in New World! 600,000 names 300,000 standardized names after synonymy and misspelling (fuzzy matching) TNRS service Boyle et al 2013

18 Geoscrubbing Synonymy MEX MX Mexico MEXICO México MÃ xico México Mexi ISO alpha 3 ISO alpha 2 official geonames.org (and gadm.org) name capitalization insensitive geonames.org alternate name recognizable misencoding of México translatable HTML character code not matched 439 country names 62 (14%) unrecognizably misspelled 377 recognized 193 recognized countries (49% synonyms) 43% canonical names

19 GeoScrubbing in BIEN ~ 1/3 of records had no lat/lon (or 0/0) >1/2 had + longitude About 1% had other obvious lat/lon errors 15% not in the right country 25% not in the right state/province 67%*85%*75%=41% correct!

20 McGill s 9 easy steps of ecoinformatics Collect Scrub Join Store Update Analyze Manage Staff Fund

21 Joining data Cleaning & synonyms Pinus strobus in USFIA vs Pinus strobus L. in MOBOT Semantic joining USFIA has # stems per 0.04 ha plot MOBOT has a specimen card/occurrence Record connecting The easy part databases do this well

22 How to assemble data Three approaches: Semantic web automated assembly over network Standards agree on database format, individuals contribute Bottom up (standards committees) Top down Only works for relatively uniform data Datawarehousing specific people pull it together and maintain Cost $ Effectiveness

23 Two laws of scrubbing & joining Gartner s law #1 expect it to be 70% of your work McGill s law #2 expect ~50% of the data to be wrong

24 McGill s 9 easy steps of ecoinformatics Collect Scrub Join Store Update Analyze Manage Staff Fund

25 OLTP vs OLAP OLTP=OnLine Transaction Processing The preeminent need in business (since 1960s) Frequent entry of new facts, frequent recall of individual facts, infrequent sweeping analyses Bank account transactions, phone call recording & billing, order entry, accounting OLAP=OnLine Analytical Processing Growing need in business (since 1980s, coined in 1995) Analysis of information stored in OLTP systems Increasing recognition that OLAP systems are not just a query function on OLTP Datawarehousing, multidimensional databases

26 OLAP context OLTP Database OLAP Database Periodic, Incremental Updates

27 Fear 3 rd normal form All undergraduate computer scientists are told to normalize their database schema They tell all ecologists to do this Ecologists always nod their heads knowingly Except experienced database people know normalization is a trade off gradient and it is really good for OLTP and really bad for OLAP

28 OLAP Schema (Star/Snowflake) =Dimensional modelling Time Order Family Diet Species Species Genus Family Authority Body size Functional group Fact Table Abundance Date recorded Plot Latitude Longitude Elevation Landcover Site MAT MAP Biome

29 Katge et al 2011 (MEE) TRY database

30 Multidimensional Databases the conceptual idea? Biotic measurements Abiotic (environmental measurements Time Time Site Site

31 Spatial databases Spatial indexing on lat/lon SQL extensions Distance, area/length, point in polygon PostGIS SELECT binomial, body_size FROM species WHERE ST_Intersects(geom, ST_GeomFromText('POINT(45-100)',4326)); SELECT state.code, species.binomial FROM states JOIN species ON ST_INTERSECTS(states.geom, species.range);

32 McGill s 9 easy steps of ecoinformatics Collect Scrub Join Store Update Analyze Manage Staff Fund

33 Updating the database Key decision Never (one time) Snapshot (fixed points in time) Continuous Amount of work Benefit More work than you think Expect a 50% reduction each time you update

34 McGill s 9 easy steps of ecoinformatics Collect Scrub Join Store Update Analyze Manage Staff Fund

35 The fun part Lamanna et al 2014

36 Serving data to other scientists Pretty mapped screens are nice But I want a subset of the data! Options Downloadable dumps Query & download RESTFUL API R API

37 McGill s 9 easy steps of ecoinformatics Collect Scrub Join Store Update Analyze Manage Staff Fund

38 Over the wall Scientists

39 Iterative prototyping Initial Analysis & Design User Feedback Software Development

40 This is now 1 2 hours/week of your life

41 Software engineering Use cases Design reviews Version control Separate develop and test Automated testing Bug tracking software Steering committees

42 McGill s 9 easy steps of ecoinformatics Collect Scrub Join Store Update Analyze Manage Staff Fund

43 Students or not students Pros of having students do it Valuable career skill Need to train next generation Cons Often years of work without papers or science Often much time spent on training Computer scientists do cutting edge computer science, not boring data scrubbing & databases Consultants are an underconsidered alternative

44 Student training Not everybody is good at it. Not everybody wants to do it. Not everybody should do it. But the ones who do should be rewarded (or at least not penalized). Paraphrasing Stephen Jackson Ecologists computers or techies ecology? Both but make sure the ecology is in there Every ecology student needs training as a consumer of databases (simple SQL)

45 McGill s 9 easy steps of ecoinformatics Collect Scrub Join Store Update Analyze Manage Staff Fund

46 Funding Good luck! More seriously More grants need to build in data management costs (should NSF give free 15% supplements?) NSF needs to figure way to fund maintaining infrastructure

47 Top community priorities 1. Better scrubbing tools Taxonomy, space, time are pretty standard 2. Developing capacity/training 3. Better spatio temporal tools 4. Imagery/raster tools 5. Broader exposure to more database types 6. Don t reinvent the wheel software best practices exist! 7. Keep the ecology in the drivers seat

The purchaser of the ebook is licensed to copy ebook for use with immediate family members only.

The purchaser of the ebook is licensed to copy ebook for use with immediate family members only. Copyright 2017 Leah Nieman www.leahnieman.com All rights reserved. No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form by any means, electronic, mechanical,

More information

Building Websites People Can Actually Use

Building Websites People Can Actually Use Building Websites People Can Actually Use Your Presenter: Joel Baglien VP Consulting Services, High Monkey Consulting MARCH 13, 2013 Introduction Welcome & thanks to Kentico for hosting the Webinar Please

More information

Data-Intensive Distributed Computing

Data-Intensive Distributed Computing Data-Intensive Distributed Computing CS 451/651 431/631 (Winter 2018) Part 5: Analyzing Relational Data (1/3) February 8, 2018 Jimmy Lin David R. Cheriton School of Computer Science University of Waterloo

More information

Curriculum Catalog

Curriculum Catalog 2017-2018 Curriculum Catalog Career and Technical Education Series 2017 Glynlyon, Inc. Table of Contents FUNDAMENTALS OF DIGITAL MEDIA COURSE OVERVIEW...1 UNIT 1: INTRODUCTION TO DIGITAL AND ONLINE MEDIA

More information

Package BIEN. R topics documented: May 9, 2018

Package BIEN. R topics documented: May 9, 2018 Package BIEN May 9, 2018 Title Tools for Accessing the Botanical Information and Ecology Network Database Version 1.2.3 Provides Tools for Accessing the Botanical Information and Ecology Network Database.

More information

Accelerate your SAS analytics to take the gold

Accelerate your SAS analytics to take the gold Accelerate your SAS analytics to take the gold A White Paper by Fuzzy Logix Whatever the nature of your business s analytics environment we are sure you are under increasing pressure to deliver more: more

More information

1/12/2018. APPA Institute Dallas, TX Feb DATA INTEGRATION PURPOSE OF TODAY S PRESENTATION

1/12/2018. APPA Institute Dallas, TX Feb DATA INTEGRATION PURPOSE OF TODAY S PRESENTATION DATA INTEGRATION APPA Institute for Facilities Management January 23, 2018 Portland, OR PURPOSE OF TODAY S PRESENTATION To provide a broad understanding of: Data as a utility How various units of Facilities

More information

Syllabus. Syllabus. Motivation Decision Support. Syllabus

Syllabus. Syllabus. Motivation Decision Support. Syllabus Presentation: Sophia Discussion: Tianyu Metadata Requirements and Conclusion 3 4 Decision Support Decision Making: Everyday, Everywhere Decision Support System: a class of computerized information systems

More information

12/6/2012. Getting Started with Metadata. Presenter. Making Metadata Work. Overall Topics. Data Collection. Topics

12/6/2012. Getting Started with Metadata. Presenter. Making Metadata Work. Overall Topics. Data Collection. Topics Making Metadata Work Using metadata to document your science Presenter Viv Hutchison US Geological Survey NBII program Metadata Coordinator Location: USGS Western Fisheries Research Center Seattle, WA

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 07 Terminologies Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Database

More information

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. [R&G] Chapter 23, Part A

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. [R&G] Chapter 23, Part A Data Warehousing and Decision Support [R&G] Chapter 23, Part A CS 432 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful patterns and support business

More information

BHL-EUROPE: Biodiversity Heritage Library for Europe. Jana Hoffmann, Henning Scholz

BHL-EUROPE: Biodiversity Heritage Library for Europe. Jana Hoffmann, Henning Scholz Nimis P. L., Vignes Lebbe R. (eds.) Tools for Identifying Biodiversity: Progress and Problems pp. 43-48. ISBN 978-88-8303-295-0. EUT, 2010. BHL-EUROPE: Biodiversity Heritage Library for Europe Jana Hoffmann,

More information

Data Mining: Exploring Data. Lecture Notes for Chapter 3

Data Mining: Exploring Data. Lecture Notes for Chapter 3 Data Mining: Exploring Data Lecture Notes for Chapter 3 1 What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include

More information

Test Driven Development. René Barto SES Agile Development - Test Driven Development

Test Driven Development. René Barto SES Agile Development - Test Driven Development Test Driven Development René Barto SES Agile Development - Test Driven Development 27-09-2006 Contents About Myself About SES Agile Development A Typical Developer s Day Test Driven Development Questions

More information

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS LECTURE: 05 (A) DATA WAREHOUSING (DW) By: Dr. Tendani J. Lavhengwa lavhengwatj@tut.ac.za 1 My personal quote:

More information

OLAP Introduction and Overview

OLAP Introduction and Overview 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata

More information

European Environmental Data: development, trends and availability

European Environmental Data: development, trends and availability European Environmental Data: development, trends and availability Stefan Jensen, head of group EUROSAI working group on environmental auditing, Cyprus, 24.10.2012 Outline EEA regulation and SEIS as umbrella

More information

WELCOME. to the 1 st online DG CONNECT NIPS Study workshop. July 25, 2013

WELCOME. to the 1 st online DG CONNECT NIPS Study workshop. July 25, 2013 WELCOME to the 1 st online DG CONNECT NIPS Study workshop July 25, 2013 2 DG CONNECT NIPS Study online workshop Agenda topics Timing Facilitator Introduction and practicalities of the workshop 5 min Dan

More information

Making Metadata Work. Using metadata to document your science. August 1 st, 2010

Making Metadata Work. Using metadata to document your science. August 1 st, 2010 Making Metadata Work Using metadata to document your science August 1 st, 2010 Presenter Viv Hutchison US Geological Survey NBII program Metadata Coordinator Location: USGS Western Fisheries Research Center

More information

What is Data Warehouse like

What is Data Warehouse like What is Data Warehouse like in the Big Data Era? Sales (Asia) Data Warehouse Sales (US) ETL ETL Collects and organizes historical data from multiple sources Inventory Advertising ETL ETL So far Ø Star

More information

CONFERENCE OF EUROPEAN STATISTICIANS ACTIVITIES ON CLIMATE CHANGE-RELATED STATISTICS

CONFERENCE OF EUROPEAN STATISTICIANS ACTIVITIES ON CLIMATE CHANGE-RELATED STATISTICS Statistical Commission Forty-seventh session 8-11 March 2016 Item 3(k) of the provisional agenda Climate change statistics Background document Available in English only CONFERENCE OF EUROPEAN STATISTICIANS

More information

Welcome to the topic of SAP HANA modeling views.

Welcome to the topic of SAP HANA modeling views. Welcome to the topic of SAP HANA modeling views. 1 At the end of this topic, you will be able to describe the three types of SAP HANA modeling views and use the SAP HANA Studio to work with views in the

More information

Recently Updated Dumps from PassLeader with VCE and PDF (Question 1 - Question 15)

Recently Updated Dumps from PassLeader with VCE and PDF (Question 1 - Question 15) Recently Updated 70-467 Dumps from PassLeader with VCE and PDF (Question 1 - Question 15) Valid 70-467 Dumps shared by PassLeader for Helping Passing 70-467 Exam! PassLeader now offer the newest 70-467

More information

What is interaction design? What is Interaction Design? Example of bad and good design. Goals of interaction design

What is interaction design? What is Interaction Design? Example of bad and good design. Goals of interaction design What is interaction design? What is Interaction Design? Designing interactive products to support people in their everyday and working lives Sharp, Rogers and Preece (2002) The design of spaces for human

More information

1/5/2019. APPA Institute Dallas, TX Feb DATA INTEGRATION PURPOSE OF TODAY S PRESENTATION

1/5/2019. APPA Institute Dallas, TX Feb DATA INTEGRATION PURPOSE OF TODAY S PRESENTATION DATA INTEGRATION APPA Institute for Facilities Management Ft. Worth, TX January 14, 2019 PURPOSE OF TODAY S PRESENTATION To provide a broad understanding of: Data as a utility How various units of Facilities

More information

Data Warehousing and Decision Support

Data Warehousing and Decision Support Data Warehousing and Decision Support Chapter 23, Part A Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1 Introduction Increasingly, organizations are analyzing current and historical

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

Managing Ecological and Biodiversity Data Using Ecoinformatics: Taiwan Experience. Chau Chin Lin Taiwan Forestry Research Institute

Managing Ecological and Biodiversity Data Using Ecoinformatics: Taiwan Experience. Chau Chin Lin Taiwan Forestry Research Institute Managing Ecological and Biodiversity Data Using Ecoinformatics: Taiwan Experience Chau Chin Lin Taiwan Forestry Research Institute Persons to Thank First for The Following Presentation Dr. Hen-biau King

More information

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar What is data exploration? A preliminary exploration of the data to better understand its characteristics.

More information

Data Warehousing and Decision Support

Data Warehousing and Decision Support Data Warehousing and Decision Support [R&G] Chapter 23, Part A CS 4320 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful patterns and support business

More information

Data Mining: Exploring Data. Lecture Notes for Data Exploration Chapter. Introduction to Data Mining

Data Mining: Exploring Data. Lecture Notes for Data Exploration Chapter. Introduction to Data Mining Data Mining: Exploring Data Lecture Notes for Data Exploration Chapter Introduction to Data Mining by Tan, Steinbach, Karpatne, Kumar 02/03/2018 Introduction to Data Mining 1 What is data exploration?

More information

ArcGIS Enterprise: An Introduction. Philip Heede

ArcGIS Enterprise: An Introduction. Philip Heede Enterprise: An Introduction Philip Heede Online Enterprise Hosted by Esri (SaaS) - Upgraded automatically (by Esri) - Esri controls SLA Core Web GIS functionality (Apps, visualization, smart mapping, analysis

More information

Data Warehousing. Overview

Data Warehousing. Overview Data Warehousing Overview Basic Definitions Normalization Entity Relationship Diagrams (ERDs) Normal Forms Many to Many relationships Warehouse Considerations Dimension Tables Fact Tables Star Schema Snowflake

More information

Guide Users along Information Pathways and Surf through the Data

Guide Users along Information Pathways and Surf through the Data Guide Users along Information Pathways and Surf through the Data Stephen Overton, Overton Technologies, LLC, Raleigh, NC ABSTRACT Business information can be consumed many ways using the SAS Enterprise

More information

Introduction to Data Management for Ocean Science Research

Introduction to Data Management for Ocean Science Research Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological and Chemical Oceanography Data Management Office 12 November 2009 Ocean Acidification Short Course Woods Hole, MA USA

More information

DataONE: Open Persistent Access to Earth Observational Data

DataONE: Open Persistent Access to Earth Observational Data Open Persistent Access to al Robert J. Sandusky, UIC University of Illinois at Chicago The Net Partners Update: ONE and the Conservancy December 14, 2009 Outline NSF s Net Program ONE Introduction Motivating

More information

GRETL FOR TODDLERS!! CONTENTS. 1. Access to the econometric software A new data set: An existent data set: 3

GRETL FOR TODDLERS!! CONTENTS. 1. Access to the econometric software A new data set: An existent data set: 3 GRETL FOR TODDLERS!! JAVIER FERNÁNDEZ-MACHO CONTENTS 1. Access to the econometric software 3 2. Loading and saving data: the File menu 3 2.1. A new data set: 3 2.2. An existent data set: 3 2.3. Importing

More information

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores Announcements Shumo office hours change See website for details HW2 due next Thurs

More information

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing. About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This

More information

DATA MANAGEMENT MODEL

DATA MANAGEMENT MODEL DATA MANAGEMENT MODEL InfoCoSM Arto Vuorela Vesa Roivas Jean-Pierre Houix 26-29.6.2006 INTRODUCTION This report is an intermediary report summarizing the main objectives to be reached by the end of July

More information

COMPUTATIONAL INTELLIGENCE SEW (INTRODUCTION TO MACHINE LEARNING) SS18. Lecture 6: k-nn Cross-validation Regularization

COMPUTATIONAL INTELLIGENCE SEW (INTRODUCTION TO MACHINE LEARNING) SS18. Lecture 6: k-nn Cross-validation Regularization COMPUTATIONAL INTELLIGENCE SEW (INTRODUCTION TO MACHINE LEARNING) SS18 Lecture 6: k-nn Cross-validation Regularization LEARNING METHODS Lazy vs eager learning Eager learning generalizes training data before

More information

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures) CS614- Data Warehousing Solved MCQ(S) From Midterm Papers (1 TO 22 Lectures) BY Arslan Arshad Nov 21,2016 BS110401050 BS110401050@vu.edu.pk Arslan.arshad01@gmail.com AKMP01 CS614 - Data Warehousing - Midterm

More information

ETL TESTING TRAINING

ETL TESTING TRAINING ETL TESTING TRAINING Retrieving Data using the SQL SELECT Statement Capabilities of the SELECT statement Arithmetic expressions and NULL values in the SELECT statement Column aliases Use of concatenation

More information

PNAMP Metadata Builder Prototype Development Summary Report December 17, 2012

PNAMP Metadata Builder Prototype Development Summary Report December 17, 2012 PNAMP Metadata Builder Prototype Development Summary Report December 17, 2012 Overview Metadata documentation is not a commonly embraced activity throughout the region. But without metadata, anyone using

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 02 Introduction to Data Warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

A Data Warehouse Implementation Using the Star Schema. For an outpatient hospital information system

A Data Warehouse Implementation Using the Star Schema. For an outpatient hospital information system A Data Warehouse Implementation Using the Star Schema For an outpatient hospital information system GurvinderKaurJosan Master of Computer Application,YMT College of Management Kharghar, Navi Mumbai ---------------------------------------------------------------------***----------------------------------------------------------------

More information

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List)

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List) CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List) Microsoft Solution Latest Sl Area Refresh No. Course ID Run ID Course Name Mapping Date 1 AZURE202x 2 Microsoft

More information

Banking Technology Trends

Banking Technology Trends Banking Technology Trends Tuesday, June 18, 2013 3:30 PM 4:45 PM Presented by: Joe Trafton SVP, Chief Strategic Officer COCC 135 Darling Drive Avon, CT 06001 slide 2 slide 3 Bank Return on Equity 16% 14%

More information

360SCIENCE. use case. Customer Data Matching for Business Intelligence and Finance M&A

360SCIENCE. use case. Customer Data Matching for Business Intelligence and Finance M&A 360SCIENCE use case Customer Data Matching for Business Intelligence and Finance M&A Amir Javidan, SVP Operations PayPal - TIO Networks Overview TIO Networks (a PayPal company) is a financial services

More information

Handout 12 Data Warehousing and Analytics.

Handout 12 Data Warehousing and Analytics. Handout 12 CS-605 Spring 17 Page 1 of 6 Handout 12 Data Warehousing and Analytics. Operational (aka transactional) system a system that is used to run a business in real time, based on current data; also

More information

User interface design. Ian Sommerville 2004 Software Engineering, 7th edition. Chapter 16 Slide 1

User interface design. Ian Sommerville 2004 Software Engineering, 7th edition. Chapter 16 Slide 1 User interface design Ian Sommerville 2004 Software Engineering, 7th edition. Chapter 16 Slide 1 The user interface Should be designed to match: Skills, experience and expectations of its anticipated users.

More information

Eleven+ Views of Semantic Search

Eleven+ Views of Semantic Search Eleven+ Views of Semantic Search Denise A. D. Bedford, Ph.d. Goodyear Professor of Knowledge Management Information Architecture and Knowledge Management Kent State University Presentation Focus Long-Term

More information

Data warehouse architecture consists of the following interconnected layers:

Data warehouse architecture consists of the following interconnected layers: Architecture, in the Data warehousing world, is the concept and design of the data base and technologies that are used to load the data. A good architecture will enable scalability, high performance and

More information

Big data for big river science: data intensive tools, techniques, and projects at the USGS/Columbia Environmental Research Center

Big data for big river science: data intensive tools, techniques, and projects at the USGS/Columbia Environmental Research Center Big data for big river science: data intensive tools, techniques, and projects at the USGS/Columbia Environmental Research Center Ed Bulliner U.S. Geological Survey, Columbia Environmental Research Center

More information

Q1) Describe business intelligence system development phases? (6 marks)

Q1) Describe business intelligence system development phases? (6 marks) BUISINESS ANALYTICS AND INTELLIGENCE SOLVED QUESTIONS Q1) Describe business intelligence system development phases? (6 marks) The 4 phases of BI system development are as follow: Analysis phase Design

More information

The DataBridge: A Social Network for Long Tail Science Data!

The DataBridge: A Social Network for Long Tail Science Data! The DataBridge: A Social Network for Long Tail Science Data Howard Lander howard@renci.org Renaissance Computing Institute The University of North Carolina at Chapel Hill Outline of This Talk The DataBridge

More information

From Data Challenge to Data Opportunity

From Data Challenge to Data Opportunity From Data Challenge to Data Opportunity Jason Hunter, CTO Asia-Pacific, MarkLogic A Big Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub

More information

Human-Computer Interaction: An Overview. CS2190 Spring 2010

Human-Computer Interaction: An Overview. CS2190 Spring 2010 Human-Computer Interaction: An Overview CS2190 Spring 2010 There must be a problem because What is HCI? Human-Computer interface Where people meet or come together with machines or computer-based systems

More information

Generating EML from a Relational Database Management System (RDBMS)

Generating EML from a Relational Database Management System (RDBMS) Page 13 of 34 Generating EML from a Relational Database Management System (RDBMS) - Zhiqiang Yang and Don Henshaw (AND) The Ecological Metadata Language (EML) is an open metadata specification and provides

More information

Online User Clinic: Database Cleanup in The Raiser s Edge. Wednesday February 18, 2009

Online User Clinic: Database Cleanup in The Raiser s Edge. Wednesday February 18, 2009 Online User Clinic: Database Cleanup in The Raiser s Edge Wednesday February 18, 2009 Hosted by: Martin Burke Solutions Engineer Agenda Welcome and Introductions Blackbaud News and Updates Support Resources

More information

CSC 408F/CSC2105F Lecture Notes

CSC 408F/CSC2105F Lecture Notes CSC 408F/CSC2105F Lecture Notes These lecture notes are provided for the personal use of students taking CSC 408H/CSC 2105H in the Fall term 2004/2005 at the University of Toronto. Copying for purposes

More information

DATA MINING TRANSACTION

DATA MINING TRANSACTION DATA MINING Data Mining is the process of extracting patterns from data. Data mining is seen as an increasingly important tool by modern business to transform data into an informational advantage. It is

More information

Development of a Protected Areas Database for Jamaica

Development of a Protected Areas Database for Jamaica Progress Report Technical and Financial Development of a Protected Areas Database for Jamaica Grants for Digital Data Protected Areas Thematic Network Prepared by: Projects Planning & Monitoring Branch

More information

Outline. Quick Introduction to Database Systems. Data Manipulation Tasks. What do they all have in common? CSE142 Wi03 G-1

Outline. Quick Introduction to Database Systems. Data Manipulation Tasks. What do they all have in common? CSE142 Wi03 G-1 Outline Quick Introduction to Database Systems Why do we need a different kind of system? What is a database system? Separating the what the how: The relational data model Querying the databases: SQL May

More information

IUNI Web of Science Data Enclave 102

IUNI Web of Science Data Enclave 102 Enclave 102 Katy Börner and Robert Light Cyberinfrastructure for Network Science Center School of Informatics and Computing and IUNI Indiana University, USA Val Pentchev, Matt Hutchinson, and Benjamin

More information

RethinkDB. Niharika Vithala, Deepan Sekar, Aidan Pace, and Chang Xu

RethinkDB. Niharika Vithala, Deepan Sekar, Aidan Pace, and Chang Xu RethinkDB Niharika Vithala, Deepan Sekar, Aidan Pace, and Chang Xu Content Introduction System Features Data Model ReQL Applications Introduction Niharika Vithala What is a NoSQL Database Databases that

More information

Revenue Management as a Customer Service. Ed Hackney

Revenue Management as a Customer Service. Ed Hackney Revenue Management as a Customer Service Ed Hackney About Ed 14 years with SUEZ and United Water Revenue Management Operational Technology Engineering IT 2 Internet Startups 2 Big 4 consultancies BS Rutgers

More information

Audience BI professionals BI developers

Audience BI professionals BI developers Applied Microsoft BI The Microsoft Data Platform empowers BI pros to implement organizational BI solutions delivering a single version of the truth across the enterprise. A typical organizational solution

More information

Infrastructure Matters

Infrastructure Matters Infrastructure Matters DATA PROTECTION DISASTER RECOVERY CLOUD SECURITY DATA ANALYTICS VIRTUALISATION Intel Xeon Processors. Your infrastructure matters more than you might think. Why? Because if you re

More information

Data Standards and Protocols for Biological Collections Data

Data Standards and Protocols for Biological Collections Data Data Standards and Protocols for Biological Collections Data Meeting? Renato De Giovanni CRIA Reference Center on Environmental Information renato @ cria. org. br ICCC12, September 2010 Outline 1. Introduction

More information

Building A Billion Spatio-Temporal Object Search and Visualization Platform

Building A Billion Spatio-Temporal Object Search and Visualization Platform 2017 2 nd International Symposium on Spatiotemporal Computing Harvard University Building A Billion Spatio-Temporal Object Search and Visualization Platform Devika Kakkar, Benjamin Lewis Goal Develop a

More information

Data Mining: Exploring Data

Data Mining: Exploring Data Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar But we start with a brief discussion of the Friedman article and the relationship between Data

More information

OLAP Cubes 101: An Introduction to Business Intelligence Cubes

OLAP Cubes 101: An Introduction to Business Intelligence Cubes OLAP Cubes 101 Page 3 Clear the Tables Page 4 Embracing Cubism Page 5 Everybody Loves Cubes Page 6 Cubes in Action Page 7 Cube Terminology Page 9 Correcting Mis-cube-ceptions Page 10 OLAP Cubes 101: It

More information

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SHRI ANGALAMMAN COLLEGE OF ENGINEERING & TECHNOLOGY (An ISO 9001:2008 Certified Institution) SIRUGANOOR,TRICHY-621105. DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING Year / Semester: IV/VII CS1011-DATA

More information

Smart Islands, Intelligent Nations. Carlyle Roberts, Bahamas Telecommunications Company August 1 st, 2016

Smart Islands, Intelligent Nations. Carlyle Roberts, Bahamas Telecommunications Company August 1 st, 2016 Smart Islands, Intelligent Nations Carlyle Roberts, Bahamas Telecommunications Company August 1 st, 2016 Technology: Constantly Evolving Evolutionary Path Technology: Creating a new world Evolutionary

More information

Building a Data Warehouse step by step

Building a Data Warehouse step by step Informatica Economică, nr. 2 (42)/2007 83 Building a Data Warehouse step by step Manole VELICANU, Academy of Economic Studies, Bucharest Gheorghe MATEI, Romanian Commercial Bank Data warehouses have been

More information

Data Mining and Warehousing

Data Mining and Warehousing Data Mining and Warehousing Sangeetha K V I st MCA Adhiyamaan College of Engineering, Hosur-635109. E-mail:veerasangee1989@gmail.com Rajeshwari P I st MCA Adhiyamaan College of Engineering, Hosur-635109.

More information

Taking a First Look at Excel s Reporting Tools

Taking a First Look at Excel s Reporting Tools CHAPTER 1 Taking a First Look at Excel s Reporting Tools This chapter provides you with an overview of Excel s reporting features. It shows you the principal types of Excel reports and how you can use

More information

INTRODUCTION TO VISUALIZATION A OVERVIEW

INTRODUCTION TO VISUALIZATION A OVERVIEW Cyberinfrastructure Technology Integration (CITI) Advanced Visualization Division INTRODUCTION TO VISUALIZATION A OVERVIEW Vetria L. Byrd, PhD REU Coordinator June 03, 2014 REU SITE Research Experience

More information

Parcel QA/QC: Video Script. 1. Introduction 1

Parcel QA/QC: Video Script. 1. Introduction 1 1. Introduction 1 Hi! It s been a while since you heard from us. I am Ara Erickson with the University of Washington. We are taking this opportunity to do a quick introduction and explain a few things

More information

Oracle and Tangosol Acquisition Announcement

Oracle and Tangosol Acquisition Announcement Oracle and Tangosol Acquisition Announcement March 23, 2007 The following is intended to outline our general product direction. It is intended for information purposes only, and may

More information

Data Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich

Data Modeling and Databases Ch 14: Data Replication. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Data Modeling and Databases Ch 14: Data Replication Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich Database Replication What is database replication The advantages of

More information

Chapter 8: GPS Clustering and Analytics

Chapter 8: GPS Clustering and Analytics Chapter 8: GPS Clustering and Analytics Location information is crucial for analyzing sensor data and health inferences from mobile and wearable devices. For example, let us say you monitored your stress

More information

Case Study: CyberSKA - A Collaborative Platform for Data Intensive Radio Astronomy

Case Study: CyberSKA - A Collaborative Platform for Data Intensive Radio Astronomy Case Study: CyberSKA - A Collaborative Platform for Data Intensive Radio Astronomy Outline Motivation / Overview Participants / Industry Partners Documentation Architecture Current Status and Services

More information

The Billion Object Platform (BOP): a system to lower barriers to support big, streaming, spatio-temporal data sources

The Billion Object Platform (BOP): a system to lower barriers to support big, streaming, spatio-temporal data sources FOSS4G 2017 Boston The Billion Object Platform (BOP): a system to lower barriers to support big, streaming, spatio-temporal data sources Devika Kakkar and Ben Lewis Harvard Center for Geographic Analysis

More information

The What, Why, Who and How of Where: Building a Portal for Geospatial Data. Alan Darnell Director, Scholars Portal

The What, Why, Who and How of Where: Building a Portal for Geospatial Data. Alan Darnell Director, Scholars Portal The What, Why, Who and How of Where: Building a Portal for Geospatial Data Alan Darnell Director, Scholars Portal What? Scholars GeoPortal Beta release Fall 2011 Production release March 2012 OLITA Award

More information

DATA-SHARING PLAN FOR MOORE FOUNDATION Coral resilience investigated in the field and via a sea anemone model system

DATA-SHARING PLAN FOR MOORE FOUNDATION Coral resilience investigated in the field and via a sea anemone model system DATA-SHARING PLAN FOR MOORE FOUNDATION Coral resilience investigated in the field and via a sea anemone model system GENERAL PHILOSOPHY (Arthur Grossman, Steve Palumbi, and John Pringle) The three Principal

More information

Interoperability ~ An Introduction

Interoperability ~ An Introduction Interoperability ~ An Introduction Cyndy Chandler Biological and Chemical Oceanography Data Management Office (BCO-DMO) Woods Hole Oceanographic Institution 26 July 2008 MMI OOS Interoperability Planning

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

Data Immersion : Providing Integrated Data to Infinity Scientists. Kevin Gilpin Principal Engineer Infinity Pharmaceuticals October 19, 2004

Data Immersion : Providing Integrated Data to Infinity Scientists. Kevin Gilpin Principal Engineer Infinity Pharmaceuticals October 19, 2004 Data Immersion : Providing Integrated Data to Infinity Scientists Kevin Gilpin Principal Engineer Infinity Pharmaceuticals October 19, 2004 Informatics at Infinity Understand the nature of the science

More information

Introduction to User Stories. CSCI 5828: Foundations of Software Engineering Lecture 05 09/09/2014

Introduction to User Stories. CSCI 5828: Foundations of Software Engineering Lecture 05 09/09/2014 Introduction to User Stories CSCI 5828: Foundations of Software Engineering Lecture 05 09/09/2014 1 Goals Present an introduction to the topic of user stories concepts and terminology benefits and limitations

More information

SQL Server Analysis Services

SQL Server Analysis Services DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, SQL Server 2005 Analysis Services SQL Server 2005 Analysis Services - 1 Analysis Services Database and

More information

Link to Marine Vegetation Atlas website. Link to Nearshore Habitat Program. Overview of the Marine Vegetation Atlas. Contact DNR about the Atlas

Link to Marine Vegetation Atlas website. Link to Nearshore Habitat Program. Overview of the Marine Vegetation Atlas. Contact DNR about the Atlas Washington Marine Vegetation Atlas Help The purpose of the Marine Vegetation Atlas is to provide spatially referenced information and data about vegetation that grows in nearshore areas (seagrass, kelp,

More information

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) INTRODUCTION A dimension is an attribute within a multidimensional model consisting of a list of values (called members). A fact is defined by a combination

More information

TOP DEVELOPERS MINDSET. All About the 5 Things You Don t Know.

TOP DEVELOPERS MINDSET. All About the 5 Things You Don t Know. MINDSET TOP DEVELOPERS All About the 5 Things You Don t Know 1 INTRODUCTION Coding and programming are becoming more and more popular as technology advances and computer-based devices become more widespread.

More information

Evolution of Database Systems

Evolution of Database Systems Evolution of Database Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Intelligent Decision Support Systems Master studies, second

More information

Kanban One-Day Workshop

Kanban One-Day Workshop Kanban One-Day Workshop Copyright Net Objectives, Inc. All Rights Reserved 2 Copyright Net Objectives, Inc. All Rights Reserved 3 Lean for Executives Product Portfolio Management Business Product Owner

More information

Creating our Public Works Story

Creating our Public Works Story Creating our Public Works Story The Evolution of an Interactive Map Application for Rancho Cordova Public Works April 26 th, 2017 Brad Findlay, GIS Analyst Interwest Consulting Group Overview Introduction

More information

The next generation of Google APIs

The next generation of Google APIs The next generation of Google APIs Ade Oshineye www.oshineye.com/+ Let s talk about the future This is not a vendor pitch This. Is. Not. A. Vendor. Pitch. I work on the Google+ Project www.oshineye.com/+

More information

SQL Server 2005 Analysis Services

SQL Server 2005 Analysis Services atabase and ata Mining Group of atabase and ata Mining Group of atabase and ata Mining Group of atabase and ata Mining Group of atabase and ata Mining Group of atabase and ata Mining Group of SQL Server

More information