HANDLING PUBLICLY GENERATED AIR QUALITY DATA PETE TENEBRUSO & MIKE MATSKO MARCH 8 TH, 2017

Size: px
Start display at page:

Download "HANDLING PUBLICLY GENERATED AIR QUALITY DATA PETE TENEBRUSO & MIKE MATSKO MARCH 8 TH, 2017"

Transcription

1 HANDLING PUBLICLY GENERATED AIR QUALITY DATA PETE TENEBRUSO & MIKE MATSKO MARCH 8 TH, 2017

2 EXAMPLES OF DEP DATA AND CROWDSOURCING Storm Readiness Beach Assessments Park Closings Emergency Management Social Media Watershed Ambassadors ARMS

3 AIR INFORMATION MANAGEMENT SYSTEM (ARMS) There are 286 monitors at these 40 air quality stations tracking 35 distinct parameters (continuous and non-continuous) NOX, NO, SO2, PM2.5, NO2, CO, O3, wind speed, wind direction, temp are monitored There approximately 150,000,000 continuous air quality minute data points collected every year.

4 ARMS SITES & MONITORS

5 Back Up Wireless Polling System NJDEP Air and Radiation Monitoring System Regular Air Wireless Sites Verizon Air Card Envista Comm Center and Database Digi Wireless Router Verizon Wireless Network OIT ExtraNet GSN WWW Hosted by Envitech Updated via FTP from ARMS Comm Center Air FRM Wireless Sites Digi Wireless Router Public Access Layer Secure Access Layer Air TEOM Wireless Sites CREST Wireless Sites Digi Wireless Router Digi Wireless Router Phone Line Modem for Backup Verizon Wireless Access Points Core layer Standby Database Standby Comm. Center & FTP MOXA NPort Modem Bank for Backup Primary Database Clustering Master Comm. Center & FTP Clustering MOXA NPort Modem Bank for Backup CREST Leased Line Sites Leased Line Modem Leased Line Modem FRM/TEOM Modem Oracle Observer (Manage Fast Start Failover) Slave Comm. Center & FTP Air FRM Phone Line Sites Air TEOM Phone Line Sites Phone Line Modem Phone Line Modem Leased Lines & Phone lines Network Green Devices represent future projects Standby NJ Air/Rad Monitoring 401 East State Street DEP TLS Primary NJ Air/Rad Monitoring Troop C Prepared by Harry Chen, 12/1/2010

6 CROWDSOURCING a specific sourcing model in which individuals or organizations use contributions from Internet users to obtain needed services or ideas Amazon Mechanical Turk Kickstarter Wikipedia

7 BACKGROUND Massive data deluge in recent years 80% of the worlds data is unstructured (images, videos, raw text, etc.) Algorithms to fully comprehend unstructured data have not been developed yet Many experts believe we are at least several decades away from this goal

8 CONSIDERATIONS OF USING INFO FROM 'CROWDS' Can disseminate both valid and invalid information Crowds often have no immediate way to discern truth from falsehood Crowds are prone to add opinion to data; which sometimes sticks more than the credible data themselves. Separating opinion and credible data through expert interpretation and curation, both centralized and decentralized, is important Very few organizational or procedural channels specifying how to aggregate and incorporate information in decision making Better information is needed not necessarily more monitors.

9 INTEGRATING EXPERTS, CROWDS, & ALGORITHMS.

10

11 CROWD SOURCING CONCERNS How to solicit users What they can contribute How to combine their contributions How to manage quality, open versus close worlds, query semantics, query execution, optimization, and user interfaces

12 BENEFITS OF MACHINE LEARNING Feature extraction i.e. interpreting text to infer time, location, people, etc. referred to in it; Classification - classify, group or tag information based on some explicit or unknown criteria; Clustering - Machines can process vast amounts of data and present correlations and proximities that escape the human eye and brain, sometimes discovering non-obvious correlations between variables With large amounts of data available, it is not even necessary to have a deep understanding of the relationships within the data themselves: machines can on their own distil the noise from the relevant correlations through successive optimization.

13 MACHINE LEARNING SHORTCOMINGS Algorithms are more specific than sensitive, meaning that important signals may be missed (false negatives) A combination of algorithms is important to draw different types of events and event features from undifferentiated data understanding which algorithms, through experience, is essential Algorithms need to be thoroughly validated and tested and reassessed Algorithms need data to train and feedback to learn. Out of the box value is difficult Human factor lazy over time experience with accepted algorithms, where over-dependency and improper cross-checks of an algorithm's results may result in missed or misinterpreted signals; Low social acceptance of systems that do not function in a way that is predictable or describable Past misuse of machine learning has led users to fear and distrust algorithm w/o some human interaction. Should an algorithm declare a health emergency or should it help present data to an expert or authority with 'suggestions' and 'red flags', and then the authority can declare a health emergency

14 STANDARDIZATION NEEDED FOR INTEROPERABILITY Interoperability challenges with data formats, service interfaces, semantics and measurement uniformity Broad usage of open sensor standards is needed The Sensor Web Enablement Initiative (SWE) by the OGC (Open Geospatial Consortium) seeks to provide open standards and protocols for enhanced operability within and between multiple platforms and vendors. They aim to make sensors discoverable, query-able, and controllable over the Internet. Currently, the SWE family consists of seven standards: Sensor Model Language (SensorML) XML Schemas to defining geometric, dynamic and observational properties of a sensor. Accommodates sensor discovery, processing and analysis of the retrieved data, as well as the geo-location of observed values. Observations & Measurements (O&M) Transducer Model Language (TML) Generally speaking, TML can be understood as O&M's pendant or streaming data by providing a method and message format describing how to interpret raw transducer data. Sensor Observation Service (SOS) This component provides a service to retrieve measurement results from a sensor or a sensor network.

15 STANDARDIZATION CONTINUED Sensor Planning Service (SPS) This component provides a standardized interface for collection assets and aims at automating complex information flows in large networks.. Sensor Alert Service (SAS) Interfaces enabling sensors to advertise and publish alerts, including according metadata. Web Notification Service (WNS) Enables 1 & 2 way message exchanges, with other services. This process is especially expedient when several services are required to comply with a client's request, or when an according response is only possible under considerable delays.

16 SENSOR OBSERVATION SERVICE (SOS)

17 NEED A GOOD PLAN What are you trying to do - what s the value of this data What s the approach? Selecting location and placement Collecting Quality control Sensor maintenance Data review Data validation Issues (interference and drift) Analyze, interpret, communicate results QA QC

18 SENSOR CONSIDERATIONS Low cost Varying reliability, quality, and accuracy Questionable maintenance and calibration Pollutants measured (ozone, PM, volatiles) Location and Placement - Fixed/mobile, in/outside, below/above ground IOT Security of devices

19 DATA MANAGEMENT CONSIDERATIONS Several Existing repositories - would not want to replicate DEP had experience in managing large sets of data but not at this potential scale Large cost of managing data Infrastructure/Tools/etc. Leverage existing Real time and historical APIs Separation of local, state, and nationwide data Integration and analysis with existing state data

Semantically enhancing SensorML with controlled vocabularies in the marine domain

Semantically enhancing SensorML with controlled vocabularies in the marine domain Semantically enhancing SensorML with controlled vocabularies in the marine domain KOKKINAKI ALEXANDRA, BUCK JUSTIN, DARROCH LOUISE, JIRKA SIMON AND THE MARINE PROFILES FOR OGC SENSOR WEB ENABLEMENT STANDARDS

More information

Observation trends: Expectations from European Comission regarding data exchange and interoperability

Observation trends: Expectations from European Comission regarding data exchange and interoperability Observation trends: Expectations from European Comission regarding data exchange and interoperability Marcin Wichorowski (IO PAN), Sławomir Sagan (IO PAN), Declan Dunne (UCC MaREI), John Barton (UCC-TYNDALL)

More information

Land Administration and Management: Big Data, Fast Data, Semantics, Graph Databases, Security, Collaboration, Open Source, Shareable Information

Land Administration and Management: Big Data, Fast Data, Semantics, Graph Databases, Security, Collaboration, Open Source, Shareable Information Land Administration and Management: Big Data, Fast Data, Semantics, Graph Databases, Security, Collaboration, Open Source, Shareable Information Platform Steven Hagan, Vice President, Engineering 1 Copyright

More information

Sensor Data Management

Sensor Data Management Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 8-14-2007 Sensor Data Management Cory Andrew Henson Wright State University

More information

Context-aware Services for UMTS-Networks*

Context-aware Services for UMTS-Networks* Context-aware Services for UMTS-Networks* * This project is partly financed by the government of Bavaria. Thomas Buchholz LMU München 1 Outline I. Properties of current context-aware architectures II.

More information

EXPERT SERVICES FOR IoT CYBERSECURITY AND RISK MANAGEMENT. An Insight Cyber White Paper. Copyright Insight Cyber All rights reserved.

EXPERT SERVICES FOR IoT CYBERSECURITY AND RISK MANAGEMENT. An Insight Cyber White Paper. Copyright Insight Cyber All rights reserved. EXPERT SERVICES FOR IoT CYBERSECURITY AND RISK MANAGEMENT An Insight Cyber White Paper Copyright Insight Cyber 2018. All rights reserved. The Need for Expert Monitoring Digitization and external connectivity

More information

SmartData Fabric distributed virtual data, graph data and master data management, analytics and security. Solutions and Key Features Revision 2.

SmartData Fabric distributed virtual data, graph data and master data management, analytics and security. Solutions and Key Features Revision 2. s and Key Features Revision 2.5 Page 1 of 7 www.whamtech.com (972) 991-5700 info@whamtech.com March 2018 ID SOL1 Automated Data Discovery and Classification (ADDC) Key Feature ID KF01 KF02 KF03 Key Feature

More information

The Storage Networking Industry Association (SNIA) Data Preservation and Metadata Projects. Bob Rogers, Application Matrix

The Storage Networking Industry Association (SNIA) Data Preservation and Metadata Projects. Bob Rogers, Application Matrix The Storage Networking Industry Association (SNIA) Data Preservation and Metadata Projects Bob Rogers, Application Matrix Overview The Self Contained Information Retention Format Rationale & Objectives

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

Sensor Web when sensor networks meet the World-Wide Web

Sensor Web when sensor networks meet the World-Wide Web Sensor Web when sensor networks meet the World-Wide Web Dr. Steve Liang Assistant Professor Department of Geomatics Engineering Schulich School of Engineering University of Calgary steve.liang@ucalgary.ca

More information

USERS CONFERENCE Copyright 2016 OSIsoft, LLC

USERS CONFERENCE Copyright 2016 OSIsoft, LLC Bridge IT and OT with a process data warehouse Presented by Matt Ziegler, OSIsoft Complexity Problem Complexity Drives the Need for Integrators Disparate assets or interacting one-by-one Monitoring Real-time

More information

Dell Boomi Cloud MDM Overview

Dell Boomi Cloud MDM Overview Dell Boomi Cloud MDM Overview Dell Boomi s Multi-Purpose PaaS Boomi as the Multi-Purpose PaaS for enterprise data management Move: AtomSphere Integration Manage: Master Data Management (MDM) Govern: API

More information

BSC Smart Cities Initiative

BSC Smart Cities Initiative www.bsc.es BSC Smart Cities Initiative José Mª Cela CASE Director josem.cela@bsc.es CITY DATA ACCESS 2 City Data Access 1. Standardize data access (City Semantics) Define a software layer to keep independent

More information

Writing a Data Management Plan A guide for the perplexed

Writing a Data Management Plan A guide for the perplexed March 29, 2012 Writing a Data Management Plan A guide for the perplexed Agenda Rationale and Motivations for Data Management Plans Data and data structures Metadata and provenance Provisions for privacy,

More information

THE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET Project introduction and technical foundations (I)

THE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET Project introduction and technical foundations (I) ENVIROfying the Future Internet THE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET Project introduction and technical foundations (I) INSPIRE Conference Firenze,

More information

M-2-M-2-People. How Mobility Enables Visibility. Daniel Munyan Director, M2M Center of Excellence Computer Sciences Corporation 6/1/2012 1

M-2-M-2-People. How Mobility Enables Visibility. Daniel Munyan Director, M2M Center of Excellence Computer Sciences Corporation 6/1/2012 1 M-2-M-2-People How Mobility Enables Visibility Daniel Munyan Director, M2M Center of Excellence 6/1/2012 1 The Timeless Challenge of Logistics Achieving Enterprise Visibility / Situation Awareness Tracking

More information

The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse

The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse Stephen Hunt OSD CAPE Joint Data Support (SAIC) Stephen.Hunt.ctr@osd.mil The DoD Office of Security Review has cleared this report

More information

Fluentd + MongoDB + Spark = Awesome Sauce

Fluentd + MongoDB + Spark = Awesome Sauce Fluentd + MongoDB + Spark = Awesome Sauce Nishant Sahay, Sr. Architect, Wipro Limited Bhavani Ananth, Tech Manager, Wipro Limited Your company logo here Wipro Open Source Practice: Vision & Mission Vision

More information

Reducing Consumer Uncertainty

Reducing Consumer Uncertainty Spatial Analytics Reducing Consumer Uncertainty Towards an Ontology for Geospatial User-centric Metadata Introduction Cooperative Research Centre for Spatial Information (CRCSI) in Australia Communicate

More information

ASSET AND OPERATIONS MANAGEMENT INTEGRATED SOLUTIONS FOR EFFECTIVE LOW COST MONITORING. Colin Davies. Carbon Based Environmental Pty Ltd

ASSET AND OPERATIONS MANAGEMENT INTEGRATED SOLUTIONS FOR EFFECTIVE LOW COST MONITORING. Colin Davies. Carbon Based Environmental Pty Ltd ASSET AND OPERATIONS MANAGEMENT INTEGRATED SOLUTIONS FOR EFFECTIVE LOW COST MONITORING Paper Presented by: Colin Davies Author: Colin Davies, Managing Director, Carbon Based Environmental Pty Ltd 6 th

More information

Semantic Web Mining and its application in Human Resource Management

Semantic Web Mining and its application in Human Resource Management International Journal of Computer Science & Management Studies, Vol. 11, Issue 02, August 2011 60 Semantic Web Mining and its application in Human Resource Management Ridhika Malik 1, Kunjana Vasudev 2

More information

Enrichment of Sensor Descriptions and Measurements Using Semantic Technologies. Student: Alexandra Moraru Mentor: Prof. Dr.

Enrichment of Sensor Descriptions and Measurements Using Semantic Technologies. Student: Alexandra Moraru Mentor: Prof. Dr. Enrichment of Sensor Descriptions and Measurements Using Semantic Technologies Student: Alexandra Moraru Mentor: Prof. Dr. Dunja Mladenić Environmental Monitoring automation Traffic Monitoring integration

More information

Convergence and Collaboration: Transforming Business Process and Workflows

Convergence and Collaboration: Transforming Business Process and Workflows Convergence and Collaboration: Transforming Business Process and Workflows Steven Hagan, Vice President, Server Technologies 1 Copyright 2011, Oracle and/or its affiliates. All rights Convergence & Collaboration:

More information

DATA COLLECTION. Slides by WESLEY WILLETT 13 FEB 2014

DATA COLLECTION. Slides by WESLEY WILLETT 13 FEB 2014 DATA COLLECTION Slides by WESLEY WILLETT INFO VISUAL 340 ANALYTICS D 13 FEB 2014 WHERE DOES DATA COME FROM? We tend to think of data as a thing in a database somewhere WHY DO YOU NEED DATA? (HINT: Usually,

More information

On User-centric QoE Prediction for VoIP & Video Streaming based on Machine-Learning

On User-centric QoE Prediction for VoIP & Video Streaming based on Machine-Learning UNIVERSITY OF CRETE On User-centric QoE Prediction for VoIP & Video Streaming based on Machine-Learning Michalis Katsarakis, Maria Plakia, Paul Charonyktakis & Maria Papadopouli University of Crete Foundation

More information

More than a Lifetime of

More than a Lifetime of More than a Lifetime of Data and Information Unifying Live and Archival Storage Larry Stabile Iron Mountain Digital Time Capsules 1000 years Amarillo, Texas, 1968 5000 years NY World s Fair, 1939 Pyramids

More information

Data Model Considerations for Radar Systems

Data Model Considerations for Radar Systems WHITEPAPER Data Model Considerations for Radar Systems Executive Summary The market demands that today s radar systems be designed to keep up with a rapidly changing threat environment, adapt to new technologies,

More information

Monitoring the Environment with Sensor Web Services

Monitoring the Environment with Sensor Web Services EnviroInfo 2009 (Berlin) Environmental Informatics and Industrial Environmental Protection: Concepts, Methods and Tools Monitoring the Environment with Sensor Web Services Simon Jirka 1, Dr. Albert Remke

More information

Novel System Architectures for Semantic Based Sensor Networks Integraion

Novel System Architectures for Semantic Based Sensor Networks Integraion Novel System Architectures for Semantic Based Sensor Networks Integraion Z O R A N B A B O V I C, Z B A B O V I C @ E T F. R S V E L J K O M I L U T N O V I C, V M @ E T F. R S T H E S C H O O L O F T

More information

Elysium Technologies Private Limited::IEEE Final year Project

Elysium Technologies Private Limited::IEEE Final year Project Elysium Technologies Private Limited::IEEE Final year Project - o n t e n t s Data mining Transactions Rule Representation, Interchange, and Reasoning in Distributed, Heterogeneous Environments Defeasible

More information

EXTRA Examples of OGC standards in support of health applications

EXTRA Examples of OGC standards in support of health applications EXTRA Examples of OGC standards in support of health applications Some prior / existing initiatives using OGC Standards EU INSPIRE (health and safety working group) GEOSS AIP, EO2Heaven project: EO2HEAVEN

More information

Informatica Enterprise Information Catalog

Informatica Enterprise Information Catalog Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with

More information

<Insert Picture Here> Click to edit Master title style

<Insert Picture Here> Click to edit Master title style Click to edit Master title style Introducing the Oracle Service What Is Oracle Service? Provides visibility into services, service providers and related resources across the enterprise

More information

The Semantic Sensor Network Ontology A Generic Language to Describe Sensor Assets

The Semantic Sensor Network Ontology A Generic Language to Describe Sensor Assets Ben Ridge Road Weather Station, South Esk River Catchment, Tasmania The Semantic Sensor Network Ontology A Generic Language to Describe Sensor Assets Holger Neuhaus Michael Compton Commonwealth Scientific

More information

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation INSPIRE 2010, KRAKOW Dr. Arif Shaon, Dr. Andrew Woolf (e-science, Science and Technology Facilities Council, UK) 3

More information

Using Linked Data and taxonomies to create a quick-start smart thesaurus

Using Linked Data and taxonomies to create a quick-start smart thesaurus 7) MARJORIE HLAVA Using Linked Data and taxonomies to create a quick-start smart thesaurus 1. About the Case Organization The two current applications of this approach are a large scientific publisher

More information

Latent Space Model for Road Networks to Predict Time-Varying Traffic. Presented by: Rob Fitzgerald Spring 2017

Latent Space Model for Road Networks to Predict Time-Varying Traffic. Presented by: Rob Fitzgerald Spring 2017 Latent Space Model for Road Networks to Predict Time-Varying Traffic Presented by: Rob Fitzgerald Spring 2017 Definition of Latent https://en.oxforddictionaries.com/definition/latent Latent Space Model?

More information

Technical Brief: Domain Risk Score Proactively uncover threats using DNS and data science

Technical Brief: Domain Risk Score Proactively uncover threats using DNS and data science Technical Brief: Domain Risk Score Proactively uncover threats using DNS and data science 310 Million + Current Domain Names 11 Billion+ Historical Domain Profiles 5 Million+ New Domain Profiles Daily

More information

MINING OPERATIONAL DATA FOR IMPROVING GSM NETWORK PERFORMANCE

MINING OPERATIONAL DATA FOR IMPROVING GSM NETWORK PERFORMANCE MINING OPERATIONAL DATA FOR IMPROVING GSM NETWORK PERFORMANCE Antonio Leong, Simon Fong Department of Electrical and Electronic Engineering University of Macau, Macau Edison Lai Radio Planning Networks

More information

Data Sheet. Monitoring Automation for Web-Scale Networks MONITORING AUTOMATION FOR WEB-SCALE NETWORKS -

Data Sheet. Monitoring Automation for Web-Scale Networks MONITORING AUTOMATION FOR WEB-SCALE NETWORKS - Data Sheet Monitoring Automation for Web-Scale Networks CLOUD-BASED MONITORING AUTOMATION FOR WEB-SCALE NETWORKS NetSpyGlass (NSG) is cloud-based, network monitoring automation for network operators seeking

More information

Analytics and Visualization

Analytics and Visualization GU I DE NO. 4 Analytics and Visualization AWS IoT Analytics Mini-User Guide Introduction As IoT applications scale, so does the data generated from these various IoT devices. This data is raw, unstructured,

More information

Pre-Requisites: CS2510. NU Core Designations: AD

Pre-Requisites: CS2510. NU Core Designations: AD DS4100: Data Collection, Integration and Analysis Teaches how to collect data from multiple sources and integrate them into consistent data sets. Explains how to use semi-automated and automated classification

More information

Data Preprocessing. Slides by: Shree Jaswal

Data Preprocessing. Slides by: Shree Jaswal Data Preprocessing Slides by: Shree Jaswal Topics to be covered Why Preprocessing? Data Cleaning; Data Integration; Data Reduction: Attribute subset selection, Histograms, Clustering and Sampling; Data

More information

Next Steps in Data Mining. Sistemas de Apoio à Decisão Cláudia Antunes

Next Steps in Data Mining. Sistemas de Apoio à Decisão Cláudia Antunes Next Steps in Data Mining Sistemas de Apoio à Decisão Cláudia Antunes Temporal Data Mining Cláudia Antunes Data Mining Knowledge Discovery is the nontrivial extraction of implicit, previously unknown,

More information

Semantic web based Sensor Planning Services (SPS) for Sensor Web Enablement (SWE)

Semantic web based Sensor Planning Services (SPS) for Sensor Web Enablement (SWE) Semantic web based Sensor Planning Services (SPS) for Sensor Web Enablement (SWE) P.Udayakumar 1, M.Indhumathi 2 1 Teaching Fellow,Department of Computer Technology, MIT Campus, Anna University Chennai,

More information

2. An implementation-ready data model needn't necessarily contain enforceable rules to guarantee the integrity of the data.

2. An implementation-ready data model needn't necessarily contain enforceable rules to guarantee the integrity of the data. Test bank for Database Systems Design Implementation and Management 11th Edition by Carlos Coronel,Steven Morris Link full download test bank: http://testbankcollection.com/download/test-bank-for-database-systemsdesign-implementation-and-management-11th-edition-by-coronelmorris/

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

Call for Participation in AIP-6

Call for Participation in AIP-6 Call for Participation in AIP-6 GEOSS Architecture Implementation Pilot (AIP) Issue Date of CFP: 9 February 2013 Due Date for CFP Responses: 15 March 2013 Introduction GEOSS Architecture Implementation

More information

Labelling & Classification using emerging protocols

Labelling & Classification using emerging protocols Labelling & Classification using emerging protocols "wheels you don't have to reinvent & bandwagons you can jump on" Stephen McGibbon Lotus Development Assumptions The business rationale and benefits of

More information

The Emerging Data Lake IT Strategy

The Emerging Data Lake IT Strategy The Emerging Data Lake IT Strategy An Evolving Approach for Dealing with Big Data & Changing Environments bit.ly/datalake SPEAKERS: Thomas Kelly, Practice Director Cognizant Technology Solutions Sean Martin,

More information

Grid Computing Systems: A Survey and Taxonomy

Grid Computing Systems: A Survey and Taxonomy Grid Computing Systems: A Survey and Taxonomy Material for this lecture from: A Survey and Taxonomy of Resource Management Systems for Grid Computing Systems, K. Krauter, R. Buyya, M. Maheswaran, CS Technical

More information

Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Perceptions

Citizen Sensing: Opportunities and Challenges in Mining Social Signals and Perceptions Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 7-19-2011 Citizen Sensing: Opportunities and Challenges in Mining Social

More information

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring

More information

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

MAPR DATA GOVERNANCE WITHOUT COMPROMISE MAPR TECHNOLOGIES, INC. WHITE PAPER JANUARY 2018 MAPR DATA GOVERNANCE TABLE OF CONTENTS EXECUTIVE SUMMARY 3 BACKGROUND 4 MAPR DATA GOVERNANCE 5 CONCLUSION 7 EXECUTIVE SUMMARY The MapR DataOps Governance

More information

Meltem Özturan misprivate.boun.edu.tr/ozturan/mis515

Meltem Özturan misprivate.boun.edu.tr/ozturan/mis515 Meltem Özturan misprivate.boun.edu.tr/ozturan/mis515 1 2 1 Selecting the Best Alternative Major Activities in the Analysis Phase Gather information Define system requirements Prototype for feasibility

More information

Benefits of Automating Data Warehousing

Benefits of Automating Data Warehousing Benefits of Automating Data Warehousing Introduction Data warehousing can be defined as: A copy of data specifically structured for querying and reporting. In most cases, the data is transactional data

More information

9/27/15 MOBILE COMPUTING. CSE 40814/60814 Fall System Structure. explicit output. explicit input

9/27/15 MOBILE COMPUTING. CSE 40814/60814 Fall System Structure. explicit output. explicit input MOBILE COMPUTING CSE 40814/60814 Fall 2015 System Structure explicit input explicit output 1 Context as Implicit Input explicit input explicit output Context: state of the user state of the physical environment

More information

A data-driven framework for archiving and exploring social media data

A data-driven framework for archiving and exploring social media data A data-driven framework for archiving and exploring social media data Qunying Huang and Chen Xu Yongqi An, 20599957 Oct 18, 2016 Introduction Social media applications are widely deployed in various platforms

More information

Pedigree Management and Assessment Framework (PMAF) Demonstration

Pedigree Management and Assessment Framework (PMAF) Demonstration Pedigree Management and Assessment Framework (PMAF) Demonstration Kenneth A. McVearry ATC-NY, Cornell Business & Technology Park, 33 Thornwood Drive, Suite 500, Ithaca, NY 14850 kmcvearry@atcorp.com Abstract.

More information

Hitachi Visualization Suite

Hitachi Visualization Suite The Problem and Solution Your single pane of glass for the Smart City Over the past few years, the majority of public and private organizations have implemented a diverse set of systems to protect their

More information

Enabling Data Governance Leveraging Critical Data Elements

Enabling Data Governance Leveraging Critical Data Elements Adaptive Presentation at DAMA-NYC October 19 th, 2017 Enabling Data Governance Leveraging Critical Data Elements Jeff Goins, President, Jeff.goins@adaptive.com James Cerrato, Chief, Product Evangelist,

More information

745: Advanced Database Systems

745: Advanced Database Systems 745: Advanced Database Systems Yanlei Diao University of Massachusetts Amherst Outline Overview of course topics Course requirements Database Management Systems 1. Online Analytical Processing (OLAP) vs.

More information

International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.7, No.3, May Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani

International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.7, No.3, May Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani LINK MINING PROCESS Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani Higher Colleges of Technology, United Arab Emirates ABSTRACT Many data mining and knowledge discovery methodologies and process models

More information

Data Driving the Smart Grid

Data Driving the Smart Grid Data Driving the Smart Grid Network Model Management for Distribution Operations Eric J. Charette, P.E. Annual GITA Conference - Pacific Northwest Chapter April 24th - 25th, 2017 Future of Flight Aviation

More information

Web Services for Geospatial Mobile AR

Web Services for Geospatial Mobile AR Web Services for Geospatial Mobile AR Introduction Christine Perey PEREY Research & Consulting cperey@perey.com Many popular mobile applications already use the smartphone s built-in sensors and receivers

More information

IoT Mashups with the WoTKit

IoT Mashups with the WoTKit IoT Mashups with the WoTKit Mike Blackstock, Rodger Lea Media and Graphics Interdisciplinary Centre University of British Columbia Vancouver, Canada Motivation IoT mashups are simple, personal, situational,

More information

Enterprise Data Catalog for Microsoft Azure Tutorial

Enterprise Data Catalog for Microsoft Azure Tutorial Enterprise Data Catalog for Microsoft Azure Tutorial VERSION 10.2 JANUARY 2018 Page 1 of 45 Contents Tutorial Objectives... 4 Enterprise Data Catalog Overview... 5 Overview... 5 Objectives... 5 Enterprise

More information

Warfare and business applications

Warfare and business applications Strategic Planning, R. Knox Research Note 10 April 2003 XML Best Practices: The United States Military The U.S. Department of Defense was early to recognize the value of XML to enable interoperability,

More information

Dynamic Semantics for the Internet of Things. Payam Barnaghi Institute for Communication Systems (ICS) University of Surrey Guildford, United Kingdom

Dynamic Semantics for the Internet of Things. Payam Barnaghi Institute for Communication Systems (ICS) University of Surrey Guildford, United Kingdom Dynamic Semantics for the Internet of Things Payam Barnaghi Institute for Communication Systems (ICS) University of Surrey Guildford, United Kingdom 1 Things, Devices, Data, and lots of it image courtesy:

More information

How Insurers are Realising the Promise of Big Data

How Insurers are Realising the Promise of Big Data How Insurers are Realising the Promise of Big Data Jason Hunter, CTO Asia-Pacific, MarkLogic A Big Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies

More information

CHAPTER 2: DATA MODELS

CHAPTER 2: DATA MODELS Database Systems Design Implementation and Management 12th Edition Coronel TEST BANK Full download at: https://testbankreal.com/download/database-systems-design-implementation-andmanagement-12th-edition-coronel-test-bank/

More information

MarkLogic. A Modern Data Platform To Support Your Critical Path COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

MarkLogic. A Modern Data Platform To Support Your Critical Path COPYRIGHT 2016 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. MarkLogic A Modern Data Platform To Support Your Critical Path SLIDE: 2 Inception Pre- Post- Distribution Archive Taxonomies Semantics Technical Descriptive Customers Usage SLIDE: 4 Inception Pre- Post-

More information

Driving Interoperability with CMIS

Driving Interoperability with CMIS A guide to understanding the impact of the draft Content Management Interoperability Specification (CMIS) on content management repositories This white paper also includes developer resources for creating

More information

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS) 2 ITARC 2010 Stockholm 100420 Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS) 3 Contents Trends in information / data Critical factors... growing importance Needs

More information

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS)

ITARC Stockholm Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS) 2 ITARC 2010 Stockholm 100420 Olle Olsson World Wide Web Consortium (W3C) Swedish Institute of Computer Science (SICS) 3 Contents Trends in information / data Critical factors... growing importance Needs

More information

Challenges of Positive Train Control Interoperability

Challenges of Positive Train Control Interoperability Challenges of Positive Train Control Interoperability Clark Palmer, Chief Technology Officer Meteorcomm, LLC Address: 1201 SW 7 th Street, Renton, WA 98057 Phone: 253 236 0115 E-Mail:cpalmer@meteorcomm.com

More information

A Study of Mountain Environment Monitoring Based Sensor Web in Wireless Sensor Networks

A Study of Mountain Environment Monitoring Based Sensor Web in Wireless Sensor Networks , pp.96-100 http://dx.doi.org/10.14257/astl.2014.60.24 A Study of Mountain Environment Monitoring Based Sensor Web in Wireless Sensor Networks Yeon-Jun An 1, Do-Hyeun Kim 2 1,2 Dept. of Computing Engineering

More information

Yeseong Kim. System Energy Efficiency Lab. seelab.ucsd.edu

Yeseong Kim. System Energy Efficiency Lab. seelab.ucsd.edu Yeseong Kim System Energy Efficiency Lab seelab.ucsd.edu 1 Things, Data, Action and Software Data is collected by sensor devices. Motion, Pressure, Temperature, Light sensors Cameras, Microphones, GPS

More information

RiskSense Attack Surface Validation for IoT Systems

RiskSense Attack Surface Validation for IoT Systems RiskSense Attack Surface Validation for IoT Systems 2018 RiskSense, Inc. Surfacing Double Exposure Risks Changing Times and Assessment Focus Our view of security assessments has changed. There is diminishing

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

Smartcrawler: A Two-Stage Crawler for Efficiently Harvesting Deep-Web Interfaces

Smartcrawler: A Two-Stage Crawler for Efficiently Harvesting Deep-Web Interfaces Smartcrawler: A Two-Stage Crawler for Efficiently Harvesting Deep-Web Interfaces Nikhil S. Mane, Deepak V. Jadhav M. E Student, Department of Computer Engineering, ZCOER, Narhe, Pune, India Professor,

More information

MarkLogic Technology Briefing

MarkLogic Technology Briefing MarkLogic Technology Briefing Edd Patterson CTO/VP Systems Engineering, Americas Slide 1 Agenda Introductions About MarkLogic MarkLogic Server Deep Dive Slide 2 MarkLogic Overview Company Highlights Headquartered

More information

Survey on Community Question Answering Systems

Survey on Community Question Answering Systems World Journal of Technology, Engineering and Research, Volume 3, Issue 1 (2018) 114-119 Contents available at WJTER World Journal of Technology, Engineering and Research Journal Homepage: www.wjter.com

More information

<Insert Picture Here> Enterprise Data Management using Grid Technology

<Insert Picture Here> Enterprise Data Management using Grid Technology Enterprise Data using Grid Technology Kriangsak Tiawsirisup Sales Consulting Manager Oracle Corporation (Thailand) 3 Related Data Centre Trends. Service Oriented Architecture Flexibility

More information

INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING

INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING CS 7265 BIG DATA ANALYTICS INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING * Some contents are adapted from Dr. Hung Huang and Dr. Chengkai Li at UT Arlington Mingon Kang, PhD Computer Science,

More information

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan Metadata for Data Discovery: The NERC Data Catalogue Service Steve Donegan Introduction NERC, Science and Data Centres NERC Discovery Metadata The Data Catalogue Service NERC Data Services Case study:

More information

Data formats for exchanging classifications UNSD

Data formats for exchanging classifications UNSD ESA/STAT/AC.234/22 11 May 2011 UNITED NATIONS DEPARTMENT OF ECONOMIC AND SOCIAL AFFAIRS STATISTICS DIVISION Meeting of the Expert Group on International Economic and Social Classifications New York, 18-20

More information

Comparison of SmartData Fabric with Cloudera and Hortonworks Revision 2.1

Comparison of SmartData Fabric with Cloudera and Hortonworks Revision 2.1 Comparison of SmartData Fabric with Cloudera and Hortonworks Revision 2.1 Page 1 of 11 www.whamtech.com (972) 991-5700 info@whamtech.com August 2018 Page 2 of 11 www.whamtech.com (972) 991-5700 info@whamtech.com

More information

GEOSS Data Management Principles: Importance and Implementation

GEOSS Data Management Principles: Importance and Implementation GEOSS Data Management Principles: Importance and Implementation Alex de Sherbinin / Associate Director / CIESIN, Columbia University Gregory Giuliani / Lecturer / University of Geneva Joan Maso / Researcher

More information

Introduction to Data Science

Introduction to Data Science UNIT I INTRODUCTION TO DATA SCIENCE Syllabus Introduction of Data Science Basic Data Analytics using R R Graphical User Interfaces Data Import and Export Attribute and Data Types Descriptive Statistics

More information

Curriculum Guide. ThingWorx

Curriculum Guide. ThingWorx Curriculum Guide ThingWorx Live Classroom Curriculum Guide Introduction to ThingWorx 8 ThingWorx 8 User Interface Development ThingWorx 8 Platform Administration ThingWorx 7.3 Fundamentals Applying Machine

More information

The GeoPortal Cookbook Tutorial

The GeoPortal Cookbook Tutorial The GeoPortal Cookbook Tutorial Wim Hugo SAEON/ SAEOS SCOPE OF DISCUSSION Background and Additional Resources Context and Concepts The Main Components of a GeoPortal Architecture Implementation Options

More information

Oracle 10g GeoSpatial Technologies. Eve Kleiman Asia/Pacific Spatial Product Manager Oracle Corporation

Oracle 10g GeoSpatial Technologies. Eve Kleiman Asia/Pacific Spatial Product Manager Oracle Corporation Oracle 10g GeoSpatial Technologies Eve Kleiman Asia/Pacific Spatial Product Manager Oracle Corporation Eve.Kleiman@oracle.com Agenda Market and Technology Trends Oracle GeoSpatial Technology Stack What

More information

The Information Platform of the Future. MarkLogic and Smartlogic

The Information Platform of the Future. MarkLogic and Smartlogic The Information Platform of the Future MarkLogic and Smartlogic The problem - AAARRRGHHHH Discoverability? I d settle for plain findability don t even have that. My data lake is really a cesspool I need

More information

70-532: Developing Microsoft Azure Solutions

70-532: Developing Microsoft Azure Solutions 70-532: Developing Microsoft Azure Solutions Exam Design Target Audience Candidates of this exam are experienced in designing, programming, implementing, automating, and monitoring Microsoft Azure solutions.

More information

Top 20 Data Quality Solutions for Data Science

Top 20 Data Quality Solutions for Data Science Top 20 Data Quality Solutions for Data Science Data Science & Business Analytics Meetup Boulder, CO 2014-12-03 Ken Farmer DQ Problems for Data Science Loom Large & Frequently 4000000 Strikingly visible

More information

USC Viterbi School of Engineering

USC Viterbi School of Engineering Introduction to Computational Thinking and Data Science USC Viterbi School of Engineering http://www.datascience4all.org Term: Fall 2016 Time: Tues- Thur 10am- 11:50am Location: Allan Hancock Foundation

More information

Chapter 3 A New Framework for Multicast Mobility in WiFi Networks

Chapter 3 A New Framework for Multicast Mobility in WiFi Networks Chapter 3 A New Framework for Multicast Mobility in WiFi Networks 3.1 Introduction This chapter presents the designed framework that was produced during this research. The chapter describes about network

More information

The Value of Metadata

The Value of Metadata The Value of Metadata Why is metadata important to your agency / organization? Metadata has tremendous value to Individuals within your organization, as well as to individuals outside of your organization

More information

Chapter 6 VIDEO CASES

Chapter 6 VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information