Big Data Integration for Data Enthusiasts. Jayant Madhavan Structured Data Research Google Inc.

Size: px
Start display at page:

Download "Big Data Integration for Data Enthusiasts. Jayant Madhavan Structured Data Research Google Inc."

Transcription

1 for Data Enthusiasts Jayant Madhavan Structured Data Research Google Inc.

2 Big Data Challenge Running computations over ginormous datasets Petabytes, Exabytes, maybe more! Only one aspect of the challenge!

3 Big Data Challenge What about the data enthusiasts? Data experts, but without technical expertise Journalists, Social Scientists, NGOs, High School Teachers and Students, etc. Motivated by advocacy, i.e., doing good through data awareness Tools that help data enthusiasts work effectively Analysis: explore, clean, analyze and summarize Storytelling: visualize and publish Integration: share, find, and combine

4 Google Fusion Tables Data Management / Integration in the cloud Sharing, Collaborating, Exploring, Visualizing and Publishing Ease of use with focus on data enthusiasts Launched in Labs June Now part of Google Apps. Many million uploaded tables Embedded maps popular with journalists SQL API used widely to access reference tables, e.g., CA DMV Search to find public tables and tables extracted from the Web Bigdata Storytelling through Interactive Maps [Madhavan+, DeBull 2012] Dated system overviews [SoCC 2010, SIGMOD 2010]

5 Upload, share, explore, visualize, publish

6 Upload, share, explore, visualize, publish

7 Upload, share, explore, visualize, publish

8 Upload, share, explore, visualize, publish

9 Upload, share, explore, visualize, publish

10 Upload, share, explore, visualize, publish

11 Thousands of points -- not a simple mashup anymore

12 Table Facts: Wikileaks Iraq War Diaries: 52,000 incidents

13 Table Facts: Texas Counties 2010 Census: 254 counties with vertices Colored based on various demographics

14 Integration by Map Layers Table Facts: Earthquakes since 1973 (USGS): 174,000 incidents of magnitude 4.5+ displayed as a heatmap Nuclear Power Stations (IAEA): 248 locations with with active nuclear energy generation

15 Table Facts: English poverty rates: 32,000 wards with a total of 1.8 million vertices Colors indicate poverty levels 2011 Rioting: 2100 incidents Colors indicate addresses of Rioting and Rioters Best UK Internet Journalist Knight-Batten Award for Innovations in Journalism

16 Interactive Visualizations on Large Datasets Interactive responses necessitate Fast server-side retrieval Fast visualization rendering Low network delays Low server response time In-memory column database Specialized spatial index Details in [Lee, Sarma, Gonzalez, Lam, Madhavan, Roy, In Prep] Low rendering time à Small server response sizes Constrained sampling to limit points according to level of detail Details [Sarma, Lee, Gonzalez, Halevy, Madhavan, SIGMOD 2012]

17 The F in Fusion Tables Support for Merged tables Virtual tables that are the join of two or more underlying data tables Full first class citizens sharing, exploration, visualization and publishing Potentially critical to data enthusiasts Data from multiple sources that complete a story Requirements: Joins that just work, i.e., approx matching, entity-based matching, etc.

18 Use Case 1: Merging Complementary Datasets Firearm Deaths by Country Population by Country

19 Use Case 2: Merging with Reference Datasets Cost by Country Geometry boundaries per County

20 Use Case 2: Merging with Reference Datasets

21 Merges and their consequences: Community Benefits Fosters an eco-system for data reuse High quality reference tables used by many, but managed by a dedicated few Extent of data reuse serves as a crowdsourced quality signal Per-table permissions lead to sophisticated sharing models

22 Merges and their consequences: Performance Benefits Efficient in-memory optimizations Multiple visualizations share in-memory indices reducing their combined footprint Splitting tables into fixed and changing subtables leads to higher update rates

23 Table Facts: Terrestrial Eco Regions: regions and 4 million vertices Colored based on choice of ground water, cleared forests, human appropriations, etc.

24 The Merge Search Problem Problem: Given a table and a keyword, find table(s) that can be merged with the table Input table + looking for: population à Tables that include population for all/most of the countries in the input table

25 Detour: Table Search Problem: Find tables on the web that match keyword search queries firearm deaths by state à Table search available at: research.google.com/tables

26

27 Table Search Problem: Find tables on the web that match keyword search queries Challenges Extraction: Not everything within a <table> is a data table Identifying data tables from navigation and formatting ones Ranking: Not as simple as restricting Google.com results Content outside table might be necessary, yet misleading

28

29 Table Search Class-property queries are data-seeking and plentiful and they can be improved over web search Detecting subject columns and their corresponding semantic classes can improve search quality of results Detecting header columns and their corresponding properties can improve results [Venetis+, VLDB 2011] [Cafarella+, VLDB 2008]

30 Back to Merge Search Matching join columns Coverage: Entity overlap Matching keyword queries Use token-based matching, synonymy, etc. Subtle, yet critical, difference from web search Recall more important than precision Traditional IR optimizations can come in the way

31

32

33 Future: can we make data research easier for data enthusiasts? Can we automatically suggest Datasets that complement theirs E.g., tables with other attributes relevant to firearms

34 Future: can we make data research easier for data enthusiasts? Can we automatically suggest Datasets that complete theirs E.g., tables with the same data, but countries not in the table Datasets that contradict / support theirs E.g., tables with the same information for earlier years Visualizations that highlight trends in their data E.g., charts that demonstrate a correlation in data

35 Structured Google Sreeram Balakrishnan Johnny Chen Alon Halevy Felix Halim Boulos Harb Karen Jacqmin-Adams Hector Gonzalez Nitin Gupta Heidi Lam Anno Langen Hongrae Lee Rod McChesney Afshin Rostamizadeh Rebecca Shapley Warren Shen Steven Whang Kenneth Wilder Fei Wu Cong Yu and others

36 Structured Google Google Fusion Tables Data management for data enthusiasts Analysis, storytelling, and integration made easy Table Search Finding data tables on the Web Making data tables useful for other search activities

Recent Advances in Structured Data and the Web

Recent Advances in Structured Data and the Web Recent Advances in Structured Data and the Web Alon Halevy Google April 10, 2013 Joint work with: Jayant Madhavan, Cong Yu, Fei Wu, Hongrae Lee, Nitin Gupta, Warren Shen Anish Das Sarma, Boulos Harb, Zack

More information

Big Data Storytelling through Interactive Maps

Big Data Storytelling through Interactive Maps Big Data Storytelling through Interactive Maps Jayant Madhavan, Sreeram Balakrishnan, Kathryn Brisbin, Hector Gonzalez, Nitin Gupta, Alon Halevy, Karen Jacqmin-Adams, Heidi Lam, Anno Langen, Hongrae Lee,

More information

Structured Data on the Web

Structured Data on the Web Structured Data on the Web Alon Halevy Google Australasian Computer Science Week January, 2010 Structured Data & The Web Andree Hudson, 4 th of July Hard to find structured data via search engines

More information

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu Presented by: Dimitri Galmanovich Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu 1 When looking for Unstructured data 2 Millions of such queries every day

More information

Scholarly Big Data: Leverage for Science

Scholarly Big Data: Leverage for Science Scholarly Big Data: Leverage for Science C. Lee Giles The Pennsylvania State University University Park, PA, USA giles@ist.psu.edu http://clgiles.ist.psu.edu Funded in part by NSF, Allen Institute for

More information

Ontology Augmentation Through Matching with Web Tables

Ontology Augmentation Through Matching with Web Tables Ontology Augmentation Through Matching with Web Tables Oliver Lehmberg 1 and Oktie Hassanzadeh 2 1 University of Mannheim, B6 26, 68159 Mannheim, Germany 2 IBM Research, Yorktown Heights, New York, U.S.A.

More information

Advanced Computer Graphics CS 525M: Crowds replace Experts: Building Better Location-based Services using Mobile Social Network Interactions

Advanced Computer Graphics CS 525M: Crowds replace Experts: Building Better Location-based Services using Mobile Social Network Interactions Advanced Computer Graphics CS 525M: Crowds replace Experts: Building Better Location-based Services using Mobile Social Network Interactions XIAOCHEN HUANG Computer Science Dept. Worcester Polytechnic

More information

FOUNDATIONS OF A CROSS-DISCIPLINARY PEDAGOGY FOR BIG DATA *

FOUNDATIONS OF A CROSS-DISCIPLINARY PEDAGOGY FOR BIG DATA * FOUNDATIONS OF A CROSS-DISCIPLINARY PEDAGOGY FOR BIG DATA * Joshua Eckroth Stetson University DeLand, Florida 386-740-2519 jeckroth@stetson.edu ABSTRACT The increasing awareness of big data is transforming

More information

Qlik Sense Desktop. Data, Discovery, Collaboration in minutes. Qlik Sense Desktop. Qlik Associative Model. Get Started for Free

Qlik Sense Desktop. Data, Discovery, Collaboration in minutes. Qlik Sense Desktop. Qlik Associative Model. Get Started for Free Qlik Sense Desktop Data, Discovery, Collaboration in minutes With Qlik Sense Desktop making business decisions becomes faster, easier, and more collaborative than ever. Qlik Sense Desktop puts rapid analytics

More information

In this exercise you will display the Geo-tagged Wikipedia Articles Fusion Table in Google Maps.

In this exercise you will display the Geo-tagged Wikipedia Articles Fusion Table in Google Maps. Introduction to the Google Maps API v3 Adding a Fusion Table to Google Maps Fusion Tables, still in the experimental stages of development, is a Google product that allows you to upload and share data

More information

Analysing crime data in Maps for Office and ArcGIS Online

Analysing crime data in Maps for Office and ArcGIS Online Analysing crime data in Maps for Office and ArcGIS Online For non-commercial use only by schools and universities Esri UK GIS for School Programme www.esriuk.com/schools Introduction ArcGIS Online is a

More information

The Definitive Guide to Preparing Your Data for Tableau

The Definitive Guide to Preparing Your Data for Tableau The Definitive Guide to Preparing Your Data for Tableau Speed Your Time to Visualization If you re like most data analysts today, creating rich visualizations of your data is a critical step in the analytic

More information

Advances in GIS help create Smarter Communities

Advances in GIS help create Smarter Communities Advances in GIS help create Smarter Communities POP(ovich) Quiz Who is a Desktop User? Who is an ArcGIS Online User? Who is a ArcGIS Server Admin? Who is a Programmer? Who works with or for a government

More information

Organizing and Managing Grassroots Enterprise Mashup Environments. Doctorial Thesis, 24 th June, Volker Hoyer

Organizing and Managing Grassroots Enterprise Mashup Environments. Doctorial Thesis, 24 th June, Volker Hoyer Organizing and Managing Grassroots Enterprise Mashup Environments Doctorial Thesis, 24 th June, 2010 Volker Hoyer Motivation and Research Questions Research Design Results Conclusion Motivation and Research

More information

Cisco Collaborative Knowledge

Cisco Collaborative Knowledge Cisco Collaborative Knowledge Product Overview. Your workforce needs knowledge, speed and flexibility to solve real-world business challenges in today s fast moving digital economy. Cisco Collaborative

More information

Towards Efficient and Effective Semantic Table Interpretation Ziqi Zhang

Towards Efficient and Effective Semantic Table Interpretation Ziqi Zhang Towards Efficient and Effective Semantic Table Interpretation Ziqi Zhang Department of Computer Science, University of Sheffield Outline Define semantic table interpretation State-of-the-art and motivation

More information

ANNUAL REPORT Visit us at project.eu Supported by. Mission

ANNUAL REPORT Visit us at   project.eu Supported by. Mission Mission ANNUAL REPORT 2011 The Web has proved to be an unprecedented success for facilitating the publication, use and exchange of information, at planetary scale, on virtually every topic, and representing

More information

Charting the Progress of Smart City Development in Shanghai

Charting the Progress of Smart City Development in Shanghai Charting the Progress of Smart City Development in Shanghai Xueguo Wen Executive Vice President of Shanghai Academy 2017 TM Forum 1 C ONTENTS Current situation Experience and outlook Strategic cooperation

More information

Using American FactFinder

Using American FactFinder Using American FactFinder John DeWitt Lisa Neidert Project Manager Data Services Social Science Data Analysis Network Population Studies Center What is American FactFinder? http://factfinder.census.gov

More information

Visualizing semantic table annotations with TableMiner+

Visualizing semantic table annotations with TableMiner+ Visualizing semantic table annotations with TableMiner+ MAZUMDAR, Suvodeep and ZHANG, Ziqi Available from Sheffield Hallam University Research Archive (SHURA) at:

More information

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018 Cloud Computing 2 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning

More information

Understanding a Large Corpus of Web Tables Through Matching with Knowledge Bases An Empirical Study

Understanding a Large Corpus of Web Tables Through Matching with Knowledge Bases An Empirical Study Understanding a Large Corpus of Web Tables Through Matching with Knowledge Bases An Empirical Study Oktie Hassanzadeh, Michael J. Ward, Mariano Rodriguez-Muro, and Kavitha Srinivas IBM T.J. Watson Research

More information

Teacher Step 1: How to create a Google Classroom

Teacher Step 1: How to create a Google Classroom Navigate to classroom.google.com Teacher Step 1: How to create a Google Classroom Login with your OCSD account, you will soon have a single sign on path within Classlink but for now manually type in yourusername@ocsd.okaloosa.k12.fl.us

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REVIEW PAPER ON IMPLEMENTATION OF DOCUMENT ANNOTATION USING CONTENT AND QUERYING

More information

The Emerging Web of Linked Data

The Emerging Web of Linked Data 4th Berlin Semantic Web Meetup 26. February 2010 The Emerging Web of Linked Data Prof. Dr. Christian Bizer Freie Universität Berlin Outline 1. From a Web of Documents to a Web of Data Web APIs and Linked

More information

MS-55045: Microsoft End to End Business Intelligence Boot Camp

MS-55045: Microsoft End to End Business Intelligence Boot Camp MS-55045: Microsoft End to End Business Intelligence Boot Camp Description This five-day instructor-led course is a complete high-level tour of the Microsoft Business Intelligence stack. It introduces

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

Professor Yashar Ganjali Department of Computer Science University of Toronto

Professor Yashar Ganjali Department of Computer Science University of Toronto Professor Yashar Ganjali Department of Computer Science University of Toronto yganjali@cs.toronto.edu http://www.cs.toronto.edu/~yganjali Some slides courtesy of J. Rexford (Princeton), N. Foster (Cornell)

More information

Intelligent Enterprise meets Science of Where. Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018

Intelligent Enterprise meets Science of Where. Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018 Intelligent Enterprise meets Science of Where Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018 Value The Esri & SAP journey Customer Impact Innovation Track Record Customer

More information

Public Data and Visualizations: How are Many Eyes and Tableau Public Used for Collaborative Analytics?

Public Data and Visualizations: How are Many Eyes and Tableau Public Used for Collaborative Analytics? Public Data and Visualizations: How are Many Eyes and Tableau Public Used for Collaborative Analytics? Kristi Morton 1, Magdalena Balazinska 1, Dan Grossman 1, Robert Kosara 2, and Jock Mackinlay 2 1 University

More information

OPTIMIZING MAPREDUCE FUNCTIONALITY IN BIGDATA USING CACHE MANAGER

OPTIMIZING MAPREDUCE FUNCTIONALITY IN BIGDATA USING CACHE MANAGER OPTIMIZING MAPREDUCE FUNCTIONALITY IN BIGDATA USING CACHE MANAGER Devi. L and S. Gowri Faculty of Computing, Sathyabama University, India E-Mail: devikanth.btech@gmail.com ABSTRACT The MapReduce framework

More information

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology Making research data repositories visible and discoverable Robert Ulrich Karlsruhe Institute of Technology Outline Background Mission Schema, Icons, Quality and Workflow Interface Growth Cooperations Experiences

More information

Creating a Recommender System. An Elasticsearch & Apache Spark approach

Creating a Recommender System. An Elasticsearch & Apache Spark approach Creating a Recommender System An Elasticsearch & Apache Spark approach My Profile SKILLS Álvaro Santos Andrés Big Data & Analytics Solution Architect in Ericsson with more than 12 years of experience focused

More information

Creating Transparency, Openness and Trust: Modern Approach to Redistricting

Creating Transparency, Openness and Trust: Modern Approach to Redistricting Creating Transparency, Openness and Trust: Modern Approach to Redistricting Richard Leadbeater, Esri Jerry Howe, Utah Legislature Larry Boden, Esri NCSL Legislative Summit Sunday, Aug. 6 11:15 am-12:15

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A REVIEW- IMPLEMENTING TRAINING AND PLACEMENT SYSTEM USING MONGODB AND REDIS ABHA

More information

US Geo-Explorer User s Guide. Web:

US Geo-Explorer User s Guide. Web: US Geo-Explorer User s Guide Web: http://usgeoexplorer.org Updated on October 26, 2016 TABLE OF CONTENTS Introduction... 3 1. System Interface... 5 2. Administrative Unit... 7 2.1 Region Selection... 7

More information

Ranking Web Pages by Associating Keywords with Locations

Ranking Web Pages by Associating Keywords with Locations Ranking Web Pages by Associating Keywords with Locations Peiquan Jin, Xiaoxiang Zhang, Qingqing Zhang, Sheng Lin, and Lihua Yue University of Science and Technology of China, 230027, Hefei, China jpq@ustc.edu.cn

More information

Microsoft Power BI for O365

Microsoft Power BI for O365 Microsoft Power BI for O365 Next hour.. o o o o o o o o Power BI for O365 Data Discovery Data Analysis Data Visualization & Power Maps Natural Language Search (Q&A) Power BI Site Data Management Self Service

More information

ArcGIS Hub: Open data best practices. Graham Hudgins, esri product engineer

ArcGIS Hub: Open data best practices. Graham Hudgins, esri product engineer ArcGIS Hub: Open data best practices Graham Hudgins, esri product engineer Agenda Overview of open data in the ArcGIS Hub Example sites - Hubs Around the World Storymap Making a good site map Preparing

More information

McAfee Security Management Center

McAfee Security Management Center Data Sheet McAfee Security Management Center Unified management for next-generation devices Key advantages: Single pane of glass across the management lifecycle for McAfee next generation devices. Scalability

More information

Tag Based Image Search by Social Re-ranking

Tag Based Image Search by Social Re-ranking Tag Based Image Search by Social Re-ranking Vilas Dilip Mane, Prof.Nilesh P. Sable Student, Department of Computer Engineering, Imperial College of Engineering & Research, Wagholi, Pune, Savitribai Phule

More information

Building Geospatial Mashups to Visualize Information for Crisis Management. Shubham Gupta and Craig A. Knoblock University of Southern California

Building Geospatial Mashups to Visualize Information for Crisis Management. Shubham Gupta and Craig A. Knoblock University of Southern California Building Geospatial Mashups to Visualize Information for Crisis Management Shubham Gupta and Craig A. Knoblock University of Southern California 1 WHAT IS A GEOSPATIAL MASHUP? Integrated View of data combined

More information

Big Ideas Math Digital Platform. Student Orientation

Big Ideas Math Digital Platform. Student Orientation Big Ideas Math Digital Platform Student Orientation Big Ideas Math Big Ideas Math is the name of the new series we are using at Edwardsville High School for the following courses: Algebra 1 Geometry Algebra

More information

Texas Connector Training Manual. For TDCJ Users [2016]

Texas Connector Training Manual. For TDCJ Users [2016] Texas Connector Training Manual For TDCJ Users [2016] CONTENTS OneStar Foundation and Texas Connector Overview... 3 The Data... 3 How To Log In... 4 TDCJ Quick Report Generator... 4 The Map... 8 Support

More information

Open Data Integration. Renée J. Miller

Open Data Integration. Renée J. Miller Open Data Integration Renée J. Miller miller@northeastern.edu !2 Open Data Principles Timely & Comprehensive Accessible and Usable Complete - All public data is made available. Public data is data that

More information

alliance FROM DISPATCH THROUGH DISPOSITION Tyler Alliance Leads the Way with Integrated Criminal Justice and Public Safety Solutions

alliance FROM DISPATCH THROUGH DISPOSITION Tyler Alliance Leads the Way with Integrated Criminal Justice and Public Safety Solutions alliance FROM DISPATCH THROUGH DISPOSITION Tyler Alliance Leads the Way with Integrated Criminal Justice and Public Safety Solutions FIRE/EMS Fire and emergency service teams access information faster

More information

Kaspersky Security Network

Kaspersky Security Network The Kaspersky Security Network (KSN) is a complex distributed infrastructure dedicated to intelligently processing cybersecurity-related data streams from millions of voluntary participants around the

More information

Collect Relevant Demographics

Collect Relevant Demographics I N T R O D U C T I O N Collect Relevant Demographics Here we will collect demographics at the local level! At this point you should have identified and mapped a prioritized brownfield site. We will now

More information

Texas Connector Training Manual [2016]

Texas Connector Training Manual [2016] Texas Connector Training Manual [2016] CONTENTS OneStar Foundation and Texas Connector Overview... 2 The Data... 2 How To Create An Account... 3 How To Log In... 3 How To Create An Account Visual Guide

More information

Creating Connection With Hive. Version: 16.0

Creating Connection With Hive. Version: 16.0 Creating Connection With Hive Version: 16.0 Copyright 2015 Intellicus Technologies This document and its content is copyrighted material of Intellicus Technologies. The content may not be copied or derived

More information

Building Self-Service BI Solutions with Power Query. Written By: Devin

Building Self-Service BI Solutions with Power Query. Written By: Devin Building Self-Service BI Solutions with Power Query Written By: Devin Knight DKnight@PragmaticWorks.com @Knight_Devin CONTENTS PAGE 3 PAGE 4 PAGE 5 PAGE 6 PAGE 7 PAGE 8 PAGE 9 PAGE 11 PAGE 17 PAGE 20 PAGE

More information

SAS Web Report Studio 3.1

SAS Web Report Studio 3.1 SAS Web Report Studio 3.1 User s Guide SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2006. SAS Web Report Studio 3.1: User s Guide. Cary, NC: SAS

More information

Chapter 9. Attribute joins

Chapter 9. Attribute joins Chapter 9 Spatial Joins 9-1 Copyright McGraw-Hill Education. Permission required for reproduction or display. Attribute joins Recall that Attribute joins: involve combining two attribute tables together

More information

Microsoft End to End Business Intelligence Boot Camp

Microsoft End to End Business Intelligence Boot Camp Microsoft End to End Business Intelligence Boot Camp 55045; 5 Days, Instructor-led Course Description This course is a complete high-level tour of the Microsoft Business Intelligence stack. It introduces

More information

55049: PowerPivot, Power View and SharePoint 2013 Business Intelligence Center for Analysts

55049: PowerPivot, Power View and SharePoint 2013 Business Intelligence Center for Analysts Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

Big Data Security Internal Threat Detection. The Critical Role of Machine Learning.

Big Data Security Internal Threat Detection. The Critical Role of Machine Learning. Big Data Security Internal Threat Detection The Critical Role of Machine Learning Objectives 1.Discuss internal user risk management challenges in Big Data Environment 2.Discuss why machine learning is

More information

Identification and Classification of A/E/C Web Sites and Pages

Identification and Classification of A/E/C Web Sites and Pages Construction Informatics Digital Library http://itc.scix.net/ paper w78-2002-34.content Theme: Title: Author(s): Institution(s): E-mail(s): Abstract: Keywords: Identification and Classification of A/E/C

More information

Survey on Community Question Answering Systems

Survey on Community Question Answering Systems World Journal of Technology, Engineering and Research, Volume 3, Issue 1 (2018) 114-119 Contents available at WJTER World Journal of Technology, Engineering and Research Journal Homepage: www.wjter.com

More information

Chapter 7. A Quick Tour of ArcGIS Pro

Chapter 7. A Quick Tour of ArcGIS Pro Chapter 7 A Quick Tour of ArcGIS Pro Skills you will learn: This tutorial is intended to get you going using ArcGIS Pro, a new desktop application that is part of ArcGIS Desktop. A separate tutorial gives

More information

An Introduction to Big Data Formats

An Introduction to Big Data Formats Introduction to Big Data Formats 1 An Introduction to Big Data Formats Understanding Avro, Parquet, and ORC WHITE PAPER Introduction to Big Data Formats 2 TABLE OF TABLE OF CONTENTS CONTENTS INTRODUCTION

More information

SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009

SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009 SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009 The Emerging Web of Linked Data Chris Bizer, Freie Universität Berlin Outline 1. From a Web of Documents to a Web of Data

More information

GOOGLE FUSION TABLES AS A GEOGRAPHIC INFORMATION SYSTEM LEARNING TOOLS

GOOGLE FUSION TABLES AS A GEOGRAPHIC INFORMATION SYSTEM LEARNING TOOLS 6-05 Google Fusion Tables As A Geographic Information System Learning Tools GOOGLE FUSION TABLES AS A GEOGRAPHIC INFORMATION SYSTEM LEARNING TOOLS Adri Gabriel Sooai Department of Computer Science, Engineering

More information

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM CHAPTER THREE INFORMATION RETRIEVAL SYSTEM 3.1 INTRODUCTION Search engine is one of the most effective and prominent method to find information online. It has become an essential part of life for almost

More information

Joint Entity Resolution

Joint Entity Resolution Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute

More information

How App Ratings and Reviews Impact Rank on Google Play and the App Store

How App Ratings and Reviews Impact Rank on Google Play and the App Store APP STORE OPTIMIZATION MASTERCLASS How App Ratings and Reviews Impact Rank on Google Play and the App Store BIG APPS GET BIG RATINGS 13,927 AVERAGE NUMBER OF RATINGS FOR TOP-RATED IOS APPS 196,833 AVERAGE

More information

Building a Europe of Knowledge. Towards the Seventh Framework Programme

Building a Europe of Knowledge. Towards the Seventh Framework Programme Building a Europe of Knowledge Towards the Seventh Framework Programme 2007-2013 FP7 /1 EUROPEAN COMMISSION - Research DG - November 2005 EU research: the story so far 1952: ECSC treaty; first projects

More information

How Managers and Executives Can Leverage SAS Enterprise Guide

How Managers and Executives Can Leverage SAS Enterprise Guide Paper 8820-2016 How Managers and Executives Can Leverage SAS Enterprise Guide ABSTRACT Steven First and Jennifer First-Kluge, Systems Seminar Consultants, Inc. SAS Enterprise Guide is an extremely valuable

More information

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user

More information

Putting it all together: Creating a Big Data Analytic Workflow with Spotfire

Putting it all together: Creating a Big Data Analytic Workflow with Spotfire Putting it all together: Creating a Big Data Analytic Workflow with Spotfire Authors: David Katz and Mike Alperin, TIBCO Data Science Team In a previous blog, we showed how ultra-fast visualization of

More information

Discovery services: next generation of searching scholarly information

Discovery services: next generation of searching scholarly information Discovery services: next generation of searching scholarly information Article (Unspecified) Keene, Chris (2011) Discovery services: next generation of searching scholarly information. Serials, 24 (2).

More information

Supporting Fuzzy Keyword Search in Databases

Supporting Fuzzy Keyword Search in Databases I J C T A, 9(24), 2016, pp. 385-391 International Science Press Supporting Fuzzy Keyword Search in Databases Jayavarthini C.* and Priya S. ABSTRACT An efficient keyword search system computes answers as

More information

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University Geographical Information Systems Institute Center for Geographic Analysis, Harvard University LAB EXERCISE 5: Queries, Joins: Spatial and Non-spatial 1.0 Getting Census data 1. Go to the American Factfinder

More information

arxiv: v3 [cs.dl] 23 Sep 2017

arxiv: v3 [cs.dl] 23 Sep 2017 A Complete Year of User Retrieval Sessions in a Social Sciences Academic Search Engine Philipp Mayr, Ameni Kacem arxiv:1706.00816v3 [cs.dl] 23 Sep 2017 GESIS Leibniz Institute for the Social Sciences,

More information

Smallworld Core Spatial Technology 4 Spatial data is more than maps tracking the topology of complex network models

Smallworld Core Spatial Technology 4 Spatial data is more than maps tracking the topology of complex network models Smallworld Core Spatial Technology 4 Spatial data is more than maps tracking the topology of complex network models 2004 General Electric Company. All Rights Reserved GER-4230 (10/04) Abstract Spatial

More information

White Paper: Next generation disaster data infrastructure CODATA LODGD Task Group 2017

White Paper: Next generation disaster data infrastructure CODATA LODGD Task Group 2017 White Paper: Next generation disaster data infrastructure CODATA LODGD Task Group 2017 Call for Authors This call for authors seeks contributions from academics and scientists who are in the fields of

More information

Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index

Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index B. SATYA MOUNIKA 1, J. VENKATA KRISHNA 2 1 M-Tech Dept. of CSE SreeVahini Institute of Science and Technology TiruvuruAndhra

More information

Comparative Analysis of Range Aggregate Queries In Big Data Environment

Comparative Analysis of Range Aggregate Queries In Big Data Environment Comparative Analysis of Range Aggregate Queries In Big Data Environment Ranjanee S PG Scholar, Dept. of Computer Science and Engineering, Institute of Road and Transport Technology, Erode, TamilNadu, India.

More information

What is SEO? Search Engine Optimization 101

What is SEO? Search Engine Optimization 101 What is SEO? Search Engine Optimization 101 What is Search Engine Optimization (SEO)? Paid Search Listings SEO is the practice of improving and promoting a website to increase the number of Organic visitors

More information

SAS BI Dashboard 3.1. User s Guide Second Edition

SAS BI Dashboard 3.1. User s Guide Second Edition SAS BI Dashboard 3.1 User s Guide Second Edition The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2007. SAS BI Dashboard 3.1: User s Guide, Second Edition. Cary, NC:

More information

2011 NASCIO RECOGNITION AWARDS NOMINATION CATEGORY: FAST TRACK SOLUTIONS CALIFORNIA MOBILE DEVELOPMENT PROGRAM STATE OF CALIFORNIA

2011 NASCIO RECOGNITION AWARDS NOMINATION CATEGORY: FAST TRACK SOLUTIONS CALIFORNIA MOBILE DEVELOPMENT PROGRAM STATE OF CALIFORNIA 2011 NASCIO RECOGNITION AWARDS NOMINATION CATEGORY: FAST TRACK SOLUTIONS CALIFORNIA MOBILE DEVELOPMENT PROGRAM STATE OF CALIFORNIA OFFICE OF TECHNOLOGY SERVICES CALIFORNIA TECHNOLOGY AGENCY 1 B. EXECUTIVE

More information

TAKING NETWORK TESTING TO THE NEXT LEVEL

TAKING NETWORK TESTING TO THE NEXT LEVEL TAKING NETWORK TESTING TO THE NEXT LEVEL WELCOME TO THE NEXT LEVEL OF NETWORK TESTING. Do you understand the performance and customer experience of your mobile network? P3 does. Our holistic approach is

More information

Testing System Qualities

Testing System Qualities Testing System Qualities Rebecca Wirfs-Brock Joseph Yoder Copyright 2012 Rebecca Wirfs-Brock, Joseph Yoder, Wirfs-Brock Associates and The Refactory, Inc. Introducing Rebecca President, Wirfs-Brock Associates

More information

PANDA: A Platform for Academic Knowledge Discovery and Acquisition

PANDA: A Platform for Academic Knowledge Discovery and Acquisition PANDA: A Platform for Academic Knowledge Discovery and Acquisition Zhaoan Dong 1 ; Jiaheng Lu 2,1 ; Tok Wang Ling 3 1.Renmin University of China 2.University of Helsinki 3.National University of Singapore

More information

Getting Started with GeoQuery

Getting Started with GeoQuery Getting Started with GeoQuery A quick-start guide to the download and use of spatial data for international development geo.aiddata.org GeoQuery Quick Start Handbook v. 1.01, December 2017 WWW.GEOQUERY.ORG

More information

Informatica Enterprise Information Catalog

Informatica Enterprise Information Catalog Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with

More information

These are all examples of relatively simple databases. All of the information is textual or referential.

These are all examples of relatively simple databases. All of the information is textual or referential. 1.1. Introduction Databases are pervasive in modern society. So many of our actions and attributes are logged and stored in organised information repositories, or Databases. 1.1.01. Databases Where do

More information

Finding Topic-centric Identified Experts based on Full Text Analysis

Finding Topic-centric Identified Experts based on Full Text Analysis Finding Topic-centric Identified Experts based on Full Text Analysis Hanmin Jung, Mikyoung Lee, In-Su Kang, Seung-Woo Lee, Won-Kyung Sung Information Service Research Lab., KISTI, Korea jhm@kisti.re.kr

More information

VMworld 2015 Track Names and Descriptions

VMworld 2015 Track Names and Descriptions VMworld 2015 Track Names and Descriptions Software- Defined Data Center Software- Defined Data Center General Pioneered by VMware and recognized as groundbreaking by the industry and analysts, the VMware

More information

Combining Government and Linked Open Data in Emergency Management

Combining Government and Linked Open Data in Emergency Management Combining Government and Linked Open Data in Emergency Management Axel Schulz 1,2 and Heiko Paulheim 3 1 SAP Research 2 Technische Universität Darmstadt Telecooperation Group axel.schulz@sap.com 3 Technische

More information

The Mission of the Abu Dhabi Smart Solutions and Services Authority. Leading ADSSSA. By Michael J. Keegan

The Mission of the Abu Dhabi Smart Solutions and Services Authority. Leading ADSSSA. By Michael J. Keegan Perspective on Digital Transformation in Government with Her Excellency Dr. Rauda Al Saadi, Director General, Abu Dhabi Smart Solutions and Services Authority By Michael J. Keegan Today s digital economy

More information

A SURVEY ON SCHEDULING IN HADOOP FOR BIGDATA PROCESSING

A SURVEY ON SCHEDULING IN HADOOP FOR BIGDATA PROCESSING Journal homepage: www.mjret.in ISSN:2348-6953 A SURVEY ON SCHEDULING IN HADOOP FOR BIGDATA PROCESSING Bhavsar Nikhil, Bhavsar Riddhikesh,Patil Balu,Tad Mukesh Department of Computer Engineering JSPM s

More information

Proposed System. Start. Search parameter definition. User search criteria (input) usefulness score > 0.5. Retrieve results

Proposed System. Start. Search parameter definition. User search criteria (input) usefulness score > 0.5. Retrieve results , Impact Factor- 5.343 Hybrid Approach For Efficient Diversification on Cloud Stored Large Dataset Geetanjali Mohite 1, Prof. Gauri Rao 2 1 Student, Department of Computer Engineering, B.V.D.U.C.O.E, Pune,

More information

(Big Data Integration) : :

(Big Data Integration) : : (Big Data Integration) : : 3 # $%&'! ()* +$,- 2/30 ()* + # $%&' = 3 : $ 2 : 17 ;' $ # < 2 6 ' $%&',# +'= > 0 - '? @0 A 1 3/30 3?. - B 6 @* @(C : E6 - > ()* (C :(C E6 1' +'= - ''3-6 F :* 2G '> H-! +'-?

More information

Building a Data Strategy for a Digital World

Building a Data Strategy for a Digital World Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service

More information

SAP Analytics Cloud Best Practices for BI Platform Live Universes

SAP Analytics Cloud Best Practices for BI Platform Live Universes NOTE: Delete the yellow stickers when finished. See the SAP Image Library for other available images. Once the custom image is inserted, click Format Send Backward Send to Back, so the motion band is on

More information

Towards Open Innovation with Open Data Service Platform

Towards Open Innovation with Open Data Service Platform Towards Open Innovation with Open Data Service Platform Marut Buranarach Data Science and Analytics Research Group National Electronics and Computer Technology Center (NECTEC), Thailand The 44 th Congress

More information

Extracting and Querying Probabilistic Information From Text in BayesStore-IE

Extracting and Querying Probabilistic Information From Text in BayesStore-IE Extracting and Querying Probabilistic Information From Text in BayesStore-IE Daisy Zhe Wang, Michael J. Franklin, Minos Garofalakis 2, Joseph M. Hellerstein University of California, Berkeley Technical

More information

Research and Design of Education and Teaching Resource Management System based on ASP.NET Technology

Research and Design of Education and Teaching Resource Management System based on ASP.NET Technology 2018 3rd International Conference on Education & Education Research (EDUER 2018) Research and Design of Education and Teaching Resource Management System based on ASP.NET Technology Jin Xin Science and

More information

Power BI Developer Bootcamp

Power BI Developer Bootcamp Power BI Developer Bootcamp Mastering the Power BI Development Platform Course Code Audience Format Length Course Description Student Prerequisites PBD365 Professional Developers In-person and Remote 4

More information

This guide covers 3 functions you can perform with DataPlace: o Mapping, o Creating Tables, and o Creating Rankings. Registering with DataPlace

This guide covers 3 functions you can perform with DataPlace: o Mapping, o Creating Tables, and o Creating Rankings. Registering with DataPlace Guide for Using DataPlace DataPlace is one-stop source for housing and demographic data about communities, the region, and the nation. The site assembles a variety of data sets from multiple sources, and

More information