Recent Advances in Structured Data and the Web

Size: px
Start display at page:

Download "Recent Advances in Structured Data and the Web"

Transcription

1 Recent Advances in Structured Data and the Web Alon Halevy Google April 10, 2013 Joint work with: Jayant Madhavan, Cong Yu, Fei Wu, Hongrae Lee, Nitin Gupta, Warren Shen Anish Das Sarma, Boulos Harb, Zack Ives, Afshin Rostamizadeh, Sree Balakrishnan, Anno Langen, Steven Whang, Heidi Lam, Rod McChesney, Ken Wilder, Karen Jacqmin-Adams, and others

2 Structured Data in Search Results

3

4

5 Recognizing Set-Oriented Queries

6 Data in Movies!

7

8

9 But Will it Stick?

10 City planning Sustainability: water, coffee, Crisis response Shirts Advancing public discourse (e.g., gun control) Data philanthropy corporations encouraged to contribute data to the good of society.

11 How Can We Help? We need to empower a new generation of factivists : Data enthusiasts who know nothing about DBMS They need to find good data, do meaningful data integration, and tell compelling stories We need to extract the semantics of structured data from text Natural Language + DB is poised for breakthrough.

12 Fusion Tables google.com/fusiontables Goal: an easy-to-use database system that is integrated with the Web. Key: support common workflows Easy upload (CSV, KML, spreadsheets) Sharing (even outside your company) Visualizations front and center Easy publishing Goal 2: Fusion in the data cloud -- discover others data and combine with yours.

13

14 ICDE Acceptance Rate

15 Coffee Producing Countries

16 Coffee Consumption Per Capita

17

18 Table Facts: English poverty rates: 32,000 wards with a total of 1.8 million vertices Colors indicate poverty levels 2011 Rioting: 2100 incidents Colors indicate addresses of Rioting and Rioters Best UK Internet Journalist Knight-Batten Award for Innovations in Journalism Big Data for Regular People

19

20

21

22 Crowd Sourcing

23 Public toilets in Australia

24 Other Visualizations

25 From Visualization to Story Telling

26

27

28 See SIGMOD 2012 paper for details on scaling map visualizations Table Facts: Texas Counties 2010 Census: 254 counties with vertices Colored based on various demographics Big Data Integration

29 Data Integration as Search

30

31 Join with Population Data: What is a City?

32 Data Integration by the Masses: Challenges Guess as much as possible: suggest visualizations and data sets to join with Tools for telling good stories about the data Find join candidates that make sense Identifying reliable data Distinguishing between the source data and its derivatives

33 Finding Good Data on the Web The possible sources of data What are we trying to accomplish? Main classes of challenges The deeper challenge: semantics in text Also the opportunity: Natural Language is here.

34 HTML Tables

35 HTML Lists See Elmeleegy et al., VLDB 2009

36 The Deep Web Tree Search Amish quilts Parking tickets in India Horses

37 Repeating Structures

38 Other Sources of Data Spreadsheets CSV files Tables embedded in PDF XML, RDF Visualizations Online databases (Fusion Tables, Tableau, ) Each source has its particularities, but most problems are common to all.

39 What Will We Do With the Data? research.google.com/tables (give feedback to Cong Yu, in person)

40 Give Answers from Tables

41 It Better Be Right!

42 Answer with a Visualization

43 Nuclear Facilities in Earthquake Prone Areas

44 Long Term Goal: A Data-Guided Decision Engine Support decision making: Healthcare debate Should I install solar in my house? Which charity should I contribute to? Show relevant data Expose facets of the decision and enable drilldown Show opposing views Manually curated examples of decision engines: Justfacts.com, followthemoney.com, decide.com

45 Finding The Needle in the Haystack Categories of challenges: Non-tabular data displayed in HTML tables Good data, optimized for presentation on a page Good data, optimized for presentation on a site Table quality is often subjective! The ultimate challenge: recovering the table semantics

46 Non-Tabular Data in HTML

47 Data Optimized for Page Layout

48

49 Tabular Data Optimized for Site Layout See [Ling et al, IJCAI 2013] for stitching tables within a site.

50 Vertical Tables

51 Semantics are in Text

52 Semantics Can Be Brittle

53 Text and Structured Data Structured data in textual context: Spreadsheets PDF files Attribute descriptions in databases Comments field (e.g., customer care data) Everywhere on the Web

54 The Challenge Analyze natural language text as it pertains to structured data. Different from (open) information extraction that builds databases entirely from text. Good news: natural language parsing technology is now scalable.

55 First Step: Annotating Columns [Venetis et al., VLDB 2011]

56 Next Step: Understanding Relationships Between Columns

57 A Dictionary of Attributes Countries have population, coffee production Urban population is a subset of population Synonyms: for mountains, height = elevation Time dependence: population (by year) Units Coffee production: 60Kg bags Mountain elevations: meters, feet GDP: measured per capita Patterns of mentions in text The coffee production of X increased by Y percent in Z.

58 Dependency Parsing is Ready

59 Tower of Babel: Internet Style Coffee production el salvador 2013 Knowledge Graph El Salvador exports coffee 2013 Tables Text In 2013, the coffee production of El Salvador dropped by 20% due to the coffee rust disease.

60 Conclusions This was a talk about Big Data: Millions of people creating data sets Billions of people seeing the data being impacted Get out there and find your favorite application. Dreams do come true: At least as it pertains to structured data on the Web!

61 References Fusion Tables: SIGMOD 2010, 2012 WebTables: VLDB 2008, 2009, 2011

62

Big Data Integration for Data Enthusiasts. Jayant Madhavan Structured Data Research Google Inc.

Big Data Integration for Data Enthusiasts. Jayant Madhavan Structured Data Research Google Inc. for Data Enthusiasts Jayant Madhavan Structured Data Research Google Inc. Big Data Challenge Running computations over ginormous datasets Petabytes, Exabytes, maybe more! Only one aspect of the challenge!

More information

Structured Data on the Web

Structured Data on the Web Structured Data on the Web Alon Halevy Google Australasian Computer Science Week January, 2010 Structured Data & The Web Andree Hudson, 4 th of July Hard to find structured data via search engines

More information

Ontology Augmentation Through Matching with Web Tables

Ontology Augmentation Through Matching with Web Tables Ontology Augmentation Through Matching with Web Tables Oliver Lehmberg 1 and Oktie Hassanzadeh 2 1 University of Mannheim, B6 26, 68159 Mannheim, Germany 2 IBM Research, Yorktown Heights, New York, U.S.A.

More information

Understanding a Large Corpus of Web Tables Through Matching with Knowledge Bases An Empirical Study

Understanding a Large Corpus of Web Tables Through Matching with Knowledge Bases An Empirical Study Understanding a Large Corpus of Web Tables Through Matching with Knowledge Bases An Empirical Study Oktie Hassanzadeh, Michael J. Ward, Mariano Rodriguez-Muro, and Kavitha Srinivas IBM T.J. Watson Research

More information

Collect Relevant Demographics

Collect Relevant Demographics I N T R O D U C T I O N Collect Relevant Demographics Here we will collect demographics at the local level! At this point you should have identified and mapped a prioritized brownfield site. We will now

More information

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu Presented by: Dimitri Galmanovich Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu 1 When looking for Unstructured data 2 Millions of such queries every day

More information

Visualizing semantic table annotations with TableMiner+

Visualizing semantic table annotations with TableMiner+ Visualizing semantic table annotations with TableMiner+ MAZUMDAR, Suvodeep and ZHANG, Ziqi Available from Sheffield Hallam University Research Archive (SHURA) at:

More information

ProFoUnd: Program-analysis based Form Understanding

ProFoUnd: Program-analysis based Form Understanding ProFoUnd: Program-analysis based Form Understanding (joint work with M. Benedikt, T. Furche, A. Savvides) PIERRE SENELLART IC2 Group Seminar, 16 May 2012 The Deep Web Definition (Deep Web, Hidden Web,

More information

Challenges of providing broadband connectivity to masses in emerging India Market by. Narinder Chhibber Secretary General PTC India Foundation

Challenges of providing broadband connectivity to masses in emerging India Market by. Narinder Chhibber Secretary General PTC India Foundation Challenges of providing broadband connectivity to masses in emerging India Market by Narinder Chhibber Secretary General PTC India Foundation Agenda Main challenges for developing countries. India s present

More information

Self-Service Data Preparation for Qlik. Cookbook Series Self-Service Data Preparation for Qlik

Self-Service Data Preparation for Qlik. Cookbook Series Self-Service Data Preparation for Qlik Self-Service Data Preparation for Qlik What is Data Preparation for Qlik? The key to deriving the full potential of solutions like QlikView and Qlik Sense lies in data preparation. Data Preparation is

More information

Building Geospatial Mashups to Visualize Information for Crisis Management. Shubham Gupta and Craig A. Knoblock University of Southern California

Building Geospatial Mashups to Visualize Information for Crisis Management. Shubham Gupta and Craig A. Knoblock University of Southern California Building Geospatial Mashups to Visualize Information for Crisis Management Shubham Gupta and Craig A. Knoblock University of Southern California 1 WHAT IS A GEOSPATIAL MASHUP? Integrated View of data combined

More information

Ten Years of WebTables

Ten Years of WebTables Ten Years of WebTables Michael Cafarella Alon Halevy University of Michigan Megagon Labs michjc@umich.edu alon@megagon.ai Hongrae Lee Jayant Madhavan Cong Yu Google, Inc. {hrlee, jayant, congyu}@google.com

More information

Towards Open Innovation with Open Data Service Platform

Towards Open Innovation with Open Data Service Platform Towards Open Innovation with Open Data Service Platform Marut Buranarach Data Science and Analytics Research Group National Electronics and Computer Technology Center (NECTEC), Thailand The 44 th Congress

More information

The Data Journalist Chapter 7 tutorial Geocoding in ArcGIS Desktop

The Data Journalist Chapter 7 tutorial Geocoding in ArcGIS Desktop The Data Journalist Chapter 7 tutorial Geocoding in ArcGIS Desktop Summary: In many cases, online geocoding services are all you will need to convert addresses and other location data into geographic data.

More information

Smart Cities & The 4th Industrial Revolution

Smart Cities & The 4th Industrial Revolution Smart Cities & The 4th Industrial Revolution August 2nd, 2018 Tom Snyder ncriot.org @ncriot Capture IoT opportunities for our community locally, regionally, and nationally RIoT Ecosystem 6,000 Participants

More information

Hawaii Energy and Environmental Technologies (HEET) Initiative

Hawaii Energy and Environmental Technologies (HEET) Initiative Hawaii Energy and Environmental Technologies (HEET) Initiative Office of Naval Research Grant Award Number N0014-11-1-0391 Task 8. ENERGY-NEUTRAL ENERGY TEST PLATFORMS 8.3 Advanced Database Research, Development

More information

Chapter 7. Buffering in QGIS Desktop

Chapter 7. Buffering in QGIS Desktop Chapter 7 Buffering in QGIS Desktop Summary: How far away is that? How many are too close? These are some of the most compelling mapping questions journalists can ask. A buffer is one of the most useful

More information

Open Data Integration. Renée J. Miller

Open Data Integration. Renée J. Miller Open Data Integration Renée J. Miller miller@northeastern.edu !2 Open Data Principles Timely & Comprehensive Accessible and Usable Complete - All public data is made available. Public data is data that

More information

Creating and Running a Report

Creating and Running a Report Creating and Running a Report Reports are similar to queries in that they retrieve data from one or more tables and display the records. Unlike queries, however, reports add formatting to the output including

More information

Building a Europe of Knowledge. Towards the Seventh Framework Programme

Building a Europe of Knowledge. Towards the Seventh Framework Programme Building a Europe of Knowledge Towards the Seventh Framework Programme 2007-2013 FP7 /1 EUROPEAN COMMISSION - Research DG - November 2005 EU research: the story so far 1952: ECSC treaty; first projects

More information

Automatic Metadata Extraction for Archival Description and Access

Automatic Metadata Extraction for Archival Description and Access Automatic Metadata Extraction for Archival Description and Access WILLIAM UNDERWOOD Georgia Tech Research Institute Abstract: The objective of the research reported is this paper is to develop techniques

More information

Keywords Data alignment, Data annotation, Web database, Search Result Record

Keywords Data alignment, Data annotation, Web database, Search Result Record Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web

More information

Mapping discarded needles in Fusion Tables Tutorial

Mapping discarded needles in Fusion Tables Tutorial Mapping discarded needles in Fusion Tables Tutorial In the previous tutorials, we ve learned how to turn pivot tables into heat maps and place colour code geographic points using markers that Fusion Tables

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

Intelligent Enterprise meets Science of Where. Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018

Intelligent Enterprise meets Science of Where. Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018 Intelligent Enterprise meets Science of Where Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018 Value The Esri & SAP journey Customer Impact Innovation Track Record Customer

More information

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user

More information

August 2012 Daejeon, South Korea

August 2012 Daejeon, South Korea Building a Web of Linked Entities (Part I: Overview) Pablo N. Mendes Free University of Berlin August 2012 Daejeon, South Korea Outline Part I A Web of Linked Entities Challenges Progress towards solutions

More information

SAVI Advanced The Basics

SAVI Advanced The Basics SAVI Advanced The Basics Help Topics The Basics Getting Started System Requirements Getting Started Using SAVI Advanced Exploring the Data in SAVI Advanced Viewing Change Over Time Identifying Target Geographies

More information

Knowledge Representation in Social Context. CS227 Spring 2011

Knowledge Representation in Social Context. CS227 Spring 2011 7. Knowledge Representation in Social Context CS227 Spring 2011 Outline Vision for Social Machines From Web to Semantic Web Two Use Cases Summary The Beginning g of Social Machines Image credit: http://www.lifehack.org

More information

Effective News Content Sharing Within Station Groups in a Daily News Cycle

Effective News Content Sharing Within Station Groups in a Daily News Cycle Effective News Content Sharing Within Station BY MIKE PALMER Sharing news content between television stations within an ownership group brings benefits at both the group and local level, increasing competitive

More information

Mapping with Google Fusion Tables

Mapping with Google Fusion Tables Mapping with Google Fusion Tables You will learn how to transform location information stored in an Excel spreadsheet to a format that can be mapped directly with Google Fusion Tables. Format of Data within

More information

CompetitiveAccess Training Guide

CompetitiveAccess Training Guide www.strenuus.com CONTENTS Introduction Introduction to Competitive Access...3 Using Competitive Access Using Competitive Access...4 Accessibility Specification...5 Census File...6 Network Selection...8

More information

What is Text Mining? Sophia Ananiadou National Centre for Text Mining University of Manchester

What is Text Mining? Sophia Ananiadou National Centre for Text Mining   University of Manchester National Centre for Text Mining www.nactem.ac.uk University of Manchester Outline Aims of text mining Text Mining steps Text Mining uses Applications 2 Aims Extract and discover knowledge hidden in text

More information

Facebook Page Insights

Facebook Page Insights Facebook Product Guide for Facebook Page owners Businesses will be better in a connected world. That s why we connect 845M people and their friends to the things they care about, using social technologies

More information

Internet of Things Towards a more collaborative model

Internet of Things Towards a more collaborative model i Internet of Things Towards a more collaborative model Brahim GHRIBI Head of Government Relations MEA NOKIA 1 Nokia 2017 Past has been about connecting people, the future is about connecting things Improving

More information

Introduction to Data Science

Introduction to Data Science UNIT I INTRODUCTION TO DATA SCIENCE Syllabus Introduction of Data Science Basic Data Analytics using R R Graphical User Interfaces Data Import and Export Attribute and Data Types Descriptive Statistics

More information

Stephanie Stuckey Chief Resilience Officer

Stephanie Stuckey Chief Resilience Officer Stephanie Stuckey Chief Resilience Officer Helping cities around the world become more resilient to the physical, social and economic challenges that are a growing part of the 21st century. 2 100RC MEMBER

More information

Synthesizing Union Tables from the Web

Synthesizing Union Tables from the Web Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Synthesizing Union Tables from the Web Xiao Ling xiaoling@cs.washington.edu University of Washington Alon Halevy

More information

Recognizing Quantity Names for Tabular Data

Recognizing Quantity Names for Tabular Data Recognizing Quantity Names for Tabular Data Yang Yi Zhiyu Chen Jeff Heflin Brian D. Davison {yay218 zhc415 heflin davison}@cse.lehigh.edu Dept. of Computer Science and Engineering, Lehigh University Bethlehem,

More information

International Journal of Advance Engineering and Research Development. Performance Enhancement of Search System

International Journal of Advance Engineering and Research Development. Performance Enhancement of Search System Scientific Journal of Impact Factor(SJIF): 3.134 International Journal of Advance Engineering and Research Development Volume 2,Issue 7, July -2015 Performance Enhancement of Search System Ms. Uma P Nalawade

More information

Automatically Constructing a Directory of Molecular Biology Databases

Automatically Constructing a Directory of Molecular Biology Databases Automatically Constructing a Directory of Molecular Biology Databases Luciano Barbosa Sumit Tandon Juliana Freire School of Computing University of Utah {lbarbosa, sumitt, juliana}@cs.utah.edu Online Databases

More information

Taxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company

Taxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company Taxonomy Tools: Collaboration, Creation & Integration Dave Clarke Global Taxonomy Director dave.clarke@dowjones.com Dow Jones & Company Introduction Software Tools for Taxonomy 1. Collaboration 2. Creation

More information

PEOPLE PEOPLE. Dynamic profiles of all your people, with info captured from anywhere. Includes followups & targeting.

PEOPLE PEOPLE. Dynamic profiles of all your people, with info captured from anywhere. Includes followups & targeting. FEATURES PEOPLE WEBSITE COMMUNICATIONS FINANCES The world's first Community Organizing System PEOPLE WEBSITE Multiple page types & user profiles. Build custom responsive designs using NationBuilder Theme

More information

Tutorial for mapping Toronto s 2013 fire hydrant parking ticket data

Tutorial for mapping Toronto s 2013 fire hydrant parking ticket data Tutorial for mapping Toronto s 2013 fire hydrant parking ticket data Toronto, like cities including Vancouver, Edmonton, Ottawa and Halifax, has an open data site with lots of great data to download. The

More information

Annotating Multiple Web Databases Using Svm

Annotating Multiple Web Databases Using Svm Annotating Multiple Web Databases Using Svm M.Yazhmozhi 1, M. Lavanya 2, Dr. N. Rajkumar 3 PG Scholar, Department of Software Engineering, Sri Ramakrishna Engineering College, Coimbatore, India 1, 3 Head

More information

(Big Data Integration) : :

(Big Data Integration) : : (Big Data Integration) : : 3 # $%&'! ()* +$,- 2/30 ()* + # $%&' = 3 : $ 2 : 17 ;' $ # < 2 6 ' $%&',# +'= > 0 - '? @0 A 1 3/30 3?. - B 6 @* @(C : E6 - > ()* (C :(C E6 1' +'= - ''3-6 F :* 2G '> H-! +'-?

More information

Provide Real-Time Data To Financial Applications

Provide Real-Time Data To Financial Applications Provide Real-Time Data To Financial Applications DATA SHEET Introduction Companies typically build numerous internal applications and complex APIs for enterprise data access. These APIs are often engineered

More information

Microsoft Power BI for O365

Microsoft Power BI for O365 Microsoft Power BI for O365 Next hour.. o o o o o o o o Power BI for O365 Data Discovery Data Analysis Data Visualization & Power Maps Natural Language Search (Q&A) Power BI Site Data Management Self Service

More information

Tutorial for downloading the city of Ottawa s monthly service request submissions data

Tutorial for downloading the city of Ottawa s monthly service request submissions data Tutorial for downloading the city of Ottawa s monthly service request submissions data Which ward has the gets the most noise complaints in the summer? Where are the city s graffiti hotspots? Which residents

More information

Overview of SAS/GIS Software

Overview of SAS/GIS Software 3 CHAPTER 1 Overview of SAS/GIS Software Introduction to Geographic Information Systems 3 Features of SAS Software 4 Data in SAS/GIS Applications 5 Spatial Data 5 Spatial Data Layers 6 Spatial Data Coverages

More information

Activity-Based Human Mobility Patterns Inferred from Mobile Phone Data: A Case Study of Singapore

Activity-Based Human Mobility Patterns Inferred from Mobile Phone Data: A Case Study of Singapore Activity-Based Human Mobility Patterns Inferred from Mobile Phone Data: A Case Study of Singapore By: Shan Jiang, Joseph Ferreira, Jr., and Marta C. Gonzalez Published in: 2017 Presented by: Masijia Qiu

More information

User Manual.

User Manual. User Manual www.districtmetrics.com District Metrics (DM) helps you to unravel the potential household consumers in India at a disaggregated level. The service brings you, in a single platform, demographics,

More information

Facebook Page Insights

Facebook Page Insights Facebook Product Guide for Facebook Page owners Businesses will be better in a connected world. That s why we connect 800M people and their friends to the things they care about, using social technologies

More information

www.powerretrieve.solutions Copyright 2015 1 What is PowerRetrieve? Turning your documents into data Imagine a world where using a few words, dates or letter combinations you could search through the content

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

April 13, Honorable John Hickenlooper Governor, State of Colorado 136 State Capitol Building 200 East Colfax Avenue Denver, CO 80203

April 13, Honorable John Hickenlooper Governor, State of Colorado 136 State Capitol Building 200 East Colfax Avenue Denver, CO 80203 April 13, 2017 Honorable John Hickenlooper Governor, State of Colorado 136 State Capitol Building 200 East Colfax Avenue Denver, CO 80203 RE: Support for House Bill 1193 Small Wireless Infrastructure Dear

More information

Energy Efficiency Harnessing a Local Resource

Energy Efficiency Harnessing a Local Resource Energy Efficiency Harnessing a Local Resource Jenny Edwards Director, Innovation Exchange Center for Energy and Environment Metro CERT Annual Resource & Networking Event October 18, 2016 Community-based

More information

An Introduction to Data Interoperability

An Introduction to Data Interoperability Esri International User Conference San Diego, California 2012 Technical Workshops July 24/25 An Introduction to Data Interoperability Bruce Harold - Esri Dale Lutz Safe Software Background Why Data Interoperability?

More information

Pfizerpedia Patents The Who, what when and why of patents. David Walsh, Andrew Berridge, RDMi, Pfizer, Sandwich

Pfizerpedia Patents The Who, what when and why of patents. David Walsh, Andrew Berridge, RDMi, Pfizer, Sandwich Pfizerpedia Patents The Who, what when and why of patents David Walsh, Andrew Berridge, RDMi, Pfizer, Sandwich We are Pfizer - We like Patents! Lipitor patent The best selling US prescription medicine

More information

Fusing Corporate Thesaurus Management with Linked Data using PoolParty

Fusing Corporate Thesaurus Management with Linked Data using PoolParty Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl PoolParty at a glance Developed by punkt. netservices Current release: PoolParty 2.8 Main focus on three application

More information

Grants Module 101. A Quick Reference Guide for Inquiry into Grants Contracts and Grants April 2016

Grants Module 101. A Quick Reference Guide for Inquiry into Grants Contracts and Grants April 2016 Grants Module 101 A Quick Reference Guide for Inquiry into Grants Contracts and Grants April 2016 Table of Contents Adding Pages to Favorites Pages 2-3 Viewing Information on the Award Profile Pages 4-8

More information

Data publication and discovery with Globus

Data publication and discovery with Globus Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,

More information

ScholarOne Manuscripts. Production Center User Guide

ScholarOne Manuscripts. Production Center User Guide ScholarOne Manuscripts Production Center User Guide 1-May-2018 Clarivate Analytics ScholarOne Manuscripts Production Center User Guide Page i TABLE OF CONTENTS INTRODUCTION... 2 USE GET HELP NOW & FAQS...

More information

Web-Scale Extraction of Structured Data

Web-Scale Extraction of Structured Data Web-Scale Extraction of Structured Data Michael J. Cafarella University of Washington mjc@cs.washington.edu Jayant Madhavan Google Inc. jayant@google.com Alon Halevy Google Inc. halevy@google.com ABSTRACT

More information

Mapping with Google Fusion Tables

Mapping with Google Fusion Tables Mapping with Google Fusion Tables You will learn how to transform location information stored in an Excel spreadsheet to a format that can be mapped directly with the Google Fusion Tables. Format of Data

More information

Extracting Algorithms by Indexing and Mining Large Data Sets

Extracting Algorithms by Indexing and Mining Large Data Sets Extracting Algorithms by Indexing and Mining Large Data Sets Vinod Jadhav 1, Dr.Rekha Rathore 2 P.G. Student, Department of Computer Engineering, RKDF SOE Indore, University of RGPV, Bhopal, India Associate

More information

Data Visualisation with Google Fusion Tables

Data Visualisation with Google Fusion Tables Data Visualisation with Google Fusion Tables Workshop Exercises Dr Luc Small 12 April 2017 1.5 Data Visualisation with Google Fusion Tables Page 1 of 33 1 Introduction Google Fusion Tables is shaping up

More information

DBpedia Data Processing and Integration Tasks in UnifiedViews

DBpedia Data Processing and Integration Tasks in UnifiedViews 1 DBpedia Data Processing and Integration Tasks in Tomas Knap Semantic Web Company Markus Freudenberg Leipzig University Kay Müller Leipzig University 2 Introduction Agenda, Team 3 Agenda Team & Goal An

More information

Demo: Linked Open Statistical Data for the Scottish Government

Demo: Linked Open Statistical Data for the Scottish Government Demo: Linked Open Statistical Data for the Scottish Government Bill Roberts 1 1 Swirrl IT Limited http://swirrl.com Abstract. This paper describes the approach taken by the Scottish Government, supported

More information

Resilience Planning in San Francisco

Resilience Planning in San Francisco Resilience Planning in San Francisco 2 Developed policies and infrastructure to mitigate and recover from disasters Expanded definition of resilience: Responding to disasters Systemic crises like economic

More information

What is a graph database?

What is a graph database? What is a graph database? A graph database is a data store that has been optimized for highly connected data. Storing connected data in a flat tabular format is time and resource intensive, usually requiring

More information

Using pivot tables in Excel (live exercise with data)

Using pivot tables in Excel (live exercise with data) Using pivot tables in Excel (live exercise with data) In chapter four, we used B.C. s political donations data to learn how to build pivot tables, which group elements in your data and summarize the information

More information

Advances in Data Management - Web Data Integration A.Poulovassilis

Advances in Data Management - Web Data Integration A.Poulovassilis Advances in Data Management - Web Data Integration A.Poulovassilis 1 1 Integrating Deep Web Data Traditionally, the web has made available vast amounts of information in unstructured form (i.e. text).

More information

Buffering in ArcMap tutorial

Buffering in ArcMap tutorial Buffering in ArcMap tutorial 1) Open ArcMap 2) Bring in the Wards_2010 and Parks shape files from the city of Ottawa s open data portal. (NOTE: for the purposes of this tutorial, you could also bring in

More information

QGIS LAB SERIES GST 102: Spatial Analysis Lab 2: Introduction to Geospatial Analysis

QGIS LAB SERIES GST 102: Spatial Analysis Lab 2: Introduction to Geospatial Analysis QGIS LAB SERIES GST 102: Spatial Analysis Lab 2: Introduction to Geospatial Analysis Objective Understand Attribute Table Joins and Data Classification Document Version: 2014-06-16 (Beta) Contents Introduction...2

More information

Using Pivot Tables in Excel (Live Exercise with Data)

Using Pivot Tables in Excel (Live Exercise with Data) Chapter 4 Using Pivot Tables in Excel (Live Exercise with Data) In chapter four, we used B.C. s political donations data to learn how to build pivot tables, which group elements in your data and summarize

More information

A Case Study about IFEZ(Song-do) Smart-City. Ryan LEE

A Case Study about IFEZ(Song-do) Smart-City. Ryan LEE A Case Study about IFEZ(Song-do) Smart-City Ryan LEE Career History Incheon U-City Corporation Incheon, Korea - Director, IFEZ Smart-City Integrated Operation Center * Managing IFEZ Smart City Integrated

More information

7 BEST PRACTICES TO BECOME A TABLEAU NINJA FOR OBIEE

7 BEST PRACTICES TO BECOME A TABLEAU NINJA FOR OBIEE 7 BEST PRACTICES TO BECOME A TABLEAU NINJA FOR OBIEE Whitepaper Abstract By connecting directly from Tableau to OBIEE, BI Connector has empowered business users with Self-Service capabilities like never

More information

Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications

Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications WWW2004 (New York, May 22, 2004) Semantic Web Track, Developers Day Boanerges

More information

Towards Efficient and Effective Semantic Table Interpretation Ziqi Zhang

Towards Efficient and Effective Semantic Table Interpretation Ziqi Zhang Towards Efficient and Effective Semantic Table Interpretation Ziqi Zhang Department of Computer Science, University of Sheffield Outline Define semantic table interpretation State-of-the-art and motivation

More information

Outline. Part I. Introduction Part II. ML for DI. Part III. DI for ML Part IV. Conclusions and research direction

Outline. Part I. Introduction Part II. ML for DI. Part III. DI for ML Part IV. Conclusions and research direction Outline Part I. Introduction Part II. ML for DI ML for entity linkage ML for data extraction ML for data fusion ML for schema alignment Part III. DI for ML Part IV. Conclusions and research direction Data

More information

GOOGLE ANALYTICS HELP PRESENTATION. We Welcome You to. Google Analytics Implementation Guidelines

GOOGLE ANALYTICS HELP PRESENTATION. We Welcome You to. Google Analytics Implementation Guidelines GOOGLE ANALYTICS HELP PRESENTATION We Welcome You to Google Analytics Implementation Guidelines 05/23/2008 Ashi Avalon - Google Analytics Implementation Presentation Page 1 of 28 1) What Is Google Analytics?

More information

Adding an event to club web site.

Adding an event to club web site. Adding an event to club web site. Use this procedure to add new ride events or edit existing ones. The procedure details what is needed for a ride, one of the more complex events. For other things like

More information

Teleuse on a Shoestring

Teleuse on a Shoestring Teleuse on a Shoestring LIRNEasia Media Workshop Singapore. 28 February 2007 Agenda Research methodology & key findings: Harsha de Silva The SMS story and others Lorraine Carlos Salazar The next billion

More information

Query-Time JOIN for Active Intelligence Engine (AIE)

Query-Time JOIN for Active Intelligence Engine (AIE) Query-Time JOIN for Active Intelligence Engine (AIE) Ad hoc JOINing of Structured Data and Unstructured Content: An Attivio-Patented Breakthrough in Information- Centered Business Agility An Attivio Technology

More information

Geodatabases. Dr. Zhang SPRING 2016 GISC /03/2016

Geodatabases. Dr. Zhang SPRING 2016 GISC /03/2016 Geodatabases Dr. Zhang SPRING 2016 GISC 1401 10/03/2016 Using and making maps Navigating GIS maps Map design Working with spatial data Spatial data infrastructure Interactive maps Map Animations Map layouts

More information

Thinking cities. Khalil Laaboudi. Smart & Sustainable Cities. Global Marketing

Thinking cities. Khalil Laaboudi. Smart & Sustainable Cities. Global Marketing Thinking cities Khalil Laaboudi Smart & Sustainable Cities Global Marketing Thinking Cities -- Agenda I. Ericsson in the World and in Hong Kong II. III. IV. Networked Society and other global trends New

More information

Informatica Enterprise Information Catalog

Informatica Enterprise Information Catalog Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with

More information

Building the Future. New ICT Enables Smart Cities. Yannis Liapis Oct 25 th,

Building the Future. New ICT Enables Smart Cities. Yannis Liapis Oct 25 th, Building the Future New ICT Enables Smart Cities Yannis Liapis Oct 25 th, 2017 1 Why Do We Need Smart City? Rapid population growth, urbanization acceleration Major issues Global population growth will

More information

iknow Use Cases Michael Br ands Senior Product Manager

iknow Use Cases Michael Br ands Senior Product Manager iknow Use Cases Michael Br ands Senior Product Manager Agenda iknow in the InterSystems offering Breakthrough Characteristics Steps in Deployment New Features Use Cases The InterSystems Technology Unlock

More information

DOWNLOAD OR READ : USING MICROSOFT ACCESS TO CREATE REPORTS FROM SASIXP PART II GRADE REPORTING PDF EBOOK EPUB MOBI

DOWNLOAD OR READ : USING MICROSOFT ACCESS TO CREATE REPORTS FROM SASIXP PART II GRADE REPORTING PDF EBOOK EPUB MOBI DOWNLOAD OR READ : USING MICROSOFT ACCESS TO CREATE REPORTS FROM SASIXP PART II GRADE REPORTING PDF EBOOK EPUB MOBI Page 1 Page 2 using microsoft access to create reports from sasixp part ii grade reporting

More information

Supporting Fuzzy Keyword Search in Databases

Supporting Fuzzy Keyword Search in Databases I J C T A, 9(24), 2016, pp. 385-391 International Science Press Supporting Fuzzy Keyword Search in Databases Jayavarthini C.* and Priya S. ABSTRACT An efficient keyword search system computes answers as

More information

WEB-BASED COLLECTION MANAGEMENT FOR ARCHIVES

WEB-BASED COLLECTION MANAGEMENT FOR ARCHIVES WEB-BASED COLLECTION MANAGEMENT FOR ARCHIVES Comprehensive Collections Management Systems You Can Access Anytime, Anywhere AXIELL COLLECTIONS FOR ARCHIVES Axiell Collections is a webbased CMS designed

More information

Data Integration for the Relational Web

Data Integration for the Relational Web Data Integration for the Relational Web Report for the course hy562 Michalis Katsarakis katsarakis@csd.uoc.gr 26 June 2012 Contents Contents 1. Contribution Summary (Περίληψη Συνεισφοράς) 2. Range of Result

More information

ScholarOne Manuscripts. COGNOS Reports User Guide

ScholarOne Manuscripts. COGNOS Reports User Guide ScholarOne Manuscripts COGNOS Reports User Guide 1-May-2018 Clarivate Analytics ScholarOne Manuscripts COGNOS Reports User Guide Page i TABLE OF CONTENTS USE GET HELP NOW & FAQS... 1 SYSTEM REQUIREMENTS...

More information

Lecture Telecooperation. D. Fensel Leopold-Franzens- Universität Innsbruck

Lecture Telecooperation. D. Fensel Leopold-Franzens- Universität Innsbruck Lecture Telecooperation D. Fensel Leopold-Franzens- Universität Innsbruck First Lecture: Introduction: Semantic Web & Ontology Introduction Semantic Web and Ontology Part I Introduction into the subject

More information

Keyword search in relational databases. By SO Tsz Yan Amanda & HON Ka Lam Ethan

Keyword search in relational databases. By SO Tsz Yan Amanda & HON Ka Lam Ethan Keyword search in relational databases By SO Tsz Yan Amanda & HON Ka Lam Ethan 1 Introduction Ubiquitous relational databases Need to know SQL and database structure Hard to define an object 2 Query representation

More information

Import Portfolio Feature Starter Guide. Revision 1

Import Portfolio Feature Starter Guide. Revision 1 Import Portfolio Feature Starter Guide Revision 1 Sustainable IQ, Inc. 4-4-2016 Contents Introduction... 2 Reference Documents... 2 Staging Version of DDx URL... 3 The Basics: How to Use... 4 Feature Workflow

More information

Big Data Storytelling through Interactive Maps

Big Data Storytelling through Interactive Maps Big Data Storytelling through Interactive Maps Jayant Madhavan, Sreeram Balakrishnan, Kathryn Brisbin, Hector Gonzalez, Nitin Gupta, Alon Halevy, Karen Jacqmin-Adams, Heidi Lam, Anno Langen, Hongrae Lee,

More information

Data Governance for the Connected Enterprise

Data Governance for the Connected Enterprise Data Governance for the Connected Enterprise Irene Polikoff and Jack Spivak, TopQuadrant Inc. November 3, 2016 Copyright 2016 TopQuadrant Inc. Slide 1 Data Governance for the Connected Enterprise Today

More information