Big Data Integration for Data Enthusiasts. Jayant Madhavan Structured Data Research Google Inc.
|
|
- Lenard Melton
- 5 years ago
- Views:
Transcription
1 for Data Enthusiasts Jayant Madhavan Structured Data Research Google Inc.
2 Big Data Challenge Running computations over ginormous datasets Petabytes, Exabytes, maybe more! Only one aspect of the challenge!
3 Big Data Challenge What about the data enthusiasts? Data experts, but without technical expertise Journalists, Social Scientists, NGOs, High School Teachers and Students, etc. Motivated by advocacy, i.e., doing good through data awareness Tools that help data enthusiasts work effectively Analysis: explore, clean, analyze and summarize Storytelling: visualize and publish Integration: share, find, and combine
4 Google Fusion Tables Data Management / Integration in the cloud Sharing, Collaborating, Exploring, Visualizing and Publishing Ease of use with focus on data enthusiasts Launched in Labs June Now part of Google Apps. Many million uploaded tables Embedded maps popular with journalists SQL API used widely to access reference tables, e.g., CA DMV Search to find public tables and tables extracted from the Web Bigdata Storytelling through Interactive Maps [Madhavan+, DeBull 2012] Dated system overviews [SoCC 2010, SIGMOD 2010]
5 Upload, share, explore, visualize, publish
6 Upload, share, explore, visualize, publish
7 Upload, share, explore, visualize, publish
8 Upload, share, explore, visualize, publish
9 Upload, share, explore, visualize, publish
10 Upload, share, explore, visualize, publish
11 Thousands of points -- not a simple mashup anymore
12 Table Facts: Wikileaks Iraq War Diaries: 52,000 incidents
13 Table Facts: Texas Counties 2010 Census: 254 counties with vertices Colored based on various demographics
14 Integration by Map Layers Table Facts: Earthquakes since 1973 (USGS): 174,000 incidents of magnitude 4.5+ displayed as a heatmap Nuclear Power Stations (IAEA): 248 locations with with active nuclear energy generation
15 Table Facts: English poverty rates: 32,000 wards with a total of 1.8 million vertices Colors indicate poverty levels 2011 Rioting: 2100 incidents Colors indicate addresses of Rioting and Rioters Best UK Internet Journalist Knight-Batten Award for Innovations in Journalism
16 Interactive Visualizations on Large Datasets Interactive responses necessitate Fast server-side retrieval Fast visualization rendering Low network delays Low server response time In-memory column database Specialized spatial index Details in [Lee, Sarma, Gonzalez, Lam, Madhavan, Roy, In Prep] Low rendering time à Small server response sizes Constrained sampling to limit points according to level of detail Details [Sarma, Lee, Gonzalez, Halevy, Madhavan, SIGMOD 2012]
17 The F in Fusion Tables Support for Merged tables Virtual tables that are the join of two or more underlying data tables Full first class citizens sharing, exploration, visualization and publishing Potentially critical to data enthusiasts Data from multiple sources that complete a story Requirements: Joins that just work, i.e., approx matching, entity-based matching, etc.
18 Use Case 1: Merging Complementary Datasets Firearm Deaths by Country Population by Country
19 Use Case 2: Merging with Reference Datasets Cost by Country Geometry boundaries per County
20 Use Case 2: Merging with Reference Datasets
21 Merges and their consequences: Community Benefits Fosters an eco-system for data reuse High quality reference tables used by many, but managed by a dedicated few Extent of data reuse serves as a crowdsourced quality signal Per-table permissions lead to sophisticated sharing models
22 Merges and their consequences: Performance Benefits Efficient in-memory optimizations Multiple visualizations share in-memory indices reducing their combined footprint Splitting tables into fixed and changing subtables leads to higher update rates
23 Table Facts: Terrestrial Eco Regions: regions and 4 million vertices Colored based on choice of ground water, cleared forests, human appropriations, etc.
24 The Merge Search Problem Problem: Given a table and a keyword, find table(s) that can be merged with the table Input table + looking for: population à Tables that include population for all/most of the countries in the input table
25 Detour: Table Search Problem: Find tables on the web that match keyword search queries firearm deaths by state à Table search available at: research.google.com/tables
26
27 Table Search Problem: Find tables on the web that match keyword search queries Challenges Extraction: Not everything within a <table> is a data table Identifying data tables from navigation and formatting ones Ranking: Not as simple as restricting Google.com results Content outside table might be necessary, yet misleading
28
29 Table Search Class-property queries are data-seeking and plentiful and they can be improved over web search Detecting subject columns and their corresponding semantic classes can improve search quality of results Detecting header columns and their corresponding properties can improve results [Venetis+, VLDB 2011] [Cafarella+, VLDB 2008]
30 Back to Merge Search Matching join columns Coverage: Entity overlap Matching keyword queries Use token-based matching, synonymy, etc. Subtle, yet critical, difference from web search Recall more important than precision Traditional IR optimizations can come in the way
31
32
33 Future: can we make data research easier for data enthusiasts? Can we automatically suggest Datasets that complement theirs E.g., tables with other attributes relevant to firearms
34 Future: can we make data research easier for data enthusiasts? Can we automatically suggest Datasets that complete theirs E.g., tables with the same data, but countries not in the table Datasets that contradict / support theirs E.g., tables with the same information for earlier years Visualizations that highlight trends in their data E.g., charts that demonstrate a correlation in data
35 Structured Google Sreeram Balakrishnan Johnny Chen Alon Halevy Felix Halim Boulos Harb Karen Jacqmin-Adams Hector Gonzalez Nitin Gupta Heidi Lam Anno Langen Hongrae Lee Rod McChesney Afshin Rostamizadeh Rebecca Shapley Warren Shen Steven Whang Kenneth Wilder Fei Wu Cong Yu and others
36 Structured Google Google Fusion Tables Data management for data enthusiasts Analysis, storytelling, and integration made easy Table Search Finding data tables on the Web Making data tables useful for other search activities
Recent Advances in Structured Data and the Web
Recent Advances in Structured Data and the Web Alon Halevy Google April 10, 2013 Joint work with: Jayant Madhavan, Cong Yu, Fei Wu, Hongrae Lee, Nitin Gupta, Warren Shen Anish Das Sarma, Boulos Harb, Zack
More informationBig Data Storytelling through Interactive Maps
Big Data Storytelling through Interactive Maps Jayant Madhavan, Sreeram Balakrishnan, Kathryn Brisbin, Hector Gonzalez, Nitin Gupta, Alon Halevy, Karen Jacqmin-Adams, Heidi Lam, Anno Langen, Hongrae Lee,
More informationStructured Data on the Web
Structured Data on the Web Alon Halevy Google Australasian Computer Science Week January, 2010 Structured Data & The Web Andree Hudson, 4 th of July Hard to find structured data via search engines
More informationPresented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu
Presented by: Dimitri Galmanovich Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu 1 When looking for Unstructured data 2 Millions of such queries every day
More informationScholarly Big Data: Leverage for Science
Scholarly Big Data: Leverage for Science C. Lee Giles The Pennsylvania State University University Park, PA, USA giles@ist.psu.edu http://clgiles.ist.psu.edu Funded in part by NSF, Allen Institute for
More informationOntology Augmentation Through Matching with Web Tables
Ontology Augmentation Through Matching with Web Tables Oliver Lehmberg 1 and Oktie Hassanzadeh 2 1 University of Mannheim, B6 26, 68159 Mannheim, Germany 2 IBM Research, Yorktown Heights, New York, U.S.A.
More informationAdvanced Computer Graphics CS 525M: Crowds replace Experts: Building Better Location-based Services using Mobile Social Network Interactions
Advanced Computer Graphics CS 525M: Crowds replace Experts: Building Better Location-based Services using Mobile Social Network Interactions XIAOCHEN HUANG Computer Science Dept. Worcester Polytechnic
More informationFOUNDATIONS OF A CROSS-DISCIPLINARY PEDAGOGY FOR BIG DATA *
FOUNDATIONS OF A CROSS-DISCIPLINARY PEDAGOGY FOR BIG DATA * Joshua Eckroth Stetson University DeLand, Florida 386-740-2519 jeckroth@stetson.edu ABSTRACT The increasing awareness of big data is transforming
More informationQlik Sense Desktop. Data, Discovery, Collaboration in minutes. Qlik Sense Desktop. Qlik Associative Model. Get Started for Free
Qlik Sense Desktop Data, Discovery, Collaboration in minutes With Qlik Sense Desktop making business decisions becomes faster, easier, and more collaborative than ever. Qlik Sense Desktop puts rapid analytics
More informationIn this exercise you will display the Geo-tagged Wikipedia Articles Fusion Table in Google Maps.
Introduction to the Google Maps API v3 Adding a Fusion Table to Google Maps Fusion Tables, still in the experimental stages of development, is a Google product that allows you to upload and share data
More informationAnalysing crime data in Maps for Office and ArcGIS Online
Analysing crime data in Maps for Office and ArcGIS Online For non-commercial use only by schools and universities Esri UK GIS for School Programme www.esriuk.com/schools Introduction ArcGIS Online is a
More informationThe Definitive Guide to Preparing Your Data for Tableau
The Definitive Guide to Preparing Your Data for Tableau Speed Your Time to Visualization If you re like most data analysts today, creating rich visualizations of your data is a critical step in the analytic
More informationAdvances in GIS help create Smarter Communities
Advances in GIS help create Smarter Communities POP(ovich) Quiz Who is a Desktop User? Who is an ArcGIS Online User? Who is a ArcGIS Server Admin? Who is a Programmer? Who works with or for a government
More informationOrganizing and Managing Grassroots Enterprise Mashup Environments. Doctorial Thesis, 24 th June, Volker Hoyer
Organizing and Managing Grassroots Enterprise Mashup Environments Doctorial Thesis, 24 th June, 2010 Volker Hoyer Motivation and Research Questions Research Design Results Conclusion Motivation and Research
More informationCisco Collaborative Knowledge
Cisco Collaborative Knowledge Product Overview. Your workforce needs knowledge, speed and flexibility to solve real-world business challenges in today s fast moving digital economy. Cisco Collaborative
More informationTowards Efficient and Effective Semantic Table Interpretation Ziqi Zhang
Towards Efficient and Effective Semantic Table Interpretation Ziqi Zhang Department of Computer Science, University of Sheffield Outline Define semantic table interpretation State-of-the-art and motivation
More informationANNUAL REPORT Visit us at project.eu Supported by. Mission
Mission ANNUAL REPORT 2011 The Web has proved to be an unprecedented success for facilitating the publication, use and exchange of information, at planetary scale, on virtually every topic, and representing
More informationCharting the Progress of Smart City Development in Shanghai
Charting the Progress of Smart City Development in Shanghai Xueguo Wen Executive Vice President of Shanghai Academy 2017 TM Forum 1 C ONTENTS Current situation Experience and outlook Strategic cooperation
More informationUsing American FactFinder
Using American FactFinder John DeWitt Lisa Neidert Project Manager Data Services Social Science Data Analysis Network Population Studies Center What is American FactFinder? http://factfinder.census.gov
More informationVisualizing semantic table annotations with TableMiner+
Visualizing semantic table annotations with TableMiner+ MAZUMDAR, Suvodeep and ZHANG, Ziqi Available from Sheffield Hallam University Research Archive (SHURA) at:
More informationCloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018
Cloud Computing 2 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning
More informationUnderstanding a Large Corpus of Web Tables Through Matching with Knowledge Bases An Empirical Study
Understanding a Large Corpus of Web Tables Through Matching with Knowledge Bases An Empirical Study Oktie Hassanzadeh, Michael J. Ward, Mariano Rodriguez-Muro, and Kavitha Srinivas IBM T.J. Watson Research
More informationTeacher Step 1: How to create a Google Classroom
Navigate to classroom.google.com Teacher Step 1: How to create a Google Classroom Login with your OCSD account, you will soon have a single sign on path within Classlink but for now manually type in yourusername@ocsd.okaloosa.k12.fl.us
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REVIEW PAPER ON IMPLEMENTATION OF DOCUMENT ANNOTATION USING CONTENT AND QUERYING
More informationThe Emerging Web of Linked Data
4th Berlin Semantic Web Meetup 26. February 2010 The Emerging Web of Linked Data Prof. Dr. Christian Bizer Freie Universität Berlin Outline 1. From a Web of Documents to a Web of Data Web APIs and Linked
More informationMS-55045: Microsoft End to End Business Intelligence Boot Camp
MS-55045: Microsoft End to End Business Intelligence Boot Camp Description This five-day instructor-led course is a complete high-level tour of the Microsoft Business Intelligence stack. It introduces
More informationPowering Knowledge Discovery. Insights from big data with Linguamatics I2E
Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural
More informationProfessor Yashar Ganjali Department of Computer Science University of Toronto
Professor Yashar Ganjali Department of Computer Science University of Toronto yganjali@cs.toronto.edu http://www.cs.toronto.edu/~yganjali Some slides courtesy of J. Rexford (Princeton), N. Foster (Cornell)
More informationIntelligent Enterprise meets Science of Where. Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018
Intelligent Enterprise meets Science of Where Anand Raisinghani Head Platform & Data Management SAP India 10 September, 2018 Value The Esri & SAP journey Customer Impact Innovation Track Record Customer
More informationPublic Data and Visualizations: How are Many Eyes and Tableau Public Used for Collaborative Analytics?
Public Data and Visualizations: How are Many Eyes and Tableau Public Used for Collaborative Analytics? Kristi Morton 1, Magdalena Balazinska 1, Dan Grossman 1, Robert Kosara 2, and Jock Mackinlay 2 1 University
More informationOPTIMIZING MAPREDUCE FUNCTIONALITY IN BIGDATA USING CACHE MANAGER
OPTIMIZING MAPREDUCE FUNCTIONALITY IN BIGDATA USING CACHE MANAGER Devi. L and S. Gowri Faculty of Computing, Sathyabama University, India E-Mail: devikanth.btech@gmail.com ABSTRACT The MapReduce framework
More informationMaking research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology
Making research data repositories visible and discoverable Robert Ulrich Karlsruhe Institute of Technology Outline Background Mission Schema, Icons, Quality and Workflow Interface Growth Cooperations Experiences
More informationCreating a Recommender System. An Elasticsearch & Apache Spark approach
Creating a Recommender System An Elasticsearch & Apache Spark approach My Profile SKILLS Álvaro Santos Andrés Big Data & Analytics Solution Architect in Ericsson with more than 12 years of experience focused
More informationCreating Transparency, Openness and Trust: Modern Approach to Redistricting
Creating Transparency, Openness and Trust: Modern Approach to Redistricting Richard Leadbeater, Esri Jerry Howe, Utah Legislature Larry Boden, Esri NCSL Legislative Summit Sunday, Aug. 6 11:15 am-12:15
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A REVIEW- IMPLEMENTING TRAINING AND PLACEMENT SYSTEM USING MONGODB AND REDIS ABHA
More informationUS Geo-Explorer User s Guide. Web:
US Geo-Explorer User s Guide Web: http://usgeoexplorer.org Updated on October 26, 2016 TABLE OF CONTENTS Introduction... 3 1. System Interface... 5 2. Administrative Unit... 7 2.1 Region Selection... 7
More informationRanking Web Pages by Associating Keywords with Locations
Ranking Web Pages by Associating Keywords with Locations Peiquan Jin, Xiaoxiang Zhang, Qingqing Zhang, Sheng Lin, and Lihua Yue University of Science and Technology of China, 230027, Hefei, China jpq@ustc.edu.cn
More informationMicrosoft Power BI for O365
Microsoft Power BI for O365 Next hour.. o o o o o o o o Power BI for O365 Data Discovery Data Analysis Data Visualization & Power Maps Natural Language Search (Q&A) Power BI Site Data Management Self Service
More informationArcGIS Hub: Open data best practices. Graham Hudgins, esri product engineer
ArcGIS Hub: Open data best practices Graham Hudgins, esri product engineer Agenda Overview of open data in the ArcGIS Hub Example sites - Hubs Around the World Storymap Making a good site map Preparing
More informationMcAfee Security Management Center
Data Sheet McAfee Security Management Center Unified management for next-generation devices Key advantages: Single pane of glass across the management lifecycle for McAfee next generation devices. Scalability
More informationTag Based Image Search by Social Re-ranking
Tag Based Image Search by Social Re-ranking Vilas Dilip Mane, Prof.Nilesh P. Sable Student, Department of Computer Engineering, Imperial College of Engineering & Research, Wagholi, Pune, Savitribai Phule
More informationBuilding Geospatial Mashups to Visualize Information for Crisis Management. Shubham Gupta and Craig A. Knoblock University of Southern California
Building Geospatial Mashups to Visualize Information for Crisis Management Shubham Gupta and Craig A. Knoblock University of Southern California 1 WHAT IS A GEOSPATIAL MASHUP? Integrated View of data combined
More informationBig Ideas Math Digital Platform. Student Orientation
Big Ideas Math Digital Platform Student Orientation Big Ideas Math Big Ideas Math is the name of the new series we are using at Edwardsville High School for the following courses: Algebra 1 Geometry Algebra
More informationTexas Connector Training Manual. For TDCJ Users [2016]
Texas Connector Training Manual For TDCJ Users [2016] CONTENTS OneStar Foundation and Texas Connector Overview... 3 The Data... 3 How To Log In... 4 TDCJ Quick Report Generator... 4 The Map... 8 Support
More informationOpen Data Integration. Renée J. Miller
Open Data Integration Renée J. Miller miller@northeastern.edu !2 Open Data Principles Timely & Comprehensive Accessible and Usable Complete - All public data is made available. Public data is data that
More informationalliance FROM DISPATCH THROUGH DISPOSITION Tyler Alliance Leads the Way with Integrated Criminal Justice and Public Safety Solutions
alliance FROM DISPATCH THROUGH DISPOSITION Tyler Alliance Leads the Way with Integrated Criminal Justice and Public Safety Solutions FIRE/EMS Fire and emergency service teams access information faster
More informationKaspersky Security Network
The Kaspersky Security Network (KSN) is a complex distributed infrastructure dedicated to intelligently processing cybersecurity-related data streams from millions of voluntary participants around the
More informationCollect Relevant Demographics
I N T R O D U C T I O N Collect Relevant Demographics Here we will collect demographics at the local level! At this point you should have identified and mapped a prioritized brownfield site. We will now
More informationTexas Connector Training Manual [2016]
Texas Connector Training Manual [2016] CONTENTS OneStar Foundation and Texas Connector Overview... 2 The Data... 2 How To Create An Account... 3 How To Log In... 3 How To Create An Account Visual Guide
More informationCreating Connection With Hive. Version: 16.0
Creating Connection With Hive Version: 16.0 Copyright 2015 Intellicus Technologies This document and its content is copyrighted material of Intellicus Technologies. The content may not be copied or derived
More informationBuilding Self-Service BI Solutions with Power Query. Written By: Devin
Building Self-Service BI Solutions with Power Query Written By: Devin Knight DKnight@PragmaticWorks.com @Knight_Devin CONTENTS PAGE 3 PAGE 4 PAGE 5 PAGE 6 PAGE 7 PAGE 8 PAGE 9 PAGE 11 PAGE 17 PAGE 20 PAGE
More informationSAS Web Report Studio 3.1
SAS Web Report Studio 3.1 User s Guide SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2006. SAS Web Report Studio 3.1: User s Guide. Cary, NC: SAS
More informationChapter 9. Attribute joins
Chapter 9 Spatial Joins 9-1 Copyright McGraw-Hill Education. Permission required for reproduction or display. Attribute joins Recall that Attribute joins: involve combining two attribute tables together
More informationMicrosoft End to End Business Intelligence Boot Camp
Microsoft End to End Business Intelligence Boot Camp 55045; 5 Days, Instructor-led Course Description This course is a complete high-level tour of the Microsoft Business Intelligence stack. It introduces
More information55049: PowerPivot, Power View and SharePoint 2013 Business Intelligence Center for Analysts
Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com
More informationBig Data Security Internal Threat Detection. The Critical Role of Machine Learning.
Big Data Security Internal Threat Detection The Critical Role of Machine Learning Objectives 1.Discuss internal user risk management challenges in Big Data Environment 2.Discuss why machine learning is
More informationIdentification and Classification of A/E/C Web Sites and Pages
Construction Informatics Digital Library http://itc.scix.net/ paper w78-2002-34.content Theme: Title: Author(s): Institution(s): E-mail(s): Abstract: Keywords: Identification and Classification of A/E/C
More informationSurvey on Community Question Answering Systems
World Journal of Technology, Engineering and Research, Volume 3, Issue 1 (2018) 114-119 Contents available at WJTER World Journal of Technology, Engineering and Research Journal Homepage: www.wjter.com
More informationChapter 7. A Quick Tour of ArcGIS Pro
Chapter 7 A Quick Tour of ArcGIS Pro Skills you will learn: This tutorial is intended to get you going using ArcGIS Pro, a new desktop application that is part of ArcGIS Desktop. A separate tutorial gives
More informationAn Introduction to Big Data Formats
Introduction to Big Data Formats 1 An Introduction to Big Data Formats Understanding Avro, Parquet, and ORC WHITE PAPER Introduction to Big Data Formats 2 TABLE OF TABLE OF CONTENTS CONTENTS INTRODUCTION
More informationSRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009
SRI International, Artificial Intelligence Center Menlo Park, USA, 24 July 2009 The Emerging Web of Linked Data Chris Bizer, Freie Universität Berlin Outline 1. From a Web of Documents to a Web of Data
More informationGOOGLE FUSION TABLES AS A GEOGRAPHIC INFORMATION SYSTEM LEARNING TOOLS
6-05 Google Fusion Tables As A Geographic Information System Learning Tools GOOGLE FUSION TABLES AS A GEOGRAPHIC INFORMATION SYSTEM LEARNING TOOLS Adri Gabriel Sooai Department of Computer Science, Engineering
More informationCHAPTER THREE INFORMATION RETRIEVAL SYSTEM
CHAPTER THREE INFORMATION RETRIEVAL SYSTEM 3.1 INTRODUCTION Search engine is one of the most effective and prominent method to find information online. It has become an essential part of life for almost
More informationJoint Entity Resolution
Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute
More informationHow App Ratings and Reviews Impact Rank on Google Play and the App Store
APP STORE OPTIMIZATION MASTERCLASS How App Ratings and Reviews Impact Rank on Google Play and the App Store BIG APPS GET BIG RATINGS 13,927 AVERAGE NUMBER OF RATINGS FOR TOP-RATED IOS APPS 196,833 AVERAGE
More informationBuilding a Europe of Knowledge. Towards the Seventh Framework Programme
Building a Europe of Knowledge Towards the Seventh Framework Programme 2007-2013 FP7 /1 EUROPEAN COMMISSION - Research DG - November 2005 EU research: the story so far 1952: ECSC treaty; first projects
More informationHow Managers and Executives Can Leverage SAS Enterprise Guide
Paper 8820-2016 How Managers and Executives Can Leverage SAS Enterprise Guide ABSTRACT Steven First and Jennifer First-Kluge, Systems Seminar Consultants, Inc. SAS Enterprise Guide is an extremely valuable
More informationEmpowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia
Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user
More informationPutting it all together: Creating a Big Data Analytic Workflow with Spotfire
Putting it all together: Creating a Big Data Analytic Workflow with Spotfire Authors: David Katz and Mike Alperin, TIBCO Data Science Team In a previous blog, we showed how ultra-fast visualization of
More informationDiscovery services: next generation of searching scholarly information
Discovery services: next generation of searching scholarly information Article (Unspecified) Keene, Chris (2011) Discovery services: next generation of searching scholarly information. Serials, 24 (2).
More informationSupporting Fuzzy Keyword Search in Databases
I J C T A, 9(24), 2016, pp. 385-391 International Science Press Supporting Fuzzy Keyword Search in Databases Jayavarthini C.* and Priya S. ABSTRACT An efficient keyword search system computes answers as
More informationGeographical Information Systems Institute. Center for Geographic Analysis, Harvard University
Geographical Information Systems Institute Center for Geographic Analysis, Harvard University LAB EXERCISE 5: Queries, Joins: Spatial and Non-spatial 1.0 Getting Census data 1. Go to the American Factfinder
More informationarxiv: v3 [cs.dl] 23 Sep 2017
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Search Engine Philipp Mayr, Ameni Kacem arxiv:1706.00816v3 [cs.dl] 23 Sep 2017 GESIS Leibniz Institute for the Social Sciences,
More informationSmallworld Core Spatial Technology 4 Spatial data is more than maps tracking the topology of complex network models
Smallworld Core Spatial Technology 4 Spatial data is more than maps tracking the topology of complex network models 2004 General Electric Company. All Rights Reserved GER-4230 (10/04) Abstract Spatial
More informationWhite Paper: Next generation disaster data infrastructure CODATA LODGD Task Group 2017
White Paper: Next generation disaster data infrastructure CODATA LODGD Task Group 2017 Call for Authors This call for authors seeks contributions from academics and scientists who are in the fields of
More informationSearching of Nearest Neighbor Based on Keywords using Spatial Inverted Index
Searching of Nearest Neighbor Based on Keywords using Spatial Inverted Index B. SATYA MOUNIKA 1, J. VENKATA KRISHNA 2 1 M-Tech Dept. of CSE SreeVahini Institute of Science and Technology TiruvuruAndhra
More informationComparative Analysis of Range Aggregate Queries In Big Data Environment
Comparative Analysis of Range Aggregate Queries In Big Data Environment Ranjanee S PG Scholar, Dept. of Computer Science and Engineering, Institute of Road and Transport Technology, Erode, TamilNadu, India.
More informationWhat is SEO? Search Engine Optimization 101
What is SEO? Search Engine Optimization 101 What is Search Engine Optimization (SEO)? Paid Search Listings SEO is the practice of improving and promoting a website to increase the number of Organic visitors
More informationSAS BI Dashboard 3.1. User s Guide Second Edition
SAS BI Dashboard 3.1 User s Guide Second Edition The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2007. SAS BI Dashboard 3.1: User s Guide, Second Edition. Cary, NC:
More information2011 NASCIO RECOGNITION AWARDS NOMINATION CATEGORY: FAST TRACK SOLUTIONS CALIFORNIA MOBILE DEVELOPMENT PROGRAM STATE OF CALIFORNIA
2011 NASCIO RECOGNITION AWARDS NOMINATION CATEGORY: FAST TRACK SOLUTIONS CALIFORNIA MOBILE DEVELOPMENT PROGRAM STATE OF CALIFORNIA OFFICE OF TECHNOLOGY SERVICES CALIFORNIA TECHNOLOGY AGENCY 1 B. EXECUTIVE
More informationTAKING NETWORK TESTING TO THE NEXT LEVEL
TAKING NETWORK TESTING TO THE NEXT LEVEL WELCOME TO THE NEXT LEVEL OF NETWORK TESTING. Do you understand the performance and customer experience of your mobile network? P3 does. Our holistic approach is
More informationTesting System Qualities
Testing System Qualities Rebecca Wirfs-Brock Joseph Yoder Copyright 2012 Rebecca Wirfs-Brock, Joseph Yoder, Wirfs-Brock Associates and The Refactory, Inc. Introducing Rebecca President, Wirfs-Brock Associates
More informationPANDA: A Platform for Academic Knowledge Discovery and Acquisition
PANDA: A Platform for Academic Knowledge Discovery and Acquisition Zhaoan Dong 1 ; Jiaheng Lu 2,1 ; Tok Wang Ling 3 1.Renmin University of China 2.University of Helsinki 3.National University of Singapore
More informationGetting Started with GeoQuery
Getting Started with GeoQuery A quick-start guide to the download and use of spatial data for international development geo.aiddata.org GeoQuery Quick Start Handbook v. 1.01, December 2017 WWW.GEOQUERY.ORG
More informationInformatica Enterprise Information Catalog
Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with
More informationThese are all examples of relatively simple databases. All of the information is textual or referential.
1.1. Introduction Databases are pervasive in modern society. So many of our actions and attributes are logged and stored in organised information repositories, or Databases. 1.1.01. Databases Where do
More informationFinding Topic-centric Identified Experts based on Full Text Analysis
Finding Topic-centric Identified Experts based on Full Text Analysis Hanmin Jung, Mikyoung Lee, In-Su Kang, Seung-Woo Lee, Won-Kyung Sung Information Service Research Lab., KISTI, Korea jhm@kisti.re.kr
More informationVMworld 2015 Track Names and Descriptions
VMworld 2015 Track Names and Descriptions Software- Defined Data Center Software- Defined Data Center General Pioneered by VMware and recognized as groundbreaking by the industry and analysts, the VMware
More informationCombining Government and Linked Open Data in Emergency Management
Combining Government and Linked Open Data in Emergency Management Axel Schulz 1,2 and Heiko Paulheim 3 1 SAP Research 2 Technische Universität Darmstadt Telecooperation Group axel.schulz@sap.com 3 Technische
More informationThe Mission of the Abu Dhabi Smart Solutions and Services Authority. Leading ADSSSA. By Michael J. Keegan
Perspective on Digital Transformation in Government with Her Excellency Dr. Rauda Al Saadi, Director General, Abu Dhabi Smart Solutions and Services Authority By Michael J. Keegan Today s digital economy
More informationA SURVEY ON SCHEDULING IN HADOOP FOR BIGDATA PROCESSING
Journal homepage: www.mjret.in ISSN:2348-6953 A SURVEY ON SCHEDULING IN HADOOP FOR BIGDATA PROCESSING Bhavsar Nikhil, Bhavsar Riddhikesh,Patil Balu,Tad Mukesh Department of Computer Engineering JSPM s
More informationProposed System. Start. Search parameter definition. User search criteria (input) usefulness score > 0.5. Retrieve results
, Impact Factor- 5.343 Hybrid Approach For Efficient Diversification on Cloud Stored Large Dataset Geetanjali Mohite 1, Prof. Gauri Rao 2 1 Student, Department of Computer Engineering, B.V.D.U.C.O.E, Pune,
More information(Big Data Integration) : :
(Big Data Integration) : : 3 # $%&'! ()* +$,- 2/30 ()* + # $%&' = 3 : $ 2 : 17 ;' $ # < 2 6 ' $%&',# +'= > 0 - '? @0 A 1 3/30 3?. - B 6 @* @(C : E6 - > ()* (C :(C E6 1' +'= - ''3-6 F :* 2G '> H-! +'-?
More informationBuilding a Data Strategy for a Digital World
Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service
More informationSAP Analytics Cloud Best Practices for BI Platform Live Universes
NOTE: Delete the yellow stickers when finished. See the SAP Image Library for other available images. Once the custom image is inserted, click Format Send Backward Send to Back, so the motion band is on
More informationTowards Open Innovation with Open Data Service Platform
Towards Open Innovation with Open Data Service Platform Marut Buranarach Data Science and Analytics Research Group National Electronics and Computer Technology Center (NECTEC), Thailand The 44 th Congress
More informationExtracting and Querying Probabilistic Information From Text in BayesStore-IE
Extracting and Querying Probabilistic Information From Text in BayesStore-IE Daisy Zhe Wang, Michael J. Franklin, Minos Garofalakis 2, Joseph M. Hellerstein University of California, Berkeley Technical
More informationResearch and Design of Education and Teaching Resource Management System based on ASP.NET Technology
2018 3rd International Conference on Education & Education Research (EDUER 2018) Research and Design of Education and Teaching Resource Management System based on ASP.NET Technology Jin Xin Science and
More informationPower BI Developer Bootcamp
Power BI Developer Bootcamp Mastering the Power BI Development Platform Course Code Audience Format Length Course Description Student Prerequisites PBD365 Professional Developers In-person and Remote 4
More informationThis guide covers 3 functions you can perform with DataPlace: o Mapping, o Creating Tables, and o Creating Rankings. Registering with DataPlace
Guide for Using DataPlace DataPlace is one-stop source for housing and demographic data about communities, the region, and the nation. The site assembles a variety of data sets from multiple sources, and
More information