Contextual Search using Cognitive Discovery Capabilities

Size: px
Start display at page:

Download "Contextual Search using Cognitive Discovery Capabilities"

Transcription

1 Contextual Search using Cognitive Discovery Capabilities In this exercise, you will work with a sample application that uses the Watson Discovery service API s for cognitive search use cases. Discovery service queries are used to extract and detect concepts, keywords, sentiment, entities such as people and companies, relationships as well as trends. Contextual search is a powerful way to gather personalized results based on information from multiple structured and unstructured data repositories. Context is used to help refine and target search results and relevance. Without context, users often must sift through a bunch of irrelevant results before finding what they want. Contextual intelligence helps to assign confidence rankings to search results and streamline the process of finding relevant, current data. The Discovery Service enables developers to build an automated data pipeline to ingest your unstructured data, where the discovery service uses Natural Language Understanding and other cognitive services to enrich understanding of the data. This process consists of automatically tagging NLP meta data, cleansing and normalizing for improved data quality. Once ingested and enriched, queries can be performed. In this exercise, we ll focus on using Discovery service queries with examples for contextual search with news datasources. Exercise: 1. Go to the Discovery News application in Bluemix at:

2 Enter a company name for a search term, for instance Amazon. The app uses Discovery service APIs and queries to perform a contextual search in news articles previously ingested and enriched thru the data pipeline and NLP process. It returns the following information: - most frequently occurring topics (concepts), companies, and people (entities) - the news articles search results with links to each article content, along document content sentiment score - positive/negative sentiment rating of the company for content from 10 randomly selected news sources/sites - sentiment trend along a timeline based on mentions of the company along with other companies that it is most frequently mentioned with, in the source articles Examine the contextual search results in each of the four sections: - Top Entities - Top Stories - Sentiment Analysis - Co-mentions & Trends

3 2. To get ready to examine and understand the Discovery service query that was used to perform this contextual search, lets review a few key concepts: Examples of keyword and entity queries: Queries can be structured for additional options including concepts, sentiment, filtering and aggregations that can provide deeper insights and identify patterns, clusters and trends. The Discovery service provides a query tool that uses a simple query language for multiple query types including boolean, filter, and aggregation queries to discover patterns, trends, and answers.

4 Aggregations are collections of occurrences of the keywords, concepts and entities from the search results. Aggregations can be nested to extract information and get insights about other keywords, concepts and entities that may be connected or related in the source content. 3. Examine the Discovery service query that was used to perform the news articles contextual search: - In the Top Entities block, click the View Query button. The content of the query consists of the actions and results described in-line in italics below: "return": "title,enrichedtitle.text,url,host,blekko.chrondate", "query": "\"amazon\",language:english", This is a simple keyword query for the company name

5 "aggregations": [ "nested(enrichedtitle.entities).filter(enrichedtitle.entities.ty pe:company).term(enrichedtitle.entities.text)", This aggregation collects the enriched data for companies the query specifies selection of entities of type company, to get company names mentioned in the news articles "nested(enrichedtitle.entities).filter(enrichedtitle.entities.ty pe:person).term(enrichedtitle.entities.text)", This aggregation collects the enriched data entities of type people to get names of people mentioned in the news articles "term(enrichedtitle.concepts.text)", This aggregation collects the enriched data for concepts, to get the names of topics mentioned in the news articles 4. Click on Response to view the response data returned from the query The response data consists of most frequently occurring company names, names of people and topics, with number of occurrences, in sorted order.

6 Click on the GoBack button. 5. In the Top Stories section, click on the View Query button, notice that the same query is used to retrieve data for the most frequently appearing stories based on the enriched title and extracted concepts. Click on the Response Data button response data includes the document title, URL, enriched title, host website and sentiment score. Note that in some cases the enriched title may be different than the document title, either to add more context or to remove irrelevant information such as URL strings in titles. 6. In the Sentiment Analysis section, click on the View Query button. Examine the query and notice the stanza in the query that extracts content sentiment: "term(blekko.basedomain).term(docsentiment.type)", This aggregation collects the enriched data for content sentiment of the news articles This next aggregation collects the enriched data for content sentiment of the news articles for min/max sentiment trend along a timeline of each mention of the company plus co-mentioned companies: "term(docsentiment.type)", "min(docsentiment.score)", "max(docsentiment.score)", "filter(enrichedtitle.entities.type::company).term(enrichedtitle.entities.text).timeslice(blekko.chrondate,1day).term(docsentime nt.type)" ], "filter": "blekko.hostrank>20,blekko.chrondate> ,blekko.chrondate < " Click on the Response Data button response data includes two sections: The first section provides the count of all documents queried having a positive sentiment score, negative sentiment score and neutral sentiment score.

7 The second section provides the positive/negative/neutral sentiment document count for each of the 10 randomly selected news sites the content was obtained from.

8 7. In the Co-mentions & Trends section, click on the Response Data button response data includes two sections: The first section provides the count of all documents in which the company name occurs having a positive sentiment score, negative sentiment score and neutral sentiment score, along with individual document sentiment scoring detail data for each. The second section provides for each of the top co-mentioned companies in the documents, the number of matches and the sentiment score document counts of all the documents with the co-mentioned company followed by the individual document sentiment scoring detail data. "key": "Google", "matching_results": 3599, "aggregations": [ "type": "timeslice", "field": "blekko.chrondate", "interval": "1d", "results": [ "key_as_string": " ", "key": , "matching_results": 8, "aggregations": [ "type": "term", "field": "docsentiment.type", "results": [ "key": "negative", "matching_results": 6 }, "key": "positive", "matching_results": 2 }

9 Learn more about Watson Discovery Service View these education modules on the Watson Discovery service to learn more: CHG8YUvWx WWLP Watson Discovery Service key use cases: Additional use cases are described for financial research, supply chain, customer behavior insights, field engineer advisor and surgical knowledgebase in the Architecture Center for Cognitive Discovery, which also provides detailed information on how to work with the Watson Discovery service to create your document store, create queries, and implement or integrate in your application. Using the Watson Discovery Service getting started documentation and query guide Step by step tutorial on using the Watson Discovery service in Bluemix and building custom queries with the Discovery query tool More information regarding contextual search using cognitive discovery capabilities Complete source code for the application used in this exercise is available at GitHub in the project repository

An Oracle White Paper October Oracle Social Cloud Platform Text Analytics

An Oracle White Paper October Oracle Social Cloud Platform Text Analytics An Oracle White Paper October 2012 Oracle Social Cloud Platform Text Analytics Executive Overview Oracle s social cloud text analytics platform is able to process unstructured text-based conversations

More information

C. The system is equally reliable for classifying any one of the eight logo types 78% of the time.

C. The system is equally reliable for classifying any one of the eight logo types 78% of the time. Volume: 63 Questions Question No: 1 A system with a set of classifiers is trained to recognize eight different company logos from images. It is 78% accurate. Without further information, which statement

More information

Create Swift mobile apps with IBM Watson services IBM Corporation

Create Swift mobile apps with IBM Watson services IBM Corporation Create Swift mobile apps with IBM Watson services Create a Watson sentiment analysis app with Swift Learning objectives In this section, you ll learn how to write a mobile app in Swift for ios and add

More information

USER GUIDE DASHBOARD OVERVIEW A STEP BY STEP GUIDE

USER GUIDE DASHBOARD OVERVIEW A STEP BY STEP GUIDE USER GUIDE DASHBOARD OVERVIEW A STEP BY STEP GUIDE DASHBOARD LAYOUT Understanding the layout of your dashboard. This user guide discusses the layout and navigation of the dashboard after the setup process

More information

Information Retrieval

Information Retrieval Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have

More information

Sharp Social. Natural Language Understanding

Sharp Social. Natural Language Understanding Sharp Social Natural Language Understanding Step 1 Go to the URL https://console.ng.bluemix.net/ and press enter. A new window appears of IBM Bluemix which asks you to sign up and create a Bluemix account.

More information

Enhancing applications with Cognitive APIs IBM Corporation

Enhancing applications with Cognitive APIs IBM Corporation Enhancing applications with Cognitive APIs After you complete this section, you should understand: The Watson Developer Cloud offerings and APIs The benefits of commonly used Cognitive services 2 Watson

More information

Chapter 27 Introduction to Information Retrieval and Web Search

Chapter 27 Introduction to Information Retrieval and Web Search Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval

More information

Build a Company List

Build a Company List Build a Company List Build a Company List Let s look at how to build a list of companies associated with renewable energy using the Conceptual Search How would you perform this search? Consider who you

More information

DMI Exam PDDM Professional Diploma in Digital Marketing Version: 7.0 [ Total Questions: 199 ]

DMI Exam PDDM Professional Diploma in Digital Marketing Version: 7.0 [ Total Questions: 199 ] s@lm@n DMI Exam PDDM Professional Diploma in Digital Marketing Version: 7.0 [ Total Questions: 199 ] https://certkill.com Topic break down Topic No. of Questions Topic 1: Search Marketing (SEO) 21 Topic

More information

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai. UNIT-V WEB MINING 1 Mining the World-Wide Web 2 What is Web Mining? Discovering useful information from the World-Wide Web and its usage patterns. 3 Web search engines Index-based: search the Web, index

More information

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009 Maximizing the Value of STM Content through Semantic Enrichment Frank Stumpf December 1, 2009 What is Semantics and Semantic Processing? Content Knowledge Framework Technology Framework Search Text Images

More information

IBM Netcool Operations Insight Version 1 Release 4. Integration Guide IBM SC

IBM Netcool Operations Insight Version 1 Release 4. Integration Guide IBM SC IBM Netcool Operations Insight Version 1 Release 4 Integration Guide IBM SC27-8601-00 Note Before using this information and the product it supports, read the information in Notices on page 249. This edition

More information

How Primo Works VE. 1.1 Welcome. Notes: Published by Articulate Storyline Welcome to how Primo works.

How Primo Works VE. 1.1 Welcome. Notes: Published by Articulate Storyline   Welcome to how Primo works. How Primo Works VE 1.1 Welcome Welcome to how Primo works. 1.2 Objectives By the end of this session, you will know - What discovery, delivery, and optimization are - How the library s collections and

More information

Question No : 1 Web spiders carry out a key function within search. What is it? Choose one of the following:

Question No : 1 Web spiders carry out a key function within search. What is it? Choose one of the following: Volume: 199 Questions Question No : 1 Web spiders carry out a key function within search. What is it? Choose one of the following: A. Indexing the site B. Ranking the site C. Parsing the site D. Translating

More information

Visualization and text mining of patent and non-patent data

Visualization and text mining of patent and non-patent data of patent and non-patent data Anton Heijs Information Solutions Delft, The Netherlands http://www.treparel.com/ ICIC conference, Nice, France, 2008 Outline Introduction Applications on patent and non-patent

More information

TISA Methodology Threat Intelligence Scoring and Analysis

TISA Methodology Threat Intelligence Scoring and Analysis TISA Methodology Threat Intelligence Scoring and Analysis Contents Introduction 2 Defining the Problem 2 The Use of Machine Learning for Intelligence Analysis 3 TISA Text Analysis and Feature Extraction

More information

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user

More information

Search Engine Architecture II

Search Engine Architecture II Search Engine Architecture II Primary Goals of Search Engines Effectiveness (quality): to retrieve the most relevant set of documents for a query Process text and store text statistics to improve relevance

More information

RESEARCH ANALYTICS From Web of Science to InCites. September 20 th, 2010 Marta Plebani

RESEARCH ANALYTICS From Web of Science to InCites. September 20 th, 2010 Marta Plebani RESEARCH ANALYTICS From Web of Science to InCites September 20 th, 2010 Marta Plebani marta.plebani@thomsonreuters.com Web Of Science: main purposes Find high-impact articles and conference proceedings.

More information

Welcome to Analytics. Welcome to Applause! Table of Contents:

Welcome to Analytics. Welcome to Applause! Table of Contents: Welcome to Applause! Your success is our priority and we want to make sure Applause Analytics (ALX) provides you with actionable insight into what your users are thinking and saying about their experiences

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Welcome to Your Data

Welcome to Your Data Welcome to Your Data PLEASE STAND BY WEBINAR IS ABOUT TO BEGIN : YI PI Yippy makes the difference between searchers who find what they are looking for and those who have to slog lists of results with no

More information

IBM Watson Application Developer Workshop. Watson Knowledge Studio: Building a Machine-learning Annotator with Watson Knowledge Studio.

IBM Watson Application Developer Workshop. Watson Knowledge Studio: Building a Machine-learning Annotator with Watson Knowledge Studio. IBM Watson Application Developer Workshop Lab02 Watson Knowledge Studio: Building a Machine-learning Annotator with Watson Knowledge Studio January 2017 Duration: 60 minutes Prepared by Víctor L. Fandiño

More information

Embedding Intelligence through Cognitive Services

Embedding Intelligence through Cognitive Services Embedding Intelligence through Cognitive Services Dr. Latika Kharb 1, Sarabjit Kaur 2 Associate Professor 1, Student 2, Jagan Institute of Management Studies (JIMS), Delhi, India. Abstract: Cognitive Services

More information

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany Information Systems & University of Koblenz Landau, Germany Semantic Search examples: Swoogle and Watson Steffen Staad credit: Tim Finin (swoogle), Mathieu d Aquin (watson) and their groups 2009-07-17

More information

DATA MINING - 1DL105, 1DL111

DATA MINING - 1DL105, 1DL111 1 DATA MINING - 1DL105, 1DL111 Fall 2007 An introductory class in data mining http://user.it.uu.se/~udbl/dut-ht2007/ alt. http://www.it.uu.se/edu/course/homepage/infoutv/ht07 Kjell Orsborn Uppsala Database

More information

ThreatConnect Learning Exercises

ThreatConnect Learning Exercises ThreatConnect Learning Exercises The following exercises will teach you some of the important features within the ThreatConnect platform. You will learn various ways of adding intelligence data into ThreatConnect,

More information

Scuole di dottorato in Bioscienze e biotecnologie e Scienze biomediche sperimentali WEB OF SCIENCE

Scuole di dottorato in Bioscienze e biotecnologie e Scienze biomediche sperimentali WEB OF SCIENCE WEB OF SCIENCE WEB OF SCIENCE COVERAGE: multidisciplinary TIME RANGE: 1985- DOCUMENT TYPES: articles, proceedings papers, books WEB OF SCIENCE: SEARCH you can add one or more search field you can limit

More information

Google Tools and your Library - the Possibilities are Exponential

Google Tools and your Library - the Possibilities are Exponential Google Tools and your Library - the Possibilities are Exponential Google CSE Google Scholar Google Books @ My Library Google Alerts Google Gadgets & igoogle Google Custom Search engine Custom Search Engines

More information

Competitive Intelligence and Web Mining:

Competitive Intelligence and Web Mining: Competitive Intelligence and Web Mining: Domain Specific Web Spiders American University in Cairo (AUC) CSCE 590: Seminar1 Report Dr. Ahmed Rafea 2 P age Khalid Magdy Salama 3 P age Table of Contents Introduction

More information

Parmenides. Semi-automatic. Ontology. construction and maintenance. Ontology. Document convertor/basic processing. Linguistic. Background knowledge

Parmenides. Semi-automatic. Ontology. construction and maintenance. Ontology. Document convertor/basic processing. Linguistic. Background knowledge Discover hidden information from your texts! Information overload is a well known issue in the knowledge industry. At the same time most of this information becomes available in natural language which

More information

COMPARISON WHITEPAPER. Snowplow Insights VS SaaS load-your-data warehouse providers. We do data collection right.

COMPARISON WHITEPAPER. Snowplow Insights VS SaaS load-your-data warehouse providers. We do data collection right. COMPARISON WHITEPAPER Snowplow Insights VS SaaS load-your-data warehouse providers We do data collection right. Background We were the first company to launch a platform that enabled companies to track

More information

Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search

Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search 1 / 33 Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search Bernd Wittefeld Supervisor Markus Löckelt 20. July 2012 2 / 33 Teaser - Google Web History http://www.google.com/history

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

Ugly PBNer Standard Operating Procedures

Ugly PBNer Standard Operating Procedures Ugly PBNer Standard Operating Procedures Product Registration Product Registration (Step 1) Click on Tools From The Menu Bar Click Software Registration Product Registration (Step 2) Enter Your Name, Email,

More information

Oracle Endeca Information Discovery

Oracle Endeca Information Discovery Oracle Endeca Information Discovery Glossary Version 2.4.0 November 2012 Copyright and disclaimer Copyright 2003, 2013, Oracle and/or its affiliates. All rights reserved. Oracle and Java are registered

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO

Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO INDEX Proposal Recap Implementation Evaluation Future Works Proposal Recap Keyword Visualizer (chrome

More information

Prototyping Data Intensive Apps: TrendingTopics.org

Prototyping Data Intensive Apps: TrendingTopics.org Prototyping Data Intensive Apps: TrendingTopics.org Pete Skomoroch Research Scientist at LinkedIn Consultant at Data Wrangling @peteskomoroch 09/29/09 1 Talk Outline TrendingTopics Overview Wikipedia Page

More information

Reputation Management Guide

Reputation Management Guide Reputation Management Guide What is Reputation Management? Reputation Management is a tool that measures your online visibility and alerts you every time your business is reviewed or mentioned online.

More information

Finding Stories in Data

Finding Stories in Data Finding Stories in Data http://training.theodi.org/malaysia David Tarrant @davetaz Improve understanding of how to source, analyse and visualise data to discover insight and tell stories. Aim Aims Identify

More information

Analytics and Visualization

Analytics and Visualization GU I DE NO. 4 Analytics and Visualization AWS IoT Analytics Mini-User Guide Introduction As IoT applications scale, so does the data generated from these various IoT devices. This data is raw, unstructured,

More information

Media Mining Client. Quick User Guide. Version

Media Mining Client. Quick User Guide. Version Media Mining Client Quick User Guide Version 2016-3 Table of Contents How to get started Main interface page 3 Story View page 4 Multilingual options page 5 Visual Features Globe page 6 Relationship Graph

More information

Front-End Web Developer Nanodegree Syllabus

Front-End Web Developer Nanodegree Syllabus Front-End Web Developer Nanodegree Syllabus Build Stunning User Experiences Before You Start You've taken the first step toward becoming a web developer by choosing the Front End Web Developer Nanodegree

More information

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality?

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality? Oliver Engels & Tillmann Eitelberg Big Data! Big Quality? Like to visit Germany? PASS Camp 2017 Main Camp 5.12 7.12.2017 (4.12 Kick Off Evening) Lufthansa Training & Conference Center, Seeheim SQL Konferenz

More information

DATA MINING II - 1DL460. Spring 2014"

DATA MINING II - 1DL460. Spring 2014 DATA MINING II - 1DL460 Spring 2014" A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt14 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,

More information

Troubleshoot DNA Center Using Data Platform

Troubleshoot DNA Center Using Data Platform About Data Platform, on page 1 Troubleshoot DNA Center Using the Analytics Ops Center, on page 2 View or Update Collector Configuration Information, on page 3 Configure Data Retention Settings, on page

More information

IBM Advantage: IBM Watson Compare and Comply Element Classification

IBM Advantage: IBM Watson Compare and Comply Element Classification IBM Advantage: IBM Watson Compare and Comply Element Classification Executive overview... 1 Introducing Watson Compare and Comply... 2 Definitions... 3 Element Classification insights... 4 Sample use cases...

More information

Chapter 2. Architecture of a Search Engine

Chapter 2. Architecture of a Search Engine Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them

More information

Overview of Web Mining Techniques and its Application towards Web

Overview of Web Mining Techniques and its Application towards Web Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous

More information

Package rzeit2. January 7, 2019

Package rzeit2. January 7, 2019 Type Package Title Client for the ZEIT ONLINE Content API Version 0.2.3 Package rzeit2 January 7, 2019 Interface to gather newspaper articles from 'DIE ZEIT' and 'ZEIT ONLINE', based on a multilevel query

More information

white paper 4 Steps to Better Keyword Grouping Strategies for More Effective & Profitable Keyword Segmentation

white paper 4 Steps to Better Keyword Grouping Strategies for More Effective & Profitable Keyword Segmentation white paper 4 Steps to Better Keyword Grouping Strategies for More Effective & Profitable Keyword Segmentation 2009, WordStream, Inc. All rights reserved. WordStream technologies are protected by pending

More information

Data Mining Concepts & Tasks

Data Mining Concepts & Tasks Data Mining Concepts & Tasks Duen Horng (Polo) Chau Georgia Tech CSE6242 / CX4242 Jan 16, 2014 Partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko, Christos Faloutsos Last Time

More information

Evaluating the Usefulness of Sentiment Information for Focused Crawlers

Evaluating the Usefulness of Sentiment Information for Focused Crawlers Evaluating the Usefulness of Sentiment Information for Focused Crawlers Tianjun Fu 1, Ahmed Abbasi 2, Daniel Zeng 1, Hsinchun Chen 1 University of Arizona 1, University of Wisconsin-Milwaukee 2 futj@email.arizona.edu,

More information

Data Mining Concepts & Tasks

Data Mining Concepts & Tasks Data Mining Concepts & Tasks Duen Horng (Polo) Chau Georgia Tech CSE6242 / CX4242 Sept 9, 2014 Partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko, Christos Faloutsos Last Time

More information

Blurring the Line Between Developer and Data Scientist

Blurring the Line Between Developer and Data Scientist Blurring the Line Between Developer and Data Scientist Notebooks with PixieDust va barbosa va@us.ibm.com Developer Advocacy IBM Watson Data Platform WHY ARE YOU HERE? More companies making bet-the-business

More information

Ranked Retrieval. Evaluation in IR. One option is to average the precision scores at discrete. points on the ROC curve But which points?

Ranked Retrieval. Evaluation in IR. One option is to average the precision scores at discrete. points on the ROC curve But which points? Ranked Retrieval One option is to average the precision scores at discrete Precision 100% 0% More junk 100% Everything points on the ROC curve But which points? Recall We want to evaluate the system, not

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

THE URBAN COWGIRL PRESENTS KEYWORD RESEARCH

THE URBAN COWGIRL PRESENTS KEYWORD RESEARCH THE URBAN COWGIRL PRESENTS KEYWORD RESEARCH The most valuable keywords you have are the ones you mine from your pay-per-click performance reports. Scaling keywords that have proven to convert to orders

More information

Relevance Feature Discovery for Text Mining

Relevance Feature Discovery for Text Mining Relevance Feature Discovery for Text Mining Laliteshwari 1,Clarish 2,Mrs.A.G.Jessy Nirmal 3 Student, Dept of Computer Science and Engineering, Agni College Of Technology, India 1,2 Asst Professor, Dept

More information

Metadata Standards & Applications. 7. Approaches to Models of Metadata Creation, Storage, and Retrieval

Metadata Standards & Applications. 7. Approaches to Models of Metadata Creation, Storage, and Retrieval Metadata Standards & Applications 7. Approaches to Models of Metadata Creation, Storage, and Retrieval Goals for Session Understand the differences between traditional vs. digital library Metadata creation

More information

Microsoft FAST Search Server 2010 for SharePoint for Application Developers Course 10806A; 3 Days, Instructor-led

Microsoft FAST Search Server 2010 for SharePoint for Application Developers Course 10806A; 3 Days, Instructor-led Microsoft FAST Search Server 2010 for SharePoint for Application Developers Course 10806A; 3 Days, Instructor-led Course Description This course is designed to highlight the differentiating features of

More information

Taming Text. How to Find, Organize, and Manipulate It MANNING GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS. Shelter Island

Taming Text. How to Find, Organize, and Manipulate It MANNING GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS. Shelter Island Taming Text How to Find, Organize, and Manipulate It GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS 11 MANNING Shelter Island contents foreword xiii preface xiv acknowledgments xvii about this book

More information

From Digitally Disrupted to Digital Disrupter. Alex Andrenacci Managing Director Accenture Technology

From Digitally Disrupted to Digital Disrupter. Alex Andrenacci Managing Director Accenture Technology From Digitally Disrupted to Digital Disrupter Alex Andrenacci Managing Director Accenture Technology alex.andrenacci@accenture.com Every Business is a Digital Business: The Evolution Accenture Technology

More information

This is an author-deposited version published in : Eprints ID : 12964

This is an author-deposited version published in :   Eprints ID : 12964 Open Archive TOULOUSE Archive Ouverte (OATAO) OATAO is an open access repository that collects the work of Toulouse researchers and makes it freely available over the web where possible. This is an author-deposited

More information

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How

More information

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,

More information

Datameer for Data Preparation:

Datameer for Data Preparation: Datameer for Data Preparation: Explore, Profile, Blend, Cleanse, Enrich, Share, Operationalize DATAMEER FOR DATA PREPARATION: EXPLORE, PROFILE, BLEND, CLEANSE, ENRICH, SHARE, OPERATIONALIZE Datameer Datameer

More information

DIGITAL MARKETING Your revolution starts here

DIGITAL MARKETING Your revolution starts here DIGITAL MARKETING Your revolution starts here Course Highlights Online Marketing Introduction to Online Search. Understanding How Search Engines Work. Understanding Google Page Rank. Introduction to Search

More information

Automated Online News Classification with Personalization

Automated Online News Classification with Personalization Automated Online News Classification with Personalization Chee-Hong Chan Aixin Sun Ee-Peng Lim Center for Advanced Information Systems, Nanyang Technological University Nanyang Avenue, Singapore, 639798

More information

PART I A Technical Guide to Oracle Endeca Information Discovery

PART I A Technical Guide to Oracle Endeca Information Discovery Contents at a Glance PART I A Technical Guide to Oracle Endeca Information Discovery 1 Oracle Endeca Information Discovery Architecture... 3 2 Powering Endeca Server... 25 3 Designing Visualization with

More information

Natural Language Processing with PoolParty

Natural Language Processing with PoolParty Natural Language Processing with PoolParty Table of Content Introduction to PoolParty 2 Resolving Language Problems 4 Key Features 5 Entity Extraction and Term Extraction 5 Shadow Concepts 6 Word Sense

More information

seobility First steps towards successful SEO

seobility First steps towards successful SEO ? seobility BEGINNER S Guide First steps towards successful SEO 1 / 32 Seobility BEGINNER s Guide l First steps towards successful SEO TABLE OF CONTENTS Welcome 3 Adding a new project 4 General project

More information

Introduction to Text Mining. Hongning Wang

Introduction to Text Mining. Hongning Wang Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:

More information

User guide for GEM-TREND

User guide for GEM-TREND User guide for GEM-TREND 1. Requirements for Using GEM-TREND GEM-TREND is implemented as a java applet which can be run in most common browsers and has been test with Internet Explorer 7.0, Internet Explorer

More information

Flightplan: Getting from Enterprise Search to Cognitive Intelligence

Flightplan: Getting from Enterprise Search to Cognitive Intelligence Flightplan: Getting from Enterprise Search to Cognitive Intelligence 1 Flightplan: Getting from Enterprise Search to Cognitive Intelligence Introduction You walk into your office and say What are the sales

More information

Content Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered.

Content Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered. Content Enrichment An essential strategic capability for every publisher Enriched content. Delivered. An essential strategic capability for every publisher Overview Content is at the centre of everything

More information

Executive Alerts User Guide. A walkthrough of all the main features

Executive Alerts User Guide. A walkthrough of all the main features Executive Alerts User Guide A walkthrough of all the main features Where can I receive alerts? You can receive Executive Alerts in three main methods. 1 Executive Alerts Mobile App Download the newest

More information

DEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES

DEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES DEC. 1-5 Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES Monday Overview of Databases A web search engine is a large database containing information about Web pages that have been registered

More information

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Information Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science Information Retrieval CS 6900 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Information Retrieval Information Retrieval (IR) is finding material of an unstructured

More information

RPI INSIDE DEEPQA INTRODUCTION QUESTION ANALYSIS 11/26/2013. Watson is. IBM Watson. Inside Watson RPI WATSON RPI WATSON ??? ??? ???

RPI INSIDE DEEPQA INTRODUCTION QUESTION ANALYSIS 11/26/2013. Watson is. IBM Watson. Inside Watson RPI WATSON RPI WATSON ??? ??? ??? @ INSIDE DEEPQA Managing complex unstructured data with UIMA Simon Ellis INTRODUCTION 22 nd November, 2013 WAT SON TECHNOLOGIES AND OPEN ARCHIT ECT URE QUEST ION ANSWERING PROFESSOR JIM HENDLER S IMON

More information

How To Construct A Keyword Strategy?

How To Construct A Keyword Strategy? Introduction The moment you think about marketing these days the first thing that pops up in your mind is to go online. Why is there a heck about marketing your business online? Why is it so drastically

More information

Google My Business The Free Listing

Google My Business The Free Listing Google My Business The Free Listing Entrata compiled year-over-year data from 400+ apartment communities across the United States in this study to to help apartment marketers better understand how to measure

More information

Mining Web Data. Lijun Zhang

Mining Web Data. Lijun Zhang Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems

More information

Part I: Data Mining Foundations

Part I: Data Mining Foundations Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?

More information

6 TOOLS FOR A COMPLETE MARKETING WORKFLOW

6 TOOLS FOR A COMPLETE MARKETING WORKFLOW 6 S FOR A COMPLETE MARKETING WORKFLOW 01 6 S FOR A COMPLETE MARKETING WORKFLOW FROM ALEXA DIFFICULTY DIFFICULTY MATRIX OVERLAP 6 S FOR A COMPLETE MARKETING WORKFLOW 02 INTRODUCTION Marketers use countless

More information

USER GUIDE DESIGN A STEP BY STEP GUIDE

USER GUIDE DESIGN A STEP BY STEP GUIDE USER GUIDE DESIGN A STEP BY STEP GUIDE UNDERSTANDING THE NEW DESIGN TAB Users with Design privileges choose how your data will display within your dashboard visually. Under DASHBOARD DESIGN, you can change

More information

SAS Event Stream Processing

SAS Event Stream Processing FACT SHEET SAS Event Stream Processing Act on data while it s in motion to keep a real-time pulse on your business What does SAS Event Stream Processing do? SAS Event Stream Processing analyzes and understands

More information

On-Page SEO is the foundation with which backlinks and other off-page SEO strategies reach their highest potential.

On-Page SEO is the foundation with which backlinks and other off-page SEO strategies reach their highest potential. HumanProofDesigns Presents: Perfect On-Page SEO On-Page SEO is the foundation with which backlinks and other off-page SEO strategies reach their highest potential. The better your on-page, the less links

More information

Ajloun National University

Ajloun National University Study Plan Guide for the Bachelor Degree in Computer Information System First Year hr. 101101 Arabic Language Skills (1) 101099-01110 Introduction to Information Technology - - 01111 Programming Language

More information

Make the most of your access to ScienceDirect

Make the most of your access to ScienceDirect 1 Make the most of your access to ScienceDirect Present Future 2 ScienceDirect Training Deck We re here to help you make the most of your access to ScienceDirect. ScienceDirect offers researchers the latest

More information

Enterprise Data Catalog for Microsoft Azure Tutorial

Enterprise Data Catalog for Microsoft Azure Tutorial Enterprise Data Catalog for Microsoft Azure Tutorial VERSION 10.2 JANUARY 2018 Page 1 of 45 Contents Tutorial Objectives... 4 Enterprise Data Catalog Overview... 5 Overview... 5 Objectives... 5 Enterprise

More information

ANALYTICS DATA To Make Better Content Marketing Decisions

ANALYTICS DATA To Make Better Content Marketing Decisions HOW TO APPLY ANALYTICS DATA To Make Better Content Marketing Decisions AS A CONTENT MARKETER you should be well-versed in analytics, no matter what your specific roles and responsibilities are in working

More information

Big Data Computing for GIS Data Discovery

Big Data Computing for GIS Data Discovery Big Data Computing for GIS Data Discovery Solutions for Today Options for Tomorrow Vic Baker 1,2, Jennifer Bauer 1, Kelly Rose 1,Devin Justman 1,3 1 National Energy Technology Laboratory, 2 MATRIC, 3 AECOM

More information

Social Business Intelligence in Action

Social Business Intelligence in Action Social Business Intelligence in ction Matteo Francia, nrico Gallinucci, Matteo Golfarelli, Stefano Rizzi DISI University of Bologna, Italy Introduction Several Social-Media Monitoring tools are available

More information

SOURCERER: MINING AND SEARCHING INTERNET- SCALE SOFTWARE REPOSITORIES

SOURCERER: MINING AND SEARCHING INTERNET- SCALE SOFTWARE REPOSITORIES SOURCERER: MINING AND SEARCHING INTERNET- SCALE SOFTWARE REPOSITORIES Introduction to Information Retrieval CS 150 Donald J. Patterson This content based on the paper located here: http://dx.doi.org/10.1007/s10618-008-0118-x

More information

The Evolution of Search:

The Evolution of Search: : Moving from Information Retrieval to Dialogue-driven Advisory Systems A White Paper by White Paper : Moving from Information Retrieval to Dialogue-driven Advisory Systems Overview Online searching has

More information

Microsoft FAST Search Server 2010 for SharePoint Evaluation Guide

Microsoft FAST Search Server 2010 for SharePoint Evaluation Guide Microsoft FAST Search Server 2010 for SharePoint Evaluation Guide 1 www.microsoft.com/sharepoint The information contained in this document represents the current view of Microsoft Corporation on the issues

More information

Text Mining. Representation of Text Documents

Text Mining. Representation of Text Documents Data Mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data,

More information