Castanet: Using WordNet to Build Facet Hierarchies. Emilia Stoica and Marti Hearst School of Information, Berkeley

Size: px
Start display at page:

Download "Castanet: Using WordNet to Build Facet Hierarchies. Emilia Stoica and Marti Hearst School of Information, Berkeley"

Transcription

1 Castanet: Using WordNet to Build Facet Hierarchies Emilia Stoica and Marti Hearst School of Information, Berkeley

2 Motivation Want to assign labels from multiple hierarchies

3 Motivation Hot and Sweet Chicken: 1 pepper, 2 apricots, 1 pound chicken breast, 1 Tbsp gingerroot Meat Chicken Fruit Apricot Vegetables pepper Flavor gingerroot

4 Castanet Carves out a structure from the hypernym (IS-A) relations within WordNet Produces surprisingly good results for a wide range of subjects e.g., arts, medicine, recipes, math, news, bibliographical records

5 WordNet Challenges A word may have more than one sense - Fine granularity of word sense distinctions e.g., newspaper (#1) - daily publication on folded sheets newspaper (#3) - physical object - Ambiguity for the same sense tuna #1 cactus #2 fish food fish bony fish

6 WordNet Challenges (cont.) The hypernym path may be quite long (e.g., sense #3 of tuna has 14 nodes) Sparse coverage of proper names and noun phrases (not addressed)

7 Algorithm Goals Build a set of facet hierarchies Balance depth and breadth Avoid skinny paths Don t go too deep or too broad Choose understandable labels Disambiguate words Currently a word can take on only one sense

8 Our Approach Documents Select terms Build core tree Augment core tree WordNet Compress Tree Remove top level categories Divide into facets

9 1. Select Terms Select well-distributed terms from the collection Eliminate stopwords Documents Select terms Build core tree Augm. core tree Retain only those terms with a distribution higher than a threshold WordNet Comp. tree Remove top level categ. (default: top 10%)

10 2. Build Core Tree Build a backbone Create paths from unambiguous terms only Bias the structure towards appropriate senses of words Documents Select terms WordNet Build core tree Comp. tree Augm. core tree Remove top level categ. entity substance,matter nutriment dessert frozen dessert ice cream sundae sundae entity substance,matter nutriment dessert frozen dessert sherbet,sorbet sherbet Get hypernym path if term: - has only one sense, or - matches a pre-selected WordNet domain Adding a new term increases a count at each node on its path by # of docs with the term.

11 2. Build Core Tree (cont.) Merge hypernym paths to build a tree entity substance,matter nutriment dessert frozen dessert entity substance,matter nutriment dessert frozen dessert entity substance,matter nutriment dessert frozen dessert ice cream sundae sherbet,sorbet ice cream sundae sherbet,sorbet sundae sherbet sundae sherbet

12 3. Augment Core Tree Attach to Core tree the terms with more than one sense Documents Select terms Build core tree Augm. core tree Favor the more common path over other alternatives WordNet Comp. tree Remove top level categ.

13 Augment Core Tree (cont.) Date (p1) Date (p2) entity abstraction substance,matter measure, quantity food, nutrient fundamental quality nutriment time period food calendar day (18) edible fruit (78) date Choose this path since it has more items assigned date

14 Optional Step: Domains To disambiguate, use Domains Wordnet has 212 Domains medicine, mathematics, biology, chemistry, linguistics, soccer, etc. A better collection has been developed by Magnini 2000 Assigns a domain to every noun synset Automatically scan the collection to see which domains apply The user selects which of the suggested domains to use or may add own Paths for terms that match the selected domains are added to the core tree

15 Using Domains dip glosses: Sense 1: A depression in an otherwise level surface Sense 2: The angle that a magnet needle makes with horizon Sense 3: Tasty mixture into which bite-size foods are dipped dip hypernyms Sense 1 Sense 2 Sense 3 solid shape, form food => concave shape => space => ingredient, fixings => depression => angle => flavorer Given domain food, choose sense 3

16 4. Compress Tree Rule 1: Eliminate a parent with fewer than k children unless it is the root or its distribution is larger than 0.1*max dist Documents Select terms WordNet Build core tree Comp. tree abstraction Augm. core tree Remove top level categ. ice cream sundae sundae dessert frozen dessert parfait sherbet,sorbet sherbet dessert frozen dessert sundae parfait sherbet

17 4. Compress Tree (cont.) Rule 2: Eliminate a child whose name appears within the parent s name Documents Select terms WordNet Build core tree Comp. tree abstraction Augm. core tree Remove top level categ. dessert frozen dessert dessert sundae parfait sherbet sundae parfait sherbet

18 5. Divide into Facets Divide into facets

19 5. Divide into Facets (Remove top levels) entity substance,matter food,nutriment Divide into facets food stuff,food product ingredient,fixings flavorer flavorer herb sweetening herb sweetening parsley oregano sugar syrup parsley oregano sugar syrup Rule 1: Eliminate very general categories (e.g., entity, abstraction). If no paths are longer than threshold t, then done. Else: Rule 2: Undo first step. Then eliminate all top levels until the maximum length of any path in the resulting hierarchy is t.

20 Example: Recipes (3500 docs)

21 Castanet Output (shown in Flamenco)

22 Castanet Output

23 Castanet Output

24 Castanet Output

25 Castanet Output

26

27 Castanet Evaluation This is a tool for information architects, so people of this type did the evaluation We compared output on Recipes Biomedical journal titles We compared to two state-of-the-art algorithms LDA (Blei et al. 04) Subsumption (Sanderson & Croft 99)

28 Subsumption Output

29 Subsumption Output

30 Subsumption Output

31 Subsumption Output

32 LDA Output

33 LDA Output

34 LDA Output

35 Evaluation Method Information architects assessed the category systems For each of 2 systems output: Examined and commented on top-level Examined and commented on two sub-levels Then comment on overall properties Meaningful? Systematic? Likely to use in your work?

36 Evaluation (cont.) Sample questions for top level categories: - Would you add/remove/rename any category? - Did this category match your expectations? Sample questions for a specific category: - Would you add/move/remove any sub-categories? - Would you promote any sub-category to top level? General questions: - Would you use Castanet? - Would you use LDA? - Would you use Subsumption? - Would you use list of most frequent terms?

37 Evaluation Results Results on recipes collection for Would you use this system in your work? # Yes in some cases or yes, definitely : Castanet: 29/34 LDA: 0/18 Subsumption: 6/16 Baseline: 25/34 Average response to questions about quality (4 = strongly agree )

38 Evaluation Results Average responses for top-level categories 4= no changes, 1 = change many Average responses for 2 subcategories

39 Needed Improvements Take spelling variations and morphological variants into account Use verbs and adjectives, not just nouns Normalize noun phrases Allow terms to have more than one sense Improve algorithm for assigning documents to categories.

40 Opportunities for Tagging New opportunity: Tagging, folksonomies (flickr, de.lici.ous) People are created facets in a decentralized manner They are assigning multiple facets to items This is done on a massive scale This leads naturally to meaningful associations

41 Conclusions Flexible application of hierarchical faceted metadata is a proven approach for navigating large information collections. Midway in complexity between simple hierarchies and deep knowledge representation. Currently in use on e-commerce sites; spreading to other domains Systems are needed to help create faceted metadata structures Our WordNet-based algorithm, while not perfect, seems like it will be a useful tool for Information Architects.

42 Conclusions Castanet builds a set of faceted hierarchies by finding IS-A relations between terms using WordNet. The method has been tested on various domains: medicine, recipes, math, news, arts, bibliographical records Usability study shows: Castanet is preferred to other state-of-the art solutions. Information architects want to use the tool in their work.

43 Learn More Funding This work supported in part by NSF (IIS ) For more information: Stoica, E., Hearst, M., and Richardson, M., Automating Creation of Hierarchical Faceted Metadata Structures, NAACL/HLT 2007 See

Faceted Metadata in Search Interfaces

Faceted Metadata in Search Interfaces Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS-9984741. Focus: Search and Navigation of Large Collections Image Collections E-Government

More information

Faceted Metadata in Search Interfaces

Faceted Metadata in Search Interfaces Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS-9984741. Focus: Search and Navigation of Large Collections Image Collections E-Government

More information

NLP Support for Faceted Navigation in Scholarly Collections

NLP Support for Faceted Navigation in Scholarly Collections NLP Support for Faceted Navigation in Scholarly Collections Marti A. Hearst School of Information, UC Berkeley 102 South Hall, Berkeley, CA 94720 hearst@ischool.berkeley.edu Emilia Stoica Ask.com 555 12th

More information

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD 10 Text Mining Munawar, PhD Definition Text mining also is known as Text Data Mining (TDM) and Knowledge Discovery in Textual Database (KDT).[1] A process of identifying novel information from a collection

More information

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 93-94

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 93-94 ه عا ی Semantic Web Ontology Engineering and Evaluation Morteza Amini Sharif University of Technology Fall 93-94 Outline Ontology Engineering Class and Class Hierarchy Ontology Evaluation 2 Outline Ontology

More information

Ontology Development. Qing He

Ontology Development. Qing He A tutorial report for SENG 609.22 Agent Based Software Engineering Course Instructor: Dr. Behrouz H. Far Ontology Development Qing He 1 Why develop an ontology? In recent years the development of ontologies

More information

Component tree: an efficient representation of grayscale connected components

Component tree: an efficient representation of grayscale connected components Component tree: an efficient representation of grayscale connected components Gaëtan Lehmann 1 December 31, 2007 1 INRA, UMR 1198; ENVA; CNRS, FRE 2857, Biologie du Développement et Reproduction, Jouy

More information

Duty/Task Crosswalk to ACF Standards

Duty/Task Crosswalk to ACF Standards Techniques Duty/Task Crosswalk to Stards The Techniques Duty/Task Crosswalk is referenced to three American Federation () Required Knowledge Skill Competencies: 1. : American Federation Accrediting Commission

More information

GernEdiT: A Graphical Tool for GermaNet Development

GernEdiT: A Graphical Tool for GermaNet Development GernEdiT: A Graphical Tool for GermaNet Development Verena Henrich University of Tübingen Tübingen, Germany. verena.henrich@unituebingen.de Erhard Hinrichs University of Tübingen Tübingen, Germany. erhard.hinrichs@unituebingen.de

More information

For convenience in typing examples, we can shorten the wordnet name to wn.

For convenience in typing examples, we can shorten the wordnet name to wn. NLP Lab Session Week 14, December 4, 2013 More Semantics: WordNet similarity in NLTK and LDA Mallet demo More on Final Projects: weka memory and loading Spam documents Getting Started For the final projects,

More information

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 95-96

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 95-96 ه عا ی Semantic Web Ontology Engineering and Evaluation Morteza Amini Sharif University of Technology Fall 95-96 Outline Ontology Engineering Class and Class Hierarchy Ontology Evaluation 2 Outline Ontology

More information

> Semantic Web Use Cases and Case Studies

> Semantic Web Use Cases and Case Studies > Semantic Web Use Cases and Case Studies Case Study: The Semantic Web for the Agricultural Domain, Semantic Navigation of Food, Nutrition and Agriculture Journal Gauri Salokhe, Margherita Sini, and Johannes

More information

R&H TOPPINGS, SYRUPS, & BASES

R&H TOPPINGS, SYRUPS, & BASES Premium R&H TOPPINGS, SYRUPS, & BASES READY-TO-USE Solutions For Your Operation R&H SAUCES & TOPPINGS DELICIOUS READY-TO-USE SAUCES & TOPPINGS THAT WILL SATISFY Create extraordinary desserts with rich

More information

COMP90042 LECTURE 3 LEXICAL SEMANTICS COPYRIGHT 2018, THE UNIVERSITY OF MELBOURNE

COMP90042 LECTURE 3 LEXICAL SEMANTICS COPYRIGHT 2018, THE UNIVERSITY OF MELBOURNE COMP90042 LECTURE 3 LEXICAL SEMANTICS SENTIMENT ANALYSIS REVISITED 2 Bag of words, knn classifier. Training data: This is a good movie.! This is a great movie.! This is a terrible film. " This is a wonderful

More information

Ontology Development. Farid Naimi

Ontology Development. Farid Naimi Ontology Development Farid Naimi Overview Why develop an ontology? What is in an ontology? Ontology Development Defining classes and a class hierarchy Naming considerations Conclusion Why develop an ontology?

More information

0.1 Knowledge Organization Systems for Semantic Web

0.1 Knowledge Organization Systems for Semantic Web 0.1 Knowledge Organization Systems for Semantic Web 0.1 Knowledge Organization Systems for Semantic Web 0.1.1 Knowledge Organization Systems Why do we need to organize knowledge? Indexing Retrieval Organization

More information

GDSN Major Release 3.1 Training Course for High Level Changes

GDSN Major Release 3.1 Training Course for High Level Changes Welcome to the Global Data Synchronisation Network Major Release 3.1 Training Course for. This 30 minutes e-learning targets any GS1 Member Organization, Data Pool or Trading partner that needs to implement

More information

Physical Level of Databases: B+-Trees

Physical Level of Databases: B+-Trees Physical Level of Databases: B+-Trees Adnan YAZICI Computer Engineering Department METU (Fall 2005) 1 B + -Tree Index Files l Disadvantage of indexed-sequential files: performance degrades as file grows,

More information

Ontology Research Group Overview

Ontology Research Group Overview Ontology Research Group Overview ORG Dr. Valerie Cross Sriram Ramakrishnan Ramanathan Somasundaram En Yu Yi Sun Miami University OCWIC 2007 February 17, Deer Creek Resort OCWIC 2007 1 Outline Motivation

More information

Ontology Engineering for the Semantic Web and Beyond

Ontology Engineering for the Semantic Web and Beyond Ontology Engineering for the Semantic Web and Beyond Natalya F. Noy Stanford University noy@smi.stanford.edu A large part of this tutorial is based on Ontology Development 101: A Guide to Creating Your

More information

Graph-based Entity Linking using Shortest Path

Graph-based Entity Linking using Shortest Path Graph-based Entity Linking using Shortest Path Yongsun Shim 1, Sungkwon Yang 1, Hyunwhan Joe 1, Hong-Gee Kim 1 1 Biomedical Knowledge Engineering Laboratory, Seoul National University, Seoul, Korea {yongsun0926,

More information

MASSACHUSETTS VOCATIONAL TECHNICAL TEACHER TESTING PROGRAM SCOPE OF TEST CODE #11 - CULINARY ARTS WRITTEN EXAM QUESTIONS TIME ALLOWED: 3 HOURS

MASSACHUSETTS VOCATIONAL TECHNICAL TEACHER TESTING PROGRAM SCOPE OF TEST CODE #11 - CULINARY ARTS WRITTEN EXAM QUESTIONS TIME ALLOWED: 3 HOURS MASSACHUSETTS VOCATIONAL TECHNICAL TEACHER TESTING PROGRAM SCOPE OF TEST CODE #11 - CULINARY ARTS WRITTEN EXAM - 100 QUESTIONS TIME ALLOWED: 3 HOURS PERCENT OF TEST: 15 % Health and Safety Sanitation Food

More information

Automatic Construction of WordNets by Using Machine Translation and Language Modeling

Automatic Construction of WordNets by Using Machine Translation and Language Modeling Automatic Construction of WordNets by Using Machine Translation and Language Modeling Martin Saveski, Igor Trajkovski Information Society Language Technologies Ljubljana 2010 1 Outline WordNet Motivation

More information

TEXT PREPROCESSING FOR TEXT MINING USING SIDE INFORMATION

TEXT PREPROCESSING FOR TEXT MINING USING SIDE INFORMATION TEXT PREPROCESSING FOR TEXT MINING USING SIDE INFORMATION Ms. Nikita P.Katariya 1, Prof. M. S. Chaudhari 2 1 Dept. of Computer Science & Engg, P.B.C.E., Nagpur, India, nikitakatariya@yahoo.com 2 Dept.

More information

Faceted Metadata for Information Architecture and Search. CHI Course - April 24, 2006 Session I

Faceted Metadata for Information Architecture and Search. CHI Course - April 24, 2006 Session I Faceted Metadata for Information Architecture and Search CHI Course - April 24, 2006 Session I Marti Hearst, School of Information, UC Berkeley Preston Smalley & Corey Chandler, ebay User Experience &

More information

System Analysis and Design. Data Flow Diagram. System Analysis and Design

System Analysis and Design. Data Flow Diagram. System Analysis and Design Data Flow Diagram 1 Data Flow diagram The dataflow diagram is a modeling tool that allows us to picture a system as a network of functional processes, connected to one another by pipelines and holding

More information

DATABASE SEARCHING. Instructional guide

DATABASE SEARCHING. Instructional guide University of KwaZulu-Natal Library, Pietermaritzburg DATABASE SEARCHING Instructional guide Databases contain references to journal articles, chapters in books and in some cases, theses and dissertations.

More information

A Data Modeling Process. Determining System Requirements. Planning the Project. Specifying Relationships. Specifying Entities

A Data Modeling Process. Determining System Requirements. Planning the Project. Specifying Relationships. Specifying Entities Chapter 3 Entity-Relationship Data Modeling: Process and Examples Fundamentals, Design, and Implementation, 9/e A Data Modeling Process Steps in the data modeling process Plan project Determine requirements

More information

Math Dr. Miller - Constructing in Sketchpad (tm) - Due via by Friday, Mar. 18, 2016

Math Dr. Miller - Constructing in Sketchpad (tm) - Due via  by Friday, Mar. 18, 2016 Math 304 - Dr. Miller - Constructing in Sketchpad (tm) - Due via email by Friday, Mar. 18, 2016 As with our second GSP activity for this course, you will email the assignment at the end of this tutorial

More information

CHAPTER 2: DATA MODELS

CHAPTER 2: DATA MODELS Database Systems Design Implementation and Management 12th Edition Coronel TEST BANK Full download at: https://testbankreal.com/download/database-systems-design-implementation-andmanagement-12th-edition-coronel-test-bank/

More information

STRUCTURES AND STRATEGIES FOR STATE SPACE SEARCH

STRUCTURES AND STRATEGIES FOR STATE SPACE SEARCH Slide 3.1 3 STRUCTURES AND STRATEGIES FOR STATE SPACE SEARCH 3.0 Introduction 3.1 Graph Theory 3.2 Strategies for State Space Search 3.3 Using the State Space to Represent Reasoning with the Predicate

More information

Semantics Isn t Easy Thoughts on the Way Forward

Semantics Isn t Easy Thoughts on the Way Forward Semantics Isn t Easy Thoughts on the Way Forward NANCY IDE, VASSAR COLLEGE REBECCA PASSONNEAU, COLUMBIA UNIVERSITY COLLIN BAKER, ICSI/UC BERKELEY CHRISTIANE FELLBAUM, PRINCETON UNIVERSITY New York University

More information

A LEXICAL APPROACH FOR TAXONOMY MAPPING

A LEXICAL APPROACH FOR TAXONOMY MAPPING Journal of Web Engineering, Vol. 0, No. 0 (2015) 000 000 c Rinton Press A LEXICAL APPROACH FOR TAXONOMY MAPPING LENNART NEDERSTIGT, DAMIR VANDIC, and FLAVIUS FRASINCAR Econometric Institute, Erasmus University

More information

Using the Web in Your Teaching

Using the Web in Your Teaching Using the Web in Your Teaching November 16, 2001 Dirk Morrison Extension Division, University of Saskatchewan Workshop Outline What will we cover? Why use the Web for teaching and learning? Planning to

More information

International Journal of Advance Engineering and Research Development SENSE BASED INDEXING OF HIDDEN WEB USING ONTOLOGY

International Journal of Advance Engineering and Research Development SENSE BASED INDEXING OF HIDDEN WEB USING ONTOLOGY Scientific Journal of Impact Factor (SJIF): 5.71 International Journal of Advance Engineering and Research Development Volume 5, Issue 04, April -2018 e-issn (O): 2348-4470 p-issn (P): 2348-6406 SENSE

More information

BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network

BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network Roberto Navigli, Simone Paolo Ponzetto What is BabelNet a very large, wide-coverage multilingual

More information

2. An implementation-ready data model needn't necessarily contain enforceable rules to guarantee the integrity of the data.

2. An implementation-ready data model needn't necessarily contain enforceable rules to guarantee the integrity of the data. Test bank for Database Systems Design Implementation and Management 11th Edition by Carlos Coronel,Steven Morris Link full download test bank: http://testbankcollection.com/download/test-bank-for-database-systemsdesign-implementation-and-management-11th-edition-by-coronelmorris/

More information

Introduction to Information Retrieval. Lecture Outline

Introduction to Information Retrieval. Lecture Outline Introduction to Information Retrieval Lecture 1 CS 410/510 Information Retrieval on the Internet Lecture Outline IR systems Overview IR systems vs. DBMS Types, facets of interest User tasks Document representations

More information

Making Sense Out of the Web

Making Sense Out of the Web Making Sense Out of the Web Rada Mihalcea University of North Texas Department of Computer Science rada@cs.unt.edu Abstract. In the past few years, we have witnessed a tremendous growth of the World Wide

More information

Metadata for Digital Collections: A How-to-Do-It Manual. Introduction to Resource Description and Dublin Core

Metadata for Digital Collections: A How-to-Do-It Manual. Introduction to Resource Description and Dublin Core Chapter 2 Supplement Introduction to Resource Description and Dublin Core Questions for Review, Study, or Discussion Resource Description Fundamentals 1. What is resource description? What are some other

More information

This document is a preliminary proposal to encode two characters into Unicode.

This document is a preliminary proposal to encode two characters into Unicode. A preliminary proposal to encode two base characters William J G Overington 19 October 2015 1. Introduction This document is a preliminary proposal to encode two characters into Unicode. The two characters

More information

This is a one-week excerpt from the Starfall Kindergarten Mathematics Teacher s Guide. If you have questions or comments, please contact us.

This is a one-week excerpt from the Starfall Kindergarten Mathematics Teacher s Guide. If you have questions or comments, please contact us. UNIT 6 WEEK 13 This is a one-week excerpt from the Starfall Kindergarten Mathematics Teacher s Guide. If you have questions or comments, please contact us. Email: helpdesk@starfall.com Phone: 1-888-857-8990

More information

Search Engine Optimization

Search Engine Optimization Search Engine Optimization Are you Visible? Presented by Andrew Arenson www.e-ms.com www.verticalrail.com Disciplines of Search Marketing SEO WSO SEM LMO SMO What Is Search Optimization? Key Optimization

More information

Thus, it is reasonable to compare binary search trees and binary heaps as is shown in Table 1.

Thus, it is reasonable to compare binary search trees and binary heaps as is shown in Table 1. 7.2 Binary Min-Heaps A heap is a tree-based structure, but it doesn t use the binary-search differentiation between the left and right sub-trees to create a linear ordering. Instead, a binary heap only

More information

Putting ontologies to work in NLP

Putting ontologies to work in NLP Putting ontologies to work in NLP The lemon model and its future John P. McCrae National University of Ireland, Galway Introduction In natural language processing we are doing three main things Understanding

More information

IBE101: Introduction to Information Architecture. Hans Fredrik Nordhaug 2008

IBE101: Introduction to Information Architecture. Hans Fredrik Nordhaug 2008 IBE101: Introduction to Information Architecture Hans Fredrik Nordhaug 2008 Objectives Defining IA Practicing IA User Needs and Behaviors The anatomy of IA Organizations Systems Labelling Systems Navigation

More information

Mining Opinion Attributes From Texts using Multiple Kernel Learning

Mining Opinion Attributes From Texts using Multiple Kernel Learning Mining Opinion Attributes From Texts using Multiple Kernel Learning Aleksander Wawer axw@ipipan.waw.pl December 11, 2011 Institute of Computer Science Polish Academy of Science Agenda Text Corpus Ontology

More information

(C) 2010 Pearson Education, Inc. All rights reserved. Dr. Marenglen Biba

(C) 2010 Pearson Education, Inc. All rights reserved. Dr. Marenglen Biba Dr. Marenglen Biba In Chapters 12 13, you design and implement an object-oriented automated teller machine (ATM) software system. Concise, carefully paced, complete design and implementation experience.

More information

CHAPTER 10 GRAPHS AND TREES. Alessandro Artale UniBZ - artale/

CHAPTER 10 GRAPHS AND TREES. Alessandro Artale UniBZ -  artale/ CHAPTER 10 GRAPHS AND TREES Alessandro Artale UniBZ - http://www.inf.unibz.it/ artale/ SECTION 10.5 Trees Copyright Cengage Learning. All rights reserved. Trees In mathematics, a tree is a connected graph

More information

Metadata. Week 4 LBSC 671 Creating Information Infrastructures

Metadata. Week 4 LBSC 671 Creating Information Infrastructures Metadata Week 4 LBSC 671 Creating Information Infrastructures Muddiest Points Memory madness Hard drives, DVD s, solid state disks, tape, Digitization Images, audio, video, compression, file names, Where

More information

Nominal Data. May not have a numerical representation Distance measures might not make sense. PR and ANN

Nominal Data. May not have a numerical representation Distance measures might not make sense. PR and ANN NonMetric Data Nominal Data So far we consider patterns to be represented by feature vectors of real or integer values Easy to come up with a distance (similarity) measure by using a variety of mathematical

More information

CHAPTER 2: DATA MODELS

CHAPTER 2: DATA MODELS CHAPTER 2: DATA MODELS 1. A data model is usually graphical. PTS: 1 DIF: Difficulty: Easy REF: p.36 2. An implementation-ready data model needn't necessarily contain enforceable rules to guarantee the

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY

WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.4, April 2009 349 WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY Mohammed M. Sakre Mohammed M. Kouta Ali M. N. Allam Al Shorouk

More information

MENUS & MENU CYCLES MENU SOFTWARE MENU ENHANCEMENT NUTRITION MODULE

MENUS & MENU CYCLES MENU SOFTWARE MENU ENHANCEMENT NUTRITION MODULE MENUS & MENU CYCLES MENU SOFTWARE MENU ENHANCEMENT NUTRITION MODULE USER S GUIDE Texas Department of Agriculture Revised 11/14/2017 2 OTHER RESOURCES In addition to the User s Guide:, you may use the following

More information

Random Walks for Knowledge-Based Word Sense Disambiguation. Qiuyu Li

Random Walks for Knowledge-Based Word Sense Disambiguation. Qiuyu Li Random Walks for Knowledge-Based Word Sense Disambiguation Qiuyu Li Word Sense Disambiguation 1 Supervised - using labeled training sets (features and proper sense label) 2 Unsupervised - only use unlabeled

More information

Punjabi WordNet Relations and Categorization of Synsets

Punjabi WordNet Relations and Categorization of Synsets Punjabi WordNet Relations and Categorization of Synsets Rupinderdeep Kaur Computer Science Engineering Department, Thapar University, rupinderdeep@thapar.edu Suman Preet Department of Linguistics and Punjabi

More information

Access Control Lists. Don Porter CSE 506

Access Control Lists. Don Porter CSE 506 Access Control Lists Don Porter CSE 506 Background (1) ò If everything in Unix is a file ò Everything in Windows is an object ò Why not files? ò Not all OS abstractions make sense as a file ò Examples:

More information

Photoshop for Geeks Photoshop Scripting for Beginners Photoshop World Orlando, 2008 Jeffrey Tranberry Adobe Systems, QE Product Lead

Photoshop for Geeks Photoshop Scripting for Beginners Photoshop World Orlando, 2008 Jeffrey Tranberry Adobe Systems, QE Product Lead Photoshop Scripting for Beginners Photoshop World Orlando, 2008 Jeffrey Tranberry Adobe Systems, QE Product Lead What you will get out of this class What is Scripting? What you will get out of this class

More information

Tags, Categories and Keywords

Tags, Categories and Keywords Tags, Categories and Keywords Document Management Tip Sheet As more and more content gets added to your repository, it will become harder to find what you need. Documents may become buried in multi-level

More information

MENU MANAGER HANDBOOK

MENU MANAGER HANDBOOK MENU MANAGER HANDBOOK Wayne Toczek, CEO Innovations Services 102 Parson Street Norwalk, Ohio 44857 419-541-7288 http://www.innovaservices.info/ 11/14/2012 TABLE OF CONTENTS A. INSTALLING YOUR MENU MANAGER

More information

Finding Images in an OPAC: Analysis of User Queries & Subject Search for Images

Finding Images in an OPAC: Analysis of User Queries & Subject Search for Images Finding Images in an OPAC: Analysis of User Queries & Subject Search for Images Young Choi & Ingrid Hsieh Yee School of Library and Information Science Catholic University of America Bridging the Spectrum

More information

Chapter 6: Information Retrieval and Web Search. An introduction

Chapter 6: Information Retrieval and Web Search. An introduction Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods

More information

Information Retrieval and Web Search

Information Retrieval and Web Search Information Retrieval and Web Search Relevance Feedback. Query Expansion Instructor: Rada Mihalcea Intelligent Information Retrieval 1. Relevance feedback - Direct feedback - Pseudo feedback 2. Query expansion

More information

Question Answering Approach Using a WordNet-based Answer Type Taxonomy

Question Answering Approach Using a WordNet-based Answer Type Taxonomy Question Answering Approach Using a WordNet-based Answer Type Taxonomy Seung-Hoon Na, In-Su Kang, Sang-Yool Lee, Jong-Hyeok Lee Department of Computer Science and Engineering, Electrical and Computer Engineering

More information

Tree Structures. A hierarchical data structure whose point of entry is the root node

Tree Structures. A hierarchical data structure whose point of entry is the root node Binary Trees 1 Tree Structures A tree is A hierarchical data structure whose point of entry is the root node This structure can be partitioned into disjoint subsets These subsets are themselves trees and

More information

Oracle Hyperion Financial Management Instructor-led Live Online Training Program

Oracle Hyperion Financial Management Instructor-led Live Online Training Program 1. Introduction to Financial Management About Oracle's Enterprise Performance Management Suite Financial Management Solution Financial Consolidation, Reporting, Analysis and Product Components Financial

More information

Enzo-P / Cello. Scalable Adaptive Mesh Refinement for Astrophysics and Cosmology. San Diego Supercomputer Center. Department of Physics and Astronomy

Enzo-P / Cello. Scalable Adaptive Mesh Refinement for Astrophysics and Cosmology. San Diego Supercomputer Center. Department of Physics and Astronomy Enzo-P / Cello Scalable Adaptive Mesh Refinement for Astrophysics and Cosmology James Bordner 1 Michael L. Norman 1 Brian O Shea 2 1 University of California, San Diego San Diego Supercomputer Center 2

More information

CS 310 B-trees, Page 1. Motives. Large-scale databases are stored in disks/hard drives.

CS 310 B-trees, Page 1. Motives. Large-scale databases are stored in disks/hard drives. CS 310 B-trees, Page 1 Motives Large-scale databases are stored in disks/hard drives. Disks are quite different from main memory. Data in a disk are accessed through a read-write head. To read a piece

More information

B-Trees. Version of October 2, B-Trees Version of October 2, / 22

B-Trees. Version of October 2, B-Trees Version of October 2, / 22 B-Trees Version of October 2, 2014 B-Trees Version of October 2, 2014 1 / 22 Motivation An AVL tree can be an excellent data structure for implementing dictionary search, insertion and deletion Each operation

More information

Optimal Query. Assume that the relevant set of documents C r. 1 N C r d j. d j. Where N is the total number of documents.

Optimal Query. Assume that the relevant set of documents C r. 1 N C r d j. d j. Where N is the total number of documents. Optimal Query Assume that the relevant set of documents C r are known. Then the best query is: q opt 1 C r d j C r d j 1 N C r d j C r d j Where N is the total number of documents. Note that even this

More information

MedLingMap: A growing resource mapping the Bio-Medical NLP field

MedLingMap: A growing resource mapping the Bio-Medical NLP field MedLingMap: A growing resource mapping the Bio-Medical NLP field Marie Meteer, Bensiin Borukhov, Michael Crivaro, Michael Shafir, Attapol Thamrongrattanarit {mmeteer, bborukhov, mcrivaro, mshafir, tet}@brandeis.edu

More information

Enhanced retrieval using semantic technologies:

Enhanced retrieval using semantic technologies: Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008

More information

Trees and Intro to Counting

Trees and Intro to Counting Trees and Intro to Counting CSE21 Winter 2017, Day 15 (B00), Day 10/11 (A00) February 15, 2017 http://vlsicad.ucsd.edu/courses/cse21-w17 Another Special Type of Graph: Trees (Rooted) Trees: definitions

More information

Ontology-based Navigation of Bibliographic Metadata: Example from the Food, Nutrition and Agriculture Journal

Ontology-based Navigation of Bibliographic Metadata: Example from the Food, Nutrition and Agriculture Journal Ontology-based Navigation of Bibliographic Metadata: Example from the Food, Nutrition and Agriculture Journal Margherita Sini 1, Gauri Salokhe 1, Christopher Pardy 1, Janice Albert 1, Johannes Keizer 1,

More information

Instructions for Using the DAILY COOK'S SUBSTITUTE MENU

Instructions for Using the DAILY COOK'S SUBSTITUTE MENU Instructions for Using the DAILY COOK'S SUBSTITUTE MENU Revised September, 2006 Several additional menu items have been developed for the three meals. These Daily Cook's Substitute Menus allow you to quickly

More information

"Leveraging FIBO with Semantic Analysis to Perform On-Boarding, KYC and CDD" Bryan Bell & Elisa Kendall

Leveraging FIBO with Semantic Analysis to Perform On-Boarding, KYC and CDD Bryan Bell & Elisa Kendall Ontology Summit 2016 Track B 12 April 2017 "Leveraging FIBO with Semantic Analysis to Perform On-Boarding, KYC and CDD" Bryan Bell & Elisa Kendall linkedin.com/company/expert-system twitter.com/expert_system

More information

McDonald s Australia Beverages Allergen - Ingredients - Nutrition Information

McDonald s Australia Beverages Allergen - Ingredients - Nutrition Information McDonald s Australia Beverages Allergen - Ingredients - Nutrition Information Information correct as at 16 May 2017 At McDonald's we believe in the nutritional principals of balance, variety and moderation

More information

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany Information Systems University of Koblenz Landau, Germany Ontology 101 Design principles Ontology design principles Based on paper by Natasha Noy & Deborah McGuinness Ontology Development 101: A Guide

More information

CHAPTER 3 INFORMATION RETRIEVAL BASED ON QUERY EXPANSION AND LATENT SEMANTIC INDEXING

CHAPTER 3 INFORMATION RETRIEVAL BASED ON QUERY EXPANSION AND LATENT SEMANTIC INDEXING 43 CHAPTER 3 INFORMATION RETRIEVAL BASED ON QUERY EXPANSION AND LATENT SEMANTIC INDEXING 3.1 INTRODUCTION This chapter emphasizes the Information Retrieval based on Query Expansion (QE) and Latent Semantic

More information

Commerce 1E03 Business Environment & Organization

Commerce 1E03 Business Environment & Organization Commerce 1E03 Business Environment & Organization Ines Perkovic Business Librarian Innis Library KTH-108 Winter 2018 McMaster University Libraries library.mcmaster.ca 905.525.9140 x22081 library@mcmaster.ca

More information

Activity 1.1.1: A Mysterious Death

Activity 1.1.1: A Mysterious Death Activity 1.1.1: A Mysterious Death Part II: Processing a Crime Scene Concept Map Although every crime scene is unique, five basic tasks need to be completed in order to properly process a crime scene.

More information

TxAIR Training Manual

TxAIR Training Manual Introduction and Overview TxAIR, the Texas Achievement Items Repository, is a TEKS-aligned system that provides science and mathematics resources for grades 3 8, Integrated Physics and Chemistry, Biology,

More information

Trees. Eric McCreath

Trees. Eric McCreath Trees Eric McCreath 2 Overview In this lecture we will explore: general trees, binary trees, binary search trees, and AVL and B-Trees. 3 Trees Trees are recursive data structures. They are useful for:

More information

NATURAL LANGUAGE PROCESSING

NATURAL LANGUAGE PROCESSING NATURAL LANGUAGE PROCESSING LESSON 9 : SEMANTIC SIMILARITY OUTLINE Semantic Relations Semantic Similarity Levels Sense Level Word Level Text Level WordNet-based Similarity Methods Hybrid Methods Similarity

More information

Action Planning based on Open Knowledge Graphs and LOD

Action Planning based on Open Knowledge Graphs and LOD Action Planning based on Open Knowledge Graphs and LOD Seiji Koide 1, Fumihiro Kato 1, Hideaki Takeda 12, Yuta Ochiai 3, and Kenki Ueda 3 1 National Institute of Informatics, 2-1-2 Hitotsubashi, Chiyoda-ku,

More information

CMU Lecture 18: Deep learning and Vision: Convolutional neural networks. Teacher: Gianni A. Di Caro

CMU Lecture 18: Deep learning and Vision: Convolutional neural networks. Teacher: Gianni A. Di Caro CMU 15-781 Lecture 18: Deep learning and Vision: Convolutional neural networks Teacher: Gianni A. Di Caro DEEP, SHALLOW, CONNECTED, SPARSE? Fully connected multi-layer feed-forward perceptrons: More powerful

More information

Your Total Training Resource. Microsoft OneNote. To Schedule / Need Additional Information

Your Total Training Resource. Microsoft OneNote. To Schedule / Need Additional Information Microsoft OneNote It's your very own digital notebook, so you can keep notes, ideas, web pages, photos, even audio and video all in one place. Whether you're at home, in the office, or on the move, you

More information

Using Hash based Bucket Algorithm to Select Online Ontologies for Ontology Engineering through Reuse

Using Hash based Bucket Algorithm to Select Online Ontologies for Ontology Engineering through Reuse Using Hash based Bucket Algorithm to Select Online Ontologies for Ontology Engineering through Reuse Nadia Imdadi Dept. of Computer Science Jamia Millia Islamia a Central University, New Delhi India Dr.

More information

CSE 214 Computer Science II Introduction to Tree

CSE 214 Computer Science II Introduction to Tree CSE 214 Computer Science II Introduction to Tree Fall 2017 Stony Brook University Instructor: Shebuti Rayana shebuti.rayana@stonybrook.edu http://www3.cs.stonybrook.edu/~cse214/sec02/ Tree Tree is a non-linear

More information

Challenge. Case Study. The fabric of space and time has collapsed. What s the big deal? Miami University of Ohio

Challenge. Case Study. The fabric of space and time has collapsed. What s the big deal? Miami University of Ohio Case Study Use Case: Recruiting Segment: Recruiting Products: Rosette Challenge CareerBuilder, the global leader in human capital solutions, operates the largest job board in the U.S. and has an extensive

More information

A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet

A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet Joerg-Uwe Kietz, Alexander Maedche, Raphael Volz Swisslife Information Systems Research Lab, Zuerich, Switzerland fkietz, volzg@swisslife.ch

More information

AUTOMATED DISCOVERY OF TELIC RELATIONS FOR WORDNET MARCO DE BONI SURESH MANANDHAR

AUTOMATED DISCOVERY OF TELIC RELATIONS FOR WORDNET MARCO DE BONI SURESH MANANDHAR AUTOMATED DISCOVERY OF TELIC RELATIONS FOR WORDNET MARCO DE BONI SURESH MANANDHAR Introduction Background Terms Glosses Every word and synset in WordNet contains a short (about one sentence long) description

More information

Data Exchange. Hyper-Text Markup Language. Contents: HTML Sample. HTML Motivation. Cascading Style Sheets (CSS) Problems w/html

Data Exchange. Hyper-Text Markup Language. Contents: HTML Sample. HTML Motivation. Cascading Style Sheets (CSS) Problems w/html Data Exchange Contents: Mariano Cilia / cilia@informatik.tu-darmstadt.de Origins (HTML) Schema DOM, SAX Semantic Data Exchange Integration Problems MIX Model 1 Hyper-Text Markup Language HTML Hypertext:

More information

Data Analysis 1. Chapter 2.1 V3.1. Napier University Dr Gordon Russell

Data Analysis 1. Chapter 2.1 V3.1. Napier University Dr Gordon Russell Data Analysis 1 Chapter 2.1 V3.1 Copyright @ Napier University Dr Gordon Russell Entity Relationship Modelling Overview Database Analysis Life Cycle Components of an Entity Relationship Diagram What is

More information

Results of NBJLM for OAEI 2010

Results of NBJLM for OAEI 2010 Results of NBJLM for OAEI 2010 Song Wang 1,2, Gang Wang 1 and Xiaoguang Liu 1 1 College of Information Technical Science, Nankai University Nankai-Baidu Joint Lab, Weijin Road 94, Tianjin, China 2 Military

More information

LexiRes: A Tool for Exploring and Restructuring EuroWordNet for Information Retrieval

LexiRes: A Tool for Exploring and Restructuring EuroWordNet for Information Retrieval LexiRes: A Tool for Exploring and Restructuring EuroWordNet for Information Retrieval Ernesto William De Luca and Andreas Nürnberger 1 Abstract. The problem of word sense disambiguation in lexical resources

More information

Chapter 27 Introduction to Information Retrieval and Web Search

Chapter 27 Introduction to Information Retrieval and Web Search Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval

More information

Entity Relationship Modelling

Entity Relationship Modelling Entity Relationship Modelling Overview Database Analysis Life Cycle Components of an Entity Relationship Diagram What is a relationship? Entities, attributes, and relationships in a system The degree of

More information

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,

More information