Knowledge Harvesting for Business Intelligence

Size: px
Start display at page:

Download "Knowledge Harvesting for Business Intelligence"

Transcription

1 Knowledge Harvesting for Business Intelligence Nesrine Ben Mustapha and Marie-Aude Aufaure 1

2 2

3 Context Intelligence in Business Intelligence (BI)? Intelligence: application of Information, skills, experiences and reasoning to solve business problems Information acquisition from wide varity of sources Harvesting knowledge for decision making 3

4 Talk Overview Context of BI: need of semantics Correlated dimensions related to semantic BI Web evolution Semantic evolution New trend of search paradigm Progress in ontology engineering Ontology-based knowledge harvesting for Business Intelligence 4

5 Context Data, Information and Knowledge in BI Process Data What data is to be collected and managed and in what context? 5

6 Context Data of BI Related Stock News COMPANIES in Same or Related INDUSTRY Business Data Competition COMPANIES in INDUSTRY with Competing PRODUCTS Technology Products Important to INDUSTRY or COMPANY Industry News Regulations Impacting INDUSTRY or Filed By COMPANY 6

7 Context Data, Information and Knowledge in BI Data What data is to be collected and managed and in what context? Information Knowledge Data warehousing, online analytical processing, data quality, data profiling, business rule analysis, and data mining 7

8 Context Data, Information and Knowledge in BI Data What data is to be collected and managed and in what context? Information Knowledge Business plans Explicit, embedded or tacit Knowledge? Data warehousing, online analytical processing, data quality, data profiling, business rule analysis, and data mining 8

9 Context Business Intelligence: closed or open environment Knowledge Exploiters Competition BI Environment Knowledge learners May be focused on certain areas and not able to integrate different streams of knowledge. 9 Knowledge Explorers Maintain a good balance between internal and external learning. Knowledge Innovators Combine internal and external learning

10 Motivating Use cases Business Case study in Knowledgeweb Network Main Goal: Transfer of ontology-based technologies from the field of academia to industry Service Industry: Recruitment B2C market place for tourism Media and comms Multimedia Content analysis News aggregation service Technology providers Product lifecycle management Health: Data warehousing in healthcare Hospital information systems 10

11 Motivating Use cases Recruitment Current practices Job request: Creation of electronic profiles (contact, qualifications, work experiences, skills, etc) for job seeker Searching in the database of vacancies according to the described profile Application is automatically forwarded to the firm Job offers: State job Center Creation of electronic description of the position Publishing the description to the database of vacancies Requesting a list of suitable candidates (only registrated jobseekers in that system) Automatic sending s to the selected candidates German federal Employment office Monster Jobpilot 11

12 Motivating Use cases Recruitment Challenges and new requirements Facilitate efficiently open job vacancies with qualified suitable candidates Automatic matching between job offers and job seekers Semantic Solution: Semantic support based on expressing relationships between job characteristics and candidate qualifications Representing Searching Sharing on the web Semantic matching Knowledge for decision making 12

13 Motivating Use cases Semantic solution and key business benefit Solution Job offers Semantic querying Machmaker Job seekers profiles Metadata crawler Knowledge Base Enriching Semantics (ontologies) Enriching 13

14 Motivating Use cases Business Case study in Knowledgeweb Network Main Goal: Transfer of ontology-based technologies from the field of academia to industry Service Industry: Recruitment B2C market place for tourism Media and comms Multimedia Content analysis News aggregation service Technology providers Product lifecycle management Health: Data warehousing in healthcare Hospital information systems 14

15 Motivating Use cases B2C market place for tourism Current systems Only information pages and no tourism offers No personalized services 15 19/07/2012

16 Motivating Use cases B2C market place for tourism Challenges and new requirements Offer on-line personalized tourism packages 16

17 Motivating Use cases B2C market place for tourism Challenges and new requirements Offer on-line personalized tourism packages Geo-localization Dynamic exploitation of content, service providers and personalized data Semantic Solution: Semantic data integration Natural language processing Personal data representation and exploitation Semantic web services Business partners: In France, Tourism market was evaluated 32 Billions Tourism content providers Tour operators Regional tourism councils euros which river tourism represents a turnover greater than 250 M euros. 17

18 Motivating Use cases Business Case study in Knowledgeweb Network Main Goal: Transfer of ontology-based technologies from the field of academia to industry Service Industry: Recruitment B2C market place for tourism Media Multimedia Content analysis News aggregation service Technology providers Product lifecycle management Health: Data warehousing in healthcare Hospital information systems 18 19/07/2012

19 Motivating Use cases Multimedia Content analysis Current systems Difficult to develop and maintain large multimedia databases Difficult to organize, find and distribute multimedia content Solutions Multimedia data can be annotated in terms of knowledge extracted from it Machine processable data models supporting Semantic search Navigation Reasoning functionalities 19

20 Motivating Use cases Multimedia Content analysis AceMedia Project 20

21 Motivating Use cases Multimedia Content analysis 21

22 Motivating Use cases Business Case study in Knowledgeweb Network Main Goal: Transfer of ontology-based technologies from the field of academia to industry Service Industry: Recruitment B2C market place for tourism Media Multimedia Content analysis News aggregation service Technology providers Product lifecycle management Health: Data warehousing in healthcare Hospital information systems 22

23 Motivating Use cases News aggregation service Business interest in following news in specific categories including economics, sciences and IT, or on specific companies. Manual fastidious task!! Knowledge Exploiters BI Environment Competition 23

24 Motivating Use cases News aggregation service Traditional systems: Feed services based on RSS or Atom ( 24

25 Motivating Use cases News aggregation service 25

26 Motivating Use cases News aggregation service Traditional systems: Feed services based on RSS or Atom ( very basic Model (e.g. title, author, link to full story) and not suitable for any intelligent searching or organizing Portals such as Google News Large body of information that can be processed 26

27 Motivating Use cases News aggregation service Current Semantic approaches The news aggregation service from neofonie GmbH 27

28 Motivating Use cases News aggregation service 28

29 Motivating Use cases News aggregation service Current Semantic approaches The news aggregation service from neofonie GmbH Manual creation by a source expert of a XSLT template for each news source Automatic processing of that news source through a thematic clustering algorithm (NLP) and classification with category mappings Extraction of semantics from the source documents 29

30 Motivating Use cases Data warehousing in healthcare Main Goal: Transfer of ontology-based technologies from the field of academia to industry Service Industry: Recruitment B2C market place for tourism Media Multimedia Content analysis News aggregation service Technology providers Product lifecycle management Health: Data warehousing in healthcare Hospital information systems 30 19/07/2012

31 Motivating Use cases Data warehousing in healthcare Large health insurance company use a cognos data warehousing solution to administrate its data Business data are stored in various PC and don t share the same data formats Results: Manual search over data sources Need of introducing common terminology for healthcare data Problem of updating data Semantic solution Common terminology of healthcare domain for ontology-based integration 31

32 Motivating Use cases Data warehousing in healthcare Data Mart ontology Builder Knowledge Explorers BI Tools Data Mart Data Mart Core Inference Engine 32

33 Motivating Use cases Data warehousing in healthcare Main Goal: Transfer of ontology-based technologies from the field of academia to industry Service Industry: Recruitment B2C market place for tourism Media Multimedia Content analysis News aggregation service Technology providers Product lifecycle management Health: Data warehousing in healthcare Hospital information systems 33 19/07/2012

34 Motivating Use cases Hospital information systems Hospitals have dispersed data sources: administrative information about patients, diagnoses and treatment history for each division Different type formats of stored data (databases, texts) Need of efficient access Semantic solution: Ontology engineering from unstructured data Data wrapper (from database to Ontology) Query mediation and semantic matching solution Middleware for database integration 34

35 Correlated dimensions related to semantic BI Semantic Information Retrieval Ontology Engineering Semantic Representation Web 35

36 Evolution of Web: Semantic Web Towards Open Linked Data 36

37 Evolution of the web Web 4.0 (2020) Web 1.0 (1990) Semantic Web (Web2.0)(2000) Weblogs Web 3.0 (2010) Weblogs Social networking Desktop PC-ERA (1980) 37 Groupware Databases File servers File Systems Directory portals Web sites Linked Open DATA YAGO-NAGA

38 Evolution of the web Semantic web Web 1.0 Web 2.0 Web Web

39 Evolution of the web From Wikipedia to YAGO, DBPEDIA, OLD 39

40 Evolution of the web Existing large knowledge bases Extraction of structured information (info Box) Dbpedia Wikipedia NLP-Based Extraction YAGO Linked Open Data Wordnet GENONAMe 40 Event, Time, and spatial data extraction YAGO 2

41 Evolution of the web Existing large knowledge bases 6,000,000 Ontologies should not be judged purely by the number of facts! This is just an informational overview. 2,000,000 30,000 60, , , KnowItAll SUMO WordNet OpenCyc Cyc Yago

42 Evolution of the web Existing large knowledge bases 42 18/07/2012

43 Evolution of the web Existing large knowledge bases Open Linked Data 43 18/07/2012

44 Correlated dimensions related to semantic BI Information Retrieval Web Ontology Engineering Semantic Representation 44

45 Evolution of Semantics: From Dictionaries To Ontologies 45

46 Evolution of Semantics Semantic Web (Web2.0)(2000) Web Web 3.0 (2010) Web 1.0 (1990) Linked Open Data (LOD) YAGO-NAGA 46

47 Evolution of Semantics: Levels of Knowledge Representation 47

48 Evolution of Semantics: Levels of Knowledge Representation Menu Object Person Topic Document Student Researcher Semantics Doctoral Student PhD Student F-Logic Ontology A Taxonomy 48

49 Evolution of Semantics: Levels of Knowledge Representation Menu Object Person Topic Document Student Researcher Semantics Doktoral Student PhD Student F-Logic Ontology synonym similar Thesaurus Graph with primitives, 2 fixed relationships (similar, synonym 49

50 Evolution of Semantics: Levels of Knowledge Representation Menu knows Object Person Topic Document writes described_in Student Researcher Semantics Doktoral Student PhD Student F-Logic Ontology Tel Affiliation synonym similar Topic Map 50

51 Evolution of Semantics: Levels of Knowledge Representation is_a Object is_a knows Person Topic Document writes described_in Student is_a Doktoral PhD PhD Student Tel Researcher Affiliation Affiliation PhD Student instance_of York Sure Semantics subtopicof F-Logic F-Logic similar Ontology Ontology Rules T described_in similar D T is_about D P writes D is_about T P knows T AIFB 51

52 Evolution of Semantics: Impact on business domain After 2005 Before Information Systems Engineering (Infological view) Enterprise Engineering (Ontological view on enterprise) Data Systems Engineering (Datalogical view) 52

53 Evolution of Semantics: Ontology definition Origin: A branch of philosophy that investigates and explains the nature and essential properties and relations of all beings, or the principles and causes of being. Computer Science (Artificial Intelligence): An ontology is a formal and explicit specification of a shared conceptualization (Gruber, 93) among a community of people (and agents) of a common area of interest. 53

54 Evolution of Semantics: Ontology: communication principle Concept evokes refers to Form Jaguar Stands for Referent [Odwen, Richards, 1923] 54

55 Evolution of Semantics: Ontology structure Concept : class of objects referred by a set of terms (synonyms) Customer Product Marketing strategy Person Invoice Taxonomic (hyponymy) relations: is-a relation Customer is a person Meronymy relations: part-of relation An invoice includes Marketing Strategies 55

56 Evolution of Semantics: Ontology structure Non taxonomic relations: associative properties between concepts A customer subscribes to products An invoice belongs to a customer An invoice has products Axioms: consist in defining the meaning of concepts; setting restrictions on attribute value verifying the validity of specified knowledge 56

57 Evolution of Semantics: Ontology structure Rules: used to infer additional statements 57 The discount for a customer buying a product is 7.5 percent if the customer is premium and the product is luxury. <Implies> <head> <Atom> <Rel>discount</Rel> <Var>customer</Var> <Var>product</Var> <Ind>7.5 percent</ind> </Atom> </head> <body> <And> <Atom> <Rel>premium</Rel> <Var>customer</Var> </Atom> <Atom> <Rel>luxury</Rel> <Var>product</Var> </Atom> </And> </body> </Implies>

58 Evolution of Semantics: Ontology Types Four typology in the literature: Formalization degree: formal, informal and semi-formal Granularity degree Level of completeness Domain of knowledge Top-level Ontology (generic ontology) Lexical Ontology: WorldNet Domain Ontology Ontology of tasks Application Ontology 58

59 Evolution of Semantics: Ontologies - Some Examples General purpose ontologies: WordNet, EuroWordNet Upper level ontologies: DOLCE Upper-Cyc Ontology, IEEE Standard Upper Ontology, Domain and application-specific ontologies: RDF Site Summary RSS, UMLS, AIFB Web Page Ontology, Web-KB Ontology, Dublin Core, Gene Ontology: Mesh Ontology:

60 Evolution of Semantics: Enterprise Ontology Enterprise Ontology: a formal and explicit specification of a shared conceptualization among a community of people (managers, developers, employees and users) 60

61 Evolution of Semantics: Enterprise Ontology: CoProE Ontology CoProE Ontology: a newsevents ontology [Lösch, 2009] and United Nations Standard Products and Services Code (UNSPSC) 1 classification of products and segments of industries. Towards e leadership: Higher profitability through innovative management and leadership systems project Lösch, U., Nikitina, N.: The newsevents Ontology An Ontology for Describing Business Events. In: 8th International Semantic Web Conference, 1st Workshop on Ontology Design Patterns, Washington DC, USA (2009) 1. < 61

62 Evolution of Semantics: Enterprise Ontology: CoProE Ontology 62

63 Evolution of Semantics: Marketing ontology 63

64 Correlated dimensions related to semantic BI Information Retrieval Web Ontology Engineering 64

65 New trends of search paradigm 65

66 Semantic web search classification in Web 2.0 Ontology search engines Ontology Meta-search : OntoSearch [Y. Zhang et al., 2004] Swangler [T. Fini et al., 2005 ] Semantic search engines Contextual semantic search QuizRDF [J. Davies et al., 2005], Corese [O. Corby], Infofox [B. Sigrist et al., 2003], SHOE [B. Aleman-Meza et al., 2003], DOSE [D. Bonino 2003], SERSE [V. Tamma, 2004] Crawler-based ontology search: Ontokhoj [C. Patel et al., 2003] Swoogle [T. Finin et al., 2005] Evolutive semantic search W3C Semantic Search [R. Guha et al., 2005] Semantic association discovery SemDis [C. Rocha et al., 2004] 66

67 Ontology Search Engines Ontology Meta-search : OntoSearch Swangler Use of conventional search engines Provide specific type of files (RSS, RDF, OWl) Search by name or options like filetype 67

68 Ontology Search Engines OntoSearch 68

69 Ontology Search Engines OntoSearch 69

70 Semantic web search classification in Web 2.0 Ontology search engines Ontology Meta-search : OntoSearch [Y. Zhang et al., 2004] Swangler [T. Fini et al., 2005 ] Semantic search engines Contextual semantic search QuizRDF [J. Davies et al., 2005], Corese [O. Corby], Infofox [B. Sigrist et al., 2003], SHOE [B. Aleman-Meza et al., 2003], DOSE [D. Bonino 2003], SERSE [V. Tamma, 2004] Crawler-based ontology search: Ontokhoj [C. Patel et al., 2003] Swoogle [T. Finin et al., 2005] Evolutive semantic search W3C Semantic Search [R. Guha et al., 2005] Semantic association discovery SemDis [C. Rocha et al., 2004] 70

71 Ontology Search Engines Crawler-based ontology search : Ontokhoj Swoogle Specific crawler for semantic web document 71

72 Ontology Search Engines 72

73 Semantic web search classification in Web 2.0 Ontology search engines Ontology Meta-search : OntoSearch [Y. Zhang et al., 2004] Swangler [T. Fini et al., 2005 ] Semantic search engines Contextual semantic search QuizRDF [J. Davies et al., 2005], Corese [O. Corby], Infofox [B. Sigrist et al., 2003], SHOE [B. Aleman-Meza et al., 2003], DOSE [D. Bonino 2003], SERSE [V. Tamma, 2004] Crawler-based ontology search: Ontokhoj [C. Patel et al., 2003] Swoogle [T. Finin et al., 2005] Evolutive semantic search W3C Semantic Search [R. Guha et al., 2005] Semantic association discovery SemDis [C. Rocha et al., 2004] 73

74 Semantic search Engine 1. Contextual semantic search Annotation service Indexation service Web Crawler Context detector and query annotator Ontologies Document filtering service Concept Matching service 74

75 Semantic search Engine 1. Contextual semantic search 75

76 Semantic search Engine 1. Contextual semantic search 76

77 Semantic web search classification in Web 2.0 Ontology search engines Ontology Meta-search : OntoSearch [Y. Zhang et al., 2004] Swangler [T. Fini et al., 2005 ] Semantic search engines Contextual semantic search QuizRDF [J. Davies et al., 2005], Corese [O. Corby], Infofox [B. Sigrist et al., 2003], SHOE [B. Aleman-Meza et al., 2003], DOSE [D. Bonino 2003], SERSE [V. Tamma, 2004] Crawler-based ontology search: Ontokhoj [C. Patel et al., 2003] Swoogle [T. Finin et al., 2005] Evolutive semantic search W3C Semantic Search [R. Guha et al., 2005] Semantic association discovery SemDis [C. Rocha et al., 2004] 77

78 Semantic search Engine 2. Evolutive semantic search 78

79 Semantic web search classification in Web 2.0 Ontology search engines Ontology Meta-search : OntoSearch [Y. Zhang et al., 2004] Swangler [T. Fini et al., 2005 ] Semantic search engines Contextual semantic search QuizRDF [J. Davies et al., 2005], Corese [O. Corby], Infofox [B. Sigrist et al., 2003], SHOE [B. Aleman-Meza et al., 2003], DOSE [D. Bonino 2003], SERSE [V. Tamma, 2004] Crawler-based ontology search: Ontokhoj [C. Patel et al., 2003] Swoogle [T. Finin et al., 2005] Evolutive semantic search W3C Semantic Search [R. Guha et al., 2005] Semantic association discovery SemDis [C. Rocha et al., 2004] 79

80 Semantic search Engine 3. Association Discovery 80

81 Enterprise Search Intelligent Content = What You Asked for + What you need to know! Content which does contain the words the user asked for Content which does not contain the words the user asked for, but is about what he asked for. Content the user did not think to ask for, but which he needs to know. Extractor Agents + Value-added Metadata + Semantic Associations 81

82 Correlated dimensions related to semantic BI Ontology Engineering Web Semantic Search Syntactic Search Document retrieval Bag of words 82

83 Progress in Ontology Engineering Research 83

84 Ontology engineering vs. learning Ontology engineering Expert-driven, small-scale Knowledge and concept identification Preliminary informal representation Formalization (RDF, OWL, etc) Evaluation and Maintenance Ontology learning Data driven large scale Source selection Data exploration Concept and relation learning Evaluation and Updating 84

85 Ontology Learning 85

86 Ontology Learning History 86

87 Manual Ontology engineering Manual ontology engineering: Ontology building «from scratch» [Fernandez et al., 99] OTK Methodology: Knowledge Meta Process [Y. Sure and R. Studer, 2002] 87

88 Manual Ontology engineering Manual ontology engineering: Ontology building «from scratch» [Fernandez et al., 99] OTK Methodology: Knowledge Meta Process [Y. Sure and R. Studer, 2002] Methontology [Fernández-Lopez et al., 1997] But. Manual engineering of ontologies is a very time consuming task! => Need of automatic way to reduce the burden of engineering! 88

89 Domain Ontology Learning Knowledge Discovery ONTOLOGY LEARNING Ontology Engineering Ontology learning is defined as an approach of ontology building from knowledge sources using a set of machine learning techniques and knowledge acquisition methods. 89

90 DomainOntology Learning Main steps x, y( sufferfrom ( x, y) ill ( x)) cure(dom:doctor,range:disease) is_a(doctor,person) DISEASE:=<Int,Ext,Lex> Axioms & Rules Relations Taxonomy Concepts {disease, illness, Krankheit} disease, illness, hospital (Multilingual) Synonyms Terms Introduced in: Philipp Cimiano, PhD Thesis University of Karlsruhe, forthcoming 90

91 Domain Ontology Learning 91

92 Domain Ontology learning from Texts Domain MDX 92

93 Domain Ontology learning from Texts Linguistic analysis Shallow linguistic parsing Linguistic patterns Consistency grammar Dependency grammar Domain Term extraction MDX MDX language Multidimensional Expressions (MDX) Is based on the position of the words in a sentence and how can be grouped => Lexico-syntactic Patterns Provide binary grammatical links between words in a sentence. => Dependency relationships 93

94 Domain Ontology learning from Texts Term Extraction Text: Multidimensional Expressions (MDX) is a query language for OLAP databases Linguistic analysis Consistency grammar Dependency grammar 94

95 Domain Ontology learning from Texts Linguistic analysis Shallow linguistic parsing & Linguistic patterns Consistency grammar Dependency grammar Domain Term extraction MDX MDX language Multidimensional Expressions (MDX) Statistical analysis Co-occurrence analysis Selection measures (IR) Methods that determine a score representing the relationship between two terms and retain those with scores greater than a given threshold 95

96 Domain Ontology learning from Texts Term Extraction Statistical analysis TF-IDF [Robertson 1976] Entropy [Brini 2005] Relevance to the domain Term_frequency(t,d) * Inverse_term_frequency(t, D) Dn P t log (P(t)) t Di Consensus to the domain Pointwise Mutual Information (PMI) 96

97 Domain Ontology learning from Texts Term Extraction Statistical analysis TF-IDF [Robertson 1976] Entropy [Brini 2005] Relevance to the domain [Verladi, 2001 Consensus to the domain Pointwise Mutual Information (PMI) Term t MDX P(t, domain RD(t, domain i )= i ) i=1..n p(t, domain i ) P(t, domain i )= freq(t in domain i ) i=1..n p(t, domain i ) Economy Business analysis Finance Accounting 97

98 Domain Ontology learning from Texts Term Extraction Statistical analysis TF-IDF [Robertson 1976] Entropy [Brini 2005] Relevance to the domain [Verladi, 2001] Consensus to the domain Pointwise Mutual Information (PMI) Analysis of the distribution of the term t on a set of documents dj belonging to the domain Di. CD(t, D i )= P t, dj log 2( P(t, D i )= freq(t in dj) i=1..n p(t in dj) 1 ) P(t,dj) 98

99 Domain Ontology learning from Texts Term Extraction Statistical analysis TF-IDF [Robertson 1976] Entropy [Brini 2005] Probability is estimated to the co-occurrence of the term t with the domain concepts concept Relevance to the domain [Verladi, 2001] Consensus to the domain Pointwise Mutual Information (PMI) PMI(t, concept)=log( p(t/concept) p(concept) ) 99

100 Domain Ontology learning from Texts Concept Discovery Hybrid methods Lexical resources: Wordnet Domain Term Extraction Concept Learning MDX MDX language, MDX, Multidimensional Expressions (MDX) Query language standard 100

101 Domain Ontology learning from Texts Concept Discovery Hybrid methods Lexical resources: WordNet Domain Term Extraction Concept Learning MDX MDX language, MDX, Multidimensional Expressions (MDX) Query language standard Topic signature construction (Aguire, 2000) Classification of concepts based on contextual properties of words Each The concept meaning is represented of word is strongly by a vector correlated of co-occurring with the contexts words and in which their it frequency appears. 101

102 Domain Ontology learning from Texts Taxonomy Construction Linguistic analysis Domain Term Extraction MDX MDX language, MDX, Multidimensional Expressions (MDX) Concept Learning Query language standard Statistical analysis Taxonomy construction 102

103 Domain Ontology learning from Texts Taxonomy Construction Linguistic analysis Lexico-syntactic patterns (Hearst, 1998) NP such as NP, NP,... and NP a multidimensional database query language such as MDX. Such NP as NP, NP,... or NP Such query language as MDX. NP, NP,... and other NP screen real estate to financial charts, indices and other news graphics. 103 NP, especially NP, NP,... and NP Accounting, especially financial accounting gives mainly past information in that the events are recorded.

104 Domain Ontology learning from Texts Taxonomy Construction Linguistic analysis Lexico-syntactic patterns (Hearst, 1998) Statistical analysis Hierarchical grouping of concepts Grouping based on probability-based measures 104

105 Domain Ontology learning from Texts Domain Term Extraction MDX MDX language, MDX, Multidimensional Expressions (MDX) Concept Learning Query language standard Taxonomy construction Query language Multidimensional query language standard 105

106 Domain Ontology learning from Texts Relation Discovery Domain Term Extraction MDX MDX language, MDX, Multidimensional Expressions (MDX) Concept Learning Taxonomy construction Query language standard Query language Multidimensional query language standard Relation discovery 106

107 Domain Ontology learning from Texts Relation Discovery Linguistic analysis Syntactic frames [Faure and Nedellec, 1998] <To be used > (<subject: MDX>) (<for: OLAP database analysis>) <To be used > (<subject: MDX>) (< for : OLAP database querying >); Conceptual clustering Statistical analysis Multidimensional query language Is used for OLAP Database management 107

108 Domain Ontology learning from Texts Relation Discovery Linguistic analysis Syntactic frames [Faure and Nedellec, 1998] Statistical analysis w1 w2 w3 Wn-1 wn w1 w2 w3 Wn-1 wn Word space DOODLE II[Sugiura et al., 2003] Association rules Co-occurrence analysis Verb arguments analysis 108

109 Domain Ontology learning from Texts Domain MDX Term Extraction MDX language, MDX, Multidimensional Expressions (MDX) Manual, Expert-based Gold standards WordNet E V A L U A T I O N Concept Learning Taxonomy construction Relation discovery Domain Ontology Query language standard Query language Multidimensional query language standard Query language is used for OLAP Database querying 109

110 Domain Ontology Learning The Web as a learning source. 110

111 Domain Ontology Learning From web pages (texts) Collection of web pages related to a specific domain Application of the same techniques as textual approaches Web: Largest electronic textual corpora Shortcomings NL resources, visual-oriented representation Noise ( commercial bias) Lack of semantic structure Unreliable source Advantages Size and heterogeneity Public massive IR tools 111

112 Domain Ontology Learning 112

113 Ontology Learning from web From web page structure Nouns phrases in the headings, tables or lists of a document can be used to deduce relations between concepts and instances Supervised approaches Difficulty of the interpretation of html structure 113

114 Domain Ontology Learning 114

115 Ontology Learning from web Aggregation of on-line ontologies 115

116 Domain Ontology Learning 116

117 Ontology Learning from web Wikipedia mining YAGO Ontology Wu and Weld,

118 Ontology Learning from web Wikipedia mining terminological ontology not scalable Extraction of schema from infoboxes (Wu and Weld, 2008) 118

119 Domain Ontology Learning 119

120 Ontology Learning from web Ontology learning by Googling 120

121 Unsupervised Web-based semantic relatedness for ontology learning The correlation function C k (a, b ) [Turney, 2001] = p p k ab a p b Symmetric Conditional Probability (SCP) [ferreira da Silva and lopes, 1999] 2 p( a, b) SCP (a,b ) = C 2 (a, b )= p( a) p( b) The point wise mutual information (PMI) [church and al., 1991] p( ab) PMI(a,b ) = log 2 p( a) p( b) 121

122 Unsupervised Web-based semantic relatedness for ontology learning k The correlation function C k (a, b ) [Turney, 2001] = p ab p a * p b If the words a and b are independent p(a and b) = p(a) * p(b) Else p(a and b) > p(a) * p(b) For concept selection: c( problem, choice ) p( problem, choice ) p( choice ) (2) Web = scalable Corpora hit problem AND choice P (problem, choice) = Total _ pages _ Web hits ( problem AND choice ) hits ( choice ) c( problem, choice ) (3) 122

123 Ontology Learning from web Ontology learning by Googling Web-based semantic similarity for ontology learning Hyponym learning score (candidate) = hit ( hearst _ pattern ( keyword, candidate )) hit ( condidate ) 123

124 Ontology Learning from web Ontology learning by Googling Non taxonomic relation learning Extraction verb phrases from snippets score (verb_phrase, concept) = Retrieval and selection of related concepts score (candidate, concept)= hit( verb_phrase concept ) hit(verb_phrase) hit (" concept " AND" candidate ") hit ( concept ) 124

125 Unsupervised Web-based semantic relatedness for ontology learning : problems c( concept, candidate) hits( concept AND candidate) hits( candidate) Language ambiguity Polysemy hits (word) = Somme (hits (sens 1 ), hits (sens 2 ),., hits(sens n )) Need to contextualize these measures (Introducing context of words) 125

126 Unsupervised Web-based semantic relatedness for ontology learning : problems The use of statistical assessment is not sufficient to identify that a given concept candidate or relation is enough relevant to add it to the ontology Using linguistic patterns to discover relations between concepts Score pattern (candidate) = hits(" Max( Patterns i 1.. n patterni (" concept","candidate") ) hits("candidate") 126

127 Synthesis 127

128 Ontology Leaning for Business Intelligence 128

129 Ontology Learning for Business Intelligence Engineering of Semantic Business Intelligence Unified Knowledge representation Semantic integration of multiple sources Semantic search over heterogeneous data Semantic Technologies NLP techniques for question-answering Semantic annotations Ontology-based Information Extraction 129

130 Ontology Leaning for Business Intelligence Semantic Search 130

131 Main objectives PARLANCE: Probabilistic Adaptive Real-Time Learning And Natural Conversational Engine Goal: design and build mobile applications that approach human performance in conversational interaction, specifically in terms of the interactional skills needed to do so All of these skills will be learned or adapted using real data 131

132 Main objectives Develop dialogue systems in 3 languages that are: Incremental: dialogue act units Personalized: adapt to different users with different goals in different contexts Dynamic and evolving: possible to incorporate new concepts Interactive hyper-local: take into account the location and surroundings of the user 132

133 Our focus Modular incremental ontology enrichment Using existing knowledge bases Ontology learning from Web Query-driven knowledge base enrichment Dynamic, evolving rich user models Static information Contextual information Social information collaborative filtering 133

134 Linked Data Extraction SW Concept Instance A = V A = V A = V Web Search Web Ontology Construction Ontology schema Basic Population Ontology Incremental Population Entities Discovery Pattern Learning Instance A Instance = V A = A Instance V = V A = A V = A V = V A = A V = V A = V Web Web 134

135 Web Search, Web Corpus Barack Obama + United States 135

136 Ontology Leaning for Business Intelligence Ontology-based Information Extraction 136

137 Ontologies for Collecting and Analyzing Business Intelligence DAVID (Data Analysis and Visualization aid) system Kakkonen, T., Mufti, T.: Developing and Applying a Company, Product and Business Event Ontology for T Mining. Proceedings of the 11th International Conference on Knowledge Management and Knowledge Technologies, Graz, Austria,

138 Ontology Learning for Business Intelligence Ontologies for Collecting and Analyzing Business Intelligence 138

139 Ontology Leaning for Business Intelligence Annotation of Dashboard 139

140 Annotating BI Visualization Dashboards: Needs & Challenges. Micheline Elias and Anastasia Bezerianos ACM SIGCHI Conference on Human Factors in Computing Systems, CHI 2012.

141 Ontological Need: Enriching Ontologies From annotations?? 141

142 Conclusion 142

143 Conclusion Bridging the gap between academic research approaches and real business use cases Scalability of knowledge Modular ontologies Modular reasoning Open Linked data and Privacy issues Toward Open Linked Business Processes?! 143

144 Questions?? 144

Knowledge Harvesting For Business

Knowledge Harvesting For Business 1 Knowledge Harvesting For Business Intelligence Nesrine Ben Mustapha and Marie-Aude Aufaure Ecole Centrale Paris, MAS Laboratory {Nesrine.Ben-Mustapha, Marie-Aude.Aufaure}@ecp.fr Summary. With the growth

More information

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India

Shrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent

More information

Data-Mining Algorithms with Semantic Knowledge

Data-Mining Algorithms with Semantic Knowledge Data-Mining Algorithms with Semantic Knowledge Ontology-based information extraction Carlos Vicient Monllaó Universitat Rovira i Virgili December, 14th 2010. Poznan A Project funded by the Ministerio de

More information

Chapter 27 Introduction to Information Retrieval and Web Search

Chapter 27 Introduction to Information Retrieval and Web Search Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval

More information

Motivating Ontology-Driven Information Extraction

Motivating Ontology-Driven Information Extraction Motivating Ontology-Driven Information Extraction Burcu Yildiz 1 and Silvia Miksch 1, 2 1 Institute for Software Engineering and Interactive Systems, Vienna University of Technology, Vienna, Austria {yildiz,silvia}@

More information

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD

Text Mining. Munawar, PhD. Text Mining - Munawar, PhD 10 Text Mining Munawar, PhD Definition Text mining also is known as Text Data Mining (TDM) and Knowledge Discovery in Textual Database (KDT).[1] A process of identifying novel information from a collection

More information

A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet

A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet A Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet Joerg-Uwe Kietz, Alexander Maedche, Raphael Volz Swisslife Information Systems Research Lab, Zuerich, Switzerland fkietz, volzg@swisslife.ch

More information

Introduction to Text Mining. Hongning Wang

Introduction to Text Mining. Hongning Wang Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:

More information

A MODEL-DRIVEN APPROACH OF ONTOLOGICAL COMPONENTS FOR ON- LINE SEMANTIC WEB INFORMATION RETRIEVAL

A MODEL-DRIVEN APPROACH OF ONTOLOGICAL COMPONENTS FOR ON- LINE SEMANTIC WEB INFORMATION RETRIEVAL Journal of Web Engineering, Vol. 6, No.4 (2007) 303-329 Rinton Press A MODEL-DRIVEN APPROACH OF ONTOLOGICAL COMPONENTS FOR ON- LINE SEMANTIC WEB INFORMATION RETRIEVAL HAJER BAAZAOUI ZGHAL 1, MARIE-AUDE

More information

Parmenides. Semi-automatic. Ontology. construction and maintenance. Ontology. Document convertor/basic processing. Linguistic. Background knowledge

Parmenides. Semi-automatic. Ontology. construction and maintenance. Ontology. Document convertor/basic processing. Linguistic. Background knowledge Discover hidden information from your texts! Information overload is a well known issue in the knowledge industry. At the same time most of this information becomes available in natural language which

More information

Development of an Ontology-Based Portal for Digital Archive Services

Development of an Ontology-Based Portal for Digital Archive Services Development of an Ontology-Based Portal for Digital Archive Services Ching-Long Yeh Department of Computer Science and Engineering Tatung University 40 Chungshan N. Rd. 3rd Sec. Taipei, 104, Taiwan chingyeh@cse.ttu.edu.tw

More information

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,

More information

Knowledge Engineering with Semantic Web Technologies

Knowledge Engineering with Semantic Web Technologies This file is licensed under the Creative Commons Attribution-NonCommercial 3.0 (CC BY-NC 3.0) Knowledge Engineering with Semantic Web Technologies Lecture 5: Ontological Engineering 5.3 Ontology Learning

More information

PROJECT PERIODIC REPORT

PROJECT PERIODIC REPORT PROJECT PERIODIC REPORT Grant Agreement number: 257403 Project acronym: CUBIST Project title: Combining and Uniting Business Intelligence and Semantic Technologies Funding Scheme: STREP Date of latest

More information

Information Retrieval

Information Retrieval Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have

More information

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining. About the Tutorial Data Mining is defined as the procedure of extracting information from huge sets of data. In other words, we can say that data mining is mining knowledge from data. The tutorial starts

More information

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

A service based on Linked Data to classify Web resources using a Knowledge Organisation System A service based on Linked Data to classify Web resources using a Knowledge Organisation System A proof of concept in the Open Educational Resources domain Abstract One of the reasons why Web resources

More information

Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language

Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language Dong Han and Kilian Stoffel Information Management Institute, University of Neuchâtel Pierre-à-Mazel 7, CH-2000 Neuchâtel,

More information

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany Information Systems & University of Koblenz Landau, Germany Semantic Search examples: Swoogle and Watson Steffen Staad credit: Tim Finin (swoogle), Mathieu d Aquin (watson) and their groups 2009-07-17

More information

Natural Language Processing with PoolParty

Natural Language Processing with PoolParty Natural Language Processing with PoolParty Table of Content Introduction to PoolParty 2 Resolving Language Problems 4 Key Features 5 Entity Extraction and Term Extraction 5 Shadow Concepts 6 Word Sense

More information

Information mining and information retrieval : methods and applications

Information mining and information retrieval : methods and applications Information mining and information retrieval : methods and applications J. Mothe, C. Chrisment Institut de Recherche en Informatique de Toulouse Université Paul Sabatier, 118 Route de Narbonne, 31062 Toulouse

More information

Enhanced retrieval using semantic technologies:

Enhanced retrieval using semantic technologies: Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008

More information

Text Mining. Representation of Text Documents

Text Mining. Representation of Text Documents Data Mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data,

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai. UNIT-V WEB MINING 1 Mining the World-Wide Web 2 What is Web Mining? Discovering useful information from the World-Wide Web and its usage patterns. 3 Web search engines Index-based: search the Web, index

More information

Semantic Web Technologies Trends and Research in Ontology-based Systems

Semantic Web Technologies Trends and Research in Ontology-based Systems Semantic Web Technologies Trends and Research in Ontology-based Systems John Davies BT, UK Rudi Studer University of Karlsruhe, Germany Paul Warren BT, UK John Wiley & Sons, Ltd Contents Foreword xi 1.

More information

Opus: University of Bath Online Publication Store

Opus: University of Bath Online Publication Store Patel, M. (2004) Semantic Interoperability in Digital Library Systems. In: WP5 Forum Workshop: Semantic Interoperability in Digital Library Systems, DELOS Network of Excellence in Digital Libraries, 2004-09-16-2004-09-16,

More information

Information Retrieval (IR) through Semantic Web (SW): An Overview

Information Retrieval (IR) through Semantic Web (SW): An Overview Information Retrieval (IR) through Semantic Web (SW): An Overview Gagandeep Singh 1, Vishal Jain 2 1 B.Tech (CSE) VI Sem, GuruTegh Bahadur Institute of Technology, GGS Indraprastha University, Delhi 2

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

XETA: extensible metadata System

XETA: extensible metadata System XETA: extensible metadata System Abstract: This paper presents an extensible metadata system (XETA System) which makes it possible for the user to organize and extend the structure of metadata. We discuss

More information

LIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases

LIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases LIDER Survey Overview Participant profile (organisation type, industry sector) Relevant use-cases Discovering and extracting information Understanding opinion Content and data (Data Management) Monitoring

More information

Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications

Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications WWW2004 (New York, May 22, 2004) Semantic Web Track, Developers Day Boanerges

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

An Approach To Web Content Mining

An Approach To Web Content Mining An Approach To Web Content Mining Nita Patil, Chhaya Das, Shreya Patanakar, Kshitija Pol Department of Computer Engg. Datta Meghe College of Engineering, Airoli, Navi Mumbai Abstract-With the research

More information

Things to consider when using Semantics in your Information Management strategy. Toby Conrad Smartlogic

Things to consider when using Semantics in your Information Management strategy. Toby Conrad Smartlogic Things to consider when using Semantics in your Information Management strategy Toby Conrad Smartlogic toby.conrad@smartlogic.com +1 773 251 0824 Some of Smartlogic s 250+ Customers Awards Trend Setting

More information

Enabling Semantic Search in Large Open Source Communities

Enabling Semantic Search in Large Open Source Communities Enabling Semantic Search in Large Open Source Communities Gregor Leban, Lorand Dali, Inna Novalija Jožef Stefan Institute, Jamova cesta 39, 1000 Ljubljana {gregor.leban, lorand.dali, inna.koval}@ijs.si

More information

Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains

Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains Eloise Currie and Mary Parmelee SAS Institute, Cary NC About SAS: The Power to Know SAS: The Market Leader in

More information

Semantic Web Technology Evaluation Ontology (SWETO): A Test Bed for Evaluating Tools and Benchmarking Applications

Semantic Web Technology Evaluation Ontology (SWETO): A Test Bed for Evaluating Tools and Benchmarking Applications Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 5-22-2004 Semantic Web Technology Evaluation Ontology (SWETO): A Test

More information

Iterative Learning of Relation Patterns for Market Analysis with UIMA

Iterative Learning of Relation Patterns for Market Analysis with UIMA UIMA Workshop, GLDV, Tübingen, 09.04.2007 Iterative Learning of Relation Patterns for Market Analysis with UIMA Sebastian Blohm, Jürgen Umbrich, Philipp Cimiano, York Sure Universität Karlsruhe (TH), Institut

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

Semantic Annotation, Search and Analysis

Semantic Annotation, Search and Analysis Semantic Annotation, Search and Analysis Borislav Popov, Ontotext Ontology A machine readable conceptual model a common vocabulary for sharing information machine-interpretable definitions of concepts in

More information

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE YING DING 1 Digital Enterprise Research Institute Leopold-Franzens Universität Innsbruck Austria DIETER FENSEL Digital Enterprise Research Institute National

More information

The Security Role for Content Analysis

The Security Role for Content Analysis The Security Role for Content Analysis Jim Nisbet Founder, Tablus, Inc. November 17, 2004 About Us Tablus is a 3 year old company that delivers solutions to provide visibility to sensitive information

More information

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How

More information

Semantic Searching. John Winder CMSC 676 Spring 2015

Semantic Searching. John Winder CMSC 676 Spring 2015 Semantic Searching John Winder CMSC 676 Spring 2015 Semantic Searching searching and retrieving documents by their semantic, conceptual, and contextual meanings Motivations: to do disambiguation to improve

More information

Jianyong Wang Department of Computer Science and Technology Tsinghua University

Jianyong Wang Department of Computer Science and Technology Tsinghua University Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity

More information

VALLIAMMAI ENGINEERING COLLEGE SRM Nagar, Kattankulathur DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING QUESTION BANK VII SEMESTER

VALLIAMMAI ENGINEERING COLLEGE SRM Nagar, Kattankulathur DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING QUESTION BANK VII SEMESTER VALLIAMMAI ENGINEERING COLLEGE SRM Nagar, Kattankulathur 603 203 DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING QUESTION BANK VII SEMESTER CS6007-INFORMATION RETRIEVAL Regulation 2013 Academic Year 2018

More information

Ontology Engineering. CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University

Ontology Engineering. CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University Ontology Engineering CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University http://www3.cs.stonybrook.edu/~pfodor/courses/cse595.html Lecture Outline Constructing Ontologies Reusing Existing

More information

Archive of SID. A Semantic Ontology-based Document Organizer to Cluster elearning Documents

Archive of SID. A Semantic Ontology-based Document Organizer to Cluster elearning Documents A emantic Ontology-based Document Organizer to Cluster elearning Documents ara Alaee Electrical and Computer Engineering Department University of Tehran Tehran,Iran sara.alaee@ut.ac.ir Abstract Document

More information

Ontology Creation and Development Model

Ontology Creation and Development Model Ontology Creation and Development Model Pallavi Grover, Sonal Chawla Research Scholar, Department of Computer Science & Applications, Panjab University, Chandigarh, India Associate. Professor, Department

More information

SLIPO. Scalable Linking and Integration of Big POI data. Giorgos Giannopoulos IMIS/Athena RC

SLIPO. Scalable Linking and Integration of Big POI data. Giorgos Giannopoulos IMIS/Athena RC SLIPO Scalable Linking and Integration of Big POI data I n f o r m a ti o n a n d N e t w o r ki n g D a y s o n H o ri z o n 2 0 2 0 B i g Da ta Public-Priva te Partnership To p i c : I C T 14 B i g D

More information

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

A service based on Linked Data to classify Web resources using a Knowledge Organisation System A service based on Linked Data to classify Web resources using a Knowledge Organisation System A implementation to classify Open Educational Resources Janneth Chicaiza, Nelson Piedra and Jorge López Universidad

More information

Knowledge and Ontological Engineering: Directions for the Semantic Web

Knowledge and Ontological Engineering: Directions for the Semantic Web Knowledge and Ontological Engineering: Directions for the Semantic Web Dana Vaughn and David J. Russomanno Department of Electrical and Computer Engineering The University of Memphis Memphis, TN 38152

More information

Reading group on Ontologies and NLP:

Reading group on Ontologies and NLP: Reading group on Ontologies and NLP: Machine Learning27th infebruary Automated 2014 1 / 25 Te Reading group on Ontologies and NLP: Machine Learning in Automated Text Categorization, by Fabrizio Sebastianini.

More information

Ontology-Based User Requirement Analysis for Banking Application

Ontology-Based User Requirement Analysis for Banking Application Ontology-Based User Requirement Analysis for Banking Application Ruey-Shun Chen, Chan-Chine Chang, Chia-Chen Chen and Isabel Chi Institute of Information Management, National Chiao Tung University, Taiwan

More information

Enterprise Multimedia Integration and Search

Enterprise Multimedia Integration and Search Enterprise Multimedia Integration and Search José-Manuel López-Cobo 1 and Katharina Siorpaes 1,2 1 playence, Austria, 2 STI Innsbruck, University of Innsbruck, Austria {ozelin.lopez, katharina.siorpaes}@playence.com

More information

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009 Maximizing the Value of STM Content through Semantic Enrichment Frank Stumpf December 1, 2009 What is Semantics and Semantic Processing? Content Knowledge Framework Technology Framework Search Text Images

More information

Enterprise Knowledge Map: Toward Subject Centric Computing. March 21st, 2007 Dmitry Bogachev

Enterprise Knowledge Map: Toward Subject Centric Computing. March 21st, 2007 Dmitry Bogachev Enterprise Knowledge Map: Toward Subject Centric Computing March 21st, 2007 Dmitry Bogachev Are we ready?...the idea of an application is an artificial one, convenient to the programmer but not to the

More information

Proposal for Implementing Linked Open Data on Libraries Catalogue

Proposal for Implementing Linked Open Data on Libraries Catalogue Submitted on: 16.07.2018 Proposal for Implementing Linked Open Data on Libraries Catalogue Esraa Elsayed Abdelaziz Computer Science, Arab Academy for Science and Technology, Alexandria, Egypt. E-mail address:

More information

NeOn Methodology for Building Ontology Networks: a Scenario-based Methodology

NeOn Methodology for Building Ontology Networks: a Scenario-based Methodology NeOn Methodology for Building Ontology Networks: a Scenario-based Methodology Asunción Gómez-Pérez and Mari Carmen Suárez-Figueroa Ontology Engineering Group. Departamento de Inteligencia Artificial. Facultad

More information

Reducing Consumer Uncertainty

Reducing Consumer Uncertainty Spatial Analytics Reducing Consumer Uncertainty Towards an Ontology for Geospatial User-centric Metadata Introduction Cooperative Research Centre for Spatial Information (CRCSI) in Australia Communicate

More information

Domain Independent Knowledge Base Population From Structured and Unstructured Data Sources

Domain Independent Knowledge Base Population From Structured and Unstructured Data Sources Domain Independent Knowledge Base Population From Structured and Unstructured Data Sources Michelle Gregory, Liam McGrath, Eric Bell, Kelly O Hara, and Kelly Domico Pacific Northwest National Laboratory

More information

Actionable User Intentions for Real-Time Mobile Assistant Applications

Actionable User Intentions for Real-Time Mobile Assistant Applications Actionable User Intentions for Real-Time Mobile Assistant Applications Thimios Panagos, Shoshana Loeb, Ben Falchuk Applied Research, Telcordia Technologies One Telcordia Drive, Piscataway, New Jersey,

More information

Semi-Automatic Conceptual Data Modeling Using Entity and Relationship Instance Repositories

Semi-Automatic Conceptual Data Modeling Using Entity and Relationship Instance Repositories Semi-Automatic Conceptual Data Modeling Using Entity and Relationship Instance Repositories Ornsiri Thonggoom, Il-Yeol Song, Yuan An The ischool at Drexel Philadelphia, PA USA Outline Long Term Research

More information

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 95-96

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 95-96 ه عا ی Semantic Web Ontology Engineering and Evaluation Morteza Amini Sharif University of Technology Fall 95-96 Outline Ontology Engineering Class and Class Hierarchy Ontology Evaluation 2 Outline Ontology

More information

Part I: Data Mining Foundations

Part I: Data Mining Foundations Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?

More information

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 93-94

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 93-94 ه عا ی Semantic Web Ontology Engineering and Evaluation Morteza Amini Sharif University of Technology Fall 93-94 Outline Ontology Engineering Class and Class Hierarchy Ontology Evaluation 2 Outline Ontology

More information

Automation of Semantic Web based Digital Library using Unified Modeling Language Minal Bhise 1 1

Automation of Semantic Web based Digital Library using Unified Modeling Language Minal Bhise 1 1 Automation of Semantic Web based Digital Library using Unified Modeling Language Minal Bhise 1 1 Dhirubhai Ambani Institute for Information and Communication Technology, Gandhinagar, Gujarat, India Email:

More information

The Model-Driven Semantic Web Emerging Standards & Technologies

The Model-Driven Semantic Web Emerging Standards & Technologies The Model-Driven Semantic Web Emerging Standards & Technologies Elisa Kendall Sandpiper Software March 24, 2005 1 Model Driven Architecture (MDA ) Insulates business applications from technology evolution,

More information

Microsoft End to End Business Intelligence Boot Camp

Microsoft End to End Business Intelligence Boot Camp Microsoft End to End Business Intelligence Boot Camp 55045; 5 Days, Instructor-led Course Description This course is a complete high-level tour of the Microsoft Business Intelligence stack. It introduces

More information

Ontology Extraction from Heterogeneous Documents

Ontology Extraction from Heterogeneous Documents Vol.3, Issue.2, March-April. 2013 pp-985-989 ISSN: 2249-6645 Ontology Extraction from Heterogeneous Documents Kirankumar Kataraki, 1 Sumana M 2 1 IV sem M.Tech/ Department of Information Science & Engg

More information

Table Of Contents: xix Foreword to Second Edition

Table Of Contents: xix Foreword to Second Edition Data Mining : Concepts and Techniques Table Of Contents: Foreword xix Foreword to Second Edition xxi Preface xxiii Acknowledgments xxxi About the Authors xxxv Chapter 1 Introduction 1 (38) 1.1 Why Data

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

It s time for a semantic engine!

It s time for a semantic engine! It s time for a semantic engine! Ido Dagan Bar-Ilan University, Israel 1 Semantic Knowledge is not the goal it s a primary mean to achieve semantic inference! Knowledge design should be derived from its

More information

MS-55045: Microsoft End to End Business Intelligence Boot Camp

MS-55045: Microsoft End to End Business Intelligence Boot Camp MS-55045: Microsoft End to End Business Intelligence Boot Camp Description This five-day instructor-led course is a complete high-level tour of the Microsoft Business Intelligence stack. It introduces

More information

Text Mining for Software Engineering

Text Mining for Software Engineering Text Mining for Software Engineering Faculty of Informatics Institute for Program Structures and Data Organization (IPD) Universität Karlsruhe (TH), Germany Department of Computer Science and Software

More information

Data and Information Integration: Information Extraction

Data and Information Integration: Information Extraction International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Data and Information Integration: Information Extraction Varnica Verma 1 1 (Department of Computer Science Engineering, Guru Nanak

More information

Tools and Infrastructure for Supporting Enterprise Knowledge Graphs

Tools and Infrastructure for Supporting Enterprise Knowledge Graphs Tools and Infrastructure for Supporting Enterprise Knowledge Graphs Sumit Bhatia, Nidhi Rajshree, Anshu Jain, and Nitish Aggarwal IBM Research sumitbhatia@in.ibm.com, {nidhi.rajshree,anshu.n.jain}@us.ibm.com,nitish.aggarwal@ibm.com

More information

Outline. Morning program Preliminaries Semantic matching Learning to rank Entities

Outline. Morning program Preliminaries Semantic matching Learning to rank Entities 112 Outline Morning program Preliminaries Semantic matching Learning to rank Afternoon program Modeling user behavior Generating responses Recommender systems Industry insights Q&A 113 are polysemic Finding

More information

Ontology-based Architecture Documentation Approach

Ontology-based Architecture Documentation Approach 4 Ontology-based Architecture Documentation Approach In this chapter we investigate how an ontology can be used for retrieving AK from SA documentation (RQ2). We first give background information on the

More information

ImgSeek: Capturing User s Intent For Internet Image Search

ImgSeek: Capturing User s Intent For Internet Image Search ImgSeek: Capturing User s Intent For Internet Image Search Abstract - Internet image search engines (e.g. Bing Image Search) frequently lean on adjacent text features. It is difficult for them to illustrate

More information

A Lightweight Approach to Semantic Tagging

A Lightweight Approach to Semantic Tagging A Lightweight Approach to Semantic Tagging Nadzeya Kiyavitskaya, Nicola Zeni, Luisa Mich, John Mylopoulus Department of Information and Communication Technologies, University of Trento Via Sommarive 14,

More information

Text Mining: A Burgeoning technology for knowledge extraction

Text Mining: A Burgeoning technology for knowledge extraction Text Mining: A Burgeoning technology for knowledge extraction 1 Anshika Singh, 2 Dr. Udayan Ghosh 1 HCL Technologies Ltd., Noida, 2 University School of Information &Communication Technology, Dwarka, Delhi.

More information

SEMANTIC MATCHING APPROACHES

SEMANTIC MATCHING APPROACHES CHAPTER 4 SEMANTIC MATCHING APPROACHES 4.1 INTRODUCTION Semantic matching is a technique used in computer science to identify information which is semantically related. In order to broaden recall, a matching

More information

Financial Dataspaces: Challenges, Approaches and Trends

Financial Dataspaces: Challenges, Approaches and Trends Financial Dataspaces: Challenges, Approaches and Trends Finance and Economics on the Semantic Web (FEOSW), ESWC 27 th May, 2012 Seán O Riain ebusiness Copyright 2009. All rights reserved. Motivation Changing

More information

2 Ontology evolution algorithm based on web-pages and users behavior logs

2 Ontology evolution algorithm based on web-pages and users behavior logs ISSN 1749-3889 (print), 1749-3897 (online) International Journal of Nonlinear Science Vol.18(2014) No.1,pp.86-91 Ontology Evolution Algorithm for Topic Information Collection Jing Ma 1, Mengyong Sun 1,

More information

Envisioning Semantic Web Technology Solutions for the Arts

Envisioning Semantic Web Technology Solutions for the Arts Information Integration Intelligence Solutions Envisioning Semantic Web Technology Solutions for the Arts Semantic Web and CIDOC CRM Workshop Ralph Hodgson, CTO, TopQuadrant National Museum of the American

More information

Using Linked Data and taxonomies to create a quick-start smart thesaurus

Using Linked Data and taxonomies to create a quick-start smart thesaurus 7) MARJORIE HLAVA Using Linked Data and taxonomies to create a quick-start smart thesaurus 1. About the Case Organization The two current applications of this approach are a large scientific publisher

More information

Alberto Messina, Maurizio Montagnuolo

Alberto Messina, Maurizio Montagnuolo A Generalised Cross-Modal Clustering Method Applied to Multimedia News Semantic Indexing and Retrieval Alberto Messina, Maurizio Montagnuolo RAI Centre for Research and Technological Innovation Madrid,

More information

Discovering Semantic Similarity between Words Using Web Document and Context Aware Semantic Association Ranking

Discovering Semantic Similarity between Words Using Web Document and Context Aware Semantic Association Ranking Discovering Semantic Similarity between Words Using Web Document and Context Aware Semantic Association Ranking P.Ilakiya Abstract The growth of information in the web is too large, so search engine come

More information

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web What you have learned so far Interoperability Introduction to the Semantic Web Tutorial at ISWC 2010 Jérôme Euzenat Data can be expressed in RDF Linked through URIs Modelled with OWL ontologies & Retrieved

More information

Chapter 2. Architecture of a Search Engine

Chapter 2. Architecture of a Search Engine Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them

More information

ONTOPARK: ONTOLOGY BASED PAGE RANKING FRAMEWORK USING RESOURCE DESCRIPTION FRAMEWORK

ONTOPARK: ONTOLOGY BASED PAGE RANKING FRAMEWORK USING RESOURCE DESCRIPTION FRAMEWORK Journal of Computer Science 10 (9): 1776-1781, 2014 ISSN: 1549-3636 2014 doi:10.3844/jcssp.2014.1776.1781 Published Online 10 (9) 2014 (http://www.thescipub.com/jcs.toc) ONTOPARK: ONTOLOGY BASED PAGE RANKING

More information

Making Sense Out of the Web

Making Sense Out of the Web Making Sense Out of the Web Rada Mihalcea University of North Texas Department of Computer Science rada@cs.unt.edu Abstract. In the past few years, we have witnessed a tremendous growth of the World Wide

More information

The University of Jordan. Accreditation & Quality Assurance Center. Curriculum for Doctorate Degree

The University of Jordan. Accreditation & Quality Assurance Center. Curriculum for Doctorate Degree Accreditation & Quality Assurance Center Curriculum for Doctorate Degree 1. Faculty King Abdullah II School for Information Technology 2. Department Computer Science الدكتوراة في علم الحاسوب (Arabic).3

More information

WikiOnto: A System For Semi-automatic Extraction And Modeling Of Ontologies Using Wikipedia XML Corpus

WikiOnto: A System For Semi-automatic Extraction And Modeling Of Ontologies Using Wikipedia XML Corpus 2009 IEEE International Conference on Semantic Computing WikiOnto: A System For Semi-automatic Extraction And Modeling Of Ontologies Using Wikipedia XML Corpus Lalindra De Silva University of Colombo School

More information

Semantic Web and Natural Language Processing

Semantic Web and Natural Language Processing Semantic Web and Natural Language Processing Wiltrud Kessler Institut für Maschinelle Sprachverarbeitung Universität Stuttgart Semantic Web Winter 2014/2015 This work is licensed under a Creative Commons

More information

Acquiring Experience with Ontology and Vocabularies

Acquiring Experience with Ontology and Vocabularies Acquiring Experience with Ontology and Vocabularies Walt Melo Risa Mayan Jean Stanford The author's affiliation with The MITRE Corporation is provided for identification purposes only, and is not intended

More information

Automatically Annotating Text with Linked Open Data

Automatically Annotating Text with Linked Open Data Automatically Annotating Text with Linked Open Data Delia Rusu, Blaž Fortuna, Dunja Mladenić Jožef Stefan Institute Motivation: Annotating Text with LOD Open Cyc DBpedia WordNet Overview Related work Algorithms

More information

Extensible Dynamic Form Approach for Supplier Discovery

Extensible Dynamic Form Approach for Supplier Discovery Extensible Dynamic Form Approach for Supplier Discovery Yan Kang, Jaewook Kim, and Yun Peng Department of Computer Science and Electrical Engineering University of Maryland, Baltimore County {kangyan1,

More information