Towards Efficient and Effective Semantic Table Interpretation Ziqi Zhang
|
|
- Gavin Watkins
- 5 years ago
- Views:
Transcription
1 Towards Efficient and Effective Semantic Table Interpretation Ziqi Zhang Department of Computer Science, University of Sheffield
2 Outline Define semantic table interpretation State-of-the-art and motivation The method TableMiner Evaluation
3 Semantic Table Interpretation Input Ontology Relational table Goals/Tasks Label columns by concepts Link cells to named entities Connect columns by relations Thing Artist Work Location Actor/ Actress Name Film Country 1 Tom Hanks Philadelphia USA 2 Jamie Foxx Ray USA 3 Kate Winslet The Reader UK 99 Charlize Theron Rel:performIn Film Rel:performIn < > Monster Table of Best Actor/Actress Country South Africa Ent:USA Ent:UK
4 Semantic Table Interpretation Input Ontology Relational table Goals/Tasks Label columns by concepts Link cells to named entities Connect columns by relations Column classification/ header disambiguation Cell disambiguation Relation interpretation
5 Motivation and State-of-the-art 154 mil. relational tables on the Web and growing [Cafarella2008] Classic Information Extraction methods do not work [Limaye2010, Lu2013] They cannot model the complex interdependence among table components
6 Motivation and State-of-the-art SoA semantic table interpretation methods, e.g. [Limaye2010, Venetis2011, Mulwad2013] Limitation 1 Inference is exhaustive, but unnecessary Name Film Country 1 Tom Hanks Philadelphia USA 2 Jamie Foxx Ray USA 3 Kate Winslet The Reader UK 99 Charlize Theron < > Monster Table of Best Actor/Actress South Africa Goal: Assign a concept to this column Hint: Content in the column gives useful clues How much do we need for inference (99 rows in this example)? - Human: SOME (learn by example) - SoA: ALL
7 Motivation and State-of-the-art SoA semantic table interpretation methods, e.g. [Limaye2010, Venetis2011, Mulwad2013] Limitation 2 Contextual features for inference SoA: features only from within the table Context outside the table also makes hint for interpretation. E.g., the words in the paragraph are often found in descriptions of actors Table of Best Actor/Actress
8 TableMiner
9 TableMiner Two tasks: Column classification Cell disambiguation Non-exhaustive inference in a bootstrapping pattern phase 1 inference with partial content phase 2 propagation and update Contextual features both inside and outside tables
10 TableMiner Phase 1 I-Inf Incremental inference with stopping (I-Inf) T j a column; C j candidate concepts for the column; E i,j candidate entities for a cell
11 TableMiner Phase 1 I-Inf Incremental inference with stopping (I-Inf) T j a column; C j candidate concepts for the column; E i,j candidate entities for a cell Itr.1 E i,j = {<e 1,s 1 >, <e 2,s 2 >, }. (until stop)
12 TableMiner Phase 1 I-Inf Incremental inference with stopping (I-Inf) T j a column; C j candidate concepts for the column; E i,j candidate entities for a cell C j = {<c 1,s 1 >, <c 2,s 2 >} concepts = {<c 1,s 1 >, <c 2,s 2 >, } Itr.1 E i,j = {<e 1,s 1 >, <e 2,s 2 >, }. (until stop)
13 TableMiner Phase 1 I-Inf Incremental inference with stopping (I-Inf) T j a column; C j candidate concepts for the column; E i,j candidate entities for a cell H(C j ) H(prevC j ) <t? C j = {<c 1,s 1 >, <c 2,s 2 >} Yes stop No next itr. concepts = {<c 1,s 1 >, <c 2,s 2 >, } Itr.1 E i,j = {<e 1,s 1 >, <e 2,s 2 >, }. (until stop)
14 TableMiner Phase 1 I-Inf Incremental inference with stopping (I-Inf) T j a column; C j candidate concepts for the column; E i,j candidate entities for a cell H(C j ) H(prevC j ) <t? C j = {<c 1,s 1 >, <c 2,s 2 >, <c 3,s 3 >} Yes stop No next itr. concepts = {<c 1,s 1 >, <c 3,s 3 >, } Itr.2 E i,j = {<e 1,s 1 >, <e 2,s 2 >, }. (until stop)
15 TableMiner Phase 1 I-Inf Incremental inference with stopping (I-Inf) T j a column; C j candidate concepts for the column; E i,j candidate entities for a cell H(C j ) H(prevC j ) <t? C j = {<c 1,s 1 >, <c 2,s 2 >, <c 3,s 3 >,. <c 11,s 11 >} Yes stop No next itr. concepts = {<c 11,s 11 >} Itr.3 E i,j =. {<e 1,s 1 >, <e 2,s 2 >, } (until stop)
16 TableMiner Phase 1 I-Inf To compute scores of candidate named entities (e.g. <e 1,s 1 >) and concepts (e.g., <c 1,s 1 >) Candidate NE Build a feature vector of a candidate using the ontology Build a feature vector of the cell/column header using its context Compute vector similarity Candidate concept: same principle, but also depends on score of contributing NEs
17 TableMiner Phase 2 Propagate, Update When I-Inf stops Select the highest scoring candidate concept c + to label the column Propagate: use c + as constrain to disambiguate remaining cells candidate NEs not belonging to c + are discarded Update: Re-compute c + after all cells are disambiguated If the new c + is different, revise disambiguation across the entire column with it as new constraint Repeat until no change Use as constraint to disambiguate cells C j = {<c 1,s 1 >, <c 2,s 2 >, <c 3,s 3 >,. <c 11,s 11 >} c + Rank and select
18 Evaluation
19 TableMiner Evaluation Data Freebase as reference ontology/background knowledge Limaye Web tables from Limaye2010 originally annotated with Wikipedia Cells are automatically mapped to Freebase some are unmapped Columns are manually annotated IMDB 7,354 cast tables of films mapped to Freebase
20 TableMiner Evaluation Baselines (both uses exhaustive inference) B first - cell disambiguation: choose the top ranked NE candidate in the Freebase search result - column classification: each disambiguated cell casts a vote to the set of concepts the NEs belong to, and the majority wins B sim - cell disambiguation: string similarity + feature vector similarity (in-table context only) - column classification: the majority vote method as above + string similarity
21 TableMiner Evaluation Results Cell disambiguation Manual validation of 932 cell annotations in Limaye112 not covered by the above results (i.e., unmapped cells) If only consider those cells where at least one system predicts correctly
22 TableMiner Evaluation Results Column classification best only a column is labelled correctly only if the concept is suitable for the data in the column and is specific enough best or ok a column is labelled correctly if the concept is suitable for the data in the column, though not very specific (E.g., Film Actors may be the best, while Artist or Person is OK, but Engineer is incorrect)
23 TableMiner Evaluation Results Efficiency TableMiner is efficient because Column classification: processes partial content from a column (avg. 57% Limaye112, 43% IMDB) Cell disambiguation: constrained by column classification, resulting in smaller NE candidate space (avg. 32% reduction Limaye32, 24% IMDB) Fewer candidates => less time spent on retrieval and feature space creation (typically >90% of CPU in the pipeline, Limaye2010)
24 TableMiner Conclusion TableMiner take-home messages How can it be more effective? Use both context within and outside tables as features for inference Message 1 How can it be more efficient? Perform inference with partial data and follow the bootstrapping pattern of learning Message 2
25 References [Cafarella2008] Cafarella, M.J., Halevy, A., Wang, D.Z., Wu, E., Zhang, Y. 2008: Webtables: exploring the power of tables on the web. Proceedings of VLDB Endowment 1(1), [Limaye2010] Limaye, G., Sarawagi, S., Chakrabarti, S. 2010: Annotating and searching web tables using entities, types and relationships. Proceedings of the VLDB Endowment 3(1-2), [Lu2013] Lu, C., Bing, L., Lam, W., Chan, K., Gu, Y. 2013: Web entity detection for semi-structured text data records with unlabeled data. International Journal of Computational Linguistics and Applications [Mulwad2013] Mulwad, V., Finin, T., Joshi, A. 2013: Semantic message passing for generating linked data from tables. In: International Semantic Web Conference (1). pp Lecture Notes in Computer Science, Springer [Venetis2011] Venetis, P., Halevy, A., Madhavan, J., Pas ca, M., Shen,W.,Wu, F., Miao, G.,Wu, C. 2011: Recovering semantics of tables on the web. Proceedings of VLDB Endowment 4(9),
26 Thank you
Visualizing semantic table annotations with TableMiner+
Visualizing semantic table annotations with TableMiner+ MAZUMDAR, Suvodeep and ZHANG, Ziqi Available from Sheffield Hallam University Research Archive (SHURA) at:
More informationEffective and Efficient Semantic Table Interpretation using TableMiner +
Semantic Web tbd (2016) 1 39 1 IOS Press Effective and Efficient Semantic Table Interpretation using TableMiner + Editor(s): Pascal Hitzler, Wright State University, USA; Isabel Cruz, University of Illinois
More informationEntity Linking in Web Tables with Multiple Linked Knowledge Bases
Entity Linking in Web Tables with Multiple Linked Knowledge Bases Tianxing Wu, Shengjia Yan, Zhixin Piao, Liang Xu, Ruiming Wang, Guilin Qi School of Computer Science and Engineering, Southeast University,
More informationarxiv: v1 [cs.cl] 4 Nov 2018
ColNet: Embedding the Semantics of Web Tables for Column Type Prediction Jiaoyan Chen 1, Ernesto Jiménez-Ruiz 3,4, Ian Horrocks 1,4, Charles Sutton 2,4 1 Department of Computer Science, University of Oxford,
More informationDC Proposal: Graphical Models and Probabilistic Reasoning for Generating Linked Data from Tables
DC Proposal: Graphical Models and Probabilistic Reasoning for Generating Linked Data from Tables Varish Mulwad Computer Science and Electrical Engineering University of Maryland, Baltimore County varish1@cs.umbc.edu
More informationAutomatically Generating Government Linked Data from Tables
Automatically Generating Government Linked Data from Tables Varish Mulwad, Tim Finin and Anupam Joshi Computer Science and Electrical Engineering University of Maryland, Baltimore County Baltimore, Maryland
More information(Big Data Integration) : :
(Big Data Integration) : : 3 # $%&'! ()* +$,- 2/30 ()* + # $%&' = 3 : $ 2 : 17 ;' $ # < 2 6 ' $%&',# +'= > 0 - '? @0 A 1 3/30 3?. - B 6 @* @(C : E6 - > ()* (C :(C E6 1' +'= - ''3-6 F :* 2G '> H-! +'-?
More informationLeveraging Linked Data to Infer Semantic Relations within Structured Sources
Leveraging Linked Data to Infer Semantic Relations within Structured Sources Mohsen Taheriyan 1, Craig A. Knoblock 1, Pedro Szekely 1, José Luis Ambite 1, and Yinyi Chen 2 1 University of Southern California
More informationPresented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu
Presented by: Dimitri Galmanovich Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu 1 When looking for Unstructured data 2 Millions of such queries every day
More informationNUS-I2R: Learning a Combined System for Entity Linking
NUS-I2R: Learning a Combined System for Entity Linking Wei Zhang Yan Chuan Sim Jian Su Chew Lim Tan School of Computing National University of Singapore {z-wei, tancl} @comp.nus.edu.sg Institute for Infocomm
More informationMatching Web Tables To DBpedia - A Feature Utility Study
Matching Web Tables To DBpedia - A Feature Utility Study Dominique Ritze, Christian Bizer Data and Web Science Group, University of Mannheim, B6, 26 68159 Mannheim, Germany {dominique,chris}@informatik.uni-mannheim.de
More informationUnderstanding the Query: THCIB and THUIS at NTCIR-10 Intent Task. Junjun Wang 2013/4/22
Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task Junjun Wang 2013/4/22 Outline Introduction Related Word System Overview Subtopic Candidate Mining Subtopic Ranking Results and Discussion
More informationMatching Web Tables with Knowledge Base Entities: From Entity Lookups to Entity Embeddings
Matching Web Tables with Knowledge Base Entities: From Entity Lookups to Entity Embeddings Vasilis Efthymiou 1, Oktie Hassanzadeh 2, Mariano Rodriguez-Muro 2, and Vassilis Christophides 3 1 ICS-FORTH &
More informationA Domain Independent Framework for Extracting Linked Semantic Data from Tables
Preprint of: Varish Mulwad, Tim Finin and Anupam Joshi, A Domain Independent Framework for Extracting Linked Semantic Data from Tables, in Search Computing - Broadening Web Search, Stefano Ceri and Marco
More informationLeveraging Knowledge Graphs for Web-Scale Unsupervised Semantic Parsing. Interspeech 2013
Leveraging Knowledge Graphs for Web-Scale Unsupervised Semantic Parsing LARRY HECK, DILEK HAKKANI-TÜR, GOKHAN TUR Focus of This Paper SLU and Entity Extraction (Slot Filling) Spoken Language Understanding
More informationLearning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search
1 / 33 Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search Bernd Wittefeld Supervisor Markus Löckelt 20. July 2012 2 / 33 Teaser - Google Web History http://www.google.com/history
More informationTriplifying Wikipedia s Tables
Triplifying Wikipedia s Tables Emir Muñoz, Aidan Hogan, and Alessandra Mileo Digital Enterprise Research Institute, National University of Ireland, Galway {emir.munoz, aidan.hogan, alessandra.mileo}@deri.org
More informationSynthesizing Union Tables from the Web
Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Synthesizing Union Tables from the Web Xiao Ling xiaoling@cs.washington.edu University of Washington Alon Halevy
More informationRecovering Semantics of Tables on the Web
Recovering Semantics of Tables on the Web Petros Venetis Alon Halevy Jayant Madhavan Marius Paşca Stanford University Google Inc. Google Inc. Google Inc. venetis@cs.stanford.edu halevy@google.com jayant@google.com
More informationLeveraging Linked Data to Discover Semantic Relations within Data Sources
Leveraging Linked Data to Discover Semantic Relations within Data Sources Mohsen Taheriyan, Craig A. Knoblock, Pedro Szekely, and José Luis Ambite University of Southern California Information Sciences
More informationOntology Augmentation Through Matching with Web Tables
Ontology Augmentation Through Matching with Web Tables Oliver Lehmberg 1 and Oktie Hassanzadeh 2 1 University of Mannheim, B6 26, 68159 Mannheim, Germany 2 IBM Research, Yorktown Heights, New York, U.S.A.
More informationWeb Scale Information Extraction ECML/PKDD 2013
Overview Wrapper Induction Web Scale Information Extraction TUTORIAL @ ECML/PKDD 2013 A.L. Gentile Z. Zhang Department of Computer Science, The University of Sheffield, UK 27 th September 2013 A.L. Gentile,
More informationarxiv: v1 [cs.cl] 3 Aug 2015
Compositional Semantic Parsing on Semi-Structured Tables Panupong Pasupat Computer Science Department Stanford University ppasupat@cs.stanford.edu Percy Liang Computer Science Department Stanford University
More informationEvaluating Approaches for Supervised Semantic Labeling
Evaluating Approaches for Supervised Semantic Labeling Nataliia Rümmele Siemens Germany nataliia.ruemmele@ siemens.com Yuriy Tyshetskiy Data61, CSIRO Australia yuriy.tyshetskiy@ data61.csiro.au Alex Collins
More informationTAIPAN: Automatic Property Mapping for Tabular Data
TAIPAN: Automatic Property Mapping for Tabular Data Ivan Ermilov and Axel-Cyrille Ngonga Ngomo University of Leipzig, Institute of Computer Science, Leipzig, Germany iermilov,ngonga@informatik.uni-leipzig.de
More informationImproving Open Data Usability through Semantics
Improving Open Data Usability through Semantics PhD research proposal Sebastian Neumaier Vienna University of Economics and Business, Vienna, Austria sebastian.neumaier@wu.ac.at Abstract. With the success
More informationBig Data Integration for Data Enthusiasts. Jayant Madhavan Structured Data Research Google Inc.
for Data Enthusiasts Jayant Madhavan Structured Data Research Google Inc. Big Data Challenge Running computations over ginormous datasets Petabytes, Exabytes, maybe more! Only one aspect of the challenge!
More informationExtending Keyword Search to Metadata in Relational Database
DEWS2008 C6-1 Extending Keyword Search to Metadata in Relational Database Jiajun GU Hiroyuki KITAGAWA Graduate School of Systems and Information Engineering Center for Computational Sciences University
More informationMulti-Stage Rocchio Classification for Large-scale Multilabeled
Multi-Stage Rocchio Classification for Large-scale Multilabeled Text data Dong-Hyun Lee Nangman Computing, 117D Garden five Tools, Munjeong-dong Songpa-gu, Seoul, Korea dhlee347@gmail.com Abstract. Large-scale
More informationLarge-Scale Syntactic Processing: Parsing the Web. JHU 2009 Summer Research Workshop
Large-Scale Syntactic Processing: JHU 2009 Summer Research Workshop Intro CCG parser Tasks 2 The Team Stephen Clark (Cambridge, UK) Ann Copestake (Cambridge, UK) James Curran (Sydney, Australia) Byung-Gyu
More informationA Scalable Approach to Learn Semantic Models of Structured Sources
2014 IEEE International Conference on Semantic Computing A Scalable Approach to Learn Semantic Models of Structured Sources Mohsen Taheriyan, Craig A. Knoblock, Pedro Szekely, José Luis Ambite Information
More informationSemantic Annotation for Semantic Social Networks. Using Community Resources
Semantic Annotation for Semantic Social Networks Using Community Resources Lawrence Reeve and Hyoil Han College of Information Science and Technology Drexel University, Philadelphia, PA 19108 lhr24@drexel.edu
More informationJianyong Wang Department of Computer Science and Technology Tsinghua University
Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REVIEW PAPER ON IMPLEMENTATION OF DOCUMENT ANNOTATION USING CONTENT AND QUERYING
More informationCreating Large-scale Training and Test Corpora for Extracting Structured Data from the Web
Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Robert Meusel and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {robert,heiko}@informatik.uni-mannheim.de
More informationRe-contextualization and contextual Entity exploration. Sebastian Holzki
Re-contextualization and contextual Entity exploration Sebastian Holzki Sebastian Holzki June 7, 2016 1 Authors: Joonseok Lee, Ariel Fuxman, Bo Zhao, and Yuanhua Lv - PAPER PRESENTATION - LEVERAGING KNOWLEDGE
More informationOntology Based Prediction of Difficult Keyword Queries
Ontology Based Prediction of Difficult Keyword Queries Lubna.C*, Kasim K Pursuing M.Tech (CSE)*, Associate Professor (CSE) MEA Engineering College, Perinthalmanna Kerala, India lubna9990@gmail.com, kasim_mlp@gmail.com
More informationA Hybrid Neural Model for Type Classification of Entity Mentions
A Hybrid Neural Model for Type Classification of Entity Mentions Motivation Types group entities to categories Entity types are important for various NLP tasks Our task: predict an entity mention s type
More informationLinking Entities in Chinese Queries to Knowledge Graph
Linking Entities in Chinese Queries to Knowledge Graph Jun Li 1, Jinxian Pan 2, Chen Ye 1, Yong Huang 1, Danlu Wen 1, and Zhichun Wang 1(B) 1 Beijing Normal University, Beijing, China zcwang@bnu.edu.cn
More informationExam Marco Kuhlmann. This exam consists of three parts:
TDDE09, 729A27 Natural Language Processing (2017) Exam 2017-03-13 Marco Kuhlmann This exam consists of three parts: 1. Part A consists of 5 items, each worth 3 points. These items test your understanding
More informationPTE : Predictive Text Embedding through Large-scale Heterogeneous Text Networks
PTE : Predictive Text Embedding through Large-scale Heterogeneous Text Networks Pramod Srinivasan CS591txt - Text Mining Seminar University of Illinois, Urbana-Champaign April 8, 2016 Pramod Srinivasan
More informationChinese Microblog Entity Linking System Combining Wikipedia and Search Engine Retrieval Results
Chinese Microblog Entity Linking System Combining Wikipedia and Search Engine Retrieval Results Zeyu Meng, Dong Yu, and Endong Xun Inter. R&D center for Chinese Education, Beijing Language and Culture
More informationSERIMI Results for OAEI 2011
SERIMI Results for OAEI 2011 Samur Araujo 1, Arjen de Vries 1, and Daniel Schwabe 2 1 Delft University of Technology, PO Box 5031, 2600 GA Delft, the Netherlands {S.F.CardosodeAraujo, A.P.deVries}@tudelft.nl
More informationUnderstanding Tables on the Web
Understanding Tables on the Web ABSTRACT The Web contains a wealth of information, and a key challenge is to make this information machine processable. Because natural language understanding at web scale
More informationAnnotating and Searching Web Tables Using Entities, Types and Relationships
Annotating and Searching Web Tables Using Entities, Types and Relationships Girija Limaye IIT Bombay, India girija@cse.iitb.ac.in Sunita Sarawagi IIT Bombay, India sunita@iitb.ac.in Soumen Chakrabarti
More informationProcessing Structural Constraints
SYNONYMS None Processing Structural Constraints Andrew Trotman Department of Computer Science University of Otago Dunedin New Zealand DEFINITION When searching unstructured plain-text the user is limited
More informationCMU System for Entity Discovery and Linking at TAC-KBP 2015
CMU System for Entity Discovery and Linking at TAC-KBP 2015 Nicolas Fauceglia, Yiu-Chang Lin, Xuezhe Ma, and Eduard Hovy Language Technologies Institute Carnegie Mellon University 5000 Forbes Ave, Pittsburgh,
More informationSelf-tuning ongoing terminology extraction retrained on terminology validation decisions
Self-tuning ongoing terminology extraction retrained on terminology validation decisions Alfredo Maldonado and David Lewis ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin
More informationLeveraging Set Relations in Exact Set Similarity Join
Leveraging Set Relations in Exact Set Similarity Join Xubo Wang, Lu Qin, Xuemin Lin, Ying Zhang, and Lijun Chang University of New South Wales, Australia University of Technology Sydney, Australia {xwang,lxue,ljchang}@cse.unsw.edu.au,
More informationDoctoral Thesis Proposal Learning Semantics of WikiTables
Doctoral Thesis Proposal Learning Semantics of WikiTables Chandra Sekhar Bhagavatula Department of Electrical Engineering and Computer Science Northwestern University csbhagav@u.northwestern.edu December
More informationReducing Over-generation Errors for Automatic Keyphrase Extraction using Integer Linear Programming
Reducing Over-generation Errors for Automatic Keyphrase Extraction using Integer Linear Programming Florian Boudin LINA - UMR CNRS 6241, Université de Nantes, France Keyphrase 2015 1 / 22 Errors made by
More informationStructured Data on the Web
Structured Data on the Web Alon Halevy Google Australasian Computer Science Week January, 2010 Structured Data & The Web Andree Hudson, 4 th of July Hard to find structured data via search engines
More informationLimitations of XPath & XQuery in an Environment with Diverse Schemes
Exploiting Structure, Annotation, and Ontological Knowledge for Automatic Classification of XML-Data Martin Theobald, Ralf Schenkel, and Gerhard Weikum Saarland University Saarbrücken, Germany 23.06.2003
More informationSPARK: Top-k Keyword Query in Relational Database
SPARK: Top-k Keyword Query in Relational Database Wei Wang University of New South Wales Australia 20/03/2007 1 Outline Demo & Introduction Ranking Query Evaluation Conclusions 20/03/2007 2 Demo 20/03/2007
More informationClosest Keywords Search on Spatial Databases
Closest Keywords Search on Spatial Databases 1 A. YOJANA, 2 Dr. A. SHARADA 1 M. Tech Student, Department of CSE, G.Narayanamma Institute of Technology & Science, Telangana, India. 2 Associate Professor,
More informationCMU System for Entity Discovery and Linking at TAC-KBP 2016
CMU System for Entity Discovery and Linking at TAC-KBP 2016 Xuezhe Ma, Nicolas Fauceglia, Yiu-chang Lin, and Eduard Hovy Language Technologies Institute Carnegie Mellon University 5000 Forbes Ave, Pittsburgh,
More informationOutline. Part I. Introduction Part II. ML for DI. Part III. DI for ML Part IV. Conclusions and research direction
Outline Part I. Introduction Part II. ML for DI ML for entity linkage ML for data extraction ML for data fusion ML for schema alignment Part III. DI for ML Part IV. Conclusions and research direction Data
More informationTag Based Image Search by Social Re-ranking
Tag Based Image Search by Social Re-ranking Vilas Dilip Mane, Prof.Nilesh P. Sable Student, Department of Computer Engineering, Imperial College of Engineering & Research, Wagholi, Pune, Savitribai Phule
More informationTowards Semantic Data Mining
Towards Semantic Data Mining Haishan Liu Department of Computer and Information Science, University of Oregon, Eugene, OR, 97401, USA ahoyleo@cs.uoregon.edu Abstract. Incorporating domain knowledge is
More informationKnowledge Based Systems Text Analysis
Knowledge Based Systems Text Analysis Dr. Shubhangi D.C 1, Ravikiran Mitte 2 1 H.O.D, 2 P.G.Student 1,2 Department of Computer Science and Engineering, PG Center Regional Office VTU, Kalaburagi, Karnataka
More informationEntiTables: Smart Assistance for Entity-Focused Tables
EntiTables: Smart Assistance for Entity-Focused Tables ABSTRACT Shuo Zhang University of Stavanger shuo.zhang@uis.no Tables are among the most powerful and practical tools for organizing and working with
More informationQuery Disambiguation from Web Search Logs
Vol.133 (Information Technology and Computer Science 2016), pp.90-94 http://dx.doi.org/10.14257/astl.2016. Query Disambiguation from Web Search Logs Christian Højgaard 1, Joachim Sejr 2, and Yun-Gyung
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationWebTables: Exploring the Power of Tables on the Web
WebTables: Exploring the Power of Tables on the Web Michael J. Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, Yang Zhang Presented by: Ganesh Viswanathan September 29 th, 2011 CIS 6930 Data Science:
More informationString Vector based KNN for Text Categorization
458 String Vector based KNN for Text Categorization Taeho Jo Department of Computer and Information Communication Engineering Hongik University Sejong, South Korea tjo018@hongik.ac.kr Abstract This research
More informationEfficient Index Based Query Keyword Search in the Spatial Database
Advances in Computational Sciences and Technology ISSN 0973-6107 Volume 10, Number 5 (2017) pp. 1517-1529 Research India Publications http://www.ripublication.com Efficient Index Based Query Keyword Search
More informationAnnotating Spatio-Temporal Information in Documents
Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de
More informationAutomatically Synthesizing SQL Queries from Input-Output Examples
Automatically Synthesizing SQL Queries from Input-Output Examples Sai Zhang University of Washington Joint work with: Yuyin Sun Goal: making it easier for non-expert users to write correct SQL queries
More informationIJCSC Volume 5 Number 1 March-Sep 2014 pp ISSN
Movie Related Information Retrieval Using Ontology Based Semantic Search Tarjni Vyas, Hetali Tank, Kinjal Shah Nirma University, Ahmedabad tarjni.vyas@nirmauni.ac.in, tank92@gmail.com, shahkinjal92@gmail.com
More informationTable Identification and Information extraction in Spreadsheets
Table Identification and Information extraction in Spreadsheets Elvis Koci 1,2, Maik Thiele 1, Oscar Romero 2, and Wolfgang Lehner 1 1 Technische Universität Dresden, Germany 2 Universitat Politècnica
More informationEntity and Knowledge Base-oriented Information Retrieval
Entity and Knowledge Base-oriented Information Retrieval Presenter: Liuqing Li liuqing@vt.edu Digital Library Research Laboratory Virginia Polytechnic Institute and State University Blacksburg, VA 24061
More informationUnderstanding a Large Corpus of Web Tables Through Matching with Knowledge Bases An Empirical Study
Understanding a Large Corpus of Web Tables Through Matching with Knowledge Bases An Empirical Study Oktie Hassanzadeh, Michael J. Ward, Mariano Rodriguez-Muro, and Kavitha Srinivas IBM T.J. Watson Research
More informationSIMON: A Multi-strategy Classification Approach Resolving Ontology Heterogeneity The P2P Meets the Semantic Web *
SIMON: A Multi-strategy Classification Approach Resolving Ontology Heterogeneity The P2P Meets the Semantic Web * Leyun Pan, Liang Zhang, and Fanyuan Ma Department of Computer Science and Engineering Shanghai
More informationCIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets
CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets Arjumand Younus 1,2, Colm O Riordan 1, and Gabriella Pasi 2 1 Computational Intelligence Research Group,
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationCMU System for Entity Discovery and Linking at TAC-KBP 2017
CMU System for Entity Discovery and Linking at TAC-KBP 2017 Xuezhe Ma, Nicolas Fauceglia, Yiu-chang Lin, and Eduard Hovy Language Technologies Institute Carnegie Mellon University 5000 Forbes Ave, Pittsburgh,
More informationA New Technique to Optimize User s Browsing Session using Data Mining
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
More informationRishiraj Saha Roy and Niloy Ganguly IIT Kharagpur India. Monojit Choudhury and Srivatsan Laxman Microsoft Research India India
Rishiraj Saha Roy and Niloy Ganguly IIT Kharagpur India Monojit Choudhury and Srivatsan Laxman Microsoft Research India India ACM SIGIR 2012, Portland August 15, 2012 Dividing a query into individual semantic
More informationPapers for comprehensive viva-voce
Papers for comprehensive viva-voce Priya Radhakrishnan Advisor : Dr. Vasudeva Varma Search and Information Extraction Lab, International Institute of Information Technology, Gachibowli, Hyderabad, India
More informationZákladní informace. Motivace
Základní informace Jméno projektu Zkratka Vedoucí Konzultanti Anotace Open Data Linker and Classifier Odalic Tomas Knap Ziqi Zhang The goal of the project
More informationAlberto Messina, Maurizio Montagnuolo
A Generalised Cross-Modal Clustering Method Applied to Multimedia News Semantic Indexing and Retrieval Alberto Messina, Maurizio Montagnuolo RAI Centre for Research and Technological Innovation Madrid,
More informationLecture 4: Unsupervised Word-sense Disambiguation
ootstrapping Lecture 4: Unsupervised Word-sense Disambiguation Lexical Semantics and Discourse Processing MPhil in dvanced Computer Science Simone Teufel Natural Language and Information Processing (NLIP)
More informationDiscovering Names in Linked Data Datasets
Discovering Names in Linked Data Datasets Bianca Pereira 1, João C. P. da Silva 2, and Adriana S. Vivacqua 1,2 1 Programa de Pós-Graduação em Informática, 2 Departamento de Ciência da Computação Instituto
More informationS-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking
S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking Yi Yang * and Ming-Wei Chang # * Georgia Institute of Technology, Atlanta # Microsoft Research, Redmond Traditional
More informationWeb Information Retrieval using WordNet
Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT
More informationNLP Techniques in Knowledge Graph. Shiqi Zhao
NLP Techniques in Knowledge Graph Shiqi Zhao Outline Baidu Knowledge Graph Knowledge Mining Semantic Computation Zhixin for Baidu PC Search Knowledge graph Named entities Normal entities Exact answers
More informationEffective Latent Space Graph-based Re-ranking Model with Global Consistency
Effective Latent Space Graph-based Re-ranking Model with Global Consistency Feb. 12, 2009 1 Outline Introduction Related work Methodology Graph-based re-ranking model Learning a latent space graph A case
More informationA Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition
A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition Ana Zelaia, Olatz Arregi and Basilio Sierra Computer Science Faculty University of the Basque Country ana.zelaia@ehu.es
More informationA graph-based method to improve WordNet Domains
A graph-based method to improve WordNet Domains Aitor González, German Rigau IXA group UPV/EHU, Donostia, Spain agonzalez278@ikasle.ehu.com german.rigau@ehu.com Mauro Castillo UTEM, Santiago de Chile,
More informationA Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition
A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition Ana Zelaia, Olatz Arregi and Basilio Sierra Computer Science Faculty University of the Basque Country ana.zelaia@ehu.es
More informationTable2Vec: Neural Word and Entity Embeddings for Table Population and Retrieval
Faculty of Science and Technology Department of Electrical Engineering and Computer Science Table2Vec: Neural Word and Entity Embeddings for Table Population and Retrieval Master s Thesis in Computer Science
More informationUncovering the Relational Web
Uncovering the Relational Web Michael J. Cafarella University of Washington mjc@cs.washington.edu Eugene Wu MIT eugene@csail.mit.edu Alon Halevy Google, Inc. halevy@google.com Yang Zhang MIT zhang@csail.mit.edu
More informationFast Inbound Top- K Query for Random Walk with Restart
Fast Inbound Top- K Query for Random Walk with Restart Chao Zhang, Shan Jiang, Yucheng Chen, Yidan Sun, Jiawei Han University of Illinois at Urbana Champaign czhang82@illinois.edu 1 Outline Background
More informationDisambiguating Entities Referred by Web Endpoints using Tree Ensembles
Disambiguating Entities Referred by Web Endpoints using Tree Ensembles Gitansh Khirbat Jianzhong Qi Rui Zhang Department of Computing and Information Systems The University of Melbourne Australia gkhirbat@student.unimelb.edu.au
More informationTowards Summarizing the Web of Entities
Towards Summarizing the Web of Entities contributors: August 15, 2012 Thomas Hofmann Director of Engineering Search Ads Quality Zurich, Google Switzerland thofmann@google.com Enrique Alfonseca Yasemin
More informationWEIGHTING QUERY TERMS USING WORDNET ONTOLOGY
IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.4, April 2009 349 WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY Mohammed M. Sakre Mohammed M. Kouta Ali M. N. Allam Al Shorouk
More informationEntity Discovery and Annotation in Tables
Entity Discovery and Annotation in Tables Gianluca Quercini Université Paris-Sud XI Laboratoire de Recherche en Informatique (LRI) gianluca.quercini@lri.fr Chantal Reynaud Université Paris-Sud XI Laboratoire
More informationUsing PageRank in Feature Selection
Using PageRank in Feature Selection Dino Ienco, Rosa Meo, and Marco Botta Dipartimento di Informatica, Università di Torino, Italy {ienco,meo,botta}@di.unito.it Abstract. Feature selection is an important
More informationMulti-label classification using rule-based classifier systems
Multi-label classification using rule-based classifier systems Shabnam Nazmi (PhD candidate) Department of electrical and computer engineering North Carolina A&T state university Advisor: Dr. A. Homaifar
More informationTop-k Keyword Search Over Graphs Based On Backward Search
Top-k Keyword Search Over Graphs Based On Backward Search Jia-Hui Zeng, Jiu-Ming Huang, Shu-Qiang Yang 1College of Computer National University of Defense Technology, Changsha, China 2College of Computer
More information