Computer-gestützte Interaktion. Vorlesung: Information Retrieval 2.
|
|
- Maximillian Hines
- 6 years ago
- Views:
Transcription
1 Vorlesung: Information Retrieval 2. Florian Metze, Fachbereich Usability WS 2008/ Termin: Donnerstags 10:15 11:45; TEL20, Auditorium Date Remark Topic Einführung Q&U Lab Statistik Klassifikation Grundlagen und ASR ASR Anwendungen und Systeme Future ASR Grundlagen und regelbasierte Übersetzung Statistische Übersetzung (10:15-11:45) Sprachübersetzungssysteme (12:15-13:45) (Sprach-)dialogsysteme (10:15-11:45) Multimodale Schnittstellen (12:15-13:45) Fusion/ Fission: Audio, Video, Keyboard, Touch, (10:15-11:45) Anwendungen & Wiederholung (12:15-13:45) Information Retrieval, Dokumentensuche (10:15-11:45) Information Retrieval 2, Expertensuche (12:15-13:45) VL CGI FMe 13 - IR2.ppt X 1
2 Human Computer Interfaces: Example Information Retrieval. Introduction Conceptual model Relationship of IR and HCI and HCC Latent Semantic Indexing The ESP Game Assessing the retrieval Future Directions VL CGI FMe 13 - IR2.ppt X 2 HCI: Information Retrieval Model. Content-Centered Retrieval as Matching Document Representations to Query Representations A powerful paradigm that has driven IR R&D for half a century. Evaluation metric is effectiveness of the match. (e.g., recall and precision). VL CGI FMe 13 - IR2.ppt X 3
3 HCI-IR: Content Trend. Content Features (queries too) Not only text Statistics, images, music, code, streams, bio-chemical Multimedia, multilingual Dynamic Temporal (e,g., blogs, wikis, sensor streams) Conditional (e.g., computed links, recommendations) Content Relationships Hyperlinks, new metadata, aggregations Digital libraries, personal collections Content acquires history context retrieval VL CGI FMe 13 - IR2.ppt X 4 HCI-IR: Responses to Content Trend. Link analysis Multiple sources of evidence (fusion) Authors words (e.g., full text IR) Indexer/ abstractor words (e.g., OPACs) Authors citations/links (e.g., Google) Readers search paths (e.g., recommenders, opinion miners: collaborative filtering ) Machine generated features and relationships ( mining ) Three key challenges: How do we generate references? What new relationships can we leverage (human and machine)? How can we integrate multiple sources of evidence? VL CGI FMe 13 - IR2.ppt X 5
4 HCI-IR: User Trend. Technical advances and technical literacy allows us to leverage information seeker intelligence Rather than sole dependence on matching algorithms, focus on flow of representations and actions in situ as people think with these new tools and information resources To leverage human intelligence and effort, people must assume responsibilities: beyond the two-word, single query Web and TV remotes have legitimized browsing as human-controlled information seeking Aim at understanding rather than retrieval Responses to User Trend: Adapt techniques to WWW Relevance feedback Query expansion User modeling/profiles, SDI services Recommender systems: explicit and implicit models Capture everything (e.g., Lifebits) User Interfaces: dynamic queries, agile views, tuning of IR systems VL CGI FMe 13 - IR2.ppt X 6 HCI: HCC Model of HCI. A user-oriented model that has driven R&D. Evaluation based on user time, accuracy, and satisfaction. VL CGI FMe 13 - IR2.ppt X 7
5 HCI: WWW Trends. First decade of WWW as great equalizer (we all get impoverished, but we admit MANY more people) Universal access Platform independence (lots of devices) Enhanced browsers, specialized browsers Interface Servers Social awareness (user is not alone) VL CGI FMe 13 - IR2.ppt X 8 HCI-IR: An Expanded Model. Think of IR from the perspective of an active human with information needs, information skills, powerful IR resources (that include other humans), and situated in global and local connected communities, all of which evolve over time. Get people closer to the information they need Closer to the backend Closer to the meaning Involve information professionals as integral to the IR system Increase responsibility as well as control Leverage more demanding and knowledgeable installed base Consider ubiquity, digital libraries, e-commerce as extended memories and tools (personal and shared) VL CGI FMe 13 - IR2.ppt X 9
6 HCI-IR: Key Challenges. Linking conceptual interface to system backend Metadata generation Alternative representations and control mechanisms Raising user literacy and involvement Engaging without insulting or annoying Adding human intelligence to the system Moving beyond retrieval to understanding Context VL CGI FMe 13 - IR2.ppt X 10 HCI Example 1: Word-Net. WordNet is a large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept. Synsets are interlinked by means of conceptualsemantic and lexical relations. The resulting network of meaningfully related words and concepts can be navigated with the browser. WordNet's structure makes it a useful tool for computational linguistics and natural language processing. WordNet relations can be expressed in OWL, RDFS or other ontology markup languages: VL CGI FMe 13 - IR2.ppt X 11
7 HCI Example 2: The ESP Game. How to label images? On the web? Clever way to automate meta-data generation Image annotation/ recognition very difficult Labeling the Web using Human Computation Two-player game on the web Players get points for generating keywords describing a picture, if the other player agrees Taboo wordsexist, too Accuracy assured by over-sampling Social aspect ( become top labeler ) and fun as motivation Funded by NSA, conceived by Luis von Ahn at CMU. Now sold to Google. VL CGI FMe 13 - IR2.ppt X 12 HCI Example 2: Latent Semantic Indexing (LSA). How LSA works: LSA uses a term-document matrix which describestheoccurrencesof termsin documents It is a sparse matrix whose rows correspond to terms (typically stemmed words) and whose columns correspond to documents, matrix elements are tf-idf. LSA transforms the occurrence matrix into a relation between the terms and some concepts, and a relation between those concepts and the documents. Thus the terms and documents are now indirectly related through the concepts. LSA finds a low-rank approximation to the term-document matrix. The consequence of the rank lowering is that some dimensions are combined and depend on more than one term: {(car), (truck), (flower)} {( * car * truck), (flower)} The new concept space typically can be used to: Compare the documents in the concept space (data clustering, document classification). Find similar documents across languages, after analyzing a base set of translated documents (cross language retrieval). Find relations between terms (synonymy and polysemy). Given a query of terms, translate it into the concept space, and find matching documents (information retrieval). Synonymy and polysemy are fundamental problems in natural language processing: Synonymy is the phenomenon where different words describe the same idea. Polysemy is the phenomenon where the same word has multiple meanings. Principal Component Analysis (PCA) in term space VL CGI FMe 13 - IR2.ppt X 13
8 HCI and Computer Aided Interaction. Automatic classification works best when its application is supported by humans with knowledge of the domain and the techniques at hand. (Gary Marchionini) Computers should learn! The Relation Browser tool for metadata mining: VL CGI FMe 13 - IR2.ppt X 14 HCI: The Relation Browser. A general purpose dynamic query interface for databases with a small number of facets (~10) and a small number of categories in each facet (~10). Easy to look ahead (overviews and previews) Couples interactive partitioning/ exploration with string query Semi-automatic category generation and webpage classification Mousing over Coal reveals the distribution of coal -related web-pages in the other categories VL CGI FMe 13 - IR2.ppt X 15
9 HCI: The Relation Browser. 1) Acquire data: 2) Build Representation: Crawl sites/ Internet Formats? Mirror locally? Clean data Remove non-alphabeticals Lowercaseall Word-Net validate words Stemornotstem Select data to include Pages to include/ exclude ASCII text from Titles Link anchors Metadata tags Build raw term-document matrix Pages as rows (observations) Terms as columns (variables) Frequencies or TF-IDF weights in cells VL CGI FMe 13 - IR2.ppt X 16 HCI: The Relation Browser. 3) Filter data: 4) Project data onto lower dimensional space Stop word lists General terms Domain specific terms Web and navigation terms Iteratively developed/ refined Term discrimination filters (various) doc frequency interval Interval augmented by 100 top freq Empirical threshold (e.g., > 5 docs) First N principal components latent semantic dimensions independent components Reduces to narrower term-doc matrix Still kind of experimental VL CGI FMe 13 - IR2.ppt X 17
10 HCI: The Relation Browser. 5) Cluster documents 6) Evaluate clusters and name topics K-means, e.g., with k<<100 EM yields a probability distribution for each document over the clusters (so a document has some probability of belonging to each cluster) Create usable output A web page with the clusters and number of documents in each For each cluster, a list of the top 10 most frequently occurring terms; a list of the top 10 log-odds ratio terms; and links to all the pages in that cluster Eyeball the terms, pick a cluster (topic) name (names); else iterate previous steps VL CGI FMe 13 - IR2.ppt X 18 HCI: The Relation Browser. 7) Assign pages to topics 8) Create other facets (views) and display For every page, compute the probability distribution (using EM model) over each cluster/ topic Select a threshold for placing pages into topics (most easily go into only one topic) Use a set of heuristic rules to place pages into geographic categories Use a set of heuristic rules to place pages into temporal categories (ad hoc at present) Map the files onto the RB relational scheme VL CGI FMe 13 - IR2.ppt X 19
11 HCI: Interaction Principles and Caveats (Incomplete). Principles Look ahead without penalty Minimize scrolling and clicking Alternative ways to slice and dice Closely couple search, browse, and examine Continuous engagement useful attractors Treasures to surface Caveats Scalability (getting metadata to client side) Metadata crucial: e.g. working on automatically creating partitions Increasing expectations about useful results (answers!) VL CGI FMe 13 - IR2.ppt X 20 HCI: Long-term IR paradigm. Information interaction as core life cycle process: Examples represent early ways to get the information seeker more involved in the information seeking process there is plenty more to do. Like eating we have varying expectations, invest different levels of effort, and use diverse and ubiquitous infrastructures. Key challenge is to span boundaries between cyberinfrastructure and the real world. Coda: Our hopes that we can create systems (solutions) that do IR for us are unreasonable Our expectations that people can find and understand information without thinking and investing effort are unreasonable. Aim to develop systems that involve people and machines continuously learning and changing together. Google would not work as well next month if there were not a large group of employees tuning the system, adding new spam filters, and crawlers checking out pages and links continuously. VL CGI FMe 13 - IR2.ppt X 21
12 Backup
Human-Computer Information Retrieval
Human-Computer Information Retrieval Gary Marchionini University of North Carolina at Chapel Hill march@ils.unc.edu CSAIL MIT November 12, 2004 Message IR and HCI are related fields that have strong (staid?)
More informationUser Interfaces for Information Retrieval on the WWW
User Interfaces for Information Retrieval on the WWW Gary Marchionini University of North Carolina at Chapel Hill march@ils.unc.edu INFORUM 2005 Prague May 24-27, 2005 Message On the WWW, the User Interface
More informationToward Human-Computer Information Retrieval
Toward Human-Computer Information Retrieval Gary Marchionini University of North Carolina at Chapel Hill march@ils.unc.edu Samuel Lazerow Memorial Lecture The Information School University of Washington
More informationJanuary- March,2016 ISSN NO
USER INTERFACES FOR INFORMATION RETRIEVAL ON THE WWW: A PERSPECTIVE OF INDIAN WOMEN. Sunil Kumar Research Scholar Bhagwant University,Ajmer sunilvats1981@gmail.com Dr. S.B.L. Tripathy Abstract Information
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationWeb Information Retrieval using WordNet
Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT
More informationInformation Retrieval
Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,
More informationA Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2
A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,
More informationUNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.
UNIT-V WEB MINING 1 Mining the World-Wide Web 2 What is Web Mining? Discovering useful information from the World-Wide Web and its usage patterns. 3 Web search engines Index-based: search the Web, index
More informationWEIGHTING QUERY TERMS USING WORDNET ONTOLOGY
IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.4, April 2009 349 WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY Mohammed M. Sakre Mohammed M. Kouta Ali M. N. Allam Al Shorouk
More informationChapter 6: Information Retrieval and Web Search. An introduction
Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods
More informationEnhanced retrieval using semantic technologies:
Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008
More informationLearning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search
1 / 33 Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search Bernd Wittefeld Supervisor Markus Löckelt 20. July 2012 2 / 33 Teaser - Google Web History http://www.google.com/history
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationCHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS
82 CHAPTER 5 SEARCH ENGINE USING SEMANTIC CONCEPTS In recent years, everybody is in thirst of getting information from the internet. Search engines are used to fulfill the need of them. Even though the
More informationChapter 27 Introduction to Information Retrieval and Web Search
Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationWeb Mining TEAM 8. Professor Anita Wasilewska CSE 634 Data Mining
Web Mining TEAM 8 Paper - You Are What You Tweet : Analyzing Twitter for Public Health Authors : Paul, Michael J., and Mark Dredze. Conference : AAAI Publications, Fifth International AAAI Conference on
More informationSemantic Website Clustering
Semantic Website Clustering I-Hsuan Yang, Yu-tsun Huang, Yen-Ling Huang 1. Abstract We propose a new approach to cluster the web pages. Utilizing an iterative reinforced algorithm, the model extracts semantic
More informationKnowledge Discovery and Data Mining 1 (VO) ( )
Knowledge Discovery and Data Mining 1 (VO) (707.003) Data Matrices and Vector Space Model Denis Helic KTI, TU Graz Nov 6, 2014 Denis Helic (KTI, TU Graz) KDDM1 Nov 6, 2014 1 / 55 Big picture: KDDM Probability
More informationIn the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google,
1 1.1 Introduction In the recent past, the World Wide Web has been witnessing an explosive growth. All the leading web search engines, namely, Google, Yahoo, Askjeeves, etc. are vying with each other to
More informationChapter 2. Architecture of a Search Engine
Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them
More informationOntology Based Search Engine
Ontology Based Search Engine K.Suriya Prakash / P.Saravana kumar Lecturer / HOD / Assistant Professor Hindustan Institute of Engineering Technology Polytechnic College, Padappai, Chennai, TamilNadu, India
More informationInformation Retrieval
Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have
More informationAn Introduction to Search Engines and Web Navigation
An Introduction to Search Engines and Web Navigation MARK LEVENE ADDISON-WESLEY Ал imprint of Pearson Education Harlow, England London New York Boston San Francisco Toronto Sydney Tokyo Singapore Hong
More informationCHAPTER 3 INFORMATION RETRIEVAL BASED ON QUERY EXPANSION AND LATENT SEMANTIC INDEXING
43 CHAPTER 3 INFORMATION RETRIEVAL BASED ON QUERY EXPANSION AND LATENT SEMANTIC INDEXING 3.1 INTRODUCTION This chapter emphasizes the Information Retrieval based on Query Expansion (QE) and Latent Semantic
More informationText Mining. Munawar, PhD. Text Mining - Munawar, PhD
10 Text Mining Munawar, PhD Definition Text mining also is known as Text Data Mining (TDM) and Knowledge Discovery in Textual Database (KDT).[1] A process of identifying novel information from a collection
More informationSearch Engine Architecture. Hongning Wang
Search Engine Architecture Hongning Wang CS@UVa CS@UVa CS4501: Information Retrieval 2 Document Analyzer Classical search engine architecture The Anatomy of a Large-Scale Hypertextual Web Search Engine
More informationCreating a Recommender System. An Elasticsearch & Apache Spark approach
Creating a Recommender System An Elasticsearch & Apache Spark approach My Profile SKILLS Álvaro Santos Andrés Big Data & Analytics Solution Architect in Ericsson with more than 12 years of experience focused
More informationDisambiguating Search by Leveraging a Social Context Based on the Stream of User s Activity
Disambiguating Search by Leveraging a Social Context Based on the Stream of User s Activity Tomáš Kramár, Michal Barla and Mária Bieliková Faculty of Informatics and Information Technology Slovak University
More informationTIC: A Topic-based Intelligent Crawler
2011 International Conference on Information and Intelligent Computing IPCSIT vol.18 (2011) (2011) IACSIT Press, Singapore TIC: A Topic-based Intelligent Crawler Hossein Shahsavand Baghdadi and Bali Ranaivo-Malançon
More informationOverview of Web Mining Techniques and its Application towards Web
Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous
More informationEFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH
EFFICIENT INTEGRATION OF SEMANTIC TECHNOLOGIES FOR PROFESSIONAL IMAGE ANNOTATION AND SEARCH Andreas Walter FZI Forschungszentrum Informatik, Haid-und-Neu-Straße 10-14, 76131 Karlsruhe, Germany, awalter@fzi.de
More informationEmpowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia
Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user
More informationWEB PAGE RE-RANKING TECHNIQUE IN SEARCH ENGINE
WEB PAGE RE-RANKING TECHNIQUE IN SEARCH ENGINE Ms.S.Muthukakshmi 1, R. Surya 2, M. Umira Taj 3 Assistant Professor, Department of Information Technology, Sri Krishna College of Technology, Kovaipudur,
More informationCS490W. Text Clustering. Luo Si. Department of Computer Science Purdue University
CS490W Text Clustering Luo Si Department of Computer Science Purdue University [Borrows slides from Chris Manning, Ray Mooney and Soumen Chakrabarti] Clustering Document clustering Motivations Document
More informationDATA MINING II - 1DL460. Spring 2014"
DATA MINING II - 1DL460 Spring 2014" A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt14 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationCS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University
CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and
More informationKnowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.
Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European
More informationMulti-Application Interest Modeling. Frank Shipman
Multi-Application Interest Modeling Frank Shipman My Research Area Many interests Multimedia New Media Computers and Education Computers and Design Software Engineering Computer-Supported Cooperative Work
More informationContent Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered.
Content Enrichment An essential strategic capability for every publisher Enriched content. Delivered. An essential strategic capability for every publisher Overview Content is at the centre of everything
More informationInformation Retrieval and Web Search
Information Retrieval and Web Search Relevance Feedback. Query Expansion Instructor: Rada Mihalcea Intelligent Information Retrieval 1. Relevance feedback - Direct feedback - Pseudo feedback 2. Query expansion
More informationSCALABLE KNOWLEDGE BASED AGGREGATION OF COLLECTIVE BEHAVIOR
SCALABLE KNOWLEDGE BASED AGGREGATION OF COLLECTIVE BEHAVIOR P.SHENBAGAVALLI M.E., Research Scholar, Assistant professor/cse MPNMJ Engineering college Sspshenba2@gmail.com J.SARAVANAKUMAR B.Tech(IT)., PG
More informationTaxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company
Taxonomy Tools: Collaboration, Creation & Integration Dave Clarke Global Taxonomy Director dave.clarke@dowjones.com Dow Jones & Company Introduction Software Tools for Taxonomy 1. Collaboration 2. Creation
More informationSeek and Ye shall Find
Seek and Ye shall Find The continuum of computer intelligence COS 116, Spring 2012 Adam Finkelstein Recap: Binary Representation Powers of 2 2 0 2 1 2 2 2 3 2 4 2 5 2 6 2 7 2 8 2 9 2 10 1 2 4 8 16 32 64
More informationInformation Retrieval. Session 11 LBSC 671 Creating Information Infrastructures
Information Retrieval Session 11 LBSC 671 Creating Information Infrastructures Agenda The search process Information retrieval Recommender systems Evaluation The Memex Machine Information Hierarchy More
More informationExploring and Navigating Ontologies and Data A Work in Progress Discussion Jan 21 st, 2009
Exploring and Navigating Ontologies and Data A Work in Progress Discussion Jan 21 st, 2009 Margaret-Anne Storey University of Victoria Our goal: Provide cognitive support for ontology developers and users
More informationChrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO
Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO INDEX Proposal Recap Implementation Evaluation Future Works Proposal Recap Keyword Visualizer (chrome
More informationSeek and Ye shall Find
Seek and Ye shall Find The continuum of computer intelligence COS 116, Spring 2010 Adam Finkelstein Final tally: Computer $77,147, Ken Jennings $24,000, Brad Rutter $21,600. Jennings: I, for one, welcome
More informationInformation Management (IM)
1 2 3 4 5 6 7 8 9 Information Management (IM) Information Management (IM) is primarily concerned with the capture, digitization, representation, organization, transformation, and presentation of information;
More informationLimitations of XPath & XQuery in an Environment with Diverse Schemes
Exploiting Structure, Annotation, and Ontological Knowledge for Automatic Classification of XML-Data Martin Theobald, Ralf Schenkel, and Gerhard Weikum Saarland University Saarbrücken, Germany 23.06.2003
More informationCS 6320 Natural Language Processing
CS 6320 Natural Language Processing Information Retrieval Yang Liu Slides modified from Ray Mooney s (http://www.cs.utexas.edu/users/mooney/ir-course/slides/) 1 Introduction of IR System components, basic
More informationInformation Retrieval
Natural Language Processing SoSe 2015 Information Retrieval Dr. Mariana Neves June 22nd, 2015 (based on the slides of Dr. Saeedeh Momtazi) Outline Introduction Indexing Block 2 Document Crawling Text Processing
More informationEleven+ Views of Semantic Search
Eleven+ Views of Semantic Search Denise A. D. Bedford, Ph.d. Goodyear Professor of Knowledge Management Information Architecture and Knowledge Management Kent State University Presentation Focus Long-Term
More informationSeek and Ye shall Find
Seek and Ye shall Find The continuum of computer intelligence COS 116: 2/22/2007 Adam Finkelstein Recap: Binary Representation Powers of 2 2 0 2 1 2 2 2 3 2 4 2 5 2 6 2 7 2 8 2 9 2 10 1024 1 2 4 8 16 32
More informationUsing the Semantic Web in Ubiquitous and Mobile Computing
Using the Semantic Web in Ubiquitous and Mobile Computing Ora Lassila Research Fellow, Software & Applications Laboratory, Nokia Research Center Elected Member of Advisory Board, World Wide Web Consortium
More informationOptimal Query. Assume that the relevant set of documents C r. 1 N C r d j. d j. Where N is the total number of documents.
Optimal Query Assume that the relevant set of documents C r are known. Then the best query is: q opt 1 C r d j C r d j 1 N C r d j C r d j Where N is the total number of documents. Note that even this
More informationQuestion Answering Approach Using a WordNet-based Answer Type Taxonomy
Question Answering Approach Using a WordNet-based Answer Type Taxonomy Seung-Hoon Na, In-Su Kang, Sang-Yool Lee, Jong-Hyeok Lee Department of Computer Science and Engineering, Electrical and Computer Engineering
More informationWhat is this Song About?: Identification of Keywords in Bollywood Lyrics
What is this Song About?: Identification of Keywords in Bollywood Lyrics by Drushti Apoorva G, Kritik Mathur, Priyansh Agrawal, Radhika Mamidi in 19th International Conference on Computational Linguistics
More informationAutomatic Identification of User Goals in Web Search [WWW 05]
Automatic Identification of User Goals in Web Search [WWW 05] UichinLee @ UCLA ZhenyuLiu @ UCLA JunghooCho @ UCLA Presenter: Emiran Curtmola@ UC San Diego CSE 291 4/29/2008 Need to improve the quality
More informationCollective Intelligence in Action
Collective Intelligence in Action SATNAM ALAG II MANNING Greenwich (74 w. long.) contents foreword xv preface xvii acknowledgments xix about this book xxi PART 1 GATHERING DATA FOR INTELLIGENCE 1 "1 Understanding
More informationInformation Retrieval. hussein suleman uct cs
Information Management Information Retrieval hussein suleman uct cs 303 2004 Introduction Information retrieval is the process of locating the most relevant information to satisfy a specific information
More informationNext Level Marketing Online techniques to grow your business Hudson Digital
Next Level Marketing Online techniques to grow your business. 2019 Hudson Digital Your Online Presence Chances are you've already got a web site for your business. The fact is, today, every business needs
More information21. Search Models and UIs for IR
21. Search Models and UIs for IR INFO 202-10 November 2008 Bob Glushko Plan for Today's Lecture The "Classical" Model of Search and the "Classical" UI for IR Web-based Search Best practices for UIs in
More informationEnhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm
Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm K.Parimala, Assistant Professor, MCA Department, NMS.S.Vellaichamy Nadar College, Madurai, Dr.V.Palanisamy,
More informationOutline. Possible solutions. The basic problem. How? How? Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity
Outline Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity Lecture 10 CS 410/510 Information Retrieval on the Internet Query reformulation Sources of relevance for feedback Using
More informationSemantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.
Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...
More informationCS506/606 - Topics in Information Retrieval
CS506/606 - Topics in Information Retrieval Instructors: Class time: Steven Bedrick, Brian Roark, Emily Prud hommeaux Tu/Th 11:00 a.m. - 12:30 p.m. September 25 - December 6, 2012 Class location: WCC 403
More informationVALLIAMMAI ENGINEERING COLLEGE SRM Nagar, Kattankulathur DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING QUESTION BANK VII SEMESTER
VALLIAMMAI ENGINEERING COLLEGE SRM Nagar, Kattankulathur 603 203 DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING QUESTION BANK VII SEMESTER CS6007-INFORMATION RETRIEVAL Regulation 2013 Academic Year 2018
More informationAn Analysis of Image Retrieval Behavior for Metadata Type and Google Image Database
An Analysis of Image Retrieval Behavior for Metadata Type and Google Image Database Toru Fukumoto Canon Inc., JAPAN fukumoto.toru@canon.co.jp Abstract: A large number of digital images are stored on the
More informationIntroduction to Information Retrieval
Introduction to Information Retrieval (Supplementary Material) Zhou Shuigeng March 23, 2007 Advanced Distributed Computing 1 Text Databases and IR Text databases (document databases) Large collections
More informationDeveloping Focused Crawlers for Genre Specific Search Engines
Developing Focused Crawlers for Genre Specific Search Engines Nikhil Priyatam Thesis Advisor: Prof. Vasudeva Varma IIIT Hyderabad July 7, 2014 Examples of Genre Specific Search Engines MedlinePlus Naukri.com
More informationHyper G and Hyperwave
Hyper G and Hyperwave Chapter 13 Presented by: Stacie Zilber & Olga Chapkova History Research by IICM during the 1980s on: Videotex - a system for sending of text pages to a user in computerized form,
More informationCHALLENGES IN ADAPTIVE WEB INFORMATION SYSTEMS: DO NOT FORGET THE LINK!
CHALLENGES IN ADAPTIVE WEB INFORMATION SYSTEMS: DO NOT FORGET THE LINK! GEERT-JAN HOUBEN Technische Universiteit Eindhoven PO Box 513, NL-5600 MB Eindhoven, The Netherlands E-mail: g.j.houben@tue.nl In
More informationAn Oracle White Paper October Oracle Social Cloud Platform Text Analytics
An Oracle White Paper October 2012 Oracle Social Cloud Platform Text Analytics Executive Overview Oracle s social cloud text analytics platform is able to process unstructured text-based conversations
More informationLet s get parsing! Each component processes the Doc object, then passes it on. doc.is_parsed attribute checks whether a Doc object has been parsed
Let s get parsing! SpaCy default model includes tagger, parser and entity recognizer nlp = spacy.load('en ) tells spacy to use "en" with ["tagger", "parser", "ner"] Each component processes the Doc object,
More informationThe Topic Specific Search Engine
The Topic Specific Search Engine Benjamin Stopford 1 st Jan 2006 Version 0.1 Overview This paper presents a model for creating an accurate topic specific search engine through a focussed (vertical)
More informationDocument Clustering for Mediated Information Access The WebCluster Project
Document Clustering for Mediated Information Access The WebCluster Project School of Communication, Information and Library Sciences Rutgers University The original WebCluster project was conducted at
More informationISSUES IN INFORMATION RETRIEVAL Brian Vickery. Presentation at ISKO meeting on June 26, 2008 At University College, London
ISSUES IN INFORMATION RETRIEVAL Brian Vickery Presentation at ISKO meeting on June 26, 2008 At University College, London NEEDLE IN HAYSTACK MY BACKGROUND Plant chemist, then reports librarian Librarian,
More informationDATA MINING II - 1DL460. Spring 2017
DATA MINING II - 1DL460 Spring 2017 A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt17 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationWordNet-based User Profiles for Semantic Personalization
PIA 2005 Workshop on New Technologies for Personalized Information Access WordNet-based User Profiles for Semantic Personalization Giovanni Semeraro, Marco Degemmis, Pasquale Lops, Ignazio Palmisano LACAM
More informationSEO: SEARCH ENGINE OPTIMISATION
SEO: SEARCH ENGINE OPTIMISATION SEO IN 11 BASIC STEPS EXPLAINED What is all the commotion about this SEO, why is it important? I have had a professional content writer produce my content to make sure that
More informationOntology Based Prediction of Difficult Keyword Queries
Ontology Based Prediction of Difficult Keyword Queries Lubna.C*, Kasim K Pursuing M.Tech (CSE)*, Associate Professor (CSE) MEA Engineering College, Perinthalmanna Kerala, India lubna9990@gmail.com, kasim_mlp@gmail.com
More informationBig Data Analytics CSCI 4030
High dim. data Graph data Infinite data Machine learning Apps Locality sensitive hashing PageRank, SimRank Filtering data streams SVM Recommen der systems Clustering Community Detection Queries on streams
More informationMarketing & Back Office Management
Marketing & Back Office Management Menu Management Add, Edit, Delete Menu Gallery Management Add, Edit, Delete Images Banner Management Update the banner image/background image in web ordering Online Data
More informationDepartment of Computer Science and Engineering B.E/B.Tech/M.E/M.Tech : B.E. Regulation: 2013 PG Specialisation : _
COURSE DELIVERY PLAN - THEORY Page 1 of 6 Department of Computer Science and Engineering B.E/B.Tech/M.E/M.Tech : B.E. Regulation: 2013 PG Specialisation : _ LP: CS6007 Rev. No: 01 Date: 27/06/2017 Sub.
More informationPurpose, features and functionality
Topic 6 Purpose, features and functionality In this topic you will look at the purpose, features, functionality and range of users that use information systems. You will learn the importance of being able
More informationThings to consider when using Semantics in your Information Management strategy. Toby Conrad Smartlogic
Things to consider when using Semantics in your Information Management strategy Toby Conrad Smartlogic toby.conrad@smartlogic.com +1 773 251 0824 Some of Smartlogic s 250+ Customers Awards Trend Setting
More informationPart I: Data Mining Foundations
Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?
More informationKristina Lerman University of Southern California. This lecture is partly based on slides prepared by Anon Plangprasopchok
Kristina Lerman University of Southern California This lecture is partly based on slides prepared by Anon Plangprasopchok Social Web is a platform for people to create, organize and share information Users
More informationInformation Retrieval. Lecture 9 - Web search basics
Information Retrieval Lecture 9 - Web search basics Seminar für Sprachwissenschaft International Studies in Computational Linguistics Wintersemester 2007 1/ 30 Introduction Up to now: techniques for general
More informationVersion 11
The Big Challenges Networked and Electronic Media European Technology Platform The birth of a new sector www.nem-initiative.org Version 11 1. NEM IN THE WORLD The main objective of the Networked and Electronic
More informationD B M G Data Base and Data Mining Group of Politecnico di Torino
DataBase and Data Mining Group of Data mining fundamentals Data Base and Data Mining Group of Data analysis Most companies own huge databases containing operational data textual documents experiment results
More informationOverview MULTIMEDIA INFORMATION RETRIEVAL. Search Engines. Information Retrieval. Explanation. Van Rijsbergen
MULTIMEDIA INFORMATION RETRIEVAL Arjen P. de Vries arjen@acm.org Overview Information Retrieval Text Retrieval Multimedia Retrieval Recent Developments Research Topics Centrum voor Wiskunde en Informatica
More informationIntroduction to Information Retrieval
Introduction to Information Retrieval Mohsen Kamyar چهارمین کارگاه ساالنه آزمایشگاه فناوری و وب بهمن ماه 1391 Outline Outline in classic categorization Information vs. Data Retrieval IR Models Evaluation
More informationInstructor: Stefan Savev
LECTURE 2 What is indexing? Indexing is the process of extracting features (such as word counts) from the documents (in other words: preprocessing the documents). The process ends with putting the information
More informationApplying Semantic Web in Mobile and Ubiquitous Computing: Will Policy-Awareness Help?
Applying Semantic Web in Mobile and Ubiquitous Computing: Will Policy-Awareness Help? Ora Lassila Research Fellow Nokia Research Center Burlington, MA, USA Cambridge November 2005 1 NOKIA lassila-isvc2005-slides.ppt
More information: Semantic Web (2013 Fall)
03-60-569: Web (2013 Fall) University of Windsor September 4, 2013 Table of contents 1 2 3 4 5 Definition of the Web The World Wide Web is a system of interlinked hypertext documents accessed via the Internet
More informationInformation Retrieval. (M&S Ch 15)
Information Retrieval (M&S Ch 15) 1 Retrieval Models A retrieval model specifies the details of: Document representation Query representation Retrieval function Determines a notion of relevance. Notion
More information