Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala
|
|
- Tyler Waters
- 6 years ago
- Views:
Transcription
1 Master Project Various Aspects of Recommender Systems May 2nd, 2017 Master project SS17 Albert-Ludwigs-Universität Freiburg Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala
2 Agenda Organization Recommender Systems Topics - Finding complementary products (Anthony) - Cross-domain recommendations (Anthony) - Scientific Paper recommendation (Anas) - Recommending new Wikipedia articles (Michael) - Recommending references for (scientific) texts (Michael) 2
3 Requirements Study regulations (Studienordnung) - 16 ECTS 480 hours Master project - Team size: 1-3 students - Project report: ~10-12 pages per student - Short presentations: 2-3 (individual as needed) - Final presentation: 25 min Some preconditions - Recommended lecture Data Analysis and Query Language or similar 3
4 General goals Collective work on a project Gain experience in research and development method Improve individual programming skills Incorporate in new topics (Semantic Web, Recommender systems, ) Learn about problems of larger projects 4
5 Assessment Workload of every student must be clearly distinguishable Some Criteria - Methodology - The scope and difficulty of the work / implementation - Individual contribution - Team performance: a successful project has a positive effect - Role and participation in the team (coordination, etc.) - Quality of code (formatting, documentation, testing) - Individual report (project report) - Presentations (especially the final presentation) 5
6 Organization Meetings - Building 51 SR Website - Apply via HISinOne SVN repository Various Aspects of Recommender Systems SS17 6
7 Master projects 1. Finding complementary products (Anthony) 2. Cross-domain recommendations (Anthony) 3. Scientific Paper recommendation (Anas) 4. Recommending new Wikipedia articles (Michael) 5. Recommending references for (scientific) texts (Michael) Various Aspects of Recommender Systems SS17 7
8 Finding complementary products - 1 st project Products that are sold separately but that are used together, each creating a demand for the other Click 8
9 CP Traditional Approaches Data Mining (Association Rules) - Require transactions Limitations - Cold start for new items - Unpopular products - No explanations 10
10 CP Problem Predict if complementary relationship holds No transactions Using Semantic Web technologies - Linked Open Data (DBpedia): knowledge graph Based on product s meta-data - Publicly available 11
11 CP Solution scheme Learning to Identify Complementary Products from Dbpedia. Victor Anthony Arrascue Ayala, Trong-Nghia Cheng, Anas Alzoghbi, Georg Lausen Evaluation using Amazon s data 12
12 Goal: improving the scheme 1. Reproduce pipeline 2. Add new features - Observable graph-features - Meta-data: e.g. price 3. Extend evaluation - Other categories (Books, Movies and TV, etc.) - Ranking vs. classification 13
13 Compulsory task 1. Read the paper 2. Extract products attributes - Smallest category - Using NER tool (Alchemy / Spotlight) 3. Create knowledge graph - Crawl links between attributes from DBpedia 4. Data analysis - Products coverage - Interconnection s quality - Etc Various Aspects of Recommender Systems SS17 14
14 Submission of compulsory task Pre-requisite to participation Report - Introduction - Problem statement (1 page) - Solution proposal (1 page) - Data analysis (2 pages) - Related work (1 pages) 1 team, max. 3 students Deadline: , 12: Various Aspects of Recommender Systems SS17 15
15 Cross-domain recommendations - 2 nd project The research on cross-domain recommendation generally aims to exploit knowledge from a source domain D S to perform or improve recommendations in a target domain D T [RS Handbook]??? 16
16 CDRS Problem For each user - Given a set of likes for items in D S - Predict items in D T Using Semantic Web technologies - Linked Open Data (DBpedia): knowledge graph - Items are interconnected 17
17 CP Solution scheme (not assessed) Learning to Identify Complementary Products from Dbpedia. Victor Anthony Arrascue Ayala, Trong-Nghia Cheng, Anas Alzoghbi, Georg Lausen Evaluation using Facebook s data (likes) 18
18 CP Solution scheme (not assessed) Learning to Identify Complementary Products from Dbpedia. Victor Anthony Arrascue Ayala, Trong-Nghia Cheng, Anas Alzoghbi, Georg Lausen Evaluation using Facebook s data (likes) Liked? 19
19 Goal: try the scheme 1. Reproduce pipeline 2. Implement a recommender on top - Predict if a user would like the item - Predict top-k recommendations - *Optional: Integrate into RecRD4J 3. Evaluate the recommender - Use standard metrics: Precision, Recall 20
20 Compulsory task 1. Read the paper 2. Build infrastructure - Large dataset (approx. 15 GB) 3. Data analysis - For each domain (books, movies, music) - Interconnection s quality - Long-tail - Sparsity - Etc Various Aspects of Recommender Systems SS17 21
21 Submission of compulsory task Pre-requisite to participation Report - Introduction - Problem statement (1 page) - Solution proposal (1 page) - Data analysis (2 pages) - Related work (1 pages) 1 team, max. 3 students Deadline: , 12: Various Aspects of Recommender Systems SS17 22
22 Scientific Paper recommendation- 3 rd project Recommend Scientific papers to users Content-Based, Collaborative filtering and Hybrid Papers features (meta-data) - Textual features: Title, Abstract, Keyword list - Non-textual features: Publication year, Authors, Venue, Publisher, 23
23 Scientific Paper recommendation- 3 rd project Textual paper representation Term Extraction k 1 k i k i+1 k n 1 1 tf-idf i+1 tf-idf n Paper Paper Vector Various Aspects of Recommender Systems SS17 24
24 Scientific Paper recommendation- 3 rd project Rating Matrix 25
25 HyPRec Master Project WS 2016 Scientific papers recommender Probabilistic Topic Modeling (LDA) Matrix factorization (ALS Algorithm) Python GitHub 26
26 HyPRec - Architecture Evaluator Metrics Calculator Train-Test splitter MRR, NDCG, Recall User-Based K-Fold Split Recommender Papers Model Content-Based Filtering Collaborative Filtering Hybrid Item-based CBF Matrix Factorization CF Weighted (Linear Combination) Citeulike Dataset (csv files) Data Parser Mysql DB Textual representation Latent topics Features Tf-IDF LDA Publication year, authors, publisher,... 27
27 Regulations One team max 3 students Weekly meetings Programming language: Python Various Aspects of Recommender Systems SS17 28
28 Regulations Compulsory task (Deadline: , Pre-requisite to participation) - Get familiar with HyPRec - Implement a simple Recommender (User-based CF) - Submit evaluation results (small presentation) Starting Report (Submission: ) - Problem statement (1 page) - Solution proposal (1 page) Various Aspects of Recommender Systems SS17 29
29 New Wikipedia Article Recommendation - 4 th project 30
30 Motivation: Writing New Wikipedia Articles Dan Fredinburg Michael Slager What to write about? Adult Beginners Oleg Kalashnikov LG G4 What to do? 1. Use list of requested articles 2. Read news or consume other media. Automatically recommend relevant novel Wikipedia articles based on news stream. 31
31 Distinguish between notable and not-notable entities Various Aspects of Recommender Systems SS17
32 Approach: Use diff between Wikipedia dumps 33
33 Existing Approach for Recommending New Wikipedia Articles see Färber et al.: On Emerging Entity Detection, EKAW
34 Task Build a live system for Wikipedia article recommendation. 35
35 Task Improve the system via - Better selection of news sources - Distributed processing of news articles (especially text annotation) - Considering also very recently added Wikipedia pages - Find and implement better features / adapt existing features - Improve binary classification, e.g., by using a Recurrent Neural Network. - Using word embeddings for better representation of candidates in news articles. - Using other Knowledge Graphs, e.g., Wikidata or CrunchBase. 36
36 Compulsory task 1. Read related work (esp., On Emerging Entity Detection, EKAW 2016). 2. Extract Wikipedia articles which were inserted between two Wikipedia dumps (given the Wikipedia indices). 3. Annotate news articles (from between the Wikipedia versions) via an entity linking tool and extract noun phrases. 4. Calculate statistics about annotations. 5. Correlate new Wikipedia articles and their mentions with metainformation of news articles (e.g., which sources are suitable for predicting new Wikipedia articles). 37
37 Submission of compulsory task 1 team, max. 2 students Report, Deadline: , 12:00, Pre-requisite to participation - Introduction (1 page) - Data analysis (2 pages) - Related work (1 page) Project proposal ( ) - Additional sections: Problem statement (1 page), proposed approach/improvements of the system (2 pages), proposed evaluation (1 page) 38
38 Citation Recommendation - 5 th project Idea: Enrich (scientific) text with citation markers (e.g, [1] ) and references. 39
39 Approach 1. Create model: - Extract citations with context from publication corpus. - Develop & implement features for ranking publications. 2. Apply model: - Extract citation contexts from input text. - Determine which publications to cite in which context. - Add citations to text. 40
40 Useful Data Sets Scholarly - 101k papers in computer science domain, PDF+metadata arxiv.org - Over 1M papers (PDF+metadata) - Different fields: Physics, Mathematics, Computer Science, Quantitative Biology, Quantitative Finance and Statistics CiteSeerX - Database with publications and citations - Ca. 7M papers DBLP, Microsoft Academic Graph, 41
41 Compulsory task Read related work Analyze and compare existing data sets for citation recommendation, including - citation context extraction - publication meta-data retrieval - citation graph creation - incorporating external data sets (e.g., DBLP, PageRank, ) 42
42 Submission of compulsory task 1 team, max. 3 students Report, Deadline: , 12:00, Pre-requisite for participation - Introduction (1 page) - Analysis & comparison of data sets and tools (2 pages) - Related work (for task in general) (2 pages) Project proposal ( ) - Additional sections: Problem statement (1 page), proposed approach (2 pages), proposed evaluation (1 page) 43
43 Thank you! Any questions? 44
USC Viterbi School of Engineering
Introduction to Computational Thinking and Data Science USC Viterbi School of Engineering http://www.datascience4all.org Term: Fall 2016 Time: Tues- Thur 10am- 11:50am Location: Allan Hancock Foundation
More informationOpen Research Online The Open University s repository of research publications and other research outputs
Open Research Online The Open University s repository of research publications and other research outputs The Smart Book Recommender: An Ontology-Driven Application for Recommending Editorial Products
More informationRecommender Systems: Practical Aspects, Case Studies. Radek Pelánek
Recommender Systems: Practical Aspects, Case Studies Radek Pelánek 2017 This Lecture practical aspects : attacks, context, shared accounts,... case studies, illustrations of application illustration of
More informationNatural Language Processing. SoSe Question Answering
Natural Language Processing SoSe 2017 Question Answering Dr. Mariana Neves July 5th, 2017 Motivation Find small segments of text which answer users questions (http://start.csail.mit.edu/) 2 3 Motivation
More informationSchool of Computer Science
School of Computer Science Computer Science (CS) modules CS1002 Object-Oriented Programming Computer Science - 1000 & 2000 Level - 2016/7 - December 2016 SCOTCAT Credits: 20 SCQF Level 7 Semester: 1 3.00
More informationIRCE at the NTCIR-12 IMine-2 Task
IRCE at the NTCIR-12 IMine-2 Task Ximei Song University of Tsukuba songximei@slis.tsukuba.ac.jp Yuka Egusa National Institute for Educational Policy Research yuka@nier.go.jp Masao Takaku University of
More informationMultimedia Information Systems
Multimedia Information Systems Samson Cheung EE 639, Fall 2004 Lecture 6: Text Information Retrieval 1 Digital Video Library Meta-Data Meta-Data Similarity Similarity Search Search Analog Video Archive
More informationSearching the Deep Web
Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index
More informationSOURCERER: MINING AND SEARCHING INTERNET- SCALE SOFTWARE REPOSITORIES
SOURCERER: MINING AND SEARCHING INTERNET- SCALE SOFTWARE REPOSITORIES Introduction to Information Retrieval CS 150 Donald J. Patterson This content based on the paper located here: http://dx.doi.org/10.1007/s10618-008-0118-x
More informationMusic Recommendation with Implicit Feedback and Side Information
Music Recommendation with Implicit Feedback and Side Information Shengbo Guo Yahoo! Labs shengbo@yahoo-inc.com Behrouz Behmardi Criteo b.behmardi@criteo.com Gary Chen Vobile gary.chen@vobileinc.com Abstract
More informationCS54701: Information Retrieval
CS54701: Information Retrieval Basic Concepts 19 January 2016 Prof. Chris Clifton 1 Text Representation: Process of Indexing Remove Stopword, Stemming, Phrase Extraction etc Document Parser Extract useful
More informationEnhanced retrieval using semantic technologies:
Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008
More informationRecommender Systems - Content, Collaborative, Hybrid
BOBBY B. LYLE SCHOOL OF ENGINEERING Department of Engineering Management, Information and Systems EMIS 8331 Advanced Data Mining Recommender Systems - Content, Collaborative, Hybrid Scott F Eisenhart 1
More informationQuery Expansion using Wikipedia and DBpedia
Query Expansion using Wikipedia and DBpedia Nitish Aggarwal and Paul Buitelaar Unit for Natural Language Processing, Digital Enterprise Research Institute, National University of Ireland, Galway firstname.lastname@deri.org
More informationCourse Design Document: IS202 Data Management. Version 4.5
Course Design Document: IS202 Data Management Version 4.5 Friday, October 1, 2010 Table of Content 1. Versions History... 4 2. Overview of the Data Management... 5 3. Output and Assessment Summary... 6
More informationIntroduction to Data Mining
Introduction to Data Mining Lecture #7: Recommendation Content based & Collaborative Filtering Seoul National University In This Lecture Understand the motivation and the problem of recommendation Compare
More informationScience 2.0 VU Processing Science 2.0 Data, Content Mining
W I S S E N n T E C H N I K n L E I D E N S C H A F T Science 2.0 VU Processing Science 2.0 Data, Content Mining Elisabeth Lex KTI, TU Graz WS 2015/16 u www.tugraz.at Agenda Repetition from last time:
More informationQuestion Answering Systems
Question Answering Systems An Introduction Potsdam, Germany, 14 July 2011 Saeedeh Momtazi Information Systems Group Outline 2 1 Introduction Outline 2 1 Introduction 2 History Outline 2 1 Introduction
More informationUnstructured Data. CS102 Winter 2019
Winter 2019 Big Data Tools and Techniques Basic Data Manipulation and Analysis Performing well-defined computations or asking well-defined questions ( queries ) Data Mining Looking for patterns in data
More informationIntroduction to Information Retrieval
Introduction to Information Retrieval Mohsen Kamyar چهارمین کارگاه ساالنه آزمایشگاه فناوری و وب بهمن ماه 1391 Outline Outline in classic categorization Information vs. Data Retrieval IR Models Evaluation
More informationIALP 2016 Improving the Effectiveness of POI Search by Associated Information Summarization
IALP 2016 Improving the Effectiveness of POI Search by Associated Information Summarization Hsiu-Min Chuang, Chia-Hui Chang*, Chung-Ting Cheng Dept. of Computer Science and Information Engineering National
More informationSearching the Deep Web
Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index
More informationKristina Lerman University of Southern California. This lecture is partly based on slides prepared by Anon Plangprasopchok
Kristina Lerman University of Southern California This lecture is partly based on slides prepared by Anon Plangprasopchok Social Web is a platform for people to create, organize and share information Users
More informationKnowledge Discovery and Data Mining 1 (VO) ( )
Knowledge Discovery and Data Mining 1 (VO) (707.003) Data Matrices and Vector Space Model Denis Helic KTI, TU Graz Nov 6, 2014 Denis Helic (KTI, TU Graz) KDDM1 Nov 6, 2014 1 / 55 Big picture: KDDM Probability
More informationProperty1 Property2. by Elvir Sabic. Recommender Systems Seminar Prof. Dr. Ulf Brefeld TU Darmstadt, WS 2013/14
Property1 Property2 by Recommender Systems Seminar Prof. Dr. Ulf Brefeld TU Darmstadt, WS 2013/14 Content-Based Introduction Pros and cons Introduction Concept 1/30 Property1 Property2 2/30 Based on item
More informationInformation Retrieval. CS630 Representing and Accessing Digital Information. What is a Retrieval Model? Basic IR Processes
CS630 Representing and Accessing Digital Information Information Retrieval: Retrieval Models Information Retrieval Basics Data Structures and Access Indexing and Preprocessing Retrieval Models Thorsten
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationBrowser-Oriented Universal Cross-Site Recommendation and Explanation based on User Browsing Logs
Browser-Oriented Universal Cross-Site Recommendation and Explanation based on User Browsing Logs Yongfeng Zhang, Tsinghua University zhangyf07@gmail.com Outline Research Background Research Topic Current
More informationTriRank: Review-aware Explainable Recommendation by Modeling Aspects
TriRank: Review-aware Explainable Recommendation by Modeling Aspects Xiangnan He, Tao Chen, Min-Yen Kan, Xiao Chen National University of Singapore Presented by Xiangnan He CIKM 15, Melbourne, Australia
More informationPart I: Data Mining Foundations
Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web and the Internet 2 1.3. Web Data Mining 4 1.3.1. What is Data Mining? 6 1.3.2. What is Web Mining?
More informationIntroduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.
Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p. 6 What is Web Mining? p. 6 Summary of Chapters p. 8 How
More informationInformation Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science
Information Retrieval CS 6900 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Information Retrieval Information Retrieval (IR) is finding material of an unstructured
More informationOutline. Database Theory. Prerequisites and Admission. Classes VU , SS 2018
Database Theory Database Theory Outline Database Theory VU 181.140, SS 2018 0. General Information Reinhard Pichler Institut für Informationssysteme Arbeitsbereich DBAI Technische Universität Wien 6 March,
More informationChapter 2. Architecture of a Search Engine
Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them
More informationLecture 0: Overview of cs1106/cs6503
Lecture 0: Overview of cs1106/cs6503 cs1106+ Overview Dr Kieran T. Herley Department of Computer Science University College Cork 2018/19 KH (09/10/18) Lecture 0: Overview of cs1106/cs6503 2018/19 1 / 16
More informationSteering Committee Meeting
Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationKnowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.
Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European
More informationText Analytics (Text Mining)
CSE 6242 / CX 4242 Apr 1, 2014 Text Analytics (Text Mining) Concepts and Algorithms Duen Horng (Polo) Chau Georgia Tech Some lectures are partly based on materials by Professors Guy Lebanon, Jeffrey Heer,
More informationLeveraging open source web resources to improve retrieval of low text content items
Leveraging open source web resources to improve retrieval of low text content items A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Ayush Singhal IN PARTIAL FULFILLMENT
More informationANNUAL REPORT Visit us at project.eu Supported by. Mission
Mission ANNUAL REPORT 2011 The Web has proved to be an unprecedented success for facilitating the publication, use and exchange of information, at planetary scale, on virtually every topic, and representing
More informationOutline. Possible solutions. The basic problem. How? How? Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity
Outline Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity Lecture 10 CS 410/510 Information Retrieval on the Internet Query reformulation Sources of relevance for feedback Using
More informationNERD workshop. Luca ALMAnaCH - Inria Paris. Berlin, 18/09/2017
NERD workshop Luca Foppiano @ ALMAnaCH - Inria Paris Berlin, 18/09/2017 Agenda Introducing the (N)ERD service NERD REST API Usages and use cases Entities Rigid textual expressions corresponding to certain
More informationReal World Evaluation of Approaches to Research Paper Recommendation
Real World Evaluation of Approaches to Research Paper Recommendation Undergraduate Thesis Submitted in partial fulfillment of the requirements of BITS F422, Thesis by Siddharth Sankaran Dinesh 2012B3A7519G
More informationTIB AV-Portal. Margret Plank 19th of January 2015 TACC Meeting
TIB AV-Portal Margret Plank 19th of January 2015 TACC Meeting German National Library of Science and Technology (TIB) German National Library of Science and Technology for all areas of engineering as well
More informationELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 4. Prof. James She
ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 4 Prof. James She james.she@ust.hk 1 Selected Works of Activity 4 2 Selected Works of Activity 4 3 Last lecture 4 Mid-term
More informationSemantic Scholar. ICSTI Towards a More Efficient Review of Research Literature 11 September
Semantic Scholar ICSTI Towards a More Efficient Review of Research Literature 11 September 2018 Allen Institute for Artificial Intelligence (https://allenai.org/) Non-profit Research Institute in Seattle,
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationRecommendation Algorithms: Collaborative Filtering. CSE 6111 Presentation Advanced Algorithms Fall Presented by: Farzana Yasmeen
Recommendation Algorithms: Collaborative Filtering CSE 6111 Presentation Advanced Algorithms Fall. 2013 Presented by: Farzana Yasmeen 2013.11.29 Contents What are recommendation algorithms? Recommendations
More informationReference Framework for the FERMA Certification Programme
Brussels, 23/07/2015 Dear Sir/Madam, Subject: Invitation to Tender Reference Framework for the FERMA Certification Programme Background The Federation of European Risk Management Associations (FERMA) brings
More informationUniversity of Virginia Department of Computer Science. CS 4501: Information Retrieval Fall 2015
University of Virginia Department of Computer Science CS 4501: Information Retrieval Fall 2015 5:00pm-6:15pm, Monday, October 26th Name: ComputingID: This is a closed book and closed notes exam. No electronic
More informationEERQI Innovative Indicators and Test Results
This project is funded by the Socioeconomic Sciences and Humanities Section. EERQI Final Conference, Brussels, 15-16 March 2011 EERQI Innovative Indicators and Test Results Prof. Dr. Stefan Gradmann /
More informationKNOWLEDGE GRAPHS. Lecture 1: Introduction and Motivation. TU Dresden, 16th Oct Markus Krötzsch Knowledge-Based Systems
KNOWLEDGE GRAPHS Lecture 1: Introduction and Motivation Markus Krötzsch Knowledge-Based Systems TU Dresden, 16th Oct 2018 Introduction and Organisation Markus Krötzsch, 16th Oct 2018 Knowledge Graphs slide
More informationCitation Services for Institutional Repositories: Citebase Search. Tim Brody Intelligence, Agents, Multimedia Group University of Southampton
Citation Services for Institutional Repositories: Citebase Search Tim Brody Intelligence, Agents, Multimedia Group University of Southampton Content The Research Literature The Open Access Literature Why
More informationProgramming Technologies for Web Resource Mining
Programming Technologies for Web Resource Mining SoftLang Team, University of Koblenz-Landau Prof. Dr. Ralf Lämmel Msc. Johannes Härtel Msc. Marcel Heinz Motivation What are interesting web resources??
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REVIEW PAPER ON IMPLEMENTATION OF DOCUMENT ANNOTATION USING CONTENT AND QUERYING
More informationCreating Large-scale Training and Test Corpora for Extracting Structured Data from the Web
Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Robert Meusel and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {robert,heiko}@informatik.uni-mannheim.de
More informationEntity and Knowledge Base-oriented Information Retrieval
Entity and Knowledge Base-oriented Information Retrieval Presenter: Liuqing Li liuqing@vt.edu Digital Library Research Laboratory Virginia Polytechnic Institute and State University Blacksburg, VA 24061
More informationCITESEERX DATA: SEMANTICIZING SCHOLARLY PAPERS
CITESEERX DATA: SEMANTICIZING SCHOLARLY PAPERS Jian Wu, IST, Pennsylvania State University Chen Liang, IST, Pennsylvania State University Huaiyu Yang, EECS, Vanderbilt University C. Lee Giles, IST & CSE
More informationModern Retrieval Evaluations. Hongning Wang
Modern Retrieval Evaluations Hongning Wang CS@UVa What we have known about IR evaluations Three key elements for IR evaluation A document collection A test suite of information needs A set of relevance
More informationPart 11: Collaborative Filtering. Francesco Ricci
Part : Collaborative Filtering Francesco Ricci Content An example of a Collaborative Filtering system: MovieLens The collaborative filtering method n Similarity of users n Methods for building the rating
More informationRecommender Systems. Collaborative Filtering & Content-Based Recommending
Recommender Systems Collaborative Filtering & Content-Based Recommending 1 Recommender Systems Systems for recommending items (e.g. books, movies, CD s, web pages, newsgroup messages) to users based on
More informationUsing Linked Data and taxonomies to create a quick-start smart thesaurus
7) MARJORIE HLAVA Using Linked Data and taxonomies to create a quick-start smart thesaurus 1. About the Case Organization The two current applications of this approach are a large scientific publisher
More informationQuery Difficulty Prediction for Contextual Image Retrieval
Query Difficulty Prediction for Contextual Image Retrieval Xing Xing 1, Yi Zhang 1, and Mei Han 2 1 School of Engineering, UC Santa Cruz, Santa Cruz, CA 95064 2 Google Inc., Mountain View, CA 94043 Abstract.
More informationLecture 1: Introduction and Motivation Markus Kr otzsch Knowledge-Based Systems
KNOWLEDGE GRAPHS Introduction and Organisation Lecture 1: Introduction and Motivation Markus Kro tzsch Knowledge-Based Systems TU Dresden, 16th Oct 2018 Markus Krötzsch, 16th Oct 2018 Course Tutors Knowledge
More informationCreating a Recommender System. An Elasticsearch & Apache Spark approach
Creating a Recommender System An Elasticsearch & Apache Spark approach My Profile SKILLS Álvaro Santos Andrés Big Data & Analytics Solution Architect in Ericsson with more than 12 years of experience focused
More informationSeminar Recent Trends in Database Research
Seminar Recent Trends in Database Research Summer Term 2013 Lehrgebiet Informationssysteme Weiping Qu qu@cs.uni-kl.de AG Datenbanken und Informationssysteme AG Heterogene Informationssysteme Goals a) Familiarize
More informationMining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University Infinite data. Filtering data streams
/9/7 Note to other teachers and users of these slides: We would be delighted if you found this our material useful in giving your own lectures. Feel free to use these slides verbatim, or to modify them
More informationCSE 494: Information Retrieval, Mining and Integration on the Internet
CSE 494: Information Retrieval, Mining and Integration on the Internet Midterm. 18 th Oct 2011 (Instructor: Subbarao Kambhampati) In-class Duration: Duration of the class 1hr 15min (75min) Total points:
More informationAUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS
AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS Nilam B. Lonkar 1, Dinesh B. Hanchate 2 Student of Computer Engineering, Pune University VPKBIET, Baramati, India Computer Engineering, Pune University VPKBIET,
More informationRecommender Systems 6CCS3WSN-7CCSMWAL
Recommender Systems 6CCS3WSN-7CCSMWAL http://insidebigdata.com/wp-content/uploads/2014/06/humorrecommender.jpg Some basic methods of recommendation Recommend popular items Collaborative Filtering Item-to-Item:
More informationIntroduction April 27 th 2016
Social Web Mining Summer Term 2016 1 Introduction April 27 th 2016 Dr. Darko Obradovic Insiders Technologies GmbH Kaiserslautern d.obradovic@insiders-technologies.de Outline for Today 1.1 1.2 1.3 1.4 1.5
More informationDatabases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016
+ Databases and Information Retrieval Integration TIETS42 Autumn 2016 Kostas Stefanidis kostas.stefanidis@uta.fi http://www.uta.fi/sis/tie/dbir/index.html http://people.uta.fi/~kostas.stefanidis/dbir16/dbir16-main.html
More informationA Recommender System Based on Improvised K- Means Clustering Algorithm
A Recommender System Based on Improvised K- Means Clustering Algorithm Shivani Sharma Department of Computer Science and Applications, Kurukshetra University, Kurukshetra Shivanigaur83@yahoo.com Abstract:
More informationChrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO
Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO INDEX Proposal Recap Implementation Evaluation Future Works Proposal Recap Keyword Visualizer (chrome
More informationBing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer
Bing Liu Web Data Mining Exploring Hyperlinks, Contents, and Usage Data With 177 Figures Springer Table of Contents 1. Introduction 1 1.1. What is the World Wide Web? 1 1.2. A Brief History of the Web
More informationCriES 2010
CriES Workshop @CLEF 2010 Cross-lingual Expert Search - Bridging CLIR and Social Media Institut AIFB Forschungsgruppe Wissensmanagement (Prof. Rudi Studer) Organizing Committee: Philipp Sorg Antje Schultz
More informationUsing Linked Data to Reduce Learning Latency for e-book Readers
Using Linked Data to Reduce Learning Latency for e-book Readers Julien Robinson, Johann Stan, and Myriam Ribière Alcatel-Lucent Bell Labs France, 91620 Nozay, France, Julien.Robinson@alcatel-lucent.com
More informationSupervised classification of law area in the legal domain
AFSTUDEERPROJECT BSC KI Supervised classification of law area in the legal domain Author: Mees FRÖBERG (10559949) Supervisors: Evangelos KANOULAS Tjerk DE GREEF June 24, 2016 Abstract Search algorithms
More informationInformation Retrieval and Knowledge Organisation
Information Retrieval and Knowledge Organisation Knut Hinkelmann Content Information Retrieval Indexing (string search and computer-linguistic aproach) Classical Information Retrieval: Boolean, vector
More informationCS 124/LINGUIST 180 From Languages to Information
CS /LINGUIST 80 From Languages to Information Dan Jurafsky Stanford University Recommender Systems & Collaborative Filtering Slides adapted from Jure Leskovec Recommender Systems Customer X Buys CD of
More informationCitation Services for Institutional Repositories: Citebase Search. Tim Brody Intelligence, Agents, Multimedia Group University of Southampton
Citation Services for Institutional Repositories: Citebase Search Tim Brody Intelligence, Agents, Multimedia Group University of Southampton 28/04/2009 2 28/04/2009 3 Content The Open Access Literature
More informationCollaborative Filtering using Euclidean Distance in Recommendation Engine
Indian Journal of Science and Technology, Vol 9(37), DOI: 10.17485/ijst/2016/v9i37/102074, October 2016 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Collaborative Filtering using Euclidean Distance
More informationMGA Developing Interactive Systems (5 ECTS), spring 2017 (16 weeks)
MGA 672 - Developing Interactive Systems (5 ECTS), spring 2017 (16 weeks) Lecturer: Ilja Šmorgun ilja.smorgun@idmaster.eu, Sónia Sousa sonia.sousa@idmaster.eu Contact Details: All email communication regarding
More informationPre-Requisites: CS2510. NU Core Designations: AD
DS4100: Data Collection, Integration and Analysis Teaches how to collect data from multiple sources and integrate them into consistent data sets. Explains how to use semi-automated and automated classification
More informationSelf-tuning ongoing terminology extraction retrained on terminology validation decisions
Self-tuning ongoing terminology extraction retrained on terminology validation decisions Alfredo Maldonado and David Lewis ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin
More informationScopus. Information literacy in Chemistry. J une 14, 2011
Information literacy in Chemistry Scopus J une 14, 2011 BIBLIOGRAPHIC DATABASE electronic archive of bibliographic records that refer to published academic literature the records are structured and organized
More informationAutomatically Building Research Reading Lists
Automatically Building Research Reading Lists Michael D. Ekstrand 1 Praveen Kanaan 1 James A. Stemper 2 John T. Butler 2 Joseph A. Konstan 1 John T. Riedl 1 ekstrand@cs.umn.edu 1 GroupLens Research Department
More informationMultimedia Data Management M
ALMA MATER STUDIORUM - UNIVERSITÀ DI BOLOGNA Multimedia Data Management M Second cycle degree programme (LM) in Computer Engineering University of Bologna Course presentation Academic Year 2016/2017 Home
More informationMultimedia Data Management M
ALMA MATER STUDIORUM - UNIVERSITÀ DI BOLOGNA Multimedia Data Management M Second cycle degree programme (LM) in Computer Engineering University of Bologna Course presentation Academic Year 2016/2017 Home
More informationAuthor(s): Rahul Sami, 2009
Author(s): Rahul Sami, 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Noncommercial Share Alike 3.0 License: http://creativecommons.org/licenses/by-nc-sa/3.0/
More informationInformation Retrieval
Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,
More informationEffective Latent Space Graph-based Re-ranking Model with Global Consistency
Effective Latent Space Graph-based Re-ranking Model with Global Consistency Feb. 12, 2009 1 Outline Introduction Related work Methodology Graph-based re-ranking model Learning a latent space graph A case
More informationInternational Journal of Advance Engineering and Research Development. A Facebook Profile Based TV Shows and Movies Recommendation System
Scientific Journal of Impact Factor (SJIF): 4.72 International Journal of Advance Engineering and Research Development Volume 4, Issue 3, March -2017 A Facebook Profile Based TV Shows and Movies Recommendation
More informationJianyong Wang Department of Computer Science and Technology Tsinghua University
Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity
More informationSemantic Estimation for Texts in Software Engineering
Semantic Estimation for Texts in Software Engineering 汇报人 : Reporter:Xiaochen Li Dalian University of Technology, China 大连理工大学 2016 年 11 月 29 日 Oscar Lab 2 Ph.D. candidate at OSCAR Lab, in Dalian University
More informationDialog System & Technology Challenge 6 Overview of Track 1 - End-to-End Goal-Oriented Dialog learning
Dialog System & Technology Challenge 6 Overview of Track 1 - End-to-End Goal-Oriented Dialog learning Julien Perez 1 and Y-Lan Boureau 2 and Antoine Bordes 2 1 Naver Labs Europe, Grenoble, France 2 Facebook
More informationGDSA - Audiovisual Signal Management and Distribution
Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2018 205 - ESEIAAT - Terrassa School of Industrial, Aerospace and Audiovisual Engineering 739 - TSC - Department of Signal Theory
More informationAugust 2012 Daejeon, South Korea
Building a Web of Linked Entities (Part I: Overview) Pablo N. Mendes Free University of Berlin August 2012 Daejeon, South Korea Outline Part I A Web of Linked Entities Challenges Progress towards solutions
More informationText Analytics (Text Mining)
CSE 6242 / CX 4242 Text Analytics (Text Mining) Concepts and Algorithms Duen Horng (Polo) Chau Georgia Tech Some lectures are partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko,
More informationSAPIENT Automation project
Dr Maria Liakata Leverhulme Trust Early Career fellow Department of Computer Science, Aberystwyth University Visitor at EBI, Cambridge mal@aber.ac.uk 25 May 2010, London Motivation SAPIENT Automation Project
More information