Generation of Large Hypertext and Document Networks from Collective User Retrieval Patterns
|
|
- Eustace Hawkins
- 5 years ago
- Views:
Transcription
1 Generation of Large Hypertext and Document Networks from Collective User Retrieval Patterns Johan Bollen Old Dominion University Department of Computer Science jbollen July 5, 2002 Page 1
2 A Personal Introduction Johan Bollen 1. Principia Cybernetica Project, U. Brussels ( ) (a) Web site on philosophy of cybernetics (b) Designed to represent knowledge network, systems science encyclopedia (c) Collective effort of 4 editors (d) Adaptive Hypertext and models of navigation 2. Active Recommendation Project, Los Alamos National Laboratory ( ) (a) Focus on Distributed Information Systems (b) Generation of document proximities for DL recommendation systems (c) Application of spreading activation systems 3. Old Dominion University, Computer Science Department (a) Systems for scientometric analysis (b) Large scale applications of spreading activation for cross-resource recommendation (c) Temporal analysis of changes in user community preferences July 5, 2002 Page 2
3 Problem Statement 1. Initially: Hypertext (a) Design and use: Collection of documents need to be meaningfully linked and user retrieval needs to be made more efficient (b) Design: i. Manual Design: Uneconomical, and user preferences? ii. Taxonomies: Correspondance to user preferences? iii. IR methods: Dependance on text, and explicit queries (c) Use: i. Lost in Hyperspace ii. Users prefer Boolean searching 2. More generally: Digital Libraries (a) Desire to link collections and provide retrieval facilities beyond keyterm matching (b) Heterogeneous collections (format, language, etc) (c) Scalability issues July 5, 2002 Page 3
4 Basic Approach 1. Hypertext linking: (a) Shift from focus on design to usage (b) Patterns of retrieval determine generation of hyperlinks (c) Relations to Machine Learning: auto-associators and Hebbian learning 2. Hypertext retrieval: (a) Focus on use of network structure rather than document content (b) Recurrent spreading activation (c) Relations to CLEVER and PAGERANKER 3. Digital Libraries (a) Reconstruction of retrieval patterns from server logs (b) Generation of large scale document and journal networks (c) Network analysis for document and journal impact ranking July 5, 2002 Page 4
5 Presentation Structure 1. Background: Hypertext linking (a) The early experiments (b) Results and applications (c) Relevancy to general WWW interfaces 2. Digital Libraries (a) System for document linking from server logs (b) Journal and document impact ranking (as opposed to ISI IF) (c) Spreading Activation retrieval July 5, 2002 Page 5
6 USER USER MM Automated Hypertext Linking: first experiments MM Designer MM USER USER MM Learning Rules 0.03 MM USER USER MM MM Hypertext USER MM 0.4 Adaptive Hypertext USER MM USER MM 1. Replace explicit design or text generated links by user retrieval 2. Frequency of retrieval sequence indicates strength of associative relation 3. Shift from explicit author or designer preferences, to implicit user preferences July 5, 2002 Page 6
7 Experiment Setup 1. Representation (a) Weighted Hyperlinks: weights indicate degree of association (b) Directed Weighted Graph Representation of Network Structure i. Assumption is asymetric associations ii. Corresponds to word association research and WWW hyperlinks 2. Adaptation (a) Reinforcements based on Hyperlink Traversals (b) Learning Rules Operate on Hyperlink Weights 3. Interface i. Small additions to link weights for each traversal ii. Aaimed at gradual adaptation of structure (a) Weighted Ordered Ranking of Hyperlinks (b) Dynamic Re-Ordering (c) Display slices of 10 hyperlinks July 5, 2002 Page 7
8 Graph Representation Hypertext Graph Representation (Furner,1996), (Kleinberg, 1999) G V L V v 0 v 1 v n 1 L V V v i v j L represents hyperlink v i V v j V n n matrix, H, binary values h i j 0 1 hyperlink: v i v j L : h i j 1 absence hyperlink: v i v j L : h i j 0 July 5, 2002 Page 8
9 Graph Representation Assumptions 1. Hypertext Pages Graph Nodes? (a) Homogeneous content (b) Singular Concept 2. Hyperlinks Graph Edges (a) Singular anchor and target (b) Hyperlink Semantics (c) Directed Connections Simplification 1. Feature rather than bug 2. Goal is not to construct best hypertext representation but study larger class of linked information systems July 5, 2002 Page 9
10 Learning Rules G V L add weight function W, G V L W W : V 2 W v i v j User Request Sequence (RS): p RS Sub-path Triplet: p s v 0 v 1 v n 1 n v i v j v k i j k j i 1 k i 2 i n 2 July 5, 2002 Page 10
11 Learning Rules Frequency (F) or F p s W : W v i v j t W v i v j t 1 r f Symmetry (S), or S p s W : W v j v k t W v j v i t W v j v k t 1 r f W v j v i t 1 r s Transitivity (T), or T p s W : W v k v j t W v i v k t W v k v j t 1 r s W v j v k t 1 r t July 5, 2002 Page 11
12 Learning rules 1. Frequency: (a) Operates on link traversal frequency (b) Reinforce existing connections (c) Hebbian Learning 2. Transitivity: (a) Operates on transitive co-retrievals (b) Bridges plausible connections 3. Symmetry: (a) Reinforce connection symmetric to those traversed (b) Reinforce plausible connections July 5, 2002 Page 12
13 freq. freq. freq. house table stool bar Example symm. symm. symm. trans. trans. July 5, 2002 Page 13
14 house door table family stool chair bottle 0.9 board 0.8 bar 0.3 wood Example, cont. 0.6 chair 1.5 table bottle door 0.6 house bar stool 0.9 board 0.3 family 0.3 wood July 5, 2002 Page 14
15 Interface July 5, 2002 Page 15
16 Link Ranking weight rank hyperlink period moment history year change day section age situation thought man end experience week blood development action century evening light mind morning theory part movement life course attention service nature sort society music government way... GROUP 1 GROUP 2 GROUP 3 GROUP 4 Hyperlink Selection "More Items" Hyperlink Selection "More Items" Hyperlink Selection "More Items" Displaying links 1. Display shows groups of 10 links 2. Rank ordered: first slice displays 10 strongest links 3. User can shift from one group to the other July 5, 2002 Page 16
17 Adaptive Hypertext System Construction Representation learning rules Interface link order page generation path Adaptation request Users July 5, 2002 Page 17
18 Experiments 1. Open Participation (a) No Registration of Subjects (b) No Data Gathered on Characteristics Motives of Subjects 2. Instructions to Browse Experimental Hypertext System (a) Associative Browsing: Position based Hyperlink Selection (b) Request not to Apply High-level Strategies 3. Specific Choice of Node Labels (a) English nouns (b) 150 most frequent 4. Two Experiments (a) Frequency Transitivity (r f 1, r t 0 5) (b) Frequency Transitivity + Symmetry (r s 0 3) July 5, 2002 Page 18
19 Nodes act action age amount area art attention authority bed blood board body book boy building car case century change church city committee commonwealth company conference control council country course court day death development door doubt education effect end evening evidence example experience face fact family father field figure film food form friend girl government group hand head health heart history house idea industry influence interest job kind knowledge labor land language law level life light line love man market matter meeting method mind moment money morning mother movement music name nature need night number office order paper part party peace period person place point policy position power problem question rate reason research result road room school section sense service side situation society sort stage state story system table tax theory thing thought time town trade training type use value view voice water way week wife woman word work world year July 5, 2002 Page 19
20 Results 1. Participation (a) 6000 requests (b) 600 participants experiment 2. Resulting Networks: (a) Sensity (b) Weight values: x 3 07, max = 80 (c) Clustered structure (d) Wide range of generated connections July 5, 2002 Page 20
21 Network Matrix number label (1) act (2) action (3) age (4) amount (5) area (6) art (7) attention (8) authority (9) bed (10) blood Table 1: Sample of matrix representing final structure of adaptive hypertext network trained in experiment 1 July 5, 2002 Page 21
22 Centrality Un-normalized Centrality node i: n j 0 c w i j n j 0 w ji i 2 10 most central nodes: (1) label centrality knowledge experience development 261 mind education method thought system 181 body person 168 July 5, 2002 Page 22
23 action movement change influence power authority knowledge control system method experience reason development mind education thought body research Graph structure person life July 5, 2002 Page 23
24 Cluster Analysis Cluster Time Space Movement Control Cognition Intimacy Vitality Society Office Documents Age, time, century, day, evening, moment, period, week, year Place, area, point, stage Action, change, movement, road, car Authority, control, power, influence Knowledge, fact, idea, thought, interest, book, course, development, doubt, education, example, experience, language, mind, name, word, problem, question, reason, research, result, school, side, situation, story, theory, training, use, voice Love, family, house, peace, father, friend, girl, hand, body, face, head, figure, heart, church, kind, mother, woman, music, bed, wife Boy, man, wife, health Society, state, town, commonwealth Building, office, work, room July 5, 2002 Page 24
25 Hyperlink Development CONTROL Rank 1 Country Authority Authority Authority Power 2 Thing Society Rate Power Authority 3 Government Rate Society Effect Effect 4 Policy Theory Theory Knowledge Influence 5 Film Problem Problem Government Government 6 Heart Job Government Society Law 7 Group Service Job Money Knowledge 8 Room Money Money Father Society 9 Paper Knowledge Effect Law Order 10 Order Day Service Rate Money Requests: July 5, 2002 Page 25
26 Conclusion 1. Link generation: (a) Spontaneous proliferation of hyperlinks (b) Fast development of stable collection of links (c) Ranking reflects associative validity 2. Network structure: (a) Clustering of related items (b) In- and out-degree of ndoes reflects level of abstraction (c) Different from word association norms 3. Note: (a) Link generation independent from text or format (b) No requirement for explicit user preference statements (c) Distribution of link weight: few retrievals required July 5, 2002 Page 26
27 Digital Library Applications 1. Article and Journal Linking (a) Citation or IR based (b) Static (c) Non-specific (User Community) 2. Application of Adaptive HT: Off-line DL Setting (a) No intervention in actual library systems (b) Registration of User Requests: server logs (c) Reconstruction of User Paths 3. Large Number of Articles and Journals (a) Heterogeneous: Validity Atomic Concept representation? (b) Scalability? July 5, 2002 Page 27
28 Methodology 1. Reconstruction of co-retrieval events from DL Server logs (a) Use Sequence of Retrieval Requests to Derive Article and Journal Associations (b) Integrate access Patterns for Large Range of Applications, Users, etc. 2. Generate Associative Network of Journals (a) Every Article appears in a journal (b) Journal Relations subsume Article Relations 3. Use Network (a) Generate document links without resorting to text-content based IR (b) Map Preferences of LANL User Community (c) Recommendations Systems that Operate on Network Structure July 5, 2002 Page 28
29 Test Application 1. Los Alamos National Laboratory Research Library: (a) Large digital library with focus on management of own material (b) User community: focus on physics and nuclear science (c) SciSearch database: +20,000,000 articles (d) Downloadable: PDF format after search and metadata retrieval 2. Availability of server logs from 1998 to present (a) Download rates: 10,000/mo. (b) Server logs used: 2001 (c) Log file used: 5 months, 40,847 requests, 1829 journals and 20,720 articles July 5, 2002 Page 29
30 Interface July 5, 2002 Page 30
31 Interface July 5, 2002 Page 31
32 Network Generation Set of co-retrieval events: E e 1 e 2 e k Co-retrieval event e i : e i v i v j t v i v j where t v i v j represent time in seconds elapsed retrieval requests for document v i and v j e i v i v j t v i v j r, r f e i reinforcement value so that: m i j r i jm i j 0 for (i=1; i n+1; i++) e i v i v j t v i v j : m i j f e i July 5, 2002 Page 32
33 Retrieval Events D6 D5 D1 { Server Log D2 Document Network D3 D4 U100S1 U101S1 U102S1 U102S2 E(1,2) E(2,3) E(4,5) E(5,6) E(2,4) E(6,1) D1 + D2 + D3 W1,2 W2,3 D4 + D5 + D6 W4,5 W5,6 D2 + W2,4 D4 D6 + W6,1 D1 User100 Session1 User101 Session1 User102 Session1 US: User Session D: Document Retrieval E: Co retrieval Event User102 Session2 W + Link Reinforcement July 5, 2002 Page 33
34 SciSearch Retrieval Log userid data/time Document ID Service /Aug/2001:17:12: ;14;4;69 ddacfrtip SciServer /Aug/2001:17:14: ;25;4;499 prtuauma SciServer /Aug/2001:17:15: ;19;2;281 apiaaaortars SciServer /Jun/2001:12:03: ;38;3;347 otfrim SciServer /Jun/2001:12:04: ;19;3;211 aotdfmfct SciServer /Jun/2001:13:13: ;25;2;113 asrgt3tr SciServer Table 2: An example of the transformed Science Server log used to reconstruct journal and document co-retrieval events. July 5, 2002 Page 34
35 SciSearch Retrieval Log Analysis 100 ( , ) ( , ) 101 ( , ) ( , ) Table 3: Co-retrieval events derived from set of user retrieval requests in table 2. July 5, 2002 Page 35
36 Analysis application July 5, 2002 Page 36
37 Generated Network Journal ADV COLLOID INTERFAC ADVAN MATH ANALYT CHIM ANALYT BIOC ANIMAL BEHAVIOUR ANN PHYS-NEW YORK APPLIED ACOUSTICS APPL PHYS LETT ARCH RATION MECH AN ARCH BIOCHEM BIOPHYS ARTIF INTELL ASTRONOM J PUBL ASTRON SOC JPN ASTRON ASTROPHYS ASTROPHYSICAL JOURNAL July 5, 2002 Page 37
38 Generated Network 1. density 0.176% 2. mean link weight = 1.196, standard deviation=0.821 for all m i j : m i j 0 July 5, 2002 Page 38
39 Network Analysis and Comparison to Impact Factor Structural Features 1. Highest link weights 2. Graph-theoretical network characteristics compared to Impact Factor (a) Impact Factor: based on citation frquency (ratio of citations over publications for last 2 years) (b) Journal Consultation Frequency: F v i n j 1 m i j n j 1 m ji (c) Impact Factor Discrepancy Ratio: r f i JCF i IF i July 5, 2002 Page 39
40 Highest Link Weights Frequency Start node End Node 22.0 PHYSICA C PHYSICA B 18.0 PHYSICA B PHYSICA C 14.0 NUCL INSTRUM METH A IEEE T NUCL SCI 12.0 PHYSICA B J MAGN MAGN MATER 11.0 J MAGN MAGN MATER PHYSICA B Table 4: Five pairs of journals for which highest co-retrieval frequencies have been found. July 5, 2002 Page 40
41 JCF vs IF Frequency IF Journal Title PHYSICA B NUCL INSTRUM METH A APPL CATAL A-GEN MAT SCI ENG A-STRUCT J ALLOY COMPD IEEE T NUCL SCI CATAL TODAY J CATAL SURF SCI J NUCL MATER Table 5: List of 10 most frequent journals in LANL RL Science Server network and 2000 Impact Factors. July 5, 2002 Page 41
42 Scatterplot JCF vs. ISI IF (2000) 35 JCF vs. IF r s = 0.13 (p 0 05) IF(2000) JCF July 5, 2002 Page 42
43 Impact Factor Discrepancy Ratio r f JCF IF Journal Title ASTROPHYS SPACE SCI NUCL ENG DES PHYSICA B J MATER PROCESS TECH ADV SPACE RES ACTA ASTRONAUT J RADIOANAL NUCL CH PROG NUCL ENERG NUCL INSTRUM METH A MATH COMPUT SIMULAT Table 6: List of 10 journals for which highest ratio of JCF and ISI IF has been found. July 5, 2002 Page 43
44 Spreading Activation 1. Model Facilitated, Automatic Retrieval from human memory. 2. Assumes Associative Network of Concepts. 3. Concept Activation spreads through network, modulated by connection weights. July 5, 2002 Page 44
45 Spreading Activation for Information Retrieval 1. Promises (a) Text and Language Independent (b) Cue-based retrieval or retrieval by example (c) Use of Network structure: incomplete structure can still lead to complete retrieval 2. Problems (a) Establishment of Document Networks (b) Questions regarding efficiency (Salton, 1988) (c) Main Application in Concept Networks, Query Expansion July 5, 2002 Page 45
46 Spreading Activation on Journal Titles 1. Features (a) Transformation of initial keyterm query to network initial activation values (b) Recommendation Refinement by Query by example (c) Network interface (Servlet), marked-up output 2. Implementation (a) Java Servlet i. Third-party service for LANL DL ii. jbollen (b) Packed Query (c) Link to LANL RL journal pages i. Where available ii. Added link to LinkSeeker service 3. Prototype: SciS loop July 5, 2002 Page 46
47 Implementation July 5, 2002 Page 47
48 Query July 5, 2002 Page 48
49 Results July 5, 2002 Page 49
50 Query Expansion July 5, 2002 Page 50
51 Dynamic Network representation July 5, 2002 Page 51
52 Conclusion 1. What have we got? (a) Methodology to link documents across formats, content and languages. (b) Method to analyse networks and obtain information on preferences and structure of user community (c) Generation of alternatives for citation based impact factors (d) Spreading Activation Retrieval 2. Issues: (a) Applications to other information resources (b) Scalability (c) Sufficient server logs for network training (d) Hebbian learning assumption (invalid labeling, user relevance) July 5, 2002 Page 52
Lecture 9: I: Web Retrieval II: Webology. Johan Bollen Old Dominion University Department of Computer Science
Lecture 9: I: Web Retrieval II: Webology Johan Bollen Old Dominion University Department of Computer Science jbollen@cs.odu.edu http://www.cs.odu.edu/ jbollen April 10, 2003 Page 1 WWW retrieval Two approaches
More informationAdaptive Networks of Smart Objects
Adaptive Networks of Smart Objects Johan Bollen and Michael L. Nelson Computer Science Department Old Dominion University Norfolk, VA jbollen,mln @cs.odu.edu Abstract We propose the implementation of a
More informationInformation Retrieval CSCI
Information Retrieval CSCI 4141-6403 My name is Anwar Alhenshiri My email is: anwar@cs.dal.ca I prefer: aalhenshiri@gmail.com The course website is: http://web.cs.dal.ca/~anwar/ir/main.html 5/6/2012 1
More informationHow Big Data is Changing User Engagement
Strata Barcelona November 2014 How Big Data is Changing User Engagement Mounia Lalmas Yahoo Labs London mounia@acm.org This talk User engagement Definitions Characteristics Metrics Case studies: Absence
More informationToward Human-Computer Information Retrieval
Toward Human-Computer Information Retrieval Gary Marchionini University of North Carolina at Chapel Hill march@ils.unc.edu Samuel Lazerow Memorial Lecture The Information School University of Washington
More informationHigh Frequency Words. Flashcards. 1 st. McGraw-Hill Wonders
High Frequency Words Flashcards 1 st McGraw-Hill Wonders Credits KG Fonts www.kimberlygeswein.com ***IMPORTANT*** Ms. Mai Huynh is the sole creator of this product and does not claim endorsement or association
More informationEERQI Innovative Indicators and Test Results
This project is funded by the Socioeconomic Sciences and Humanities Section. EERQI Final Conference, Brussels, 15-16 March 2011 EERQI Innovative Indicators and Test Results Prof. Dr. Stefan Gradmann /
More informationIEEE Journal Titles and Reference Abbreviations Reference Abbreviation
IEEE Journal Titles and Reference Abbreviations Title Access, IEEE Aerospace and Electronic Systems, IEEE Transactions on Affective Computing, IEEE Transactions on Antennas and Propagation, IEEE Transactions
More informationChapter 6. Semi-metric Behavior in Document Networks and its Application to Recommendation Systems. Luis M. Rocha
Chapter 6 Semi-metric Behavior in Document Networks and its Application to Recommendation Systems Luis M. Rocha Recommendation systems for different Document Networks (DN) such as the World Wide Web (WWW),
More informationUser-Centered Analysis & Design
User-Centered Analysis & Design Section Topic Slides Study Guide Quick References (QR) Introduction UCA vs. UT 1-26 12 Comparing Analysis and Testing Approaches ROI 1-29 7 Formulas for Calculating ROI
More informationAdaptable and Adaptive Web Information Systems. Lecture 1: Introduction
Adaptable and Adaptive Web Information Systems School of Computer Science and Information Systems Birkbeck College University of London Lecture 1: Introduction George Magoulas gmagoulas@dcs.bbk.ac.uk October
More informationUNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.
UNIT-V WEB MINING 1 Mining the World-Wide Web 2 What is Web Mining? Discovering useful information from the World-Wide Web and its usage patterns. 3 Web search engines Index-based: search the Web, index
More informationKnowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.
Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European
More informationBasic Network Concepts
Basic Network Concepts Basic Vocabulary Alice Graph Network Edges Links Nodes Vertices Chuck Bob Edges Alice Chuck Bob Edge Weights Alice Chuck Bob Apollo 13 Movie Network Main Actors in Apollo 13 the
More informationScopus. Quick Reference Guide
Scopus Quick Reference Guide Quick Reference Guide An eye on global research. Scopus is the largest abstract and citation database of peer-reviewed literature, with bibliometrics tools to track, analyze
More informationOverview of Web Mining Techniques and its Application towards Web
Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous
More informationWEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS
1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,
More informationTechnical Group Health and Health Interview Survey (HIS) Statistics
EUROPEAN COMMISSION EUROSTAT Directorate F: Social Statistics and Information Society Unit F-5: Health and food safety statistics Technical Group Health and Health Interview Survey (HIS) Statistics Luxembourg,
More informationHuman-Computer Information Retrieval
Human-Computer Information Retrieval Gary Marchionini University of North Carolina at Chapel Hill march@ils.unc.edu CSAIL MIT November 12, 2004 Message IR and HCI are related fields that have strong (staid?)
More informationThis release of Journal Citation Reports provides 2016 data. The 2017 data will be made available in the 2018 Journal Citation Reports release.
What's New in Journal Citation Reports? September 2014 Training Videos Journal Citation Reports on InCites training videos help explain the data, features and workflows of the system. Videos and registration
More informationVannevar Bush. Information Retrieval. Prophetic: Hypertext. Historic Vision 2/8/17
Information Retrieval Vannevar Bush Director of the Office of Scientific Research and Development (1941-1947) Vannevar Bush,1890-1974 End of WW2 - what next big challenge for scientists? 1 Historic Vision
More informationA quick guide to getting the most from your Idox Information Service membership
A quick guide to getting the most from your Idox Information Service membership The Idox Information Service The Idox Information Service is the most complete and accessible source of public sector knowledge
More informationJournal Status. February 1, 2008
Journal Status Johan Bollen, Marko A. Rodriguez, and Herbert Van de Sompel arxiv:cs/0601030v1 [cs.dl] 9 Jan 2006 February 1, 2008 Digital Library Research & Prototyping Team, Research Library, Los Alamos
More informationOn Complex Dynamical Networks. G. Ron Chen Centre for Chaos Control and Synchronization City University of Hong Kong
On Complex Dynamical Networks G. Ron Chen Centre for Chaos Control and Synchronization City University of Hong Kong 1 Complex Networks: Some Typical Examples 2 Complex Network Example: Internet (William
More informationImplementing Games User Research Processes Throughout Development: Beyond Playtesting
Implementing Games User Research Processes Throughout Development: Beyond Playtesting Graham McAllister Founder, Player Research @grmcall Introduction Founder - Player Research, a User Research studio
More informationSocial Information Filtering
Social Information Filtering Tersia Gowases, Student No: 165531 June 21, 2006 1 Introduction In today s society an individual has access to large quantities of data there is literally information about
More informationLearn SEO Copywriting
This is video 1.1 in the online course: Learn SEO Copywriting Module 1: An introduction to SEO copywriting What we ll cover in this session What is SEO? What is SEO copywriting? How does Google work Factors
More informationCS 533 Information Retrieval Systems
CS 533 Information Retrieval Systems Prepared by: Mehmet Ali Abbasoğlu Week No 1 Users query Information Retrieval Systems (IRS) to get their needed relevant documents Information Retrieval Systems are
More informationDATA MINING II - 1DL460. Spring 2014"
DATA MINING II - 1DL460 Spring 2014" A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt14 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationChapter 3: Google Penguin, Panda, & Hummingbird
Chapter 3: Google Penguin, Panda, & Hummingbird Search engine algorithms are based on a simple premise: searchers want an answer to their queries. For any search, there are hundreds or thousands of sites
More informationTaylor & Francis E-Books
Taylor & Francis E-Books (http://inflibnet.etailer.dpsl.net)* Taylor & Francis is one of the leading global academic publishers and part of the Academic Division of Informa plc, which is the leading provider
More informationDeep Web Crawling and Mining for Building Advanced Search Application
Deep Web Crawling and Mining for Building Advanced Search Application Zhigang Hua, Dan Hou, Yu Liu, Xin Sun, Yanbing Yu {hua, houdan, yuliu, xinsun, yyu}@cc.gatech.edu College of computing, Georgia Tech
More information*Note: To find a complete list of sources, click on the List of Sources link in the top portion of every Biography Resource Center search screen.
Biography Resource Center Navigation Guide OVERVIEW The Biography Resource Center (BioRC) is a comprehensive database of biographical information on over 380,000 people from throughout history, around
More informationA TEXT MINER ANALYSIS TO COMPARE INTERNET AND MEDLINE INFORMATION ABOUT ALLERGY MEDICATIONS Chakib Battioui, University of Louisville, Louisville, KY
Paper # DM08 A TEXT MINER ANALYSIS TO COMPARE INTERNET AND MEDLINE INFORMATION ABOUT ALLERGY MEDICATIONS Chakib Battioui, University of Louisville, Louisville, KY ABSTRACT Recently, the internet has become
More informationNavigating the RTL to System Continuum
Navigating the RTL to System Continuum Calypto Design Systems, Inc. www.calypto.com Copyright 2005 Calypto Design Systems, Inc. - 1 - The rapidly evolving semiconductor industry has always relied on innovation
More informationIntroduction & Administrivia
Introduction & Administrivia Information Retrieval Evangelos Kanoulas ekanoulas@uva.nl Section 1: Unstructured data Sec. 8.1 2 Big Data Growth of global data volume data everywhere! Web data: observation,
More informationPageRank for Product Image Search. Research Paper By: Shumeet Baluja, Yushi Jing
PageRank for Product Image Search Research Paper By: Shumeet Baluja, Yushi Jing Topics Motivation What is PageRank? ImageRank Algorithm Features generation & Similarity measure Concept of Centrality PageRank
More informationUnderstanding the use of Temporal Expressions on Persian Web Search
Understanding the use of Temporal Expressions on Persian Web Search Behrooz Mansouri Mohammad Zahedi Ricardo Campos Mojgan Farhoodi Alireza Yari Ricardo Campos TempWeb 2018 @ WWW Lyon, France, Apr 23,
More informationBeyond Content-Based Retrieval:
Beyond Content-Based Retrieval: Modeling Domains, Users, and Interaction Susan Dumais Microsoft Research http://research.microsoft.com/~sdumais IEEE ADL 99 - May 21, 1998 Research in IR at MS Microsoft
More informationIntelligent management of on-line video learning resources supported by Web-mining technology based on the practical application of VOD
World Transactions on Engineering and Technology Education Vol.13, No.3, 2015 2015 WIETE Intelligent management of on-line video learning resources supported by Web-mining technology based on the practical
More informationInternational Journal of Scientific & Engineering Research Volume 2, Issue 12, December ISSN Web Search Engine
International Journal of Scientific & Engineering Research Volume 2, Issue 12, December-2011 1 Web Search Engine G.Hanumantha Rao*, G.NarenderΨ, B.Srinivasa Rao+, M.Srilatha* Abstract This paper explains
More informationWelcome to CS120 Fall 2012
Welcome to CS120 Fall 2012 John Magee (jmagee@clarku.edu) 1 Welcome to CS120 Computing is ubiquitous Daily life, news, ecommerce Sciences and engineering fields Social sciences, humanity, Arts, music,
More information±» ²¼ ª² ª³ ±» ² ª±² ( 80 ±«) ± ³» ³ ª² ; ±² ²¼ ³² ³±² ± ¹ ³ ³ ³ ³ ² «³ ± ««¾±² - ± ³«³»µ «; ² ª±² ( ±², ±, ±±³ ), ± ±² µ ±² ³ª²³».» µ³ ±²» ² ª±² ( «¾
Ž ˆŸ Š ˆŸŒ Ž Ž ¹µ±, ±²³ ¾¹ 5- ª«±± ³ ² ² ª ±²¼ ; \ ¹ " ±²» ² ±ª ² ±ª» ³»µ ±² ±³¹ ±² ² «¼ ( ² ; ³ «±²¼{ ³ «±²¼, : ±«, ), ² ±ª«; «² «¼ ( ª -, ;, ±«, ); ±«² «¼ ( ±««ª «±² ² ; ² ); ±² ( «; «, ±«,, ); ««(,
More informationUser Interfaces for Information Retrieval on the WWW
User Interfaces for Information Retrieval on the WWW Gary Marchionini University of North Carolina at Chapel Hill march@ils.unc.edu INFORUM 2005 Prague May 24-27, 2005 Message On the WWW, the User Interface
More informationTime based search call logs using ESM
Anil Kumar Dasari, Ajay Bhaskar Dasari, K.Pradeep / International Journal of Engineering Time based search call logs using ESM Anil Kumar Dasari 1, Ajay Bhaskar Dasari 2 and K.Pradeep 3 1 Nova College
More informationEnglish 120 INFORMATION SEARCH PROCESS
English 120 INFORMATION SEARCH PROCESS Search Process Credo Reference Book Resources (searching LIBROS-WorldCat Discovery) Periodical Articles (searching online databases) Films on Demand World Wide Web
More informationAn Introduction to Search Engines and Web Navigation
An Introduction to Search Engines and Web Navigation MARK LEVENE ADDISON-WESLEY Ал imprint of Pearson Education Harlow, England London New York Boston San Francisco Toronto Sydney Tokyo Singapore Hong
More informationQuick Reference Guide
Quick Reference Guide www.scopus.com Scopus is the largest abstract and citation database of peer-reviewed literature with bibliometrics tools to track, analyze and visualize research. It contains over,000
More informationDIGITAL APPRENTICESHIPS
DIGITAL Networking, Infrastructure Technician, Unified Communication Technician and Software & Web Development Technician - Advanced and Higher Apprenticeships WHY CHOOSE US? OVERVIEW OF PROGRAMMES ESTABLISHED
More informationCase-based Recommendation. Peter Brusilovsky with slides of Danielle Lee
Case-based Recommendation Peter Brusilovsky with slides of Danielle Lee Where we are? Search Navigation Recommendation Content-based Semantics / Metadata Social Modern E-Commerce Site The Power of Metadata
More informationOpen-Corpus Adaptive Hypermedia. Adaptive Hypermedia
Open-Corpus Adaptive Hypermedia Peter Brusilovsky School of Information Sciences University of Pittsburgh, USA http://www.sis.pitt.edu/~peterb Adaptive Hypermedia Hypermedia systems = Pages + Links Adaptive
More informationA Parallel Computing Architecture for Information Processing Over the Internet
A Parallel Computing Architecture for Information Processing Over the Internet Wendy A. Lawrence-Fowler, Xiannong Meng, Richard H. Fowler, Zhixiang Chen Department of Computer Science, University of Texas
More information2005 University of California Undergraduate Experience Survey
2005 University of California Undergraduate Experience Survey This year's survey has three parts: I. Time use and overall satisfaction II. Your background III. Rotating modules PART I: TIME USE and OVERALL
More informationWeb-Based Electronic Death Registration (EDR) Texas Department of Health
Web-Based Electronic Death Registration (EDR) TX EDR THIN CLIENT Trainers Guide DEMOGRAPHIC SECTION Prepared for: Texas Department of Health December 03, 2005 Version 1.0 Prepared by: The Industry Standard
More information5 Choosing keywords Initially choosing keywords Frequent and rare keywords Evaluating the competition rates of search
Seo tutorial Seo tutorial Introduction to seo... 4 1. General seo information... 5 1.1 History of search engines... 5 1.2 Common search engine principles... 6 2. Internal ranking factors... 8 2.1 Web page
More informationDigital Forensic Text String Searching: Improving Information Retrieval Effectiveness by Thematically Clustering Search Results
Digital Forensic Text String Searching: Improving Information Retrieval Effectiveness by Thematically Clustering Search Results DFRWS 2007 Department of Information Systems & Technology Management The
More informationCyber Security School
Cyber Cyber Security School FUTURE PROOF Y SECURITY TALENT "The UK needs to tackle the systemic issues at the heart of the Cyber skills shortage..." National Cyber Security Strategy 2016-2021, HM Government
More informationThis is an author produced version of Mobile devices in palliative care services: a methodological approach to identifying use and implementation.
This is an author produced version of Mobile devices in palliative care services: a methodological approach to identifying use and implementation. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/110720/
More informationEVALUATION OF SEARCHER PERFORMANCE IN DIGITAL LIBRARIES
DEFINING SEARCH SUCCESS: EVALUATION OF SEARCHER PERFORMANCE IN DIGITAL LIBRARIES by Barbara M. Wildemuth Associate Professor, School of Information and Library Science University of North Carolina at Chapel
More informationQuick Reference Guide
The Cochrane Library on Wiley InterScience Quick Reference Guide Go to www.thecochranelibrary.com to discover the best single source of reliable evidence for healthcare decision-making Go to www.thecochranelibrary.com
More informationArchitectural Technology
Architectural Technology The Architectural Technology curriculum prepares individuals to apply technical knowledge and skills to the fields of architecture, construction, construction management, facility
More informationCABI Training Materials. Ovid Silver Platter (SP) platform. Simple Searching of CAB Abstracts and Global Health KNOWLEDGE FOR LIFE.
CABI Training Materials Ovid Silver Platter (SP) platform Simple Searching of CAB Abstracts and Global Health www.cabi.org KNOWLEDGE FOR LIFE Contents The OvidSP Database Selection Screen... 3 The Ovid
More informationINNOVATIONS CENTER. Fariz T. JAFAROV Director. e-gov Development Center of Azerbaijan
INNOVATIONS CENTER Fariz T. JAFAROV Director e-gov Development Center of Azerbaijan 1 Contextual environment 2 Billions GDP Growth in Azerbaijan four periods of development 100 80 60 40 20 0 Survival Explosive
More informationDevelopment of Search Engines using Lucene: An Experience
Available online at www.sciencedirect.com Procedia Social and Behavioral Sciences 18 (2011) 282 286 Kongres Pengajaran dan Pembelajaran UKM, 2010 Development of Search Engines using Lucene: An Experience
More informationThe Rise of the (Modelling) Bots: Towards Assisted Modelling via Social Networks
The Rise of the (Modelling) Bots: Towards Assisted Modelling via Social Networks Sara Perez-Soler, Esther Guerra, Juan de Lara, Francisco Jurado 2017 Presented by Laura Walsh 1 Overview 1. Background &
More informationWEB PAGE RE-RANKING TECHNIQUE IN SEARCH ENGINE
WEB PAGE RE-RANKING TECHNIQUE IN SEARCH ENGINE Ms.S.Muthukakshmi 1, R. Surya 2, M. Umira Taj 3 Assistant Professor, Department of Information Technology, Sri Krishna College of Technology, Kovaipudur,
More informationOpen-Corpus Adaptive Hypermedia. Peter Brusilovsky School of Information Sciences University of Pittsburgh, USA
Open-Corpus Adaptive Hypermedia Peter Brusilovsky School of Information Sciences University of Pittsburgh, USA http://www.sis.pitt.edu/~peterb Adaptive Hypermedia Hypermedia systems = Pages + Links Adaptive
More informationShape Classification and Cell Movement in 3D Matrix Tutorial (Part I)
Shape Classification and Cell Movement in 3D Matrix Tutorial (Part I) Fred Park UCI icamp 2011 Outline 1. Motivation and Shape Definition 2. Shape Descriptors 3. Classification 4. Applications: Shape Matching,
More informationIntroduction to The Cochrane Library
Introduction to The Cochrane Library What is The Cochrane Library? A collection of databases that include high quality, independent, reliable evidence from Cochrane and other systematic reviews, clinical
More informationEnhanced Web Log Based Recommendation by Personalized Retrieval
Enhanced Web Log Based Recommendation by Personalized Retrieval Xueping Peng FACULTY OF ENGINEERING AND INFORMATION TECHNOLOGY UNIVERSITY OF TECHNOLOGY, SYDNEY A thesis submitted for the degree of Doctor
More informationKDD 10 Tutorial: Recommender Problems for Web Applications. Deepak Agarwal and Bee-Chung Chen Yahoo! Research
KDD 10 Tutorial: Recommender Problems for Web Applications Deepak Agarwal and Bee-Chung Chen Yahoo! Research Agenda Focus: Recommender problems for dynamic, time-sensitive applications Content Optimization
More informationJMP and SAS : One Completes The Other! Philip Brown, Predictum Inc, Potomac, MD! Wayne Levin, Predictum Inc, Toronto, ON!
Paper JM08-2014 JMP and SAS : One Completes The Other Philip Brown, Predictum Inc, Potomac, MD Wayne Levin, Predictum Inc, Toronto, ON ABSTRACT Integrating JMP with SAS creates a formidable data management
More informationTable Of Contents. Getting Started Related Topics... 10
ScienceDirect Help Table Of Contents Getting Started... 1 Related Topics... 1 Home Page Overview... 3 ScienceDirect Home Page... 3 Navigation Bar... 3 Related Topics... 4 Browser Requirements and Preferences...
More informationBrief (non-technical) history
Web Data Management Part 2 Advanced Topics in Database Management (INFSCI 2711) Textbooks: Database System Concepts - 2010 Introduction to Information Retrieval - 2008 Vladimir Zadorozhny, DINS, SCI, University
More informationInformation Retrieval. Lecture 4: Search engines and linkage algorithms
Information Retrieval Lecture 4: Search engines and linkage algorithms Computer Science Tripos Part II Simone Teufel Natural Language and Information Processing (NLIP) Group sht25@cl.cam.ac.uk Today 2
More informationOnly the original curriculum in Danish language has legal validity in matters of discrepancy
CURRICULUM Only the original curriculum in Danish language has legal validity in matters of discrepancy CURRICULUM OF 1 SEPTEMBER 2007 FOR THE BACHELOR OF ARTS IN INTERNATIONAL BUSINESS COMMUNICATION (BA
More informationWorkshop: Modeling Scenarios Using Data. Fiona Charles BCS SIGIST 22-Sept-2009
Workshop: Modeling Scenarios Using Fiona Charles BCS SIGIST 22-Sept-2009 Who I Am: Fiona Charles - Test Consultant and Test Manager, President and Principal Consultant of Quality Intelligence, Inc. I teach
More informationDesign and Implementation of Movie Recommendation System Based on Knn Collaborative Filtering Algorithm
Design and Implementation of Movie Recommendation System Based on Knn Collaborative Filtering Algorithm Bei-Bei CUI School of Software Engineering, Beijing University of Technology, Beijing, China e-mail:
More informationThe Role of Information Scent in On-line Browsing:
The Role of Information Scent in On-line Browsing: Extensions of the ACT-R Utility and Concept Formation Mechanisms Peter Pirolli User Interface Research Area Supported in part by Office of Naval Research
More informationInformation Retrieval
Introduction Information Retrieval Information retrieval is a field concerned with the structure, analysis, organization, storage, searching and retrieval of information Gerard Salton, 1968 J. Pei: Information
More informationUnion of International Associations (UIA)
Union of International Associations (UIA) Brussels http://www.uia.org/ East-West Collaboration in the Development of Interactive Media Products (Hungary, May 1999) Anthony Judge, Director, Communications
More informationContent Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered.
Content Enrichment An essential strategic capability for every publisher Enriched content. Delivered. An essential strategic capability for every publisher Overview Content is at the centre of everything
More informationSIRS Issues Researcher
From the main screen of SIRS, click on the SIRS Issues Researcher link. 1 This tutorial will provide an overview of the following features available through SIRS Issues Researcher: 2. Search Tabs 3. Reference
More informationInformation Retrieval and Web Search
Information Retrieval and Web Search Link analysis Instructor: Rada Mihalcea (Note: This slide set was adapted from an IR course taught by Prof. Chris Manning at Stanford U.) The Web as a Directed Graph
More informationDesigning and Building an Automatic Information Retrieval System for Handling the Arabic Data
American Journal of Applied Sciences (): -, ISSN -99 Science Publications Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data Ibrahiem M.M. El Emary and Ja'far
More informationTeacher Website Training Chapin High School October 10, D5 Guidelines
Teacher Website Training Chapin High School October 10, 2011 D5 Guidelines The purpose of District sponsored web sites is twofold: to make parents, students, and the community aware of school, classroom,
More informationGVU s WWW User Surveys
GVU s WWW User Surveys Jim Pitkow & Colleen Kehoe Georgia Tech - CoC - GVU Special thanks to: CNS, Randy Carpenter, Kipp Jones, Mike Mealling, Dr. James Foley, GVU, Sun Microsystems GVU s WWW User Surveys
More informationSEMANTIC NETWORK AND SEARCH IN VEHICLE ENGINEERING
Martin Sturm, Sylke Rosenplaenter SEMANTIC NETWORK AND SEARCH IN VEHICLE ENGINEERING From Concept to Deployment Vehicle Design Operations & System Development GM Europe Engineering Adam Opel AG www.opel.com
More informationOptimizing Search Engines using Click-through Data
Optimizing Search Engines using Click-through Data By Sameep - 100050003 Rahee - 100050028 Anil - 100050082 1 Overview Web Search Engines : Creating a good information retrieval system Previous Approaches
More informationAdvanced Computer Graphics CS 525M: Understanding Mobile Web. Today s Dynamic Mobile Landscape
Advanced Computer Graphics CS 525M: Understanding Mobile Web and Mobile Search Use in Today s Dynamic Mobile Landscape Shary J. Llanos Antonio Computer Science Dept. Worcester Polytechnic Institute (WPI)
More informationInternet Usage Transaction Log Studies: The Next Generation
Internet Usage Transaction Log Studies: The Next Generation Sponsored by SIG USE Dietmar Wolfram, Moderator. School of Information Studies, University of Wisconsin-Milwaukee Milwaukee, WI 53201. dwolfram@uwm.edu
More informationOutline. Lecture 3: EITN01 Web Intelligence and Information Retrieval. Query languages - aspects. Previous lecture. Anders Ardö.
Outline Lecture 3: EITN01 Web Intelligence and Information Retrieval Anders Ardö EIT Electrical and Information Technology, Lund University February 5, 2013 A. Ardö, EIT Lecture 3: EITN01 Web Intelligence
More informationCooperative Games. Lecture 1: Introduction. Stéphane Airiau. ILLC - University of Amsterdam
Cooperative Games Lecture 1: Introduction Stéphane Airiau ILLC - University of Amsterdam Stéphane Airiau (ILLC) - Cooperative Games Lecture 1: Introduction 1 Why study coalitional games? Cooperative games
More informationTexting and driving. Texting and Driving Report Angel Reyes
Texting and Driving Report Angel Reyes Abstract Texting and Driving is a big issue we face today as a community killing many individuals and injuring even more. From teenagers to mid aged adults this a
More informationIT CUTS COSTS FOR US AND OUR CLIENTS LIKE NEVER BEFORE.
IT CUTS COSTS FOR US AND OUR CLIENTS LIKE NEVER BEFORE. KATHRYN CONDE PARTNER, NUTTER McCLENNEN & FISH LLP BOSTON Modern law firms and law departments are using modern technology to deliver better legal
More informationCOMP5331: Knowledge Discovery and Data Mining
COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani and Terry Winograd, Jon M. Kleinberg 1 1 PageRank
More informationArmtec Website Flowchart
Armtec Website Flowchart INTRODUCTION: The following document details the page layout and functionality for your web site project. Your website is displayed in a flowchart format, with each web page represented
More informationMobile Phones, Household Welfare and Women s Empowerment: Evidence from Rural Off-grid Regions of Bangladesh
Mobile Phones, Household Welfare and Women s Empowerment: Evidence from Rural Off-grid Regions of Bangladesh Monzur Hossain, BIDS Hussain Samad, World Bank Presented in BIDS Research Almanac, December
More informationCrossing the Archival Borders
IST-Africa 2008 Conference Proceedings Paul Cunningham and Miriam Cunningham (Eds) IIMC International Information Management Corporation, 2008 ISBN: 978-1-905824-07-6 Crossing the Archival Borders Fredrik
More informationResearch in the Real World
Research in the Real World Jane Larrington, Law Reference Librarian x4766 or jlarrington@sandiego.edu Melissa Fung, Foreign & Intn l Materials Law Librarian x4734 or mfung@sandiego.edu Judith Lihosit,
More information