The Journal of MacroTrends in Technology and Innovation
|
|
- Ralph Gray
- 5 years ago
- Views:
Transcription
1 MACROJOURNALS The Journal of MacroTrends in Technology and Innovation Enhancement of Indexing for Social knowledge- Sharing Community Piitra Jomsri Department of Information Technology, Suan Sunandha Raabhat University, Bangkok, Thailand Abstract Internet technology provides an efficient way to store and share information. Search engines and social bookmarking systems are important tools for web assets discovery. This research investigated two different indexing approaches applied to Diigo a social bookmarking system for knowledge-sharing. The indexing approaches here are known as: Tag only and Tag with Title. Two indexing approaches were evaluated using mean values of Normalized Discount Cumulative Gain (NDCG). The results suggested that indexing using Tag, Title performed the best. The initial evaluation on this research implementation implied that these designs might improve the accuracy and efficiency of web resource searching on social bookmarking system which can applies technique in other domains. Keywords: social bookmarking; indexing; knowledge-sharing community 1. Introduction Nowadays numbers of people using the internet to exchange information are increasing. Thus, a search engine is one important tool that supports users to search for documents on the internet. A social bookmarking system is also an important tool that allows people to share interesting web resources. It not only provides web resource sharing functions but also allows people to create a set of tags attached with the web resource. Diigo ( is social bookmarking which a multi-tool for personal knowledge management dramatically improve your workflow and productivity easy and intuitive, yet versatile and powerful. The name Diigo is the abbreviation of' Digest of Internet Information, Groups and Other stuff. Use Diigo provides social annotation service. One can highlight text passages and add notes on any web page that one is reading at any time. Web page, one can not only read public comments published by other, but can also carry out discussion and interaction with others. Diigo can not only be a powerful personal tools and social sharing platform for 93
2 knowledge worker, along with its development, the whole Web can be a writable, participatory and interactive media. While the primary goal of these applications is to serve the needs of individual users, the tags of each web resource, links to knowledge-sharing community for each particular case, should also help other users to categorize, browse, and find items. The tags can also be used for information discovery, sharing, and community ranking. The tags can be useful for tasks such as search, navigation or information extraction. Therefore, it is interesting to investigate how well a set of tags for the link to knowledge-sharing community on Diigo contribute to search results. In this research, the social tagging were investigated to improve knowledge-sharing indexing and proposed indexing method using tagging information together with a title of knowledge (TT). Researcher refer to it as a Tag with Title indexing method. To evaluate the proposed indexing method, it was compared with tagging information only indexing method or Tag Only indexing method (T). The paper is structured as follows. First, we discuss related work in Section 2. We then describe Framework for social tagging based knowledge-sharing searching in Section 3. The Section 4-5 is Result and Discussion. Finally, Section 6 contains the Conclusion and Future work. 2. Related Work Researchers who studied Diigo include: Zhou Peng (2010) analyzed the functions and features of Diigo and collaborative learning, and the author design a collaborative learning model under the Diigo environment (Peng and et al.,2010). Some researcher exploring whether the web 2.0- based note-sharing cooperative learning method worked more effectively than classroom cooperative learning with Student s Team Achievement Division (STAD) method and the traditional lectures in teaching Chinese rhetoric comprehension by using Diigo and Google Doc as tools (Chen and et al.,2011). Khoii investigated the impact of learning with Schoology (the LMS selected for this study) on learners autonomy and use of reading strategies while incorporating Diigo, a social bookmarking website (Khoii and et al.) Researchers who studied and improved social tagging: Suchanek found that tags are meaningful and the tagging process is influenced by tag suggestions (Suchanek and et al. 2008) while Thom-Santelli explored the use of tags for communication in these systems in social tagging (Thom-Santelli and et al. 2008). Gelernter compares the information retrieval value of the cloud format tags and the tag words themselves as found in the LibraryThing catalog. Results also show that, whether searchers are working toward research or personal ends, high recall matters (Gelernter, J. (2007)..A. Budura present HAMLET to promote an efficient and precise reuse of shared metadata in highly dynamic where tags are scarce (Budura and et al. 2008). J. Gelernter (2009) offers a method of evaluating user tag preference and the relative strength of social tag vs. LCSH string retrieval performance. Choochaiwattana examined the use of social annotations to improve the quality of web searches. LI (2008) use the self-organizing characteristics of SOM neural networks to classify the popular tags in "Del.icio.us" website. Jomsri (2009, 2015) investigated three different indexing approaches applied to CiteULike. The preliminary results illustrated that indexing using Tag, Title, with Abstract performed the best. 94
3 3. Framework for Social Tagging Based knowledge-sharing Searching In this section, the experimental design and evaluation method were discussed. The experiment was divided into five steps follow to Fig.1. Fig. 1. Framework for Social Tagging Based knowledge-sharing Searching. A. Research Methods 1) Crawler: A knowledge-sharing crawler is a small computer program that browses directly to the knowledge-sharing sharing systems of the WWW in a predetermined manner. The knowledge-sharing crawler is responsible for gathering knowledge-sharing information such as author, tags used, etc. This useful information helps the system to determine a user's interests and also helps the system to create index for each knowledge-sharing. Java programming is used to implement a crawler on this framework. 2) Knowledge corpus: the corpus is a collection of knowledge-sharing extracted from the knowledge sharing system. Knowledge-sharing data were crawled from Diigo between January and September The final set consisted of 32,450 records related to computer science. 3) Indexer: TF-IDF (term frequency inverse document frequency) will be used for creating indices. TF-IDF is a weight often used in information retrieval and text mining. This weight is a statistical measure used to evaluate how important a word is to a document in a collection or corpus. The importance increases proportionally to the number of times a word appears in the document but is offset by the frequency of the word in the corpus. In this experiment, three different indexers were developed. The equation (1), and (2) show a modified Term Frequency/Inverse Document Frequency (tf/idf) formula for the different indexers, where T is Tag only, TT is Tag with Title : ni, T tfidfi log (1) (1), n d : t d k k, i 95
4 ni, TT tfidfi log (2) (2), n d : t d k k, Let n i, be the number of occurrences of the considered term in document d, T is total number of Tag Only documents in the corpus, TT is total number of Tag and Title documents in the corpus, d : t i d is number of documents where the term t i appears (that is n i 0 ). If the term is not in the corpus, this will lead to a division-by-zero. It is therefore common to use1 d : t d. i 4) Search Function: Cosine similarity is a similarity measurement between two vectors of n dimensions. The concept is finding the cosine of the angle between two vectors. This measurement is often used to compare documents in text mining. Given two vectors of attributes, A and B, the cosine similarity, θ, is calculated by the attributes dot product divided by the magnitude as Equation (3). similarity (3) cos( ) i A. B A B 5) Ranking: The score of similarity measurement can be used for ranking mechanism. Knowledge-sharing Searching Two search engines based on the two indexers were developed. Subects can see: titleid of the document, title name that can link for link obtaining data from Diigo. 4. Experimental Setting Thirty subects who were lecturers and students from Suan Sunandha Raabhat University were asked to be participants. In the experiments, each subect was assigned to find knowledge using our search engines. Each subect was given two questions. They formulated their own queries according to the given questions. They were asked to use same query for each search engine. Then, they were asked to rate the relevancy of the search result set on a five-point scale: Score 0 is not relevant at all, Score 1 is probably not relevant, Score 2 is less relevant, Score 3 is probably relevant, Score 4 is extremely relevant. The top 20 search results of each search engine were displayed for relevancy udgment and relevancy ratings for each query are considered to be perfect. The evaluation Metric use NDCG (Normalized Discounted Cumulative Gain) as originally proposed by Jarvelin and Kekalainen (aschke and et al. 2007). 96
5 5. Experimental Result This section separate in to two parts: first is results from the experiment and the second is the discussion. Results The results of the average NDCG score of the first 20 rank of T is Tag only indexing method, TT is the Tag with Title indexing method are shown in Fig.2. The x-axis represents the first 20 documents of the search results, whereas the y-axis denotes the NDCG score.the result from this figure suggests that Tag with Title indexing method seems to outperform other ranking methods. Fig.2 Comparison of the average NDCG for two indexing methods. Furthermore, a paired-sample T test is employed for top 20 ranks. Assume that the sample comes from populations that are approximately normal with equal variances. Level of significance is set to 0.05 ( =0.05). The pair differences were used to find the differences among the three rankings method. The results from Table I indicate that a set of mean difference search results provided by the TT is the Tag with Title indexing method at k=1-20. The TT is the Tag with Title indexing is statistically difference from the set of search results provided by the Tag only approach. RESULT OF DECISION TREE Ran k Indexing Mean Differe nce (K) (I) (J) (I-J) T- TTindexer 1-20 index er Std. Erro r Sig. (2- tailed ) 0.04 Discussion There are some indications that results from the proposed heuristic ranking method Tag with Title can improve knowledge searching on social bookmarking. This might be because the 97
6 method utilizes the information of user behavior. The result can implied that T indexer for this particular study is still important. Finally, the chosen experimental factor can help the system to adust the ranking and improve search results of knowledge searching. 6. Conclusion and Future Works This preliminary study focuses on the comparison of a heuristic search engine. Here, the heuristic indexer implemented is using Tag with Title. Thirty subected are assigned to investigate the system obtained from the search engines. Each subect specified three different queries. Each query is applied with these two search engines. The first 20 documents for each search engine for relevancy are displayed. Finally, the subects were asked to rate the relevancy of the search results on a five-point scale. The results show that TT indexer returns a higher NDCG score. This implies that TT has a better. To further analyze the results, a paired-sample T-test is utilized. However, the number of subects is considered to be small in the experiment. In order to confirm the finding, more subects may be needed in the experiments. In addition, the experiment should be extended to different search domains. Improving indexing not only enhances the performance of academic knowledge searches, but also all document searches in general. Future research in the area consists of extending the scale of experiments, developing ranking, as well as optimizing the parameters. ACKNOWLEDGMENT The authors would like to thank Suan Sunandha Raabhat University for scholarship support. REFERENCES Peng, Z., Mei, L., Yuhua, N., and Yi, Z. (2010). The Application of the Diigo-based Collaborative Learning Model in the Course"Fundamentals of Computers. International Coriference on Educational and Information Technology (ICEIT 2010) Chen, C., Wang, C.,Shih, J. (2011). The Effects of Employing Web 2.0-based Notesharing Strategy in Teaching Chinese Rhetoric for Elementary School Students. Electrical and Control Engineering (ICECE). International Conference, Khoii, R., Ahmadi, N., Gharib, M. The Effects of Integrating Diigo Social Bookmarking into Schoology Learning Management System on EFL Learners Autonomy and Use of Reading Strategies. International conference ICT for language learning. Suchanek, F. M., Vonovi c, M., and Gunawardena, D. (2008). Social Tags: Meaning and Suggestions. CIKM 08, Napa Valley, California, USA October Thom-Santelli, J. and Muller, M. J., Millen, David R. (2008). Social Tagging Roles: Publishers, Evangelists, Leaders. CHI 2008, Florence, Italy, 5-10 April Gelernter, J. (2007). A Quantitative Analysis of Collaborative Tags: Evaluation for Information Retrieval a Preliminary Study. International Conference on Collaborative Computing: Networking, Applications and Worksharing Nov. 2007, New York, NY
7 Budura, T., Michel, S., Cudre-Mauroux, P., and Aberer, K. (2008). To Tag or Not to tag-harvesting Adacent Metadata in Large-Scale Tagging Systems. SIGIS 08, Singapore, July Choochaiwattana, W.,and Spring, M.B. (2009). Applying Social Annotations to Retrieve and Re-rank Web Resources. Proceedings of 2009 International Conference on Information Management and Engineering (ICIME 2009), Kuala Lumpur, Malaysia 3 5 April LI, B.,and Zhu, Q. (2008). The Determination of Semantic Dimension in Social Tagging System Based on SOM Model. Second International Symposium on Intelligent Information Technology Application 2008(IITA 08), Dec. 2008, Shanghai, Jomsri, P., Sanguansintukul, S., Choochaiwattana, W. (2009). A Comparison of Search Engine Using Tag Title and Abstract with CiteULike An Initial Evaluation. the 4th IEEE Int. Conf. for Internet Technology and Secured Transactions (ICITST-2009),United Kingdom,2009. aschke, J, Marinho, L. B., Hotho, A., Schmidt-Thieme, L., and Stumme, G. (2007). Tag Recommendations in Folksonomies, In Proceedings of PKDD 2007, volume 4702 of Lecture Notes in Computer Science, Springer Verlag, pp Jomsri, P., Prangchumpol, D. (2015). A hybrid model ranking search result for research paper searching on social bookmarking, 1st International Conference on Industrial Networks and Intelligent Systems (INISCom),IEEE, pp
8
An Improvement of Search Results Access by Designing a Search Engine Result Page with a Clustering Technique
An Improvement of Search Results Access by Designing a Search Engine Result Page with a Clustering Technique 60 2 Within-Subjects Design Counter Balancing Learning Effect 1 [1 [2www.worldwidewebsize.com
More informationRepositorio Institucional de la Universidad Autónoma de Madrid.
Repositorio Institucional de la Universidad Autónoma de Madrid https://repositorio.uam.es Esta es la versión de autor de la comunicación de congreso publicada en: This is an author produced version of
More informationPERSONALIZED TAG RECOMMENDATION
PERSONALIZED TAG RECOMMENDATION Ziyu Guan, Xiaofei He, Jiajun Bu, Qiaozhu Mei, Chun Chen, Can Wang Zhejiang University, China Univ. of Illinois/Univ. of Michigan 1 Booming of Social Tagging Applications
More informationWeb Page Recommender System based on Folksonomy Mining for ITNG 06 Submissions
Web Page Recommender System based on Folksonomy Mining for ITNG 06 Submissions Satoshi Niwa University of Tokyo niwa@nii.ac.jp Takuo Doi University of Tokyo Shinichi Honiden University of Tokyo National
More informationAdaptive Socio-Recommender System for Open Corpus E-Learning
Adaptive Socio-Recommender System for Open Corpus E-Learning Rosta Farzan Intelligent Systems Program University of Pittsburgh, Pittsburgh PA 15260, USA rosta@cs.pitt.edu Abstract. With the increase popularity
More informationLinking Entities in Chinese Queries to Knowledge Graph
Linking Entities in Chinese Queries to Knowledge Graph Jun Li 1, Jinxian Pan 2, Chen Ye 1, Yong Huang 1, Danlu Wen 1, and Zhichun Wang 1(B) 1 Beijing Normal University, Beijing, China zcwang@bnu.edu.cn
More informationUnderstanding the user: Personomy translation for tag recommendation
Understanding the user: Personomy translation for tag recommendation Robert Wetzker 1, Alan Said 1, and Carsten Zimmermann 2 1 Technische Universität Berlin, Germany 2 University of San Diego, USA Abstract.
More informationAn Improved Framework for Tag-Based Academic Information Sharing and Recommendation System
, July 4-6, 2012, London, U.K. An Improved Framework for Tag-Based Academic Information Sharing and Recommendation System Jyoti Gautam, Ela Kumar Abstract The Internet and the World Wide Web provides methods
More informationMobile Query Interfaces
Mobile Query Interfaces Matthew Krog Abstract There are numerous alternatives to the application-oriented mobile interfaces. Since users use their mobile devices to manage personal information, a PIM interface
More informationRSDC 09: Tag Recommendation Using Keywords and Association Rules
RSDC 09: Tag Recommendation Using Keywords and Association Rules Jian Wang, Liangjie Hong and Brian D. Davison Department of Computer Science and Engineering Lehigh University, Bethlehem, PA 18015 USA
More informationDomain Specific Search Engine for Students
Domain Specific Search Engine for Students Domain Specific Search Engine for Students Wai Yuen Tang The Department of Computer Science City University of Hong Kong, Hong Kong wytang@cs.cityu.edu.hk Lam
More informationAn IR-based approach to Tag Recommendation
An IR-based approach to Tag Recommendation Cataldo Musto cataldomusto@di.uniba.it Pasquale Lops lops@di.uniba.it Fedelucio Narducci narducci@di.uniba.it Giovanni Semeraro semeraro@di.uniba.it Marco De
More informationISSN: (Online) Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies
ISSN: 2321-7782 (Online) Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Paper / Case Study Available online at: www.ijarcsms.com
More informationReview on Techniques of Collaborative Tagging
Review on Techniques of Collaborative Tagging Ms. Benazeer S. Inamdar 1, Mrs. Gyankamal J. Chhajed 2 1 Student, M. E. Computer Engineering, VPCOE Baramati, Savitribai Phule Pune University, India benazeer.inamdar@gmail.com
More informationCS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University
CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and
More informationRSLIS at INEX 2012: Social Book Search Track
RSLIS at INEX 2012: Social Book Search Track Toine Bogers and Birger Larsen Royal School of Library and Information Science Birketinget 6, 2300 Copenhagen, Denmark {tb,blar}@iva.dk Abstract. In this paper,
More informationNUS-I2R: Learning a Combined System for Entity Linking
NUS-I2R: Learning a Combined System for Entity Linking Wei Zhang Yan Chuan Sim Jian Su Chew Lim Tan School of Computing National University of Singapore {z-wei, tancl} @comp.nus.edu.sg Institute for Infocomm
More informationA Survey on Information Extraction in Web Searches Using Web Services
A Survey on Information Extraction in Web Searches Using Web Services Maind Neelam R., Sunita Nandgave Department of Computer Engineering, G.H.Raisoni College of Engineering and Management, wagholi, India
More informationOutline. Possible solutions. The basic problem. How? How? Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity
Outline Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity Lecture 10 CS 410/510 Information Retrieval on the Internet Query reformulation Sources of relevance for feedback Using
More informationijade Reporter An Intelligent Multi-agent Based Context Aware News Reporting System
ijade Reporter An Intelligent Multi-agent Based Context Aware Reporting System Eddie C.L. Chan and Raymond S.T. Lee The Department of Computing, The Hong Kong Polytechnic University, Hung Hong, Kowloon,
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationKNOW At The Social Book Search Lab 2016 Suggestion Track
KNOW At The Social Book Search Lab 2016 Suggestion Track Hermann Ziak and Roman Kern Know-Center GmbH Inffeldgasse 13 8010 Graz, Austria hziak, rkern@know-center.at Abstract. Within this work represents
More informationCLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL
STUDIA UNIV. BABEŞ BOLYAI, INFORMATICA, Volume LVII, Number 4, 2012 CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL IOAN BADARINZA AND ADRIAN STERCA Abstract. In this paper
More informationImproving the Navigability of Tagging Systems with Hierarchically Constructed Resource Lists: A Comparative Study
Improving the Navigability of Tagging Systems with Hierarchically Constructed Resource Lists: A Comparative Study Christoph Trattner Knowledge Management Institute and Institute for Information Systems
More informationHYBRIDIZED MODEL FOR EFFICIENT MATCHING AND DATA PREDICTION IN INFORMATION RETRIEVAL
International Journal of Mechanical Engineering & Computer Sciences, Vol.1, Issue 1, Jan-Jun, 2017, pp 12-17 HYBRIDIZED MODEL FOR EFFICIENT MATCHING AND DATA PREDICTION IN INFORMATION RETRIEVAL BOMA P.
More informationKristina Lerman University of Southern California. This lecture is partly based on slides prepared by Anon Plangprasopchok
Kristina Lerman University of Southern California This lecture is partly based on slides prepared by Anon Plangprasopchok Social Web is a platform for people to create, organize and share information Users
More informationFrom Passages into Elements in XML Retrieval
From Passages into Elements in XML Retrieval Kelly Y. Itakura David R. Cheriton School of Computer Science, University of Waterloo 200 Univ. Ave. W. Waterloo, ON, Canada yitakura@cs.uwaterloo.ca Charles
More informationChapter 6: Information Retrieval and Web Search. An introduction
Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods
More informationKnowledge Discovery and Data Mining 1 (VO) ( )
Knowledge Discovery and Data Mining 1 (VO) (707.003) Data Matrices and Vector Space Model Denis Helic KTI, TU Graz Nov 6, 2014 Denis Helic (KTI, TU Graz) KDDM1 Nov 6, 2014 1 / 55 Big picture: KDDM Probability
More informationOpen Research Online The Open University s repository of research publications and other research outputs
Open Research Online The Open University s repository of research publications and other research outputs The Smart Book Recommender: An Ontology-Driven Application for Recommending Editorial Products
More informationOpen-Corpus Adaptive Hypermedia. Adaptive Hypermedia
Open-Corpus Adaptive Hypermedia Peter Brusilovsky School of Information Sciences University of Pittsburgh, USA http://www.sis.pitt.edu/~peterb Adaptive Hypermedia Hypermedia systems = Pages + Links Adaptive
More informationWeb Document Clustering using Semantic Link Analysis
Web Document Clustering using Semantic Link Analysis SOMJIT ARCH-INT, Ph.D. Semantic Information Technology Innovation (SITI) LAB Department of Computer Science, Faculty of Science, Khon Kaen University,
More informationAn Interactive e-government Question Answering System
An Interactive e-government Question Answering System Malte Schwarzer 1, Jonas Düver 1, Danuta Ploch 2, and Andreas Lommatzsch 2 1 Technische Universität Berli, Straße des 17. Juni, D-10625 Berlin, Germany
More informationInformation Retrieval
Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have
More informationRelational Classification for Personalized Tag Recommendation
Relational Classification for Personalized Tag Recommendation Leandro Balby Marinho, Christine Preisach, and Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Samelsonplatz 1, University
More informationNeighborhood-based Tag Prediction*
Neighborhood-based Tag Prediction* Adriana Budura 1, Sebastian Michel 1, Philippe Cudré-Mauroux 2, and Karl Aberer 1 1 Ecole Polytechnique Fédérale de Lausanne (EPFL), Switzerland adriana.budura@epfl.ch,
More informationA Parallel Computing Architecture for Information Processing Over the Internet
A Parallel Computing Architecture for Information Processing Over the Internet Wendy A. Lawrence-Fowler, Xiannong Meng, Richard H. Fowler, Zhixiang Chen Department of Computer Science, University of Texas
More informationSearch Evaluation. Tao Yang CS293S Slides partially based on text book [CMS] [MRS]
Search Evaluation Tao Yang CS293S Slides partially based on text book [CMS] [MRS] Table of Content Search Engine Evaluation Metrics for relevancy Precision/recall F-measure MAP NDCG Difficulties in Evaluating
More informationTag Recommendations Based on Tracking Social Bookmarking Systems
Tag Recommendations Based on Tracking Social Bookmarking Systems Szymon Chojnacki Department of Artificial Intelligence, Institute of Computer Science, Polish Academy of Sciences Abstract. The purpose
More informationOpen-Corpus Adaptive Hypermedia. Peter Brusilovsky School of Information Sciences University of Pittsburgh, USA
Open-Corpus Adaptive Hypermedia Peter Brusilovsky School of Information Sciences University of Pittsburgh, USA http://www.sis.pitt.edu/~peterb Adaptive Hypermedia Hypermedia systems = Pages + Links Adaptive
More informationRetrieval of Highly Related Documents Containing Gene-Disease Association
Retrieval of Highly Related Documents Containing Gene-Disease Association K. Santhosh kumar 1, P. Sudhakar 2 Department of Computer Science & Engineering Annamalai University Annamalai Nagar, India. santhosh09539@gmail.com,
More informationA Comparison of Text-Categorization Methods applied to N-Gram Frequency Statistics
A Comparison of Text-Categorization Methods applied to N-Gram Frequency Statistics Helmut Berger and Dieter Merkl 2 Faculty of Information Technology, University of Technology, Sydney, NSW, Australia hberger@it.uts.edu.au
More informationA modified and fast Perceptron learning rule and its use for Tag Recommendations in Social Bookmarking Systems
A modified and fast Perceptron learning rule and its use for Tag Recommendations in Social Bookmarking Systems Anestis Gkanogiannis and Theodore Kalamboukis Department of Informatics Athens University
More informationLearning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li
Learning to Match Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li 1. Introduction The main tasks in many applications can be formalized as matching between heterogeneous objects, including search, recommendation,
More informationResPubliQA 2010
SZTAKI @ ResPubliQA 2010 David Mark Nemeskey Computer and Automation Research Institute, Hungarian Academy of Sciences, Budapest, Hungary (SZTAKI) Abstract. This paper summarizes the results of our first
More informationCollaborative Tag Recommendations
Collaborative Tag Recommendations Leandro Balby Marinho and Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Samelsonplatz 1, University of Hildesheim, D-31141 Hildesheim, Germany
More informationDeveloping Focused Crawlers for Genre Specific Search Engines
Developing Focused Crawlers for Genre Specific Search Engines Nikhil Priyatam Thesis Advisor: Prof. Vasudeva Varma IIIT Hyderabad July 7, 2014 Examples of Genre Specific Search Engines MedlinePlus Naukri.com
More informationUsing Linked Data to Reduce Learning Latency for e-book Readers
Using Linked Data to Reduce Learning Latency for e-book Readers Julien Robinson, Johann Stan, and Myriam Ribière Alcatel-Lucent Bell Labs France, 91620 Nozay, France, Julien.Robinson@alcatel-lucent.com
More informationBuilding Web Annotation Stickies based on Bidirectional Links
Building Web Annotation Stickies based on Bidirectional Links Hiroyuki Sano, Taiki Ito, Tadachika Ozono and Toramatsu Shintani Dept. of Computer Science and Engineering Graduate School of Engineering,
More informationInformation Retrieval
Natural Language Processing SoSe 2014 Information Retrieval Dr. Mariana Neves June 18th, 2014 (based on the slides of Dr. Saeedeh Momtazi) Outline Introduction Indexing Block 2 Document Crawling Text Processing
More informationChapter 27 Introduction to Information Retrieval and Web Search
Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval
More informationSocial Tag-Based Recommendation Services. Jordan Bentley Advisor: Elke Rundensteiner Collaborator: Dr. K. Claypool, MIT Lincoln Lab
Social Tag-Based Recommendation Services Jordan Bentley Advisor: Elke Rundensteiner Collaborator: Dr. K. Claypool, MIT Lincoln Lab Outline Introduction to Collaborative Filtering Tag based solutions Problems
More informationDesigning and Building an Automatic Information Retrieval System for Handling the Arabic Data
American Journal of Applied Sciences (): -, ISSN -99 Science Publications Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data Ibrahiem M.M. El Emary and Ja'far
More informationAn Investigation of Basic Retrieval Models for the Dynamic Domain Task
An Investigation of Basic Retrieval Models for the Dynamic Domain Task Razieh Rahimi and Grace Hui Yang Department of Computer Science, Georgetown University rr1042@georgetown.edu, huiyang@cs.georgetown.edu
More informationAn Implementation and Analysis on the Effectiveness of Social Search
Independent Study Report University of Pittsburgh School of Information Sciences 135 North Bellefield Avenue, Pittsburgh PA 15260, USA Fall 2004 An Implementation and Analysis on the Effectiveness of Social
More informationWhere Should the Bugs Be Fixed?
Where Should the Bugs Be Fixed? More Accurate Information Retrieval-Based Bug Localization Based on Bug Reports Presented by: Chandani Shrestha For CS 6704 class About the Paper and the Authors Publication
More informationTelling Experts from Spammers Expertise Ranking in Folksonomies
32 nd Annual ACM SIGIR 09 Boston, USA, Jul 19-23 2009 Telling Experts from Spammers Expertise Ranking in Folksonomies Michael G. Noll (Albert) Ching-Man Au Yeung Christoph Meinel Nicholas Gibbins Nigel
More informationCIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets
CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets Arjumand Younus 1,2, Colm O Riordan 1, and Gabriella Pasi 2 1 Computational Intelligence Research Group,
More informationNatural Language Processing
Natural Language Processing Information Retrieval Potsdam, 14 June 2012 Saeedeh Momtazi Information Systems Group based on the slides of the course book Outline 2 1 Introduction 2 Indexing Block Document
More informationSocial Navigation Support through Annotation-Based Group Modeling
Social Navigation Support through Annotation-Based Group Modeling Rosta Farzan 2 and Peter Brusilovsky 1,2 1 School of Information Sciences and 2 Intelligent Systems Program University of Pittsburgh, Pittsburgh
More informationAn Empirical Performance Comparison of Machine Learning Methods for Spam Categorization
An Empirical Performance Comparison of Machine Learning Methods for Spam E-mail Categorization Chih-Chin Lai a Ming-Chi Tsai b a Dept. of Computer Science and Information Engineering National University
More informationInferring Variable Labels Considering Co-occurrence of Variable Labels in Data Jackets
2016 IEEE 16th International Conference on Data Mining Workshops Inferring Variable Labels Considering Co-occurrence of Variable Labels in Data Jackets Teruaki Hayashi Department of Systems Innovation
More informationUniversity of Virginia Department of Computer Science. CS 4501: Information Retrieval Fall 2015
University of Virginia Department of Computer Science CS 4501: Information Retrieval Fall 2015 5:00pm-6:15pm, Monday, October 26th Name: ComputingID: This is a closed book and closed notes exam. No electronic
More informationCS54701: Information Retrieval
CS54701: Information Retrieval Basic Concepts 19 January 2016 Prof. Chris Clifton 1 Text Representation: Process of Indexing Remove Stopword, Stemming, Phrase Extraction etc Document Parser Extract useful
More informationRanking Web Pages by Associating Keywords with Locations
Ranking Web Pages by Associating Keywords with Locations Peiquan Jin, Xiaoxiang Zhang, Qingqing Zhang, Sheng Lin, and Lihua Yue University of Science and Technology of China, 230027, Hefei, China jpq@ustc.edu.cn
More informationInformation Retrieval (IR) Introduction to Information Retrieval. Lecture Overview. Why do we need IR? Basics of an IR system.
Introduction to Information Retrieval Ethan Phelps-Goodman Some slides taken from http://www.cs.utexas.edu/users/mooney/ir-course/ Information Retrieval (IR) The indexing and retrieval of textual documents.
More informationFind, New, Copy, Web, Page - Tagging for the (Re-)Discovery of Web Pages
Find, New, Copy, Web, Page - Tagging for the (Re-)Discovery of Web Pages Martin Klein and Michael L. Nelson Old Dominion University, Department of Computer Science Norfolk VA 23529 {mklein, mln}@cs.odu.edu
More informationDynamic Visualization of Hubs and Authorities during Web Search
Dynamic Visualization of Hubs and Authorities during Web Search Richard H. Fowler 1, David Navarro, Wendy A. Lawrence-Fowler, Xusheng Wang Department of Computer Science University of Texas Pan American
More informationClassic IR Models 5/6/2012 1
Classic IR Models 5/6/2012 1 Classic IR Models Idea Each document is represented by index terms. An index term is basically a (word) whose semantics give meaning to the document. Not all index terms are
More informationWeb Information Retrieval using WordNet
Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT
More informationLarge scale corporate Web Analysis for Business Intelligence
Industrial Clusters in England Large scale corporate Web Analysis for Business Intelligence Michele Barbera, Andrey Bratus, Nicola Sambin {barbera,bratus,sambin}@spaziodati.eu 29 April, 2016 25 Software
More informationdoi: / _32
doi: 10.1007/978-3-319-12823-8_32 Simple Document-by-Document Search Tool Fuwatto Search using Web API Masao Takaku 1 and Yuka Egusa 2 1 University of Tsukuba masao@slis.tsukuba.ac.jp 2 National Institute
More informationTerm-Frequency Inverse-Document Frequency Definition Semantic (TIDS) Based Focused Web Crawler
Term-Frequency Inverse-Document Frequency Definition Semantic (TIDS) Based Focused Web Crawler Mukesh Kumar and Renu Vig University Institute of Engineering and Technology, Panjab University, Chandigarh,
More informationA Study on Metadata Extraction, Retrieval and 3D Visualization Technologies for Multimedia Data and Its Application to e-learning
A Study on Metadata Extraction, Retrieval and 3D Visualization Technologies for Multimedia Data and Its Application to e-learning Naofumi YOSHIDA In this paper we discuss on multimedia database technologies
More informationTag-based Semantic Website Recommendation for Turkish Language
Tag-based Semantic Website Recommendation for Turkish Language Onur Yılmaz Middle East Technical University - Computer Engineering Department onur@onuryilmaz.me, yilmaz.onur@metu.edu.tr Abstract With the
More informationFeature Selecting Model in Automatic Text Categorization of Chinese Financial Industrial News
Selecting Model in Automatic Text Categorization of Chinese Industrial 1) HUEY-MING LEE 1 ), PIN-JEN CHEN 1 ), TSUNG-YEN LEE 2) Department of Information Management, Chinese Culture University 55, Hwa-Kung
More informationAn Approach to Evaluate and Enhance the Retrieval of Web Services Based on Semantic Information
An Approach to Evaluate and Enhance the Retrieval of Web Services Based on Semantic Information Stefan Schulte Multimedia Communications Lab (KOM) Technische Universität Darmstadt, Germany schulte@kom.tu-darmstadt.de
More informationarxiv: v1 [cs.dl] 23 Feb 2012
Analyzing Tag Distributions in Folksonomies for Resource Classification Arkaitz Zubiaga, Raquel Martínez, and Víctor Fresno arxiv:1202.5477v1 [cs.dl] 23 Feb 2012 NLP & IR Group @ UNED Abstract. Recent
More informationCOLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA
COLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA Vincent W. Zheng, Yu Zheng, Xing Xie, Qiang Yang Hong Kong University of Science and Technology Microsoft Research Asia WWW 2010
More informationRecommender Systems: Practical Aspects, Case Studies. Radek Pelánek
Recommender Systems: Practical Aspects, Case Studies Radek Pelánek 2017 This Lecture practical aspects : attacks, context, shared accounts,... case studies, illustrations of application illustration of
More informationBoolean Model. Hongning Wang
Boolean Model Hongning Wang CS@UVa Abstraction of search engine architecture Indexed corpus Crawler Ranking procedure Doc Analyzer Doc Representation Query Rep Feedback (Query) Evaluation User Indexer
More informationFACILITATING VIDEO SOCIAL MEDIA SEARCH USING SOCIAL-DRIVEN TAGS COMPUTING
FACILITATING VIDEO SOCIAL MEDIA SEARCH USING SOCIAL-DRIVEN TAGS COMPUTING ABSTRACT Wei-Feng Tung and Yan-Jun Huang Department of Information Management, Fu-Jen Catholic University, Taipei, Taiwan Online
More informationAcademic Paper Recommendation Based on Heterogeneous Graph
Academic Paper Recommendation Based on Heterogeneous Graph Linlin Pan, Xinyu Dai, Shujian Huang, and Jiajun Chen National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023,
More informationKEYWORD EXTRACTION FROM DESKTOP USING TEXT MINING TECHNIQUES
KEYWORD EXTRACTION FROM DESKTOP USING TEXT MINING TECHNIQUES Dr. S.Vijayarani R.Janani S.Saranya Assistant Professor Ph.D.Research Scholar, P.G Student Department of CSE, Department of CSE, Department
More informationChapter 2. Architecture of a Search Engine
Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them
More informationTag-based Social Interest Discovery
Tag-based Social Interest Discovery Xin Li / Lei Guo / Yihong (Eric) Zhao Yahoo!Inc 2008 Presented by: Tuan Anh Le (aletuan@vub.ac.be) 1 Outline Introduction Data set collection & Pre-processing Architecture
More informationDocument Retrieval using Predication Similarity
Document Retrieval using Predication Similarity Kalpa Gunaratna 1 Kno.e.sis Center, Wright State University, Dayton, OH 45435 USA kalpa@knoesis.org Abstract. Document retrieval has been an important research
More informationA RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH
A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH A thesis Submitted to the faculty of the graduate school of the University of Minnesota by Vamshi Krishna Thotempudi In partial fulfillment of the requirements
More informationChapter 8. Evaluating Search Engine
Chapter 8 Evaluating Search Engine Evaluation Evaluation is key to building effective and efficient search engines Measurement usually carried out in controlled laboratory experiments Online testing can
More informationA Web Page Segmentation Method by using Headlines to Web Contents as Separators and its Evaluations
IJCSNS International Journal of Computer Science and Network Security, VOL.13 No.1, January 2013 1 A Web Page Segmentation Method by using Headlines to Web Contents as Separators and its Evaluations Hiroyuki
More informationActive Code Search: Incorporating User Feedback to Improve Code Search Relevance
Active Code Search: Incorporating User Feedback to Improve Code Search Relevance Shaowei Wang, David Lo, and Lingxiao Jiang School of Information Systems, Singapore Management University {shaoweiwang.2010,davidlo,lxjiang}@smu.edu.sg
More informationChrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO
Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO INDEX Proposal Recap Implementation Evaluation Future Works Proposal Recap Keyword Visualizer (chrome
More informationGoNTogle: A Tool for Semantic Annotation and Search
GoNTogle: A Tool for Semantic Annotation and Search Giorgos Giannopoulos 1,2, Nikos Bikakis 1, Theodore Dalamagas 2 and Timos Sellis 1,2 1 KDBS Lab, School of ECE, NTU Athens, Greece. {giann@dblab.ntua.gr,
More informationProf. Ahmet Süerdem Istanbul Bilgi University London School of Economics
Prof. Ahmet Süerdem Istanbul Bilgi University London School of Economics Media Intelligence Business intelligence (BI) Uses data mining techniques and tools for the transformation of raw data into meaningful
More informationRanking Techniques in Search Engines
Ranking Techniques in Search Engines Rajat Chaudhari M.Tech Scholar Manav Rachna International University, Faridabad Charu Pujara Assistant professor, Dept. of Computer Science Manav Rachna International
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationTSS: A Hybrid Web Searches
410 TSS: A Hybrid Web Searches Li-Xin Han 1,2,3, Gui-Hai Chen 3, and Li Xie 3 1 Department of Mathematics, Nanjing University, Nanjing 210093, P.R. China 2 Department of Computer Science and Engineering,
More informationEvaluating an Associative Browsing Model for Personal Information
Evaluating an Associative Browsing Model for Personal Information Jinyoung Kim, W. Bruce Croft, David A. Smith and Anton Bakalov Department of Computer Science University of Massachusetts Amherst {jykim,croft,dasmith,abakalov}@cs.umass.edu
More informationOleksandr Kuzomin, Bohdan Tkachenko
International Journal "Information Technologies Knowledge" Volume 9, Number 2, 2015 131 INTELLECTUAL SEARCH ENGINE OF ADEQUATE INFORMATION IN INTERNET FOR CREATING DATABASES AND KNOWLEDGE BASES Oleksandr
More informationNortheastern University in TREC 2009 Million Query Track
Northeastern University in TREC 2009 Million Query Track Evangelos Kanoulas, Keshi Dai, Virgil Pavlu, Stefan Savev, Javed Aslam Information Studies Department, University of Sheffield, Sheffield, UK College
More information