Information Retrieval Technique for MIR and P2P Network

Size: px
Start display at page:

Download "Information Retrieval Technique for MIR and P2P Network"

Transcription

1 2014 年 5 月第十七卷二期 Vol. 17, No. 2, May 2014 Information Retrieval Technique for MIR and P2P Network Huei-Chen Hsu Ya-Li Chung Hui-Chun Chan

2 Web Journal of Chinese Management Review Vol. 17 No 2 1 Information Retrieval Technique for MIR and P2P Network ABSTRACT Huei-Chen Hsu Ya-Li Chung Hui-Chun Chan Peer-to-Peer (P2P) systems are application layer networks which enable networked hosts to share resources in a distributed manner. An important problem in such networks is to be able to efficiently search the contents of other peers. The focus of this paper is the motivation how technological innovations enable new value propositions. In this paper, the authors describe a P2P system music metadata registration and query resolution and present a survey of search techniques for information retrieval in P2P networks. These studies showed the technical feasibility of these ideas in a prototypical P2P environment and also present a realistic experimental evaluation and comparison of these techniques, using a distributed middleware infrastructure we have designed and implemented. Keywords: Information Retrieval; P2P network; music information retrieval (MIR) Huei-Chen Hsu Department of Marketing Management, TransWorld University Ya-Li Chung Department of Business Administration, TransWorld University Hui-Chun Chan Department of Business Administration, TransWorld University

3 Web Journal of Chinese Management Review Vol. 17 No 2 2 Introduction The advances in public networks and the deployment of powerful personal computing units by end users have brought a shift from the traditional Client-Server computing model to the Peer-to-Peer (P2P) computing model. There are many advantages to P2P networks, such as distributed computing and storage power, fault-tolerance, and reliability. In the P2P model, each node (peer) has a symmetric role of both a client and a server. Large numbers of peers collaborate in a dynamic and ad-hoc manner and share information in large-scale distributed environments without any centralized coordination. Recently the P2P model has also been proposed in as an alternative model to WWW-crawling based systems to cope with information that changes frequently. One of the greatest potential benefits of P2P networks [Baumann, 2003] is the ability to harness the collaborative efforts of users to provide semantic, subjective, and community-based tags to describe musical content. The digital content industry has to think about new value propositions for customers. It leads to successful adoption by end users. New value propositions being enabled by technological innovations are possible. In this paper we consider the information retrieval problem in P2P networks. Assume that each peer has a database (or collection) of documents that it shares in the network. The documents can be collections of text, audio, video or other semi-structured documents. A node searches for information by sending query messages to its peers. We present recent work in the field of music information retrieval (MIR) satisfying the requirements. The information retrieval problem is a more complex operation than traditional search techniques based on object identifiers or filenames, currently being used in P2P systems. The Information Retrieval (IR) community has over the years developed algorithms [Baumann & Klüter, 2002] for precise document retrieval in static data environments (such as a corpus of documents). However these methods are not directly applicable to P2P systems where there is no central repository, there are large numbers of documents and nodes are joining and leaving in a dynamic and ad-hoc manner. We believe that such new capabilities are an important step for making P2P systems applicable to a wide set of applications than simply object storage. After the presentation of a recent approach [Burns & Richard, 2006] to value-based modeling in e-commerce scenarios we present a survey of search techniques for information retrieval in P2P networks including our recently proposed Intelligent Search Mechanism and introduce the middleware simulation infrastructure and present some of our experimental results.

4 Web Journal of Chinese Management Review Vol. 17 No 2 3 Information Retrieval techniques for MIR and P2P The main challenge for information retrieval in P2P networks is to be able to guide the query to the sources that contain the most relevant answers in a fast and efficient way. In particular, our objective is to decrease the number of messages sent per query while at the same time maintain a high recall rate (i.e. search result quality). One could argue that both the ideas of Music Information Retrieval (MIR) and Peer-to-Peer networks (P2P) essentially started with Napster ( While a variety of novel ways of searching and retrieving music, especially in audio format, have been proposed, they haven t found their way into P2P networks and remain largely academic exercises. We present the search techniques using object identifiers rather than keywords. The Breadth First Search (BFS) Technique BFS is a technique widely used in P2P file sharing applications such as Gnutella [Gnutella, 2012]. BFS sacrifices its performance and network utilization in the sake of its simplicity. The BFS search protocol (figure 1a) in a peer-to-peer network N works as follows. A node q generates a Query message which is propagated to all of its neighbors (peers). When a peer p receives a Query request, it first forwards the query to all the peers, other than the sender, and then searches its local repository for relevant matches. If some node r receives the query and has a match, r generates a QueryHit message to transmit the result. The QueryHit message includes information such the number of corresponding documents and the network connectivity of the answering peer. When node q receives QueryHits from more than one peer, it may decide to download the file from the peer with the best network connectivity. QueryHit messages are sent along the same path that carried the Query messages; therefore no path information needs to be stored in the transmitted messages. The Random Breadth-First-Search (RBFS) Technique The Random Breadth-First-Search (RBFS) technique can dramatically improve over the naive BFS approach. In RBFS (figure 1b) a peer q forwards a search message to only a fraction of its peers, selected at random. The fraction of peers is a parameter to the mechanism. The advantage of RBFS is that it does not require global knowledge; a node is able to make local decisions in a fast manner since it only needs to select a portion of its peers. On the other hand, this algorithm is probabilistic. Therefore some large segments of the network may become unreachable because a node was not able to understand that a particular link would lead the query to a large segment of the network. The Intelligent Search Mechanism (ISM) The Intelligent Search Mechanism (ISM) is a new mechanism for information retrieval in P2P networks (figure 1c). The objective of the algorithm, which is to help the querying peer is to find the most relevant answers to its query quickly and efficiently.

5 Web Journal of Chinese Management Review Vol. 17 No 2 4 Keys to improving the speed and efficiency of the information retrieval mechanism is to minimize the communication costs, that is, the number of messages sent between the peers, and to minimize the number of peers that are queried for each search request. To achieve this, a peer estimates for each query, which of its peers are more likely to reply to this query, and propagates the query message to those peers only. Directed BFS and the Most Results in Past (>RES) Heuristic In Yang [Yang & Garcia-Molina, 2002] present a technique where each node forwards a query to a subset of its peers based on some aggregated statistics (figure 1d). The authors compare a number of query routing heuristics and mention that the Most Results in Past (>RES) heuristic has the most satisfactory performance. A query is satisfied if Z, for some constant Z, or more results are returned. In >RES a peer q forwards a search message to k peers which returned the most results for the last m queries. In their experiments they chose k = 1 and m = 10 turning in that way their approach from a Directed BFS into a Depth-First-Search approach. Figure 1 Routing Query Messages in a P2P Network Using ISM works well in environments which exhibit strong degrees of query locality and where peers hold some specialized knowledge. One problem of the ISM mechanism is that search messages may get locked into a cycle and consequently fail to explore other parts of the network. To solve this problem, we pick a small random subset of peers and add it to the set of most relevant peers for each query. As a result, with high probability the mechanism will explore a larger part of the network and will learn about the contents of additional peers. P2P Community Platform P2P system that provides flexible search semantics based on attribute-value (AV) pairs

6 Web Journal of Chinese Management Review Vol. 17 No 2 5 and supports automatic extraction of musical features and content-based similarity retrieval. It is a very natural paradigm to process the content where it is, in this case at the private computers of the consumers. We will first make some remarks why a mainstream music service should follow a P2P paradigm in order to be successful. In addition to this a P2P protocol offers the possibility to implement peer clustering by content to realize such issues as collaborative retrieval on top. This step offers the potential to have automatic and dynamic community building (e.g. special genre interest groups) over the time the network evolves. These features fall into both categories of the value type framework: efficiency = reduction of search time and esthetics = exploring the beauty of recommended, previously unknown but relevant new tracks. JXTA was chosen as the platform for developing our integrated approach. We used straight forward the following functionalities of the client to realize our own version M-Peer: (1) Peer Groups: can be used to model different interest groups according to specific styles, genres or artist fan communities. (2) Group Chat, 1 to 1 Chat: supporting the verbal communication about preferred music has been one of the cornerstones in our design of an intelligent P2P music platform since we aim at extracting cultural metadata from chats in the future (see Fig. 2). (3) File Sharing: there is nothing special about this feature, but we realized that we would have a great benefit by adding our special additional tags (e.g. pre-computed audio profiles, extended meta-tags such as similar artists, etc.) into the existing MP3 file format which is supported at hand by its mime type. (4) Meta-tag representation: by using ID3v2 as part of the MP3 file format we were able to supply the interesting musical features using a well-established format, having a critical mass of users, resp. standard applications such as MP3 players, playlist and cataloguing tools to read and store this format. In this way simple file-sharing enters a totally new dimension of quality since each MP3 file can be interpreted as a self-contained music knowledge container. (5) File Search: at this point we added again our component for a fuzzy match of phonetic misspelled queries to correct entities in the artist field. (6) Presentation of metadata attributes and values: after a computation of similar songs by using the music similarity metrics we added the according artists as a new meta-tag. Furthermore standard ID3 tags such as artist and genre are presented (see Fig.3).

7 Web Journal of Chinese Management Review Vol. 17 No 2 6 Figure 2 MPeer Client: Embedded recommendations Figure 3 M-Peer Client: Meta-tag Search The aforementioned automatic audio analysis recognizes properties about loudness, tempo features. These features can be used for the determination of music similarity. Lots of authors have presented different approaches in the recent past to tackle this problem which is a research issue in music information retrieval (MIR). An excellent introduction to the entire MIR field and related communities can be found in [Futrelle & Downie, 2002]. While we are interested in providing recommendations for similar music other authors have coped with the problem of identifying audio tracks in large music databases even for distorted audio queries [Allamanche et. al., 2001]. Such methods are essential to realize stable digital right management systems (DRM) either for central server or P2P approaches. In our perspective on consumer value we focus on the value for end-users, which is not directly increased by using DRM technology. An initial description of the system presented in this paper can be found in Grimm and Nützel [Grimm & Nützel, 2002]. The main difference of this paper from previous work is presenting an alternative model without copy protection for a friendly P2P approach

8 Web Journal of Chinese Management Review Vol. 17 No 2 7 enabling users to re-distribute musical content. Users do not have to care for DRM and have the freedom to receive, consume and re-distribute multimedia content. Re-distribution allows for earning credits and represents therefore increased consumer value. In this way their approach can be seen as complementary to our work [Pachet & Aucouturier, 2003]. Wang, Li and Shi [Wang et al., 2002] evaluated four different P2P models with integrated content-based music retrieval. They showed how acceleration of retrieval could be achieved in large-scale distributed music networks. Therefore they reach an increase of consumer value by improving the extrinsic value of efficiency for music selection. The paper outlines a very generic content-based retrieval method missing details about the audio features and similarity measure. It seems to be optimized for exact retrieval using audio extracts or sung queries. In contrast to our approach recommendations for similar music based on sound or cultural context are neglected. Following recent work in P2P networks [Pfeiffer & Vincent, 2001]; our system decouples the process of locating content from the process of downloading it. Each node in the network not only stores music files for sharing but also information about the location of other music files in the network. Therefore when the user submits a search to the P2P system, the system returns a set of peers from which the music files that satisfy the search criteria can then be downloaded. Each file in the system is described by a Music File Description (MFD) which essentially is a set of attribute-value (AV) pairs. The user as well as automatically extracted features for describing musical content (underlined). The Music Feature Extraction Engine (MFEE) is the component that calculates these features from the audio files (see Figure 4). These automatically extracted features in addition to being used for content-based similarity search can also be used for various other types of audio analysis such as musical genre classification. There are two operations that are supported by the system and both take MFD as arguments. In registration, a new music file is made available for sharing and its associated MFD is registered to the P2P system. During searching, the user query is converted into an appropriate MFD which is then used to locate the nodes that contain files that match the search criteria. Once the nodes are located they are contacted directly to start the actual downloading for the file. The main concern in the design of our system is the efficient content discovery rather than the actual downloading mechanism [Logan, 2000].

9 Web Journal of Chinese Management Review Vol. 17 No 2 8 Figure 4 Software architecture on a peer node

10 Web Journal of Chinese Management Review Vol. 17 No 2 9 PeerWare Simulation Infrastructure & Experiments In order to benchmark the efficiency of the information retrieval algorithms, we have implemented PeerWare 3, a distributed middleware infrastructure which allows us to benchmark different query routing algorithms over large-scale P2P systems. For the experiments we deployed 104 nodes running on a network of 50 workstations, each of which has an AMD Athlon4 1.4 GHz processor with 1GB RAM running Mandrake Linux 8.0 (kernel ) all interconnected with a 10/100 LAN. Our evaluation metrics were: (i) the recall rate, that is, the fraction of documents each of the search mechanisms retrieves, and (ii) the efficiency of the techniques, that is, the number of messages used to find the results. To test the applicability of the information retrieval algorithms in P2P systems, we chose to implement only the algorithms that require local knowledge when searching for the documents (i.e. BFS, RBFS, >RES and ISM). RBFS, >RES and ISM all forward some query message to only half of the neighbors that would be contacted with BFS. Additionally for >RES, we set k = 0:5 * d and m = 100, where d is the degree of a node. In this way a peer propagates the request selectively to half of its peer which returned the most results in the past 100 queries. In our first experiment we performed 10 queries, each of which was run 10 consecutive times, and where each host has an average degree of 8 connections. The queries are keywords randomly sampled from the dataset. Figure 5 (top, left) shows the number of messages required by the four query routing techniques. The figure indicates that Breadth-First-Search (BFS) requires almost 2,5 times as many messages as its competitors with around 1050 messages per query. BFS's recall rate is used as the basis for comparing the recall rate of the other techniques and is therefore set to 100%. Random Breadth-First-Search (RBFS), the Intelligent Search Mechanism (ISM) and the Most Results in the Past (>RES) on the other hand use all significantly less messages but ISM is the one that finds the most documents. That is attributed to the fact that ISM improves its knowledge over time. More specifically, ISM achieves almost 90% recall rate while using only 38% of the BFS's messages. Figure 5 (top, right) illustrates that both >RES and ISM start out with a low recall rate (i.e %) because they are initially both choosing their neighbors at random. Therefore their recall rate performance is comparable to that of RBFS. The values shown are the averages of 10 consecutive requests. In the second experiment we investigated the effect of increasing the TTL parameter to our results. Figure 5 (bottom, left) shows that by increasing the value of TTL to 5 the ISM mechanism discovers almost the same documents with what BFS finds for TTL=4. More specifically, our experimental results show that our mechanism achieves 100% recall rate (Figure 2 (bottom, right)) while using only 57% of the number of messages used in BFS. We have also shown that ISM has approximately the same query response time (QRT) (~250ms) with its two competitors (RBFS and >RES) and far less QRT than BFS (which required 60% more

11 Web Journal of Chinese Management Review Vol. 17 No 2 10 time). Figure 5 Messages (left) and Recall Rate (right) used by the 4 Algorithms for 10x10 queries for (TTL=4) (top) and (TTL=5) (bottom)

12 Web Journal of Chinese Management Review Vol. 17 No 2 11 Conclusions In this paper we presented a number of different query routing techniques that enable efficient information retrieval in P2P systems. Existing techniques are not scaling well because they are either based on the idea of preceding the network with queries or because they require some form of global knowledge. We have shown the various tradeoffs and experimentally evaluated four of the techniques that require no global knowledge. The main challenge for a query routing technique is to query peers that contain the most relevant content with minimum messaging. We described technological solutions may act as value enabling tools in e-commerce scenarios for virtual music. Advanced features for artist recommendations based on audio and cultural features may act in the same way. We showed the technical feasibility of these ideas in a prototypical P2P environment. Another important aspect of the proposed system is that new attributes can be incorporated into the system with minimum effort. In addition, user studies need to be conducted to explore how the users interact with the system and what are their typical queries. In this paper we consider the information retrieval problem in P2P networks. In particular, our objective is to decrease the number of messages sent per query while at the same time maintain a high recall rate (i.e. search result quality). Also, we present a survey of search techniques for information retrieval in P2P networks, including recent techniques and express a realistic experimental evaluation and comparison of these techniques, using a distributed middleware infrastructure we have designed and implemented.

13 Web Journal of Chinese Management Review Vol. 17 No 2 12 REFERENCES Allamanche, E., Herre, J., Hellmuth, O., Froba, B.,Kastner, T., & Cremer, M. (2001). Content-based identification of audio material using MPEG-7 low level description. In proceedings of the 2 nd ISMIR 2001, Indiana University Bloomington, Indiana, USA. Baumann, S. (2003). Music similarity analysis in a P2P environment. In proceedings of the 4 th WIAMIS 2003, London, UK. Baumann, S., & Klüter, A. (2002). Super-convenience for non-musicians: Querying MP3 and the semantic web. In proceedings of the 3 rd ISMIR 2002, Paris, France. Burns, K. S., & Lutz, R. J. (2006). The function of format: Consumer responses to six on-line advertising formats. Journal of Advertising, 35(1), Futrelle, J., & Downie, J. S. (2002). Interdisciplinary communities and research issues in music information retrieval. In proceedings of the 3 rd ISMIR 2002, Paris, France. Gnutella (2012). Grimm, R., & Nützel, J. (2002). Peer-to-peer music-sharing with profit but without copy protection. In proceedings of the 2 nd WEDELMUSIC 2002, Darmstadt, Germany. Logan,S. (2000). Mel frequency cepstral coefficients for music modelling. In proceedings of the 1 st ISMIR 2000, Plymouth, USA. Pachet, F., & Aucouturier, J. (2003). Representing musical genre: A State of the Art. Journal of New Music Research, 32(1), Pfeiffer, S., & Vincent, T. (2001). Formalization of MPEG-1 compressed domain audio features. Report No 01/196 of CSIRO Mathematical and Information Sciences, Australia. Wang, C., Li, J., & Shi, S. (2002). A kind of content-based music information retrieval method in a peer-to-peer environment. In proceedings of the 3 rd ISMIR 2002, Paris, France. Yang, B., & Garcia-Molina, H. (2002). Efficient search in peer-to-peer networks. In proceedings of the 22 nd ICDCS2002, Vienna, Austria.

Exploiting Locality for Scalable Information Retrieval in Peer-to-Peer Networks

Exploiting Locality for Scalable Information Retrieval in Peer-to-Peer Networks Exploiting Locality for Scalable Information Retrieval in Peer-to-Peer Networks D. Zeinalipour-Yazti, Vana Kalogeraki, Dimitrios Gunopulos Department of Computer Science and Engineering University of California

More information

A Document-centered Approach to a Natural Language Music Search Engine

A Document-centered Approach to a Natural Language Music Search Engine A Document-centered Approach to a Natural Language Music Search Engine Peter Knees, Tim Pohle, Markus Schedl, Dominik Schnitzer, and Klaus Seyerlehner Dept. of Computational Perception, Johannes Kepler

More information

An Approach to Evaluate and Enhance the Retrieval of Web Services Based on Semantic Information

An Approach to Evaluate and Enhance the Retrieval of Web Services Based on Semantic Information An Approach to Evaluate and Enhance the Retrieval of Web Services Based on Semantic Information Stefan Schulte Multimedia Communications Lab (KOM) Technische Universität Darmstadt, Germany schulte@kom.tu-darmstadt.de

More information

Peer-to-Peer Systems. Chapter General Characteristics

Peer-to-Peer Systems. Chapter General Characteristics Chapter 2 Peer-to-Peer Systems Abstract In this chapter, a basic overview is given of P2P systems, architectures, and search strategies in P2P systems. More specific concepts that are outlined include

More information

Semantic Overlay Networks

Semantic Overlay Networks Semantic Overlay Networks Arturo Crespo and Hector Garcia-Molina Write-up by Pavel Serdyukov Saarland University, Department of Computer Science Saarbrücken, December 2003 Content 1 Motivation... 3 2 Introduction

More information

Information Retrieval in Peer to Peer Systems. Sharif University of Technology. Fall Dr Hassan Abolhassani. Author: Seyyed Mohsen Jamali

Information Retrieval in Peer to Peer Systems. Sharif University of Technology. Fall Dr Hassan Abolhassani. Author: Seyyed Mohsen Jamali Information Retrieval in Peer to Peer Systems Sharif University of Technology Fall 2005 Dr Hassan Abolhassani Author: Seyyed Mohsen Jamali [Slide 2] Introduction Peer-to-Peer systems are application layer

More information

Opportunistic Application Flows in Sensor-based Pervasive Environments

Opportunistic Application Flows in Sensor-based Pervasive Environments Opportunistic Application Flows in Sensor-based Pervasive Environments Nanyan Jiang, Cristina Schmidt, Vincent Matossian, and Manish Parashar ICPS 2004 1 Outline Introduction to pervasive sensor-based

More information

A Survey of Peer-to-Peer Content Distribution Technologies

A Survey of Peer-to-Peer Content Distribution Technologies A Survey of Peer-to-Peer Content Distribution Technologies Stephanos Androutsellis-Theotokis and Diomidis Spinellis ACM Computing Surveys, December 2004 Presenter: Seung-hwan Baek Ja-eun Choi Outline Overview

More information

An Approach to Automatically Tracking Music Preference on Mobile Players

An Approach to Automatically Tracking Music Preference on Mobile Players An Approach to Automatically Tracking Music Preference on Mobile Players Tim Pohle, 1 Klaus Seyerlehner 1 and Gerhard Widmer 1,2 1 Department of Computational Perception Johannes Kepler University Linz,

More information

UAPRIORI: AN ALGORITHM FOR FINDING SEQUENTIAL PATTERNS IN PROBABILISTIC DATA

UAPRIORI: AN ALGORITHM FOR FINDING SEQUENTIAL PATTERNS IN PROBABILISTIC DATA UAPRIORI: AN ALGORITHM FOR FINDING SEQUENTIAL PATTERNS IN PROBABILISTIC DATA METANAT HOOSHSADAT, SAMANEH BAYAT, PARISA NAEIMI, MAHDIEH S. MIRIAN, OSMAR R. ZAÏANE Computing Science Department, University

More information

pfusion: A P2P Architecture for Internet-Scale Content-Based Search and Retrieval

pfusion: A P2P Architecture for Internet-Scale Content-Based Search and Retrieval pfusion: A P2P Architecture for Internet-Scale Content-Based Search and Retrieval Demetrios Zeinalipour-Yazti, Vana Kalogeraki and Dimitrios Gunopulos Charalampos S. Nikolaou charnik@di.uoa.gr Department

More information

Improving Search In Peer-to-Peer Systems

Improving Search In Peer-to-Peer Systems Improving Search In Peer-to-Peer Systems Presented By Jon Hess cs294-4 Fall 2003 Goals Present alternative searching methods for systems with loose integrity constraints Probabilistically optimize over

More information

Yunfeng Zhang 1, Huan Wang 2, Jie Zhu 1 1 Computer Science & Engineering Department, North China Institute of Aerospace

Yunfeng Zhang 1, Huan Wang 2, Jie Zhu 1 1 Computer Science & Engineering Department, North China Institute of Aerospace [Type text] [Type text] [Type text] ISSN : 0974-7435 Volume 10 Issue 20 BioTechnology 2014 An Indian Journal FULL PAPER BTAIJ, 10(20), 2014 [12526-12531] Exploration on the data mining system construction

More information

RECOMMENDATION ITU-R BS Procedure for the performance test of automated query-by-humming systems

RECOMMENDATION ITU-R BS Procedure for the performance test of automated query-by-humming systems Rec. ITU-R BS.1693 1 RECOMMENDATION ITU-R BS.1693 Procedure for the performance test of automated query-by-humming systems (Question ITU-R 8/6) (2004) The ITU Radiocommunication Assembly, considering a)

More information

P2P Contents Distribution System with Routing and Trust Management

P2P Contents Distribution System with Routing and Trust Management The Sixth International Symposium on Operations Research and Its Applications (ISORA 06) Xinjiang, China, August 8 12, 2006 Copyright 2006 ORSC & APORC pp. 319 326 P2P Contents Distribution System with

More information

Scalable overlay Networks

Scalable overlay Networks overlay Networks Dr. Samu Varjonen 1 Lectures MO 15.01. C122 Introduction. Exercises. Motivation. TH 18.01. DK117 Unstructured networks I MO 22.01. C122 Unstructured networks II TH 25.01. DK117 Bittorrent

More information

SIPCache: A Distributed SIP Location Service for Mobile Ad-Hoc Networks

SIPCache: A Distributed SIP Location Service for Mobile Ad-Hoc Networks SIPCache: A Distributed SIP Location Service for Mobile Ad-Hoc Networks Simone Leggio Hugo Miranda Kimmo Raatikainen Luís Rodrigues University of Helsinki University of Lisbon August 16, 2006 Abstract

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION CHAPTER 1 INTRODUCTION 1.1 Introduction Pattern recognition is a set of mathematical, statistical and heuristic techniques used in executing `man-like' tasks on computers. Pattern recognition plays an

More information

Keywords Data alignment, Data annotation, Web database, Search Result Record

Keywords Data alignment, Data annotation, Web database, Search Result Record Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web

More information

Assignment 5. Georgia Koloniari

Assignment 5. Georgia Koloniari Assignment 5 Georgia Koloniari 2. "Peer-to-Peer Computing" 1. What is the definition of a p2p system given by the authors in sec 1? Compare it with at least one of the definitions surveyed in the last

More information

Simulating Search Strategies for Gnutella

Simulating Search Strategies for Gnutella ENSC 835: HIGH-PERFORMANCE NETWORKS FINAL PROJECT PRESENTATIONS Fall 2003 Simulating Search Strategies for Gnutella Chun Wai Chan http://www.sfu.ca/~cchany/ensc835 cchany@sfu.ca 1 Roadmap Introduction:

More information

Addressed Issue. P2P What are we looking at? What is Peer-to-Peer? What can databases do for P2P? What can databases do for P2P?

Addressed Issue. P2P What are we looking at? What is Peer-to-Peer? What can databases do for P2P? What can databases do for P2P? Peer-to-Peer Data Management - Part 1- Alex Coman acoman@cs.ualberta.ca Addressed Issue [1] Placement and retrieval of data [2] Server architectures for hybrid P2P [3] Improve search in pure P2P systems

More information

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user

More information

A Tagging Approach to Ontology Mapping

A Tagging Approach to Ontology Mapping A Tagging Approach to Ontology Mapping Colm Conroy 1, Declan O'Sullivan 1, Dave Lewis 1 1 Knowledge and Data Engineering Group, Trinity College Dublin {coconroy,declan.osullivan,dave.lewis}@cs.tcd.ie Abstract.

More information

International Journal of Scientific & Engineering Research Volume 8, Issue 5, May ISSN

International Journal of Scientific & Engineering Research Volume 8, Issue 5, May ISSN International Journal of Scientific & Engineering Research Volume 8, Issue 5, May-2017 106 Self-organizing behavior of Wireless Ad Hoc Networks T. Raghu Trivedi, S. Giri Nath Abstract Self-organization

More information

Advanced Peer to Peer Discovery and Interaction Framework

Advanced Peer to Peer Discovery and Interaction Framework Advanced Peer to Peer Discovery and Interaction Framework Peeyush Tugnawat J.D. Edwards and Company One, Technology Way, Denver, CO 80237 peeyush_tugnawat@jdedwards.com Mohamed E. Fayad Computer Engineering

More information

A Content Based Image Retrieval System Based on Color Features

A Content Based Image Retrieval System Based on Color Features A Content Based Image Retrieval System Based on Features Irena Valova, University of Rousse Angel Kanchev, Department of Computer Systems and Technologies, Rousse, Bulgaria, Irena@ecs.ru.acad.bg Boris

More information

Two-layer Distance Scheme in Matching Engine for Query by Humming System

Two-layer Distance Scheme in Matching Engine for Query by Humming System Two-layer Distance Scheme in Matching Engine for Query by Humming System Feng Zhang, Yan Song, Lirong Dai, Renhua Wang University of Science and Technology of China, iflytek Speech Lab, Hefei zhangf@ustc.edu,

More information

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

The Comparative Study of Machine Learning Algorithms in Text Data Classification* The Comparative Study of Machine Learning Algorithms in Text Data Classification* Wang Xin School of Science, Beijing Information Science and Technology University Beijing, China Abstract Classification

More information

Lily: Ontology Alignment Results for OAEI 2009

Lily: Ontology Alignment Results for OAEI 2009 Lily: Ontology Alignment Results for OAEI 2009 Peng Wang 1, Baowen Xu 2,3 1 College of Software Engineering, Southeast University, China 2 State Key Laboratory for Novel Software Technology, Nanjing University,

More information

Automatically Adapting the Structure of Audio Similarity Spaces

Automatically Adapting the Structure of Audio Similarity Spaces Automatically Adapting the Structure of Audio Similarity Spaces Tim Pohle 1, Peter Knees 1, Markus Schedl 1 and Gerhard Widmer 1,2 1 Department of Computational Perception Johannes Kepler University Linz,

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REVIEW PAPER ON IMPLEMENTATION OF DOCUMENT ANNOTATION USING CONTENT AND QUERYING

More information

A Decentralized Content-based Aggregation Service for Pervasive Environments

A Decentralized Content-based Aggregation Service for Pervasive Environments A Decentralized Content-based Aggregation Service for Pervasive Environments Nanyan Jiang, Cristina Schmidt, Manish Parashar The Applied Software Systems Laboratory Rutgers, The State University of New

More information

LARGE-VOCABULARY CHINESE TEXT/SPEECH INFORMATION RETRIEVAL USING MANDARIN SPEECH QUERIES

LARGE-VOCABULARY CHINESE TEXT/SPEECH INFORMATION RETRIEVAL USING MANDARIN SPEECH QUERIES LARGE-VOCABULARY CHINESE TEXT/SPEECH INFORMATION RETRIEVAL USING MANDARIN SPEECH QUERIES Bo-ren Bai 1, Berlin Chen 2, Hsin-min Wang 2, Lee-feng Chien 2, and Lin-shan Lee 1,2 1 Department of Electrical

More information

Early Measurements of a Cluster-based Architecture for P2P Systems

Early Measurements of a Cluster-based Architecture for P2P Systems Early Measurements of a Cluster-based Architecture for P2P Systems Balachander Krishnamurthy, Jia Wang, Yinglian Xie I. INTRODUCTION Peer-to-peer applications such as Napster [4], Freenet [1], and Gnutella

More information

DISTRIBUTED COMPUTER SYSTEMS ARCHITECTURES

DISTRIBUTED COMPUTER SYSTEMS ARCHITECTURES DISTRIBUTED COMPUTER SYSTEMS ARCHITECTURES Dr. Jack Lange Computer Science Department University of Pittsburgh Fall 2015 Outline System Architectural Design Issues Centralized Architectures Application

More information

A Fast and High Throughput SQL Query System for Big Data

A Fast and High Throughput SQL Query System for Big Data A Fast and High Throughput SQL Query System for Big Data Feng Zhu, Jie Liu, and Lijie Xu Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing, China 100190

More information

STUDYING OF CLASSIFYING CHINESE SMS MESSAGES

STUDYING OF CLASSIFYING CHINESE SMS MESSAGES STUDYING OF CLASSIFYING CHINESE SMS MESSAGES BASED ON BAYESIAN CLASSIFICATION 1 LI FENG, 2 LI JIGANG 1,2 Computer Science Department, DongHua University, Shanghai, China E-mail: 1 Lifeng@dhu.edu.cn, 2

More information

Overlay and P2P Networks. Unstructured networks. Prof. Sasu Tarkoma

Overlay and P2P Networks. Unstructured networks. Prof. Sasu Tarkoma Overlay and P2P Networks Unstructured networks Prof. Sasu Tarkoma 19.1.2015 Contents Unstructured networks Last week Napster Skype This week: Gnutella BitTorrent P2P Index It is crucial to be able to find

More information

Information mining and information retrieval : methods and applications

Information mining and information retrieval : methods and applications Information mining and information retrieval : methods and applications J. Mothe, C. Chrisment Institut de Recherche en Informatique de Toulouse Université Paul Sabatier, 118 Route de Narbonne, 31062 Toulouse

More information

Overlay and P2P Networks. Unstructured networks. Prof. Sasu Tarkoma

Overlay and P2P Networks. Unstructured networks. Prof. Sasu Tarkoma Overlay and P2P Networks Unstructured networks Prof. Sasu Tarkoma 20.1.2014 Contents P2P index revisited Unstructured networks Gnutella Bloom filters BitTorrent Freenet Summary of unstructured networks

More information

Model-Based Social Networking Over Femtocell Environments

Model-Based Social Networking Over Femtocell Environments Proc. of World Cong. on Multimedia and Computer Science Model-Based Social Networking Over Femtocell Environments 1 Hajer Berhouma, 2 Kaouthar Sethom Ben Reguiga 1 ESPRIT, Institute of Engineering, Tunis,

More information

Scalable Hybrid Search on Distributed Databases

Scalable Hybrid Search on Distributed Databases Scalable Hybrid Search on Distributed Databases Jungkee Kim 1,2 and Geoffrey Fox 2 1 Department of Computer Science, Florida State University, Tallahassee FL 32306, U.S.A., jungkkim@cs.fsu.edu, 2 Community

More information

Development of Generic Search Method Based on Transformation Invariance

Development of Generic Search Method Based on Transformation Invariance Development of Generic Search Method Based on Transformation Invariance Fuminori Adachi, Takashi Washio, Hiroshi Motoda and *Hidemitsu Hanafusa I.S.I.R., Osaka University, {adachi, washio, motoda}@ar.sanken.osaka-u.ac.jp

More information

Heading-Based Sectional Hierarchy Identification for HTML Documents

Heading-Based Sectional Hierarchy Identification for HTML Documents Heading-Based Sectional Hierarchy Identification for HTML Documents 1 Dept. of Computer Engineering, Boğaziçi University, Bebek, İstanbul, 34342, Turkey F. Canan Pembe 1,2 and Tunga Güngör 1 2 Dept. of

More information

P2P Knowledge Management: an Investigation of the Technical Architecture and Main Processes

P2P Knowledge Management: an Investigation of the Technical Architecture and Main Processes P2P Management: an Investigation of the Technical Architecture and Main Processes Oscar Mangisengi, Wolfgang Essmayr Software Competence Center Hagenberg (SCCH) Hauptstrasse 99, A-4232 Hagenberg, Austria

More information

SyncPlayer Public Demo

SyncPlayer Public Demo SyncPlayer 1.5.2 Public Demo Christian Fremerey and Frank Kurth Multimedia Signal Processing Group Department of Computer Science III University of Bonn, Römerstraße 164, 53117 Bonn, Germany e-mail: {fremerey,frank}@iai.uni-bonn.de

More information

A New Context Based Indexing in Search Engines Using Binary Search Tree

A New Context Based Indexing in Search Engines Using Binary Search Tree A New Context Based Indexing in Search Engines Using Binary Search Tree Aparna Humad Department of Computer science and Engineering Mangalayatan University, Aligarh, (U.P) Vikas Solanki Department of Computer

More information

An Intelligent Retrieval Platform for Distributional Agriculture Science and Technology Data

An Intelligent Retrieval Platform for Distributional Agriculture Science and Technology Data An Intelligent Retrieval Platform for Distributional Agriculture Science and Technology Data Xiaorong Yang 1,2, Wensheng Wang 1,2, Qingtian Zeng 3, and Nengfu Xie 1,2 1 Agriculture Information Institute,

More information

CS3242 assignment 2 report Content-based music retrieval. Luong Minh Thang & Nguyen Quang Minh Tuan

CS3242 assignment 2 report Content-based music retrieval. Luong Minh Thang & Nguyen Quang Minh Tuan CS3242 assignment 2 report Content-based music retrieval Luong Minh Thang & Nguyen Quang Minh Tuan 1. INTRODUCTION With the development of the Internet, searching for information has proved to be a vital

More information

Efficient Indexing and Searching Framework for Unstructured Data

Efficient Indexing and Searching Framework for Unstructured Data Efficient Indexing and Searching Framework for Unstructured Data Kyar Nyo Aye, Ni Lar Thein University of Computer Studies, Yangon kyarnyoaye@gmail.com, nilarthein@gmail.com ABSTRACT The proliferation

More information

ADAPTIVE AND DYNAMIC LOAD BALANCING METHODOLOGIES FOR DISTRIBUTED ENVIRONMENT

ADAPTIVE AND DYNAMIC LOAD BALANCING METHODOLOGIES FOR DISTRIBUTED ENVIRONMENT ADAPTIVE AND DYNAMIC LOAD BALANCING METHODOLOGIES FOR DISTRIBUTED ENVIRONMENT PhD Summary DOCTORATE OF PHILOSOPHY IN COMPUTER SCIENCE & ENGINEERING By Sandip Kumar Goyal (09-PhD-052) Under the Supervision

More information

Designing a System Engineering Environment in a structured way

Designing a System Engineering Environment in a structured way Designing a System Engineering Environment in a structured way Anna Todino Ivo Viglietti Bruno Tranchero Leonardo-Finmeccanica Aircraft Division Torino, Italy Copyright held by the authors. Rubén de Juan

More information

Efficient, Scalable, and Provenance-Aware Management of Linked Data

Efficient, Scalable, and Provenance-Aware Management of Linked Data Efficient, Scalable, and Provenance-Aware Management of Linked Data Marcin Wylot 1 Motivation and objectives of the research The proliferation of heterogeneous Linked Data on the Web requires data management

More information

Processing Rank-Aware Queries in P2P Systems

Processing Rank-Aware Queries in P2P Systems Processing Rank-Aware Queries in P2P Systems Katja Hose, Marcel Karnstedt, Anke Koch, Kai-Uwe Sattler, and Daniel Zinn Department of Computer Science and Automation, TU Ilmenau P.O. Box 100565, D-98684

More information

Web Data mining-a Research area in Web usage mining

Web Data mining-a Research area in Web usage mining IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 13, Issue 1 (Jul. - Aug. 2013), PP 22-26 Web Data mining-a Research area in Web usage mining 1 V.S.Thiyagarajan,

More information

Parallel Architecture & Programing Models for Face Recognition

Parallel Architecture & Programing Models for Face Recognition Parallel Architecture & Programing Models for Face Recognition Submitted by Sagar Kukreja Computer Engineering Department Rochester Institute of Technology Agenda Introduction to face recognition Feature

More information

A PROPOSAL OF USER AUTHENTICATION AND A CONTENT DISTRIBUTION MECHANISM USING P2P CONNECTION OVER A MOBILE AD HOC NETWORK

A PROPOSAL OF USER AUTHENTICATION AND A CONTENT DISTRIBUTION MECHANISM USING P2P CONNECTION OVER A MOBILE AD HOC NETWORK A PROPOSAL OF USER AUTHENTICATION AND A CONTENT DISTRIBUTION MECHANISM USING P2P CONNECTION OVER A MOBILE AD HOC NETWORK Masato Oguchi, Yoshiko Nakatsuka y, and Chiho Tomizawa z Department of Information

More information

Peer Clustering and Firework Query Model

Peer Clustering and Firework Query Model Peer Clustering and Firework Query Model Cheuk Hang Ng, Ka Cheung Sia Department of Computer Science and Engineering The Chinese University of Hong Kong Shatin, N.T., Hong Kong SAR {chng,kcsia}@cse.cuhk.edu.hk

More information

Improved Classification of Known and Unknown Network Traffic Flows using Semi-Supervised Machine Learning

Improved Classification of Known and Unknown Network Traffic Flows using Semi-Supervised Machine Learning Improved Classification of Known and Unknown Network Traffic Flows using Semi-Supervised Machine Learning Timothy Glennan, Christopher Leckie, Sarah M. Erfani Department of Computing and Information Systems,

More information

On Veracious Search In Unsystematic Networks

On Veracious Search In Unsystematic Networks On Veracious Search In Unsystematic Networks K.Thushara #1, P.Venkata Narayana#2 #1 Student Of M.Tech(S.E) And Department Of Computer Science And Engineering, # 2 Department Of Computer Science And Engineering,

More information

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016 + Databases and Information Retrieval Integration TIETS42 Autumn 2016 Kostas Stefanidis kostas.stefanidis@uta.fi http://www.uta.fi/sis/tie/dbir/index.html http://people.uta.fi/~kostas.stefanidis/dbir16/dbir16-main.html

More information

Semi supervised clustering for Text Clustering

Semi supervised clustering for Text Clustering Semi supervised clustering for Text Clustering N.Saranya 1 Assistant Professor, Department of Computer Science and Engineering, Sri Eshwar College of Engineering, Coimbatore 1 ABSTRACT: Based on clustering

More information

Effects of Internet Path Selection on Video-QoE

Effects of Internet Path Selection on Video-QoE Effects of Internet Path Selection on Video-QoE by Mukundan Venkataraman & Mainak Chatterjee Dept. of EECS University of Central Florida, Orlando, FL 32826 mukundan@eecs.ucf.edu mainak@eecs.ucf.edu Streaming

More information

DESIGN AND ARCHITECTURE OF A DIGITAL MUSIC LIBRARY ON THE WEB

DESIGN AND ARCHITECTURE OF A DIGITAL MUSIC LIBRARY ON THE WEB DESIGN AND ARCHITECTURE OF A DIGITAL MUSIC LIBRARY ON THE WEB ABSTRACT In this paper, a Web-based digital music library based on a threetier architecture is presented. The digital library s primary goal

More information

Clean Living: Eliminating Near-Duplicates in Lifetime Personal Storage

Clean Living: Eliminating Near-Duplicates in Lifetime Personal Storage Clean Living: Eliminating Near-Duplicates in Lifetime Personal Storage Zhe Wang Princeton University Jim Gemmell Microsoft Research September 2005 Technical Report MSR-TR-2006-30 Microsoft Research Microsoft

More information

Similarity by Metadata

Similarity by Metadata Similarity by Metadata Simon Putzke Mirco Semper Freie Universität Berlin Institut für Informatik Seminar Music Information Retrieval Outline Motivation Metadata Tagging Similarities Webmining Example

More information

Multimodal Information Spaces for Content-based Image Retrieval

Multimodal Information Spaces for Content-based Image Retrieval Research Proposal Multimodal Information Spaces for Content-based Image Retrieval Abstract Currently, image retrieval by content is a research problem of great interest in academia and the industry, due

More information

INTRODUCTION. Chapter GENERAL

INTRODUCTION. Chapter GENERAL Chapter 1 INTRODUCTION 1.1 GENERAL The World Wide Web (WWW) [1] is a system of interlinked hypertext documents accessed via the Internet. It is an interactive world of shared information through which

More information

Music Similarity Clustering

Music Similarity Clustering Cognitive Science B.Sc. Program University of Osnabrück, Germany Music Similarity Clustering A Content-Based Approach in Clustering Music Files According to User Preference Thesis-Project for the Degree

More information

Sumantra Dutta Roy, Preeti Rao and Rishabh Bhargava

Sumantra Dutta Roy, Preeti Rao and Rishabh Bhargava 1 OPTIMAL PARAMETER ESTIMATION AND PERFORMANCE MODELLING IN MELODIC CONTOUR-BASED QBH SYSTEMS Sumantra Dutta Roy, Preeti Rao and Rishabh Bhargava Department of Electrical Engineering, IIT Bombay, Powai,

More information

Evaluation of Information Dissemination Characteristics in a PTS VANET

Evaluation of Information Dissemination Characteristics in a PTS VANET Evaluation of Information Dissemination Characteristics in a PTS VANET Holger Kuprian 1, Marek Meyer 2, Miguel Rios 3 1) Technische Universität Darmstadt, Multimedia Communications Lab Holger.Kuprian@KOM.tu-darmstadt.de

More information

Information Retrieval

Information Retrieval Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,

More information

Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web

Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Robert Meusel and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {robert,heiko}@informatik.uni-mannheim.de

More information

Improving Suffix Tree Clustering Algorithm for Web Documents

Improving Suffix Tree Clustering Algorithm for Web Documents International Conference on Logistics Engineering, Management and Computer Science (LEMCS 2015) Improving Suffix Tree Clustering Algorithm for Web Documents Yan Zhuang Computer Center East China Normal

More information

File Sharing in Less structured P2P Systems

File Sharing in Less structured P2P Systems File Sharing in Less structured P2P Systems. Bhosale S.P. 1, Sarkar A.R. 2 Computer Science And Engg. Dept., SVERI s College of Engineering Pandharpur Solapur, India1 Asst.Prof, Computer Science And Engg.

More information

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE

SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE SEMANTIC WEB POWERED PORTAL INFRASTRUCTURE YING DING 1 Digital Enterprise Research Institute Leopold-Franzens Universität Innsbruck Austria DIETER FENSEL Digital Enterprise Research Institute National

More information

Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines

Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines The Scientific World Journal Volume 2013, Article ID 596724, 6 pages http://dx.doi.org/10.1155/2013/596724 Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines Weizhe

More information

SHORTEST PATH ALGORITHM FOR QUERY PROCESSING IN PEER TO PEER NETWORKS

SHORTEST PATH ALGORITHM FOR QUERY PROCESSING IN PEER TO PEER NETWORKS SHORTEST PATH ALGORITHM FOR QUERY PROCESSING IN PEER TO PEER NETWORKS Abstract U.V.ARIVAZHAGU * Research Scholar, Sathyabama University, Chennai, Tamilnadu, India arivu12680@gmail.com Dr.S.SRINIVASAN Director

More information

Project Participants

Project Participants Annual Report for Period:10/2004-10/2005 Submitted on: 06/21/2005 Principal Investigator: Yang, Li. Award ID: 0414857 Organization: Western Michigan Univ Title: Projection and Interactive Exploration of

More information

An Efficient Methodology for Image Rich Information Retrieval

An Efficient Methodology for Image Rich Information Retrieval An Efficient Methodology for Image Rich Information Retrieval 56 Ashwini Jaid, 2 Komal Savant, 3 Sonali Varma, 4 Pushpa Jat, 5 Prof. Sushama Shinde,2,3,4 Computer Department, Siddhant College of Engineering,

More information

Adaptive Spatiotemporal Node Selection in Dynamic Networks

Adaptive Spatiotemporal Node Selection in Dynamic Networks Adaptive Spatiotemporal Node Selection in Dynamic Networks Pradip Hari, John B. P. McCabe, Jonathan Banafato, Marcus Henry, Ulrich Kremer, Dept. of Computer Science, Rutgers University Kevin Ko, Emmanouil

More information

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and

More information

An Cross Layer Collaborating Cache Scheme to Improve Performance of HTTP Clients in MANETs

An Cross Layer Collaborating Cache Scheme to Improve Performance of HTTP Clients in MANETs An Cross Layer Collaborating Cache Scheme to Improve Performance of HTTP Clients in MANETs Jin Liu 1, Hongmin Ren 1, Jun Wang 2, Jin Wang 2 1 College of Information Engineering, Shanghai Maritime University,

More information

1 Multipath Node-Disjoint Routing with Backup List Based on the AODV Protocol

1 Multipath Node-Disjoint Routing with Backup List Based on the AODV Protocol 1 Multipath Node-Disjoint Routing with Backup List Based on the AODV Protocol Vahid Zangeneh i and Shahriar Mohammadi ii * ABSTRACT In recent years, routing has been the most focused area in ad hoc networks

More information

Making Gnutella-like P2P Systems Scalable

Making Gnutella-like P2P Systems Scalable Making Gnutella-like P2P Systems Scalable Y. Chawathe, S. Ratnasamy, L. Breslau, N. Lanham, S. Shenker Presented by: Herman Li Mar 2, 2005 Outline What are peer-to-peer (P2P) systems? Early P2P systems

More information

From Improved Auto-taggers to Improved Music Similarity Measures

From Improved Auto-taggers to Improved Music Similarity Measures From Improved Auto-taggers to Improved Music Similarity Measures Klaus Seyerlehner 1, Markus Schedl 1, Reinhard Sonnleitner 1, David Hauger 1, and Bogdan Ionescu 2 1 Johannes Kepler University Department

More information

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment Shigeo Sugimoto Research Center for Knowledge Communities Graduate School of Library, Information

More information

Baidu-I 2 R Research Centre Launches Singapore-developed Music Search Technology

Baidu-I 2 R Research Centre Launches Singapore-developed Music Search Technology Media Release For Immediate Release Total: 5 pages Baidu-I 2 R Research Centre Launches Singapore-developed Music Search Technology Singapore-developed music search technology provides new experiences

More information

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND 41 CHAPTER 5 TEXT W. Bruce Croft BACKGROUND Much of the information in digital library or digital information organization applications is in the form of text. Even when the application focuses on multimedia

More information

CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL

CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL STUDIA UNIV. BABEŞ BOLYAI, INFORMATICA, Volume LVII, Number 4, 2012 CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL IOAN BADARINZA AND ADRIAN STERCA Abstract. In this paper

More information

Research and Application of Unstructured Data Acquisition and Retrieval Technology

Research and Application of Unstructured Data Acquisition and Retrieval Technology 2018 2nd International Conference on Systems, Computing, and Applications (SYSTCA 2018) Research and Application of Unstructured Data Acquisition and Retrieval Technology Zhenjiang Lei1,*, Lin Qiao2, Lina

More information

Web Service Usage Mining: Mining For Executable Sequences

Web Service Usage Mining: Mining For Executable Sequences 7th WSEAS International Conference on APPLIED COMPUTER SCIENCE, Venice, Italy, November 21-23, 2007 266 Web Service Usage Mining: Mining For Executable Sequences MOHSEN JAFARI ASBAGH, HASSAN ABOLHASSANI

More information

Looking for Patterns in Content: From Design to End-Users Consumption.

Looking for Patterns in Content: From Design to End-Users Consumption. Looking for Patterns in Content: From Design to End-Users Consumption. Panel discussion, CONTENT 2011, Rome, Italy. Hans-Werner Sehring, T-Systems Multimedia Solutions GmbH. Patterns for and in Content?

More information

MURDOCH RESEARCH REPOSITORY

MURDOCH RESEARCH REPOSITORY MURDOCH RESEARCH REPOSITORY http://researchrepository.murdoch.edu.au/ This is the author s final version of the work, as accepted for publication following peer review but without the publisher s layout

More information

Routing XQuery in A P2P Network Using Adaptable Trie-Indexes

Routing XQuery in A P2P Network Using Adaptable Trie-Indexes Routing XQuery in A P2P Network Using Adaptable Trie-Indexes Florin Dragan, Georges Gardarin, Laurent Yeh PRISM laboratory Versailles University & Oxymel, France {firstname.lastname@prism.uvsq.fr} Abstract

More information

SPATIO-TEMPORAL SIGNATURES FOR VIDEO COPY DETECTION

SPATIO-TEMPORAL SIGNATURES FOR VIDEO COPY DETECTION SPATIO-TEMPORAL SIGNATURES FOR VIDEO COPY DETECTION Isabelle Simand, 2 Denis Pellerin, 3 Stephane Bres and 3 Jean-Michel Jolion Isabelle.Simand@liris.cnrs.fr 3 LIRIS, bat. J. Verne, INSA, 6962 Villeurbanne

More information

Quantifying and Assessing the Merge of Cloned Web-Based System: An Exploratory Study

Quantifying and Assessing the Merge of Cloned Web-Based System: An Exploratory Study Quantifying and Assessing the Merge of Cloned Web-Based System: An Exploratory Study Jadson Santos Department of Informatics and Applied Mathematics Federal University of Rio Grande do Norte, UFRN Natal,

More information

Characterizing Gnutella Network Properties for Peer-to-Peer Network Simulation

Characterizing Gnutella Network Properties for Peer-to-Peer Network Simulation Characterizing Gnutella Network Properties for Peer-to-Peer Network Simulation Selim Ciraci, Ibrahim Korpeoglu, and Özgür Ulusoy Department of Computer Engineering, Bilkent University, TR-06800 Ankara,

More information

A Privacy Preserving Model for Ownership Indexing in Distributed Storage Systems

A Privacy Preserving Model for Ownership Indexing in Distributed Storage Systems A Privacy Preserving Model for Ownership Indexing in Distributed Storage Systems Tiejian Luo tjluo@ucas.ac.cn Zhu Wang wangzhubj@gmail.com Xiang Wang wangxiang11@mails.ucas.ac.cn ABSTRACT The indexing

More information