The Future of P2P Audio/Visual Search

Size: px
Start display at page:

Download "The Future of P2P Audio/Visual Search"

Transcription

1 The Future of P2P Audio/Visual Search Fausto Rabitti ISTI-CNR, Pisa, Italy P2PIR Workshop - ACM CIKM 2006

2 Outline of the talk 1. Guessing the future of technologies 2. Today outlook 1. Peer-to-Peer Applications 2. Image and Video search on the Web 3. Improving effectiveness by Similarity Search 4. Scalability issue: P2P solution Search P2PIR Workshop - ACM CIKM

3 Future of Audio/Visual Search Everything we write, see, or hear can now be in a digital form (93% of produced data is digital) In the next three years, we will create more data than has been produced in all of human history, most of it in Audio/Visual form. New trend in MM content production: personal producer VS professional producers Dimensions of the search problem: Effectiveness Efficiency (scalability is the key issue) Search P2PIR Workshop - ACM CIKM

4 Future of Audio/Visual Search Economic dimension of the problem (e.g., personal journalism, cultural tourism, etc.) Social impact of solutions (e.g. community networks) Scientific research activities Results on innovation Is P2P a solution? Search P2PIR Workshop - ACM CIKM

5 What is Peer-to-Peer? P2P is a class of applications that takes advantage of resources storage, cycles, content, human presence available at the edges of the Internet (Shirky) A P2P system is a self-organizing system of equal, autonomous entities (peers) which aims for the shared usage of distributed resources in a networked environment avoiding central services (Steinmetz) P2P is about overcoming the barriers to the formation of ad hoc communities, whether of people, of programs, of devices, or of distributed resources (O Reilly) Search P2PIR Workshop - ACM CIKM

6 Peer-to-Peer systems The traditional client-server approaches require a tremendous amount of effort and resource to meet today challenges Scalability, security, flexibility are the main requirements of future Internet-based applications P2P systems are characterized by decentralized resource usage and decentralized selforganization Search P2PIR Workshop - ACM CIKM

7 Peer-to-Peer systems: Searching One of the major problems of P2P systems is: how to find a data item stored at some dynamic set of nodes in the system Three basic strategies can be used: centralized servers (first generation P2P: Napster) flooding (second generation P2P: Gnutella) distributed indexing (DHTs: Kademilia used in emule): structured P2P system Search P2PIR Workshop - ACM CIKM

8 Structured Peer-to-Peer systems Inspired by the significant possibilities of decentralized self-organizing systems, researches focused on approaches for distributed indexing structures Distributed Hash Tables were developed to provide scalability, reliability and fault tolerance Search P2PIR Workshop - ACM CIKM

9 Searching in the World of Peers Peer-to-peer systems are mostly used for file sharing. This task, which made the fortune of P2P, was not achievable by centralized servers Structured P2P networks such as DHTs, have produced a considerable amount of research but their usage is still limited Today centralized servers are still largely used for searching between (often illegal) file sharing communities data Search P2PIR Workshop - ACM CIKM

10 emule: Servers Search P2PIR Workshop - ACM CIKM

11 emule: Kademilia Search P2PIR Workshop - ACM CIKM

12 emule: Search Search P2PIR Workshop - ACM CIKM

13 emule: Searching from a web server Search P2PIR Workshop - ACM CIKM

14 BitTorrent: web servers Search P2PIR Workshop - ACM CIKM

15 Image and Video search on the Web Today Image and Video Search Engines are trivial applications of Web Search Engines Examples: Google, Yahoo, Ask, etc. Search is performed on the MM Object context (i.e. Web page) or on manually associated text Limits of this approach: who is going to manually tag all A/V material produced by personal devices? Search P2PIR Workshop - ACM CIKM

16 Searching for sea : flickr Search P2PIR Workshop - ACM CIKM

17 Improving effectiveness by Similarity Search on MM Content Exploiting automatically extracted metadata representing MM content: MM features (e.g. MPEG-7) Automatic Context information (e.g., GPS generated info) Solutions based on combination of traditional information (Manual text, Web pages) with automatically generated information (i.e. MM features, context) representing MM Content New searching paradigm based on Similarity Search P2PIR Workshop - ACM CIKM

18 Searching for sea : MILOS PhotoBook Query Search P2PIR Workshop - ACM CIKM

19 Detecting Coat of arms by components Search P2PIR Workshop - ACM CIKM

20 Search P2PIR Workshop - ACM CIKM

21 Face Recognition (TV) Query VideoFrame 2 result Search P2PIR Workshop - ACM CIKM

22 The Importance of Similarity Quotation: An ability to assess similarity lies close to the core of cognition. The sense of sameness is the very keel and backbone of our thinking. An understanding of problem solving, categorization, memory retrieval, inductive reasoning, and other cognitive processes require that we understand how humans assess similarity. MIT Encyclopedia of the Cognitive Sciences, Cambridge, MA, MIT Press 2006, pp Search P2PIR Workshop - ACM CIKM

23 Feature-based Approach similar? image database Search P2PIR Workshop - ACM CIKM

24 Feature-based Approach image layer B feature layer R G Search P2PIR Workshop - ACM CIKM

25 Specific Similarity concepts and definitions Similarity in Chemistry In order to assess the similarity between two molecules A and B we need to: first describe the molecules according to some scheme and, choose an appropriate measure to compare the descriptions of the molecules. Similarity in Social Psychology similarity refers to how closely attitudes, values, interests and personality match between people. similarity leads to interpersonal attraction, i.e. the attraction between people which leads to friendship and relationships. similarity forms social networks of individuals with ties mirrored as friends and acquaintances. Search P2PIR Workshop - ACM CIKM

26 Requirements of New Applications Medicine: Magnetic Resonance Images (MRI) Finance: stocks with similar time behavior Digital library: text retrieval multimedia information retrieval Search P2PIR Workshop - ACM CIKM

27 Similarity Searching Effectiveness the way of formulating the similarity measures - a model of human perception Efficiency the way of achieving the required performance over huge volumes of data index structure Search P2PIR Workshop - ACM CIKM

28 Metric Space an Abstraction of Similarity Metric space: M = (D,d) D domain distance function d(x,y) x,y,z D d(x,y) > 0 -non-negativity d(x,y) = 0 x = y - identity d(x,y) = d(y,x) - symmetry d(x,y) d(x,z) + d(z,y) - triangle inequality Search P2PIR Workshop - ACM CIKM

29 Similarity Search Problem For X D in metric space M, pre-process X so that the similarity queries are executed efficiently. similarity queries range search R(q,r) = { x X d(q,x) r } q D, r 0 q r Search P2PIR Workshop - ACM CIKM

30 Similarity Queries k-nearest neighbours k=5 NN(q,k) = A, q D, k > 0 A X, A = k x A, y X A, d(q,x) < d(q,y) q similarity join X = {x 1, x 2, x N }, Y = {y 1, y 2, y M } μ {(x i,y j ) d(x i,y j ) < μ} similarity self join X = Y Search P2PIR Workshop - ACM CIKM

31 Basic Partitioning Principles ball partitioning { x X d(p,x) r } { x X d(p,x) r } p r generalised hyperplane { x X d(p 1,x) d(p 2,x) } { x X d(p 1,x) >d(p 2,x) } p 2 p 1 Search P2PIR Workshop - ACM CIKM

32 The M-tree (an example) inherently dynamic structure disk-oriented (fixed-size nodes) built in a bottom-up fashion inspired by R-trees and B-trees all data in the leaf nodes internal nodes: pointers to subtrees and additional information Search P2PIR Workshop - ACM CIKM

33 M-tree: Example o 10 o 5 o 3 o 2 o 7 o 11 o 1 o 6 o 4 o 9 o 8 o _ o _ _ _ o _ o _ _ _ o _ o _ o _ o 0.0 o _ _ o 0. 0 o _ _ o 0.0 o _ _ o 0.0 o o 11 o 0.0 o _ _ Search P2PIR Workshop - ACM CIKM

34 Scalability: CPU Costs range query: r = 1,000; 2,000 k-nn query: k = 1; 100 labels: radius or k + D (D-index), M (M-tree), SEQ data: from 100,000 to 600,000 objects M-tree and D-index are faster (D-index slightly better) linear trends Search P2PIR Workshop - ACM CIKM

35 Scalability: I/O Costs the same trends as for CPU costs Search P2PIR Workshop - ACM CIKM

36 Similarity Search Scalability Similarity search is expensive. The scalability of centralized indexes is linear. cannot be applied to huge data archives become inefficient after a certain point Possible solutions: Sacrifice some precision: approximate techniques Use more storage & computational power: distributed data structures Search P2PIR Workshop - ACM CIKM

37 Similarity Search in the World of Peers With P2P systems able to perform similarity search: similarity search becomes scalable P2P communities have new search capabilities While preserving all structured P2P benefits, they will give new search capabilities not available from current centralized servers Search P2PIR Workshop - ACM CIKM

38 Implementation Postulates of Distributed Indexes scalability nodes (computers) can be added (removed) no hot-spots no centralized nodes, no flooding by messages update independence network update at one site does not require an immediate change propagation to all the other sites Search P2PIR Workshop - ACM CIKM

39 Distributed Similarity Search Structures Native metric structures: GHT* (Generalized Hyperplane Tree) VPT* (Vantage Point Tree) Transformation approaches (based on DHTs): M-CAN (Metric Content Addressable Network) M-Chord (Metric Chord) Search P2PIR Workshop - ACM CIKM

40 M-CAN: Range Query Execution Range query R(q,r) map the q on F(q) route the query towards F(q) Reach regions with candidate objects L (F(x),F(q)) r Propagate the query over the candidate regions using a multicast algorithm of CAN Check objects using d Search P2PIR Workshop - ACM CIKM

41 Scalability comparison (INFOSCALE 2006) Compared 4 distributed similarity search structures Query size scalability Dataset size scalability Capability of simultaneous query processing single query multiple queries GHT* excellent poor VPT* good satisfactory MCAN satisfactory good M-Chord satisfactory very good Search P2PIR Workshop - ACM CIKM

42 Further Research Challenges: Complex Similarity Search Problems: different types of queries, involving different features and different similarity measures multiple overlays over the same physical network, distributed incremental similarity search, high communication costs of naïve implementations, collaboration with the load balancing mechanism. Search P2PIR Workshop - ACM CIKM

43 Further Research Challenges: Load balancing Problems: one node contains data of different features, load balancing cost models to measure the load and estimate the reorganization costs, postulates of distributed processing must strictly be respected performance tuning Search P2PIR Workshop - ACM CIKM

44 P2P Solutions for A/V Search P2P-based solution to solve the fundamental Scalability Issue, concerning not only: Distributed Similarity Search structures but also: Cooperative A/V features extraction Support of highly dynamic applications (e.g. videoblogs, photoblogs, etc.) Push-based/cooperative crawling Search P2PIR Workshop - ACM CIKM

45 Technological requirements for MM Search Engines Media specific analysis and feature extraction: e.g. Music Information Retrieval Scalable, dynamic and distributed index structures supporting similarity search Complex/multi-feature query processing: combining evidence from different media indexes, using the similarity paradigm (together with the traditional Web search) Support of distributed push-based crawling, where containers are asked to publish and push information to the search engine (together with the traditional pull-based crawling) Scalable dynamic caching techniques to enhance performance Context based support (based on user location, activity, etc.) and Multi device support (search from PC, mobile phone, PDAs). Search P2PIR Workshop - ACM CIKM

46 P2P and push-based crawling Conventional pull-based crawling techniques face the high refresh rates and huge size of the Web with increasing difficulty and have limitations in dealing with multimedia information In a distributed push-based crawling model, content providers (both professional and personal) are asked to publish and push information to the P2P indexing nodes Collaborative crawling model can effectively deal with important multimedia content that is hidden to traditional crawlers because it is not directly hyperlinked from some HTML page or it is stored in on-line AV specialized repositories that cannot be visited by crawling agents.. Search P2PIR Workshop - ACM CIKM

47 P2P and push-based crawling (2) Multimedia content providers can be helped by the P2P infrastructure in the heavy process of multimedia feature extraction A collaborative and participatory P2P environment can give the providers the possibility of maintaining the control on the contributed material (publish what you want when you want): IPR-protected material is indexed and searched for, but its delivery controlled directly by the owner. New collaborative business models? Search P2PIR Workshop - ACM CIKM

48 Dynamic combination of crawling and feature extraction modes Search P2PIR Workshop - ACM CIKM

49 P2P and Dynamic Caching Caching and replication are routinely used in the Web since they allow bandwidth consumption to be reduced, and user-perceived quality of service to be improved. In decentralized P2P systems, caching and replication permit to achieve a better load distribution, shorter latency, and higher availability. These techniques can be applied to contents, query results, and index entries. Literature proposes several solutions in which caching and replication strategies are managed locally, at the peer or super peer levels, or globally, by deploying a distributed cache over several peers. Search P2PIR Workshop - ACM CIKM

50 P2P and Dynamic Caching (2) New dimension of the problem: due to the peculiarities of multimedia content (size, dynamicity, DRM constraints) It is necessary to enhance search responsiveness, and save computational and communication resources (e.g. by exploiting self-similarities among submitted queries which follow a zipfian distribution) The main problem to deal with is related to dynamicity. In fact, it is not clear how long cached information will remain valid. The variability of data and the dynamicity of the network itself make hard to predict freshness of information for cache entries. To design an on-line caching algorithm, it is necessary to investigate the trade-off between caching-time and validitytime, and explore whether and in which cases there is a correlation between the popularity and the time validity of a cached entry. Search P2PIR Workshop - ACM CIKM

51 Search P2PIR Workshop - ACM CIKM

52 Music Information Retrieval (MIR) Search Mainly based on melody, modeling possible mismatch between the query and the documents Retrieve a song given an excerpt, sung or recorded Retrieve songs being similar to some query excerpts/songs Classification Mainly based on timbre, timing and long-term features Identify author, performers, artist, genre, style, orchestration Recommendation Based on collaborative filtering mixed with content-based Suggest a number of items to purchase or to organize in a playlist; organize programs for Web radios Visualization Represent large personal music collections, for music browsing, audio preview Search P2PIR Workshop - ACM CIKM

53 MIR: Basic Techniques MIR tasks require audio processing for: Tempo identification Transcription of the main melody Recognition of harmonic structures Timbre characterization Similarity is computed using String matching - i.e. Dynamic Time Warping Statistical modeling - i.e. Hidden Markov Models Geometric approaches - i.e. Earth Movers Distance Techniques to visualize, classify, recommend k-nearest Neighbor, Gaussian Mixture Models, Self-Organizing Maps, Markov Models Search P2PIR Workshop - ACM CIKM

54 MIR: Evaluation Problems with copyright issues Researchers have difficulties to obtain large music collections Music Information Retrieval Evaluation exchange (MIREX) Common effort for a TREC-like evaluation framework Participants propose tasks and provide test collections Experiments are carried out by the organizers Main focus on preprocessing techniques Effectiveness of feature extraction, no real need of relevance judgments Initial efforts also for typical retrieval tasks Search P2PIR Workshop - ACM CIKM

55 MIR: Digital Rights Management Audio fingerprinting To recognize copyrighted material Can be exploited for retrieval tasks too Audio watermarking To include copyright ownership and to track users sharing behaviors With watermarks retrieval can be based on metadata Song similarity To identify plagiarism Artist identification Search P2PIR Workshop - ACM CIKM

56 MIR: P2P and Portable Devices The increasing number of large personal collections of digital music, allows for Music retrieval with distributed music indexes, stored in different peers Music recommendation using collaborative filtering based on the analysis of personal collection content Computation of music similarity based on users listening behaviors The audio channel is more suitable for the interaction with portable devices Music retrieval, classification and recommendation through aural interaction Techniques for extracting music snapshots and snippets Search P2PIR Workshop - ACM CIKM

57 Conclusions Starting from today situation Peer-to-Peer Applications Image and Video search on the Web In order to improve effectiveness by adopting the Similarity Search paradigm We need a highly scalable and dynamic solution P2P solution is feasible and promising, also with respect to: Cooperative A/V features extraction Push-based/cooperative crawling Search P2PIR Workshop - ACM CIKM

MUFIN Basics. MUFIN team Faculty of Informatics, Masaryk University Brno, Czech Republic SEMWEB 1

MUFIN Basics. MUFIN team Faculty of Informatics, Masaryk University Brno, Czech Republic SEMWEB 1 MUFIN Basics MUFIN team Faculty of Informatics, Masaryk University Brno, Czech Republic mufin@fi.muni.cz SEMWEB 1 Search problem SEARCH index structure data & queries infrastructure SEMWEB 2 The thesis

More information

Peer-to-Peer Systems. Chapter General Characteristics

Peer-to-Peer Systems. Chapter General Characteristics Chapter 2 Peer-to-Peer Systems Abstract In this chapter, a basic overview is given of P2P systems, architectures, and search strategies in P2P systems. More specific concepts that are outlined include

More information

SISTEMI PER LA RICERCA

SISTEMI PER LA RICERCA SISTEMI PER LA RICERCA DI DATI MULTIMEDIALI Claudio Lucchese e Salvatore Orlando Terzo Workshop di Dipartimento Dipartimento di Informatica Università degli studi di Venezia Ringrazio le seguenti persone,

More information

FROM PEER TO PEER...

FROM PEER TO PEER... FROM PEER TO PEER... Dipartimento di Informatica, Università degli Studi di Pisa HPC LAB, ISTI CNR Pisa in collaboration with: Alessandro Lulli, Emanuele Carlini, Massimo Coppola, Patrizio Dazzi 2 nd HPC

More information

Making Gnutella-like P2P Systems Scalable

Making Gnutella-like P2P Systems Scalable Making Gnutella-like P2P Systems Scalable Y. Chawathe, S. Ratnasamy, L. Breslau, N. Lanham, S. Shenker Presented by: Herman Li Mar 2, 2005 Outline What are peer-to-peer (P2P) systems? Early P2P systems

More information

Unit 8 Peer-to-Peer Networking

Unit 8 Peer-to-Peer Networking Unit 8 Peer-to-Peer Networking P2P Systems Use the vast resources of machines at the edge of the Internet to build a network that allows resource sharing without any central authority. Client/Server System

More information

Introduction to Similarity Search in Multimedia Databases

Introduction to Similarity Search in Multimedia Databases Introduction to Similarity Search in Multimedia Databases Tomáš Skopal Charles University in Prague Faculty of Mathematics and Phycics SIRET research group http://siret.ms.mff.cuni.cz March 23 rd 2011,

More information

A Scalable Content- Addressable Network

A Scalable Content- Addressable Network A Scalable Content- Addressable Network In Proceedings of ACM SIGCOMM 2001 S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker Presented by L.G. Alex Sung 9th March 2005 for CS856 1 Outline CAN basics

More information

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards Jürgen Herre for Integrated Circuits (FhG-IIS) Erlangen, Germany Jürgen Herre, hrr@iis.fhg.de Page 1 Overview Extracting meaning

More information

Addressed Issue. P2P What are we looking at? What is Peer-to-Peer? What can databases do for P2P? What can databases do for P2P?

Addressed Issue. P2P What are we looking at? What is Peer-to-Peer? What can databases do for P2P? What can databases do for P2P? Peer-to-Peer Data Management - Part 1- Alex Coman acoman@cs.ualberta.ca Addressed Issue [1] Placement and retrieval of data [2] Server architectures for hybrid P2P [3] Improve search in pure P2P systems

More information

CS555: Distributed Systems [Fall 2017] Dept. Of Computer Science, Colorado State University

CS555: Distributed Systems [Fall 2017] Dept. Of Computer Science, Colorado State University CS 555: DISTRIBUTED SYSTEMS [P2P SYSTEMS] Shrideep Pallickara Computer Science Colorado State University Frequently asked questions from the previous class survey Byzantine failures vs malicious nodes

More information

Information Retrieval

Information Retrieval Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,

More information

Using peer to peer. Marco Danelutto Dept. Computer Science University of Pisa

Using peer to peer. Marco Danelutto Dept. Computer Science University of Pisa Using peer to peer Marco Danelutto Dept. Computer Science University of Pisa Master Degree (Laurea Magistrale) in Computer Science and Networking Academic Year 2009-2010 Rationale Two common paradigms

More information

Overlay networks. To do. Overlay networks. P2P evolution DHTs in general, Chord and Kademlia. Turtles all the way down. q q q

Overlay networks. To do. Overlay networks. P2P evolution DHTs in general, Chord and Kademlia. Turtles all the way down. q q q Overlay networks To do q q q Overlay networks P2P evolution DHTs in general, Chord and Kademlia Turtles all the way down Overlay networks virtual networks Different applications with a wide range of needs

More information

Crawling, Indexing, and Similarity Searching Images on the Web

Crawling, Indexing, and Similarity Searching Images on the Web Crawling, Indexing, and Similarity Searching Images on the Web (Extended Abstract) M. Batko 1 and F. Falchi 2 and C. Lucchese 2 and D. Novak 1 and R. Perego 2 and F. Rabitti 2 and J. Sedmidubsky 1 and

More information

March 10, Distributed Hash-based Lookup. for Peer-to-Peer Systems. Sandeep Shelke Shrirang Shirodkar MTech I CSE

March 10, Distributed Hash-based Lookup. for Peer-to-Peer Systems. Sandeep Shelke Shrirang Shirodkar MTech I CSE for for March 10, 2006 Agenda for Peer-to-Peer Sytems Initial approaches to Their Limitations CAN - Applications of CAN Design Details Benefits for Distributed and a decentralized architecture No centralized

More information

Designing Peer-to-Peer Systems for Business-to-Business Environments

Designing Peer-to-Peer Systems for Business-to-Business Environments Designing Peer-to-Peer Systems for Business-to-Business Environments Davide Carboni, Massimo Deriu, Cristian Lai and Antonio Pintus CRS4 (Center for Advanced Studies, Research and Development in Sardinia)

More information

Scalability Comparison of Peer-to-Peer Similarity-Search Structures

Scalability Comparison of Peer-to-Peer Similarity-Search Structures Scalability Comparison of Peer-to-Peer Similarity-Search Structures Michal Batko a David Novak a Fabrizio Falchi b Pavel Zezula a a Masaryk University, Brno, Czech Republic b ISTI-CNR, Pisa, Italy Abstract

More information

doc. RNDr. Tomáš Skopal, Ph.D. Department of Software Engineering, Faculty of Information Technology, Czech Technical University in Prague

doc. RNDr. Tomáš Skopal, Ph.D. Department of Software Engineering, Faculty of Information Technology, Czech Technical University in Prague Praha & EU: Investujeme do vaší budoucnosti Evropský sociální fond course: Searching the Web and Multimedia Databases (BI-VWM) Tomáš Skopal, 2011 SS2010/11 doc. RNDr. Tomáš Skopal, Ph.D. Department of

More information

CSE 5306 Distributed Systems

CSE 5306 Distributed Systems CSE 5306 Distributed Systems Naming Jia Rao http://ranger.uta.edu/~jrao/ 1 Naming Names play a critical role in all computer systems To access resources, uniquely identify entities, or refer to locations

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK PEER-TO-PEER FILE SHARING WITH THE BITTORRENT PROTOCOL APURWA D. PALIWAL 1, PROF.

More information

Search Engines. Information Retrieval in Practice

Search Engines. Information Retrieval in Practice Search Engines Information Retrieval in Practice All slides Addison Wesley, 2008 Web Crawler Finds and downloads web pages automatically provides the collection for searching Web is huge and constantly

More information

A Survey of Peer-to-Peer Content Distribution Technologies

A Survey of Peer-to-Peer Content Distribution Technologies A Survey of Peer-to-Peer Content Distribution Technologies Stephanos Androutsellis-Theotokis and Diomidis Spinellis ACM Computing Surveys, December 2004 Presenter: Seung-hwan Baek Ja-eun Choi Outline Overview

More information

Distributed Multi-modal Similarity Retrieval

Distributed Multi-modal Similarity Retrieval Distributed Multi-modal Similarity Retrieval David Novak Seminar of DISA Lab, October 14, 2014 David Novak Multi-modal Similarity Retrieval DISA Seminar 1 / 17 Outline of the Talk 1 Motivation Similarity

More information

Kademlia: A P2P Informa2on System Based on the XOR Metric

Kademlia: A P2P Informa2on System Based on the XOR Metric Kademlia: A P2P Informa2on System Based on the XOR Metric Today! By Petar Mayamounkov and David Mazières, presented at IPTPS 22 Next! Paper presentation and discussion Image from http://www.vs.inf.ethz.ch/about/zeit.jpg

More information

Nearest Neighbor Search by Branch and Bound

Nearest Neighbor Search by Branch and Bound Nearest Neighbor Search by Branch and Bound Algorithmic Problems Around the Web #2 Yury Lifshits http://yury.name CalTech, Fall 07, CS101.2, http://yury.name/algoweb.html 1 / 30 Outline 1 Short Intro to

More information

Efficient Indexing and Searching Framework for Unstructured Data

Efficient Indexing and Searching Framework for Unstructured Data Efficient Indexing and Searching Framework for Unstructured Data Kyar Nyo Aye, Ni Lar Thein University of Computer Studies, Yangon kyarnyoaye@gmail.com, nilarthein@gmail.com ABSTRACT The proliferation

More information

PEER-TO-PEER NETWORKS, DHTS, AND CHORD

PEER-TO-PEER NETWORKS, DHTS, AND CHORD PEER-TO-PEER NETWORKS, DHTS, AND CHORD George Porter May 25, 2018 ATTRIBUTION These slides are released under an Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) Creative Commons license

More information

Peer to Peer Networks

Peer to Peer Networks Sungkyunkwan University Peer to Peer Networks Prepared by T. Le-Duc and H. Choo Copyright 2000-2017 Networking Laboratory Presentation Outline 2.1 Introduction 2.2 Client-Server Paradigm 2.3 Peer-To-Peer

More information

Distributed Systems. 17. Distributed Lookup. Paul Krzyzanowski. Rutgers University. Fall 2016

Distributed Systems. 17. Distributed Lookup. Paul Krzyzanowski. Rutgers University. Fall 2016 Distributed Systems 17. Distributed Lookup Paul Krzyzanowski Rutgers University Fall 2016 1 Distributed Lookup Look up (key, value) Cooperating set of nodes Ideally: No central coordinator Some nodes can

More information

SIGIR Workshop Report. The SIGIR Heterogeneous and Distributed Information Retrieval Workshop

SIGIR Workshop Report. The SIGIR Heterogeneous and Distributed Information Retrieval Workshop SIGIR Workshop Report The SIGIR Heterogeneous and Distributed Information Retrieval Workshop Ranieri Baraglia HPC-Lab ISTI-CNR, Italy ranieri.baraglia@isti.cnr.it Fabrizio Silvestri HPC-Lab ISTI-CNR, Italy

More information

Scalability In Peer-to-Peer Systems. Presented by Stavros Nikolaou

Scalability In Peer-to-Peer Systems. Presented by Stavros Nikolaou Scalability In Peer-to-Peer Systems Presented by Stavros Nikolaou Background on Peer-to-Peer Systems Definition: Distributed systems/applications featuring: No centralized control, no hierarchical organization

More information

Peer-to-Peer Streaming Systems. Behzad Akbari

Peer-to-Peer Streaming Systems. Behzad Akbari Peer-to-Peer Streaming Systems Behzad Akbari 1 Outline Introduction Scaleable Streaming Approaches Application Layer Multicast Content Distribution Networks Peer-to-Peer Streaming Metrics Current Issues

More information

Today. Architectural Styles

Today. Architectural Styles Today Architectures for distributed systems (Chapter 2) Centralized, decentralized, hybrid Middleware Self-managing systems Lecture 2, page 1 Architectural Styles Important styles of architecture for distributed

More information

Assignment 5. Georgia Koloniari

Assignment 5. Georgia Koloniari Assignment 5 Georgia Koloniari 2. "Peer-to-Peer Computing" 1. What is the definition of a p2p system given by the authors in sec 1? Compare it with at least one of the definitions surveyed in the last

More information

CSE 5306 Distributed Systems. Naming

CSE 5306 Distributed Systems. Naming CSE 5306 Distributed Systems Naming 1 Naming Names play a critical role in all computer systems To access resources, uniquely identify entities, or refer to locations To access an entity, you have resolve

More information

Version 11

Version 11 The Big Challenges Networked and Electronic Media European Technology Platform The birth of a new sector www.nem-initiative.org Version 11 1. NEM IN THE WORLD The main objective of the Networked and Electronic

More information

Lecture 13: P2P Distributed Systems

Lecture 13: P2P Distributed Systems Lecture 13: P2P Distributed Systems Behzad Bordbar School of Computer Science, University of Birmingham, UK Lecture 13 1 Outline Characteristics of P2P How Napster works? Limitation of Napster and P2P

More information

Peer-to-Peer Internet Applications: A Review

Peer-to-Peer Internet Applications: A Review Peer-to-Peer Internet Applications: A Review Davide Quaglia 01/14/10 Introduction Key points Lookup task Outline Centralized (Napster) Query flooding (Gnutella) Distributed Hash Table (Chord) Simulation

More information

doc. RNDr. Tomáš Skopal, Ph.D. Department of Software Engineering, Faculty of Information Technology, Czech Technical University in Prague

doc. RNDr. Tomáš Skopal, Ph.D. Department of Software Engineering, Faculty of Information Technology, Czech Technical University in Prague Praha & EU: Investujeme do vaší budoucnosti Evropský sociální fond course: Searching the Web and Multimedia Databases (BI-VWM) Tomáš Skopal, 2011 SS2010/11 doc. RNDr. Tomáš Skopal, Ph.D. Department of

More information

Overlay Networks. Behnam Momeni Computer Engineering Department Sharif University of Technology

Overlay Networks. Behnam Momeni Computer Engineering Department Sharif University of Technology CE443 Computer Networks Overlay Networks Behnam Momeni Computer Engineering Department Sharif University of Technology Acknowledgments: Lecture slides are from Computer networks course thought by Jennifer

More information

Today CSCI Coda. Naming: Volumes. Coda GFS PAST. Instructor: Abhishek Chandra. Main Goals: Volume is a subtree in the naming space

Today CSCI Coda. Naming: Volumes. Coda GFS PAST. Instructor: Abhishek Chandra. Main Goals: Volume is a subtree in the naming space Today CSCI 5105 Coda GFS PAST Instructor: Abhishek Chandra 2 Coda Main Goals: Availability: Work in the presence of disconnection Scalability: Support large number of users Successor of Andrew File System

More information

Peer-to-Peer Systems. Network Science: Introduction. P2P History: P2P History: 1999 today

Peer-to-Peer Systems. Network Science: Introduction. P2P History: P2P History: 1999 today Network Science: Peer-to-Peer Systems Ozalp Babaoglu Dipartimento di Informatica Scienza e Ingegneria Università di Bologna www.cs.unibo.it/babaoglu/ Introduction Peer-to-peer (PP) systems have become

More information

Distributed Meta-data Servers: Architecture and Design. Sarah Sharafkandi David H.C. Du DISC

Distributed Meta-data Servers: Architecture and Design. Sarah Sharafkandi David H.C. Du DISC Distributed Meta-data Servers: Architecture and Design Sarah Sharafkandi David H.C. Du DISC 5/22/07 1 Outline Meta-Data Server (MDS) functions Why a distributed and global Architecture? Problem description

More information

Overlay networks. Today. l Overlays networks l P2P evolution l Pastry as a routing overlay example

Overlay networks. Today. l Overlays networks l P2P evolution l Pastry as a routing overlay example Overlay networks Today l Overlays networks l P2P evolution l Pastry as a routing overlay eample Network virtualization and overlays " Different applications with a range of demands/needs network virtualization

More information

EECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks. Overlay Networks: Motivations

EECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks. Overlay Networks: Motivations EECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks Ion Stoica Computer Science Division Department of Electrical Engineering and Computer Sciences University of California, Berkeley

More information

Ossification of the Internet

Ossification of the Internet Ossification of the Internet The Internet evolved as an experimental packet-switched network Today, many aspects appear to be set in stone - Witness difficulty in getting IP multicast deployed - Major

More information

Comparing Hybrid Peer-to-Peer Systems. Hybrid peer-to-peer systems. Contributions of this paper. Questions for hybrid systems

Comparing Hybrid Peer-to-Peer Systems. Hybrid peer-to-peer systems. Contributions of this paper. Questions for hybrid systems Comparing Hybrid Peer-to-Peer Systems Beverly Yang and Hector Garcia-Molina Presented by Marco Barreno November 3, 2003 CS 294-4: Peer-to-peer systems Hybrid peer-to-peer systems Pure peer-to-peer systems

More information

DISTRIBUTED COMPUTER SYSTEMS ARCHITECTURES

DISTRIBUTED COMPUTER SYSTEMS ARCHITECTURES DISTRIBUTED COMPUTER SYSTEMS ARCHITECTURES Dr. Jack Lange Computer Science Department University of Pittsburgh Fall 2015 Outline System Architectural Design Issues Centralized Architectures Application

More information

P2P Applications. Reti di Elaboratori Corso di Laurea in Informatica Università degli Studi di Roma La Sapienza Canale A-L Prof.ssa Chiara Petrioli

P2P Applications. Reti di Elaboratori Corso di Laurea in Informatica Università degli Studi di Roma La Sapienza Canale A-L Prof.ssa Chiara Petrioli P2P Applications Reti di Elaboratori Corso di Laurea in Informatica Università degli Studi di Roma La Sapienza Canale A-L Prof.ssa Chiara Petrioli Server-based Network Peer-to-peer networks A type of network

More information

Motivation for peer-to-peer

Motivation for peer-to-peer Peer-to-peer systems INF 5040 autumn 2015 lecturer: Roman Vitenberg INF5040, Frank Eliassen & Roman Vitenberg 1 Motivation for peer-to-peer Ø Inherent restrictions of the standard client/ server model

More information

A Top Catching Scheme Consistency Controlling in Hybrid P2P Network

A Top Catching Scheme Consistency Controlling in Hybrid P2P Network A Top Catching Scheme Consistency Controlling in Hybrid P2P Network V. Asha*1, P Ramesh Babu*2 M.Tech (CSE) Student Department of CSE, Priyadarshini Institute of Technology & Science, Chintalapudi, Guntur(Dist),

More information

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS 1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,

More information

PUB-2-SUB: A Content-Based Publish/Subscribe Framework for Cooperative P2P Networks

PUB-2-SUB: A Content-Based Publish/Subscribe Framework for Cooperative P2P Networks PUB-2-SUB: A Content-Based Publish/Subscribe Framework for Cooperative P2P Networks Duc A. Tran Cuong Pham Network Information Systems Lab (NISLab) Dept. of Computer Science University of Massachusetts,

More information

Stratos Idreos. A thesis submitted in fulfillment of the requirements for the degree of. Electronic and Computer Engineering

Stratos Idreos. A thesis submitted in fulfillment of the requirements for the degree of. Electronic and Computer Engineering P2P-DIET: A QUERY AND NOTIFICATION SERVICE BASED ON MOBILE AGENTS FOR RAPID IMPLEMENTATION OF P2P APPLICATIONS by Stratos Idreos A thesis submitted in fulfillment of the requirements for the degree of

More information

Information Retrieval Spring Web retrieval

Information Retrieval Spring Web retrieval Information Retrieval Spring 2016 Web retrieval The Web Large Changing fast Public - No control over editing or contents Spam and Advertisement How big is the Web? Practically infinite due to the dynamic

More information

Chapter 6 PEER-TO-PEER COMPUTING

Chapter 6 PEER-TO-PEER COMPUTING Chapter 6 PEER-TO-PEER COMPUTING Distributed Computing Group Computer Networks Winter 23 / 24 Overview What is Peer-to-Peer? Dictionary Distributed Hashing Search Join & Leave Other systems Case study:

More information

Introduction on Peer to Peer systems

Introduction on Peer to Peer systems Introduction on Peer to Peer systems Georges Da Costa dacosta@irit.fr dacosta@irit.fr 1/55 Goal of this Lecture What can P2P do, not only as a buzzword What it can't do Shows some examples & algorithms

More information

A Content-Addressable Network for Similarity Search in Metric Spaces

A Content-Addressable Network for Similarity Search in Metric Spaces University of Pisa Masaryk University Faculty of Informatics Department of Information Engineering Doctoral Course in Information Engineering Department of Information Technologies Doctoral Course in Informatics

More information

Desktop Crawls. Document Feeds. Document Feeds. Information Retrieval

Desktop Crawls. Document Feeds. Document Feeds. Information Retrieval Information Retrieval INFO 4300 / CS 4300! Web crawlers Retrieving web pages Crawling the web» Desktop crawlers» Document feeds File conversion Storing the documents Removing noise Desktop Crawls! Used

More information

Information Retrieval Technique for MIR and P2P Network

Information Retrieval Technique for MIR and P2P Network 2014 年 5 月第十七卷二期 Vol. 17, No. 2, May 2014 Information Retrieval Technique for MIR and P2P Network Huei-Chen Hsu Ya-Li Chung Hui-Chun Chan http://cmr.ba.ouhk.edu.hk Web Journal of Chinese Management Review

More information

Peer-to-peer computing research a fad?

Peer-to-peer computing research a fad? Peer-to-peer computing research a fad? Frans Kaashoek kaashoek@lcs.mit.edu NSF Project IRIS http://www.project-iris.net Berkeley, ICSI, MIT, NYU, Rice What is a P2P system? Node Node Node Internet Node

More information

MPEG-7 Audio: Tools for Semantic Audio Description and Processing

MPEG-7 Audio: Tools for Semantic Audio Description and Processing MPEG-7 Audio: Tools for Semantic Audio Description and Processing Jürgen Herre for Integrated Circuits (FhG-IIS) Erlangen, Germany Jürgen Herre, hrr@iis.fhg.de Page 1 Overview Why semantic description

More information

Searching in variably connected P2P networks

Searching in variably connected P2P networks Searching in variably connected P2P networks Njål T. Borch Norut Informasjonsteknologi AS Tromsø, Norway Email: Njaal.Borch@itek.norut.no Lars Kristian Vognild Norut Informasjonsteknologi AS Tromsø, Norway

More information

Architectures for distributed systems (Chapter 2)

Architectures for distributed systems (Chapter 2) Today Architectures for distributed systems (Chapter 2) Architectural styles Client-server architectures Decentralized and peer-to-peer architectures Lecture 2, page!1 Module 1: Architectural Styles Important

More information

Collaborative Multi-Source Scheme for Multimedia Content Distribution

Collaborative Multi-Source Scheme for Multimedia Content Distribution Collaborative Multi-Source Scheme for Multimedia Content Distribution Universidad Autónoma Metropolitana-Cuajimalpa, Departament of Information Technology, Mexico City, Mexico flopez@correo.cua.uam.mx

More information

System models for distributed systems

System models for distributed systems System models for distributed systems INF5040/9040 autumn 2010 lecturer: Frank Eliassen INF5040 H2010, Frank Eliassen 1 System models Purpose illustrate/describe common properties and design choices for

More information

CHAPTER 8 Multimedia Information Retrieval

CHAPTER 8 Multimedia Information Retrieval CHAPTER 8 Multimedia Information Retrieval Introduction Text has been the predominant medium for the communication of information. With the availability of better computing capabilities such as availability

More information

Overlay Networks: Motivations. EECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks. Motivations (cont d) Goals.

Overlay Networks: Motivations. EECS 122: Introduction to Computer Networks Overlay Networks and P2P Networks. Motivations (cont d) Goals. Overlay Networks: Motivations CS : Introduction to Computer Networks Overlay Networks and PP Networks Ion Stoica Computer Science Division Department of lectrical ngineering and Computer Sciences University

More information

Outline A Hierarchical P2P Architecture and an Efficient Flooding Algorithm

Outline A Hierarchical P2P Architecture and an Efficient Flooding Algorithm University of British Columbia Cpsc 527 Advanced Computer Communications Lecture 9b Hierarchical P2P Architecture and Efficient Multicasting (Juan Li s MSc Thesis) Instructor: Dr. Son Vuong The World Connected

More information

!!!!!! Portfolio Summary!! for more information July, C o n c e r t T e c h n o l o g y

!!!!!! Portfolio Summary!! for more information  July, C o n c e r t T e c h n o l o g y Portfolio Summary July, 2014 for more information www.concerttechnology.com bizdev@concerttechnology.com C o n c e r t T e c h n o l o g y Overview The screenplay project covers emerging trends in social

More information

Telematics Chapter 9: Peer-to-Peer Networks

Telematics Chapter 9: Peer-to-Peer Networks Telematics Chapter 9: Peer-to-Peer Networks Beispielbild User watching video clip Server with video clips Application Layer Presentation Layer Application Layer Presentation Layer Session Layer Session

More information

Introduction to Peer-to-Peer Systems

Introduction to Peer-to-Peer Systems Introduction Introduction to Peer-to-Peer Systems Peer-to-peer (PP) systems have become extremely popular and contribute to vast amounts of Internet traffic PP basic definition: A PP system is a distributed

More information

Topology Enhancement in Wireless Multihop Networks: A Top-down Approach

Topology Enhancement in Wireless Multihop Networks: A Top-down Approach Topology Enhancement in Wireless Multihop Networks: A Top-down Approach Symeon Papavassiliou (joint work with Eleni Stai and Vasileios Karyotis) National Technical University of Athens (NTUA) School of

More information

Bayeux: An Architecture for Scalable and Fault Tolerant Wide area Data Dissemination

Bayeux: An Architecture for Scalable and Fault Tolerant Wide area Data Dissemination Bayeux: An Architecture for Scalable and Fault Tolerant Wide area Data Dissemination By Shelley Zhuang,Ben Zhao,Anthony Joseph, Randy Katz,John Kubiatowicz Introduction Multimedia Streaming typically involves

More information

Towards an Integrated Approach to Music Retrieval

Towards an Integrated Approach to Music Retrieval Towards an Integrated Approach to Music Retrieval Emanuele Di Buccio 1, Ivano Masiero 1, Yosi Mass 2, Massimo Melucci 1, Riccardo Miotto 1, Nicola Orio 1, and Benjamin Sznajder 2 1 Department of Information

More information

Overview Computer Networking Lecture 16: Delivering Content: Peer to Peer and CDNs Peter Steenkiste

Overview Computer Networking Lecture 16: Delivering Content: Peer to Peer and CDNs Peter Steenkiste Overview 5-44 5-44 Computer Networking 5-64 Lecture 6: Delivering Content: Peer to Peer and CDNs Peter Steenkiste Web Consistent hashing Peer-to-peer Motivation Architectures Discussion CDN Video Fall

More information

Today. Architectural Styles

Today. Architectural Styles Today Architectures for distributed systems (Chapter 2) Centralized, decentralized, hybrid Middleware Self-managing systems Lecture 2, page 1 Architectural Styles Important styles of architecture for distributed

More information

Lecture 8: Internet and Online Services. CS 598: Advanced Internetworking Matthew Caesar March 3, 2011

Lecture 8: Internet and Online Services. CS 598: Advanced Internetworking Matthew Caesar March 3, 2011 Lecture 8: Internet and Online Services CS 598: Advanced Internetworking Matthew Caesar March 3, 2011 Demands of modern networked services Old approach: run applications on local PC Now: major innovation

More information

A Comparative study of On-Demand Data Delivery with Tables Driven and On-Demand Protocols for Mobile Ad-Hoc Network

A Comparative study of On-Demand Data Delivery with Tables Driven and On-Demand Protocols for Mobile Ad-Hoc Network A Comparative study of On-Demand Data Delivery with Tables Driven and On-Demand Protocols for Mobile Ad-Hoc Network Humayun Bakht Research Fellow, London School of Commerce, United Kingdom humayunbakht@yahoo.co.uk

More information

Overlay networks. T o do. Overlay networks. P2P evolution DHTs in general, Chord and Kademlia. q q q. Turtles all the way down

Overlay networks. T o do. Overlay networks. P2P evolution DHTs in general, Chord and Kademlia. q q q. Turtles all the way down Overlay networks T o do q q q Overlay networks P2P evolution DHTs in general, Chord and Kademlia Turtles all the way down Overlay networks virtual networks Different applications with a wide range of needs

More information

Module SDS: Scalable Distributed Systems. Gabriel Antoniu, KERDATA & Davide Frey, ASAP INRIA

Module SDS: Scalable Distributed Systems. Gabriel Antoniu, KERDATA & Davide Frey, ASAP INRIA Module SDS: Scalable Distributed Systems Gabriel Antoniu, KERDATA & Davide Frey, ASAP INRIA Staff Gabriel Antoniu, DR INRIA, KERDATA Team gabriel.antoniu@inria.fr Davide Frey, CR INRIA, ASAP Team davide.frey@inria.fr

More information

Distributed Hash Table

Distributed Hash Table Distributed Hash Table P2P Routing and Searching Algorithms Ruixuan Li College of Computer Science, HUST rxli@public.wh.hb.cn http://idc.hust.edu.cn/~rxli/ In Courtesy of Xiaodong Zhang, Ohio State Univ

More information

INTRODUCTION. Chapter GENERAL

INTRODUCTION. Chapter GENERAL Chapter 1 INTRODUCTION 1.1 GENERAL The World Wide Web (WWW) [1] is a system of interlinked hypertext documents accessed via the Internet. It is an interactive world of shared information through which

More information

DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN. Chapter 2 ARCHITECTURES

DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN. Chapter 2 ARCHITECTURES DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN Chapter 2 ARCHITECTURES Architectural Styles Important styles of architecture for distributed systems Layered

More information

EE 122: Peer-to-Peer (P2P) Networks. Ion Stoica November 27, 2002

EE 122: Peer-to-Peer (P2P) Networks. Ion Stoica November 27, 2002 EE 122: Peer-to-Peer (P2P) Networks Ion Stoica November 27, 22 How Did it Start? A killer application: Naptser - Free music over the Internet Key idea: share the storage and bandwidth of individual (home)

More information

Peer-to-Peer Systems and Distributed Hash Tables

Peer-to-Peer Systems and Distributed Hash Tables Peer-to-Peer Systems and Distributed Hash Tables CS 240: Computing Systems and Concurrency Lecture 8 Marco Canini Credits: Michael Freedman and Kyle Jamieson developed much of the original material. Selected

More information

Distributed Web Crawling over DHTs. Boon Thau Loo, Owen Cooper, Sailesh Krishnamurthy CS294-4

Distributed Web Crawling over DHTs. Boon Thau Loo, Owen Cooper, Sailesh Krishnamurthy CS294-4 Distributed Web Crawling over DHTs Boon Thau Loo, Owen Cooper, Sailesh Krishnamurthy CS294-4 Search Today Search Index Crawl What s Wrong? Users have a limited search interface Today s web is dynamic and

More information

Venugopal Ramasubramanian Emin Gün Sirer SIGCOMM 04

Venugopal Ramasubramanian Emin Gün Sirer SIGCOMM 04 The Design and Implementation of a Next Generation Name Service for the Internet Venugopal Ramasubramanian Emin Gün Sirer SIGCOMM 04 Presenter: Saurabh Kadekodi Agenda DNS overview Current DNS Problems

More information

Distributed Information Processing

Distributed Information Processing Distributed Information Processing 14 th Lecture Eom, Hyeonsang ( 엄현상 ) Department of Computer Science & Engineering Seoul National University Copyrights 2016 Eom, Hyeonsang All Rights Reserved Outline

More information

70 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 6, NO. 1, FEBRUARY ClassView: Hierarchical Video Shot Classification, Indexing, and Accessing

70 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 6, NO. 1, FEBRUARY ClassView: Hierarchical Video Shot Classification, Indexing, and Accessing 70 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 6, NO. 1, FEBRUARY 2004 ClassView: Hierarchical Video Shot Classification, Indexing, and Accessing Jianping Fan, Ahmed K. Elmagarmid, Senior Member, IEEE, Xingquan

More information

K Nearest Neighbor Wrap Up K- Means Clustering. Slides adapted from Prof. Carpuat

K Nearest Neighbor Wrap Up K- Means Clustering. Slides adapted from Prof. Carpuat K Nearest Neighbor Wrap Up K- Means Clustering Slides adapted from Prof. Carpuat K Nearest Neighbor classification Classification is based on Test instance with Training Data K: number of neighbors that

More information

Jim Mains Director of Business Strategy and Media Services Media Solutions Group, EMC Corporation

Jim Mains Director of Business Strategy and Media Services Media Solutions Group, EMC Corporation Media Asset Management Databases The Heart of the System and Critical Decisions and Steps for Success Jim Mains Director of Business Strategy and Media Services Media Solutions Group, EMC Corporation Agenda

More information

Query Processing Over Peer-To-Peer Data Sharing Systems

Query Processing Over Peer-To-Peer Data Sharing Systems Query Processing Over Peer-To-Peer Data Sharing Systems O. D. Şahin A. Gupta D. Agrawal A. El Abbadi Department of Computer Science University of California at Santa Barbara odsahin, abhishek, agrawal,

More information

KDD 10 Tutorial: Recommender Problems for Web Applications. Deepak Agarwal and Bee-Chung Chen Yahoo! Research

KDD 10 Tutorial: Recommender Problems for Web Applications. Deepak Agarwal and Bee-Chung Chen Yahoo! Research KDD 10 Tutorial: Recommender Problems for Web Applications Deepak Agarwal and Bee-Chung Chen Yahoo! Research Agenda Focus: Recommender problems for dynamic, time-sensitive applications Content Optimization

More information

Classifier Inspired Scaling for Training Set Selection

Classifier Inspired Scaling for Training Set Selection Classifier Inspired Scaling for Training Set Selection Walter Bennette DISTRIBUTION A: Approved for public release: distribution unlimited: 16 May 2016. Case #88ABW-2016-2511 Outline Instance-based classification

More information

DISTRIBUTED SYSTEMS CSCI 4963/ /4/2015

DISTRIBUTED SYSTEMS CSCI 4963/ /4/2015 1 DISTRIBUTED SYSTEMS CSCI 4963/6963 12/4/2015 2 Info Quiz 7 on Tuesday. Project 2 submission URL is posted on the web site Submit your source code and project report (PDF!!!) in a single zip file. If

More information

Introduction to Peer-to-Peer Networks

Introduction to Peer-to-Peer Networks Introduction to Peer-to-Peer Networks The Story of Peer-to-Peer The Nature of Peer-to-Peer: Generals & Paradigms Unstructured Peer-to-Peer Systems Sample Applications 1 Prof. Dr. Thomas Schmidt http:/www.informatik.haw-hamburg.de/~schmidt

More information

Naming in Distributed Systems

Naming in Distributed Systems Naming in Distributed Systems Dr. Yong Guan Department of Electrical and Computer Engineering & Information Assurance Center Iowa State University Outline for Today s Talk Overview: Names, Identifiers,

More information

Branch and Bound. Algorithms for Nearest Neighbor Search: Lecture 1. Yury Lifshits

Branch and Bound. Algorithms for Nearest Neighbor Search: Lecture 1. Yury Lifshits Branch and Bound Algorithms for Nearest Neighbor Search: Lecture 1 Yury Lifshits http://yury.name Steklov Institute of Mathematics at St.Petersburg California Institute of Technology 1 / 36 Outline 1 Welcome

More information