Distributed similarity search algorithm in distributed heterogeneous multimedia databases

Size: px
Start display at page:

Download "Distributed similarity search algorithm in distributed heterogeneous multimedia databases"

Transcription

1 Information Processing Letters 75 (2000) Distributed similarity search algorithm in distributed heterogeneous multimedia databases Ju-Hong Lee a,1, Deok-Hwan Kim a,2, Seok-Lyong Lee a,3, Chin-Wan Chung b,, Guang-Ho Cha c,4 a Department of Information and Communication Engineering, Korea Advanced Institute of Science and Technology 373-1, Kusong-Dong, Yusong-Gu, Taejon , South Korea b Department of Computer Science, Korea Advanced Institute of Science and Technology 373-1, Kusong-Dong, Yusong-Gu, Taejon , South Korea c IBM Almaden Research Center, San Jose, CA, USA Received 5 November 1999; received in revised form 30 March 2000 Communicated by K. Iwama Abstract The collection fusion problem in multimedia databases is concerned with the merging of results retrieved by content based retrieval from distributed heterogeneous multimedia databases in order to find the most similar objects to a query object. We propose distributed similarity search algorithms, two heuristic algorithms and an algorithm using the linear regression, to solve this problem. To our knowledge, these algorithms are the first research results in the area of distributed content based retrieval for heterogeneous multimedia databases Elsevier Science B.V. All rights reserved. Keywords: Distributed similarity search algorithms; Collection fusion; Multimedia databases; Information retrieval 1. Introduction Along with the current growth of the Internet and the Web, it emerges as an important research issue to access distributed multimedia databases. To retrieve information from numerous data sources, the global server is needed to integrate various resources and process queries in a distributed manner [2]. It distributes user queries to local databases, integrates results to fit user requirements, and also provides the illusion Corresponding author. chungcw@islab.kaist.ac.kr. 1 jhlee@islab.kaist.ac.kr. 2 dhkim@islab.kaist.ac.kr. 3 sllee@islab.kaist.ac.kr. 4 ghcha@almaden.ibm.com. of a single database. A key problem is how to extract relevant objects for a query from distributed heterogeneous databases that use different similarity measures. This issue is called the collection fusion problem. It has been studied much for existing text databases [1, 3,5,8], but not for multimedia databases. The problem arises from the difference of similarity measures in a heterogeneous environment. The detailed scenario is as follows: At the global server, a user wants to retrieve objects similar to a query object from local databases using a global similarity measure. However, a local database does not support a global similarity measure, but a local similarity measure. When a global similarity measure is completely different from a local similarity measure, for instance, the global similarity mea /00/$ see front matter 2000 Elsevier Science B.V. All rights reserved. PII: S (00)

2 36 J.-H. Lee et al. / Information Processing Letters 75 (2000) sure using color and the local similarity measure using texture, a user cannot get an appropriate result for a query. Therefore, a global similarity measure must be correlated with a local similarity measure. In this paper, we show that there exist some cases that a linear relationship between two similarity measures holds. And we propose novel distributed similarity search algorithms to solve the collection fusion problem for such cases in distributed heterogeneous multimedia databases. This paper is organized as follows. Section 2 defines the collection fusion problem and provides assumptions for the problem. In Section 3, we propose distributed similarity search algorithms to solve the collection fusion problem. The experimental results are shown in Section 4. Concluding remarks are made in Section Collection fusion for distributed multimedia databases We discuss several assumptions concerning the global server and local databases. The algorithms proposed in this paper are developed based on these assumptions. Assumption 1. The global server selects local databases supporting similarity measures that are correlated with a global similarity measure, and then submits the query to them. Assumption 2. Local databases support the incremental similarity ranking such as the method using a get-more-objects facility described in [7]. Assumption 3. For a given query, local databases return objects locally most similar to the query object together with their local similarity values as the query result. The following are the formal definition and objectives of the collection fusion problem. Definition. Collection fusion problem in multimedia databases is how to retrieve and merge the results from distributed heterogeneous multimedia databases to find relevant objects, thatis,k most similar objects to a query object using a global similarity measure. Objectives. For distributed similarity search of a given query Q, letrq i be the set of relevant objects in the ith local database and IQ i be the set of irrelevant objects in the ith local database. Then RQ i I Q i = and RQ i I Q i ={all objects in the ith local database}. Let VQ i be the set of objects retrieved from the ith local database. We have the constraint that the total number of retrieved objects from local databases is fixed such as i Q = ck, where c is a constant larger than 1, k is the number of relevant objects that a user wants to retrieve, n is the number of local databases, and S is the number of elements of set S. The objectives of the collection fusion problem with this constraint are as follows: (1) The ratio of retrieved objects among relevant objects should be maximized. That is, maximize R Q i V Q i / R Q i subject to the constraint i Q = ck. (2) The ratio of irrelevant objects among retrieved objects should be minimized. That is, minimize I i Q VQ i / i Q subject to the constraint i Q = ck. (1) is to maximize the recall and (2) is to maximize the precision because the precision is 1 I i Q VQ i / i. Q Since we assume that servers are independent and autonomous, their similarity measures may be different from each other. Therefore the similarity value by a local similarity measure between an object from a local database and a query object may be different from that by a global similarity measure at the global server. There are many similarity measures for content based

3 J.-H. Lee et al. / Information Processing Letters 75 (2000) Fig. 1. Scatter diagram for the average RGB color 4 4andthe average RGB color 5 5 in the case that arbitrary pairs among images are chosen. image retrieval. There is correlation between some similarity measures. In order to show such cases, we present examples using the RGB color and the RGB texture. Example 1. Let the global server and the local database support the image similarity search using the color. The global server extracts average RGB color features for the 6 6 subimages from an image and measures its similarity value against a query image using the inter-feature normalization described in MARS [6]. The local database extracts average RGB color features for the 4 4 subimages from an image and measures its similarity value as the global server does. From 3016 images, 3000 arbitrary pairs of images are selected. For each pair, the local similarity value x and the global similarity value y are measured. The scatter diagram of the set of (x, y) values for 3000 selected pairs is shown in Fig. 1. In this case, the diagram shows the shape of a straight line. Example 2. In Fig. 2, the similarity values of the y coordinate are obtained using the average RGB color of 5 5 subimages while those of the x coordinate are obtained using the RGB texture of 6 6 subimages. Contrary to the previous case, the scatter diagram does not show any relationship between two similarity measures with different attributes. Although similarity measures are different between the global server and local databases, we observed that the scatter diagram of similarity values of some pairs of similarity measures showed the shape of a straight line. Since the relationship cannot be proved, Fig. 2. Scatter diagram for the average RGB color 5 5andthe RGB texture 6 6 in the case that arbitrary pairs among images are chosen. instead, we made extensive experiments that showed the linear relationship. Table 1 shows three groups of features, that is, RGB colors, RGB textures and RGB colors & textures, to be used for similarity measures. We used the inter-feature normalization described in MARS [6] to calculate similarity values. The statistical linear regression method is used to obtain the equation of a straight line and the test of statistical hypothesis is used to verify the linear relationship between two similarity measures. As test indicators, we used the scatter diagram, the sample coefficient of determination (r 2 ), and the analysis of variance (F 0, F (α)) where r 2 is given by (sum of squares due to linear regression)/(total variance), F 0 is given by (mean square due to linear regression)/(mean square of residual), and F(α) is obtained from F -distribution for a level of significance α. If the linear regression model is effective for two similarity measures, the scatter diagram should show the shape of a straight line, r 2 (0 <r 2 < 1) should be near to 1 and F 0 should be larger than F(α) [9,10]. Table 2 shows the result of experiments for two similarity measures. In the case of similarity measures from the same group, the scatter diagram shows the shape of a straight line, r 2 value is near to 1, and F 0 is much larger than F(α). However, in the case of similarity measures from different groups, the scatter diagram does not show the shape of a straight line and r 2 value is near to 0. And F 0 in this case is much smaller than F 0 in the case that the linear relationship is satisfied even though F 0 is larger than F(α). In this case, we can say that two similarity measures do not satisfy the linear relationship.

4 38 J.-H. Lee et al. / Information Processing Letters 75 (2000) Table 1 The description of three groups of features to be used for similarity measures Feature name Feature description RGB color features feat1 average RGB color feature for 2 2 subimages feat2 average RGB color feature for 3 3 subimages feat3 average RGB color feature for 4 4 subimages feat4 average RGB color feature for 5 5 subimages feat5 average RGB color feature for 6 6 subimages RGB texture features feat6 average RGB texture feature for 2 2 subimages feat7 average RGB texture feature for 3 3 subimages feat8 average RGB texture feature for 4 4 subimages feat9 average RGB texture feature for 5 5 subimages feat10 average RGB texture feature for 6 6 subimages RGB color & texture feat11 average RGB color & texture feature for 2 2 subimages feat12 average RGB color & texture feature for 3 3 subimages feat13 average RGB color & texture feature for 4 4 subimages feat14 average RGB color & texture feature for 5 5 subimages Table 2 Test of statistical hypothesis for linear relationship between two similarity measures Features to be used for Scatter diagram Correlation r 2 F 0 F(0.05) Result similarity measures ρ feat1 : feat2 straight line linear feat1 : feat4 straight line linear feat6 : feat8 straight line linear feat3 : feat5 straight line linear feat8 : feat10 straight line linear feat11 : feat14 straight line linear feat7 : feat9 straight line linear feat12 : feat13 straight line linear feat1 : feat9 scattered nonlinear feat6 : feat12 scattered nonlinear feat5 : feat10 scattered nonlinear feat1 : feat10 scattered nonlinear For any two similarity measures, if they satisfy the linearity relationship, we can use that property for the distributed similarity search. 3. Distributed similarity search algorithm The distributed similarity search algorithm retrieves k most similar objects using a global similarity mea-

5 J.-H. Lee et al. / Information Processing Letters 75 (2000) Table 3 Parameters used in the algorithms q k c n r LD i p i query object of distributed similarity search number of objects to find multiplication ratio (>1) when more than k objects are retrieved number of local databases number of retrievals for one local database ith local database number of objects to be retrieved from LD i in one step sure from n local databases, LD i, i = 1,...,n.The algorithm must result in high recall and high precision to achieve objectives of the collection fusion problem stated in Section 2. We suggest two heuristic algorithms and an algorithm using linear regression for distributed similarity search. Table 3 shows parameters to be used in the algorithms. Heuristic Algorithm (q,c,k,n,ld 1,...,LD n ) (1) send a query object q to all LDs (2) For each LD i, initialize p i (3) While (number of retrieved objects <ck) (4) for each LD i, get_more_objects(q, p i, LD i ) let result i be the set of objects that are retrieved from LD i. (5) merge_results(result 1,...,result n ) (6) for each LD i, recalculate p i using heuristic estimator of LD i (7) EndWhile Where, merge_results(result 1,...,result n ) merges and ranks results retrieved from all local databases using a global similarity measure and get_more_ objects(q, p i, LD i ) requests LD i to get p i more objects similar to the query q using a local similarity measure of LD i as described in [7]. If the global server retrieves exactly k objects from local databases, the recall will be less than 1 because there will be some irrelevant objects in the retrieved objects. Therefore the global server must get more than k, that is ck (c 1) objects. The precision, however, will be decreased, as c increases. The recall has a tradeoff relation to the precision. If all local databases show the same recall and the same precision, it is sufficient for the global server to get p i = [ck/n] objects only once from each local database, where []is the rounding operator. However, the values are different for each LD i and cannot be known in advance. Therefore we must refine them repeatedly. If the repetition is r, the initial value of p i is given by p i =[ck/rn]. Step (6) of the above algorithm assigns a large value to p i of the local database whose heuristic estimator is high in order to increase the recall and the precision Average ranking heuristic A heuristic estimator α i is defined as: M i α i = M i / Rank ij, j=1 where Rank ij is the merged rank of the jth object retrieved from the ith local database and M i is the number of objects retrieved in the last retrieval from the ith local database. This value means the reciprocal of the average of merged ranks of objects retrieved from the ith local database. The global server gets more objects from a local database with a high value of α i and less objects from one with a low value of α i. p i of the heuristic algorithm is given as follows: [ ] k p i = r α i. α 1 + +α n 3.2. Average global similarity heuristic This is similar to the average ranking heuristic. The rank has an integer value that has a uniform difference between adjacent ranked objects. However, the similarity difference between adjacent objects may not be uniform. So, the heuristic estimator β i is defined as: M i / β i = Global_Similarity ij Mi. j=1 This value means the average similarity of the objects retrieved from the ith local database. p i of the heuristic algorithm is given as follows: [ ] k p i = r β i. β 1 + +β n

6 40 J.-H. Lee et al. / Information Processing Letters 75 (2000) Algorithm (p,c,k,n,q,t,ld 1,...,LD n ) (1) for each LD i,i= 1,...,n, get_more_objects(q, p, LD i ) (2) for each LD i,i= 1,...,n, analyze objects retrieved from LD i using the linear regression analysis and obtain equation ŷ i =ˆα i + ˆβ i x i and obtain gt i where gt i is one of gti u,gtm i,gti l according to T (3) let LD l be the local database which has the largest GT among all local databases and its GT be gt l (4) if (the total number of retrieved objects with the global similarity value greater than gt l ) k or (the total number of retrieved objects) ck then stop (5) select the LD l which has the largest GT among all local databases (6) get_more_objects(q, p, LD l ) (7) analyze objects from LD l using linear regression (8) goto step (3) 3.3. Distributed similarity search algorithm using the linear regression In Section 2, we observe that there exist similarity measures that have the linear relationship between them. For these cases, we can apply the linear regression analysis to a distributed similarity search. The linear equation, ŷ =ˆα + ˆβx, of the straight line in a scatter diagram is obtained by using the linear regression analysis. The algorithm retrieves the predefined p number of objects from each local database and analyzes retrieved objects to find the linear equation and the global threshold (GT) corresponding to the local threshold (LT). The least of local similarity values of retrieved objects becomes the local threshold. This algorithm uses three different global thresholds, gt u, gt m, gt l corresponding to the local threshold. In Fig. 3, gt m is the y-coordinate value of the intersection point of ŷ =ˆα + ˆβx and x = LT. gt u is that of the intersection point of ŷ =ˆα + ˆβx + d y (d y is 100(1 δ)% confidence interval of y)andx = LT. gt l is that of the intersection point of ŷ =ˆα + ˆβx d y and x = LT. T indicates the type of the global threshold, one of gt u, gt m, gt l. This algorithm selects the local database that has the largest global threshold and retrieves objects from the selected database next time. The above algorithm uses one of three global thresholds gt m, gt u, gt l. In case gt u, the recall of the result is high and the precision is low. In case gt m,the recall is less than the case of gt u while the precision is higher. In case gt l, the recall is the lowest among the three cases and the precision is the highest. Fig. 3. Three local thresholds corresponding to the global threshold. 4. Experiment In order to measure the effectiveness and performance of the proposed distributed similarity search algorithms, we conducted comprehensive experiments in an environment containing a large number of image data and various queries. The test data consists of 3016 images with 256 RGB color bitmaps. The contents of test images are shown in Table 4. In order to show the preciseness of the linear regression of partly retrieved objects, we present experimental results in Table 5, indicating that the partial results approach to the final result gradually. Features to be used for similarity measures are chosen from the RGB color group. As the number of retrieved objects increases, r 2, α, and β approach the final values. We evaluated the effectiveness of the algorithm using the precision, the recall, and the combined metric that is the product of the recall and the precision. The combined metric can measure the overall effec-

7 J.-H. Lee et al. / Information Processing Letters 75 (2000) Fig. 4. The precision and the recall of each algorithm in the clustered distribution. Table 4 The contents of test images Category # of images Area plants 720 flower, leaves, grass pattern 680 glass, brick, woods architecture 820 house, building scene 796 water, sky, cloud Fig. 5. P R of each algorithm in the clustered distribution. Table 5 The preciseness of the linear regression of partly retrieved objects # of retrieved MSE * r 2 α β objects Total objects * MSE (mean square error) is (residual sum of squares)/(number of retrieved objects). tiveness. We made 10 queries for each test using various parameters and averaged their results. We assume four local databases with one global server, where the images are distributed over local databases. To allocate images to these local databases, we use two approaches: (1) random allocation and (2) clustered allocation. In the random allocation, all images are distributed randomly into four databases. In the clustered allocation, similar images are likely to be allocated to Fig. 6. P R of each algorithm in the random distribution. the same local database. The equal number of images are allocated to each local database. Clusters are generated with centers randomly distributed. Each local database contains 4 to 5 clusters. About 60% of data are allocated to clusters, while the rest are distributed randomly. These two cases are evaluated respectively. Other test parameter values are 99.9% confidence level for estimating the confidence interval of y, theupper type of the global threshold gt u,1.2forc value, and 10 for p and initial p i. The graphs of the precision, the recall, and their combined metric P R for three algorithms are summarized in Figs For the clustered distribution, the algorithm using the linear regression outperforms the average ranking heuristic algorithm (alpha)

8 42 J.-H. Lee et al. / Information Processing Letters 75 (2000) and the average global similarity heuristic algorithm (beta) because the algorithm using the linear regression (linear) reflects the clustering effect of data distribution well. For the random distribution, the algorithm using the linear regression shows slightly better results than other algorithms. In a real situation, however, data distributions of databases on the Web are generally clustered. Therefore, the algorithm using the linear regression will be used more practically. 5. Conclusion In this paper, we proposed novel distributed similarity search algorithms that solve the collection fusion problem for multimedia databases on the distributed heterogeneous environment like the Web. Experiments show that the algorithm using the linear regression is the best. As far as we know, we first studied the collection fusion problem of distributed heterogeneous multimedia databases and presented novel algorithms as solutions. The search for multimedia databases on the Web is becoming a very important issue. So algorithms proposed in this paper can be the basis for future researches in this area. References [1] J. Callan, Z. Lu, W. Croft, Searching distributed collection with inference networks, in: Proc. 18th Annual Internat. ACM/SIGIR Conference, 1995, pp [2] W. Chang, G. Sheikholeslami, J. Wang, A. Zhang, Data resource selection in distributed visual information systems, IEEE Trans. Knowledge Data Engrg. 10 (6) (1998) [3] L. Gravano, H. Garcia-Molina, Merging ranks from heterogeneous internet sources, in: Proc. 23rd Internat. Conf. on Very Large Data Bases, August 1997, pp [4] J.H. Lee, D.H. Kim, C.W. Chung, Multi-dimensional selectivity estimation using compressed histogram information, in: Proc. ACM SIGMOD Internat. Conf. on Management of Data, June 1999, pp [5] W. Meng, K.L. Liu, C. Yu, X. Wang, Y. Chang, N. Rishe, Determining text databases to search in the Internet, in: Proc. Internat. Conf. on Very Large Data Bases, August 1998, pp [6] M. Ortega, K. Chakrababarti, K. Porkaew, S. Mehrotra, Supporting ranked Boolean similarity queries in MARS, IEEE Trans. Knowledge Data Engrg. 10 (6) (1998) [7] T. Seidl, H. Kriegel, Optimal multi-step k-nearest neighbor search, in: Proc. ACM SIGMOD Internat. Conf. on Management of Data, June 1998, pp [8] E. Voorhees, N. Gupta, B. Johnson-Laird, The collection fusion problem, in: Proc. 3rd Text Retrieval Conference (TREC-3), 1994, pp [9] R.V. Hogg, E.A. Tanis, Probability & Statistical Inference, MacMillan Publishing Co., New York, [10] S.H. Park, Regression Analysis, DaeYoung Publishing Co., 1985.

A Miniature-Based Image Retrieval System

A Miniature-Based Image Retrieval System A Miniature-Based Image Retrieval System Md. Saiful Islam 1 and Md. Haider Ali 2 Institute of Information Technology 1, Dept. of Computer Science and Engineering 2, University of Dhaka 1, 2, Dhaka-1000,

More information

number of documents in global result list

number of documents in global result list Comparison of different Collection Fusion Models in Distributed Information Retrieval Alexander Steidinger Department of Computer Science Free University of Berlin Abstract Distributed information retrieval

More information

Federated Search. Jaime Arguello INLS 509: Information Retrieval November 21, Thursday, November 17, 16

Federated Search. Jaime Arguello INLS 509: Information Retrieval November 21, Thursday, November 17, 16 Federated Search Jaime Arguello INLS 509: Information Retrieval jarguell@email.unc.edu November 21, 2016 Up to this point... Classic information retrieval search from a single centralized index all ueries

More information

Textural Features for Image Database Retrieval

Textural Features for Image Database Retrieval Textural Features for Image Database Retrieval Selim Aksoy and Robert M. Haralick Intelligent Systems Laboratory Department of Electrical Engineering University of Washington Seattle, WA 98195-2500 {aksoy,haralick}@@isl.ee.washington.edu

More information

Image Classification Using Wavelet Coefficients in Low-pass Bands

Image Classification Using Wavelet Coefficients in Low-pass Bands Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August -7, 007 Image Classification Using Wavelet Coefficients in Low-pass Bands Weibao Zou, Member, IEEE, and Yan

More information

Image retrieval based on bag of images

Image retrieval based on bag of images University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 Image retrieval based on bag of images Jun Zhang University of Wollongong

More information

Enhancing Cluster Quality by Using User Browsing Time

Enhancing Cluster Quality by Using User Browsing Time Enhancing Cluster Quality by Using User Browsing Time Rehab Duwairi Dept. of Computer Information Systems Jordan Univ. of Sc. and Technology Irbid, Jordan rehab@just.edu.jo Khaleifah Al.jada' Dept. of

More information

Robust Linear Regression (Passing- Bablok Median-Slope)

Robust Linear Regression (Passing- Bablok Median-Slope) Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their

More information

A Robust Color Image Watermarking Using Maximum Wavelet-Tree Difference Scheme

A Robust Color Image Watermarking Using Maximum Wavelet-Tree Difference Scheme A Robust Color Image Watermarking Using Maximum Wavelet-Tree ifference Scheme Chung-Yen Su 1 and Yen-Lin Chen 1 1 epartment of Applied Electronics Technology, National Taiwan Normal University, Taipei,

More information

Open Access Self-Growing RBF Neural Network Approach for Semantic Image Retrieval

Open Access Self-Growing RBF Neural Network Approach for Semantic Image Retrieval Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2014, 6, 1505-1509 1505 Open Access Self-Growing RBF Neural Networ Approach for Semantic Image Retrieval

More information

AN EFFICIENT BATIK IMAGE RETRIEVAL SYSTEM BASED ON COLOR AND TEXTURE FEATURES

AN EFFICIENT BATIK IMAGE RETRIEVAL SYSTEM BASED ON COLOR AND TEXTURE FEATURES AN EFFICIENT BATIK IMAGE RETRIEVAL SYSTEM BASED ON COLOR AND TEXTURE FEATURES 1 RIMA TRI WAHYUNINGRUM, 2 INDAH AGUSTIEN SIRADJUDDIN 1, 2 Department of Informatics Engineering, University of Trunojoyo Madura,

More information

A Level-wise Priority Based Task Scheduling for Heterogeneous Systems

A Level-wise Priority Based Task Scheduling for Heterogeneous Systems International Journal of Information and Education Technology, Vol., No. 5, December A Level-wise Priority Based Task Scheduling for Heterogeneous Systems R. Eswari and S. Nickolas, Member IACSIT Abstract

More information

CPU-GPU hybrid computing for feature extraction from video stream

CPU-GPU hybrid computing for feature extraction from video stream LETTER IEICE Electronics Express, Vol.11, No.22, 1 8 CPU-GPU hybrid computing for feature extraction from video stream Sungju Lee 1, Heegon Kim 1, Daihee Park 1, Yongwha Chung 1a), and Taikyeong Jeong

More information

A Graph Theoretic Approach to Image Database Retrieval

A Graph Theoretic Approach to Image Database Retrieval A Graph Theoretic Approach to Image Database Retrieval Selim Aksoy and Robert M. Haralick Intelligent Systems Laboratory Department of Electrical Engineering University of Washington, Seattle, WA 98195-2500

More information

Stepwise Metric Adaptation Based on Semi-Supervised Learning for Boosting Image Retrieval Performance

Stepwise Metric Adaptation Based on Semi-Supervised Learning for Boosting Image Retrieval Performance Stepwise Metric Adaptation Based on Semi-Supervised Learning for Boosting Image Retrieval Performance Hong Chang & Dit-Yan Yeung Department of Computer Science Hong Kong University of Science and Technology

More information

Enhancing Cluster Quality by Using User Browsing Time

Enhancing Cluster Quality by Using User Browsing Time Enhancing Cluster Quality by Using User Browsing Time Rehab M. Duwairi* and Khaleifah Al.jada'** * Department of Computer Information Systems, Jordan University of Science and Technology, Irbid 22110,

More information

Automatic Pipeline Generation by the Sequential Segmentation and Skelton Construction of Point Cloud

Automatic Pipeline Generation by the Sequential Segmentation and Skelton Construction of Point Cloud , pp.43-47 http://dx.doi.org/10.14257/astl.2014.67.11 Automatic Pipeline Generation by the Sequential Segmentation and Skelton Construction of Point Cloud Ashok Kumar Patil, Seong Sill Park, Pavitra Holi,

More information

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false.

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. ST512 Fall Quarter, 2005 Exam 1 Name: Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. 1. (42 points) A random sample of n = 30 NBA basketball

More information

Effective Pattern Similarity Match for Multidimensional Sequence Data Sets

Effective Pattern Similarity Match for Multidimensional Sequence Data Sets Effective Pattern Similarity Match for Multidimensional Sequence Data Sets Seo-Lyong Lee, * and Deo-Hwan Kim 2, ** School of Industrial and Information Engineering, Hanu University of Foreign Studies,

More information

FSRM Feedback Algorithm based on Learning Theory

FSRM Feedback Algorithm based on Learning Theory Send Orders for Reprints to reprints@benthamscience.ae The Open Cybernetics & Systemics Journal, 2015, 9, 699-703 699 FSRM Feedback Algorithm based on Learning Theory Open Access Zhang Shui-Li *, Dong

More information

Template Extraction from Heterogeneous Web Pages

Template Extraction from Heterogeneous Web Pages Template Extraction from Heterogeneous Web Pages 1 Mrs. Harshal H. Kulkarni, 2 Mrs. Manasi k. Kulkarni Asst. Professor, Pune University, (PESMCOE, Pune), Pune, India Abstract: Templates are used by many

More information

The Study of Genetic Algorithm-based Task Scheduling for Cloud Computing

The Study of Genetic Algorithm-based Task Scheduling for Cloud Computing The Study of Genetic Algorithm-based Task Scheduling for Cloud Computing Sung Ho Jang, Tae Young Kim, Jae Kwon Kim and Jong Sik Lee School of Information Engineering Inha University #253, YongHyun-Dong,

More information

A Robust Wipe Detection Algorithm

A Robust Wipe Detection Algorithm A Robust Wipe Detection Algorithm C. W. Ngo, T. C. Pong & R. T. Chin Department of Computer Science The Hong Kong University of Science & Technology Clear Water Bay, Kowloon, Hong Kong Email: fcwngo, tcpong,

More information

AIIA shot boundary detection at TRECVID 2006

AIIA shot boundary detection at TRECVID 2006 AIIA shot boundary detection at TRECVID 6 Z. Černeková, N. Nikolaidis and I. Pitas Artificial Intelligence and Information Analysis Laboratory Department of Informatics Aristotle University of Thessaloniki

More information

Keywords Data alignment, Data annotation, Web database, Search Result Record

Keywords Data alignment, Data annotation, Web database, Search Result Record Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web

More information

Color Mining of Images Based on Clustering

Color Mining of Images Based on Clustering Proceedings of the International Multiconference on Computer Science and Information Technology pp. 203 212 ISSN 1896-7094 c 2007 PIPS Color Mining of Images Based on Clustering Lukasz Kobyliński and Krzysztof

More information

Document Summarization using Semantic Feature based on Cloud

Document Summarization using Semantic Feature based on Cloud Advanced Science and echnology Letters, pp.51-55 http://dx.doi.org/10.14257/astl.2013 Document Summarization using Semantic Feature based on Cloud Yoo-Kang Ji 1, Yong-Il Kim 2, Sun Park 3 * 1 Dept. of

More information

Differential Compression and Optimal Caching Methods for Content-Based Image Search Systems

Differential Compression and Optimal Caching Methods for Content-Based Image Search Systems Differential Compression and Optimal Caching Methods for Content-Based Image Search Systems Di Zhong a, Shih-Fu Chang a, John R. Smith b a Department of Electrical Engineering, Columbia University, NY,

More information

The Journal of Systems and Software

The Journal of Systems and Software The Journal of Systems and Software 83 (2010) 2627 2641 Contents lists available at ScienceDirect The Journal of Systems and Software journal homepage: www.elsevier.com/locate/jss Effective processing

More information

LETTER Local and Nonlocal Color Line Models for Image Matting

LETTER Local and Nonlocal Color Line Models for Image Matting 1814 IEICE TRANS. FUNDAMENTALS, VOL.E97 A, NO.8 AUGUST 2014 LETTER Local and Nonlocal Color Line Models for Image Matting Byoung-Kwang KIM a), Meiguang JIN, Nonmembers, and Woo-Jin SONG, Member SUMMARY

More information

Obtaining Language Models of Web Collections Using Query-Based Sampling Techniques

Obtaining Language Models of Web Collections Using Query-Based Sampling Techniques -7695-1435-9/2 $17. (c) 22 IEEE 1 Obtaining Language Models of Web Collections Using Query-Based Sampling Techniques Gary A. Monroe James C. French Allison L. Powell Department of Computer Science University

More information

Story Unit Segmentation with Friendly Acoustic Perception *

Story Unit Segmentation with Friendly Acoustic Perception * Story Unit Segmentation with Friendly Acoustic Perception * Longchuan Yan 1,3, Jun Du 2, Qingming Huang 3, and Shuqiang Jiang 1 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing,

More information

Document Clustering using Feature Selection Based on Multiviewpoint and Link Similarity Measure

Document Clustering using Feature Selection Based on Multiviewpoint and Link Similarity Measure Document Clustering using Feature Selection Based on Multiviewpoint and Link Similarity Measure Neelam Singh neelamjain.jain@gmail.com Neha Garg nehagarg.february@gmail.com Janmejay Pant geujay2010@gmail.com

More information

Semantics-based Image Retrieval by Region Saliency

Semantics-based Image Retrieval by Region Saliency Semantics-based Image Retrieval by Region Saliency Wei Wang, Yuqing Song and Aidong Zhang Department of Computer Science and Engineering, State University of New York at Buffalo, Buffalo, NY 14260, USA

More information

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Improving the Efficiency of Fast Using Semantic Similarity Algorithm International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year

More information

Image Compression and Resizing Using Improved Seam Carving for Retinal Images

Image Compression and Resizing Using Improved Seam Carving for Retinal Images Image Compression and Resizing Using Improved Seam Carving for Retinal Images Prabhu Nayak 1, Rajendra Chincholi 2, Dr.Kalpana Vanjerkhede 3 1 PG Student, Department of Electronics and Instrumentation

More information

Digital Halftoning Algorithm Based o Space-Filling Curve

Digital Halftoning Algorithm Based o Space-Filling Curve JAIST Reposi https://dspace.j Title Digital Halftoning Algorithm Based o Space-Filling Curve Author(s)ASANO, Tetsuo Citation IEICE TRANSACTIONS on Fundamentals o Electronics, Communications and Comp Sciences,

More information

Cellular Learning Automata-Based Color Image Segmentation using Adaptive Chains

Cellular Learning Automata-Based Color Image Segmentation using Adaptive Chains Cellular Learning Automata-Based Color Image Segmentation using Adaptive Chains Ahmad Ali Abin, Mehran Fotouhi, Shohreh Kasaei, Senior Member, IEEE Sharif University of Technology, Tehran, Iran abin@ce.sharif.edu,

More information

Distributed k-nn Query Processing for Location Services

Distributed k-nn Query Processing for Location Services Distributed k-nn Query Processing for Location Services Jonghyeong Han 1, Joonwoo Lee 1, Seungyong Park 1, Jaeil Hwang 1, and Yunmook Nah 1 1 Department of Electronics and Computer Engineering, Dankook

More information

CS54701: Information Retrieval

CS54701: Information Retrieval CS54701: Information Retrieval Federated Search 10 March 2016 Prof. Chris Clifton Outline Federated Search Introduction to federated search Main research problems Resource Representation Resource Selection

More information

A new predictive image compression scheme using histogram analysis and pattern matching

A new predictive image compression scheme using histogram analysis and pattern matching University of Wollongong Research Online University of Wollongong in Dubai - Papers University of Wollongong in Dubai 00 A new predictive image compression scheme using histogram analysis and pattern matching

More information

CONTENT BASED IMAGE RETRIEVAL SYSTEM USING IMAGE CLASSIFICATION

CONTENT BASED IMAGE RETRIEVAL SYSTEM USING IMAGE CLASSIFICATION International Journal of Research and Reviews in Applied Sciences And Engineering (IJRRASE) Vol 8. No.1 2016 Pp.58-62 gopalax Journals, Singapore available at : www.ijcns.com ISSN: 2231-0061 CONTENT BASED

More information

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data Ms. Gayatri Attarde 1, Prof. Aarti Deshpande 2 M. E Student, Department of Computer Engineering, GHRCCEM, University

More information

A Fourier Extension Based Algorithm for Impulse Noise Removal

A Fourier Extension Based Algorithm for Impulse Noise Removal A Fourier Extension Based Algorithm for Impulse Noise Removal H. Sahoolizadeh, R. Rajabioun *, M. Zeinali Abstract In this paper a novel Fourier extension based algorithm is introduced which is able to

More information

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction Yongying Gao and Hayder Radha Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48823 email:

More information

An Efficient Methodology for Image Rich Information Retrieval

An Efficient Methodology for Image Rich Information Retrieval An Efficient Methodology for Image Rich Information Retrieval 56 Ashwini Jaid, 2 Komal Savant, 3 Sonali Varma, 4 Pushpa Jat, 5 Prof. Sushama Shinde,2,3,4 Computer Department, Siddhant College of Engineering,

More information

Extendible Chained Bucket Hashing for Main Memory Databases. Abstract

Extendible Chained Bucket Hashing for Main Memory Databases. Abstract Extendible Chained Bucket Hashing for Main Memory Databases Pyung-Chul Kim *, Kee-Wook Rim, Jin-Pyo Hong Electronics and Telecommunications Research Institute (ETRI) P.O. Box 106, Yusong, Taejon, 305-600,

More information

Federated Text Search

Federated Text Search CS54701 Federated Text Search Luo Si Department of Computer Science Purdue University Abstract Outline Introduction to federated search Main research problems Resource Representation Resource Selection

More information

Clustering Color/Intensity. Group together pixels of similar color/intensity.

Clustering Color/Intensity. Group together pixels of similar color/intensity. Clustering Color/Intensity Group together pixels of similar color/intensity. Agglomerative Clustering Cluster = connected pixels with similar color. Optimal decomposition may be hard. For example, find

More information

A Novel Image Retrieval Method Using Segmentation and Color Moments

A Novel Image Retrieval Method Using Segmentation and Color Moments A Novel Image Retrieval Method Using Segmentation and Color Moments T.V. Saikrishna 1, Dr.A.Yesubabu 2, Dr.A.Anandarao 3, T.Sudha Rani 4 1 Assoc. Professor, Computer Science Department, QIS College of

More information

Using Natural Clusters Information to Build Fuzzy Indexing Structure

Using Natural Clusters Information to Build Fuzzy Indexing Structure Using Natural Clusters Information to Build Fuzzy Indexing Structure H.Y. Yue, I. King and K.S. Leung Department of Computer Science and Engineering The Chinese University of Hong Kong Shatin, New Territories,

More information

TEVI: Text Extraction for Video Indexing

TEVI: Text Extraction for Video Indexing TEVI: Text Extraction for Video Indexing Hichem KARRAY, Mohamed SALAH, Adel M. ALIMI REGIM: Research Group on Intelligent Machines, EIS, University of Sfax, Tunisia hichem.karray@ieee.org mohamed_salah@laposte.net

More information

Short Run length Descriptor for Image Retrieval

Short Run length Descriptor for Image Retrieval CHAPTER -6 Short Run length Descriptor for Image Retrieval 6.1 Introduction In the recent years, growth of multimedia information from various sources has increased many folds. This has created the demand

More information

Unequal Error Recovery Scheme for Multimedia Streaming in Application-Level Multicast

Unequal Error Recovery Scheme for Multimedia Streaming in Application-Level Multicast Unequal Error Recovery Scheme for Multimedia Streaming in Application-Level Multicast Joonhyoung Lee, Youngha Jung, and Yoonsik Choe Department of Electrical and Electronic Engineering, Yonsei University,

More information

QUERY REGION DETERMINATION BASED ON REGION IMPORTANCE INDEX AND RELATIVE POSITION FOR REGION-BASED IMAGE RETRIEVAL

QUERY REGION DETERMINATION BASED ON REGION IMPORTANCE INDEX AND RELATIVE POSITION FOR REGION-BASED IMAGE RETRIEVAL International Journal of Technology (2016) 4: 654-662 ISSN 2086-9614 IJTech 2016 QUERY REGION DETERMINATION BASED ON REGION IMPORTANCE INDEX AND RELATIVE POSITION FOR REGION-BASED IMAGE RETRIEVAL Pasnur

More information

Adaptive Aggregation Scheduling Using. Aggregation-degree Control in Sensor Network

Adaptive Aggregation Scheduling Using. Aggregation-degree Control in Sensor Network Contemporary Engineering Sciences, Vol. 7, 2014, no. 14, 725-730 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ces.2014.4681 Adaptive Aggregation Scheduling Using Aggregation-degree Control in

More information

An Enhanced Image Retrieval Using K-Mean Clustering Algorithm in Integrating Text and Visual Features

An Enhanced Image Retrieval Using K-Mean Clustering Algorithm in Integrating Text and Visual Features An Enhanced Image Retrieval Using K-Mean Clustering Algorithm in Integrating Text and Visual Features S.Najimun Nisha 1, Mrs.K.A.Mehar Ban 2, 1 PG Student, SVCET, Puliangudi. najimunnisha@yahoo.com 2 AP/CSE,

More information

Observational Learning with Modular Networks

Observational Learning with Modular Networks Observational Learning with Modular Networks Hyunjung Shin, Hyoungjoo Lee and Sungzoon Cho {hjshin72, impatton, zoon}@snu.ac.kr Department of Industrial Engineering, Seoul National University, San56-1,

More information

Automatic Texture Segmentation for Texture-based Image Retrieval

Automatic Texture Segmentation for Texture-based Image Retrieval Automatic Texture Segmentation for Texture-based Image Retrieval Ying Liu, Xiaofang Zhou School of ITEE, The University of Queensland, Queensland, 4072, Australia liuy@itee.uq.edu.au, zxf@itee.uq.edu.au

More information

A Simple and Strong Algorithm for Reconfiguration of Hexagonal Metamorphic Robots

A Simple and Strong Algorithm for Reconfiguration of Hexagonal Metamorphic Robots 50 A Simple and Strong Algorithm for Reconfiguration of Hexagonal Metamorphic Robots KwangEui Lee Department of Multimedia Engineering, Dongeui University, Busan, Korea Summary In this paper, we propose

More information

Speed-up Multi-modal Near Duplicate Image Detection

Speed-up Multi-modal Near Duplicate Image Detection Open Journal of Applied Sciences, 2013, 3, 16-21 Published Online March 2013 (http://www.scirp.org/journal/ojapps) Speed-up Multi-modal Near Duplicate Image Detection Chunlei Yang 1,2, Jinye Peng 2, Jianping

More information

A Novel Algorithm for Associative Classification

A Novel Algorithm for Associative Classification A Novel Algorithm for Associative Classification Gourab Kundu 1, Sirajum Munir 1, Md. Faizul Bari 1, Md. Monirul Islam 1, and K. Murase 2 1 Department of Computer Science and Engineering Bangladesh University

More information

Spatial Index Keyword Search in Multi- Dimensional Database

Spatial Index Keyword Search in Multi- Dimensional Database Spatial Index Keyword Search in Multi- Dimensional Database Sushma Ahirrao M. E Student, Department of Computer Engineering, GHRIEM, Jalgaon, India ABSTRACT: Nearest neighbor search in multimedia databases

More information

Wavelet Based Image Retrieval Method

Wavelet Based Image Retrieval Method Wavelet Based Image Retrieval Method Kohei Arai Graduate School of Science and Engineering Saga University Saga City, Japan Cahya Rahmad Electronic Engineering Department The State Polytechnics of Malang,

More information

ADAPTIVE TEXTURE IMAGE RETRIEVAL IN TRANSFORM DOMAIN

ADAPTIVE TEXTURE IMAGE RETRIEVAL IN TRANSFORM DOMAIN THE SEVENTH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2002), DEC. 2-5, 2002, SINGAPORE. ADAPTIVE TEXTURE IMAGE RETRIEVAL IN TRANSFORM DOMAIN Bin Zhang, Catalin I Tomai,

More information

Searching non-text information objects

Searching non-text information objects Non-text digital objects Searching non-text information objects Music Speech Images 3D models Video? 1 2 Ways to query for something Query by describing content 1. Query by category/ theme easiest - work

More information

Single link clustering: 11/7: Lecture 18. Clustering Heuristics 1

Single link clustering: 11/7: Lecture 18. Clustering Heuristics 1 Graphs and Networks Page /7: Lecture 8. Clustering Heuristics Wednesday, November 8, 26 8:49 AM Today we will talk about clustering and partitioning in graphs, and sometimes in data sets. Partitioning

More information

Inverted Index for Fast Nearest Neighbour

Inverted Index for Fast Nearest Neighbour Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,

More information

Big Data Methods. Chapter 5: Machine learning. Big Data Methods, Chapter 5, Slide 1

Big Data Methods. Chapter 5: Machine learning. Big Data Methods, Chapter 5, Slide 1 Big Data Methods Chapter 5: Machine learning Big Data Methods, Chapter 5, Slide 1 5.1 Introduction to machine learning What is machine learning? Concerned with the study and development of algorithms that

More information

A New Feature Local Binary Patterns (FLBP) Method

A New Feature Local Binary Patterns (FLBP) Method A New Feature Local Binary Patterns (FLBP) Method Jiayu Gu and Chengjun Liu The Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA Abstract - This paper presents

More information

Statistical Matching using Fractional Imputation

Statistical Matching using Fractional Imputation Statistical Matching using Fractional Imputation Jae-Kwang Kim 1 Iowa State University 1 Joint work with Emily Berg and Taesung Park 1 Introduction 2 Classical Approaches 3 Proposed method 4 Application:

More information

Lecture on Modeling Tools for Clustering & Regression

Lecture on Modeling Tools for Clustering & Regression Lecture on Modeling Tools for Clustering & Regression CS 590.21 Analysis and Modeling of Brain Networks Department of Computer Science University of Crete Data Clustering Overview Organizing data into

More information

NOVEL APPROACH TO CONTENT-BASED VIDEO INDEXING AND RETRIEVAL BY USING A MEASURE OF STRUCTURAL SIMILARITY OF FRAMES. David Asatryan, Manuk Zakaryan

NOVEL APPROACH TO CONTENT-BASED VIDEO INDEXING AND RETRIEVAL BY USING A MEASURE OF STRUCTURAL SIMILARITY OF FRAMES. David Asatryan, Manuk Zakaryan International Journal "Information Content and Processing", Volume 2, Number 1, 2015 71 NOVEL APPROACH TO CONTENT-BASED VIDEO INDEXING AND RETRIEVAL BY USING A MEASURE OF STRUCTURAL SIMILARITY OF FRAMES

More information

Color Image Segmentation

Color Image Segmentation Color Image Segmentation Yining Deng, B. S. Manjunath and Hyundoo Shin* Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 93106-9560 *Samsung Electronics Inc.

More information

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN 2016 International Conference on Artificial Intelligence: Techniques and Applications (AITA 2016) ISBN: 978-1-60595-389-2 Face Recognition Using Vector Quantization Histogram and Support Vector Machine

More information

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation Optimizing the Deblocking Algorithm for H.264 Decoder Implementation Ken Kin-Hung Lam Abstract In the emerging H.264 video coding standard, a deblocking/loop filter is required for improving the visual

More information

CS47300: Web Information Search and Management

CS47300: Web Information Search and Management CS47300: Web Information Search and Management Federated Search Prof. Chris Clifton 13 November 2017 Federated Search Outline Introduction to federated search Main research problems Resource Representation

More information

Link Prediction for Social Network

Link Prediction for Social Network Link Prediction for Social Network Ning Lin Computer Science and Engineering University of California, San Diego Email: nil016@eng.ucsd.edu Abstract Friendship recommendation has become an important issue

More information

Multiresponse Sparse Regression with Application to Multidimensional Scaling

Multiresponse Sparse Regression with Application to Multidimensional Scaling Multiresponse Sparse Regression with Application to Multidimensional Scaling Timo Similä and Jarkko Tikka Helsinki University of Technology, Laboratory of Computer and Information Science P.O. Box 54,

More information

2.1 Optimized Importance Map

2.1 Optimized Importance Map 3rd International Conference on Multimedia Technology(ICMT 2013) Improved Image Resizing using Seam Carving and scaling Yan Zhang 1, Jonathan Z. Sun, Jingliang Peng Abstract. Seam Carving, the popular

More information

REDUNDANCY REMOVAL IN WEB SEARCH RESULTS USING RECURSIVE DUPLICATION CHECK ALGORITHM. Pudukkottai, Tamil Nadu, India

REDUNDANCY REMOVAL IN WEB SEARCH RESULTS USING RECURSIVE DUPLICATION CHECK ALGORITHM. Pudukkottai, Tamil Nadu, India REDUNDANCY REMOVAL IN WEB SEARCH RESULTS USING RECURSIVE DUPLICATION CHECK ALGORITHM Dr. S. RAVICHANDRAN 1 E.ELAKKIYA 2 1 Head, Dept. of Computer Science, H. H. The Rajah s College, Pudukkottai, Tamil

More information

Block-based Watermarking Using Random Position Key

Block-based Watermarking Using Random Position Key IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.2, February 2009 83 Block-based Watermarking Using Random Position Key Won-Jei Kim, Jong-Keuk Lee, Ji-Hong Kim, and Ki-Ryong

More information

A METHOD FOR CONTENT-BASED SEARCHING OF 3D MODEL DATABASES

A METHOD FOR CONTENT-BASED SEARCHING OF 3D MODEL DATABASES A METHOD FOR CONTENT-BASED SEARCHING OF 3D MODEL DATABASES Jiale Wang *, Hongming Cai 2 and Yuanjun He * Department of Computer Science & Technology, Shanghai Jiaotong University, China Email: wjl8026@yahoo.com.cn

More information

NDoT: Nearest Neighbor Distance Based Outlier Detection Technique

NDoT: Nearest Neighbor Distance Based Outlier Detection Technique NDoT: Nearest Neighbor Distance Based Outlier Detection Technique Neminath Hubballi 1, Bidyut Kr. Patra 2, and Sukumar Nandi 1 1 Department of Computer Science & Engineering, Indian Institute of Technology

More information

Sumantra Dutta Roy, Preeti Rao and Rishabh Bhargava

Sumantra Dutta Roy, Preeti Rao and Rishabh Bhargava 1 OPTIMAL PARAMETER ESTIMATION AND PERFORMANCE MODELLING IN MELODIC CONTOUR-BASED QBH SYSTEMS Sumantra Dutta Roy, Preeti Rao and Rishabh Bhargava Department of Electrical Engineering, IIT Bombay, Powai,

More information

TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback

TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback RMIT @ TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback Ameer Albahem ameer.albahem@rmit.edu.au Lawrence Cavedon lawrence.cavedon@rmit.edu.au Damiano

More information

THE EFFECT OF JOIN SELECTIVITIES ON OPTIMAL NESTING ORDER

THE EFFECT OF JOIN SELECTIVITIES ON OPTIMAL NESTING ORDER THE EFFECT OF JOIN SELECTIVITIES ON OPTIMAL NESTING ORDER Akhil Kumar and Michael Stonebraker EECS Department University of California Berkeley, Ca., 94720 Abstract A heuristic query optimizer must choose

More information

Inverted List Caching for Topical Index Shards

Inverted List Caching for Topical Index Shards Inverted List Caching for Topical Index Shards Zhuyun Dai and Jamie Callan Language Technologies Institute, Carnegie Mellon University {zhuyund, callan}@cs.cmu.edu Abstract. Selective search is a distributed

More information

Data Distortion for Privacy Protection in a Terrorist Analysis System

Data Distortion for Privacy Protection in a Terrorist Analysis System Data Distortion for Privacy Protection in a Terrorist Analysis System Shuting Xu, Jun Zhang, Dianwei Han, and Jie Wang Department of Computer Science, University of Kentucky, Lexington KY 40506-0046, USA

More information

Integrated Framework for Keyword-based Text Data Collection and Analysis

Integrated Framework for Keyword-based Text Data Collection and Analysis Sensors and Materials, Vol. 30, No. 3 (2018) 439 445 MYU Tokyo 439 S & M 1506 Integrated Framework for Keyword-based Text Data Collection and Analysis Minki Cha, 1 Jung-Hyok Kwon, 1 Sol-Bee Lee, 1 Jaehoon

More information

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem. STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,

More information

Clustering. Robert M. Haralick. Computer Science, Graduate Center City University of New York

Clustering. Robert M. Haralick. Computer Science, Graduate Center City University of New York Clustering Robert M. Haralick Computer Science, Graduate Center City University of New York Outline K-means 1 K-means 2 3 4 5 Clustering K-means The purpose of clustering is to determine the similarity

More information

A Bayesian Approach to Hybrid Image Retrieval

A Bayesian Approach to Hybrid Image Retrieval A Bayesian Approach to Hybrid Image Retrieval Pradhee Tandon and C. V. Jawahar Center for Visual Information Technology International Institute of Information Technology Hyderabad - 500032, INDIA {pradhee@research.,jawahar@}iiit.ac.in

More information

Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features

Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features 1 Kum Sharanamma, 2 Krishnapriya Sharma 1,2 SIR MVIT Abstract- To describe the image features the Local binary pattern (LBP)

More information

A Real Time GIS Approximation Approach for Multiphase Spatial Query Processing Using Hierarchical-Partitioned-Indexing Technique

A Real Time GIS Approximation Approach for Multiphase Spatial Query Processing Using Hierarchical-Partitioned-Indexing Technique International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2017 IJSRCSEIT Volume 2 Issue 6 ISSN : 2456-3307 A Real Time GIS Approximation Approach for Multiphase

More information

Neural Network based textural labeling of images in multimedia applications

Neural Network based textural labeling of images in multimedia applications Neural Network based textural labeling of images in multimedia applications S.A. Karkanis +, G.D. Magoulas +, and D.A. Karras ++ + University of Athens, Dept. of Informatics, Typa Build., Panepistimiopolis,

More information

Novel Cluster Based Routing Protocol in Wireless Sensor Networks

Novel Cluster Based Routing Protocol in Wireless Sensor Networks ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 32 Novel Cluster Based Routing Protocol in Wireless Sensor Networks Bager Zarei 1, Mohammad Zeynali 2 and Vahid Majid Nezhad 3 1 Department of Computer

More information

Mining Data Streams. Outline [Garofalakis, Gehrke & Rastogi 2002] Introduction. Summarization Methods. Clustering Data Streams

Mining Data Streams. Outline [Garofalakis, Gehrke & Rastogi 2002] Introduction. Summarization Methods. Clustering Data Streams Mining Data Streams Outline [Garofalakis, Gehrke & Rastogi 2002] Introduction Summarization Methods Clustering Data Streams Data Stream Classification Temporal Models CMPT 843, SFU, Martin Ester, 1-06

More information

CFMTL: Clustering Wireless Sensor Network Using Fuzzy Logic and Mobile Sink In Three-Level

CFMTL: Clustering Wireless Sensor Network Using Fuzzy Logic and Mobile Sink In Three-Level CFMTL: Clustering Wireless Sensor Network Using Fuzzy Logic and Mobile Sink In Three-Level Ali Abdi Seyedkolaei 1 and Ali Zakerolhosseini 2 1 Department of Computer, Shahid Beheshti University, Tehran,

More information

Temperature Calculation of Pellet Rotary Kiln Based on Texture

Temperature Calculation of Pellet Rotary Kiln Based on Texture Intelligent Control and Automation, 2017, 8, 67-74 http://www.scirp.org/journal/ica ISSN Online: 2153-0661 ISSN Print: 2153-0653 Temperature Calculation of Pellet Rotary Kiln Based on Texture Chunli Lin,

More information