Document Summarization on Handheld Device:

Size: px
Start display at page:

Download "Document Summarization on Handheld Device:"

Transcription

1 Document Summarization on Handheld Device: An Information Visualization Tool for Mobile Commerce Christopher C. Yang Dept. of Systems Eng. and Eng. Management The Chinese University of Hong Kong Hong Kong SAR, China Fu Lee Wang Dept. of Systems Eng. and Eng. Management The Chinese University of Hong Kong Hong Kong SAR, China Abstract: Wireless access with mobile (or handheld) devices is a promising addition to the WWW and traditional electronic business. Much information is generated or accessed on the road, in conversation, while shopping in stores or meeting with business clients although traditional electronic business requires users to be stationary with network connection. Mobile devices provide convenience and portable access to the huge information space on the Internet. With the advance of mobile technologies, it creates added values and business opportunities to mobile commerce. However, the limited screen size, narrow network bandwidth, small memory capacity and low computing power are the shortcomings of handheld devices. Loading and visualizing large documents on handheld devices become impossible. The limited resolution restricts the amount of information to be displayed. The download time is intolerably long. All these shortcomings make vendors and users hesitate to explore the enormous opportunities in mobile commerce. In this paper, we introduce the fractal summarization model for document summarization on handheld devices. Three-tier architecture with the middle-tier conducting the major computation is also proposed. Visualization of summary on handheld devices is also investigated. The automatic summarization, three-tier architecture, information visualization are potential solutions to the existing problems in handheld devices for mobile commerce. Keyword: Document summarization, mobile commerce, fisheye view, handheld devices. 1. Introduction The advance of mobile network creates business opportunities and provides value-added services to users. Access to the Internet through mobile phones and other handheld devices is growing significantly in recent years. The Wireless Application Protocol (WAP) and Wireless Markup Language (WML) provide the universal open standard and markup language. In this age of information, many information-centric applications have been developed for the handheld devices [3][18][22]. For example, users can now surf the web, check , read news, quote stock price, etc. using handheld devices.

2 The convenience of handheld devices allows information access without geometric limitation. Executive officers may need to access important documents for making decision while they are on the roads out of their office in a dynamic business society. However, there may not be any network connection to access the Internet and the decision making cannot be deferred. In these situations, mobile devices, which provide Internet access anywhere, are able to solve the problem. Internet access with wireless handheld devices eliminates the geometric limitation; however there are other limitations of handheld devices that restrict its capability. 1.1 Shortcomings of Wireless Handheld Devices Although the development of wireless handheld devices is fast in recent years, there are many shortcomings associated with these devices, such as screen size, bandwidth, and memory capacity. There are two major categories of wireless handheld devices, namely WAP-enabled mobile phones and wireless PDAs. The products of Nokia and Palm are taken as examples here since both of them are the leading companies in mobile technologies. Screen Resolution Popular Wireless Handheld Devices Nokia 3320, 3330, 3360, 5510, 5210, 8310, 8390, Nokia Nokia 3350, 3410, 3510, 3590, 3610, 6310, 6510, 6590, Nokia 6610, Nokia Nokia 9110i, Palm i705 Table 1: Screen Resolutions of Wireless Handheld Devices. At present, the typical display size of popular WAP-enabled handsets and PDAs are pixels and pixels, respectively, which is approximately 1/126 to 1/30 of the display area of a standard personal computer ( pixels). The memory capacity of a handheld device greatly limits the amount of information that can be stored. The maximum WML deck size is 64 kilobyte (Nokia 9110i and Nokia 9210), and the maximum WML deck size for most of popular handset is about 1.4 kilobytes to 2.8 kilobytes binary. The typical memory capacity of PDAs is 8MB.

3 The current bandwidth available for WAP is 9.6Kbps and can be speedup to 40.2 kbps data speed with GPRS, however it is not comparable with the broadband internet connection for PC Potential Solutions by Automatic Summarization and Information Visualization Although handheld devices are convenient, they impose other constraints that do not exist on desktop computers. The low bandwidth and small resolution are major shortcomings of handheld devices. Information overloading is a critical problem; advance searching techniques solve the problem by filtering most of the irrelevant information. However, the precision of most of the commercial search engines is not high. Users may only find a few relevant documents out of a large pool of searching result. Given the large screen and high bandwidth for desktop computing, users may still be able to browse the searching result one by one and identify the relevant information using desktop computers. However, it is impossible to search and visualize the critical information on a small screen with an intolerable slow downloading speed using handheld devices. Automatic summarization summarizes a document for users to preview its major content. Users may determine if the information fits their needs by reading their summary instead of browsing the whole document one by one. The amount of information displayed and downloading time are significantly reduced. In this paper, we propose the fractal summarization model based on the statistical data and the structure of documents. Information visualization techniques are also employed to reduce the visual loads. Three-tier architecture which reduces the computing load of the handheld devices is discussed. 2. Three-tier Architecture Two-tier architecture is typically utilized for Internet access. The user s PC connects to the Internet directly, and the content loaded will be fed to the web browser and present to the user as illustrated in Figure 1. Web Server HTML Browser User User s PC Figure 1. Traditional Web Browsing The summarizer summarizes a document for users to preview before presenting the whole document. As shown in Figure 2, the content will be first fed to the summarizer after loading to the user s PC. The summarizer connects to database server when necessary and generates a summary to display on the

4 browser. Web Server HTML Document Server XML Summarizer Browser User DB Server for Summarizer SQL User s PC Figure 2 Document Browsing with Summarizer on PC Web Server Document Server HTML XML Summarizer WAP Server WAP Gateway Wireless WML Handheld PDA Local Synchronization User DB Server for SQL WAP Gateway Summarizer Figure 3 Document Browsing with Summarizer on WAP The two-tier architecture cannot be applied on handheld device since the computing power of handheld devices is insufficient to perform summarization and the network connection of mobile network does not provide sufficient bandwidth for navigation between the summarizer and other servers. The three-tier architecture as illustrated in Figure 3 is proposed. A WAP gateway is setup to process the summarization. The WAP gateway connects to Internet trough broadband network. The wireless handheld devices can conduct interactive navigation with the gateway through wireless network to retrieve the summary piece by piece. Alternatively, if the PDA is equipped with more memory, the complete summary can be downloaded to PDA through local synchronization.

5 3. Automatic Summarization 3.1 Automatic Summarization Traditional automatic text summarization is the selection of sentences from the source document based on their significance to the document [5][19]. The selection of sentences is conducted based on the salient features of the document. The thematic, location, title, and cue features are the most widely used summarization features. The thematic feature is first identified by Luhn [19]. Edmundson proposed to assign the thematic weight to keyword based on term frequency, and the sentence weight as the sum of thematic weight of constituent keywords [5]. In information retrieval, absolute term frequency by itself is considered as less useful than term frequency normalized to the document length and term frequency in the collection [11]. As a result, the tfidf (Term Frequency, Inverse Document Frequency) method is purposed to calculate the thematic weight of keyword [23]. The significance of sentence is indicated by its location [2] based on the hypotheses that topic sentences tend to occur at the beginning or in the end of documents or paragraphs [5]. Edmondson proposed to assign positive weights to sentences according to their ordinal position in the document, i.e., the sentences in the first and last paragraphs and the first and last sentences of the paragraphs. There are several functions purposed to calculate the location weight of sentence. Alternatively, the preference of sentence location can be stored in a list called Optimum Position Policy, and the sentence will be selected base on their order in the list [17]. The title feature is proposed based on the hypothesis that the author conceives the title as circumscribing the subject matter of the document. When the author partitions the document into major section, he summarizes it by choosing appropriate heading [5]. The weight of heading is very similar to the keyword approach. A title glossary is a list consisting of all the words in title, sub-title and heading. Positive weights are assigned to the title glossary, where the title words will be assigned a weight relatively prime to the heading words. The heading weight of sentence is calculated by the sum of heading weight of its constituent words. The cue phrase approach is proposed by Edmundson [5] based on the hypothesis that the probable relevance of a sentence is affected by the presence of pragmatic words such as significant, impossible, and hardly. A stored cue dictionary is used to identify the cue phases, which comprise of three sub-dictionaries: (i) bonus word, that are positively relevant; (ii) stigma words, that are negatively relevant; and (iii) null words, that are irrelevant. The cue weigh of sentence is calculated by the sum of cue weight of its constituent words

6 Typical summarization systems select a combination of summarization features [5][16][17], W sentence (s n )= a 1 w cue (s n )+ a 2 w keyword (s n )+ a 3 w title (s n )+ a 4 w location (s n ). where a 1, a 2, a 3, and a 4 are positive integers to adjust the weighting of four summarization methods. The sentences with sentence weight higher than a threshold are selected as part of the summary. 3.2 Fractal Summarization The Traditional Summarization Model considers the source document as a sequence of sentences, the sentences extracted will be concatenated as a summary of a sequence of sentences but ignore the structure of the document. Advance summarization techniques make use of the document structure to calculate the probability of a sentence to be included in the summary, for example, the sentence in the conclusion section should receive a higher score, and some other systems use the graph theory to calculate the weights of sentence. Many studies of human abstraction process has show that the human abstractor search the topic sentences according to the document structure, they start to search the topic sentences from the top-level document structure, they will extend their search to the lower level until they found enough information [6][9]. Therefore the document structure is an important factor in summarization; a high quality summary cannot be generated without consideration of the document structure. In this paper, we propose the fractal summarization model. Besides, the traditional summarization model requires the whole summary to be generated at a single batch. It takes a long time to generate a summary for a long document; therefore it is not suitable for M-Commerce. The fractal summarization model can generate a brief skeleton of summary, and the details of the summary will be generated only on demand of users, and hence, provides an interactive interface. Document Chapter Chapter... Section Section... Sub-section Sub-section... Paragraph Paragraph... Sentence Sentence... Term Term... Word Word... Character Character... Figure 4. Document Structure

7 A document can be represented by a hierarchical structure as shown on Figure 4. A document consists of chapters. A chapter consists of sections. A section may consist of subsections. A section or subsection consists of paragraphs. A paragraph consists of sentences. A sentence consists of terms. A term consists of words. A word consists of characters. A document structure can be considered as a Fractal [20] structure. At the lower abstraction level of a document, more specific information can be obtained. Although a document is not a true mathematical fractal object since a document cannot be viewed in an infinite abstraction level, we may consider a document as a Prefractal [7]. The smallest unit in a document is character; however, neither a character nor a word will convey any meaningful information concerning the overall content of a document. The lowest abstraction level in our consideration is a term. The Fractal Summarization Model applies a similar technique as fractal image compression [1][13]. An image is regularly segmented into sets of non-overlapping square blocks, called range blocks, and then each range block is subdivided into sub range blocks, until a contractive mapping can be found to represent this sub range block. The Fractal Summarization Model generates the summary by a simple recursive deterministic algorithm based on the iterated representation of a document. The original document is partitioned by the document structure, and each block is iteratively partitioned to child blocks until each block can be transformed to some key sentences by traditional summarization methods (Figure 5). Fractal Summarization Algorithm 1. Choose a Compression Ratio. 2. Choose a Threshold Value. 3. Calculate the Sentence Number Quota of the Summary. 4. Divide the document into range blocks 5. Repeat 5.1 For each range block, Calculate the sum of weight of the sentence under the range block. 5.2 Allocate Quota to each block in proportion to their sum of weights. 5.3 For each range block, If the quota is less than threshold value Select the sentence in the range block by traditional summarization method Else Divide the range block into sub range block Repeat Step 5.1, 5.2, Until all the range block is processed

8 Document Weight 1000 Quota 40 Chapter 1 Weight 300 Quota 12 Chapter 2 Weight 500 Quota 20 Chpater 3 Weight 200 Quota 8 Section 1.1 Weight 100 Quota 4 Section 1.2 Weight 150 Quota 6 Section 1.3 Weight 50 Quota 2 Section 2.1 Weight 100 Quota 3 Section 2.2 Weight 250 Quota 10 Section 2.3 Weight 150 Quota 7 Section 3.1 Weight 120 Quota 5 Section 3.2 Weight 80 Quota 3 Paragraphs... Paragraphs... Paragraphs... Figure 5. An Example of Fractal Summarization Model The compression ratio of summarization is defined as the ratio of number of sentences in the summary to the number of sentences in the source document. It was chosen as 25% in most literatures because it is proved that extraction of 20% sentences can be informative as the full text of the source document [21], those summarization systems can achieve up to a recall of 96% [5][14][25]. However, Teufel pointed out the high-compression ratio abstracting is more useful, and he reported a recall of 49.6% at around 4% compression ratio [24]. In order to minimize the bandwidth requirement and reduce the pressure on computing power of handheld devices, the default value of compression ratio is chosen as 4%. By the definition of compression ratio, the sentence quota of the summary can be calculated by the number of sentences in the source document times the compression ratio. The weights of sentences under a range block are calculated by the traditional summarization methods described in Section 3.1. The total sentence weight is a weighted sum of all the summarization features [5]. It is confirmed that the weighting of different feature do not have any substantial effect on the average precision [16], therefore the total weights of sentence is calculated as the sum of all the scores of summarization methods. A threshold value is the maximum number of sentences can be selected from a range block, if the quota is larger than the threshold value, and the range block must be divided into sub-range block. Document summarization is different from image compression, more than one attractor can be chosen in one range block. It is proven that the summarization by extraction of fixed number of sentences, the optimal length of summary is 3 to 5 sentences [9]. The default value of threshold is chosen as 5 in our system. It is believed that a full-length text document contains a set of subtopics [12] and a good quality summary should cover as many subtopic as possible, the fractal summarization will produce a summary with a

9 wider coverage of information subtopic than traditional summarization model. The traditional summarization model extracted most of sentences from few chapters, take the Hong Kong Annual Report as an example (Table 2), the traditional summarization model extracted 29 sentences from one chapter when the sentence quota is 80 sentences, and total 53 sentences extracted from top 3 chapters out of total 23 chapters, 8 chapters without sentence been extracted at all. However, the fractal summarization model extracts the sentences evenly from each chapter. In our example, it extracts maximum 8 sentences from one single chapter, and at least 1 sentences from each the chapter. Fractal Summarization Traditional Summarization Maximum No of sentences extracted from one single chapter 8 29 Minimum No of sentences extracted from one single chapter 1 0 Standard deviation of No of sentences extracted from chapters Table 2: No of sentences extracted by two summarization models from Hong Kong Annual Report 2000 An experiment is conducted to benchmark the fractal summarization with the traditional summarization models. Five college graduates are selected as subjects. They were asked to evaluate the quality of summaries without knowing the generation methods of the summaries. The fractal summarization achieves precision of 0.84 on average, while the traditional summarization achieves precision of 0.67 on average. The fractal summarization outperforms the traditional summarization. 4. Summary Visualization on Handheld Devices The summary generated by Fractal Summarization Model is represented in a tree structure. WML is the markup language supported by wireless handheld devices. The basic unit of a WML file is a desk; each desk must contain one or more cards. The card element defines the content display to users, and the card cannot be nested. Each card links to another card within or across decks. Nodes on the same level of the fractal summarization model are converted into card, and anchor links are utilized to implement the tree structure. Given a card of a summary node, there may be a lot of sentences or child-nodes, a large number of sentences in a small display area makes it difficult to read. Fisheye view is a visualization technique to enlarge the focus of interest and diminish the information that is less important [8] (Figure. 7). The objects in neighborhood of the focus of interest will be displayed with a larger visual size. The visual size of objects that are further away will be decreased inversely proportional to its distance to the focus point. In our system, we have implemented the fisheye view with 3-scale font mode available for WML. The prototype system using Nokia 6590 Handset Simulator is presented on Figure 7. The document

10 presenting is the Hong Kong Annual Report There are totally 23 chapters in the annual report, 6 of them are in large font, which means that they are more important, and the rest are in normal font or small font according to their importance to the report (Figure 7a). The Figure 7b shows the summary of Chapter 19. Figure 6: Example of Fisheye View (a) Chapters of HKAR 2000 (b) Chapter 19 of HKAR 2000 Figure 7. Screen Capture of the WAP Summarization System Figure 8. KWML on Palm V A handheld PDA is usually equipped with more memory, and the complete summary can downloaded as a single WML file to the PDA through local synchronization. To read the summary, the PDA is required to install a standard WML file reader, i.e. KWML as shown in Figure 8 [15].

11 5. Conclusion Mobile commerce is a promising addition to the electronic commerce by the adoption of portable handheld devices. However, there are many shortcomings of the handheld devices, such as limited resolution and low bandwidth. In order to overcome the shortcomings, fractal summarization and visualization by fisheye views are proposed in this paper. The fractal summarization creates a summary in tree structure and presents the summary to the handheld devices through cards in WML. Users may browse the selected summary by clicking the anchor links from the highest abstraction level to the lowest abstraction level. Based on the sentence weight computed by the summarization technique, fisheye views are employed to enlarge the focus of interest and diminish the less significant sentences. Such visualization effect draws users attention on the important content. The three-tier architecture is presented to reduce the computing load of the handheld devices. The proposed system creates an information visualization environment to avoid the existing shortcomings of handheld devices for mobile commerce. 6. References [1] M. F. Barnsley and A. E. Jacquin, Application of recurrent iterated function systems to images. In Proceedings SPIE Visual Communications and Image Processing '88, vol. 1001, pp [2] P. B. Baxendale, Machine-Made Index for Technical Literature - An Experiment. IBM Journal (October). pp [3] O. Buyukkokten, H. Garcia-Molina, A. Paepcke, and T. Winograd, Power Browser: Efficient Web Browsing for PDAs. Human-Computer Interaction Conference The Hague, The Netherlands. [4] O. Buyukkokten, H. Garcia-Molina and A. Paepcke, Text Summarization of Web pages on Handheld Devices. In Proc. of Workshop on Automatic Summarization 2001 held in conjunction with NAACL [5] H.P. Edmundson, New Method in Automatic Extraction. Journal of the ACM, vol. 16, no. 2 pp [6] B. Endres-Niggemeyer, E. Maier, and A. Sigel, How to Implement a Naturalistic Model of Abstracting: Four Core Working Steps of an Expert Abstractor. Information Processing & Management, vol. 31 no. 5, pp [7] J. Feder, Fractals. Plenum, New York. [8] G. W. Furnas, Generalized Fisheye Views. In Proceedings of the SIGCHI Conference on Human Factors in Computing System. [9] B.G. Glaser, and A. L. Strauss, The discovery of grounded theory; strategies for qualitative research. Aldine de Gruyter, New York. [10] J. Goldstein, M. Kantrowitz, V. Mittal, and J. Carbonell, Summarizing text documents: Sentence selection and evaluation metrics. In Proceedings of SIGIR, pp

12 [11] D.K. Harman, Ranking algorithms. In W.B. Frakes and R. Baeza-Yates, editors, Information Retrieval: Data Structures and Algorithms, chapter 14, pp Prentice-Hall. [12] Hearst Marti A, Subtopic Structuring for Full-Length Document Access. In Proc. of the 16th Annual International ACM SIGIR Conf. on Research and Development in Information Retrieval, pp [13] A. Jacquin Fractal image coding: A review. In Proceeding of the IEEE, vol. 81, no. 10, pp [14] J. Kepiec, J. Pedersen, and F. Chen, A Trainable Document Summarizer. In Proc. of the 18th Annual International ACM Conf. on Research and Development in Information Retrieval (SIGIR). pp [15] KWML, KWML - KVM WML (WAP) Browser on Palm. [16] A. M. Lam-Adesina, and G J. F. Jones, Applying summarization Techniques for Term Selection in Relevance Feedback, In Proceeding of SIGIR pp.1-9. [17] C. Y. Lin, and E.H. Hovy Identifying Topics by Position. In Proc. of the Applied Natural Language Processing Conference (ANLP-97), Washington, DC, pp [18] H. Lo M-Mail: A Case Study of Dynamic Application Partitioning in Mobile Computing, Master Thesis of Waterloo University. [19] H.P Luhn The Automatic Creation of Literature Abstracts. IBM Journal of Research and Development, pp [20] B. Mandelbrot, The fractal geometry of nature, New York: W.H. Freeman. [21] G. Morris, G. M. Kasper, and D. A Adams, The effect and limitation of automated text condensing on reading comprehension performance. Information System Research, pp [22] PALM, PALM: Providing Fluid Connectivity in a Wireless World, White Paper of Palm Inc., [23] G. Salton, and C. Buckley, Term-Weighting Approaches in Automatic Text Retrieval, Information Processing and Management, vol. 24, pp [24] S. Teufel, and M. Moens, Sentence Extraction and rhetorical classification for flexible abstracts, AAAI Spring Symposium on Intelligent Text summarization, Stanford. [25] S. Teufel, and M. Moens, Sentence Extraction as a Classification Task, In Workshop Intelligent and scalable Text summarization, ACL/EACL.

Domain Specific Search Engine for Students

Domain Specific Search Engine for Students Domain Specific Search Engine for Students Domain Specific Search Engine for Students Wai Yuen Tang The Department of Computer Science City University of Hong Kong, Hong Kong wytang@cs.cityu.edu.hk Lam

More information

Heading-Based Sectional Hierarchy Identification for HTML Documents

Heading-Based Sectional Hierarchy Identification for HTML Documents Heading-Based Sectional Hierarchy Identification for HTML Documents 1 Dept. of Computer Engineering, Boğaziçi University, Bebek, İstanbul, 34342, Turkey F. Canan Pembe 1,2 and Tunga Güngör 1 2 Dept. of

More information

A Novel PAT-Tree Approach to Chinese Document Clustering

A Novel PAT-Tree Approach to Chinese Document Clustering A Novel PAT-Tree Approach to Chinese Document Clustering Kenny Kwok, Michael R. Lyu, Irwin King Department of Computer Science and Engineering The Chinese University of Hong Kong Shatin, N.T., Hong Kong

More information

Classification and Summarization: A Machine Learning Approach

Classification and Summarization: A Machine Learning Approach Email Classification and Summarization: A Machine Learning Approach Taiwo Ayodele Rinat Khusainov David Ndzi Department of Electronics and Computer Engineering University of Portsmouth, United Kingdom

More information

Inferring User Search for Feedback Sessions

Inferring User Search for Feedback Sessions Inferring User Search for Feedback Sessions Sharayu Kakade 1, Prof. Ranjana Barde 2 PG Student, Department of Computer Science, MIT Academy of Engineering, Pune, MH, India 1 Assistant Professor, Department

More information

Chapter 3. Technology Adopted. 3.1 Introduction

Chapter 3. Technology Adopted. 3.1 Introduction Chapter 3 Technology Adopted 3.1 Introduction The previous chapter described difference between the propose system and traditional methods and also about the existing similar systems. In this chapter,

More information

A Model for Information Retrieval Agent System Based on Keywords Distribution

A Model for Information Retrieval Agent System Based on Keywords Distribution A Model for Information Retrieval Agent System Based on Keywords Distribution Jae-Woo LEE Dept of Computer Science, Kyungbok College, 3, Sinpyeong-ri, Pocheon-si, 487-77, Gyeonggi-do, Korea It2c@koreaackr

More information

Performance Evaluation of XHTML encoding and compression

Performance Evaluation of XHTML encoding and compression Performance Evaluation of XHTML encoding and compression Sathiamoorthy Manoharan Department of Computer Science, University of Auckland, Auckland, New Zealand Abstract. The wireless markup language (WML),

More information

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS 1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,

More information

A Miniature-Based Image Retrieval System

A Miniature-Based Image Retrieval System A Miniature-Based Image Retrieval System Md. Saiful Islam 1 and Md. Haider Ali 2 Institute of Information Technology 1, Dept. of Computer Science and Engineering 2, University of Dhaka 1, 2, Dhaka-1000,

More information

Iterated Functions Systems and Fractal Coding

Iterated Functions Systems and Fractal Coding Qing Jun He 90121047 Math 308 Essay Iterated Functions Systems and Fractal Coding 1. Introduction Fractal coding techniques are based on the theory of Iterated Function Systems (IFS) founded by Hutchinson

More information

A Combined Method of Text Summarization via Sentence Extraction

A Combined Method of Text Summarization via Sentence Extraction Proceedings of the 2007 WSEAS International Conference on Computer Engineering and Applications, Gold Coast, Australia, January 17-19, 2007 434 A Combined Method of Text Summarization via Sentence Extraction

More information

A Web Page Segmentation Method by using Headlines to Web Contents as Separators and its Evaluations

A Web Page Segmentation Method by using Headlines to Web Contents as Separators and its Evaluations IJCSNS International Journal of Computer Science and Network Security, VOL.13 No.1, January 2013 1 A Web Page Segmentation Method by using Headlines to Web Contents as Separators and its Evaluations Hiroyuki

More information

Supervised classification of law area in the legal domain

Supervised classification of law area in the legal domain AFSTUDEERPROJECT BSC KI Supervised classification of law area in the legal domain Author: Mees FRÖBERG (10559949) Supervisors: Evangelos KANOULAS Tjerk DE GREEF June 24, 2016 Abstract Search algorithms

More information

A Constrained Spreading Activation Approach to Collaborative Filtering

A Constrained Spreading Activation Approach to Collaborative Filtering A Constrained Spreading Activation Approach to Collaborative Filtering Josephine Griffith 1, Colm O Riordan 1, and Humphrey Sorensen 2 1 Dept. of Information Technology, National University of Ireland,

More information

Deep Web Crawling and Mining for Building Advanced Search Application

Deep Web Crawling and Mining for Building Advanced Search Application Deep Web Crawling and Mining for Building Advanced Search Application Zhigang Hua, Dan Hou, Yu Liu, Xin Sun, Yanbing Yu {hua, houdan, yuliu, xinsun, yyu}@cc.gatech.edu College of computing, Georgia Tech

More information

Domain-specific Concept-based Information Retrieval System

Domain-specific Concept-based Information Retrieval System Domain-specific Concept-based Information Retrieval System L. Shen 1, Y. K. Lim 1, H. T. Loh 2 1 Design Technology Institute Ltd, National University of Singapore, Singapore 2 Department of Mechanical

More information

Empirical Analysis of Single and Multi Document Summarization using Clustering Algorithms

Empirical Analysis of Single and Multi Document Summarization using Clustering Algorithms Engineering, Technology & Applied Science Research Vol. 8, No. 1, 2018, 2562-2567 2562 Empirical Analysis of Single and Multi Document Summarization using Clustering Algorithms Mrunal S. Bewoor Department

More information

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN 2016 International Conference on Artificial Intelligence: Techniques and Applications (AITA 2016) ISBN: 978-1-60595-389-2 Face Recognition Using Vector Quantization Histogram and Support Vector Machine

More information

A New Approach to Fractal Image Compression Using DBSCAN

A New Approach to Fractal Image Compression Using DBSCAN International Journal of Electrical Energy, Vol. 2, No. 1, March 2014 A New Approach to Fractal Image Compression Using DBSCAN Jaseela C C and Ajay James Dept. of Computer Science & Engineering, Govt.

More information

CADIAL Search Engine at INEX

CADIAL Search Engine at INEX CADIAL Search Engine at INEX Jure Mijić 1, Marie-Francine Moens 2, and Bojana Dalbelo Bašić 1 1 Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000 Zagreb, Croatia {jure.mijic,bojana.dalbelo}@fer.hr

More information

Comment Extraction from Blog Posts and Its Applications to Opinion Mining

Comment Extraction from Blog Posts and Its Applications to Opinion Mining Comment Extraction from Blog Posts and Its Applications to Opinion Mining Huan-An Kao, Hsin-Hsi Chen Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan

More information

REDUNDANCY REMOVAL IN WEB SEARCH RESULTS USING RECURSIVE DUPLICATION CHECK ALGORITHM. Pudukkottai, Tamil Nadu, India

REDUNDANCY REMOVAL IN WEB SEARCH RESULTS USING RECURSIVE DUPLICATION CHECK ALGORITHM. Pudukkottai, Tamil Nadu, India REDUNDANCY REMOVAL IN WEB SEARCH RESULTS USING RECURSIVE DUPLICATION CHECK ALGORITHM Dr. S. RAVICHANDRAN 1 E.ELAKKIYA 2 1 Head, Dept. of Computer Science, H. H. The Rajah s College, Pudukkottai, Tamil

More information

Creating a Classifier for a Focused Web Crawler

Creating a Classifier for a Focused Web Crawler Creating a Classifier for a Focused Web Crawler Nathan Moeller December 16, 2015 1 Abstract With the increasing size of the web, it can be hard to find high quality content with traditional search engines.

More information

Web Page Fragmentation for Personalized Portal Construction

Web Page Fragmentation for Personalized Portal Construction Web Page Fragmentation for Personalized Portal Construction Bouras Christos Kapoulas Vaggelis Misedakis Ioannis Research Academic Computer Technology Institute, 6 Riga Feraiou Str., 2622 Patras, Greece

More information

Automatic Text Summarization System Using Extraction Based Technique

Automatic Text Summarization System Using Extraction Based Technique Automatic Text Summarization System Using Extraction Based Technique 1 Priyanka Gonnade, 2 Disha Gupta 1,2 Assistant Professor 1 Department of Computer Science and Engineering, 2 Department of Computer

More information

Extraction of Web Image Information: Semantic or Visual Cues?

Extraction of Web Image Information: Semantic or Visual Cues? Extraction of Web Image Information: Semantic or Visual Cues? Georgina Tryfou and Nicolas Tsapatsoulis Cyprus University of Technology, Department of Communication and Internet Studies, Limassol, Cyprus

More information

Relevance Feature Discovery for Text Mining

Relevance Feature Discovery for Text Mining Relevance Feature Discovery for Text Mining Laliteshwari 1,Clarish 2,Mrs.A.G.Jessy Nirmal 3 Student, Dept of Computer Science and Engineering, Agni College Of Technology, India 1,2 Asst Professor, Dept

More information

A New Mode of Browsing Web Tables on Small Screens

A New Mode of Browsing Web Tables on Small Screens A New Mode of Browsing Web Tables on Small Screens Wenchang Xu, Xin Yang, Yuanchun Shi Department of Computer Science and Technology, Tsinghua University, Beijing, P.R. China stefanie8806@gmail.com; yang-x02@mails.tsinghua.edu.cn;

More information

Statistical Performance Comparisons of Computers

Statistical Performance Comparisons of Computers Tianshi Chen 1, Yunji Chen 1, Qi Guo 1, Olivier Temam 2, Yue Wu 1, Weiwu Hu 1 1 State Key Laboratory of Computer Architecture, Institute of Computing Technology (ICT), Chinese Academy of Sciences, Beijing,

More information

A Linear Regression Model for Assessing the Ranking of Web Sites Based on Number of Visits

A Linear Regression Model for Assessing the Ranking of Web Sites Based on Number of Visits A Linear Regression Model for Assessing the Ranking of Web Sites Based on Number of Visits Dowming Yeh, Pei-Chen Sun, and Jia-Wen Lee National Kaoshiung Normal University Kaoshiung, Taiwan 802, Republic

More information

A Study for Documents Summarization based on Personal Annotation

A Study for Documents Summarization based on Personal Annotation A Study for Documents Summarization based on Personal Annotation Haiqin Zhang University of Science and Technology of China face@mail.ustc.edu. cn ZhengChen Wei-yingMa Microsoft Research Asia zhengc@microsoft.com

More information

Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating

Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating Dipak J Kakade, Nilesh P Sable Department of Computer Engineering, JSPM S Imperial College of Engg. And Research,

More information

Image Classification Using Wavelet Coefficients in Low-pass Bands

Image Classification Using Wavelet Coefficients in Low-pass Bands Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August -7, 007 Image Classification Using Wavelet Coefficients in Low-pass Bands Weibao Zou, Member, IEEE, and Yan

More information

A Practical Passage-based Approach for Chinese Document Retrieval

A Practical Passage-based Approach for Chinese Document Retrieval A Practical Passage-based Approach for Chinese Document Retrieval Szu-Yuan Chi 1, Chung-Li Hsiao 1, Lee-Feng Chien 1,2 1. Department of Information Management, National Taiwan University 2. Institute of

More information

Lecture #3: PageRank Algorithm The Mathematics of Google Search

Lecture #3: PageRank Algorithm The Mathematics of Google Search Lecture #3: PageRank Algorithm The Mathematics of Google Search We live in a computer era. Internet is part of our everyday lives and information is only a click away. Just open your favorite search engine,

More information

Information Retrieval

Information Retrieval Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,

More information

Image Similarity Measurements Using Hmok- Simrank

Image Similarity Measurements Using Hmok- Simrank Image Similarity Measurements Using Hmok- Simrank A.Vijay Department of computer science and Engineering Selvam College of Technology, Namakkal, Tamilnadu,india. k.jayarajan M.E (Ph.D) Assistant Professor,

More information

Fractal Compression. Related Topic Report. Henry Xiao. Queen s University. Kingston, Ontario, Canada. April 2004

Fractal Compression. Related Topic Report. Henry Xiao. Queen s University. Kingston, Ontario, Canada. April 2004 Fractal Compression Related Topic Report By Henry Xiao Queen s University Kingston, Ontario, Canada April 2004 Fractal Introduction Fractal is first introduced in geometry field. The birth of fractal geometry

More information

Genetic Algorithm based Fractal Image Compression

Genetic Algorithm based Fractal Image Compression Vol.3, Issue.2, March-April. 2013 pp-1123-1128 ISSN: 2249-6645 Genetic Algorithm based Fractal Image Compression Mahesh G. Huddar Lecturer, Dept. of CSE,Hirasugar Institute of Technology, Nidasoshi, India

More information

Developing a Mobile Information Service

Developing a Mobile Information Service Developing a Mobile Information Service Proceedings of the 15 th Annual NACCQ, Hamilton New Zealand July, 2002 www.naccq.ac.nz ABSTRACT T.T. Goh Chris Chen Victoria University Wellington Wellington, New

More information

Feature Selecting Model in Automatic Text Categorization of Chinese Financial Industrial News

Feature Selecting Model in Automatic Text Categorization of Chinese Financial Industrial News Selecting Model in Automatic Text Categorization of Chinese Industrial 1) HUEY-MING LEE 1 ), PIN-JEN CHEN 1 ), TSUNG-YEN LEE 2) Department of Information Management, Chinese Culture University 55, Hwa-Kung

More information

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web Thaer Samar 1, Alejandro Bellogín 2, and Arjen P. de Vries 1 1 Centrum Wiskunde & Informatica, {samar,arjen}@cwi.nl

More information

An Empirical Study of Web Interface Design on Small Display Devices

An Empirical Study of Web Interface Design on Small Display Devices An Empirical Study of Web Interface Design on Small Display Devices Mei Kang QIU Kang ZHANG Maolin HUANG Department of Computer Science Department of Computer Science Department of Computer Systems University

More information

A Hierarchical Document Clustering Approach with Frequent Itemsets

A Hierarchical Document Clustering Approach with Frequent Itemsets A Hierarchical Document Clustering Approach with Frequent Itemsets Cheng-Jhe Lee, Chiun-Chieh Hsu, and Da-Ren Chen Abstract In order to effectively retrieve required information from the large amount of

More information

Integrating Image Content and its Associated Text in a Web Image Retrieval Agent

Integrating Image Content and its Associated Text in a Web Image Retrieval Agent From: AAAI Technical Report SS-97-03. Compilation copyright 1997, AAAI (www.aaai.org). All rights reserved. Integrating Image Content and its Associated Text in a Web Image Retrieval Agent Victoria Meza

More information

Fault Identification from Web Log Files by Pattern Discovery

Fault Identification from Web Log Files by Pattern Discovery ABSTRACT International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2017 IJSRCSEIT Volume 2 Issue 2 ISSN : 2456-3307 Fault Identification from Web Log Files

More information

Canalization and Personalization in Mobile Wireless Application

Canalization and Personalization in Mobile Wireless Application Proceedings of the Postgraduate Annual Research Seminar 2005 301 Canalization and Personalization in Mobile Wireless Application Herman Department of Computer System & Communication Faculty of Computer

More information

Integration of Handwriting Recognition in Butterfly Net

Integration of Handwriting Recognition in Butterfly Net Integration of Handwriting Recognition in Butterfly Net Sye-Min Christina Chan Department of Computer Science Stanford University Stanford, CA 94305 USA sychan@stanford.edu Abstract ButterflyNet allows

More information

A Review on Identifying the Main Content From Web Pages

A Review on Identifying the Main Content From Web Pages A Review on Identifying the Main Content From Web Pages Madhura R. Kaddu 1, Dr. R. B. Kulkarni 2 1, 2 Department of Computer Scienece and Engineering, Walchand Institute of Technology, Solapur University,

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things.

Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. + What is Data? Data is a collection of facts. Data can be in the form of numbers, words, measurements, observations or even just descriptions of things. In most cases, data needs to be interpreted and

More information

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web Thaer Samar 1, Alejandro Bellogín 2, and Arjen P. de Vries 1 1 Centrum Wiskunde & Informatica, {samar,arjen}@cwi.nl

More information

An Elevated Area Classification Scheme Situated on Regional Fractal Dimension Himanshu Tayagi Trinity College, Tublin, Ireland

An Elevated Area Classification Scheme Situated on Regional Fractal Dimension Himanshu Tayagi Trinity College, Tublin, Ireland An Elevated Area Classification Scheme Situated on Regional Fractal Dimension Himanshu Tayagi Trinity College, Tublin, Ireland ================================================================= Abstract

More information

ARCHITECTURE AND IMPLEMENTATION OF A NEW USER INTERFACE FOR INTERNET SEARCH ENGINES

ARCHITECTURE AND IMPLEMENTATION OF A NEW USER INTERFACE FOR INTERNET SEARCH ENGINES ARCHITECTURE AND IMPLEMENTATION OF A NEW USER INTERFACE FOR INTERNET SEARCH ENGINES Fidel Cacheda, Alberto Pan, Lucía Ardao, Angel Viña Department of Tecnoloxías da Información e as Comunicacións, Facultad

More information

Distributed T-Way Test Suite Data Generation using Exhaustive Search Method with Map and Reduce Framework

Distributed T-Way Test Suite Data Generation using Exhaustive Search Method with Map and Reduce Framework 2010 IEEE Symposium on Industrial Electronics and Applications (ISIEA 2010), October 3-5, 2010, Penang, Malaysia Distributed T-Way Test Suite Data using Exhaustive Search Method with Map and Reduce Framework

More information

FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING. Moheb R. Girgis and Mohammed M.

FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING. Moheb R. Girgis and Mohammed M. 322 FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING Moheb R. Girgis and Mohammed M. Talaat Abstract: Fractal image compression (FIC) is a

More information

An Application of Knowledge Management for Construction Using Mobile PDA System

An Application of Knowledge Management for Construction Using Mobile PDA System An Application of Knowledge Management for Construction Using Mobile PDA System Sheng-Tai LIN a, Yu-Cheng LIN b, and H. Ping TSERNG c a Graduate Student, Department of Civil Engineering, National Taiwan

More information

An Adaptive Threshold LBP Algorithm for Face Recognition

An Adaptive Threshold LBP Algorithm for Face Recognition An Adaptive Threshold LBP Algorithm for Face Recognition Xiaoping Jiang 1, Chuyu Guo 1,*, Hua Zhang 1, and Chenghua Li 1 1 College of Electronics and Information Engineering, Hubei Key Laboratory of Intelligent

More information

Hybrid image coding based on partial fractal mapping

Hybrid image coding based on partial fractal mapping Signal Processing: Image Communication 15 (2000) 767}779 Hybrid image coding based on partial fractal mapping Zhou Wang, David Zhang*, Yinglin Yu Department of Electrical and Computer Engineering, University

More information

Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines

Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines The Scientific World Journal Volume 2013, Article ID 596724, 6 pages http://dx.doi.org/10.1155/2013/596724 Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines Weizhe

More information

A Framework for Delivery of Thai Content through Mobile Devices

A Framework for Delivery of Thai Content through Mobile Devices A Framework for Delivery of Thai Content through Mobile Devices Chuleerat Jaruskulchai, Atichart Khanthong, and Wanlapa Tantiprasongchai Intelligent Information Retrieval and Database Department of Computer

More information

A A A. Fig.1 image patch. Then the edge gradient magnitude is . (1)

A A A. Fig.1 image patch. Then the edge gradient magnitude is . (1) International Conference on Information Science and Computer Applications (ISCA 013) Two-Dimensional Barcode Image Super-Resolution Reconstruction Via Sparse Representation Gaosheng Yang 1,Ningzhong Liu

More information

Software Testing Fundamentals. Software Testing Techniques. Information Flow in Testing. Testing Objectives

Software Testing Fundamentals. Software Testing Techniques. Information Flow in Testing. Testing Objectives Software Testing Fundamentals Software Testing Techniques Peter Lo Software Testing is a critical element of software quality assurance and represents the ultimate review of specification, design and coding.

More information

Weighted Page Rank Algorithm Based on Number of Visits of Links of Web Page

Weighted Page Rank Algorithm Based on Number of Visits of Links of Web Page International Journal of Soft Computing and Engineering (IJSCE) ISSN: 31-307, Volume-, Issue-3, July 01 Weighted Page Rank Algorithm Based on Number of Visits of Links of Web Page Neelam Tyagi, Simple

More information

SECURE SMART GRID DEVICE for HOME AREA NETWORKS Using WIRELESS APPLICATION PROTOCOL

SECURE SMART GRID DEVICE for HOME AREA NETWORKS Using WIRELESS APPLICATION PROTOCOL 826 SECURE SMART GRID DEVICE for HOME AREA NETWORKS Using WIRELESS APPLICATION PROTOCOL S. RAM KANNAN B.E, CSE Chennai, TN India ramkannan1991@gmail.com R. SABARISH B.E, CSE Chennai, TN India sabarish.raghu@gmail.com

More information

Using Query History to Prune Query Results

Using Query History to Prune Query Results Using Query History to Prune Query Results Daniel Waegel Ursinus College Department of Computer Science dawaegel@gmail.com April Kontostathis Ursinus College Department of Computer Science akontostathis@ursinus.edu

More information

News Filtering and Summarization System Architecture for Recognition and Summarization of News Pages

News Filtering and Summarization System Architecture for Recognition and Summarization of News Pages Bonfring International Journal of Data Mining, Vol. 7, No. 2, May 2017 11 News Filtering and Summarization System Architecture for Recognition and Summarization of News Pages Bamber and Micah Jason Abstract---

More information

Information Discovery, Extraction and Integration for the Hidden Web

Information Discovery, Extraction and Integration for the Hidden Web Information Discovery, Extraction and Integration for the Hidden Web Jiying Wang Department of Computer Science University of Science and Technology Clear Water Bay, Kowloon Hong Kong cswangjy@cs.ust.hk

More information

STUDYING OF CLASSIFYING CHINESE SMS MESSAGES

STUDYING OF CLASSIFYING CHINESE SMS MESSAGES STUDYING OF CLASSIFYING CHINESE SMS MESSAGES BASED ON BAYESIAN CLASSIFICATION 1 LI FENG, 2 LI JIGANG 1,2 Computer Science Department, DongHua University, Shanghai, China E-mail: 1 Lifeng@dhu.edu.cn, 2

More information

Navigating Product Catalogs Through OFDAV Graph Visualization

Navigating Product Catalogs Through OFDAV Graph Visualization Navigating Product Catalogs Through OFDAV Graph Visualization Mao Lin Huang Department of Computer Systems Faculty of Information Technology University of Technology, Sydney NSW 2007, Australia maolin@it.uts.edu.au

More information

Improving Suffix Tree Clustering Algorithm for Web Documents

Improving Suffix Tree Clustering Algorithm for Web Documents International Conference on Logistics Engineering, Management and Computer Science (LEMCS 2015) Improving Suffix Tree Clustering Algorithm for Web Documents Yan Zhuang Computer Center East China Normal

More information

I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING. Andrii Donchenko

I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING. Andrii Donchenko International Journal "Information Technologies and Knowledge" Vol.1 / 2007 293 I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING Andrii Donchenko Abstract: This article considers

More information

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN: IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T

More information

HYBRIDIZED MODEL FOR EFFICIENT MATCHING AND DATA PREDICTION IN INFORMATION RETRIEVAL

HYBRIDIZED MODEL FOR EFFICIENT MATCHING AND DATA PREDICTION IN INFORMATION RETRIEVAL International Journal of Mechanical Engineering & Computer Sciences, Vol.1, Issue 1, Jan-Jun, 2017, pp 12-17 HYBRIDIZED MODEL FOR EFFICIENT MATCHING AND DATA PREDICTION IN INFORMATION RETRIEVAL BOMA P.

More information

CIS UDEL Working Notes on ImageCLEF 2015: Compound figure detection task

CIS UDEL Working Notes on ImageCLEF 2015: Compound figure detection task CIS UDEL Working Notes on ImageCLEF 2015: Compound figure detection task Xiaolong Wang, Xiangying Jiang, Abhishek Kolagunda, Hagit Shatkay and Chandra Kambhamettu Department of Computer and Information

More information

Multi-scale Techniques for Document Page Segmentation

Multi-scale Techniques for Document Page Segmentation Multi-scale Techniques for Document Page Segmentation Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition (CEDAR), State University of New York at Buffalo, Amherst

More information

SEQUENTIAL PATTERN MINING FROM WEB LOG DATA

SEQUENTIAL PATTERN MINING FROM WEB LOG DATA SEQUENTIAL PATTERN MINING FROM WEB LOG DATA Rajashree Shettar 1 1 Associate Professor, Department of Computer Science, R. V College of Engineering, Karnataka, India, rajashreeshettar@rvce.edu.in Abstract

More information

Train schedule diagram drawing algorithm considering interrelationship between labels

Train schedule diagram drawing algorithm considering interrelationship between labels Train schedule diagram drawing algorithm considering interrelationship between labels H. Izumi', N. Tomii',2 The University of Electro-Communications, Japan. 2Railway Technical Research Institute, Japan.

More information

Mobile Data Management Structure Design based on Mobile Database

Mobile Data Management Structure Design based on Mobile Database International Conference on Materials Engineering and Information Technology Applications (MEITA 2015) Mobile Data Management Structure Design based on Mobile Database Xin Hu Library, Nanchang Institute

More information

Visualizing the Thematic Update Status of Web and WAP Sites on Mobile Phones *

Visualizing the Thematic Update Status of Web and WAP Sites on Mobile Phones * Visualizing the Thematic Update Status of Web and WAP Sites on Mobile Phones * Luca Chittaro HCI Lab, Dept. of Math and Computer Science, University of Udine via delle Scienze, 206, 33100 Udine, ITALY

More information

Semi supervised clustering for Text Clustering

Semi supervised clustering for Text Clustering Semi supervised clustering for Text Clustering N.Saranya 1 Assistant Professor, Department of Computer Science and Engineering, Sri Eshwar College of Engineering, Coimbatore 1 ABSTRACT: Based on clustering

More information

High Capacity Reversible Watermarking Scheme for 2D Vector Maps

High Capacity Reversible Watermarking Scheme for 2D Vector Maps Scheme for 2D Vector Maps 1 Information Management Department, China National Petroleum Corporation, Beijing, 100007, China E-mail: jxw@petrochina.com.cn Mei Feng Research Institute of Petroleum Exploration

More information

The Unified Segment Tree and its Application to the Rectangle Intersection Problem

The Unified Segment Tree and its Application to the Rectangle Intersection Problem CCCG 2013, Waterloo, Ontario, August 10, 2013 The Unified Segment Tree and its Application to the Rectangle Intersection Problem David P. Wagner Abstract In this paper we introduce a variation on the multidimensional

More information

Accordion Summarization for End-Game Browsing on PDAs and Cellular Phones

Accordion Summarization for End-Game Browsing on PDAs and Cellular Phones Accordion Summarization for End-Game Browsing on PDAs and Cellular Phones Orkut Buyukkokten, Hector Garcia-Molina, Andreas Paepcke Digital Libraries Lab (InfoLab), Stanford University, Stanford, CA, 94305

More information

International Journal of Video& Image Processing and Network Security IJVIPNS-IJENS Vol:10 No:02 7

International Journal of Video& Image Processing and Network Security IJVIPNS-IJENS Vol:10 No:02 7 International Journal of Video& Image Processing and Network Security IJVIPNS-IJENS Vol:10 No:02 7 A Hybrid Method for Extracting Key Terms of Text Documents Ahmad Ali Al-Zubi Computer Science Department

More information

A Machine Learning Approach for Displaying Query Results in Search Engines

A Machine Learning Approach for Displaying Query Results in Search Engines A Machine Learning Approach for Displaying Query Results in Search Engines Tunga Güngör 1,2 1 Boğaziçi University, Computer Engineering Department, Bebek, 34342 İstanbul, Turkey 2 Visiting Professor at

More information

Enhancing Web Page Skimmability

Enhancing Web Page Skimmability Enhancing Web Page Skimmability Chen-Hsiang Yu MIT CSAIL 32 Vassar St Cambridge, MA 02139 chyu@mit.edu Robert C. Miller MIT CSAIL 32 Vassar St Cambridge, MA 02139 rcm@mit.edu Abstract Information overload

More information

From Passages into Elements in XML Retrieval

From Passages into Elements in XML Retrieval From Passages into Elements in XML Retrieval Kelly Y. Itakura David R. Cheriton School of Computer Science, University of Waterloo 200 Univ. Ave. W. Waterloo, ON, Canada yitakura@cs.uwaterloo.ca Charles

More information

Graph Fractals. An Honors Thesis (Honrs 499) by Ben J. Kelly. Thesis Advisor. Ball State University Muncie, Indiana. May 1995

Graph Fractals. An Honors Thesis (Honrs 499) by Ben J. Kelly. Thesis Advisor. Ball State University Muncie, Indiana. May 1995 Graph Fractals An Honors Thesis (Honrs 499) by Ben J. Kelly Thesis Advisor Ball State University Muncie, Indiana May 1995 Expected Date Of Graduation: May 6, 1995 ~, 5fCol! rj,e5;s ~ 7) 2 J+B(). L~ if

More information

Beyond the desktop interactive visualizations

Beyond the desktop interactive visualizations Beyond the desktop interactive visualizations Hauptseminar Information Visualization - Wintersemester 2008/2009" Till Ballendat LFE Medieninformatik 17.02.2009 LMU Department of Media Informatics Hauptseminar

More information

A new fractal algorithm to model discrete sequences

A new fractal algorithm to model discrete sequences A new fractal algorithm to model discrete sequences Zhai Ming-Yue( 翟明岳 ) a) Heidi Kuzuma b) and James W. Rector b)c) a) School of EE Engineering North China Electric Power University Beijing 102206 China

More information

Keywords Data alignment, Data annotation, Web database, Search Result Record

Keywords Data alignment, Data annotation, Web database, Search Result Record Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web

More information

Knowledge Engineering in Search Engines

Knowledge Engineering in Search Engines San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research Spring 2012 Knowledge Engineering in Search Engines Yun-Chieh Lin Follow this and additional works at:

More information

Channel-Adaptive Error Protection for Scalable Audio Streaming over Wireless Internet

Channel-Adaptive Error Protection for Scalable Audio Streaming over Wireless Internet Channel-Adaptive Error Protection for Scalable Audio Streaming over Wireless Internet GuiJin Wang Qian Zhang Wenwu Zhu Jianping Zhou Department of Electronic Engineering, Tsinghua University, Beijing,

More information

The RAM Computation Model

The RAM Computation Model Department of Computer Science and Engineering Chinese University of Hong Kong This is not a programming course. Take-away message 1 from this course Programming is the last step in software development,

More information

Effect of log-based Query Term Expansion on Retrieval Effectiveness in Patent Searching

Effect of log-based Query Term Expansion on Retrieval Effectiveness in Patent Searching Effect of log-based Query Term Expansion on Retrieval Effectiveness in Patent Searching Wolfgang Tannebaum, Parvaz Madabi and Andreas Rauber Institute of Software Technology and Interactive Systems, Vienna

More information

Development of a Web based Education Module for Verilog HDL

Development of a Web based Education Module for Verilog HDL Proceedings of the 2010 International Conference on Industrial Engineering and Operations Management Dhaka, Bangladesh, January 9 10, 2010 Development of a Web based Education Module for Verilog HDL Rifat

More information

Pak. J. Biotechnol. Vol. 13 (special issue on Innovations in information Embedded and Communication Systems) Pp (2016)

Pak. J. Biotechnol. Vol. 13 (special issue on Innovations in information Embedded and Communication Systems) Pp (2016) FRACTAL IMAGE COMPRESSIO USIG QUATUM ALGORITHM T Janani* and M Bharathi* Department of Electronics and Communication Engineering, Kumaraguru College of Technology, Coimbatore, India - 641049. Email: bharathi.m.ece@kct.ac.in,

More information

AN IMPROVED DOMAIN CLASSIFICATION SCHEME BASED ON LOCAL FRACTAL DIMENSION

AN IMPROVED DOMAIN CLASSIFICATION SCHEME BASED ON LOCAL FRACTAL DIMENSION AN IMPROVED DOMAIN CLASSIFICATION SCHEME BASED ON LOCAL FRACTAL DIMENSION JAYAMOHAN M. Department of Computer Science, College of Applied Science, Adoor, Kerala, India, 691523. jmohanm@gmail.com K. REVATHY

More information