Preprocessing of Web Usage Data for Application in Prefetching to Reduce Web Latency

Size: px
Start display at page:

Download "Preprocessing of Web Usage Data for Application in Prefetching to Reduce Web Latency"

Transcription

1 Internatonal Journal of Electrcal& Computer Scences IJECS-IJENS Vol:14 No:04 1 Preprocessng of Web Usage Data for Applcaton n Prefetchng to Reduce Web Latency G T Raju Professor, Department of CSE, RNS Insttute of Technology, Bangalore Vsvesvaraya Technologcal Unversty, Karnataka, INDIA gtraju1990@yahoo.com Nandn N Research Scholar, Bharathyar Unversty Lecturer, Department of CSE RNS Insttute of Technology, BangaloreVsvesvaraya Technologcal Unversty,Karnataka, INDIA nandu_8449@redffmal.com Abstract--The Popularty of Web resulted n heavy traffc n the Internet. Ths ntense ncrease n nternet traffc has caused sgnfcant ncrease n the user perceved latency. In order to reduce ths effectvely. Prefetchng technques are found to be best sutable. Prefetchng technque s motvated by the fact that, n general, once a user goes to a Web ste; he/she generally browses around for several pages before leavng for another ste. Snce the user follows hyperlnks upon hs/her nterests, t s lkely that lnks are not followed unformly. It s possble to ether predct each user s nterest usng cookes or mne a consensus of nterests (.e., generally what pages wll be requested after the current page) wth some confdence from access log fles recorded by the Web server. Ths nformaton not only s valuable for the Web admnstrator to elmnate unnterested pages, or balance load among the servers, but also can help to mprove Web-browsng tme. In ths paper, we propose a comprehensve preprocessng methodology as a prerequste and frst stage for Prefetchng applcaton, whch has four steps: Data Cleanng, Identfcaton of users & Sessons, and fnally the Data Formattng and Summarzaton. An attempt s made to reduce the quantty of the WUD and thereby mprove the qualty of WUD for effectve use n Prefetchng applcaton. Several heurstcs have been proposed for cleanng the WUD whch s then aggregated and recorded n the relatonal data model. To valdate the effcency of our preprocessng methodology, several experments were conducted log fles on three dfferent web stes: Academc, Research, Commercal and the results shows that our methodology reduces the Web access log fles sze down to 72-83% of the ntal sze and offer rcher logs that are structured for applcaton n Prefetchng. Index Term-- Preprocessng, Prefetchng, Web Usage Data, Web Usage Mnng. 1. INTRODUCTION The exponental growth of the Web n terms of Web stes and ther users durng the last decade has generated huge amount of data related to the user s nteractons wth the Web stes. Ths data s recorded n the Web access log fles of Web servers and usually referred as Web Usage Data (WUD). Web Usage Mnng (WUM) deals wth the applcaton of data mnng technques to extract nterestng knowledge from the WUD. Data preprocessng has a fundamental role n Web Usage Mnng (WUM) applcatons. A sgnfcant problem wth most of the pattern dscovery methods s that, ther dffculty n handlng very large scales of WUD. Despte the fact that, most of the WUM processes done off-lne, the sze of WUD s n the orders of magntude larger than those met n common applcatons of machne learnng. Rushng to analyze usage data wthout a proper preprocessng method wll lead to poor results or even to falure. Preprocessng methodology has not receved enough analyss efforts. Hence managng the quantty of data that s contnuously ncreasng, and the great dversty of pages on Web ste has become crtcal for WUM applcatons. Prefetchng means fetchng of Web pages before the users request them. Prefetchng technque s lke a Web cachng technque to reduce the user perceved latency. In Web usage mnng s the area of data mnng whch deals wth the dscovery and analyss of usage patterns from web logs, n order to mprove web based applcatons. Web usage mnng conssts of three phases, preprocessng, pattern dscovery, and pattern analyss. Web Usage Mnng [22,23] data can be collected n server logs, applcaton server logs. A complete preprocessng technque s beng proposed to preprocess the web log for extracton of user patterns. In ths paper, we propose a comprehensve preprocessng methodology that allows the analyst to transform any collecton of web server log fles nto structured collecton of tables n relatonal database model for further use n Prefetchng applcaton. The log fles from dfferent web stes of the same organzaton are merged to apprehend the behavors of the users that navgate n a transparent way. Afterwards, ths fle s cleaned by removng all unnecessary requests, such as mplct requests for the objects embedded n the Web pages and the requests generated by non-human clents of the Web ste (.e. Web robots). Then, the remanng requests are grouped by user, user sessons, page vews, and vsts. Fnally, the cleaned and transformed collectons of requests are saved onto a relatonal database model. We have provded flters to flter the unwanted, rrelevant, and unused data. Analyst can select the log fles from dfferent web servers and decde what entres he/she s nterested (HTML, PDF, and TXT). The objectve of ths research work s to consderably reduce the large quantty of Web usage data avalable and, at the same tme, to ncrease ts qualty by structurng t and provdng addtonal aggregated varables for the data mnng analyss that follow. The rest of the paper s organzed as follows. We formalze the problem of data preprocessng n secton 2. An overvew of the data preprocessng process s gven n secton 3. We

2 Internatonal Journal of Electrcal& Computer Scences IJECS-IJENS Vol:14 No:04 2 present the four steps System Archtecture: Data cleanng (secton 3.1), Identfcaton (secton 3.2) of Users, Sessons, Page vews and data formattng & summarzaton (Secton 3.3). The expermental results and observatons are provded n secton 4. Related work s gven n secton 5. Fnally, n secton 6, we conclude and provde some perspectves n Prefetchng applcatons. 2. PROBLEM FORMALIZATION Consder the set R = {r 1, r 2, r nr } of all Web resources from a Web ste. If U = {u 1, u 2, u nu } s the set of all the users who have accessed that ste, we can defne a log entry as l =<u, t, s, r, [ref ] >, where u U; r R; t represents the access tme, and s represents the request status. ref represents the referrng page and s optonal as n some Web log formats, lke the CLF format[1]where the referrng page s not recorded. s s a threedgt code ndcatng the request's success or falure. In the latter case, t also ndcates the cause of the falure. A status wth a value of 200 represents a succeeded request; whle a status of 404 shows that the requested fle was not found at the expected locaton. L = {l 1, l 2, l nl }, ordered ascendng by the tme value of l, consttutes a Web server log. The preprocessng frst groups the log entres nto page vews by usng the log. A page vew p conssts of p = {r 1, r 2, r np }, where r j R. The compacted log entry s lp =<u, t, p, ref >. Consderng a tme nterval t, we can defne a vst v for a user u as v =<u, t, pv >, where pv =<(t 1,p 1 ); (t 2,p 2, (t n,p n ) >; t +1 t and t +1 - t <t; =1:n-1.Usng these notatons, the preprocessng problem s formalzed as follows: Gven a thatlog fle for gven Web ste L. The objectve s to extract the Users, User sessons, vsts, and page vews of the Web stes' users wth a gven t. The sesson dentfcaton process Varable w R D N r j t rj S S L S j j n Table I Summary of Varables and Notatons Descrpton The total number of users The ordered lst of th user access records n the web log fle The set of all R The total number of access records n R The j th access record of user The tme when r j was logged n the web log Sessons for all the users Sessons for the user Total number of sessons for user j th sesson for user The length for sesson S j 3. SYSTEM ARCHITECTURE System Archtecture of the proposed work s shown n fgure 1. It conssts of Pre processng, Sequental Pattern Mnng and Prefetchng Modules. In ths paper we focus on Web usage data pre processng, whch s an essental step for Sequental Pattern Mnng and n turn for prefetchng applcatons. can be formulated as follows: LetR ={r 1,r 2, r N } be the ordered lst of th user access records n the log (sorted by the ascendng order of the access frequency), and t rj (0 < j N ) be the tme when r j was logged n the web log fle. Let A= { A 1, A 2,.A v } be the lst of categores and V the total number of categores. We assume that each r j s represented by one of these categores and the user generates L sessons (possbly unequal length ordered sequences of pages). Assumng that w s the total number of users n the web log fle, let D = {R 1, R 2,...,R w } be the observed access records for all the users n the log. The sessons for ndvdual user are defned as a lst of subsets: S 1 = { r 1, r 2,.r n 1 }, S 2 ={ r ( n 1 +1),, r ( S 1 = {r ( n 1 nl ).. r ( Where t r( n 1 +1) t r( n 1 ) 30 mnutes, N= S 2 S L S n 1 n L + + n 1 + n 2 ) }, S L n )} n L and S1 Then the set S={S 1, S 2,..S w } represents the sessons for all the users n our data set. A summary of all the above varables and notatons s gven n the Table I. Fg. 1. System Archtecture Data preprocessng of web logs s usually complex and tme demandng. It comprses of four steps: Data cleanng, User Identfcaton, Sesson dentfcaton, and Data formattng &Summarzaton. At the begnnng of the data preprocessng, we have the Log contanng the Web server log fles collected by several Web servers. When sharng logs fles or publshng results, for prvacy reasons, we need to remove the host names or the IP addresses. Therefore, we replace the orgnal host name wth an dentfer that keeps the nformaton about the doman extenson (.e. the country code or organzaton type, such as.com,.edu, and.org). We can also use these parameters later n the analyss, and the log fles can be shared wthout revealng senstve nformaton. 3.1 Data Cleanng The frst step of data preprocessng conssts of removng useless requests from the log fle. Snce all the log entres are not vald, we need to elmnate the rrelevant entres. Usually, ths process removes requests concernng non-analyzed resources such as mages, multmeda fles, and page style fles. For example, requests for graphcal page content (*.jpg & *.gf mages) and requests for any other fle whch mght be ncluded nto a web page or even navgaton sessons performed by robots and web spders. By flterng out useless

3 Internatonal Journal of Electrcal& Computer Scences IJECS-IJENS Vol:14 No:04 3 data, we can reduce the log fle sze to use less storage space and to facltate upcomng tasks For example, by flterng out mage requests, the sze of Web server log fles reduced to less than 50% of ther orgnal sze. Thus, data cleanng ncludes the elmnaton of rrelevant entres lke: Requests executed by automated programs, such as web robots, spders and crawlers; these programs generate the traffc to web stes, can dramatcally bas the ste statstcs, and are also not the desred category whch WUM nvestgates. Requests for mage fles assocated wth requests for partcular pages; an user s request to vew a partcular page often results n several log entres because that page ncludes other graphcs, whle we are only nterested n what the users explctly request, whch are usually text fles. Entres wth unsuccessful HTTP status codes; HTTP status codes are used to ndcate the success or falure of a requested event, and we only consder successful entres wth codes between 200 and 299. Entres wth request methods except GET and POST. 3.2 Identfcaton Ths step groups the unstructured requests of a log fle by user, user sesson, page vew and vsts. At the end of ths step, the log fle wll be a set of transactons, where by transacton we refer to a user sesson or a vst User In most cases, the log fle provdes only the computer address (name or IP) and the user agent (for the ECLF[2] log fles). For Web stes requrng user regstraton, the log fle also contans the user logn (as the thrd record n a log entry). In ths case, we use ths nformaton for the user dentfcaton. When the user logn s not avalable, we consder (f necessary) each IP as a user, although we know that an IP address can be used by several users. In ths paper, we approxmate users n terms of IP address, type of OS and browsng software User Sessons Identfyng the user sessons from the log fle s not a smple task due to proxy servers, dynamc addresses, and cases where multple users access the same computer (at a lbrary, Internet cafe, etc.) or one user uses multple browsers or computers. A user sesson s defned as a sequence of requests made by a sngle user over a certan navgaton perod and a user may have a sngle (or multple) sesson(s) durng a perod of tme. Sesson dentfcaton s the process of segmentng the access log of each user nto ndvdual access sessons [5]. Two tmeorented heurstc methods: sesson-duraton based method and page-stay-tme based method have been specfcally proposed by [6, 7, and 8] for sesson dentfcaton. In ths paper, we use the tmeout threshold n order to defne the users sessons Page Vews We dentfy the page vews by usng the tme of the request. For requests made at the same tme (.e. the same second), we keep only the frst request (as ordered n the log fle) and dscard the followng ones. After the page vew dentfcaton, the log fle wll contan, normally, only one request for each user acton. The page vew dentfcaton step determnes whch page fle requests are part of the same page vew and what content was served. Ths s necessary to provde meanngful results n the pattern analyss phase. If ths step s not performed, the dscovered patterns can be domnated by page fles that make up a sngle popular page vew. Each sesson must start wth a seed page vew.e., an ntal page fle or set of page fles from whch all subsequent page vews wll be derved. In the vast majorty cases, the seed page vew s made up of a sngle fle, or starts wth a sngle fle that defnes the frame structure and mmedately causes addtonal page fles to be requested. It s very rare for an unrelated ste to lnk to more than one page fle of a dfferent ste n a sngle hypertext lnk. However, t s possble, and for these cases all of the page fles contrbutng to the seed page vew would have to be explctly entered nto the algorthm. Once the seed page vew s specfed for a sesson, each subsequent request s checked aganst the ste structure to determne whch frame s beng replaced. The new page vew conssts of the requested fle plus all of the page fles that are remanng n the browser dsplay from the prevous page vew. Each dstnct page vew dscovered n a log s gven an dentfer. 3.3 Data Formattng and Summarzaton Ths s the last step of data preprocessng. In ths step, frst, we transfer the structured fle contanng sessons and vsts to a relatonal database. Afterwards, we apply the data generalzaton at the request level (for URLs) and the aggregated data computaton for vsts and user sessons to completely fll n the database. The data summarzaton step concerns wth the computaton of aggregated varables at dfferent abstracton levels (e.g. request, vst, and user sesson). These aggregated varables are later used n the data mnng step. They represent statstcal values that characterze the objects analyzed. For nstance, f the object analyzed s a user sesson, n the aggregated data computaton process, we propose to calculate the followng varables: The number of vsts for that sesson; The length of the sesson n seconds (the dfference between the last and the frst date of the vst) or n pages vewed (the total number of page vews); 4. EXPERIMENTAL RESULTS Several experments have been conducted on log fles collected from NASA Web ste, Academc Web ste and MSNBC Web ste. Results show that our preprocessng methodology reduces sgnfcantly the sze of the ntal log fles by elmnatng unnecessary requests and ncreases ther qualty through better structurng. It s observed from the Table II that, the sze for s reduced to 72-83% of the ntal sze NASA log fle. Unque mage/css related statstcs are

4 Internatonal Journal of Electrcal& Computer Scences IJECS-IJENS Vol:14 No:04 4 presented n the fgures Fgure 2. Day wse unque vstors statstcs are presented n the Fgure3, and Page vews statstcs are presented n the Fgure4. The user sessons for Academc web ste are shown n the Table III.Smlarly for Academc web ste these statstcs are shown n Fgure 5,Fgure 6, and Fgure 7. The user sessons for NASA web ste are shown n the Table IV. Web transacton data for Academc web ste of50 pages s shown n the Table V. Sequence data for Academc web ste of 50 pages s shown n the Table VI. 5. RELATED WORK In the recent years, we have seen much research on Web usage mnng [3,4,5,6,7,8, 9,10,11,12,13,14,15,16]. However, as descrbed below, data preprocessng n WUM has receved far less attenton than t deserves. Methods for user dentfcaton, sessonzng, page vew dentfcaton, path completon, and epsode dentfcaton are presented n [3]. However, some of the heurstcs proposed are not approprate for larger and more complex Web stes. For example, they propose to use the ste topology n conjuncton wth the ECLF [2] fle for what they call the user dentfcaton. The proposed heurstc ams to dstngush between users wth the same IP address, OS and Browser by checkng every page requested n a chronologcal order. If a page requested s not referred by any prevous page requested, then t belongs to a new user sesson. The drawback of ths approach s that t consders only one way of navgatng n a Web ste, by followng lnks. However, n order to change the current page, the users can, for nstance, type the new URL n the address bar (most browsers have an auto completon feature that facltates ths functon) or they can select t from ther bookmarks. In another work [15], the authors compared tme-based and referrer-based heurstcs for vst reconstructon. They found out that a heurstc's approprateness depends on the desgn of the Web ste (.e. whether the ste s frame-based or frame-free) and on the length of the vsts (the referrer-based heurstc performs better for shorter vsts). In [16], Marquardt et al. addressed the applcaton of WUM n the e-learnng area wth a focus on the preprocessng phase. In ths context, they redefned the noton of vst from the e-learnng pont of vew. Frequent access patterns are determned from reconstructed sessons. There are several algorthms n the lterature for the sequental pattern mnng,the dffculty of mnng sequental patterns from a large fxed database has been wdely addressed n [24]. Baoyao Zhou [17] proposed a smple algorthm that to access sesson s created as a par of URL and the requested tme n a sequence of requests wth a tmestamp. The duraton of an URL s estmated as the dfference of request tme of successor entry and current entry. Ths algorthm s sutable when there are more number of URL s n a sesson. The default tme set by author s 30 mnutes per sesson. Murat Al et.al., [18] proposed, Smart Mner: A New framework for Mnng Large Scale Web Usage Data. Ths frame work s a part of Web Analytcs Software. Smart Mner sessons are set of paths traversed n the Web graph that corresponds to user navgatons among web pages. The Author developed an effcent verson of the Apror-All Technque whch uses the structure of web graph to ncrease the performance. The sessons constructed by SMART-SRA contans sequental pages accessed from server-sde works n two stages. 1. Tmestamp Orderng Rule: the tme stamp orderng condton smply represents the standard user sesson defnton. 2. Topology rule: The topology condton s ntroduced to force each user navgaton path to correspond to a path n the web ste graph. In Smart SRA there are two phases. In the frst phase, the access data stream of web users are parttoned nto shorter page request sequences called canddate sessons. In the second phase canddate sessons are dvded nto maxmal sub sessons such that for each consecutve page par n the sequence there exsts a lnk from prevous one to latter one. At the same tme, page stay tme rule for consecutve pages s also satsfed. An agent smulator s developed by authors to smulate an actual web user. It randomly generates a typcal web ste topology and a user agent to accesses the same from ts clent sde and acts lke a real user. An mportant feature of the agent smulator s ts ablty to model dynamc behavors of a web agent. Tme constrant s also consdered as the dfference between two consecutve pages s smaller than 10 mnutes. Robert F.Dell et.al [19] proposed, web user sesson reconstructon usng nteger programmng. The man advantage of ths method s constructon of all sesson s smultaneously Author approach s groups log regsters from same IP address and agent ensurng the lnk structure of the ste s followed n any constructed sesson. Each constructed sesson from a web log s an ordered lst of log regsters where each regster can only be used once n only one sesson. Experment s focused wth IP address wth hgh dversty and a hgher number of regsters. Sessons produced better match an expected emprcal dstrbuton. Navn Kumar Tyag et al., [21] proposed An Algorthmc approach to Data Pre processng n Web Usage Mnng. The author survey some data preprocessng actvtes lke data cleanng and data reducton and. It s mportant that before applyng data mnng technques to dscover user access patterns from web log, data must be processed to mantan qualty of results s based on data to be mned. An Effcent Algorthm for Data Cleanng of Log Fle usng Fle Extensons proposed [25], algorthm tested on the log fles for data cleanng. The approach show a somewhat salent reducton n the number of records and n the log fles sze and ncreases the qualty of the avalable data.

5 Internatonal Journal of Electrcal& Computer Scences IJECS-IJENS Vol:14 No:04 5 Webste NASA July 95 ACADE MIC Dec MSNBC Aug 95 Orgnal Sze (20.6M B) (2.9MB ) (7.6MB ) Table II Results after Preprocessng Sze after Preproce ssng % Reduct on n Sze No. of Sesson s % No. of Use rs % % Fg. 2.Unque Image/CSS related statstcs (NASA Log fle, July 1995)(Academc Log fle, Dec. 2013) Fg. 5. Unque Image/CSS related statstcs Fg. 3.Unque Vstors (NASA Log fle, July 1995) Fg. 6. Unque Image/CSS related statstcs (Academc Log fle, Dec. 2013)

6 Internatonal Journal of Electrcal& Computer Scences IJECS-IJENS Vol:14 No:04 6 Fg. 4. Page Vews statstcs (NASA Log fle, July 1995) Fg. 7. Unque Image/CSS related statstcs (Academc Log fle, Dec. 2013) Table III Sample User Sessons for Academc Web ste Sesson Id IP Address Date &Tme URL Accessed :37: :36: :37: :37:22 /ece-dep.html :22:24 /mages/placements.jpg :20:06 /mages/placements.jpg :30:25 /Alumn.html :31:06 /Alumn.html :30:34 /bestteachers.html :30:35 /bestteachers.html Table IV Sample User Sessons for NASA Web ste Sesson Id IP Address Date &Tme URL Accessed :16:58 /shuttle/mssons/sts-73/msson-sts-73.html :17:25 /shuttle/mssons/sts-74/msson-sts-74.html :26:33 /dsclamer.html :09:32 /dsclamer.html :27:49 /shuttle/countdown/countdown.html :28:11 /shuttle/technology/sts-newsref/stsref-toc.html :05:11 /shuttle/countdown/lftoff.html :06:33 /shuttle/countdown/lftoff.html

7 Internatonal Journal of Electrcal& Computer Scences IJECS-IJENS Vol:14 No:04 7 Table V Web Transacton Data for Academc Web ste of 50 pages User Id Page Id User Id Page Id U32 P38 U199 P47 U32 P15 U199 P24 U32 P35 U199 P36 U57 P29 U199 P37 U57 P26 U93 P39 U57 P15 U93 P9 U30 P38 U93 P4 U30 P26 U100 P43 U30 P15 U100 P33 U80 P28 U9 P32 U80 P21 U9 P29 U80 P30 U393 P34 U80 P36 U393 P18 U14 P23 U393 P8 U14 P14 U379 P28 U14 P46 U379 P28 U101 P9 U379 P20 U101 P4 U153 P10 U101 P15 U153 P12 U163 P15 U491 P1 U163 P44 U491 P9 U163 P13 U491 P4 In ther approach, a learnng sesson (LS), vst n our case, can span over several days f ths perod corresponds to a gven learnng perod. As shown above, the preprocessng step s mportant and should be present n all WUM analyss. Therefore, we compared our preprocessng methodology wth the preprocessng descrbed n other general WUM research works [3, 9, 10, 11, 12, 15 and16]. 6. CONCLUSION In ths paper, we have presented Pre-processng as a pre requste for Prefetchng applcaton. The expermental results presented n secton 4, llustrates the mportance of the data preprocessng step and the effectveness of our methodology, by reducng not only the sze of the log fle but also by ncreasng the qualty of the data avalable through the new data structures that we obtaned. Although the preprocessng methodology presented allows us to reassemble most of the ntal vsts, the process tself does not fully guarantee that we dentfy correctly all the transactons (.e. user sessons & vsts). Ths s due to the poor qualty of the ntal log fle ths may affect the data mnng, resultng n erroneous Web access patterns. Therefore, we need a sold procedure that guarantees the qualty and the accuracy of the data obtaned at the end of data preprocessng. Future work n ths regard n the applcaton of Frequent Pattern Mnng algorthms on the preprocessed web transacton data to get nterestng Frequent Sequental Patterns, further these patterns would be used as gudng rules for prefetchng applcatons. Table VI Sequence Data for Academc Web ste of 50 pages Use Id U32 Page Id P38, P15, P35 U57 P29, P23,P40 U30 U80 U14 U101 U163 U199 U93 U100 U9 U393 U379 U153 U491 P38, P26, P15 P28, P21, P30, P36 P23, P14, P46 P9, P4, P15 P15, P44,P13 P47, P24, P36, P37 P39, P9, P4 P43, P33 P32, P29 P34, P18, P8 P28, P28, P20 P10, P12 P1, P9, P4 REFERENCES [1] Confguraton fles of W3C httpd, (1995). [2] W3C Extended Log Fle Format, (1996). [3] J. Srvastava, R. Cooley, M. Deshpande, P.-N. Tan, Web usage mnng: dscovery and applcatons of usage patterns from web data, SIGKDD Exploratons, 1(2), 2000, [4] R. Kosala, H. Blockeel, Web mnng research: a survey, SIGKDD: SIGKDD exploratons: newsletter of the specal nterest group (SIG) on knowledge dscovery & data mnng, ACM 2 (1), 2000, 1 15 [5] R. Kohav, R. Parekh, Ten supplementary analyses to mprove e- commerce web stes, n: Proceedngs of the Ffth WEBKDD workshop, [6] B. Mobasher R. Cooley, and J. Srvastava, Creatng Adaptve Web Stes through usage based clusterng of URLs, n IEEE knowledge & Data Engg work shop (KDEX 99), 1999 [7] Bettna Berendt, Web usage mnng, ste semantcs, and the support of navgaton, n Proceedngs of the Workshop WEBKDD Web Mnng for E-Commerce - Challenges and Opportuntes, 6th ACM SIGKDD Int. Conf. on Knowledge Dscovery and Data Mnng, 2000, Boston, MA [8] B. Berendt and M. Splopoulou. Analyss of Navgaton Behavour n Web Stes Integratng Multple Informaton Systems. VLDB, 9(1), 2000, [9] A. Josh and R. Krshnapuram. On Mnng Web Access Logs. In ACM SIGMOD Workshop on Research Issues n Data Mnng and Knowledge Dscovery, pages 2000, [10] C. Shahab and F. B. Kashan. A Framework for Effcent and Anonymous Web Usage Mnng Based on Clent-Sde Trackng. In WEBKDD Mnng Web Log Data Across All Customers Touch Ponts, Thrd Internatonal Workshop, San Francsco, CA, USA, August 26, 2001, Revsed Papers, volume 2356 of LNCS, Sprnger, 2002, [11] Y. Fu, K. Sandhu, and M. Shh. A Generalzaton-Based Approach to Clusterng ofweb Usage Sessons. In Proceedngs of the 1999 KDD Workshop on Web Mnng, San Dego, CA, vol of LNAI,. Sprnger, 2000, [12] M. S. Chen, J. S. Park, and P. S. Yu. Effcent Data Mnng for Path Traversal Patterns. Knowledge and Data Engneerng, 10(2), 1998,

8 Internatonal Journal of Electrcal& Computer Scences IJECS-IJENS Vol:14 No:04 8 [13] B. Mobasher, H. Da, T. Luo, and M. Nakagawa. Dscovery and Evaluaton of Aggregate Usage Profles for Web Personalzaton. Data Mnng and Knowledge Dscovery, 6(1), 2002, 61-82, [14] M. El-Sayed, C. Ruz, and E. A. Rundenstener. FS-Mner: Effcent and Incremental Mnng of Frequent Sequence Patterns n Web Logs. In Proceedngs of the Sxth Annual ACM Internatonal Workshop on Web Informaton and Data Management (WIDM '04), ACM Press, 2004, [15] B. Berendt, B. Mobasher, M. Nakagawa, and M. Splopoulou. The Impactof Ste Structure and User Envronment on Sesson reconstructon n Web Usage Analyss. In Proceedngs of the Forth WebKDD 2002 Workshop, at the ACM-SIGKDD Conference on Knowledge Dscovery n Databases (KDD'2002), Edmonton, Alberta, Canada, 2002 [16] C. Marquardt, K. Becker, and D. Ruz. A Pre-Processng Tool for Web Usage Mnng n the Dstance Educaton Doman. In Proceedngs of the Internatonal Database Engneerng and Applcatons Symposum (IDEAS'04), 2004, [17] BaoyaoZhou,Su Cheung Hu and AlyssC.M.Fong, An Effectve Approach for Perodc Web Personalzaton,Proceedngs of the IEEE/ACM Internatonal Conference on Web Intellgence.IEEE,2006 [18] Murat Al Bayr,IsmalHakkToroslu, AhmetCosar and GveyFdan Dscoverng more accurate Frequent Web Usage Patterns., arxv v1,2008 [19] Lu, L., & Lu, J., Mnng maxmal sequental patterns wth layer coded Breadth-Frst lnked WAP-tree., Asa-Pacfc Conference on Computatonal Intellgence and Industral Applcatons (PACIIA), IEEE, pp , [20] Chetna Chand, Amt Thakkar, Amt Ganatra, Sequental Pattern Mnng: Survey and Current Research Challenges, Internatonal Journal of Soft Computng and Engneerng (IJSCE) ISSN: , Volume-2, Issue-1, March 2012 [21] Navn Kumar Tyag, A.K. Solank & Sanjay Tyag. An Algorthmc approach to data preprocessng n Web usage mnng, Internatonal Journal of Informaton Technology and Knowledge Management July- December 2010, Volume 2, No. 2, pp [22] Yogsh H K, Dr. G T Raju, Manjunath T N, The Descrptve Study of Knowledge Dscovery from Web Usage Mnng, IJCSI Internatonal Journal of Computer Scence Issues, Vol. 8, Issue 5, No 1, September 2011 [23] Vjayashr Losarwar, Dr. Madhur Josh, Data Preprocessng n Web Usage Mnng, Internatonal Conference on Artfcal Intellgence and Embedded Systems (ICAIES'2012) July 15-16, 2012 Sngapore [24] R. Agrawal and R. Srkant, Mnng sequental patterns, In 11 th Internatonal Conference of Data Engneerng (ICDE 95), pp. 3-14, 1995 [25] Surbh Anand, Rnkle Ran Aggarwal, An Effcent Algorthm for Data Cleanng of Log Fle usng Fle Extensons, Internatonal Journal of Computer Applcatons ( ),Volume 48 No.8, June 2012

Knowledge Discovery from Web Usage Data: Complete Preprocessing Methodology

Knowledge Discovery from Web Usage Data: Complete Preprocessing Methodology IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.1, January 2008 179 Knowledge Discovery from Web Usage Data: Complete Preprocessing Methodology G T Raju 1 and P S Satyanarayana

More information

Concurrent Apriori Data Mining Algorithms

Concurrent Apriori Data Mining Algorithms Concurrent Apror Data Mnng Algorthms Vassl Halatchev Department of Electrcal Engneerng and Computer Scence York Unversty, Toronto October 8, 2015 Outlne Why t s mportant Introducton to Assocaton Rule Mnng

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

LinkSelector: A Web Mining Approach to. Hyperlink Selection for Web Portals

LinkSelector: A Web Mining Approach to. Hyperlink Selection for Web Portals nkselector: A Web Mnng Approach to Hyperlnk Selecton for Web Portals Xao Fang and Olva R. u Sheng Department of Management Informaton Systems Unversty of Arzona, AZ 8572 {xfang,sheng}@bpa.arzona.edu Submtted

More information

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory Background EECS. Operatng System Fundamentals No. Vrtual Memory Prof. Hu Jang Department of Electrcal Engneerng and Computer Scence, York Unversty Memory-management methods normally requres the entre process

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Video Proxy System for a Large-scale VOD System (DINA)

Video Proxy System for a Large-scale VOD System (DINA) Vdeo Proxy System for a Large-scale VOD System (DINA) KWUN-CHUNG CHAN #, KWOK-WAI CHEUNG *# #Department of Informaton Engneerng *Centre of Innovaton and Technology The Chnese Unversty of Hong Kong SHATIN,

More information

TF 2 P-growth: An Efficient Algorithm for Mining Frequent Patterns without any Thresholds

TF 2 P-growth: An Efficient Algorithm for Mining Frequent Patterns without any Thresholds TF 2 P-growth: An Effcent Algorthm for Mnng Frequent Patterns wthout any Thresholds Yu HIRATE, Ego IWAHASHI, and Hayato YAMANA Graduate School of Scence and Engneerng, Waseda Unversty {hrate, ego, yamana}@yama.nfo.waseda.ac.jp

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1) Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A

More information

Effective Page Recommendation Algorithms Based on. Distributed Learning Automata and Weighted Association. Rules

Effective Page Recommendation Algorithms Based on. Distributed Learning Automata and Weighted Association. Rules Effectve Page Recommendaton Algorthms Based on Dstrbuted Learnng Automata and Weghted Assocaton Rules R. Forsat 1*, M. R. Meybod 2 1 Department of Computer Engneerng, Islamc Azad Unversty, Karaj Branch,

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Discovering similar user navigation behavior in Web log data

Discovering similar user navigation behavior in Web log data Dscoverng smlar user navgaton behavor n Web log data Tawfq A. Al-asad 1 and Ahmed J. Obad 2 1 Department of Computer Scence, Drector, College of IT, Babylon Unversty, Iraq. 2 Department of Computer Scence,

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

A Heuristic for Mining Association Rules In Polynomial Time

A Heuristic for Mining Association Rules In Polynomial Time A Heurstc for Mnng Assocaton Rules In Polynomal Tme E. YILMAZ General Electrc Card Servces, Inc. A unt of General Electrc Captal Corporaton 6 Summer Street, MS -39C, Stamford, CT, 697, U.S.A. egemen.ylmaz@gecaptal.com

More information

Query Clustering Using a Hybrid Query Similarity Measure

Query Clustering Using a Hybrid Query Similarity Measure Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

A Heuristic for Mining Association Rules In Polynomial Time*

A Heuristic for Mining Association Rules In Polynomial Time* Complete reference nformaton: Ylmaz, E., E. Trantaphyllou, J. Chen, and T.W. Lao, (3), A Heurstc for Mnng Assocaton Rules In Polynomal Tme, Computer and Mathematcal Modellng, No. 37, pp. 9-33. A Heurstc

More information

Data Preprocessing Based on Partially Supervised Learning Na Liu1,2, a, Guanglai Gao1,b, Guiping Liu2,c

Data Preprocessing Based on Partially Supervised Learning Na Liu1,2, a, Guanglai Gao1,b, Guiping Liu2,c 6th Internatonal Conference on Informaton Engneerng for Mechancs and Materals (ICIMM 2016) Data Preprocessng Based on Partally Supervsed Learnng Na Lu1,2, a, Guangla Gao1,b, Gupng Lu2,c 1 College of Computer

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier Some materal adapted from Mohamed Youns, UMBC CMSC 611 Spr 2003 course sldes Some materal adapted from Hennessy & Patterson / 2003 Elsever Scence Performance = 1 Executon tme Speedup = Performance (B)

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

Oracle Database: SQL and PL/SQL Fundamentals Certification Course

Oracle Database: SQL and PL/SQL Fundamentals Certification Course Oracle Database: SQL and PL/SQL Fundamentals Certfcaton Course 1 Duraton: 5 Days (30 hours) What you wll learn: Ths Oracle Database: SQL and PL/SQL Fundamentals tranng delvers the fundamentals of SQL and

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Petri Net Based Software Dependability Engineering

Petri Net Based Software Dependability Engineering Proc. RELECTRONIC 95, Budapest, pp. 181-186; October 1995 Petr Net Based Software Dependablty Engneerng Monka Hener Brandenburg Unversty of Technology Cottbus Computer Scence Insttute Postbox 101344 D-03013

More information

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution Real-tme Moton Capture System Usng One Vdeo Camera Based on Color and Edge Dstrbuton YOSHIAKI AKAZAWA, YOSHIHIRO OKADA, AND KOICHI NIIJIMA Graduate School of Informaton Scence and Electrcal Engneerng,

More information

Related-Mode Attacks on CTR Encryption Mode

Related-Mode Attacks on CTR Encryption Mode Internatonal Journal of Network Securty, Vol.4, No.3, PP.282 287, May 2007 282 Related-Mode Attacks on CTR Encrypton Mode Dayn Wang, Dongda Ln, and Wenlng Wu (Correspondng author: Dayn Wang) Key Laboratory

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

Virtual Machine Migration based on Trust Measurement of Computer Node

Virtual Machine Migration based on Trust Measurement of Computer Node Appled Mechancs and Materals Onlne: 2014-04-04 ISSN: 1662-7482, Vols. 536-537, pp 678-682 do:10.4028/www.scentfc.net/amm.536-537.678 2014 Trans Tech Publcatons, Swtzerland Vrtual Machne Mgraton based on

More information

A Webpage Similarity Measure for Web Sessions Clustering Using Sequence Alignment

A Webpage Similarity Measure for Web Sessions Clustering Using Sequence Alignment A Webpage Smlarty Measure for Web Sessons Clusterng Usng Sequence Algnment Mozhgan Azmpour-Kv School of Engneerng and Scence Sharf Unversty of Technology, Internatonal Campus Ksh Island, Iran mogan_az@ksh.sharf.edu

More information

Life Tables (Times) Summary. Sample StatFolio: lifetable times.sgp

Life Tables (Times) Summary. Sample StatFolio: lifetable times.sgp Lfe Tables (Tmes) Summary... 1 Data Input... 2 Analyss Summary... 3 Survval Functon... 5 Log Survval Functon... 6 Cumulatve Hazard Functon... 7 Percentles... 7 Group Comparsons... 8 Summary The Lfe Tables

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

A Post Randomization Framework for Privacy-Preserving Bayesian. Network Parameter Learning

A Post Randomization Framework for Privacy-Preserving Bayesian. Network Parameter Learning A Post Randomzaton Framework for Prvacy-Preservng Bayesan Network Parameter Learnng JIANJIE MA K.SIVAKUMAR School Electrcal Engneerng and Computer Scence, Washngton State Unversty Pullman, WA. 9964-75

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK L-qng Qu, Yong-quan Lang 2, Jng-Chen 3, 2 College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology,

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images Internatonal Journal of Informaton and Electroncs Engneerng Vol. 5 No. 6 November 015 Usng Fuzzy Logc to Enhance the Large Sze Remote Sensng Images Trung Nguyen Tu Huy Ngo Hoang and Thoa Vu Van Abstract

More information

A Deflected Grid-based Algorithm for Clustering Analysis

A Deflected Grid-based Algorithm for Clustering Analysis A Deflected Grd-based Algorthm for Clusterng Analyss NANCY P. LIN, CHUNG-I CHANG, HAO-EN CHUEH, HUNG-JEN CHEN, WEI-HUA HAO Department of Computer Scence and Informaton Engneerng Tamkang Unversty 5 Yng-chuan

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Feature Selection as an Improving Step for Decision Tree Construction

Feature Selection as an Improving Step for Decision Tree Construction 2009 Internatonal Conference on Machne Learnng and Computng IPCSIT vol.3 (2011) (2011) IACSIT Press, Sngapore Feature Selecton as an Improvng Step for Decson Tree Constructon Mahd Esmael 1, Fazekas Gabor

More information

TPL-Aware Displacement-driven Detailed Placement Refinement with Coloring Constraints

TPL-Aware Displacement-driven Detailed Placement Refinement with Coloring Constraints TPL-ware Dsplacement-drven Detaled Placement Refnement wth Colorng Constrants Tao Ln Iowa State Unversty tln@astate.edu Chrs Chu Iowa State Unversty cnchu@astate.edu BSTRCT To mnmze the effect of process

More information

The Shortest Path of Touring Lines given in the Plane

The Shortest Path of Touring Lines given in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 262 The Open Cybernetcs & Systemcs Journal, 2015, 9, 262-267 The Shortest Path of Tourng Lnes gven n the Plane Open Access Ljuan Wang 1,2, Dandan He

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CPS 0 Introducton to Computer Scence Lecture Notes Chapter : Algorthm Desgn How should we present algorthms? Natural languages lke Englsh, Spansh, or French whch are rch n nterpretaton and meanng are not

More information

Resolving Ambiguity in Depth Extraction for Motion Capture using Genetic Algorithm

Resolving Ambiguity in Depth Extraction for Motion Capture using Genetic Algorithm Resolvng Ambguty n Depth Extracton for Moton Capture usng Genetc Algorthm Yn Yee Wa, Ch Kn Chow, Tong Lee Computer Vson and Image Processng Laboratory Dept. of Electronc Engneerng The Chnese Unversty of

More information

Private Information Retrieval (PIR)

Private Information Retrieval (PIR) 2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market

More information

The Effect of Similarity Measures on The Quality of Query Clusters

The Effect of Similarity Measures on The Quality of Query Clusters The effect of smlarty measures on the qualty of query clusters. Fu. L., Goh, D.H., Foo, S., & Na, J.C. (2004). Journal of Informaton Scence, 30(5) 396-407 The Effect of Smlarty Measures on The Qualty of

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

Alignment Results of SOBOM for OAEI 2010

Alignment Results of SOBOM for OAEI 2010 Algnment Results of SOBOM for OAEI 2010 Pegang Xu, Yadong Wang, Lang Cheng, Tany Zang School of Computer Scence and Technology Harbn Insttute of Technology, Harbn, Chna pegang.xu@gmal.com, ydwang@ht.edu.cn,

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

Brushlet Features for Texture Image Retrieval

Brushlet Features for Texture Image Retrieval DICTA00: Dgtal Image Computng Technques and Applcatons, 1 January 00, Melbourne, Australa 1 Brushlet Features for Texture Image Retreval Chbao Chen and Kap Luk Chan Informaton System Research Lab, School

More information

PYTHON IMPLEMENTATION OF VISUAL SECRET SHARING SCHEMES

PYTHON IMPLEMENTATION OF VISUAL SECRET SHARING SCHEMES PYTHON IMPLEMENTATION OF VISUAL SECRET SHARING SCHEMES Ruxandra Olmd Faculty of Mathematcs and Computer Scence, Unversty of Bucharest Emal: ruxandra.olmd@fm.unbuc.ro Abstract Vsual secret sharng schemes

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

Distributed Resource Scheduling in Grid Computing Using Fuzzy Approach

Distributed Resource Scheduling in Grid Computing Using Fuzzy Approach Dstrbuted Resource Schedulng n Grd Computng Usng Fuzzy Approach Shahram Amn, Mohammad Ahmad Computer Engneerng Department Islamc Azad Unversty branch Mahallat, Iran Islamc Azad Unversty branch khomen,

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

Evaluation of an Enhanced Scheme for High-level Nested Network Mobility

Evaluation of an Enhanced Scheme for High-level Nested Network Mobility IJCSNS Internatonal Journal of Computer Scence and Network Securty, VOL.15 No.10, October 2015 1 Evaluaton of an Enhanced Scheme for Hgh-level Nested Network Moblty Mohammed Babker Al Mohammed, Asha Hassan.

More information

Keyword-based Document Clustering

Keyword-based Document Clustering Keyword-based ocument lusterng Seung-Shk Kang School of omputer Scence Kookmn Unversty & AIrc hungnung-dong Songbuk-gu Seoul 36-72 Korea sskang@kookmn.ac.kr Abstract ocument clusterng s an aggregaton of

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,

More information

A fast algorithm for color image segmentation

A fast algorithm for color image segmentation Unersty of Wollongong Research Onlne Faculty of Informatcs - Papers (Arche) Faculty of Engneerng and Informaton Scences 006 A fast algorthm for color mage segmentaton L. Dong Unersty of Wollongong, lju@uow.edu.au

More information

Classic Term Weighting Technique for Mining Web Content Outliers

Classic Term Weighting Technique for Mining Web Content Outliers Internatonal Conference on Computatonal Technques and Artfcal Intellgence (ICCTAI'2012) Penang, Malaysa Classc Term Weghtng Technque for Mnng Web Content Outlers W.R. Wan Zulkfel, N. Mustapha, and A. Mustapha

More information

Study of Data Stream Clustering Based on Bio-inspired Model

Study of Data Stream Clustering Based on Bio-inspired Model , pp.412-418 http://dx.do.org/10.14257/astl.2014.53.86 Study of Data Stream lusterng Based on Bo-nspred Model Yngme L, Mn L, Jngbo Shao, Gaoyang Wang ollege of omputer Scence and Informaton Engneerng,

More information

PRÉSENTATIONS DE PROJETS

PRÉSENTATIONS DE PROJETS PRÉSENTATIONS DE PROJETS Rex Onlne (V. Atanasu) What s Rex? Rex s an onlne browser for collectons of wrtten documents [1]. Asde ths core functon t has however many other applcatons that make t nterestng

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

PHYSICS-ENHANCED L-SYSTEMS

PHYSICS-ENHANCED L-SYSTEMS PHYSICS-ENHANCED L-SYSTEMS Hansrud Noser 1, Stephan Rudolph 2, Peter Stuck 1 1 Department of Informatcs Unversty of Zurch, Wnterthurerstr. 190 CH-8057 Zurch Swtzerland noser(stuck)@f.unzh.ch, http://www.f.unzh.ch/~noser(~stuck)

More information

Mining Web Logs with PLSA Based Prediction Model to Improve Web Caching Performance

Mining Web Logs with PLSA Based Prediction Model to Improve Web Caching Performance JOURAL OF COMPUTERS, VOL. 8, O. 5, MAY 2013 1351 Mnng Web Logs wth PLSA Based Predcton Model to Improve Web Cachng Performance Chub Huang Department of Automaton, USTC Key laboratory of network communcaton

More information

CAN COMPUTERS LEARN FASTER? Seyda Ertekin Computer Science & Engineering The Pennsylvania State University

CAN COMPUTERS LEARN FASTER? Seyda Ertekin Computer Science & Engineering The Pennsylvania State University CAN COMPUTERS LEARN FASTER? Seyda Ertekn Computer Scence & Engneerng The Pennsylvana State Unversty sertekn@cse.psu.edu ABSTRACT Ever snce computers were nvented, manknd wondered whether they mght be made

More information

Utilizing Content to Enhance a Usage-Based Method for Web Recommendation based on Q-Learning

Utilizing Content to Enhance a Usage-Based Method for Web Recommendation based on Q-Learning Proceedngs of the Twenty-Frst Internatonal FLAIS Conference (2008) Utlzng Content to Enhance a Usage-Based Method for Web ecommendaton based on Q-Learnng Nma Taghpour Department of Computer Engneerng Amrkabr

More information

Shared Running Buffer Based Proxy Caching of Streaming Sessions

Shared Running Buffer Based Proxy Caching of Streaming Sessions Shared Runnng Buffer Based Proxy Cachng of Streamng Sessons Songqng Chen, Bo Shen, Yong Yan, Sujoy Basu Moble and Meda Systems Laboratory HP Laboratores Palo Alto HPL-23-47 March th, 23* E-mal: sqchen@cs.wm.edu,

More information

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation Intellgent Informaton Management, 013, 5, 191-195 Publshed Onlne November 013 (http://www.scrp.org/journal/m) http://dx.do.org/10.36/m.013.5601 Qualty Improvement Algorthm for Tetrahedral Mesh Based on

More information

IP Camera Configuration Software Instruction Manual

IP Camera Configuration Software Instruction Manual IP Camera 9483 - Confguraton Software Instructon Manual VBD 612-4 (10.14) Dear Customer, Wth your purchase of ths IP Camera, you have chosen a qualty product manufactured by RADEMACHER. Thank you for the

More information

Analysis of Continuous Beams in General

Analysis of Continuous Beams in General Analyss of Contnuous Beams n General Contnuous beams consdered here are prsmatc, rgdly connected to each beam segment and supported at varous ponts along the beam. onts are selected at ponts of support,

More information

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION 1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute

More information

On Some Entertaining Applications of the Concept of Set in Computer Science Course

On Some Entertaining Applications of the Concept of Set in Computer Science Course On Some Entertanng Applcatons of the Concept of Set n Computer Scence Course Krasmr Yordzhev *, Hrstna Kostadnova ** * Assocate Professor Krasmr Yordzhev, Ph.D., Faculty of Mathematcs and Natural Scences,

More information

IP mobility support is becoming very important as the

IP mobility support is becoming very important as the 706 IEEE TRANSACTIONS ON COMPUTERS, VOL. 52, NO. 6, JUNE 2003 A New Scheme for Reducng Lnk and Sgnalng Costs n Moble IP Young J. Lee and Ian F. Akyldz, Fellow, IEEE Abstract IP moblty support s provded

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Parallel and Distributed Association Rule Mining - Dr. Giuseppe Di Fatta. San Vigilio,

Parallel and Distributed Association Rule Mining - Dr. Giuseppe Di Fatta. San Vigilio, Parallel and Dstrbuted Assocaton Rule Mnng - Dr. Guseppe D Fatta fatta@nf.un-konstanz.de San Vglo, 18-09-2004 1 Overvew Assocaton Rule Mnng (ARM) Apror algorthm Hgh Performance Parallel and Dstrbuted Computng

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

A Combined Approach for Mining Fuzzy Frequent Itemset

A Combined Approach for Mining Fuzzy Frequent Itemset A Combned Approach for Mnng Fuzzy Frequent Itemset R. Prabamaneswar Department of Computer Scence Govndammal Adtanar College for Women Truchendur 628 215 ABSTRACT Frequent Itemset Mnng s an mportant approach

More information

VISUAL SELECTION OF SURFACE FEATURES DURING THEIR GEOMETRIC SIMULATION WITH THE HELP OF COMPUTER TECHNOLOGIES

VISUAL SELECTION OF SURFACE FEATURES DURING THEIR GEOMETRIC SIMULATION WITH THE HELP OF COMPUTER TECHNOLOGIES UbCC 2011, Volume 6, 5002981-x manuscrpts OPEN ACCES UbCC Journal ISSN 1992-8424 www.ubcc.org VISUAL SELECTION OF SURFACE FEATURES DURING THEIR GEOMETRIC SIMULATION WITH THE HELP OF COMPUTER TECHNOLOGIES

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information