Algorithms for Frequent Pattern Mining of Big Data

Size: px
Start display at page:

Download "Algorithms for Frequent Pattern Mining of Big Data"

Transcription

1 Research Inda Publcatons. Algorthms for Frequent Pattern Mnng of Bg Data Syed Zubar Ahmad Shah 1, Mohammad Amjad 1, Ahmad Al Habeeb 2, Mohd Huzafa Faruqu 1 and Mudasr Shaf 3 1 Department of Computer Engneerng, Jama Mlla Islama, New Delh, Inda. 2 Department of Informaton Technology & Cloud, Ercsson Inda Global Servces, Gurgaon, Haryana, Inda. 3 Department of Electroncs and Communcaton Engneerng, Maharsh Dayanand Unversty, Rohtak, Haryana, Inda. 1 Orcd ID: Abstract Frequent pattern mnng s a feld of data mnng amed at unsheathng frequent patterns n data n order to deduce knowledge that may help n decson makng. Numerous algorthms for frequent pattern mnng have been developed durng the last two decades most of whch have been found to be non-scalable for Bg Data. For Bg Data some scalable dstrbuted algorthms have also been proposed. In ths paper, we analyze such algorthms n detal. We start wth age old algorthms lke Count Dstrbuton and move on to traverse other algorthms one by one and fnally to algorthms based on MapReduce programmng model. Keywords: Frequent Pattern Mnng, Dstrbuted Systems, Bg Data Analytcs INTRODUCTION Data mnng s the task of scannng large datasets wth the am to generate new nformaton or wth the am of knowledge dscovery. Broadly speakng, data mnng ncludes felds lke clusterng, frequent pattern mnng (FPM), classfcaton and outler detecton [1, 2, 3]. FPM s a popular data mnng approach [4] whch s used to fnd attrbutes that occur together frequently. Some of ts mportant applcatons are market basket analyss, share market analyss, drug dscovery, etc. [5]. Several FPM algorthms have been proposed n the last few decades such as Apror [6], Eclat [7], FP-Growth [8], etc. [9, 1]. Identfyng all frequent patterns s a complex task and s also computatonally costly due to a large number of canddate patterns and so most algorthms become neffcent when t comes to the knd of data we deal wth n ths age, the Bg Data. Socal networkng places lke Facebook produce over 3TB of data every sngle day [11]. Amazon, Walmart and other gants regster bllons of transactons every year. Gene sequencng platforms can generate terabytes of data. To cope up wth the problem of huge volumes of data, researchers proposed several parallel algorthms for the FPM problem. Ths paper revews some of the classcal as well as recent parallel FPM algorthms. Varous parallel FPM algorthms have been dscussed and analyzed n detal startng wth the classcal algorthms and then movng onto the state of the art algorthms. ALGORITHMS FOR PERFORMING FREQUENT PATTERN MINING ON A DISTRIBUTED ENVIRONMENT A number of algorthms have been proposed for effcent frequent pattern mnng on a dstrbuted system whch nclude Count Dstrbuton, Data Dstrbuton and Canddate Dstrbuton algorthms by Agrawal and Shafer [12], Parallel Eclat by Zak el. al. [13], Parallel FP-growth algorthm by L el. al. [14], Sngle Pass Countng, Fxed Passes Combnedcountng and Dynamc Passes Combned-countng algorthms by Ln et. al. [15] and Dstrbuted Eclat algorthm by Moens et. al. [16]. We dscuss a few of these heren. Count Dstrbuton Algorthm Notatons: k-temset Freq k Cand k Proc DS DSR Cand k Itemset wth k tems Set of frequent k-temsets Canddate temset of sze k Processor Dataset local to Proc Dataset local to Proc after re-partton Canddate set of Proc durng pass k The Count Dstrbuton (CD) algorthm tres to mnmze nteracton. It parttons the dataset n horzontal manner nto equal parts for the varous processors. Each processor autonomously runs the Apror algorthm and creates the hashtree on the locally stored transactons. After each teraton, global count of canddates s calculated by addng all counts 7355

2 Research Inda Publcatons. over all processors. Ths algorthm has a lmtaton that every processor has to keep the whole hash-tree n ts memory. Data dstrbuton algorthm has to be used f the whole hash-tree cannot be stored n the memory. Ths algorthm permts superfluous calculatons n parallel to prevent nteracton. In the frst pass, every processor produces ts local canddate temset Cand 1. For remanng passes, the algorthm s gven below: 1. Every processor Proc produces canddate set Cand k by makng use of the frequent temset Freq k-l that was produced n the prevous teraton. 2. Proc performs a pass on ts DS and to fnd local support counts for canddates n Cand k. 3. Proc synchronzes wth all other processors and broadcasts local Cand k counts to develop global Cand k counts. 4. Every processor Proc calculates Freq k from Cand k. usng the global support count. 5. If Freq k. s a null set then the algorthm termnates, or else t contnues. Every processor Proc wll ndepently make ths decson. Fgure 1 shows the speedup of CD algorthm (on datasets D1456K.T15.14 and D114K.T2.16) as the number of processors s ncreased. Response Tme(sec) Number of processors Fgure 1: Speedup of Count Dstrbuton Algorthm wth ncrease n the number of processors Data Dstrbuton Algorthm D1456K.T15.14 D114K.T2.16 Ths algorthm uses memory of the system extra effcently. The temsets are parttoned and shared wth processors. Each processor must fnd the count of ts local subset of canddate temsets and broadcast local data to remanng processors. Parttonng and the dstrbuton of temsets may affect performance of the algorthm, so t s desrable to have balanced load. Pass 1: Ths step s same as the step for k=1 n CD algorthm. Pass k>1: 1. Processor Proc produces canddate temset Cand k from Freq k-1 but keeps only a subset Cand k whch has only 1 N temsets of Cand k. Unon of all subsets generated by every processor s Cand k and ther ntersecton s a. 2. Proc uses both local data and data receved from remanng processors to develop support counts for the temsets n ts Cand k. 3. Processor Proc calculates Cand k usng the local canddate set Cand k after gettng complete support count data. Agan, the ntersecton of all Cand k sets s and ther unon s Freq k. 4. All processors synchronze and exchange ther local Cand k wth other processors. Every processor has the complete Freq k and can decde whether to contnue the algorthm or to termnate t ndepently. Processors fnd the support count for local temsets n step 2 and broadcast or receve local data. Network congeston may become a problem n ths setup so steps must be taken to prevent that. Parallel Eclat In Parallel Eclat (Par-Eclat), data s n vertcal layout whch s done by transformng the horzontal data nto vertcal temset lst. The two phases n the Parallel Eclat are gven below: Intalzaton Phase: There are 3 sub-steps n the ntalzaton step: 1. The support count for 2-temsets are read after the preprocessng step has been done and all the frequent temsets havng count>= mnmum support are ntroduced nto Freq 2. (Freq k s the set of frequent k-temsets). 2. Any of the clusterng schemes between these two - the equvalence class or maxmal hypergraph clque clusterng - s appled to Freq 2 to produce the set of possble maxmal frequent temsets whch are then parttoned among all the processors to acheve load-balancng. 3. The database s reparttoned so that every processor obtans the td-lsts of all the 1-temsets n the cluster apponted to t. Asynchronous Phase: After the ntalzaton phase, the td-lsts are accessble on local storage on every processor, so every processor can use 7356

3 Research Inda Publcatons. ts assgned maxmal cluster to generate frequent temsets wthout synchronzng wth any other processor. A cluster s processed completely before the processor moves onto the subsequent cluster. Local database s examned one tme to complete ths step. Therefore, there s less I/O overhead. Snce a sublattce s nduced by each cluster, ether bottom-up traversal s used to produce all frequent temsets or hybrd traversal s used to produce the maxmal frequent temsets, depng on the algorthm. Intally only td-lst for 1-temset are avalable locally usng whch, the td-lsts for 2-temsets clusters can be generated. These clusters are generally not bg and so keepng the emergng td-lsts n memory creates no problem. 3-temsets are produced by ntersecton of the tdlsts for 2-temsets n the bottom-up approach. If the cardnalty of the emergng td-lst >= mnmum support, the new temset s added to Freq 3. Then the resultng frequent 3- temsets, Freq 3 are dvded nto sets of equvalence classes on the bass of common pre-fxes of sze 2. Intersecton of all pars of 3-temsets wthn an equvalence class s used to determne Freq 4. The process can be repeated tll all frequent temsets are found. After fndng Freq k Freq k-1 s deleted. Memory space s thus needed for the temsets n Freq k-1 only wthn one maxmal cluster. Ths algorthm s thus man memory space effcent. For top-down stage of the hybrdtraversal, memory needs to keep track of only the maxmal element seen so far, along wth the temsets not yet seen. Par-Eclat Algorthm 1. Create Freq 2 usng 2-temset support counts 2. Produce clusters from Freq 2 usng Equvalence Classes 3. Assgn clusters to processors 4. Scan local dataset part 5. Share approprate td-lsts wth remanng processors 6. Procure td-lsts from remanng processors 7. foreach assgned cluster C fnd frequent temsets usng Bottom-Up traversal or Hybrd traversal Fgure 2 compares the executon tme of Par-Eclat algorthm wth CD algorthm [12] (for T1.I4.D284K dataset). Par- Eclat clearly outperforms CD algorthm. Fgure 2: Comparson of Executon Tme of Par-Eclat and Count Dstrbuton Sngle Pass Countng It s the parallel mplementaton of Apror algorthm usng MapReduce. It makes Apror effcent for mnng assocaton rules from massve datasets. The man step n Apror algorthm s fndng the frequency of canddate temsets. Ths process of countng can be parallelzed. In each step, the mapper program emts <x, 1> for every canddate x that s there n the transacton. The reducer collects the key-value par emtted by the mapper and sums the value for each key, whch gves the total frequency of the canddate n the database. The reducer outputs all the canddates whch have enough support count. Sngle Pass Countng (SPC), at k-th pass of database scannng n a MapReduce phase, fnds out the frequent k-temsets. Map(key, value = temset n transacton t ): Input : Dataset D foreach transacton tr D do foreach tem j tr do Executon Tme (sec) emt <j, 1> X1.Y1.Z1 X2.Y2.Z2 X2.Y1.Z2 T1.I4.D248K X1.Y4.Z4 X2.Y2.Z4 X4.Y1.Z4 X2.Y4.Z8 X4.Y2.Z8 X8.Y2.Z16 X= No. of Hosts, Y= No. of Processors, Z=H*P X8.Y4.Z32 CD Par-Eclat Reduce(key=tem, value = count): foreach key k do foreach value val n k do k.count+=val f k.count>=mn. supp. count 7357

4 Research Inda Publcatons. emt <k, k.count> Phase-1 of SPC algorthm Dstrbuted Eclat Dstrbuted Eclat (Dst-Eclat) algorthm s based on Eclat algorthm for mnng frequent temsets from large datasets n parallel usng MapReduce. Ths algorthm focuses on speed. Dst-Eclat s qute fast but t requres huge man memory as t stores the complete dataset n memory n vertcal format. In phase-2, the algorthm generates frequent 2-temset usng frequent temsets Freq 1 whch are located at the Dstrbuted Cache n Hadoop. Map(key, value = temset n transacton t ): Input : Dataset D and Freq k-1(k>=2). Fetch Freq k-1 from DstrbutedCache. Buld a hash tree for Cand k=gen-apror(freq k-1). foreach transacton tr D do Cand t = subset(cand k, tr ) foreach tem Cand t do emt <, 1> Reduce(key=tem, value = count): foreach key k do foreach value val n k do k.count+=val f k.count>=mnmum_support_count emt <k, k.count> Phase-2 of SPC algorthm Algorthm SPC: 1. Run Phase- 1 to fnd Freq 1 2. Run Phase-2 to fnd Freq 2 3. forall k>2 and Freq k-1 φ Map functon of phase-2 Reduce functon of phase-2 Fgure 3: Eclat on MapReduce Framework Dst-Eclat s 3-step method (fgure 3). Each step can be dstrbuted between mappers to maxmse effcency. 1. Fndng the Frequent Items: In ths step, the vertcal database s parttoned nto equal-szed sub-databases called shards and dstrbuted among several mappers. Mappers extract the frequent sngletons from ther shard and s them to reducer. All frequent tems are gathered n the reduce phase. 2. k-frequent Itemsets Generaton: The set of frequent k- temsets, Freq k, s generated. Frst, each mapper s assgned the combned form of local frequent tem sets. Mappers fnd the frequent supersets of sze k of the tems usng Eclat algorthm and runnng t up to level k. Then a reducer assgns the frequent temsets to a new set of mappers. Dstrbuton to mappers s performed by employng Round-Robn algorthm. 3. Subtree Mnng: Ths step uses Eclat algorthm to mne the prefx tree from the assgned subsets. Subtrees can be mned ndepently by mappers snce sub-trees do not need nformaton from other sub-trees. 7358

5 Research Inda Publcatons. Tme(sec) Fgure 4: Comparson of Computaton Tme of Dst-Eclat and PFP Fgure 4 compares the runtme of Dst-Eclat wth Parallel FP- Growth (PFP) [14] over del.co.us dataset [17]. Mnmum support threshold s denoted by the x-axs and the y-axs denotes the tme taken by computaton n seconds. Dst-Eclat s faster of the two algorthms. CONCLUSION In ths paper, we traversed through many dstrbuted frequent pattern mnng algorthms n order to examne them and to present a pcture of how these algorthms grew over tme. We also presented a comparson of ther runtmes on dfferent datasets. We presented both types of algorthms those based on message passng and those based on MapReduce approaches. REFERENCES MnSup [1] Han, J., Pe, J., and Kamber, M., 211, Data Mnng: Concepts and Technques, Morgan Kaufmann Publshers, Calforna, USA. [2] Shah, S. Z. A., and Amjad, M., 216, Classcal and Relearnng based Clusterng Algorthms for Huge Data Warehouses, Proc. 2nd Internatonal Conference on Computatonal Intellgence & Communcaton Technology, Ghazabad, pp [3] Aggarwal, C. C., 215, Data Mnng, Sprnger Internatonal Publshng, Swtzerland. [4] Aggarwal, C. C., and Han, J., 214, Frequent Pattern Mnng, Sprnger Internatonal Publshng, Swtzerland. [5] Berry, M. J. A., and Lnoff, G. S., 24, Data Mnng Technques: For Marketng, Sales, and Customer Relatonshp Management, John Wley & Sons Inc., Indana, USA. [6] Agrawal, R., and Srkant, R., 1994, Fast Algorthms for Mnng Assocaton Rules, Proc. 2th Internatonal Conference on Very Large Data Bases, Santago, Chle, PFP Dst-Eclat 1215, pp [7] Han, J., Pe, J., and Yn, Y., 2, Mnng Frequent Patterns Wthout Canddate Generaton, Proc. ACM SIGMOD Internatonal Conference on Management of Data, Dallas, Texas, pp [8] Zak, M. J., 2, Scalable Algorthms for Assocaton Mnng, IEEE Transactons on Knowledge and Data Engneerng, 12(3), pp [9] Sucahyo, Y. G., and Gopalan, R. P, 24, CT-PRO: A Bottom Up Non Recursve Frequent Itemset Mnng Algorthm usng Compressed FP-Tree Data Structures, Proc. ICDM Workshop on Frequent Itemset Mnng Implementatons, Brghton. [1] Uno, T., Kyom, M., and Armura, H., 24, Effcent Mnng Algorthms for Frequent/Closed/Maxmal Itemsets, Proc. ICDM Workshop on Frequent Itemset Mnng Implementatons, Brghton. [11] Ln, J., and Ryaboy, D., 213, Scalng Bg Data Mnng Infrastructure: The Twtter Experence, ACM SIGKDD Exploratons Newsletter, 14(2), pp [12] Agrawal, R., and Shafer, J., 1996, Parallel Mnng of Assocaton Rules, IEEE Transactons on Knowledge and Data Engneerng, 8(6), pp [13] Zak, M. J., Parthasarathy, S., Oghara, M., and L, W., 1997, Parallel Algorthms for Dscovery of Assocaton Rules, Data Mnng and Knowledge Dscovery, 1(4), pp [14] L, H., Wang, Y., Zhang, D., Zhang, M., and Chang, E. Y., 28, Pfp: Parallel FP-growth for Query Recommaton, Proc. ACM Conference on Recommer Systems, Lausanne, pp [15] Ln, M. Y., Lee, P. Y., and Hsueh, S. C., 212, Aprorbased Frequent Itemset Mnng Algorthms on MapReduce, Proc. 6th Internatonal Conference on Ubqutous Informaton Management and Communcaton, Kuala Lumpur, Artcle no. 76. [16] Moens, S., Aksehrl, E., and Goethals, B., 213, Frequent Itemset Mnng for Bg Data, Proc. Internatonal Conference on Bg Data, Santa Clara, Calforna, pp [17] Wetzker, R., Zmmermann C., and Bauckhage, C., 28, Analyzng Socal Bookmarkng Systems: A del. co. us Cookbook, Proc. Mnng Socal Data Workshop, Patras, pp

Concurrent Apriori Data Mining Algorithms

Concurrent Apriori Data Mining Algorithms Concurrent Apror Data Mnng Algorthms Vassl Halatchev Department of Electrcal Engneerng and Computer Scence York Unversty, Toronto October 8, 2015 Outlne Why t s mportant Introducton to Assocaton Rule Mnng

More information

Parallel and Distributed Association Rule Mining - Dr. Giuseppe Di Fatta. San Vigilio,

Parallel and Distributed Association Rule Mining - Dr. Giuseppe Di Fatta. San Vigilio, Parallel and Dstrbuted Assocaton Rule Mnng - Dr. Guseppe D Fatta fatta@nf.un-konstanz.de San Vglo, 18-09-2004 1 Overvew Assocaton Rule Mnng (ARM) Apror algorthm Hgh Performance Parallel and Dstrbuted Computng

More information

TF 2 P-growth: An Efficient Algorithm for Mining Frequent Patterns without any Thresholds

TF 2 P-growth: An Efficient Algorithm for Mining Frequent Patterns without any Thresholds TF 2 P-growth: An Effcent Algorthm for Mnng Frequent Patterns wthout any Thresholds Yu HIRATE, Ego IWAHASHI, and Hayato YAMANA Graduate School of Scence and Engneerng, Waseda Unversty {hrate, ego, yamana}@yama.nfo.waseda.ac.jp

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Outline. CHARM: An Efficient Algorithm for Closed Itemset Mining. Introductions. Introductions

Outline. CHARM: An Efficient Algorithm for Closed Itemset Mining. Introductions. Introductions CHARM: An Effcent Algorthm for Closed Itemset Mnng Authors: Mohammed J. Zak and Chng-Ju Hsao Presenter: Junfeng Wu Outlne Introductons Itemset-Tdset tree CHARM algorthm Performance study Concluson Comments

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Innovation Typology. Collaborative Authoritativeness. Focused Web Mining. Text and Data Mining In Innovation. Generational Models

Innovation Typology. Collaborative Authoritativeness. Focused Web Mining. Text and Data Mining In Innovation. Generational Models Text and Data Mnng In Innovaton Joseph Engler Innovaton Typology Generatonal Models 1. Lnear or Push (Baroque) 2. Pull (Romantc) 3. Cyclc (Classcal) 4. Strategc (New Age) 5. Collaboratve (Polyphonc) Collaboratve

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

A Combined Approach for Mining Fuzzy Frequent Itemset

A Combined Approach for Mining Fuzzy Frequent Itemset A Combned Approach for Mnng Fuzzy Frequent Itemset R. Prabamaneswar Department of Computer Scence Govndammal Adtanar College for Women Truchendur 628 215 ABSTRACT Frequent Itemset Mnng s an mportant approach

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Association Rule Mining with Parallel Frequent Pattern Growth Algorithm on Hadoop

Association Rule Mining with Parallel Frequent Pattern Growth Algorithm on Hadoop Assocaton Rule Mnng wth Parallel Frequent Pattern Growth Algorthm on Hadoop Zhgang Wang 1,2, Guqong Luo 3,*,Yong Hu 1,2, ZhenZhen Wang 1 1 School of Software Engneerng Jnlng Insttute of Technology Nanng,

More information

Performance Study of Parallel Programming on Cloud Computing Environments Using MapReduce

Performance Study of Parallel Programming on Cloud Computing Environments Using MapReduce Performance Study of Parallel Programmng on Cloud Computng Envronments Usng MapReduce Wen-Chung Shh, Shan-Shyong Tseng Department of Informaton Scence and Applcatons Asa Unversty Tachung, 41354, Tawan

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

Mining Vehicles Frequently Appearing Together from Massive Passing Records

Mining Vehicles Frequently Appearing Together from Massive Passing Records Appl. Math. Inf. Sc. 9, No. 3, 1427-1433 (2015) 1427 Appled Mathematcs & Informaton Scences An Internatonal Journal http://dx.do.org/10.12785/ams/090337 Mnng Vehcles Frequently Appearng Together from Massve

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

The Shortest Path of Touring Lines given in the Plane

The Shortest Path of Touring Lines given in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 262 The Open Cybernetcs & Systemcs Journal, 2015, 9, 262-267 The Shortest Path of Tourng Lnes gven n the Plane Open Access Ljuan Wang 1,2, Dandan He

More information

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe CSCI 104 Sortng Algorthms Mark Redekopp Davd Kempe Algorthm Effcency SORTING 2 Sortng If we have an unordered lst, sequental search becomes our only choce If we wll perform a lot of searches t may be benefcal

More information

A METHOD FOR FACTOR SCREENING OF SIMULATION EXPERIMENTS BASED ON ASSOCIATION RULE MINING

A METHOD FOR FACTOR SCREENING OF SIMULATION EXPERIMENTS BASED ON ASSOCIATION RULE MINING A METHOD FOR FACTOR SCREENING OF SIMULATION EXPERIMENTS BASED ON ASSOCIATION RULE MINING Lngyun Lu (a), We L (b), Png Ma (c), Mng Yang (d) Control and Smulaton Center, Harbn Insttute of Technology, Harbn

More information

Related-Mode Attacks on CTR Encryption Mode

Related-Mode Attacks on CTR Encryption Mode Internatonal Journal of Network Securty, Vol.4, No.3, PP.282 287, May 2007 282 Related-Mode Attacks on CTR Encrypton Mode Dayn Wang, Dongda Ln, and Wenlng Wu (Correspondng author: Dayn Wang) Key Laboratory

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

Wireless Sensor Networks Fault Identification Using Data Association

Wireless Sensor Networks Fault Identification Using Data Association Journal of Computer Scence 8 (9): 1501-1505, 2012 ISSN 1549-3636 2012 Scence Publcatons Wreless Sensor Networks Fault Identfcaton Usng Data Assocaton 1 Abram Kongu, T., 2 P. Thangaraj and 1 P. Prakanth

More information

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming CS 4/560 Desgn and Analyss of Algorthms Kent State Unversty Dept. of Math & Computer Scence LECT-6 Dynamc Programmng 2 Dynamc Programmng Dynamc Programmng, lke the dvde-and-conquer method, solves problems

More information

F Geometric Mean Graphs

F Geometric Mean Graphs Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 10, Issue 2 (December 2015), pp. 937-952 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) F Geometrc Mean Graphs A.

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Security Enhanced Dynamic ID based Remote User Authentication Scheme for Multi-Server Environments

Security Enhanced Dynamic ID based Remote User Authentication Scheme for Multi-Server Environments Internatonal Journal of u- and e- ervce, cence and Technology Vol8, o 7 0), pp7-6 http://dxdoorg/07/unesst087 ecurty Enhanced Dynamc ID based Remote ser Authentcaton cheme for ult-erver Envronments Jun-ub

More information

Load-Balanced Anycast Routing

Load-Balanced Anycast Routing Load-Balanced Anycast Routng Chng-Yu Ln, Jung-Hua Lo, and Sy-Yen Kuo Department of Electrcal Engneerng atonal Tawan Unversty, Tape, Tawan sykuo@cc.ee.ntu.edu.tw Abstract For fault-tolerance and load-balance

More information

ApproxMGMSP: A Scalable Method of Mining Approximate Multidimensional Sequential Patterns on Distributed System

ApproxMGMSP: A Scalable Method of Mining Approximate Multidimensional Sequential Patterns on Distributed System ApproxMGMSP: A Scalable Method of Mnng Approxmate Multdmensonal Sequental Patterns on Dstrbuted System Changha Zhang, Kongfa Hu, Zhux Chen, Lng Chen Department of Computer Scence and Engneerng, Yangzhou

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET 1 BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET TZU-CHENG CHUANG School of Electrcal and Computer Engneerng, Purdue Unversty, West Lafayette, Indana 47907 SAUL B. GELFAND School

More information

Determining Fuzzy Sets for Quantitative Attributes in Data Mining Problems

Determining Fuzzy Sets for Quantitative Attributes in Data Mining Problems Determnng Fuzzy Sets for Quanttatve Attrbutes n Data Mnng Problems ATTILA GYENESEI Turku Centre for Computer Scence (TUCS) Unversty of Turku, Department of Computer Scence Lemmnkäsenkatu 4A, FIN-5 Turku

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

On Some Entertaining Applications of the Concept of Set in Computer Science Course

On Some Entertaining Applications of the Concept of Set in Computer Science Course On Some Entertanng Applcatons of the Concept of Set n Computer Scence Course Krasmr Yordzhev *, Hrstna Kostadnova ** * Assocate Professor Krasmr Yordzhev, Ph.D., Faculty of Mathematcs and Natural Scences,

More information

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION 24 CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION The present chapter proposes an IPSO approach for multprocessor task schedulng problem wth two classfcatons, namely, statc ndependent tasks and

More information

A User Selection Method in Advertising System

A User Selection Method in Advertising System Int. J. Communcatons, etwork and System Scences, 2010, 3, 54-58 do:10.4236/jcns.2010.31007 Publshed Onlne January 2010 (http://www.scrp.org/journal/jcns/). A User Selecton Method n Advertsng System Shy

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

ASSOCIATION RULE MINING BASED ON IMAGE CONTENT

ASSOCIATION RULE MINING BASED ON IMAGE CONTENT Internatonal Journal of Informaton Technology and Knowledge Management January-June 011, Volume 4, No. 1, pp. 143-146 ASSOCIATION RULE MINING BASED ON IMAGE CONTENT Deepa S. Deshpande Image mnng s concerned

More information

Fast Computation of Shortest Path for Visiting Segments in the Plane

Fast Computation of Shortest Path for Visiting Segments in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 4 The Open Cybernetcs & Systemcs Journal, 04, 8, 4-9 Open Access Fast Computaton of Shortest Path for Vstng Segments n the Plane Ljuan Wang,, Bo Jang

More information

Feature Selection as an Improving Step for Decision Tree Construction

Feature Selection as an Improving Step for Decision Tree Construction 2009 Internatonal Conference on Machne Learnng and Computng IPCSIT vol.3 (2011) (2011) IACSIT Press, Sngapore Feature Selecton as an Improvng Step for Decson Tree Constructon Mahd Esmael 1, Fazekas Gabor

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

ON SOME ENTERTAINING APPLICATIONS OF THE CONCEPT OF SET IN COMPUTER SCIENCE COURSE

ON SOME ENTERTAINING APPLICATIONS OF THE CONCEPT OF SET IN COMPUTER SCIENCE COURSE Yordzhev K., Kostadnova H. Інформаційні технології в освіті ON SOME ENTERTAINING APPLICATIONS OF THE CONCEPT OF SET IN COMPUTER SCIENCE COURSE Yordzhev K., Kostadnova H. Some aspects of programmng educaton

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

Partial Restreaming Approach for Massive Graph Partitioning.

Partial Restreaming Approach for Massive Graph Partitioning. Partal Restreamng Approach for Massve Graph Parttonng. Ghzlane Echbarth, Hamamache Kheddouc To cte ths verson: Ghzlane Echbarth, Hamamache Kheddouc. Partal Restreamng Approach for Massve Graph Parttonng..

More information

Sorting Review. Sorting. Comparison Sorting. CSE 680 Prof. Roger Crawfis. Assumptions

Sorting Review. Sorting. Comparison Sorting. CSE 680 Prof. Roger Crawfis. Assumptions Sortng Revew Introducton to Algorthms Qucksort CSE 680 Prof. Roger Crawfs Inserton Sort T(n) = Θ(n 2 ) In-place Merge Sort T(n) = Θ(n lg(n)) Not n-place Selecton Sort (from homework) T(n) = Θ(n 2 ) In-place

More information

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Supervsed vs. Unsupervsed Learnng Up to now we consdered supervsed learnng scenaro, where we are gven 1. samples 1,, n 2. class labels for all samples 1,, n Ths s also

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

Fuzzy Weighted Association Rule Mining with Weighted Support and Confidence Framework

Fuzzy Weighted Association Rule Mining with Weighted Support and Confidence Framework Fuzzy Weghted Assocaton Rule Mnng wth Weghted Support and Confdence Framework M. Sulaman Khan, Maybn Muyeba, Frans Coenen 2 Lverpool Hope Unversty, School of Computng, Lverpool, UK 2 The Unversty of Lverpool,

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

CE 221 Data Structures and Algorithms

CE 221 Data Structures and Algorithms CE 1 ata Structures and Algorthms Chapter 4: Trees BST Text: Read Wess, 4.3 Izmr Unversty of Economcs 1 The Search Tree AT Bnary Search Trees An mportant applcaton of bnary trees s n searchng. Let us assume

More information

PARTITIONING POSITIONAL AND NORMAL SPACE FOR FAST OCCLUSION DETECTION

PARTITIONING POSITIONAL AND NORMAL SPACE FOR FAST OCCLUSION DETECTION Proceedngs of DETC 3 ASME 23 Desgn Engneerng Techncal Conferences and Computers and Informaton n Engneerng Conference Chcago, Illnos USA, September 2-6, 23 DETC23/DAC-48779 PARTITIONING POSITIONAL AND

More information

Keyword-based Document Clustering

Keyword-based Document Clustering Keyword-based ocument lusterng Seung-Shk Kang School of omputer Scence Kookmn Unversty & AIrc hungnung-dong Songbuk-gu Seoul 36-72 Korea sskang@kookmn.ac.kr Abstract ocument clusterng s an aggregaton of

More information

Application of Clustering Algorithm in Big Data Sample Set Optimization

Application of Clustering Algorithm in Big Data Sample Set Optimization Applcaton of Clusterng Algorthm n Bg Data Sample Set Optmzaton Yutang Lu 1, Qn Zhang 2 1 Department of Basc Subjects, Henan Insttute of Technology, Xnxang 453002, Chna 2 School of Mathematcs and Informaton

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

A Simple Methodology for Database Clustering. Hao Tang 12 Guangdong University of Technology, Guangdong, , China

A Simple Methodology for Database Clustering. Hao Tang 12 Guangdong University of Technology, Guangdong, , China for Database Clusterng Guangdong Unversty of Technology, Guangdong, 0503, Chna E-mal: 6085@qq.com Me Zhang Guangdong Unversty of Technology, Guangdong, 0503, Chna E-mal:64605455@qq.com Database clusterng

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING An Improved K-means Algorthm based on Cloud Platform for Data Mnng Bn Xa *, Yan Lu 2. School of nformaton and management scence, Henan Agrcultural Unversty, Zhengzhou, Henan 450002, P.R. Chna 2. College

More information

Cracking of the Merkle Hellman Cryptosystem Using Genetic Algorithm

Cracking of the Merkle Hellman Cryptosystem Using Genetic Algorithm Crackng of the Merkle Hellman Cryptosystem Usng Genetc Algorthm Zurab Kochladze 1 * & Lal Besela 2 1 Ivane Javakhshvl Tbls State Unversty, 1, I.Chavchavadze av 1, 0128, Tbls, Georga 2 Sokhum State Unversty,

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Today s Outline. Sorting: The Big Picture. Why Sort? Selection Sort: Idea. Insertion Sort: Idea. Sorting Chapter 7 in Weiss.

Today s Outline. Sorting: The Big Picture. Why Sort? Selection Sort: Idea. Insertion Sort: Idea. Sorting Chapter 7 in Weiss. Today s Outlne Sortng Chapter 7 n Wess CSE 26 Data Structures Ruth Anderson Announcements Wrtten Homework #6 due Frday 2/26 at the begnnng of lecture Proect Code due Mon March 1 by 11pm Today s Topcs:

More information

GIVEN two graphs G and H, the subgraph isomorphism

GIVEN two graphs G and H, the subgraph isomorphism IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS 1 Fndng and countng tree-lke subgraphs usng MapReduce Zhao Zhao, Langsh Chen, Mha Avram, Meng L, Guanyng Wang, Al Butt, Maleq Khan, Madhav Marathe, Judy

More information

USING GRAPHING SKILLS

USING GRAPHING SKILLS Name: BOLOGY: Date: _ Class: USNG GRAPHNG SKLLS NTRODUCTON: Recorded data can be plotted on a graph. A graph s a pctoral representaton of nformaton recorded n a data table. t s used to show a relatonshp

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Efficient Distributed File System (EDFS)

Efficient Distributed File System (EDFS) Effcent Dstrbuted Fle System (EDFS) (Sem-Centralzed) Debessay(Debsh) Fesehaye, Rahul Malk & Klara Naherstedt Unversty of Illnos-Urbana Champagn Contents Problem Statement, Related Work, EDFS Desgn Rate

More information

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Overvew 2 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Introducton Mult- Smulator MASIM Theoretcal Work and Smulaton Results Concluson Jay Wagenpfel, Adran Trachte Motvaton and Tasks Basc Setup

More information

Non-Split Restrained Dominating Set of an Interval Graph Using an Algorithm

Non-Split Restrained Dominating Set of an Interval Graph Using an Algorithm Internatonal Journal of Advancements n Research & Technology, Volume, Issue, July- ISS - on-splt Restraned Domnatng Set of an Interval Graph Usng an Algorthm ABSTRACT Dr.A.Sudhakaraah *, E. Gnana Deepka,

More information

Video Proxy System for a Large-scale VOD System (DINA)

Video Proxy System for a Large-scale VOD System (DINA) Vdeo Proxy System for a Large-scale VOD System (DINA) KWUN-CHUNG CHAN #, KWOK-WAI CHEUNG *# #Department of Informaton Engneerng *Centre of Innovaton and Technology The Chnese Unversty of Hong Kong SHATIN,

More information

Loop Transformations, Dependences, and Parallelization

Loop Transformations, Dependences, and Parallelization Loop Transformatons, Dependences, and Parallelzaton Announcements Mdterm s Frday from 3-4:15 n ths room Today Semester long project Data dependence recap Parallelsm and storage tradeoff Scalar expanson

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network A New Token Allocaton Algorthm for TCP Traffc n Dffserv Network A New Token Allocaton Algorthm for TCP Traffc n Dffserv Network S. Sudha and N. Ammasagounden Natonal Insttute of Technology, Truchrappall,

More information

Evaluation of an Enhanced Scheme for High-level Nested Network Mobility

Evaluation of an Enhanced Scheme for High-level Nested Network Mobility IJCSNS Internatonal Journal of Computer Scence and Network Securty, VOL.15 No.10, October 2015 1 Evaluaton of an Enhanced Scheme for Hgh-level Nested Network Moblty Mohammed Babker Al Mohammed, Asha Hassan.

More information

METHODS FOR BATCH PROCESSING OF DATA MINING QUERIES

METHODS FOR BATCH PROCESSING OF DATA MINING QUERIES ETHOS FOR TH PROESSING OF T INING QUERIES arek Wojcechowsk and acej Zakrzewcz Insttute of omputng Scence Poznan Unversty of Technology ul. Potrowo 3a Poznan, Poland bstract: ey words: ata mnng s a useful

More information

A Post Randomization Framework for Privacy-Preserving Bayesian. Network Parameter Learning

A Post Randomization Framework for Privacy-Preserving Bayesian. Network Parameter Learning A Post Randomzaton Framework for Prvacy-Preservng Bayesan Network Parameter Learnng JIANJIE MA K.SIVAKUMAR School Electrcal Engneerng and Computer Scence, Washngton State Unversty Pullman, WA. 9964-75

More information

A Heuristic for Mining Association Rules In Polynomial Time

A Heuristic for Mining Association Rules In Polynomial Time A Heurstc for Mnng Assocaton Rules In Polynomal Tme E. YILMAZ General Electrc Card Servces, Inc. A unt of General Electrc Captal Corporaton 6 Summer Street, MS -39C, Stamford, CT, 697, U.S.A. egemen.ylmaz@gecaptal.com

More information

A Heuristic for Mining Association Rules In Polynomial Time*

A Heuristic for Mining Association Rules In Polynomial Time* Complete reference nformaton: Ylmaz, E., E. Trantaphyllou, J. Chen, and T.W. Lao, (3), A Heurstc for Mnng Assocaton Rules In Polynomal Tme, Computer and Mathematcal Modellng, No. 37, pp. 9-33. A Heurstc

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory Background EECS. Operatng System Fundamentals No. Vrtual Memory Prof. Hu Jang Department of Electrcal Engneerng and Computer Scence, York Unversty Memory-management methods normally requres the entre process

More information

Array transposition in CUDA shared memory

Array transposition in CUDA shared memory Array transposton n CUDA shared memory Mke Gles February 19, 2014 Abstract Ths short note s nspred by some code wrtten by Jeremy Appleyard for the transposton of data through shared memory. I had some

More information

Security Vulnerabilities of an Enhanced Remote User Authentication Scheme

Security Vulnerabilities of an Enhanced Remote User Authentication Scheme Contemporary Engneerng Scences, Vol. 7, 2014, no. 26, 1475-1482 HIKARI Ltd, www.m-hkar.com http://dx.do.org/10.12988/ces.2014.49186 Securty Vulnerabltes of an Enhanced Remote User Authentcaton Scheme Hae-Soon

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented

More information

Keywords: classifier, Association rules, data mining, healthcare, Associative Classifiers, CBA, CMAR, CPAR, MCAR

Keywords: classifier, Association rules, data mining, healthcare, Associative Classifiers, CBA, CMAR, CPAR, MCAR Mrs. Suwarna Gothane, Dr. G.R.Bamnote / Internatonal Journal of Engneerng Research and Applcatons (IJERA) ISSN: 2248-9622 www.era.com An Automated Weghted Support Approach Based Assocatve Classfcaton Wth

More information

OPTIMAL CONFIGURATION FOR NODES IN MIXED CELLULAR AND MOBILE AD HOC NETWORK FOR INET

OPTIMAL CONFIGURATION FOR NODES IN MIXED CELLULAR AND MOBILE AD HOC NETWORK FOR INET OPTIMAL CONFIGURATION FOR NODE IN MIED CELLULAR AND MOBILE AD HOC NETWORK FOR INET Olusola Babalola D.E. Department of Electrcal and Computer Engneerng Morgan tate Unversty Dr. Rchard Dean Faculty Advsor

More information

CS1100 Introduction to Programming

CS1100 Introduction to Programming Factoral (n) Recursve Program fact(n) = n*fact(n-) CS00 Introducton to Programmng Recurson and Sortng Madhu Mutyam Department of Computer Scence and Engneerng Indan Insttute of Technology Madras nt fact

More information