ANALYSIS OF THE FEATURES OF THE OPTIMAL LOGICAL STRUCTURE OF DISTRIBUTED DATABASES

Size: px
Start display at page:

Download "ANALYSIS OF THE FEATURES OF THE OPTIMAL LOGICAL STRUCTURE OF DISTRIBUTED DATABASES"

Transcription

1 ANALYSIS OF THE FEATURES OF THE OPTIMAL LOGICAL STRUCTURE OF DISTRIBUTED DATABASES E.V. Nurmatova 1,a, V.V. Gusev 2, V.V. Kotliar 2 1 «MIREA Russian Technological University», Moscow 2 NRC "Kurchatov Institute" IHEP a nurmatova@mirea.ru The uestions of constructing optimal logical structure of a distributed database (DDB) are considered. Solving these issues will mae it possible to increase the speed of processing reuests in DDB in comparison with a traditional database. Optimal logical structure of DDB will ensure the efficiency of the information system on computational resources. The problem of constructing an optimal logical structure of DDB is reduced to the problem of uadratic integer programming. As a result of its solution, the local networ of the DDB is decomposed into a number of clusters that have minimal information connectivity with each other. In particular, such tass arise for the organization of systems for processing huge amounts of information from the Large Hadron Collider. In these systems various DDBs are used to store information about:1) the system of triggers of data collection from physical perimental installations (ATLAS, CMS, LHCb, Alice), 2) the geometry and the operating conditions of the detector while collecting perimental data. Keywords: optimal logical structure, uery implementation scheme, distributed databases, grid architecture. 218 Elena V. Nurmatova, Victor V. Gusev, Victor V. Kotliar 579

2 1. Introduction Optimal logical structure of DDB will ensure the efficiency of the information system on computational resources.such tass arise in various areas of information and communication technologies, where distributed systems, systems with high loads, or systems of increased reliability are already used. In particular, such tass arise for the organization of systems for processing huge amounts of information from the Large Hadron Collider (LHC). In these systems various DDBs are used to store information about: the system of triggers for collecting data from physical perimental installations (ATLAS, CMS, LHCb, Alice), the geometry of the detector while collecting perimental data, description of the metadata of the events on the detectors (for ample, maintenance wor on the installation), the operating conditions of the detector while collecting perimental data [1]. The last DDB, the Conditions database, is the most compl in its organization. For access to it a special program interface CORAL [2] was written. The use of DDB directly in the process of data processing of the LHC in the distributed computing environment Grid should also be noted. To optimize the transfer of the results of calculations of individual tass over computer networs, a multilevel hierarchical system is used. This system eeps trac of where data is stored and where it is necessary to transfer them [3]. 2. Synthesis of the optimal logical structure of a distributed database DDB construction technologies can be used for storing data in monitoring systems of compl computing infrastructures, which are a combination of engineering, networ, software environments[4]. The constant search for methods to reduce the costs of data storage and their processing served as an objective need to deepen research on these issues and determined the relevance of the topic of this wor. Synthesis of the optimal logical structure of a distributed database is the process of finding the optimal mapping of the canonical structure of the DDB to a logical one, which provides the optimal value of a given criterion for the performance of information systems and satisfies the basic system, networ, and structural constraints [5]. When mapping a canonical structure into a logical one the groups of data of the canonical structure of the DDB are combined into types of logical records with their simultaneous distribution among the nodes of the computing system. The logical structure of DDB will be understood as an ordered set of logical records and connections (relations) between them, distributed over the nodes of the computing system. These logical records and connections reflect the semantic and functional properties and features of a given subject area of the information system. Let the logical structure of the database be given by the graph G ( N, L). To implement the -th uery on the graph G ( N, L), it is necessary to select the search tree G N, L ), where ( N N is a subset of vertices, and L L is the subset of connections of the graph G chosen by the search tree G. It should be noted that one of the vertices structure of the database. N is the entry point to the logical Figure 1 shows an ample of a search tree G and the direction of its bypass. Reducing the response time to a uery can be achieved by eliminating the vertices to which it is necessary to return for passing to the nt branch. The graph G u thus obtained will contain only those vertices (records) whose elements form the output structures of the uery. Thus, the tas of 58

3 minimizing the response time to a uery of the -th type is reduced to the problem of choosing the optimal connection between the records of the graph G. u Figure 1. The scheme of the graph G ( N, L ) and of direction of its bypass Figure 2. The scheme of a complete graph G u and connections between all vertices of a subset The scheme of a complete graph (see Figure 2) can be used to pose the problem of choosing the optimal structure of connections between records used to form the output structures of the -th uery with restrictions on the possibility of using a certain access path. The created DDBs can have a large dimension and therefore they are loaded and introduced in parts. To this end, the LS of DDB should be divided into a number of substructures or clusters that have minimal mutual connectivity under the following restrictions: on the dimension of clusters, on types of storage devices used, on the degree of semantic proximity of logical records included in clusters, etc. The source for the formulation and solution of this problem is the information obtained as a result of the synthesis of the LS of DDB (see Table 1). N { n j / j 1, J} W F w jj ' f K ' jj u N Table 1. The notation for source variables. the multiplicity of the record types of the LS of DB the matrix of connections of records structure the matrix of access paths for implementing ueries of users R r, r j,, r } the vector of a set of logical records { 1 J, j,, } the vector of characteristics of records length in bytes { 1 J To formalize the tas, we introduce the variables X 1, if the j-th record is included in the e-th cluster; X - otherwise. je je 581

4 The tas of decomposition of the logical structure of the RDB into the multiplicity of clusters that provide minimal connectivity between them is formulated as follows: J {min x ' ' j, }, 1 j, j 1 x ' jj j x ' ' j, with restrictions on the one-time inclusion of a logical record in the cluster, x j 1 1, j 1, J, and with restrictions on the total number of logical records in a cluster j j1 x j M 1, 1,, where M is an allowable number of records in the cluster. This tas is a problem of uadratic integer programming. As a result of solving the posed problem, the LS of DDB is decomposed into a number of clusters, which have minimal information connectivity with each other. The development of a database of a separate cluster can be further carried out taing into account the importance of the logical records included in it in terms of user reuirements, development complity, and other factors. The solution of such a tas is of great practical importance for computer-aided design of the LS of DDB and for generating specifications for ueries and corrections of DDB. This is especially important when organizing a DDB in the client-server architecture, in which the uery language SQL is used as an interface. The uery specifications for it include two main parts: uery objects and search conditions. In the reuest objects the information elements and the logical records reuired by the user are listed. It should be noted that the manual construction of ueries taes a long time, because the user reuires a detailed nowledge of the composition and structure of logical records, the interconnections between them, the characteristics of the organization, etc. 3. The logic circuit implementation of the uery Queries in the DDB are characterized by the composition of the reuested data, the freuency of their use, and the performance characteristics of the average values of the number of analyzed and selected when searching for instances records. The logic scheme of the implementation of a uery of DDB user includes the following seuence of operations: Designing and initiating a reuest by the user in the node of the computing system to which it is attached, in the uery language of the selected control system of DDB. Transferring the information to the database server via communication channels for the implementation of the uery. Processing the uery using the appropriate methods and by means of the database and solving the following main tass: selecting from the database the data reuired in the uery; decomposition of a uery for subueries (tass), the number of which is determined by the number of reuired database servers; selecting optimal access ways to the reuired database servers; establishing logical connections with the nodes of the computing system on which the reuired database servers are located. Transferring subueries to the reuired database servers via communication channels. Servicing subueries with database servers. Transferring the blocs of selected data from the database servers to the server of the node that initiated the reuest via communication channels. Assembling blocs of the data into an array reuired in the uery. 582

5 Evaluation criteria, as a tool of designing a DDB, are necessary for choosing a rational database structure among several alternative possibilities. It can be said that most of the problems and failures in developing a database model arise from a vague idea of what is meant by optimal database design. At present, as well as in the nearest future, uncertainty in the choice of these criteria will remain the weaest lin in the development of a database model. 4. Time, cost and volume characteristics of DB functioning Difficulties in determining the criteria for choosing alternative solutions are mainly caused by two factors [5]: 1. The first problem is that almost infinite number of different database structures that satisfy the same set of system reuirements can be built. Selection criteria should allow differentiation of all currently available alternatives. 2. The second problem is that alternatives are difficult to evaluate, since the criteria have the property of sensitivity and the duration of the various criteria is different. When solving the problem of the synthesis of the LS of DDB, the following basic time, cost and volume characteristics of the functioning of the DDB are used. The main time characteristics are: the duration of the implementation of a given set of ueries T and the duration of the implementation of a given set of transactions Т. In sum, these two indicators give the total duration of the worload of the DDB: where p T p p1 s T p T s1 T - the duration of the implementation of the p-th user uery; implementation of the s-th transaction. The components of the times T p and s, Ts - the duration of the T s depend on many factors: the networ and telecommunications euipment used, the technical parameters of the servers, the networ and telecommunications system-wide software, the characteristics of the relational database management system (RDBMS), the bandwidth of the communication channels, the running times of programs at various levels of networ protocols, etc. The main cost characteristics of a DDB are: the cost of storing information in the DDB E xp ; the uery and transaction costs at a given time interval E ; the cost of transmitting information via communication channels E. The sum of these components determines the total cost of the functioning of the DDB: E E E E. xp The cost of storing information in the DDB is determined by the physical amount of information V xp and the cost of storing a unit of information (one logical record) on the server xp. If we accept that the cost of storing information in all nodes of the computing system is a constant value, then E V xp In practice, xp approximately euals to And finally, the volumetric characteristics of the functioning of the DDB, the sets of user ueries and transactions are calculated from the following assumption: the amount of memory occupied by one instance of the record euals to the sum of the amount of useful information and the product of length of one address reference and the number of pointers in the record. xp xp. 583

6 5. Conclusion The problem of constructing an optimal logical structure of DDB is reduced to the problem of uadratic integer programming. As a result of its solution, the local networ of the DDB is decomposed into a number of clusters that have minimal information connectivity with each other. Further research will be related to the design of an algorithm for the synthesis of the optimal logical structure of a distributed database. The algorithm will consist of two interrelated stages.at the first stage, it is supposed to solve the problem of distribution of clusters of a distributed database between a server and clients, then the solution of the problem of optimal distribution of data groups of each node to logical record types follows. At the second stage of the algorithm, the problem of localizing the database by computer networ nodes is solved, and in addition to the results of the first stage, the characteristics of the distributed database are taen into account. References [1] A. V. Vaniachine LHC Databases on the Grid: Achievements and Open Issues The Proceedings of the IV International Conference on Distributed computing and Gridtechnologies in science and education (Grid21), JINR, Dubna, Russia, 28 June - 3 July, 21. [2] Radovan Chytrace, Dir Düllmann, Giacomo Govi, Alander Kalhof, Zsolt Molnár, Andrea Valassi Distributed Database Access in the LHC Computing Grid with CORAL IEEE Nuclear Science Symposium conference record. Nuclear Science Symposium, 28. [3] D. Ciangottini, J. Balcas, M. Mascheroni, E. A. Rupeia, E. Vaandering, H. Riahi, J. M. D. Silva, J. M. Hernandez, S. Belforte, T. T. Ivanov A comparison of different database technologies for the CMS AsyncStageOut transfer database J. Phys.: Conf. Ser., 898, 4248, 217. [4] V. Kotliar, V. Anshuov, V. Ezhovac, V. Gusev, A. Kotliar, G. Latyshev, A. Shishov Development of the active monitoring system for the computer center at IHEP, CEUR-WS, ISSN , 217. [5] N. A. Kuznetsov Methods of analysis and synthesis of modular information management systems. - M.: Fizmatlit, 8 p.,

RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE

RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE Viacheslav Ilyin Alexander Kryukov Vladimir Korenkov Yuri Ryabov Aleksey Soldatov (SINP, MSU), (SINP,

More information

CLOUDS OF JINR, UNIVERSITY OF SOFIA AND INRNE JOIN TOGETHER

CLOUDS OF JINR, UNIVERSITY OF SOFIA AND INRNE JOIN TOGETHER CLOUDS OF JINR, UNIVERSITY OF SOFIA AND INRNE JOIN TOGETHER V.V. Korenkov 1, N.A. Kutovskiy 1, N.A. Balashov 1, V.T. Dimitrov 2,a, R.D. Hristova 2, K.T. Kouzmov 2, S.T. Hristov 3 1 Laboratory of Information

More information

GEOMETRY DATABASE FOR THE CBM EXPERIMENT AND ITS FIRST APPLICATION TO THE EXPERIMENTS OF THE NICA PROJECT

GEOMETRY DATABASE FOR THE CBM EXPERIMENT AND ITS FIRST APPLICATION TO THE EXPERIMENTS OF THE NICA PROJECT GEOMETRY DATABASE FOR THE CBM EXPERIMENT AND ITS FIRST APPLICATION TO THE EXPERIMENTS OF THE NICA PROJECT E.P. Akishina 1, E.I. Alexandrov 1, I.N. Alexandrov 1, I.A. Filozova 1,2,3,a, V. Friese 4, K.V.

More information

LCG Conditions Database Project

LCG Conditions Database Project Computing in High Energy and Nuclear Physics (CHEP 2006) TIFR, Mumbai, 13 Feb 2006 LCG Conditions Database Project COOL Development and Deployment: Status and Plans On behalf of the COOL team (A.V., D.Front,

More information

Federated data storage system prototype for LHC experiments and data intensive science

Federated data storage system prototype for LHC experiments and data intensive science Federated data storage system prototype for LHC experiments and data intensive science A. Kiryanov 1,2,a, A. Klimentov 1,3,b, D. Krasnopevtsev 1,4,c, E. Ryabinkin 1,d, A. Zarochentsev 1,5,e 1 National

More information

Meta-cluster of distributed computing Dubna-Grid

Meta-cluster of distributed computing Dubna-Grid Meta-cluster of distributed computing Dubna-Grid Creation of a multi-purpose town scale network of distributed computing Peter Zrelov, JINR on behalf of the Dubna-Grid Collaboration AGREEMENT Between -

More information

CouchDB-based system for data management in a Grid environment Implementation and Experience

CouchDB-based system for data management in a Grid environment Implementation and Experience CouchDB-based system for data management in a Grid environment Implementation and Experience Hassen Riahi IT/SDC, CERN Outline Context Problematic and strategy System architecture Integration and deployment

More information

TERNARY TREES USAGE FOR COMPUTATIONAL EXPERIMENT DATA STORAGE

TERNARY TREES USAGE FOR COMPUTATIONAL EXPERIMENT DATA STORAGE TERNARY TREES USAGE FOR COMPUTATIONAL EXPERIMENT DATA STORAGE A.N. Kovartsev 1, D.A. Popova-Kovartseva 1, E.E. Gorshkova 1 1 Samara National Research University, Samara, Russia Abstract. Repository for

More information

New strategies of the LHC experiments to meet the computing requirements of the HL-LHC era

New strategies of the LHC experiments to meet the computing requirements of the HL-LHC era to meet the computing requirements of the HL-LHC era NPI AS CR Prague/Rez E-mail: adamova@ujf.cas.cz Maarten Litmaath CERN E-mail: Maarten.Litmaath@cern.ch The performance of the Large Hadron Collider

More information

CMS - HLT Configuration Management System

CMS - HLT Configuration Management System Journal of Physics: Conference Series PAPER OPEN ACCESS CMS - HLT Configuration Management System To cite this article: Vincenzo Daponte and Andrea Bocci 2015 J. Phys.: Conf. Ser. 664 082008 View the article

More information

Development of DKB ETL module in case of data conversion

Development of DKB ETL module in case of data conversion Journal of Physics: Conference Series PAPER OPEN ACCESS Development of DKB ETL module in case of data conversion To cite this article: A Y Kaida et al 2018 J. Phys.: Conf. Ser. 1015 032055 View the article

More information

The JINR Tier1 Site Simulation for Research and Development Purposes

The JINR Tier1 Site Simulation for Research and Development Purposes EPJ Web of Conferences 108, 02033 (2016) DOI: 10.1051/ epjconf/ 201610802033 C Owned by the authors, published by EDP Sciences, 2016 The JINR Tier1 Site Simulation for Research and Development Purposes

More information

CMS users data management service integration and first experiences with its NoSQL data storage

CMS users data management service integration and first experiences with its NoSQL data storage Journal of Physics: Conference Series OPEN ACCESS CMS users data management service integration and first experiences with its NoSQL data storage To cite this article: H Riahi et al 2014 J. Phys.: Conf.

More information

LHC Grid Computing in Russia: present and future

LHC Grid Computing in Russia: present and future Journal of Physics: Conference Series OPEN ACCESS LHC Grid Computing in Russia: present and future To cite this article: A Berezhnaya et al 2014 J. Phys.: Conf. Ser. 513 062041 Related content - Operating

More information

WLCG Transfers Dashboard: a Unified Monitoring Tool for Heterogeneous Data Transfers.

WLCG Transfers Dashboard: a Unified Monitoring Tool for Heterogeneous Data Transfers. WLCG Transfers Dashboard: a Unified Monitoring Tool for Heterogeneous Data Transfers. J Andreeva 1, A Beche 1, S Belov 2, I Kadochnikov 2, P Saiz 1 and D Tuckett 1 1 CERN (European Organization for Nuclear

More information

Intelligent flexible query answering Using Fuzzy Ontologies

Intelligent flexible query answering Using Fuzzy Ontologies International Conference on Control, Engineering & Information Technology (CEIT 14) Proceedings - Copyright IPCO-2014, pp. 262-277 ISSN 2356-5608 Intelligent flexible query answering Using Fuzzy Ontologies

More information

The High-Level Dataset-based Data Transfer System in BESDIRAC

The High-Level Dataset-based Data Transfer System in BESDIRAC The High-Level Dataset-based Data Transfer System in BESDIRAC T Lin 1,2, X M Zhang 1, W D Li 1 and Z Y Deng 1 1 Institute of High Energy Physics, 19B Yuquan Road, Beijing 100049, People s Republic of China

More information

INDEXING OF ATLAS DATA MANAGEMENT AND ANALYSIS SYSTEM

INDEXING OF ATLAS DATA MANAGEMENT AND ANALYSIS SYSTEM INDEXING OF ATLAS DATA MANAGEMENT AND ANALYSIS SYSTEM METADATA M.A. Grigoryeva 1,2,а, M.V. Golosova 1, A.A. Klimentov 3, M.S. Borodin 4, A.A. Alekseev 2, I.A. Tkachenko 1 1 National Research Center "Kurchatov

More information

AGIS: The ATLAS Grid Information System

AGIS: The ATLAS Grid Information System AGIS: The ATLAS Grid Information System Alexey Anisenkov 1, Sergey Belov 2, Alessandro Di Girolamo 3, Stavro Gayazov 1, Alexei Klimentov 4, Danila Oleynik 2, Alexander Senchenko 1 on behalf of the ATLAS

More information

PARAMETERS OF OPTIMUM HIERARCHY STRUCTURE IN AHP

PARAMETERS OF OPTIMUM HIERARCHY STRUCTURE IN AHP Analytic Hierarchy 014, Washington D.C., U.S.A. PARAMETERS OF OPTIMUM HIERARCHY STRUCTURE IN AHP Stan Lipovetsky GfK Custom Research North America Mineapolis, MN, USA E-mail: stan.lipovetsky@gfk.edu ABSTRACT

More information

Maximizing Bichromatic Reverse Nearest Neighbor for L p -Norm in Two- and Three-Dimensional Spaces

Maximizing Bichromatic Reverse Nearest Neighbor for L p -Norm in Two- and Three-Dimensional Spaces Noname manuscript No. (will be inserted by the editor) Raymond Chi-Wing Wong M. Tamer Özsu Ada Wai-Chee Fu Philip S. Yu Lian Liu Yubao Liu Maximizing Bichromatic Reverse Nearest Neighbor for L p -Norm

More information

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez Scientific data processing at global scale The LHC Computing Grid Chengdu (China), July 5th 2011 Who I am 2 Computing science background Working in the field of computing for high-energy physics since

More information

Lecture 2 The k-means clustering problem

Lecture 2 The k-means clustering problem CSE 29: Unsupervised learning Spring 2008 Lecture 2 The -means clustering problem 2. The -means cost function Last time we saw the -center problem, in which the input is a set S of data points and the

More information

Chapter 4: Multithreaded Programming

Chapter 4: Multithreaded Programming Chapter 4: Multithreaded Programming Chapter 4: Multithreaded Programming Overview Multicore Programming Multithreading Models Threading Issues Operating System Examples Objectives To introduce the notion

More information

Common Parallel Programming Paradigms

Common Parallel Programming Paradigms Parallel Program Models Last Time» Message Passing Model» Message Passing Interface (MPI) Standard» Examples Today» Embarrassingly Parallel» Master-Worer Reminders/Announcements» Homewor #3 is due Wednesday,

More information

arxiv: v1 [cs.dc] 12 May 2017

arxiv: v1 [cs.dc] 12 May 2017 GRID Storage Optimization in Transparent and User-Friendly Way for LHCb Datasets arxiv:1705.04513v1 [cs.dc] 12 May 2017 M Hushchyn 1,2, A Ustyuzhanin 1,3, P Charpentier 4 and C Haen 4 1 Yandex School of

More information

Chapter 4: Multithreaded Programming

Chapter 4: Multithreaded Programming Chapter 4: Multithreaded Programming Chapter 4: Multithreaded Programming Overview Multicore Programming Multithreading Models Threading Issues Operating System Examples Objectives To introduce the notion

More information

CSCS CERN videoconference CFD applications

CSCS CERN videoconference CFD applications CSCS CERN videoconference CFD applications TS/CV/Detector Cooling - CFD Team CERN June 13 th 2006 Michele Battistin June 2006 CERN & CFD Presentation 1 TOPICS - Some feedback about already existing collaboration

More information

Parallel Rendering. Johns Hopkins Department of Computer Science Course : Rendering Techniques, Professor: Jonathan Cohen

Parallel Rendering. Johns Hopkins Department of Computer Science Course : Rendering Techniques, Professor: Jonathan Cohen Parallel Rendering Molnar, Cox, Ellsworth, and Fuchs. A Sorting Classification of Parallel Rendering. IEEE Computer Graphics and Applications. July, 1994. Why Parallelism Applications need: High frame

More information

The CORAL Project. Dirk Düllmann for the CORAL team Open Grid Forum, Database Workshop Barcelona, 4 June 2008

The CORAL Project. Dirk Düllmann for the CORAL team Open Grid Forum, Database Workshop Barcelona, 4 June 2008 The CORAL Project Dirk Düllmann for the CORAL team Open Grid Forum, Database Workshop Barcelona, 4 June 2008 Outline CORAL - a foundation for Physics Database Applications in the LHC Computing Grid (LCG)

More information

Exploring Domains of Approximation in R 2 : Expository Essay

Exploring Domains of Approximation in R 2 : Expository Essay Exploring Domains of Approximation in R 2 : Expository Essay Nicolay Postarnakevich August 12, 2013 1 Introduction In this paper I explore the concept of the Domains of Best Approximations. These structures

More information

Reliability Engineering Analysis of ATLAS Data Reprocessing Campaigns

Reliability Engineering Analysis of ATLAS Data Reprocessing Campaigns Journal of Physics: Conference Series OPEN ACCESS Reliability Engineering Analysis of ATLAS Data Reprocessing Campaigns To cite this article: A Vaniachine et al 2014 J. Phys.: Conf. Ser. 513 032101 View

More information

Indexing and selection of data items in huge data sets by constructing and accessing tag collections

Indexing and selection of data items in huge data sets by constructing and accessing tag collections Indexing and selection of data items in huge data sets by constructing and accessing tag collections Sébastien Ponce CERN, Geneva LHCb Experiment sebastien.ponce@cern.ch tel +1-41-22-767-2143 Roger D.

More information

Grid Computing a new tool for science

Grid Computing a new tool for science Grid Computing a new tool for science CERN, the European Organization for Nuclear Research Dr. Wolfgang von Rüden Wolfgang von Rüden, CERN, IT Department Grid Computing July 2006 CERN stands for over 50

More information

DIRAC distributed secure framework

DIRAC distributed secure framework Journal of Physics: Conference Series DIRAC distributed secure framework To cite this article: A Casajus et al 2010 J. Phys.: Conf. Ser. 219 042033 View the article online for updates and enhancements.

More information

CS 345A Data Mining. MapReduce

CS 345A Data Mining. MapReduce CS 345A Data Mining MapReduce Single-node architecture CPU Machine Learning, Statistics Memory Classical Data Mining Dis Commodity Clusters Web data sets can be ery large Tens to hundreds of terabytes

More information

Detector Control LHC

Detector Control LHC Detector Control Systems @ LHC Matthias Richter Department of Physics, University of Oslo IRTG Lecture week Autumn 2012 Oct 18 2012 M. Richter (UiO) DCS @ LHC Oct 09 2012 1 / 39 Detectors in High Energy

More information

A host selection model for a distributed bandwidth broker

A host selection model for a distributed bandwidth broker A host selection model for a distributed bandwidth broker Christos Bouras Dimitris Primpas Research Academic Computer Technology Institute, Ν.Κazantzaki Str., Patras University 26500 Rion, Patras, Greece

More information

Monitoring System for the GRID Monte Carlo Mass Production in the H1 Experiment at DESY

Monitoring System for the GRID Monte Carlo Mass Production in the H1 Experiment at DESY Journal of Physics: Conference Series OPEN ACCESS Monitoring System for the GRID Monte Carlo Mass Production in the H1 Experiment at DESY To cite this article: Elena Bystritskaya et al 2014 J. Phys.: Conf.

More information

arxiv: v1 [physics.comp-ph] 25 Dec 2010

arxiv: v1 [physics.comp-ph] 25 Dec 2010 APS/123-QED Iteration Procedure for the N-Dimensional System of Linear Equations Avas V. Khugaev a arxiv:1012.5444v1 [physics.comp-ph] 25 Dec 2010 Bogoliubov Laboratory of Theoretical Physics, Joint Institute

More information

Monitoring of large-scale federated data storage: XRootD and beyond.

Monitoring of large-scale federated data storage: XRootD and beyond. Monitoring of large-scale federated data storage: XRootD and beyond. J Andreeva 1, A Beche 1, S Belov 2, D Diguez Arias 1, D Giordano 1, D Oleynik 2, A Petrosyan 2, P Saiz 1, M Tadel 3, D Tuckett 1 and

More information

Monte Carlo Production on the Grid by the H1 Collaboration

Monte Carlo Production on the Grid by the H1 Collaboration Journal of Physics: Conference Series Monte Carlo Production on the Grid by the H1 Collaboration To cite this article: E Bystritskaya et al 2012 J. Phys.: Conf. Ser. 396 032067 Recent citations - Monitoring

More information

Large Scale Software Building with CMake in ATLAS

Large Scale Software Building with CMake in ATLAS 1 Large Scale Software Building with CMake in ATLAS 2 3 4 5 6 7 J Elmsheuser 1, A Krasznahorkay 2, E Obreshkov 3, A Undrus 1 on behalf of the ATLAS Collaboration 1 Brookhaven National Laboratory, USA 2

More information

Implementation Principles of File Management System for Omega Parallel DBMS *

Implementation Principles of File Management System for Omega Parallel DBMS * Implementation Principles of File Management System for Omega Parallel DBMS * Mikhail L. Zymbler Chelyabinsk State University Russia mzym@cgu.chel.su Leonid B. Sokolinsky Chelyabinsk State University Russia

More information

Inverted Index for Fast Nearest Neighbour

Inverted Index for Fast Nearest Neighbour Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,

More information

Research on Quantitative and Semi-Quantitative Training Simulation of Network Countermeasure Jianjun Shen1,a, Nan Qu1,b, Kai Li1,c

Research on Quantitative and Semi-Quantitative Training Simulation of Network Countermeasure Jianjun Shen1,a, Nan Qu1,b, Kai Li1,c 2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) Research on Quantitative and Semi-Quantitative Training Simulation of Networ Countermeasure Jianjun

More information

Unified System for Processing Real and Simulated Data in the ATLAS Experiment

Unified System for Processing Real and Simulated Data in the ATLAS Experiment Unified System for Processing Real and Simulated Data in the ATLAS Experiment Mikhail Borodin Big Data Laboratory, National Research Centre "Kurchatov Institute", Moscow, Russia National Research Nuclear

More information

Automated Clustering-Based Workload Characterization

Automated Clustering-Based Workload Characterization Automated Clustering-Based Worload Characterization Odysseas I. Pentaalos Daniel A. MenascŽ Yelena Yesha Code 930.5 Dept. of CS Dept. of EE and CS NASA GSFC Greenbelt MD 2077 George Mason University Fairfax

More information

The LHC Computing Grid

The LHC Computing Grid The LHC Computing Grid Gergely Debreczeni (CERN IT/Grid Deployment Group) The data factory of LHC 40 million collisions in each second After on-line triggers and selections, only 100 3-4 MB/event requires

More information

Introduction to DBMS

Introduction to DBMS Introduction to DBMS Purpose of Database Systems View of Data Data Models Data Definition Language Data Manipulation Language Transaction Management Storage Management Database Administrator Database Users

More information

Summary of the LHC Computing Review

Summary of the LHC Computing Review Summary of the LHC Computing Review http://lhc-computing-review-public.web.cern.ch John Harvey CERN/EP May 10 th, 2001 LHCb Collaboration Meeting The Scale Data taking rate : 50,100, 200 Hz (ALICE, ATLAS-CMS,

More information

MANAGEMENT AND PLACEMENT OF REPLICAS IN A HIERARCHICAL DATA GRID

MANAGEMENT AND PLACEMENT OF REPLICAS IN A HIERARCHICAL DATA GRID MANAGEMENT AND PLACEMENT OF REPLICAS IN A HIERARCHICAL DATA GRID Ghalem Belalem 1 and Bakhta Meroufel 2 1 Department of Computer Science, Faculty of Sciences, University of Oran (Es Senia), Algeria ghalem1dz@gmail.com

More information

Evaluation of the computing resources required for a Nordic research exploitation of the LHC

Evaluation of the computing resources required for a Nordic research exploitation of the LHC PROCEEDINGS Evaluation of the computing resources required for a Nordic research exploitation of the LHC and Sverker Almehed, Chafik Driouichi, Paula Eerola, Ulf Mjörnmark, Oxana Smirnova,TorstenÅkesson

More information

ISTITUTO NAZIONALE DI FISICA NUCLEARE

ISTITUTO NAZIONALE DI FISICA NUCLEARE ISTITUTO NAZIONALE DI FISICA NUCLEARE Sezione di Perugia INFN/TC-05/10 July 4, 2005 DESIGN, IMPLEMENTATION AND CONFIGURATION OF A GRID SITE WITH A PRIVATE NETWORK ARCHITECTURE Leonello Servoli 1,2!, Mirko

More information

PARALLEL FRAMEWORK FOR PARTIAL WAVE ANALYSIS AT BES-III EXPERIMENT

PARALLEL FRAMEWORK FOR PARTIAL WAVE ANALYSIS AT BES-III EXPERIMENT PARALLEL FRAMEWORK FOR PARTIAL WAVE ANALYSIS AT BES-III EXPERIMENT V.A. Tokareva a, I.I. Denisenko Laboratory of Nuclear Problems, Joint Institute for Nuclear Research, 6 Joliot-Curie, Dubna, Moscow region,

More information

RDMS CMS Computing Activities before the LHC start

RDMS CMS Computing Activities before the LHC start RDMS CMS Computing Activities before the LHC start RDMS CMS computing model Tiers 1 CERN Collaborative centers: RCC RCC MSU, MSU, RRC RRC KI KI Kharkov Minsk JINR Erevan SINP MSU RDMS CMS Tier2 Tbilisi

More information

Data Integration and Data Warehousing Database Integration Overview

Data Integration and Data Warehousing Database Integration Overview Data Integration and Data Warehousing Database Integration Overview Sergey Stupnikov Institute of Informatics Problems, RAS ssa@ipi.ac.ru Outline Information Integration Problem Heterogeneous Information

More information

An SQL-based approach to physics analysis

An SQL-based approach to physics analysis Journal of Physics: Conference Series OPEN ACCESS An SQL-based approach to physics analysis To cite this article: Dr Maaike Limper 2014 J. Phys.: Conf. Ser. 513 022022 View the article online for updates

More information

Coded Distributed Computing: Straggling Servers and Multistage Dataflows

Coded Distributed Computing: Straggling Servers and Multistage Dataflows Coded Distributed Computing: Straggling Servers and Multistage Dataflows Songze Li, Mohammad Ali Maddah-Ali, and A. Salman Avestimehr Department of Electrical Engineering, University of Southern California,

More information

Andrea Sciabà CERN, Switzerland

Andrea Sciabà CERN, Switzerland Frascati Physics Series Vol. VVVVVV (xxxx), pp. 000-000 XX Conference Location, Date-start - Date-end, Year THE LHC COMPUTING GRID Andrea Sciabà CERN, Switzerland Abstract The LHC experiments will start

More information

Decomposition into modules

Decomposition into modules Programming Languages Seminar Program Structure and readability Lefel Yaniv Hagay Pollak 1 Decomposition into modules On the criteria to be used in decomposing systems into modules by D.L.Parnas.(1972)

More information

COMP3702/7702 Artificial Intelligence Week2: Search (Russell & Norvig ch. 3)" Hanna Kurniawati"

COMP3702/7702 Artificial Intelligence Week2: Search (Russell & Norvig ch. 3) Hanna Kurniawati COMP3702/7702 Artificial Intelligence Week2: Search (Russell & Norvig ch. 3)" Hanna Kurniawati" Last week" What is Artificial Intelligence?" Some history" Agent defined" The agent design problem" Search:

More information

GRID SCHEDULING USING ENHANCED PSO ALGORITHM

GRID SCHEDULING USING ENHANCED PSO ALGORITHM GRID SCHEDULING USING ENHANCED PSO ALGORITHM Mr. P.Mathiyalagan 1 U.R.Dhepthie 2 Dr. S.N.Sivanandam 3 1 Lecturer 2 Post Graduate Student 3 Professor and Head Department of Computer Science and Engineering

More information

CONTROL SYSTEM OF THE BOOSTER INJECTION POWER SUPPLY

CONTROL SYSTEM OF THE BOOSTER INJECTION POWER SUPPLY CONTROL SYSTEM OF THE BOOSTER INJECTION POWER SUPPLY A.A. Fateev 1, V.V. Tarasov 1, E.V. Gorbachev 1, H.P. Nazlev 1, 2 1 Laboratory of High Energy Physics, Joint Institute for Nuclear Research, 6 Joliot-Curie,

More information

Trigger and Data Acquisition at the Large Hadron Collider

Trigger and Data Acquisition at the Large Hadron Collider Trigger and Data Acquisition at the Large Hadron Collider Acknowledgments (again) This overview talk would not exist without the help of many colleagues and all the material available online I wish to

More information

The LCG 3D Project. Maria Girone, CERN. The 23rd Open Grid Forum - OGF23 4th June 2008, Barcelona. CERN IT Department CH-1211 Genève 23 Switzerland

The LCG 3D Project. Maria Girone, CERN. The 23rd Open Grid Forum - OGF23 4th June 2008, Barcelona. CERN IT Department CH-1211 Genève 23 Switzerland The LCG 3D Project Maria Girone, CERN The rd Open Grid Forum - OGF 4th June 2008, Barcelona Outline Introduction The Distributed Database (3D) Project Streams Replication Technology and Performance Availability

More information

A B2B Search Engine. Abstract. Motivation. Challenges. Technical Report

A B2B Search Engine. Abstract. Motivation. Challenges. Technical Report Technical Report A B2B Search Engine Abstract In this report, we describe a business-to-business search engine that allows searching for potential customers with highly-specific queries. Currently over

More information

A customizable approach to full lifecycle variability management

A customizable approach to full lifecycle variability management Science of Computer Programming 53 (2004) 259 284 www.elsevier.com/locate/scico A customizable approach to full lifecycle variability management Klaus Schmid, Isabel John Fraunhofer Institute for Experimental

More information

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Improving the Efficiency of Fast Using Semantic Similarity Algorithm International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year

More information

Analysis of Dynamic QoS Routing Algorithms for MPLS Networks

Analysis of Dynamic QoS Routing Algorithms for MPLS Networks Analysis of Dynamic QoS Routing Algorithms for MPLS Networks Antonio Capone, Luigi Fratta, Fabio Martignon Dipartimento Elettronica e Informazione Politecnico di Milano Pzza L Da Vinci 32, 2133 Milano

More information

Distributed Databases Systems

Distributed Databases Systems Distributed Databases Systems Lecture No. 01 Distributed Database Systems Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro

More information

Early experience with the Run 2 ATLAS analysis model

Early experience with the Run 2 ATLAS analysis model Early experience with the Run 2 ATLAS analysis model Argonne National Laboratory E-mail: cranshaw@anl.gov During the long shutdown of the LHC, the ATLAS collaboration redesigned its analysis model based

More information

Traffic engineering for MPLS-based virtual private networks

Traffic engineering for MPLS-based virtual private networks University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2002 Traffic engineering for MPLS-based virtual private networs Chuntung

More information

A New Segment Building Algorithm for the Cathode Strip Chambers in the CMS Experiment

A New Segment Building Algorithm for the Cathode Strip Chambers in the CMS Experiment EPJ Web of Conferences 108, 02023 (2016) DOI: 10.1051/ epjconf/ 201610802023 C Owned by the authors, published by EDP Sciences, 2016 A New Segment Building Algorithm for the Cathode Strip Chambers in the

More information

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques Doaa M. Alebiary Department of computer Science, Faculty of computers and informatics Benha University

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Chapter 2: Intro. To the Relational Model Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Database Management System (DBMS) DBMS is Collection of

More information

Improving Image Segmentation Quality Via Graph Theory

Improving Image Segmentation Quality Via Graph Theory International Symposium on Computers & Informatics (ISCI 05) Improving Image Segmentation Quality Via Graph Theory Xiangxiang Li, Songhao Zhu School of Automatic, Nanjing University of Post and Telecommunications,

More information

3D Kinematics. Consists of two parts

3D Kinematics. Consists of two parts D Kinematics Consists of two parts D rotation D translation The same as D D rotation is more complicated than D rotation (restricted to z-ais) Net, we will discuss the treatment for spatial (D) rotation

More information

Machine Learning for Video-Based Rendering

Machine Learning for Video-Based Rendering Machine Learning for Video-Based Rendering Arno Schödl arno@schoedl.org Irfan Essa irfan@cc.gatech.edu Georgia Institute of Technology GVU Center / College of Computing Atlanta, GA 30332-0280, USA. Abstract

More information

COOL performance optimization using Oracle hints

COOL performance optimization using Oracle hints COOL performance optimization using Oracle hints Andrea deavalassi ass and Romain Basset (IT-DM) With many thanks to Luca Canali for his help! IT-DM Database Developers Workshop, 8 th July 2008 CERN IT

More information

Automating usability of ATLAS Distributed Computing resources

Automating usability of ATLAS Distributed Computing resources Journal of Physics: Conference Series OPEN ACCESS Automating usability of ATLAS Distributed Computing resources To cite this article: S A Tupputi et al 2014 J. Phys.: Conf. Ser. 513 032098 Related content

More information

Verification and Diagnostics Framework in ATLAS Trigger/DAQ

Verification and Diagnostics Framework in ATLAS Trigger/DAQ Verification and Diagnostics Framework in ATLAS Trigger/DAQ M.Barczyk, D.Burckhart-Chromek, M.Caprini 1, J.Da Silva Conceicao, M.Dobson, J.Flammer, R.Jones, A.Kazarov 2,3, S.Kolos 2, D.Liko, L.Lucio, L.Mapelli,

More information

Introduction To Computers

Introduction To Computers Introduction To Computers Chapter No 7 Introduction To Databases Overview Introduction to database To make use of information, you have to be able to find the information Data files and databases are no

More information

Feature Selection Using Principal Feature Analysis

Feature Selection Using Principal Feature Analysis Feature Selection Using Principal Feature Analysis Ira Cohen Qi Tian Xiang Sean Zhou Thomas S. Huang Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign Urbana,

More information

A FAST CLUSTERING-BASED FEATURE SUBSET SELECTION ALGORITHM

A FAST CLUSTERING-BASED FEATURE SUBSET SELECTION ALGORITHM A FAST CLUSTERING-BASED FEATURE SUBSET SELECTION ALGORITHM Akshay S. Agrawal 1, Prof. Sachin Bojewar 2 1 P.G. Scholar, Department of Computer Engg., ARMIET, Sapgaon, (India) 2 Associate Professor, VIT,

More information

8 th International Conference on DEVELOPMENT AND APPLICATION SYSTEMS S u c e a v a, R o m a n i a, M a y 25 27,

8 th International Conference on DEVELOPMENT AND APPLICATION SYSTEMS S u c e a v a, R o m a n i a, M a y 25 27, 8 th International Conference on DEVELOPMENT AND APPLICATION SYSTEMS S u c e a v a, R o m a n i a, M a y 25 27, 2 0 0 6 A HARDWARE IMPLEMENTATION OF SAFE PETRI NET MODELS Viorica SUDACEVSCHI 1, Victor

More information

Unit 4: Formal Verification

Unit 4: Formal Verification Course contents Unit 4: Formal Verification Logic synthesis basics Binary-decision diagram (BDD) Verification Logic optimization Technology mapping Readings Chapter 11 Unit 4 1 Logic Synthesis & Verification

More information

INFORMATION RETRIEVAL SYSTEM: CONCEPT AND SCOPE

INFORMATION RETRIEVAL SYSTEM: CONCEPT AND SCOPE 15 : CONCEPT AND SCOPE 15.1 INTRODUCTION Information is communicated or received knowledge concerning a particular fact or circumstance. Retrieval refers to searching through stored information to find

More information

PARALLEL ALGORITHMS FOR SOLVING ORDINARY DIFFERENTIAL EQUATIONS

PARALLEL ALGORITHMS FOR SOLVING ORDINARY DIFFERENTIAL EQUATIONS Journal of Pure and Applied Mathematics: Advances and Applications Volume 1, Number 2, 2009, Pages 129-141 PARALLEL ALGORITHMS FOR SOLVING ORDINARY DIFFERENTIAL EQUATIONS Dipartimento Di Scienze Economiche

More information

and Photonics" of Russian Academy of Sciences, Samara, Russia

and Photonics of Russian Academy of Sciences, Samara, Russia APPLICATION OF THE METHOD OF PYRAMID FOR SYNTHESIS OF PARALLEL ALGORITHM FOR DIFFERENCE SOLUTION OF THE TWO- DIMENSIONAL PARTIAL DIFFERENTIALS EQUATION D.L. Golovashkin 1,2, L.V. Yablokova 1, E.V. Belova

More information

Programming II. Modularity 2017/18

Programming II. Modularity 2017/18 Programming II Modularity 2017/18 Module? Lecture Outline Evolution and history of programming languages Modularity Example History of Programming Programming Paradigms How and why languages develop? How

More information

Efficient Synthesis of Production Schedules by Optimization of Timed Automata

Efficient Synthesis of Production Schedules by Optimization of Timed Automata Efficient Synthesis of Production Schedules by Optimization of Timed Automata Inga Krause Institute of Automatic Control Engineering Technische Universität München inga.krause@mytum.de Joint Advanced Student

More information

Optimized Implementation of Logic Functions

Optimized Implementation of Logic Functions June 25, 22 9:7 vra235_ch4 Sheet number Page number 49 black chapter 4 Optimized Implementation of Logic Functions 4. Nc3xe4, Nb8 d7 49 June 25, 22 9:7 vra235_ch4 Sheet number 2 Page number 5 black 5 CHAPTER

More information

A Tool for Conditions Tag Management in ATLAS

A Tool for Conditions Tag Management in ATLAS A Tool for Conditions Tag Management in ATLAS A. Sharmazanashvili 1, G. Batiashvili 1, G. Gvaberidze 1, L. Shekriladze 1, A. Formica 2 on behalf of ATLAS collaboration 1 Georgian CADCAM Engineering Center

More information

Event cataloguing and other database applications in ATLAS

Event cataloguing and other database applications in ATLAS Event cataloguing and other database applications in ATLAS Dario Barberis Genoa University/INFN 1 Topics Event cataloguing: the new EventIndex DB Database usage by ATLAS in LHC Run2 PCD WS Ideas for a

More information

INTELLIGENT PROCESS SELECTION FOR NTM - A NEURAL NETWORK APPROACH

INTELLIGENT PROCESS SELECTION FOR NTM - A NEURAL NETWORK APPROACH International Journal of Industrial Engineering Research and Development (IJIERD), ISSN 0976 6979(Print), ISSN 0976 6987(Online) Volume 1, Number 1, July - Aug (2010), pp. 87-96 IAEME, http://www.iaeme.com/iierd.html

More information

The CMS Computing Model

The CMS Computing Model The CMS Computing Model Dorian Kcira California Institute of Technology SuperComputing 2009 November 14-20 2009, Portland, OR CERN s Large Hadron Collider 5000+ Physicists/Engineers 300+ Institutes 70+

More information

(Type your answer in radians. Round to the nearest hundredth as needed.)

(Type your answer in radians. Round to the nearest hundredth as needed.) 1. Find the exact value of the following expression within the interval (Simplify your answer. Type an exact answer, using as needed. Use integers or fractions for any numbers in the expression. Type N

More information

The V*-Diagram: A Query-Dependent Approach to Moving KNN Queries

The V*-Diagram: A Query-Dependent Approach to Moving KNN Queries The V*-Diagram: A Query-Dependent Approach to Moving KNN Queries Sarana Nutanong Rui Zhang Egemen Tanin Lars Kuli {sarana,rui,egemen,lars}@csse.unimelb.edu.au Department of Computer Science and Software

More information

ONLINE INDEXING FOR DATABASES USING QUERY WORKLOADS

ONLINE INDEXING FOR DATABASES USING QUERY WORKLOADS International Journal of Computer Science and Communication Vol. 2, No. 2, July-December 2011, pp. 427-433 ONLINE INDEXING FOR DATABASES USING QUERY WORKLOADS Shanta Rangaswamy 1 and Shobha G. 2 1,2 Department

More information