Database Roma Tre

Size: px
Start display at page:

Download "Database Roma Tre"

Transcription

1 Database Roma Tre DIPARTIMENTO DI INFORMATICA E AUTOMAZIONE July 2012 The Database Group is one of the six research groups of the Department of Computer Science and Automation (DIA Dipartimento di Informatica e Automazione) of Università Roma Tre. It is based in Via della Vasca Navale 79-81, 00146, Rome. The Department has currently 21 permanent faculty members, 3 temporary ones, 10 postdoc or other collaborators, and about 25 PhD students. All the faculty members belong to the School of Engineering and have the responsibility for all the core courses of the undergraduate and master s programs in Computer Engineering and in Automation Engineering, which have, overall (as of April 2012) 1330 students (1033 undergraduates and 307 at the master s level). They also teach service courses in the other engineering programs. The faculty members also form the Computer Engineering section of the Doctoral School in Engineering. The School of Engineering currently has 135 faculty members and 4100 students (cumulative undergraduate and master s). Overall, the university has eight schools, 920 faculty members and (undergraduate and master s) students. The Database Group is currently composed of five faculty members and eight other full-time members: Faculty Paolo Atzeni (Professore) Luca Cabibbo (Professore Associato) Valter Crescenzi (Ricercatore) Paolo Merialdo (Professore Associato) Riccardo Torlone (Professore) Postdocs, Ph.D. Students, Collaborators Roberto De Virgilio (Assegnista di ricerca) Mirko Bronzi (Assegnista di ricerca) Francesca Bugiotti (Assegnista di ricerca) Daniele Toti (Assegnista di ricerca) Celine Badr (PhD student) Antonio Maccioni (PhD student) Disheng Qiu (PhD student) Luca Rossi (PhD student) Recent former members Paolo Papotti (now with Qatar Computing Research Institute) Lorenzo Blanco (now with Google, UK) Pierluigi Del Nostro (now with CRMPA, Roma) Stefano Paolozzi (now with CRMPA, Roma) Paolo Cappellari (now with Collective[i], New York, USA) Giorgio Gianforme (now with Almawave s.r.l., Roma) Luigi Bellomarini (now with Banca d Italia) Fabrizio Celli (now with FAO) 1

2 The group is devoted to the development of new principles, methods and tools for the organization and management of information, in the form of databases. The focus is on the new requirements generated by the growth of the Internet and of the World-Wide- Web, with the possible availability, in most settings, of various sources of information. The sources can be heterogeneous (and need not be just databases, but Web sites or files) and it is important to offer users integrated and personalized views over them. The overall approach is to tackle problems that have a practical significance, providing both general solutions (with a theoretical background if relevant) as well as concrete tools that demonstrate the approach. Past topics of interest include Management of Web sites and applications by means of a database approach (Araneus) Wrappers for extraction of data from Web sites (RoadRunner) Updating object-oriented databases Management of data warehouses Database theory Recent major topics of interest are briefly described below. MIDST (Model-Independent Data and Schema Translation) The MIDST project has the goal of developing tools for the translation of database schemas and instances from a model to another. The approach is based on a "metamodel" notion: data models are described with reference to a small set of metaconstructs, and translations are specified on metaconstructs as well, so that they are reusable. The project started in its present form in 2003, as a follow-up of a previous project carried out in In MIDST, new techniques have been proposed for database translations from a model to another, for example from object oriented to SQL or from SQL to XML schema descriptions. The approach leverages a predefined, but large and extensible, set of models: given a source schema S expressed in a source model, and a target model TM, it generates a schema S expressed in TM that is "equivalent" to S. A wide family of models is handled by using a metamodel in which models can be succinctly and precisely described. The approach expresses the translation as Datalog rules and exposes the source and target of the translation in a generic relational dictionary. This makes the translation transparent, easy to customize and model-independent. The proposal includes automatic generation of translations, on the basis of a formal system that supports reasoning on signatures of modules and elementary translations. The original version of the approach generates offline translations, in the sense that schemas and databases are imported in the tool, translated and then exported in the target system. A second version has later been produced, with a run-time approach, where the translation of data is performed by views whose definition is generated by the tool, again with the metamodel approach. In this case, only schemas are imported in the tool. As a side topic, the same model-independent approach has been applied to other model management operators (merge and diff), leading to the proposal of the MISM (Model Independent Schema Management) platform. and most recent ones P. Atzeni, L. Bellomarini, F. Bugiotti, F. Celli, G. Gianforme, A runtime approach to model-generic translation of schema and data, Information Systems 37(3), May 2012, Paolo Atzeni, Giorgio Gianforme, Paolo Cappellari: Data model descriptions and translation signatures in a multi-model framework. Annals of Mathematics and Artificial Intelligence, 63(3-4): (2011). P. Atzeni, L. Bellomarini, F. Bugiotti, G. Gianforme: A runtime approach to model-independent schema and data translation. EDBT 2009: P. Atzeni, L. Bellomarini, F. Bugiotti, G. Gianforme: MISM: A Platform for Model-Independent Solutions to Model Management Problems. J. Data Semantics 14: (2009) P. Atzeni, P. Cappellari, R. Torlone, P. A. Bernstein,, G. Gianforme: Model-independent schema translation. VLDB J. 17(6): (2008) 2

3 SOS (Save Our Systems) This began as a follow-up of MIDST, but it has now become an independent project. It considers interoperability of systems in the so called nosql family. A first result here is the SOS platform, which allows for the uniform access to different systems in the family. Paolo Atzeni, Francesca, Bugiotti, Luca Rossi. Uniform Access to Non-relational Database Systems: The SOS Platform. CAiSE 2012, LNCS 7328, pp Paolo Atzeni, Francesca, Bugiotti, Luca Rossi. SOS (Save Our Systems): A uniform programming interface for non-relational systems. EDBT 2012, ACM, PRAISED: Automatic Abbreviations Discovery and Resolution A methodology for discovering and resolving protein names abbreviations from the full-text versions of scientific articles, was proposed and implemented in the PRAISED framework with the ultimate purpose of building up a publicly available abbreviation repository. Three processing steps lie at the core of the framework: i) an abbreviation identification phase, carried out via domain-independent metrics, whose purpose is to identify all possible abbreviations within a scientific text; ii) an abbreviation resolution phase, which takes into account a number of syntactical and semantic criteria in order to match an abbreviation with its potential explanation; and iii) a dictionary-based protein name identification, which is meant to select only those abbreviations belonging to the protein science domain. A more general approach, not related to specific domains is currently being developed. Daniele Toti, Paolo Atzeni, Fabio Polticelli, Automatic Protein Abbreviations Discovery and Resolution from Full-Text Scientific Papers: The PRAISED Framework. Bio-Algorithms and Medical-Systems Vol. 8, Issue 1 (Mar 2012), pp doi: /bams ISSN: X Paolo Atzeni, Fabio Polticelli, Daniele Toti Experimentation of an automatic resolution method for protein abbreviations in full-text papers, ACM BCB 2011: 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine Pages: Doi / Temporal Content Management A follow-up of previous work on Web data management. Paolo Atzeni, Stefano Paolozzi, Pierluigi Del Nostro: Temporal Content Management and Web Sites Modeling: Putting Them Together. T. Large-Scale Data- and Knowledge-Centered Systems, 5: (2012) Doi: dx.doi.org/ / _7. Nyaya: Reasoning over Large Semantic Datasets Nyaya is a system for the management of Semantic-Web data which couples a general-purpose and extensible storage mechanism with efficient ontology reasoning and querying capabilities. Nyaya processes large Semantic-Web datasets, expressed in multiple formalisms, by transforming them into a collection of Semantic Data Kiosks. The native meta-data of each kiosk is uniformly exposed using the Datalog± language, a powerful rule based modelling language for ontological databases. The kiosks form a Semantic Data Market where the data in each kiosk can be uniformly accessed using conjunctive queries and where users can specify user-defined constraints over the data. Nyaya is easily extensible and robust to updates of both data and meta-data in the kiosk and can readily adapt to different logical organization of the persistent storage. The approach has been experimented using well-known benchmarks, and compared to state-of-the-art research prototypes and commercial systems. 3

4 Roberto De Virgilio, Giorgio Orsi, Letizia Tanca, Riccardo Torlone: NYAYA: a System Supporting the Uniform Management of Large Sets of Semantic Data. ICDE Database preferences User preferences are a fundamental ingredient of personalized database applications, in particular those in which the user context plays a key role. Given a set of preferences defined in different contexts, we have studied the problem of deriving the preferences that hold in one of them, that is, how preferences propagate through contexts. For the sake of generality, the approach relies on an abstract context model, which only requires that the contexts form a poset. We have formalized the basic properties of the propagation process and have introduced an algebraic model for preference propagation that relies on two well-known operators for combining preferences: Pareto and Prioritized composition. We have also studied three alternative propagation methods and precisely characterize them in terms of the fairness and specificity properties. To our knowledge, these are the first results providing a theoretical foundation to the management of contextual preferences in database systems. Paolo Ciaccia, Riccardo Torlone: Modeling the Propagation of User Preferences. ER 2011: (Best paper award) Query relaxation Traditional information search in which queries are posed against a known and rigid schema over a structured database is shifting towards a Web scenario in which exposed schemas are vague or absent and therefore query answering cannot be precise, but needs to be relaxed in order to match user requests with accessible data. In this framework, we have proposed a logical model and an abstract query language as a foundation for querying data sets with vague schemas. Our approach takes advantages of the availability of taxonomies, that is, simple classifications of terms arranged in a hierarchical structure. The model is a natural extension of the relational model in which data domains are organized in hierarchies, according to different levels of generalization. The query language is a conservative extension of relational algebra where special operators allow the specification of relaxed queries over vaguely structured information. We have also studied equivalence and rewriting properties of the query language that can be used for query optimization. Davide Martinenghi, Riccardo Torlone: Querying Databases with Taxonomies. ER 2010: GAIA: Generic Mappings for Data Exchange We have addressed the novel problem of schema exchange, which naturally extends the data exchange process to collections of similar schemas: while the data exchange process operates over specific source and target schemas, the goal of schema exchange is rather the definition of generic transformations of data under structurally similar schemas. To this aim, we have introduced the notion of schema template, which is used to represent a class of different database schemas sharing the same structure. Then, given a mapping between the components of a source and a target template, the goal is the translation of any database whose schema conforms to the source template into a format conforming to the target template. This framework can be used to support several activities involved in the management of heterogeneous data sources: (i) the definition, once for all, of generic transformations that work for different but similar schemas, such as the denormalization of a pair of relation tables based on a foreign key between them; (ii) the reuse of a data exchange setting, since a mapping between templates can be derived from a mapping between schemas for later use in similar scenarios, and (iii) the specification of model translations, that is, translations of schemas and data from one data model to another (e.g., from relational to XML), a problem largely studied in recent years. 4

5 Paolo Papotti, Riccardo Torlone: Schema exchange: Generic mappings for transforming data and metadata. Data Knowl. Eng. 68(7): (2009) Paolo Papotti, Riccardo Torlone: Automatic Generation of Model Translations. CAiSE 2007: RFID data management Radio Frequency Identification (RFID) technology plays a key role in supply chains and a challenging problem is the effective and efficient management of the enormous volume of data generated by such systems. In this scenario, we have studied the problem of storing and querying a large amount of RFID data. Our approach relies on a compression technique which allows a significant saving of space according to a notion of aggregates over RFID data and a logical representation of these aggregates. We have proposed an indexing technique for aggregates of RFID data that guarantees the efficient execution of an important class of queries. Finally, we have defined the architecture of a tool implementing our approach and demonstrated, with a number of experimental results made with this tool, the feasibility and effectiveness of the underlying techniques. Roberto De Virgilio, Pierpaolo Sugamiele, Riccardo Torlone: Incremental aggregation of RFID data. IDEAS 2009: Flint: extraction and integration of Web data A large and increasing number of web sites publish structured data about recognizable concepts (such as stock quotes, movies, restaurants, etc.). The great chance to create applications that rely on the huge amount of data taken from these sites has been discussed for more than a decade now, but in practice only a small fraction of such information is currently used. The main reason is that extracting, curating and integrating web data is an expensive task, which often requires human intervention. The Flint project aims at developing automatic and domain independent tools to support the main tasks to benefit from Web data: discovering data intensive web sites containing information about entities of interest, extracting and integrating the published data, and performing a probabilistic analysis to characterize the impreciseness of the data and the accuracy of the sources. Lorenzo Blanco, Mirko Bronzi, Valter Crescenzi, Paolo Merialdo, Paolo Papotti: Automatically building probabilistic databases from the web. WWW (Companion Volume) 2011: Mirko Bronzi, Valter Crescenzi, Paolo Merialdo, Paolo Papotti: Wrapper Generation for Overlapping Web Sources. ACM Web Intelligence 2011: Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti: Probabilistic Models to Reconcile Complex Data from Inaccurate Data Sources. CAiSE 2010: Paolo Papotti, Valter Crescenzi, Paolo Merialdo, Mirko Bronzi, Lorenzo Blanco: Redundancy- Driven Web Data Extraction and Integration. WebDB 2010 Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti: Flint: Google-basing the Web. EDBT 2008: Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti: Supporting the automatic construction of entity aware search engines. ACM WIDM 2008: Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo: Structure and Semantics of Data-IntensiveWeb Pages: An Experimental Study on their Relationships. J. UCS 14(11): (2008) Schema mapping A mapping system is a tool supporting the following scenario: given a source schema, a target schema, and a mapping between these two schemas, expressed as a set of attribute correspondences, generate an executable transformation (i.e., a set of queries) to compute target instances from source instances. In the context of relational mapping systems, we have proposed an extension of the well know Clio framework, to take into account mappings between relational schemas with keys, foreign keys and 5

6 nullable attributes. Specifically, we extended the two main components of a mapping system (a schema mapping generation algorithm and a query generation algorithm) to deal with such integrity constraints. As a further contribution, we have introduced referenced-attribute correspondences, which permit to specify more precise mappings than traditional attribute correspondences, while retaining a simple and intuitive semantics. Luca Cabibbo: On keys, foreign keys and nullable attributes in relational mapping systems, Proceedings of the 12th International Conference on Extending Database Technology, EDBT 2009, 2009, Textbooks for the database field and other general publications Memebers of the group maintain a set of textbooks and courses material for teaching databases in universities. It is carried out together with colleagues at Politecnico di Milano, and has led to the most popular pair of textbooks in the field in Italy, with reference to both introductory and advanced courses in databases methodsand technology. A new edition of each of the books is produced every two or three years, with attention to new development of the technology and to new teaching techniques. Additional writing material is produced from time to time. and most recent ones P. Atzeni, S. Ceri, S. Paraboschi, R. Torlone Basi di dati: modelli e linguaggi di interrogazione McGraw-Hill Italia, terza edizione, P. Atzeni, S. Ceri, P. Fraternali, S. Paraboschi, R. Torlone Basi di dati: Architetture e linee di evoluzione, seconda edizione, McGraw-Hill Italia, P. Atzeni, S. Ceri, S. Paraboschi, R. Torlone: Database Systems - Concepts, Languages and Architectures McGraw-Hill Book Company 1999 Brief CVs for the permanent members of the group Paolo Atzeni is a database professor at Università Roma Tre. He received his Dr. Ing. degree in Electrical Engineering from Università di Roma "La Sapienza" in Before joining Università Roma Tre, he was with IASI-CNR in Rome, then a faculty member at Università di Napoli and later a professor at Università di Roma La Sapienza. He also had visiting appointments at the University of Toronto, at Università dell'aquila, at Microsoft Research and at the National University of Singapore. He has worked on various topics in the database field, including relational database theory, conceptual models and design tools, deductive databases, databases and the Web, model management, cooperation of database systems. He is the leader of the database group at Roma Tre, which includes six faculty members and various postdocs and students. They collaborate with various groups in Italy and abroad, on topics that include data models, data warehouses, data in the Web world. He is currently the vicepresident of the VLDB Endowment and a member of the Executive Board of the EDBT Association, of which he is also past President. Luca Cabibbo is associate professor at the School of Engineering of Università Roma Tre. He is with Università Roma Tre since 1997, previously as a research associate. He graduated with honors in Electrical Engineering in 1992 from Università di Roma La Sapienza. In 1996 he received his PhD, also from Università di Roma La Sapienza, under the supervision of Paolo Atzeni, with a thesis on Querying and updating complex-object databases. His main research interests are in the area of databases and information systems and include: models and languages for object-oriented databases; cooperative database systems; methods and tools for data warehousing and multi-dimensional analysis; models and tools for object-relational mapping (that is, for the transparent management of object persistence by means of relational databases). On these topics, he has published several papers on important international database journals, including ACM Transaction on Database Systems and Information and Computation, as well as on the proceedings of important international database conferences (ACM PODS, IEEE-ICDE, ICDT, EDBT). Valter Crescenzi is Assistant Professor at Università degli Studi Roma Tre from He received his 6

7 Computer Engineering degree (Laurea in Ingegneria Informatica) from Università degli Studi Roma Tre, in In 2001 he received his PhD, from Università degli Studi di Roma "La Sapienza", under the supervision of prof. Paolo Atzeni. During his PhD program he also spent six months at the UCSD of San Diego, working with prof. Bertram Ludaescher. His research interests include information extraction and data management techniques for Web data. He has published his research results in important journals of the field, including Journal of the ACM, IEEE Transactions on Knowledge and Data Engineering, Journal of Applied Artificial Intelligence, and in the refereed proceedings of the major conferences (VLDB). Paolo Merialdo is Associate Professor at Università degli Studi Roma Tre from He received his Computer Engineering degree (Laurea in Ingegneria Elettronica) from Università degli Studi di Genova, in In 1998 he received his PhD, from Università degli Studi di Roma "La Sapienza", under the supervision of prof. Paolo Atzeni. During his PhD program he also spent six months at the University of Toronto, working with prof. Alberto Mendelzon. His research interests include information extraction and data management techniques for Web data. He has published his research results in important journals of the field, including ACM Transactions on Internet Technology, IEEE Transactions on Knowledge and Data Engineering, IEEE Internet Computing, Journal of Applied Artificial Intelligence, and in the refereed proceedings of the major conferences (ACM-SIGMOD, VLDB, EDBT). He has been program committee member for many international conferences. He served as Associate Director for ACM SIGMOD-RECORD ( ). He is co-founder of InnovAction Lab, an Entrepreneurship Program for master students. Riccardo Torlone is a professor in the area of Information Systems at Università Roma Tre. He received his Dr. Ing. degree in Electrical Engineering from Università di Roma "La Sapienza". Before joining Università Roma Tre, he was member of the research staff at IASI-CNR in Rome, where he has still a research appointment. He also had a visiting research appointment at the University of California Los Angeles. His research has considered various topics in the database field, including the following: relational database theory, active and deductive databases, CASE tools for database design, models and languages for object-oriented databases, data warehouses and OLAP systems, Web based information systems, data and metadata exchange, adaptive information systems and personalization. He has published his research results in the major journals of the field, including ACM Transactions of Database Systems, VLDB Journal, Information Systems, SIAM Journal of Computing, IEEE Transaction on Data and Knowledge Engineering, Distributed and Parallel Databases and in the refereed proceedings of all the major conferences (ACM-SIGMOD, VLDB, EDBT, ACM-PODS, IEEE-ICDE, ICDT, ER, CIKM). He has authored the most spread book on databases in Italy, published also in an international edition and in several versions. He has also authored two other books. 7

MISM: A platform for model-independent solutions to model management problems

MISM: A platform for model-independent solutions to model management problems MISM: A platform for model-independent solutions to model management problems Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, and Giorgio Gianforme Dipartimento di informatica e automazione Università

More information

On the Integration of Autonomous Data Marts

On the Integration of Autonomous Data Marts On the Integration of Autonomous Data Marts Luca Cabibbo and Riccardo Torlone Dipartimento di Informatica e Automazione Università di Roma Tre {cabibbo,torlone}@dia.uniroma3.it Abstract We address the

More information

is that the question?

is that the question? To structure or not to structure, is that the question? Paolo Atzeni Based on work done with (or by!) G. Mecca, P.Merialdo, P. Papotti, and many others Lyon, 24 August 2009 Content A personal (and questionable

More information

Management of schema translations in a model generic framework

Management of schema translations in a model generic framework Management of schema translations in a model generic framework Paolo Atzeni Università Roma Tre Joint work with Paolo Cappellari, Giorgio Gianforme (Università Roma Tre) and Phil Bernstein (Microsoft Research)

More information

WEB SITES NEED MODELS AND SCHEMES

WEB SITES NEED MODELS AND SCHEMES WEB SITES NEED MODELS AND SCHEMES Paolo Atzeni atzeni@dia.uniroma3.it http://www.dia.uniroma3.it/~atzeni Dipartimento di informatica e automazione Università di Roma Tre Outline Databases and information

More information

Paolo Merialdo Associate Professor Roma Tre University Department of Engineering - Computer Engineering and Automation

Paolo Merialdo Associate Professor Roma Tre University Department of Engineering - Computer Engineering and Automation Paolo Merialdo Associate Professor Roma Tre University Department of Engineering - Computer Engineering and Automation ASN Full Professor 09/H1 Sistemi di Elaborazione delle informazioni obtained on 04/04/2017

More information

Sangam: A Framework for Modeling Heterogeneous Database Transformations

Sangam: A Framework for Modeling Heterogeneous Database Transformations Sangam: A Framework for Modeling Heterogeneous Database Transformations Kajal T. Claypool University of Massachusetts-Lowell Lowell, MA Email: kajal@cs.uml.edu Elke A. Rundensteiner Worcester Polytechnic

More information

Introduction to Databases Fall-Winter 2009/10. Syllabus

Introduction to Databases Fall-Winter 2009/10. Syllabus Introduction to Databases Fall-Winter 2009/10 Syllabus Werner Nutt Syllabus Lecturer Werner Nutt, nutt@inf.unibz.it, Room TRA 2.01 Office hours: Thursday, 16:00 18:00 (If you want to meet up with me, send

More information

A Framework for the Internationalization of Data-Intensive Web Applications

A Framework for the Internationalization of Data-Intensive Web Applications A Framework for the Internationalization of Data-Intensive Web Applications Alberto Belussi and Roberto Posenato Dipartimento di Informatica - Università degli Studi di Verona Ca Vignal, Strada le Grazie,

More information

Data and Metadata Management

Data and Metadata Management Data and Metadata Management Paolo Atzeni and Riccardo Torlone Abstract In this chapter we illustrate fundamental notions of Data Management. We start from the basic concepts of database schema and instance,

More information

Lessons Learned and Research Agenda for Big Data Integration of Product Specifications (Discussion Paper)

Lessons Learned and Research Agenda for Big Data Integration of Product Specifications (Discussion Paper) Lessons Learned and Research Agenda for Big Data Integration of Product Specifications (Discussion Paper) Luciano Barbosa 1, Valter Crescenzi 2, Xin Luna Dong 3, Paolo Merialdo 2, Federico Piai 2, Disheng

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REVIEW PAPER ON IMPLEMENTATION OF DOCUMENT ANNOTATION USING CONTENT AND QUERYING

More information

A runtime approach to model-independent schema and data translation

A runtime approach to model-independent schema and data translation A runtime approach to model-independent schema and data translation Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, Giorgio Gianforme Università Roma Tre atzeni@dia.uniroma3.it, bellomarini@yahoo.it,

More information

Course on Database Design Carlo Batini University of Milano Bicocca, Italy

Course on Database Design Carlo Batini University of Milano Bicocca, Italy Course on Database Design Carlo Batini University of Milano Bicocca, Italy 1 Course on Database Design The course is made of six parts: Part 0 What you will learn in this course Part 1 Introduction to

More information

Database Design for NoSQL Systems

Database Design for NoSQL Systems DIPARTIMENTO DI INFORMATICA E AUTOMAZIONE Via della Vasca Navale, 79 00146 Roma, Italy Database Design for NoSQL Systems FRANCESCA BUGIOTTI 1, LUCA CABIBBO 2, PAOLO ATZENI 2, RICCARDO TORLONE 2 RT-DIA-210-2014

More information

Introduction to Databases Fall-Winter 2010/11. Syllabus

Introduction to Databases Fall-Winter 2010/11. Syllabus Introduction to Databases Fall-Winter 2010/11 Syllabus Werner Nutt Syllabus Lecturer Werner Nutt, nutt@inf.unibz.it, Room POS 2.09 Office hours: Tuesday, 14:00 16:00 and by appointment (If you want to

More information

Redundancy-Driven Web Data Extraction and Integration

Redundancy-Driven Web Data Extraction and Integration Redundancy-Driven Web Data Extraction and Integration Lorenzo Blanco, Mirko Bronzi, Valter Crescenzi, Paolo Merialdo, Paolo Papotti Università degli Studi Roma Tre Dipartimento di Informatica e Automazione

More information

ANDREAS PIERIS JOURNAL PAPERS

ANDREAS PIERIS JOURNAL PAPERS ANDREAS PIERIS School of Informatics, University of Edinburgh Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK apieris@inf.ed.ac.uk PUBLICATIONS (authors in alphabetical order) JOURNAL

More information

Reasoning on Business Processes and Ontologies in a Logic Programming Environment

Reasoning on Business Processes and Ontologies in a Logic Programming Environment Reasoning on Business Processes and Ontologies in a Logic Programming Environment Michele Missikoff 1, Maurizio Proietti 1, Fabrizio Smith 1,2 1 IASI-CNR, Viale Manzoni 30, 00185, Rome, Italy 2 DIEI, Università

More information

ISSN (Online) ISSN (Print)

ISSN (Online) ISSN (Print) Accurate Alignment of Search Result Records from Web Data Base 1Soumya Snigdha Mohapatra, 2 M.Kalyan Ram 1,2 Dept. of CSE, Aditya Engineering College, Surampalem, East Godavari, AP, India Abstract: Most

More information

Introduction to the course

Introduction to the course Database Management Systems Introduction to the course 1 Transaction processing On Line Transaction Processing (OLTP) Traditional DBMS usage Characterized by snapshot of current data values detailed data,

More information

Schema Exchange: a Template-based Approach to Data and Metadata Translation

Schema Exchange: a Template-based Approach to Data and Metadata Translation Schema Exchange: a Template-based Approach to Data and Metadata Translation Paolo Papotti and Riccardo Torlone Università Roma Tre {papotti,torlone}@dia.uniroma3.it Abstract. In this paper we study the

More information

XXII. Website Design. The Web

XXII. Website Design. The Web XXII. Website Design The Web Hypertext Data Independence Data Models for Hypertext Documents The Araneus Data Model (ADM) The Navigational Conceptual Model (NCM) The Araneus Methodology for Website Design

More information

Schema Exchange: a Template-based Approach to Data and Metadata Translation

Schema Exchange: a Template-based Approach to Data and Metadata Translation Schema Exchange: a Template-based Approach to Data and Metadata Translation Paolo Papotti and Riccardo Torlone Università Roma Tre {papotti,torlone}@dia.uniroma3.it Abstract. We study the schema exchange

More information

SHORT SURVEY ON GRAPHICAL DATABASE

SHORT SURVEY ON GRAPHICAL DATABASE SHORT SURVEY ON GRAPHICAL DATABASE Harsha R Vyavahare 1 Sipna COET, Computer Department, SGBAU University,Amravati, Amravati, Maharastra 444607, India hvyawahare01@gmail.com Dr.P.P.Karde 2 HOD, GOVT polytechnic

More information

Schema and Data Translation

Schema and Data Translation Schema and Data Translation Paolo Atzeni Università Roma Tre Based on Paper in EDBT 1996 (with R. Torlone) Paper in EDBT 2006 (with P. Cappellari and P. Bernstein) Demo in Sigmod 2007 (with P. Cappellari

More information

Keywords Data alignment, Data annotation, Web database, Search Result Record

Keywords Data alignment, Data annotation, Web database, Search Result Record Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web

More information

Database Design for NoSQL Systems

Database Design for NoSQL Systems Database Design for NoSQL Systems Francesca Bugiotti, Luca Cabibbo, Paolo Atzeni, Riccardo Torlone To cite this version: Francesca Bugiotti, Luca Cabibbo, Paolo Atzeni, Riccardo Torlone. Database Design

More information

Mobile robots and appliances to support the elderly people

Mobile robots and appliances to support the elderly people Microsoft Research Embedded Systems Invitation for Proposal Mobile robots and appliances to support the elderly people Luca Iocchi, Daniele Nardi Dipartimento di Informatica e Sistemistica Università di

More information

Data Quality Improvement in the DaQuinCIS System

Data Quality Improvement in the DaQuinCIS System Dipartimento di Informatica e Sistemistica Antonio Ruberti Università degli Studi di Roma La Sapienza Data Quality Improvement in the DaQuinCIS System Luca De Santis, Diego Milano, Gabriele Palmieri Monica

More information

Proposal for Implementing Linked Open Data on Libraries Catalogue

Proposal for Implementing Linked Open Data on Libraries Catalogue Submitted on: 16.07.2018 Proposal for Implementing Linked Open Data on Libraries Catalogue Esraa Elsayed Abdelaziz Computer Science, Arab Academy for Science and Technology, Alexandria, Egypt. E-mail address:

More information

Reuse of Schema Mappings for Data Transformation Design

Reuse of Schema Mappings for Data Transformation Design Dipartimento di Ingegneria Via della Vasca Navale, 79 00146 Roma, Italy Reuse of Schema Mappings for Data Transformation Design Paolo Atzeni 1, Luigi Bellomarini 1,2, Paolo Papotti 3, Riccardo Torlone

More information

MIWeb: Mediator-based Integration of Web Sources

MIWeb: Mediator-based Integration of Web Sources MIWeb: Mediator-based Integration of Web Sources Susanne Busse and Thomas Kabisch Technical University of Berlin Computation and Information Structures (CIS) sbusse,tkabisch@cs.tu-berlin.de Abstract MIWeb

More information

A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS

A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS A GML SCHEMA MAPPING APPROACH TO OVERCOME SEMANTIC HETEROGENEITY IN GIS Manoj Paul, S. K. Ghosh School of Information Technology, Indian Institute of Technology, Kharagpur 721302, India - (mpaul, skg)@sit.iitkgp.ernet.in

More information

Novel Materialized View Selection in a Multidimensional Database

Novel Materialized View Selection in a Multidimensional Database Graphic Era University From the SelectedWorks of vijay singh Winter February 10, 2009 Novel Materialized View Selection in a Multidimensional Database vijay singh Available at: https://works.bepress.com/vijaysingh/5/

More information

Semistructured Data and Mediation

Semistructured Data and Mediation Semistructured Data and Mediation Prof. Letizia Tanca Dipartimento di Elettronica e Informazione Politecnico di Milano SEMISTRUCTURED DATA FOR THESE DATA THERE IS SOME FORM OF STRUCTURE, BUT IT IS NOT

More information

First International Workshop on Managing Data Throughout its Lifecycle. in conjunction with ICDE 2011 April 11, 2011, Hannover, Germany

First International Workshop on Managing Data Throughout its Lifecycle. in conjunction with ICDE 2011 April 11, 2011, Hannover, Germany First International Workshop on Managing Data Throughout its Lifecycle (DaLí 2011) in conjunction with ICDE 2011 April 11, 2011, Hannover, Germany 40 ICDE Workshops 2011 Preface In the early days of data

More information

Inference in Hierarchical Multidimensional Space

Inference in Hierarchical Multidimensional Space Proc. International Conference on Data Technologies and Applications (DATA 2012), Rome, Italy, 25-27 July 2012, 70-76 Related papers: http://conceptoriented.org/ Inference in Hierarchical Multidimensional

More information

DL User Interfaces. Giuseppe Santucci Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza

DL User Interfaces. Giuseppe Santucci Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza DL User Interfaces Giuseppe Santucci Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza Delos work on DL interfaces Delos Cluster 4: User interfaces and visualization Cluster s goals:

More information

Using Data-Extraction Ontologies to Foster Automating Semantic Annotation

Using Data-Extraction Ontologies to Foster Automating Semantic Annotation Using Data-Extraction Ontologies to Foster Automating Semantic Annotation Yihong Ding Department of Computer Science Brigham Young University Provo, Utah 84602 ding@cs.byu.edu David W. Embley Department

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

Data integration supports seamless access to autonomous, heterogeneous information

Data integration supports seamless access to autonomous, heterogeneous information Using Constraints to Describe Source Contents in Data Integration Systems Chen Li, University of California, Irvine Data integration supports seamless access to autonomous, heterogeneous information sources

More information

Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD

Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD Matteo Golfarelli Stefano Rizzi Elisa Turricchia University of Bologna - Italy 13th International Conference on Data Warehousing

More information

MERGING BUSINESS VOCABULARIES AND RULES

MERGING BUSINESS VOCABULARIES AND RULES MERGING BUSINESS VOCABULARIES AND RULES Edvinas Sinkevicius Departament of Information Systems Centre of Information System Design Technologies, Kaunas University of Lina Nemuraite Departament of Information

More information

Web Security Vulnerabilities: Challenges and Solutions

Web Security Vulnerabilities: Challenges and Solutions Web Security Vulnerabilities: Challenges and Solutions A Tutorial Proposal for ACM SAC 2018 by Dr. Hossain Shahriar Department of Information Technology Kennesaw State University Kennesaw, GA 30144, USA

More information

TITLE OF COURSE SYLLABUS, SEMESTER, YEAR

TITLE OF COURSE SYLLABUS, SEMESTER, YEAR TITLE OF COURSE SYLLABUS, SEMESTER, YEAR Instructor Contact Information Jennifer Weller Jweller2@uncc.edu Office Hours Time/Location of Course Mon 9-11am MW 8-9:15am, BINF 105 Textbooks Needed: none required,

More information

Paolo Terenziani PERSONAL DATA. Paolo Terenziani was born in Torino on BIO AND EDUCATION

Paolo Terenziani PERSONAL DATA. Paolo Terenziani was born in Torino on BIO AND EDUCATION Paolo Terenziani PERSONAL DATA Paolo Terenziani was born in Torino on 4.7.1963. BIO AND EDUCATION On 3/7/1987 he got the Master degree in Computer Science at the University of Turin with 110/110 and laude.

More information

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2016 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020

More information

Conceptual Database Modeling

Conceptual Database Modeling Course A7B36DBS: Database Systems Lecture 01: Conceptual Database Modeling Martin Svoboda Irena Holubová Tomáš Skopal Faculty of Electrical Engineering, Czech Technical University in Prague Course Plan

More information

Modelling Data Warehouses with Multiversion and Temporal Functionality

Modelling Data Warehouses with Multiversion and Temporal Functionality Modelling Data Warehouses with Multiversion and Temporal Functionality Waqas Ahmed waqas.ahmed@ulb.ac.be Université Libre de Bruxelles Poznan University of Technology July 9, 2015 ITBI DC Outline 1 Introduction

More information

SEMANTIC WEB DATA MANAGEMENT. from Web 1.0 to Web 3.0

SEMANTIC WEB DATA MANAGEMENT. from Web 1.0 to Web 3.0 SEMANTIC WEB DATA MANAGEMENT from Web 1.0 to Web 3.0 CBD - 21/05/2009 Roberto De Virgilio MOTIVATIONS Web evolution Self-describing Data XML, DTD, XSD RDF, RDFS, OWL WEB 1.0, WEB 2.0, WEB 3.0 Web 1.0 is

More information

An Approach to Intensional Query Answering at Multiple Abstraction Levels Using Data Mining Approaches

An Approach to Intensional Query Answering at Multiple Abstraction Levels Using Data Mining Approaches An Approach to Intensional Query Answering at Multiple Abstraction Levels Using Data Mining Approaches Suk-Chung Yoon E. K. Park Dept. of Computer Science Dept. of Software Architecture Widener University

More information

Hierarchical Clustering of Process Schemas

Hierarchical Clustering of Process Schemas Hierarchical Clustering of Process Schemas Claudia Diamantini, Domenico Potena Dipartimento di Ingegneria Informatica, Gestionale e dell'automazione M. Panti, Università Politecnica delle Marche - via

More information

WEB APPLICATIONS: OUR IDEA FOR A DEVELOPMENT FRAMEWORK

WEB APPLICATIONS: OUR IDEA FOR A DEVELOPMENT FRAMEWORK WEB APPLICATIONS: OUR IDEA FOR A DEVELOPMENT FRAMEWORK Luca Contursi luca.contursi@unile.it Roberto Paiano roberto.paiano@unile.it Andrea Pandurino andrea.pandurino@unile.it ABSTRACT The increased complexity

More information

The Problem of Semantics in the Metadata Mess

The Problem of Semantics in the Metadata Mess The Problem of Semantics in the Metadata Mess V.M. David Maier Portland State University Figure: CMOP s Virtual Columbia River With thanks to the scientists at Center for Coastal Margin Observation and

More information

Information Management (IM)

Information Management (IM) 1 2 3 4 5 6 7 8 9 Information Management (IM) Information Management (IM) is primarily concerned with the capture, digitization, representation, organization, transformation, and presentation of information;

More information

Development of Educational Software

Development of Educational Software Development of Educational Software Rosa M. Reis Abstract The use of computer networks and information technology are becoming an important part of the everyday work in almost all professions, especially

More information

Specific Objectives Contents Teaching Hours 4 the basic concepts 1.1 Concepts of Relational Databases

Specific Objectives Contents Teaching Hours 4 the basic concepts 1.1 Concepts of Relational Databases Course Title: Advanced Database Management System Course No. : ICT. Ed 525 Nature of course: Theoretical + Practical Level: M.Ed. Credit Hour: 3(2T+1P) Semester: Second Teaching Hour: 80(32+8) 1. Course

More information

An Ontological Analysis of Metamodeling Languages

An Ontological Analysis of Metamodeling Languages An Ontological Analysis of Metamodeling Languages Erki Eessaar and Rünno Sgirka 2 Department of Informatics, Tallinn University of Technology, Estonia, eessaar@staff.ttu.ee 2 Department of Informatics,

More information

Maintenance of Data and Metadata in Web Site Management Systems

Maintenance of Data and Metadata in Web Site Management Systems Università degli Studi di Roma La Sapienza Dottorato di Ricerca in Informatica Medica XI Ciclo 1999 Maintenance of Data and Metadata in Web Site Management Systems Giuseppe Sindoni Università degli Studi

More information

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining. About the Tutorial Data Mining is defined as the procedure of extracting information from huge sets of data. In other words, we can say that data mining is mining knowledge from data. The tutorial starts

More information

Fausto Giunchiglia and Mattia Fumagalli

Fausto Giunchiglia and Mattia Fumagalli DISI - Via Sommarive 5-38123 Povo - Trento (Italy) http://disi.unitn.it FROM ER MODELS TO THE ENTITY MODEL Fausto Giunchiglia and Mattia Fumagalli Date (2014-October) Technical Report # DISI-14-014 From

More information

An Approach to Heterogeneous Data Translation based on XML Conversion

An Approach to Heterogeneous Data Translation based on XML Conversion An Approach to Heterogeneous Data Translation based on XML Conversion Paolo Papotti and Riccardo Torlone Dipartimento di Informatica e Automazione Università RomaTre {papotti,torlone}@dia.uniroma3.it Abstract.

More information

Which Role for an Ontology of Uncertainty?

Which Role for an Ontology of Uncertainty? Which Role for an Ontology of Uncertainty? Paolo Ceravolo, Ernesto Damiani, Marcello Leida Dipartimento di Tecnologie dell Informazione - Università degli studi di Milano via Bramante, 65-26013 Crema (CR),

More information

PRIOR System: Results for OAEI 2006

PRIOR System: Results for OAEI 2006 PRIOR System: Results for OAEI 2006 Ming Mao, Yefei Peng University of Pittsburgh, Pittsburgh, PA, USA {mingmao,ypeng}@mail.sis.pitt.edu Abstract. This paper summarizes the results of PRIOR system, which

More information

Making Semantic Web based-hypermedia Applications

Making Semantic Web based-hypermedia Applications Making Semantic Web based-hypermedia Applications Laura Montells, Susana Montero, Paloma Díaz, Ignacio Aedo Laboratorio DEI. Dpto. de Informática Universidad Carlos III de Madrid Avda. de la Universidad

More information

Ingegneria del Software Corso di Laurea in Informatica per il Management. Introduction to UML

Ingegneria del Software Corso di Laurea in Informatica per il Management. Introduction to UML Ingegneria del Software Corso di Laurea in Informatica per il Management Introduction to UML Davide Rossi Dipartimento di Informatica Università di Bologna Modeling A model is an (abstract) representation

More information

WEB DATA EXTRACTION METHOD BASED ON FEATURED TERNARY TREE

WEB DATA EXTRACTION METHOD BASED ON FEATURED TERNARY TREE WEB DATA EXTRACTION METHOD BASED ON FEATURED TERNARY TREE *Vidya.V.L, **Aarathy Gandhi *PG Scholar, Department of Computer Science, Mohandas College of Engineering and Technology, Anad **Assistant Professor,

More information

M 2 ORM 2 : A model for the transparent management of relationally persistent objects

M 2 ORM 2 : A model for the transparent management of relationally persistent objects M 2 ORM 2 : A model for the transparent management of relationally persistent objects Luca Cabibbo and Roberto Porcelli Dipartimento di Informatica e Automazione Università degli studi Roma Tre Via della

More information

Enterprise Big Data Platforms

Enterprise Big Data Platforms Enterprise Big Data Platforms + Big Data research @ Roma Tre Antonio Maccioni maccioni@dia.uniroma3.it 19 April 2017 Outline Polystores QUEPA project Data Lakes KAYAK project No one size fits all Polyglot

More information

XML Data in (Object-) Relational Databases

XML Data in (Object-) Relational Databases XML Data in (Object-) Relational Databases RNDr. Irena Mlýnková irena.mlynkova@mff.cuni.cz Charles University Faculty of Mathematics and Physics Department of Software Engineering Prague, Czech Republic

More information

College of Sciences. College of Sciences. Master s of Science in Computer Sciences Master s of Science in Biotechnology

College of Sciences. College of Sciences. Master s of Science in Computer Sciences Master s of Science in Biotechnology Master s of Science in Computer Sciences Master s of Science in Biotechnology Department of Computer Sciences 1. Introduction\Program Mission The Program mission is to prepare students to be fully abreast

More information

Horizontal Aggregations in SQL to Prepare Data Sets Using PIVOT Operator

Horizontal Aggregations in SQL to Prepare Data Sets Using PIVOT Operator Horizontal Aggregations in SQL to Prepare Data Sets Using PIVOT Operator R.Saravanan 1, J.Sivapriya 2, M.Shahidha 3 1 Assisstant Professor, Department of IT,SMVEC, Puducherry, India 2,3 UG student, Department

More information

ThinProway A Java client to a SAS application. A successful story. Exactly what you need?

ThinProway A Java client to a SAS application. A successful story. Exactly what you need? ThinProway A Java client to a SAS application. A successful story. Exactly what you need? Author: Riccardo Proni TXT Ingegneria Informatica Abstract ThinProway is a software solution dedicated to the manufacturing

More information

CONCENTRATIONS: HIGH-PERFORMANCE COMPUTING & BIOINFORMATICS CYBER-SECURITY & NETWORKING

CONCENTRATIONS: HIGH-PERFORMANCE COMPUTING & BIOINFORMATICS CYBER-SECURITY & NETWORKING MAJOR: DEGREE: COMPUTER SCIENCE MASTER OF SCIENCE (M.S.) CONCENTRATIONS: HIGH-PERFORMANCE COMPUTING & BIOINFORMATICS CYBER-SECURITY & NETWORKING The Department of Computer Science offers a Master of Science

More information

Schema Repository Database Evolution And Metamodeling

Schema Repository Database Evolution And Metamodeling Schema Repository Database Evolution And Metamodeling Evolving XML schemas and documents using UML class diagrams. E Domínguez, J 31, 2005. MeDEA: A database evolution architecture with traceability 21,

More information

Advanced Geographic Information Systems Vol. II - GIS Interoperability, from Problems to Solutions - R. Laurini, K. Yétongnon, D.

Advanced Geographic Information Systems Vol. II - GIS Interoperability, from Problems to Solutions - R. Laurini, K. Yétongnon, D. GIS INTEROPERABILITY, FROM PROBLEMS TO SOLUTIONS R. Laurini LIRIS Laboratory, INSA de Lyon, Villeurbanne, France K. Yétongnon LE2I-Equipe Ingénierie Informatique, Université de Bourgogne, Dijon, France

More information

Supporting Fuzzy Keyword Search in Databases

Supporting Fuzzy Keyword Search in Databases I J C T A, 9(24), 2016, pp. 385-391 International Science Press Supporting Fuzzy Keyword Search in Databases Jayavarthini C.* and Priya S. ABSTRACT An efficient keyword search system computes answers as

More information

Ontology for Exploring Knowledge in C++ Language

Ontology for Exploring Knowledge in C++ Language Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,

More information

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

An Overview of various methodologies used in Data set Preparation for Data mining Analysis An Overview of various methodologies used in Data set Preparation for Data mining Analysis Arun P Kuttappan 1, P Saranya 2 1 M. E Student, Dept. of Computer Science and Engineering, Gnanamani College of

More information

Databases and the World Wide Web

Databases and the World Wide Web Databases and the World Wide Web Paolo Atzeni D.I.A. - Università di Roma Tre http://www.dia.uniroma3.it/~atzeni thanks to the Araneus group: G. Mecca, P. Merialdo, A. Masci, V. Crescenzi, G. Sindoni,

More information

Annotating Multiple Web Databases Using Svm

Annotating Multiple Web Databases Using Svm Annotating Multiple Web Databases Using Svm M.Yazhmozhi 1, M. Lavanya 2, Dr. N. Rajkumar 3 PG Scholar, Department of Software Engineering, Sri Ramakrishna Engineering College, Coimbatore, India 1, 3 Head

More information

KEY BASED APPROACH FOR INTEGRATION OF HETEROGENEOUS DATA SOURCES

KEY BASED APPROACH FOR INTEGRATION OF HETEROGENEOUS DATA SOURCES KEY BASED APPROACH FOR INTEGRATION OF HETEROGENEOUS DATA SOURCES 1 KAMSURIAH AHMAD, 1 TENGKU SITI FATIMAH TENGKU WOOK, 2 REDUAN SAMAD 1 Strategic Management Research Group Faculty of Information Science

More information

A Hybrid Unsupervised Web Data Extraction using Trinity and NLP

A Hybrid Unsupervised Web Data Extraction using Trinity and NLP IJIRST International Journal for Innovative Research in Science & Technology Volume 2 Issue 02 July 2015 ISSN (online): 2349-6010 A Hybrid Unsupervised Web Data Extraction using Trinity and NLP Anju R

More information

Computer-based Tracking Protocols: Improving Communication between Databases

Computer-based Tracking Protocols: Improving Communication between Databases Computer-based Tracking Protocols: Improving Communication between Databases Amol Deshpande Database Group Department of Computer Science University of Maryland Overview Food tracking and traceability

More information

Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm

Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm Enhanced Retrieval of Web Pages using Improved Page Rank Algorithm Rekha Jain 1, Sulochana Nathawat 2, Dr. G.N. Purohit 3 1 Department of Computer Science, Banasthali University, Jaipur, Rajasthan ABSTRACT

More information

Xin ZHANG. Thesis Defense Talk passed (December 1998). Thesis Proposal presented (April 1998).

Xin ZHANG. Thesis Defense Talk passed (December 1998). Thesis Proposal presented (April 1998). Xin ZHANG Computer Science Department Worcester Polytechnic Institute 100 Institute Road, Worcester, MA 01609 Phone: (508) 831-5857 Email: xinz@wpi.edu WWW: http://www.wpi.edu/ xinz April 22, 2002 Education

More information

Category Theory in Ontology Research: Concrete Gain from an Abstract Approach

Category Theory in Ontology Research: Concrete Gain from an Abstract Approach Category Theory in Ontology Research: Concrete Gain from an Abstract Approach Markus Krötzsch Pascal Hitzler Marc Ehrig York Sure Institute AIFB, University of Karlsruhe, Germany; {mak,hitzler,ehrig,sure}@aifb.uni-karlsruhe.de

More information

A Data warehouse within a Federated database architecture

A Data warehouse within a Federated database architecture Association for Information Systems AIS Electronic Library (AISeL) AMCIS 1997 Proceedings Americas Conference on Information Systems (AMCIS) 8-15-1997 A Data warehouse within a Federated database architecture

More information

A Novel Method for the Comparison of Graphical Data Models

A Novel Method for the Comparison of Graphical Data Models 3RD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DEVELOPMENT (ISD01 CROATIA) A Novel Method for the Comparison of Graphical Data Models Katarina Tomičić-Pupek University of Zagreb, Faculty of Organization

More information

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment Paul Watry Univ. of Liverpool, NaCTeM pwatry@liverpool.ac.uk Ray Larson Univ. of California, Berkeley

More information

Remotely Sensed Image Processing Service Automatic Composition

Remotely Sensed Image Processing Service Automatic Composition Remotely Sensed Image Processing Service Automatic Composition Xiaoxia Yang Supervised by Qing Zhu State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University

More information

Enriching Query Routing Processes in PDMS with Semantic Information and Information Quality

Enriching Query Routing Processes in PDMS with Semantic Information and Information Quality Enriching Query Routing Processes in PDMS with Semantic Information and Information Quality Crishane Freire 2, Damires Souza 1, Bernadette F. Lóscio 2 and Ana Carolina Salgado 2 1 Federal Institute of

More information

Computation Independent Model (CIM): Platform Independent Model (PIM): Platform Specific Model (PSM): Implementation Specific Model (ISM):

Computation Independent Model (CIM): Platform Independent Model (PIM): Platform Specific Model (PSM): Implementation Specific Model (ISM): viii Preface The software industry has evolved to tackle new approaches aligned with the Internet, object-orientation, distributed components and new platforms. However, the majority of the large information

More information

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online):

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online): IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online): 2321-0613 Tanzeela Khanam 1 Pravin S.Metkewar 2 1 Student 2 Associate Professor 1,2 SICSR, affiliated

More information

NeOn Methodology for Building Ontology Networks: a Scenario-based Methodology

NeOn Methodology for Building Ontology Networks: a Scenario-based Methodology NeOn Methodology for Building Ontology Networks: a Scenario-based Methodology Asunción Gómez-Pérez and Mari Carmen Suárez-Figueroa Ontology Engineering Group. Departamento de Inteligencia Artificial. Facultad

More information

What is database? Types and Examples

What is database? Types and Examples What is database? Types and Examples Visit our site for more information: www.examplanning.com Facebook Page: https://www.facebook.com/examplanning10/ Twitter: https://twitter.com/examplanning10 TABLE

More information

Ontology Creation and Development Model

Ontology Creation and Development Model Ontology Creation and Development Model Pallavi Grover, Sonal Chawla Research Scholar, Department of Computer Science & Applications, Panjab University, Chandigarh, India Associate. Professor, Department

More information

Advances in Database Technology EDBT'98

Advances in Database Technology EDBT'98 Hans-Jorg Schek Felix Saltor Isidro Ramos Gustavo Alonso (Eds.) Advances in Database Technology EDBT'98 6th International Conference on Extending Database Technology Valencia, Spain, March 23-27, 1998

More information

EXTRACTION AND ALIGNMENT OF DATA FROM WEB PAGES

EXTRACTION AND ALIGNMENT OF DATA FROM WEB PAGES EXTRACTION AND ALIGNMENT OF DATA FROM WEB PAGES Praveen Kumar Malapati 1, M. Harathi 2, Shaik Garib Nawaz 2 1 M.Tech, Computer Science Engineering, 2 M.Tech, Associate Professor, Computer Science Engineering,

More information