Schema Evolution in Scalable Cloud Data Management (SCDM)

Similar documents
NoSQL Schema Evolution and Big Data Migration at Scale

ControVol: A Framework for Controlled Schema Evolution in NoSQL Application Development

NOSQL DATABASE SYSTEMS: APPLICATION DEVELOPMENT. Big Data Technologies: NoSQL DBMS (Application Development) - SoSe

Safely Managing Data Variety in Big Data Software Development

NOSQL DATABASE SYSTEMS: DECISION GUIDANCE AND TRENDS. Big Data Technologies: NoSQL DBMS (Decision Guidance) - SoSe

SCHEMA INFERENCE FOR MASSIVE JSON DATASETS!

Introduction to NoSQL Databases

Ashok Kumar P S, Md Ateeq Ur Rahman Department of CSE, JNTU/ SCET, Hyderabad, Andra Pradesh, India

Advances in Data Management - NoSQL, NewSQL and Big Data A.Poulovassilis

Realtime visitor analysis with Couchbase and Elasticsearch

Announcements. JSon Data Structures. JSon Syntax. JSon Semantics: a Tree! JSon Primitive Datatypes. Introduction to Database Systems CSE 414

A Review to the Approach for Transformation of Data from MySQL to NoSQL

5/2/16. Announcements. NoSQL Motivation. The New Hipster: NoSQL. Serverless. What is the Problem? Database Systems CSE 414

Database Systems CSE 414

Introduction to NoSQL by William McKnight

Non-Relational Databases. Pelle Jakovits

NoSQL database and its business applications

CSE 344 APRIL 16 TH SEMI-STRUCTURED DATA

10/18/2017. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414

NOSQL DATABASE PERFORMANCE BENCHMARKING - A CASE STUDY

PROFESSIONAL. NoSQL. Shashank Tiwari WILEY. John Wiley & Sons, Inc.

Performance Evaluation of a MongoDB and Hadoop Platform for Scientific Data Analysis

Appropches used in efficient migrption from Relptionpl Dptpbpse to NoSQL Dptpbpse

Column Stores and HBase. Rui LIU, Maksim Hrytsenia

A data-driven framework for archiving and exploring social media data

Chapter 8 - Sql-99 Schema Definition Constraints Queries And Views

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Data Warehousing and Data Mining. Announcements (December 1) Data integration. CPS 116 Introduction to Database Systems

Oracle Sql Describe Schemas Query To Find Database

Chapter 11 Outline. A Simple PHP Example Overview of Basic Features of PHP Overview of PHP Database Programming. Slide 11-2

Real-time Fraud Detection with Innovative Big Graph Feature. Gaurav Deshpande, VP Marketing, TigerGraph; Mingxi Wu, VP Engineering, TigerGraph

IoT Data Storage: Relational & Non-Relational Database Management Systems Performance Comparison

Final Exam Logistics. CS 133: Databases. Goals for Today. Some References Used. Final exam take-home. Same resources as midterm

Cassandra- A Distributed Database

Migrating from Oracle to Espresso

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu

Advances in Data Management - NoSQL, NewSQL and Big Data A.Poulovassilis

Oracle Big Data Science IOUG Collaborate 16

relational Key-value Graph Object Document

Topics. History. Architecture. MongoDB, Mongoose - RDBMS - SQL. - NoSQL

Challenges for Data Driven Systems

Database Evolution. DB NoSQL Linked Open Data. L. Vigliano

5/1/17. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414

CSE 344 JULY 9 TH NOSQL

CSE 544 Principles of Database Management Systems. Fall 2016 Lecture 4 Data models A Never-Ending Story

NoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre

Introduction to Computer Science. William Hsu Department of Computer Science and Engineering National Taiwan Ocean University

Intro Cassandra. Adelaide Big Data Meetup.

PSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets

Evolution of Database Systems

Distributed Non-Relational Databases. Pelle Jakovits

Getting to know. by Michelle Darling August 2013

Cloud Spanner. Rohit Gupta, Solutions

Data Lakes: A Solution or a newchallenge for Big Data Integration. Christoph Quix, DATA 2016

MarkLogic 8 Overview of Key Features COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

How do we build TiDB. a Distributed, Consistent, Scalable, SQL Database

NOSQL DATABASE SYSTEMS: DATA MODELING. Big Data Technologies: NoSQL DBMS (Data Modeling) - SoSe

Latest Trends in Database Technology NoSQL and Beyond

Introduction to Big Data. NoSQL Databases. Instituto Politécnico de Tomar. Ricardo Campos

Software Quality Understanding by Analysis of Abundant Data (SQUAAD)

Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Performance Comparison of NOSQL Database Cassandra and SQL Server for Large Databases

/ Cloud Computing. Recitation 10 March 22nd, 2016

Database Technology Introduction. Heiko Paulheim

Goal of the presentation is to give an introduction of NoSQL databases, why they are there.

Safe Harbor Statement

JSON - Overview JSon Terminology

8/24/2017 Week 1-B Instructor: Sangmi Lee Pallickara

Chapter 24 NOSQL Databases and Big Data Storage Systems

L22: NoSQL. CS3200 Database design (sp18 s2) 4/5/2018 Several slides courtesy of Benny Kimelfeld

CSE 544 Principles of Database Management Systems. Lecture 4: Data Models a Never-Ending Story

Databases and Big Data Today. CS634 Class 22

What is database? Types and Examples

E(xtract) T(ransform) L(oad)

Seminar Recent Trends in Database Research

Scalable Storage: The drive for web-scale data management

Intuitive and Interactive Query Formulation to Improve the Usability of Query Systems for Heterogeneous Graphs

Databases : Lectures 11 and 12: Beyond ACID/Relational databases Timothy G. Griffin Lent Term 2013

Database Design for NoSQL Systems

A Non-Relational Storage Analysis

The age of Big Data Big Data for Oracle Database Professionals

How we build TiDB. Max Liu PingCAP Amsterdam, Netherlands October 5, 2016

Databases : Lecture 1 2: Beyond ACID/Relational databases Timothy G. Griffin Lent Term Apologies to Martin Fowler ( NoSQL Distilled )

An UML-XML-RDB Model Mapping Solution for Facilitating Information Standardization and Sharing in Construction Industry

Keywords Data alignment, Data annotation, Web database, Search Result Record

Relational model continued. Understanding how to use the relational model. Summary of board example: with Copies as weak entity

Using and Developing with Azure. Joshua Drew

FACULTY OF ENGINEERING B.E. 4/4 (CSE) II Semester (Old) Examination, June Subject : Information Retrieval Systems (Elective III) Estelar

Architectural Styles. Software Architecture Lecture 5. Copyright Richard N. Taylor, Nenad Medvidovic, and Eric M. Dashofy. All rights reserved.

Utilization of UML diagrams in designing an events extraction system

NOSQL EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

Fault Identification from Web Log Files by Pattern Discovery

Schema Repository Database Evolution In

CompSci 516 Database Systems

Encyclopedia of Database Systems, Editors-in-chief: Özsu, M. Tamer; Liu, Ling, Springer, MAINTENANCE OF RECURSIVE VIEWS. Suzanne W.

Database Fundamentals Chapter 1

What is a Database? Peter Wood

CMPT 354: Database System I. Lecture 1. Course Introduction

COMPARISON AND EVALUATION ON METRICS

Decentralized Control of Large-Scale Distributed System

Transcription:

Schema Evolution in Scalable Cloud Data Management (SCDM) Meike Klettke Universität Rostock Uta Störl Hochschule Darmstadt Stefanie Scherzinger OTH Regensburg

Motivation NoSQL databases are often used in Big data applications Volume (Scalability) Variety (Schema Flexibility) Main Reasons for using NoSQL: Scalability Schema-flexibility 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 2

Why Schema Flexibility? 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 3

Advantages and Disadvantages of Schema Flexibility Advantages: Storage of heterogeneous data Storage of different data versions (with structural changes over time) application Disadvantage: high effort to query and process the heterogonous data 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 4

Why Schema Management? 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 5

Schema Management Compromise between strict predefined schema (like in relational databases) and schema flexibility (like in NoSQL databases) can be realized inside the NoSQL database or complexity of structure oo db XML db on-top of the database system checking of structural constraints relational db structure variability 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 6

Schema Management Subtasks in Agile Development Agile application developement (without schema) Schema Extraction Schema Evolution, Data Migration 1. Usage of NoSQL database for storing data of the agile application (data science) 2. Extraction of an explicit schema from available datasets 3. in the following: in case of structural changes: schema evolution and (scalable) data migration 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 7

Subtasks of Schema Management application version v continuous deployment application version v+1 implicitly declared in data model derived explicitly declared by developer schema S v schema evolution operations ** schema S v+1 obtained by schema extraction * derived migration operations * Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Extraction and Structural Outlier Detection for NoSQL Data Stores, BTW 2015 ** Introduced in: Stefanie Scherzinger, Meike Klettke, Uta Störl: Managing Schema Evolution in NoSQL Data Store, DBPL@VLDB, 2013 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 8

How to realize Schema Extraction? 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 9

Heterogeneous JSON documents (blogposts) {"_id": 7, "title": "NoSQL Data Modeling Techniques", "content": "NoSQL databases...", "author": "Alice", "date": "2014 01 22", "likes": 34 } {"_id": 97, "title": "NoSQL Schema Design", "content": "Schema flexibility...", "author": "Bob", "date": "2014 02 24", "comments": [ { "commentcontent": "Not sure...", "commentdate": "2014 02 24" }, { "commentcontent": "Let me...", "commentdate": "2014 02 26" } ] } Schema Extraction { "type": "object", "properties": { "_id": {"type": "integer"}, "title": {"type": "string"}, "content": {"type": "string"},... "comments": {"type": "array", "items": {"type": "object", "properties": { "commentcontent": {"type": "string"}, "commentdate": {"type": "string"}} "required": ["commentcontent", "commentdate"] } } } "required": ["title", "content", "author", "date"] } 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 10

Extraction of the Structure Identification Graph 3 { "type": "object", "properties": {.. } 2 2 {"_id": 7, "title": "NoSQL Data Modeling Techniques", "content": "NoSQL databases...",... 1 1 {"_id": 97, "title": "NoSQL Schema Design", "content": "Schema flexibility...", "author": "Bob", "date": "2014 02 24",... 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 11

Application of the Structure Identification Graph We can derive: JSON Schema { "type": "object", "properties": {.. } Statistics about the Data Outliers 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 13

Scalability 120 100 schema extraction is often a preprocessing step not time critical bottleneck are the node and edge lists (with references to the id) Time in ms 80 60 40 20 0 0 5000 10000 15000 Input data size (total number of properties) Tested with experimental data from IPP Greifswald, Wendelstein 7-X (world s largest fusion device), mongodb, 180 Collections 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 14

Reduced Structure Identification Graph (RG) instead of the SG: blogpostobj [ 7:0, 97:0 ] length: 2 [ 7:0 ] length: 1 likes [ 7:6 ] length: 1 node and edge lists do not contain references to the original documents only counters (frequencies) RG for the example: 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 15

Comparison of both Approaches SG Structure Identifikation Graph RG Reduced Structure Identifikation Graph JSON schema extraction + + Document statistics + + Outlier detection + - can be found in two steps 1. Finding document statistics 2. Outlier detection, with queries (e.g. in MongoDB) db.collection.find({pi: {$exists: true}} db.collection.find({pj: {$exists: false}}) 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 16

Some Experiments IPP Greifswald (MongoDB): based on the SG (references) based on the RG (counters) Web Performance Data (MongoDB): 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 17

Two different interpretations of structural differences Step 1 data selection and structure extraction...... schema represents all variants of the datasets deriving schema versions over time and candidates for schema evolution operations 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 18

Subtasks of Schema Management application version v continuous deployment application version v+1 implicitly declared in data model derived explicitly declared by developer schema S v schema evolution operations ** schema S v+1 obtained by schema extraction * derived migration operations * Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Extraction and Structural Outlier Detection for NoSQL Data Stores, BTW 2015 ** Introduced in: Stefanie Scherzinger, Meike Klettke, Uta Störl: Managing Schema Evolution in NoSQL Data Store, DBPL@VLDB, 2013 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 19

How to realize Evolution? 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 20

State-of-the-Art and Motivation State-of-the-Art in NoSQL databases: hand-written migration scripts application version v continuous deployment application version v+1 migration scripts In relational databases: schema evolution is a well studied field (alter table..) In NoSQL: consider data volume and data migration costs 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 21

NoSQL Schema Evolution Language Schema evolution language for describing structural changes of the data Single type operations: add property rename property delete property Multi-type operations: copy property (denormalization) move property (refactoring) schema S v schema S v+1 Schema Evolution Data Migration Introduced in: S. Scherzinger, M. Klettke, U. Störl: Managing Schema Evolution in NoSQL Data Store, DBPL@VLDB, 2013, implemented in 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 22

How to scale Data Migration? 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 23

Basic Strategies of Data Migration Eager Migration after introduction of a new schema version, all entities are migrated Lazy Migration evolution operations are stored, data migration is done on request schema S v schema S v+1 Schema Evolution schema S v schema S v+1 Schema Evolution Data Migration Data Migration 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 25

Basic Strategies of Data Migration Eager Migration Lazy Migration 100 million entities, 5 versions 100 million entities, 5 versions, access to 25 million entities per version 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 26

Advantages and Disadvantages Eager Migration Advantages: + all entities are in the current version + low latency (if entities are accessed) Disadvantages: even entities which are not in use are migrated high number (and costs) of migration operations Lazy Migration Advantages: + only entities which are in use are migrated + no unnecessary data migration operations + composition of operations is possible (see next slides) Disadvantages: entities in the NoSQL database in different versions increased latency 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 27

Optimizing Lazy Migration In case, entities are migrated from older versions: evolve 2-5 evolve 3-5 evolve 4-5 evolve 5 E t,1 E t,2 E t,3 E t,4 E t,5 e t,1 e t,1 e t,1 e t,1 e t,1 e t,2 e t,1 e t,1 e t,3 e t,1 e t,1 e t,4 e t,1 e t,1 e t,5 migrate 2-5 migrate 3-5 migrate 4-5 migrate 5 Composition of operations is possible in some cases: + composition of evolution operations + smaller number of data migration operations (lower migration costs) Meike Klettke, Uta Störl, Manuel Shenavai, Stefanie Scherzinger: NoSQL Schema Evolution and Big Data Migration at Scale, 4th Scalable Cloud Data Management Workshop (SCDM) @ IEEE Big Data Conference, Washington, D.C., 2016 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 28

Example for Lazy composed migration can be composed to: 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 29

Rules for Lazy composed migration Two schema evolution operations can be composed if they match op1 and op2 result of the composition is given in the table cells 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 30

Effects of Composition Lazy stepwise Migration Lazy composed Migration = 100 million entities, 5 versions, access to 25 million entities per version 100 million entities, 5 versions, access to 25 million entities per version 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 32

Hybrid Migration: Predictive Migration Basic idea: forecast function delivers predictions Predictive migration combines in each version eager migration (for the predictions) and lazy migration (for all other entities) Success depends on the quality of the forecast function forecast function has to consider: access statistics in past Relationships between entities Meike Klettke, Uta Störl, Manuel Shenavai, Stefanie Scherzinger: NoSQL Schema Evolution and Big Data Migration at Scale, 4th Scalable Cloud Data Management Workshop (SCDM) @ IEEE Big Data Conference, Washington, 2016 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 33

A Combination of Eager and Lazy Migration: Predictive Migration Forecast function, which entities are accessed in near future (bases on heuristics) predictive migration of these entities in current version in bulk Advantage: decreased average latency Disadvantage: additional migration operations in case of wrong predictions 100 million entities, 5 versions, access to 25 million entities per version, prediction of 25 million entities, 50% correct 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 34

Another Hybrid Migration: Incremental Migration Lazy data migration is applied (stepwise or composed) in some version, an eager migration is applied, in case of 1. disruptive schema changes 2. "technical debts" of the NoSQL database needs a function for evaluating the 1. amount of changes 2. state of the NoSQL database 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 35

Comparison of different approaches 1. Number of migration operations Eager Migration 2. Effort for query rewriting 3. Average latency Predictive Migration 4. Application downtime Incremental Migration Lazy Migration Versioning high medium medium low none none medium medium medium high none low low medium high high medium medium none none 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 36

Conclusion Schema management components support application development with NoSQL databases Two different methods for schema extraction especially realizing structural changes schema evolution operations data migration strategies eager migration (in bulk) lazy migration (on request) combined (hybrid migrations) 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 39

Future Work Provide Schema Evolution and Data Migration for a wide range of applications from eager migrations up to lazy migrations even versioning and rewriting of queries an results Development of an advisor for the data migration strategy its parametrization, and choice of the NoSQL database system Schema Evolution and Data Migration predictive migration - development and proving of a forecast function incremental migration - cost function for decision about lazy or eager data migration in each version Schema Extraction finding and consideration integrity constraints (keys, inclusion dependencies, foreign keys,..) for the next 3-4 years for the next year 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 40

More details? 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 41

Demo Sessions Please visit our software demonstration at BTW 2017 Wednesday, 8. March: 6:00 pm 7:30 pm Thursday, 9. March: 10:45 am 12:15 am 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 42

Literature (NoSQL databases) J. Baker, C. Bondç, J. C. Corbett, J. J. Furman, et al. Megastore: Providing Scalable, Highly Available Storage for Interactive Services. In Proc. CIDR 11, 2011. Apache Cassandra, 2013. http://cassandra.apache.org/. V. Benzaken, G. Castagna, K. Nguyen, and J. Simeon. Static and dynamic semantics of NoSQL languages. In Proc. POPL, pages 101 114, 2013. M. Brown. Developing with Couchbase Server. O Reilly, 2013. R. Cattell. Scalable SQL and NoSQL data stores. SIGMOD Record, 39(4):12 27, 2010. F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, et al. Bigtable: A Distributed Storage System for Structured Data. In Proc. OSDI, pages 205 218, 2006. K. Chodorow. MongoDB: The Definitive Guide. O Reilly, 2013. Couch Potato, 2010. https://github.com/langalex/couch_potato/issues/14. Google Inc. Google Datastore Admin, 2013. https://developers.google.com/appengine/docs/adminconsole/datastoreconsole. Google Inc., Google Cloud Datastore: Pricing and Quota, Oct. 2016, https://cloud.google.com/datastore/docs/pricing S. Tiwari. Professional NoSQL. John Wiley & Sons, 2011. 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 43

Literature (Schema Management) JSON Schema, 2013. http://json-schema.org/. MongoDB, MongoDB Manual Version 3.2: Document Validation, 2016, https://docs.mongodb.com/manual/core/document-validation/ U. Störl, T. Hauff, M. Klettke, and S. Scherzinger, Schemaless NoSQL Data Stores Object-NoSQL Mappers to the Rescue? in Proc. BTW 15, 2015 Morphia. A type-safe java library for MongoDB, 2013. http://code.google.com/p/morphia/. Literature (Schema Extraction) C.-H. Moh, E.-P. Lim, and W. K. Ng. DTD-Miner: A Tool for Mining DTD from XML Documents. In WECWIS, pages 144{151, 2000. Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Extraction and Structural Outlier Detection for JSON-based NoSQL Data Stores, Wissenschaftliches Programm, BTW 2015 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 44

Literature (Schema Evolution) D. L. Parnas. Software Aging. In Proc. ICSE, pages 279 287, 1994. G. Dong and J. Su. First-Order Incremental Evaluation of Datalog Queries. In DBPL 93, 1993. Objectify AppEngine. Migrating Schemas, 2012. https://code.google.com/p/objectifyappengine/wiki/schemamigration. J. McWilliams and M. Ivey. Google Developers: Updating your model s schema, 2012. https://developers.google.com/appengine/articles/update_schema. Meike Klettke: Modellierung, Bewertung und Evolution von XML-Dokumentkollektionen. Habilitationsschrift, Universität Rostock, Fakultät für Informatik und Elektrotechnik, 2007 S. Scherzinger, M. Klettke, and U. Störl. Managing Schema Evolution in NoSQL Data Stores. In Proc. DBPL 13, 2013. U. Störl, T. Hauff, M. Klettke, and S. Scherzinger, Schemaless NoSQL Data Stores Object-NoSQL Mappers to the Rescue? in Proc. BTW 15, 2015 T. Cerqueus, E. C. de Almeida, and S. Scherzinger, Safely Managing Data Variety in Big Data Software Development, in Proc. BIGDSE 15, 2015 S. Scherzinger, U. Störl, and M. Klettke, A Datalog-based Protocol for Lazy Data Migration in Agile NoSQL Application Development, in Proc. DBPL 15, 2015 Stefanie Scherzinger, Meike Klettke, Uta Störl: Cleager: Eager Schema Evolution in NoSQL Document Stores, Demoprogramm, BTW 2015 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 45

Literature (Schema Evolution, cont.) Meike Klettke, Stefanie Scherzinger, Uta Störl, Stephanie Sombach, Katharina Wiech: Challenges with Continuous Deployment of NoSQL-backed Database Applications, LWDA 2016 Stefanie Scherzinger, Stephanie Sombach, Katharina Wiech, Meike Klettke and Uta Störl: Datalution: A Tool for Continuous Schema Evolution in NoSQL-backed Web Applications, QUDOS 2016 Literature (Optimization) C. A. R. Hoare, An Axiomatic Basis for Computer Programming, Commun. ACM, vol. 12, no. 10, 1969 S. Abiteboul and V. Vianu, Equivalence and Optimization of Relational Transactions, J. ACM, vol. 35, no. 1, Jan. 1988 M. Liu. Extending Datalog with Declarative Updates. In M. Ibrahim, J. Köng, and N. Revell, editors, Database and Expert Systems Applications, volume 1873 of LNCS, pages 752 763. Springer, 2000 Meike Klettke, Uta Störl, Manuel Shenavai, Stefanie Scherzinger: NoSQL Schema Evolution and Big Data Migration at Scale, 4th Scalable Cloud Data Management Workshop (SCDM) @ IEEE Big Data Conference, Washington, D.C., USA, December 2016 06.03.2017 Meike Klettke, Uta Störl, Stefanie Scherzinger: Schema Evolution in SCDM 46