Clustering in OpenDaylight

Size: px

Start display at page:

Download "Clustering in OpenDaylight"

Delilah Joseph
5 years ago
Views:

1 Clustering in OpenDaylight Colin Dixon Technical Steering Committee Chair, OpenDaylight Distinguished Engineer, Brocade Borrowed ideas and content from Jan Medved, Moiz Raja, and Tom Pantelis

.. Applica-on Applica-ons Service Adapta-on ayer (SA) / Core Controller Core

2 Multi-Protocol SDN Controller Architecture Applica-ons Applica-ons OSS/BSS, External Apps Netconf Server REST RESTCON Applica-on... Applica-on Applica-ons Service Adapta-on ayer (SA) / Core Controller Core Netconf Client Openlow OVSDB... Protocol Plugin Protocol Plugins/Adapters Network Devices Network Devices

3 Software Architecture Applica-ons Applica-ons OSS/BSS, External Apps Network Devices Network Devices Network Devices Netconf Server RESTCON REST Applica-on Applica-on Protocol Plugin... Netconf Client Apps/Services RPCs Notifications Data Change Notifications Model- Driven SA (MD- SA) Data Store Namespace Kernel

4 Data Store Sharding Data tree root... Shard1.1 Shard1.2 Shard1.3 Shard2.1 ShardN.1... ShardX.Y: X: Service X Y: Shard Y within Service X Select data subtrees Currently, can only pick a subtree directly under the root Working on subtrees at arbitrary levels Map subtrees onto shards Map shards onto nodes... Node1 Node2 NodeM Shard ayout Algorithm: Place shards on M nodes

failures with 2f+1 nodes 3 nodes => 1 failure, 5 => 2, etc.

5 Shard Replication Node Node Replication using RAT [1] Provides strong consistency Transactional data access snapshot reads snapshot writes read/write transactions Tolerates f failures with 2f+1 nodes 3 nodes => 1 failure, 5 => 2, etc. eaders handle all writes Send to followers before committing eaders distributed to spread load [1] Node

6 Strong Consistency Serializability Everyone always reads the most recent write. oosely everyone is at the same point on the same timeline. Causal Consistency oosely you won t see the result of any event without seeing everything that could have caused that event. [Typically in the confines of reads/writes to a single datastore.] Eventual Consistency oosely everyone will eventually see the same events in some order, but not the necessarily the same order. [Eventual in the same way that your kid will eventually clean their room.]

7 Why strong consistency matters A flapping port generates events: port up, port down, port up, port down, port up, port down, Is the port up or down? If they re not ordered, you have no idea. sure, but these are events from a single device, so you can keep them ordered easily More complex examples switches (or ports on those switches) attached to different nodes go up and down If you don t have ordering, different nodes will now come to different conclusions about reachability, paths, etc.

8 Why everyone doesn t use it Strong consistency can t work in the face of partitions If you re network splits in two, either: one side stops you lose strong consistency Strong consistency requires coordination Effectively, you need some single entity to order everything This has performance costs, i.e., a single strongly consistent data store is limited by the performance of a single node Question is: do we start with strong and relax it or start weak and strengthen it? OpenDaylight has started strong

9 Service/Application Model ogically, each service or application (code) has a primary subtree (YANG model) and shard it is associated with (data) One instance of the code is co-located with each replica of the data All instances are stateless, all state is stored in the data store The instance that is co-located with the shard leader handles writes Service1 Service2 Service3 Service1 Service2 Service3 Service1 Service2 Service3 Data Store API Data Store API Data Store API Node1 Node2 Node3 SX.Y Shard eader replica SX.Y Shard ollower replica

10 Service/Application Model (cont d) Entity Ownership Service allows for related tasks to be co-located e.g., the tasks related to a given Openlow switch should happen where it s connected also handles HA/failover, automatic election of a new entity owner RPCs and Notifications are directed to the entity owner New cluster-aware data change listeners provide integration into the data store Service1 Service2 Service3 Service1 Service2 Service3 Service1 Service2 Service3 Data Store API Data Store API Data Store API Node1 Node2 Node3 SX.Y Shard eader replica SX.Y Shard ollower replica

11 Handling RPCs and Notifications in a Cluster Node Data Change Notifications lexibly delivered to shard leader, or any subset of nodes Node YANG-modeled Notifications Delivered to the node on which they were generated Typically guided to entity owner Global RPCs Delivered to node where called Node Routed RPCs Delivered to the node which registered to handle them

Service-x resolves a read or write to a subtree/shard 2. Reads are sent to the leader 1. Working on allowing for local reads 3.

12 Service/Shard Interactions Service-x Service1 Service-y 2.1 notification resolve 1 read 4 2 Data Store API Data Store API Data Store API write notification Node1 Node2 Node3 1. Service-x resolves a read or write to a subtree/shard 2. Reads are sent to the leader 1. Working on allowing for local reads 3. Writes are send to the leader to be ordered 4. Notifications changed data are sent to the shard leader 1. and to anyone registered for remote notification SX.Y Shard eader replica SX.Y Shard ollower replica

13 Major additions in Beryllium Entity Ownership Service EntityOwnershipService Clustered Data Change isteners ClusteredDataChangeistener and ClusteredDataTreeChangeistener Singificant application/plugin adoption OVSDB Openlow NETCON Neutron Etc.

14 Work in progress Dynamic, multi-level sharding Multi-level, e.g., Openlow should be able to say it s subtrees start at the switch node Dynamic, e.g., an Openlow subtree should be moved if the connection moves Improve performance, scale, stability, etc. as always aster, but stale, reads from local replica vs. always reading from leader Pipelined transactions for better cluster write throughput Whatever else you re interested in helping with

15 onger term things Helper code for building common app patterns run once in the cluster and fail-over if that node goes down run everywhere and handle things Different consistency models Giving people the knob is easy Dealing with the ramifications is hard ederated/hierarchical clustering

Efficient Geographic Replication & Disaster Recovery. Tom Pantelis Brian Freeman Colin Dixon

Efficient Geographic Replication & Disaster Recovery Tom Pantelis Brian reeman Colin Dixon The Problem: Geo Replication/Disaster Recovery Most mature SDN controllers run in a local cluster to tolerate