summaryrefslogtreecommitdiffstats
path: root/doc/developer/link-state.rst
blob: 2072595e36dac8e4969842a8002aa4a5be2a3e28 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
Link State API Documentation
============================

Introduction
------------

The Link State (LS) API aims to provide a set of structures and functions to
build and manage a Traffic Engineering Database for the various FRR daemons.
This API has been designed for several use cases:

- BGP Link State (BGP-LS): where BGP protocol need to collect the link state
  information from the routing daemons (IS-IS and/or OSPF) to implement RFC7572
- Path Computation Element (PCE): where path computation algorithms are based
  on Traffic Engineering Database
- ReSerVation Protocol (RSVP): where signaling need to know the Traffic
  Engineering topology of the network in order to determine the path of
  RSVP tunnels

Architecture
------------

The main requirements from the various uses cases are as follow:

- Provides a set of data model and function to ease Link State information
  manipulation (storage, serialize, parse ...)
- Ease and normalize Link State information exchange between FRR daemons
- Provides database structure for Traffic Engineering Database (TED)

To ease Link State understanding, FRR daemons have been classified into two
categories:

- **Consumer**: Daemons that consume Link State information e.g. BGPd
- **Producer**: Daemons that are able to collect Link State information and
  send them to consumer daemons e.g. OSPFd IS-ISd

Zebra daemon, and more precisely, the ZAPI message is used to convey the Link
State information between *producer* and *consumer*, but, Zebra acts as a
simple pass through and does not store any Link State information. A new ZAPI
**Opaque** message has been design for that purpose.

Each consumer and producer daemons are free to store or not Link State data and
organise the information following the Traffic Engineering Database model
provided by the API or any other data structure e.g. Hash, RB-tree ...

Link State API
--------------

This is the low level API that allows any daemons manipulate the Link State
elements that are stored in the Link State Database.

Data structures
^^^^^^^^^^^^^^^

3 types of Link State structure have been defined:

.. c:struct:: ls_node

   that groups all information related to a node

.. c:struct:: ls_attributes

   that groups all information related to a link

.. c:struct:: ls_prefix

   that groups all information related to a prefix

These 3 types of structures are those handled by BGP-LS (see RFC7752) and
suitable to describe a Traffic Engineering topology.

Each structure, in addition to the specific parameters, embed the node
identifier which advertises the Link State and a bit mask as flags to
indicates which parameters are valid i.e. for which the value is valid and
corresponds to a Link State information conveyed by the routing protocol.

.. c:struct:: ls_node_id

   defines the Node identifier as router ID IPv4 address plus the area ID for
   OSPF or the ISO System ID plus the IS-IS level for IS-IS.

Functions
^^^^^^^^^

A set of functions is provided to create, delete and compare Link State
Node, Atribute and Prefix:

.. c:function:: struct ls_node *ls_node_new(struct ls_node_id adv, struct in_addr router_id, struct in6_addr router6_id)
.. c:function:: struct ls_attributes *ls_attributes_new(struct ls_node_id adv, struct in_addr local, struct in6_addr local6, uint32_t local_id)
.. c:function:: struct ls_prefix *ls_prefix_new(struct ls_node_id adv, struct prefix p)

   Create respectively a new Link State Node, Attribute or Prefix.
   Structure is dynamically allocated. Link State Node ID (adv) is mandatory
   and:

   - at least one of IPv4 or IPv6 must be provided for the router ID
     (router_id or router6_id) for Node
   - at least one of local, local6 or local_id must be provided for Attribute
   - prefix is mandatory for Link State Prefix.

.. c:function:: void ls_node_del(struct ls_node *node)
.. c:function:: void ls_attributes_del(struct ls_attributes *attr)
.. c:function:: void ls_prefix_del(struct ls_prefix *pref)

   Remove, respectively Link State Node, Attributes or Prefix.
   Data structure is freed.

.. c:function:: void ls_attributes_srlg_del(struct ls_attributes *attr)

   Remove SRLGs attribute if defined. Data structure is freed.

.. c:function:: int ls_node_same(struct ls_node *n1, struct ls_node *n2)
.. c:function:: int ls_attributes_same(struct ls_attributes *a1, struct ls_attributes *a2)
.. c:function:: int ls_prefix_same(struct ls_prefix *p1, struct ls_prefix*p2)

   Check, respectively if two Link State Nodes, Attributes or Prefix are equal.
   Note that these routines have the same return value sense as '==' (which is
   different from a comparison).


Link State TED
--------------

This is the high level API that provides functions to create, update, delete a
Link State Database to build a Traffic Engineering Database (TED).

Data Structures
^^^^^^^^^^^^^^^

The Traffic Engineering is modeled as a Graph in order to ease Path Computation
algorithm implementation. Denoted **G(V, E)**, a graph is composed by a list of
**Vertices (V)** which represents the network Node and a list of **Edges (E)**
which represents Link. An additional list of **prefixes (P)** is also added and
also attached to the *Vertex (V)* which advertise it.

*Vertex (V)* contains the list of outgoing *Edges (E)* that connect this Vertex
with its direct neighbors and the list of incoming *Edges (E)* that connect
the direct neighbors to this Vertex. Indeed, the *Edge (E)* is unidirectional,
thus, it is necessary to add 2 Edges to model a bidirectional relation between
2 Vertices. Finally, the *Vertex (V)* contains a pointer to the corresponding
Link State Node.

*Edge (E)* contains the source and destination Vertex that this Edge
is connecting and a pointer to the corresponding Link State Attributes.

A unique Key is used to identify both Vertices and Edges within the Graph.


::

          --------------     ---------------------------    --------------
          | Connected  |---->| Connected Edge Va to Vb |--->| Connected  |
      --->|  Vertex    |     ---------------------------    |  Vertex    |---->
          |            |                                    |            |
          | - Key (Va) |                                    | - Key (Vb) |
      <---| - Vertex   |     ---------------------------    | - Vertex   |<----
          |            |<----| Connected Edge Vb to Va |<---|            |
          --------------     ---------------------------    --------------


4 data structures have been defined to implement the Graph model:

.. c:struct:: ls_vertex
.. c:struct:: ls_edge
.. c:struct:: ls_ted

 - :c:struct:`ls_prefix`

TED stores Vertex, Edge and Subnet elements with a RB Tree structure.
The Vertex key corresponds to the Router ID for OSPF and ISO System ID for
IS-IS. The Edge key corresponds to the IPv4 address, the lowest 64 bits of
the IPv6 address or the combination of the local & remote ID of the interface.
The Subnet key corresponds to the Prefix address (v4 or v6).

An additional status for Vertex, Edge and Subnet allows to determine the state
of the element in the TED: UNSET, NEW, UPDATE, DELETE, SYNC, ORPHAN. Normal
state is SYNC. NEW, UPDATE and DELETE are temporary state when element is
processed. UNSET is normally never used and ORPHAN serves to identify elements
that must be remove when TED is cleaning.

Vertex, Edges and Subnets management functions
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

.. c:function:: struct ls_vertex *ls_vertex_add(struct ls_ted *ted, struct ls_node *node)
.. c:function:: struct ls_edge *ls_edge_add(struct ls_ted *ted, struct ls_attributes *attributes)
.. c:function:: struct ls_subnet *ls_subnet_add(struct ls_ted *ted, struct ls_prefix *pref)

   Add, respectively new Vertex, Edge or Subnet to the Link State Datebase.
   Vertex, Edge or Subnet are created from, respectively the Link State Node,
   Attribute or Prefix structure. Data structure are dynamically allocated.

.. c:function:: struct ls_vertex *ls_vertex_update(struct ls_ted *ted, struct ls_node *node)
.. c:function:: struct ls_edge *ls_edge_update(struct ls_ted *ted, struct ls_attributes *attributes)
.. c:function:: struct ls_subnet *ls_subnet_update(struct ls_ted *ted, struct ls_prefix *pref)

   Update, respectively Vertex, Edge or Subnet with, respectively the Link
   State Node, Attribute or Prefix. A new data structure is created if no one
   corresponds to the Link State Node, Attribute or Prefix. If element already
   exists in the TED, its associated Link State information is replaced by the
   new one if there are different and the old associated Link State information
   is deleted and memory freed.

.. c:function:: void ls_vertex_del(struct ls_ted *ted, struct ls_vertex *vertex)
.. c:function:: void ls_vertex_del_all(struct ls_ted *ted, struct ls_vertex *vertex)
.. c:function:: void ls_edge_del(struct ls_ted *ted, struct ls_edge *edge)
.. c:function:: void ls_edge_del_all(struct ls_ted *ted, struct ls_edge *edge)
.. c:function:: void ls_subnet_del(struct ls_ted *ted, struct ls_subnet *subnet)
.. c:function:: void ls_subnet_del_all(struct ls_ted *ted, struct ls_subnet *subnet)

   Delete, respectively Link State Vertex, Edge or Subnet. Data structure are
   freed but not the associated Link State information with the simple `_del()`
   form of the function while the `_del_all()` version freed also associated
   Link State information. TED is not modified if Vertex, Edge or Subnet is
   NULL or not found in the Data Base. Note that references between Vertices,
   Edges and Subnets are removed first.

.. c:function:: struct ls_vertex *ls_find_vertex_by_key(struct ls_ted *ted, const uint64_t key)
.. c:function:: struct ls_vertex *ls_find_vertex_by_id(struct ls_ted *ted, struct ls_node_id id)

   Find Vertex in the TED by its unique key or its Link State Node ID.
   Return Vertex if found, NULL otherwise.

.. c:function:: struct ls_edge *ls_find_edge_by_key(struct ls_ted *ted, const uint64_t key)
.. c:function:: struct ls_edge *ls_find_edge_by_source(struct ls_ted *ted, struct ls_attributes *attributes);
.. c:function:: struct ls_edge *ls_find_edge_by_destination(struct ls_ted *ted, struct ls_attributes *attributes);

   Find Edge in the Link State Data Base by its key, source or distination
   (local IPv4 or IPv6 address or local ID) informations of the Link State
   Attributes. Return Edge if found, NULL otherwise.

.. c:function:: struct ls_subnet *ls_find_subnet(struct ls_ted *ted, const struct prefix prefix)

   Find Subnet in the Link State Data Base by its key, i.e. the associated
   prefix. Return Subnet if found, NULL otherwise.

.. c:function:: int ls_vertex_same(struct ls_vertex *v1, struct ls_vertex *v2)
.. c:function:: int ls_edge_same(struct ls_edge *e1, struct ls_edge *e2)
.. c:function:: int ls_subnet_same(struct ls_subnet *s1, struct ls_subnet *s2)

   Check, respectively if two Vertices, Edges or Subnets are equal.
   Note that these routines has the same return value sense as '=='
   (which is different from a comparison).


TED management functions
^^^^^^^^^^^^^^^^^^^^^^^^

Some helpers functions have been also provided to ease TED management:

.. c:function:: struct ls_ted *ls_ted_new(const uint32_t key, char *name, uint32_t asn)

   Create a new Link State Data Base. Key must be different from 0.
   Name could be NULL and AS number equal to 0 if unknown.

.. c:function:: void ls_ted_del(struct ls_ted *ted)
.. c:function:: void ls_ted_del_all(struct ls_ted *ted)

   Delete existing Link State Data Base. Vertices, Edges, and Subnets are not
   removed with ls_ted_del() function while they are with ls_ted_del_all().

.. c:function:: void ls_connect_vertices(struct ls_vertex *src, struct ls_vertex *dst, struct ls_edge *edge)

   Connect Source and Destination Vertices by given Edge. Only non NULL source
   and destination vertices are connected.

.. c:function:: void ls_connect(struct ls_vertex *vertex, struct ls_edge *edge, bool source)
.. c:function:: void ls_disconnect(struct ls_vertex *vertex, struct ls_edge *edge, bool source)

   Connect / Disconnect Link State Edge to the Link State Vertex which could be
   a Source (source = true) or a Destination (source = false) Vertex.

.. c:function:: void ls_disconnect_edge(struct ls_edge *edge)

   Disconnect Link State Edge from both Source and Destination Vertex.
   Note that Edge is not removed but its status is marked as ORPHAN.

.. c:function:: void ls_vertex_clean(struct ls_ted *ted, struct ls_vertex *vertex, struct zclient *zclient)

   Clean Vertex structure by removing all Edges and Subnets marked as ORPHAN
   from this vertex. Corresponding Link State Update message is sent if zclient
   parameter is not NULL. Note that associated Link State Attribute and Prefix
   are also removed and memory freed.

.. c:function:: void ls_ted_clean(struct ls_ted *ted)

   Clean Link State Data Base by removing all Vertices, Edges and SubNets
   marked as ORPHAN. Note that associated Link State Node, Attributes and
   Prefix are removed too.

.. c:function:: void ls_show_vertex(struct ls_vertex *vertex, struct vty *vty, struct json_object *json, bool verbose)
.. c:function:: void ls_show_edge(struct ls_edeg *edge, struct vty *vty, struct json_object *json, bool verbose)
.. c:function:: void ls_show_subnet(struct ls_subnet *subnet, struct vty *vty, struct json_object *json, bool verbose)
.. c:function:: void ls_show_vertices(struct ls_ted *ted, struct vty *vty, struct json_object *json, bool verbose)
.. c:function:: void ls_show_edges(struct ls_ted *ted, struct vty *vty, struct json_object *json, bool verbose)
.. c:function:: void ls_show_subnets(struct ls_ted *ted, struct vty *vty, struct json_object *json, bool verbose)
.. c:function:: void ls_show_ted(struct ls_ted *ted, struct vty *vty, struct json_object *json, bool verbose)

   Respectively, show Vertex, Edge, Subnet provided as parameter, all Vertices,
   all Edges, all Subnets and the whole TED if not specified. Output could be
   more detailed with verbose parameter for VTY output. If both JSON and VTY
   output are specified, JSON takes precedence over VTY.

.. c:function:: void ls_dump_ted(struct ls_ted *ted)

   Dump TED information to the current logging output.

Link State Messages
-------------------

This part of the API provides functions and data structure to ease the
communication between the *Producer* and *Consumer* daemons.

Communications principles
^^^^^^^^^^^^^^^^^^^^^^^^^

Recent ZAPI Opaque Message is used to exchange Link State data between daemons.
For that purpose, Link State API provides new functions to serialize and parse
Link State information through the ZAPI Opaque message. A dedicated flag,
named ZAPI_OPAQUE_FLAG_UNICAST, allows daemons to send a unicast or a multicast
Opaque message and is used as follow for the Link State exchange:

- Multicast: To send data update to all daemons that have subscribed to the
  Link State Update message
- Unicast: To send initial Link State information from a particular daemon. All
  data are send only to the daemon that request Link State Synchronisatio

Figure 1 below, illustrates the ZAPI Opaque message exchange between a
*Producer* (an IGP like OSPF or IS-IS) and a *Consumer* (e.g. BGP). The
message sequences are as follows:

- First, both *Producer* and *Consumer* must register to their respective ZAPI
  Opaque Message: **Link State Sync** for the *Producer* in order to receive
  Database synchronisation request from a *Consumer*, **Link State Update** for
  the *Consumer* in order to received any Link State update from a *Producer*.
  These register messages are stored by Zebra to determine to which daemon it
  should redistribute the ZAPI messages it receives.
- Then, the *Consumer* sends a **Link State Synchronistation** request with the
  Multicast method in order to receive the complete Link State Database from a
  *Producer*. ZEBRA daemon forwards this message to any *Producer* daemons that
  previously registered to this message. If no *Producer* has yet registered,
  the request is lost. Thus, if the *Consumer* receives no response whithin a
  given timer, it means that no *Producer* are available right now. So, the
  *Consumer* must send the same request until it receives a Link State Database
  Synchronistation message. This behaviour is necessary as we can't control in
  which order daemons are started. It is up to the *Consumer* daemon to fix the
  timeout and the number of retry.
- When a *Producer* receives a **Link State Synchronisation** request, it
  starts sending all elements of its own Link State Database through the
  **Link State Database Synchronisation** message. These messages are send with
  the Unicast method to avoid flooding other daemons with these elements. ZEBRA
  layer ensures to forward the message to the right daemon.
- When a *Producer* update its Link State Database, it automatically sends a
  **Link State Update** message with the Multicast method. In turn, ZEBRA
  daemon forwards the message to all *Consumer* daemons that previously
  registered to this message. if no daemon is registered, the message is lost.
- A daemon could unregister from the ZAPI Opaque message registry at any time.
  In this case, the ZEBRA daemon stops to forward any messages it receives to
  this daemon, even if it was previously converns.

::

       IGP                           ZEBRA                        Consumer
    (OSPF/IS-IS)               (ZAPI Opaque Thread)              (e.g. BGP)
        |                              |                             |           \
        |                              |      Register LS Update     |            |
        |                              |<----------------------------|   Register Phase
        |                              |                             |            |
        |                              |      Request LS Sync        |            |
        |                              |<----------------------------|            |
        :                              :                             :  A         |
        |    Register LS Sync          |                             |  |         |
        |----------------------------->|                             |  |        /
        :                              :                             :  |TimeOut
        :                              :                             :  |
        |                              |                             |  |
        |                              |      Request LS Sync        |  v        \
        |    Request LS Sync           |<----------------------------|            |
        |<-----------------------------|                             |   Synchronistation
        |    LS DB Update              |                             |           Phase
        |----------------------------->|      LS DB Update           |            |
        |                              |---------------------------->|            |
        |    LS DB Update (cont'd)     |                             |            |
        |----------------------------->|      LS DB Update (cont'd)  |            |
        |            .                 |---------------------------->|            |
        |            .                 |             .               |            |
        |            .                 |             .               |            |
        |    LS DB Update (end)        |             .               |            |
        |----------------------------->|      LS DB Update (end)     |            |
        |                              |---------------------------->|            |
        |                              |                             |           /
        :                              :                             :
        :                              :                             :
        |    LS DB Update              |                             |           \
        |----------------------------->|      LS DB Update           |            |
        |                              |---------------------------->|      Update Phase
        |                              |                             |            |
        :                              :                             :           /
        :                              :                             :
        |                              |                             |           \
        |                              |      Unregister LS Update   |            |
        |                              |<----------------------------|      Deregister Phase
        |                              |                             |            |
        |    LS DB Update              |                             |            |
        |----------------------------->|                             |            |
        |                              |                             |           /
        |                              |                             |

        Figure 1: Link State messages exchange


Data Structures
^^^^^^^^^^^^^^^

The Link State Message is defined to convey Link State parameters from
the routing protocol (OSPF or IS-IS) to other daemons e.g. BGP.

.. c:struct:: ls_message

The structure is composed of:

- Event of the message:

  - Sync: Send the whole LS DB following a request
  - Add: Send the a new Link State element
  - Update: Send an update of an existing Link State element
  - Delete: Indicate that the given Link State element is removed

- Type of Link State element: Node, Attribute or Prefix
- Remote node id when known
- Data: Node, Attributes or Prefix

A Link State Message can carry only one Link State Element (Node, Attributes
of Prefix) at once, and only one Link State Message is sent through ZAPI
Opaque Link State type at once.

Functions
^^^^^^^^^

.. c:function:: int ls_register(struct zclient *zclient, bool server)
.. c:function:: int ls_unregister(struct zclient *zclient, bool server)

   Register / Unregister daemon to received ZAPI Link State Opaque messages.
   Server must be set to true for *Producer* and to false for *Consumer*.

.. c:function:: int ls_request_sync(struct zclient *zclient)

   Request initial Synchronisation to collect the whole Link State Database.

.. c:function:: struct ls_message *ls_parse_msg(struct stream *s)

   Parse Link State Message from stream. Used this function once receiving a
   new ZAPI Opaque message of type Link State.

.. c:function:: void ls_delete_msg(struct ls_message *msg)

   Delete existing message. Data structure is freed.

.. c:function:: int ls_send_msg(struct zclient *zclient, struct ls_message *msg, struct zapi_opaque_reg_info *dst)

   Send Link State Message as new ZAPI Opaque message of type Link State.
   If destination is not NULL, message is sent as Unicast otherwise it is
   broadcast to all registered daemon.

.. c:function:: struct ls_message *ls_vertex2msg(struct ls_message *msg, struct ls_vertex *vertex)
.. c:function:: struct ls_message *ls_edge2msg(struct ls_message *msg, struct ls_edge *edge)
.. c:function:: struct ls_message *ls_subnet2msg(struct ls_message *msg, struct ls_subnet *subnet)

   Create respectively a new Link State Message from a Link State Vertex, Edge
   or Subnet. If Link State Message is NULL, a new data structure is
   dynamically allocated. Note that the Vertex, Edge and Subnet status is used
   to determine the corresponding Link State Message event: ADD, UPDATE,
   DELETE, SYNC.

.. c:function:: int ls_msg2vertex(struct ls_ted *ted, struct ls_message *msg)
.. c:function:: int ls_msg2edge(struct ls_ted *ted, struct ls_message *msg)
.. c:function:: int ls_msg2subnet(struct ls_ted *ted, struct ls_message *msg)

   Convert Link State Message respectively in Vertex, Edge or Subnet and
   update the Link State Database accordingly to the message event: SYNC, ADD,
   UPDATE or DELETE.

.. c:function:: struct ls_element *ls_msg2ted(struct ls_ted *ted, struct ls_message *msg, bool delete)
.. c:function:: struct ls_element *ls_stream2ted(struct ls_ted *ted, struct ls_message *msg, bool delete)

   Convert Link State Message or Stream Buffer in a Link State element (Vertex,
   Edge or Subnet) and update the Link State Database accordingly to the
   message event: SYNC, ADD, UPDATE or DELETE. The function return the generic
   structure ls_element that point to the Vertex, Edge or Subnet which has been
   added, updated or synchronous in the database. Note that the delete boolean
   parameter governs the action for the DELETE action: true, Link State Element
   is removed from the database and NULL is return. If set to false, database
   is not updated and the function sets the Link State Element status to
   Delete and return the element for futur deletion by the calling function.

.. c:function:: int ls_sync_ted(struct ls_ted *ted, struct zclient *zclient, struct zapi_opaque_reg_info *dst)

   Send all the content of the Link State Data Base to the given destination.
   Link State content is sent is this order: Vertices, Edges then Subnet.
   This function must be used when a daemon request a Link State Data Base
   Synchronization.