tentakelfabrik
/
tiny-consent

# Extension pipelining

`websocket-extensions` models the extension negotiation and processing pipelineof the WebSocket protocol. Between the driver parsing messages from the TCPstream and handing those messages off to the application, there may exist astack of extensions that transform the message somehow.
In the parlance of this framework, a *session* refers to a single instance of anextension, acting on a particular socket on either the server or the clientside. A session may transform messages both incoming to the application andoutgoing from the application, for example the `permessage-deflate` extensioncompresses outgoing messages and decompresses incoming messages. Message streamsin either direction are independent; that is, incoming and outgoing messagescannot be assumed to 'pair up' as in a request-response protocol.
Asynchronous processing of messages poses a number of problems that thispipeline construction is intended to solve.

## Overview

Logically, we have the following:

    +-------------+  out  +---+     +---+     +---+       +--------+    |             |------>|   |---->|   |---->|   |------>|        |    | Application |       | A |     | B |     | C |       | Driver |    |             |<------|   |<----|   |<----|   |<------|        |    +-------------+  in   +---+     +---+     +---+       +--------+
                          \                       /                           +----------o----------+                                      |                                   sessions

For outgoing messages, the driver receives the result of
        C.outgoing(B.outgoing(A.outgoing(message)))
    or, [A, B, C].reduce(((m, ext) => ext.outgoing(m)), message)
For incoming messages, the application receives the result of
        A.incoming(B.incoming(C.incoming(message)))
    or, [C, B, A].reduce(((m, ext) => ext.incoming(m)), message)
A session is of the following type, to borrow notation from pseudo-Haskell:
    type Session = {      incoming :: Message -> Message      outgoing :: Message -> Message      close    :: () -> ()    }
(That `() -> ()` syntax is intended to mean that `close()` is a nullary voidmethod; I apologise to any Haskell readers for not using the right monad.)
The `incoming()` and `outgoing()` methods perform message transformation in therespective directions; `close()` is called when a socket closes so the sessioncan release any resources it's holding, for example a DEFLATE de/compressioncontext.
However because this is JavaScript, the `incoming()` and `outgoing()` methodsmay be asynchronous (indeed, `permessage-deflate` is based on `zlib`, whose APIis stream-based). So their interface is strictly:
    type Session = {      incoming :: Message -> Callback -> ()      outgoing :: Message -> Callback -> ()      close    :: () -> ()    }
    type Callback = Either Error Message -> ()
This means a message *m2* can be pushed into a session while it's stillprocessing the preceding message *m1*. The messages can be processedconcurrently but they *must* be given to the next session in line (or to theapplication) in the same order they came in. Applications will expect to receivemessages in the order they arrived over the wire, and sessions require this too.So ordering of messages must be preserved throughout the pipeline.
Consider the following highly simplified extension that deflates messages on thewire. `message` is a value conforming the type:
    type Message = {      rsv1   :: Boolean      rsv2   :: Boolean      rsv3   :: Boolean      opcode :: Number      data   :: Buffer    }
Here's the extension:
```jsvar zlib = require('zlib');
var deflate = {  outgoing: function(message, callback) {    zlib.deflateRaw(message.data, function(error, result) {      message.rsv1 = true;      message.data = result;      callback(error, message);    });  },
  incoming: function(message, callback) {    // decompress inbound messages (elided)  },
  close: function() {    // no state to clean up  }};```
We can call it with a large message followed by a small one, and the small onewill be returned first:
```jsvar crypto = require('crypto'),    large  = crypto.randomBytes(1 << 14),    small  = new Buffer('hi');
deflate.outgoing({data: large}, function() {  console.log(1, 'large');});
deflate.outgoing({data: small}, function() {  console.log(2, 'small');});
/* prints:  2 'small'            1 'large' */```
So a session that processes messages asynchronously may fail to preserve messageordering.
Now, this extension is stateless, so it can process messages in any order andstill produce the same output. But some extensions are stateful and requiremessage order to be preserved.
For example, when using `permessage-deflate` without `no_context_takeover` set,the session retains a DEFLATE de/compression context between messages, whichaccumulates state as it consumes data (later messages can refer to sections ofprevious ones to improve compression). Reordering parts of the DEFLATE streamwill result in a failed decompression. Messages must be decompressed in the sameorder they were compressed by the peer in order for the DEFLATE protocol towork.
Finally, there is the problem of closing a socket. When a WebSocket is closed bythe application, or receives a closing request from the other peer, there may bemessages outgoing from the application and incoming from the peer in thepipeline. If we close the socket and pipeline immediately, two problems arise:
* We may send our own closing frame to the peer before all prior messages we  sent have been written to the socket, and before we have finished processing  all prior messages from the peer* The session may be instructed to close its resources (e.g. its de/compression  context) while it's in the middle of processing a message, or before it has  received messages that are upstream of it in the pipeline
Essentially, we must defer closing the sessions and sending a closing frameuntil after all prior messages have exited the pipeline.

## Design goals

* Message order must be preserved between the protocol driver, the extension  sessions, and the application* Messages should be handed off to sessions and endpoints as soon as possible,  to maximise throughput of stateless sessions* The closing procedure should block any further messages from entering the  pipeline, and should allow all existing messages to drain* Sessions should be closed as soon as possible to prevent them holding memory  and other resources when they have no more messages to handle* The closing API should allow the caller to detect when the pipeline is empty  and it is safe to continue the WebSocket closing procedure* Individual extensions should remain as simple as possible to facilitate  modularity and independent authorship
The final point about modularity is an important one: this framework is designedto facilitate extensions existing as plugins, by decoupling the protocol driver,extensions, and application. In an ideal world, plugins should only need tocontain code for their specific functionality, and not solve these problems thatapply to all sessions. Also, solving some of these problems requiresconsideration of all active sessions collectively, which an individual sessionis incapable of doing.
For example, it is entirely possible to take the simple `deflate` extensionabove and wrap its `incoming()` and `outgoing()` methods in two `Transform`streams, producing this type:
    type Session = {      incoming :: TransformStream      outtoing :: TransformStream      close    :: () -> ()    }
The `Transform` class makes it easy to wrap an async function such that messageorder is preserved:
```jsvar stream  = require('stream'),    session = new stream.Transform({objectMode: true});
session._transform = function(message, _, callback) {  var self = this;  deflate.outgoing(message, function(error, result) {    self.push(result);    callback();  });};```
However, this has a negative impact on throughput: it works by deferring`callback()` until the async function has 'returned', which blocks `Transform`from passing further input into the `_transform()` method until the currentmessage is dealt with completely. This would prevent sessions from processingmessages concurrently, and would unnecessarily reduce the throughput ofstateless extensions.
So, input should be handed off to sessions as soon as possible, and all we needis a mechanism to reorder the output so that message order is preserved for thenext session in line.

## Solution

We now describe the model implemented here and how it meets the above designgoals. The above diagram where a stack of extensions sit between the driver andapplication describes the data flow, but not the object graph. That looks likethis:

            +--------+            | Driver |            +---o----+                |                V          +------------+      +----------+          | Extensions o----->| Pipeline |          +------------+      +-----o----+                                    |                    +---------------+---------------+                    |               |               |              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+

A driver using this framework holds an instance of the `Extensions` class, whichit uses to register extension plugins, negotiate headers and transform messages.The `Extensions` instance itself holds a `Pipeline`, which contains an array of`Cell` objects, each of which wraps one of the sessions.

### Message processing

Both the `Pipeline` and `Cell` classes have `incoming()` and `outgoing()`methods; the `Pipeline` interface pushes messages into the pipe, delegates themessage to each `Cell` in turn, then returns it back to the driver. Outgoingmessages pass through `A` then `B` then `C`, and incoming messages in thereverse order.
Internally, a `Cell` contains two `Functor` objects. A `Functor` wraps an asyncfunction and makes sure its output messages maintain the order of its inputmessages. This name is due to [@fronx](https://github.com/fronx), on the basisthat, by preserving message order, the abstraction preserves the *mapping*between input and output messages. To use our simple `deflate` extension fromabove:
```jsvar functor = new Functor(deflate, 'outgoing');
functor.call({data: large}, function() {  console.log(1, 'large');});
functor.call({data: small}, function() {  console.log(2, 'small');});
/*  ->  1 'large'        2 'small' */```
A `Cell` contains two of these, one for each direction:

                            +-----------------------+                      +---->| Functor [A, incoming] |    +----------+      |     +-----------------------+    | Cell [A] o------+    +----------+      |     +-----------------------+                      +---->| Functor [A, outgoing] |                            +-----------------------+

This satisfies the message transformation requirements: the `Pipeline` simplyloops over the cells in the appropriate direction to transform each message.Because each `Cell` will preserve message order, we can pass a message to thenext `Cell` in line as soon as the current `Cell` returns it. This gives each`Cell` all the messages in order while maximising throughput.

### Session closing

We want to close each session as soon as possible, after all existing messageshave drained. To do this, each `Cell` begins with a pending message counter ineach direction, labelled `in` and `out` below.

                              +----------+                              | Pipeline |                              +-----o----+                                    |                    +---------------+---------------+                    |               |               |              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+                 in: 0           in: 0           in: 0                out: 0          out: 0          out: 0

When a message *m1* enters the pipeline, say in the `outgoing` direction, weincrement the `pending.out` counter on all cells immediately.

                              +----------+                        m1 => | Pipeline |                              +-----o----+                                    |                    +---------------+---------------+                    |               |               |              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+                 in: 0           in: 0           in: 0                out: 1          out: 1          out: 1

*m1* is handed off to `A`, meanwhile a second message `m2` arrives in the samedirection. All `pending.out` counters are again incremented.

                              +----------+                        m2 => | Pipeline |                              +-----o----+                                    |                    +---------------+---------------+                m1  |               |               |              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+                 in: 0           in: 0           in: 0                out: 2          out: 2          out: 2

When the first cell's `A.outgoing` functor finishes processing *m1*, the first`pending.out` counter is decremented and *m1* is handed off to cell `B`.

                              +----------+                              | Pipeline |                              +-----o----+                                    |                    +---------------+---------------+                m2  |           m1  |               |              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+                 in: 0           in: 0           in: 0                out: 1          out: 2          out: 2


As `B` finishes with *m1*, and as `A` finishes with *m2*, the `pending.out`counters continue to decrement.

                              +----------+                              | Pipeline |                              +-----o----+                                    |                    +---------------+---------------+                    |           m2  |           m1  |              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+                 in: 0           in: 0           in: 0                out: 0          out: 1          out: 2


Say `C` is a little slow, and begins processing *m2* while still processing*m1*. That's fine, the `Functor` mechanism will keep *m1* ahead of *m2* in theoutput.

                              +----------+                              | Pipeline |                              +-----o----+                                    |                    +---------------+---------------+                    |               |           m2  | m1              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+                 in: 0           in: 0           in: 0                out: 0          out: 0          out: 2

Once all messages are dealt with, the counters return to `0`.

                              +----------+                              | Pipeline |                              +-----o----+                                    |                    +---------------+---------------+                    |               |               |              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+                 in: 0           in: 0           in: 0                out: 0          out: 0          out: 0

The same process applies in the `incoming` direction, the only difference beingthat messages are passed to `C` first.
This makes closing the sessions quite simple. When the driver wants to close thesocket, it calls `Pipeline.close()`. This *immediately* calls `close()` on allthe cells. If a cell has `in == out == 0`, then it immediately calls`session.close()`. Otherwise, it stores the closing call and defers it until`in` and `out` have both ticked down to zero. The pipeline will not accept newmessages after `close()` has been called, so we know the pending counts will notincrease after this point.
This means each session is closed as soon as possible: `A` can close while theslow `C` session is still working, because it knows there are no more messageson the way. Similarly, `C` will defer closing if `close()` is called while *m1*is still in `B`, and *m2* in `A`, because its pending count means it knows ithas work yet to do, even if it's not received those messages yet. This concerncannot be addressed by extensions acting only on their own local state, unlesswe pollute individual extensions by making them all implement this samemechanism.
The actual closing API at each level is slightly different:
    type Session = {      close :: () -> ()    }
    type Cell = {      close :: () -> Promise ()    }
    type Pipeline = {      close :: Callback -> ()    }
This might appear inconsistent so it's worth explaining. Remember that a`Pipeline` holds a list of `Cell` objects, each wrapping a `Session`. The drivertalks (via the `Extensions` API) to the `Pipeline` interface, and it wants`Pipeline.close()` to do two things: close all the sessions, and tell me whenit's safe to start the closing procedure (i.e. when all messages have drainedfrom the pipe and been handed off to the application or socket). A callback APIworks well for that.
At the other end of the stack, `Session.close()` is a nullary void method withno callback or promise API because we don't care what it does, and whatever itdoes do will not block the WebSocket protocol; we're not going to hold offprocessing messages while a session closes its de/compression context. We justtell it to close itself, and don't want to wait while it does that.
In the middle, `Cell.close()` returns a promise rather than using a callback.This is for two reasons. First, `Cell.close()` might not do anythingimmediately, it might have to defer its effect while messages drain. So, ifgiven a callback, it would have to store it in a queue for later execution.Callbacks work fine if your method does something and can then invoke thecallback itself, but if you need to store callbacks somewhere so another methodcan execute them, a promise is a better fit. Second, it better serves thepurposes of `Pipeline.close()`: it wants to call `close()` on each of a list ofcells, and wait for all of them to finish. This is simple and idiomatic usingpromises:
```jsvar closed = cells.map((cell) => cell.close());Promise.all(closed).then(callback);```
(We don't actually use a full *Promises/A+* compatible promise here, we use amuch simplified construction that acts as a callback aggregater and resolvessynchronously and does not support chaining, but the principle is the same.)

### Error handling

We've not mentioned error handling so far but it bears some explanation. Theabove counter system still applies, but behaves slightly differently in thepresence of errors.
Say we push three messages into the pipe in the outgoing direction:

                              +----------+                m3, m2, m1 => | Pipeline |                              +-----o----+                                    |                    +---------------+---------------+                    |               |               |              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+                 in: 0           in: 0           in: 0                out: 3          out: 3          out: 3

They pass through the cells successfully up to this point:

                              +----------+                              | Pipeline |                              +-----o----+                                    |                    +---------------+---------------+                m3  |           m2  |           m1  |              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+                 in: 0           in: 0           in: 0                out: 1          out: 2          out: 3

At this point, session `B` produces an error while processing *m2*, that is *m2*becomes *e2*. *m1* is still in the pipeline, and *m3* is queued behind *m2*.What ought to happen is that *m1* is handed off to the socket, then *m2* isreleased to the driver, which will detect the error and begin closing thesocket. No further processing should be done on *m3* and it should not bereleased to the driver after the error is emitted.
To handle this, we allow errors to pass down the pipeline just like messages do,to maintain ordering. But, once a cell sees its session produce an error, or itreceives an error from upstream, it should refuse to accept any furthermessages. Session `B` might have begun processing *m3* by the time it producesthe error *e2*, but `C` will have been given *e2* before it receives *m3*, andcan simply drop *m3*.
Now, say *e2* reaches the slow session `C` while *m1* is still present,meanwhile *m3* has been dropped. `C` will never receive *m3* since it will havebeen dropped upstream. Under the present model, its `out` counter will be `3`but it is only going to emit two more values: *m1* and *e2*. In order forclosing to work, we need to decrement `out` to reflect this. The situationshould look like this:

                              +----------+                              | Pipeline |                              +-----o----+                                    |                    +---------------+---------------+                    |               |           e2  | m1              +-----o----+    +-----o----+    +-----o----+              | Cell [A] |    | Cell [B] |    | Cell [C] |              +----------+    +----------+    +----------+                 in: 0           in: 0           in: 0                out: 0          out: 0          out: 2

When a cell sees its session emit an error, or when it receives an error fromupstream, it sets its pending count in the appropriate direction to equal thenumber of messages it is *currently* processing. It will not accept any messagesafter it sees the error, so this will allow the counter to reach zero.
Note that while *e2* is in the pipeline, `Pipeline` should drop any furthermessages in the outgoing direction, but should continue to accept incomingmessages. Until *e2* makes it out of the pipe to the driver, behind previoussuccessful messages, the driver does not know an error has happened, and amessage may arrive over the socket and make it all the way through the incomingpipe in the meantime. We only halt processing in the affected direction to avoiddoing unnecessary work since messages arriving after an error should not beprocessed.
Some unnecessary work may happen, for example any messages already in thepipeline following *m2* will be processed by `A`, since it's upstream of theerror. Those messages will be dropped by `B`.

## Alternative ideas

I am considering implementing `Functor` as an object-mode transform streamrather than what is essentially an async function. Being object-mode, a streamwould preserve message boundaries and would also possibly help addressback-pressure. I'm not sure whether this would require external API changes sothat such streams could be connected to the downstream driver's streams.

## Acknowledgements

Credit is due to [@mnowster](https://github.com/mnowster) for helping with thedesign and to [@fronx](https://github.com/fronx) for helping name things.