Skip to content

Latest commit

 

History

History
 
 

automerge-repo-network-websocket

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Automerge-Repo Network: Websocket

Includes two implementations, a Websocket client and a Websocket server. These are used by the example sync-server.

The package uses isomorphic-ws to share code between node and the browser, but the server code is node only due to lack of browser support.

Wire Protocol

Peer naming

Whilst currently this wire protocol is only used for the websocket, it is probably generically useful for any stream oriented transport (e.g. TCP or QUIC). To make translating this spec to other transports easier we refer to the two parties in the protocol as the "initiating" and "receiving" peers. In the WebSocket case the "initiating" peer is the client and the "receiving" peer is the server.

Overview

The websocket wire protocol consists of a handshake where each peer tells the other what their peer ID - and some other metadata - is followed by the sync loop where each peer can send the other sync messages and ephemeral messages.

Handshake

Before sync can continue each peer needs to exchange peer IDs and agree on the protocol version they are using (although currently there is only one version). Handshake is the following steps:

  • Once a connection is established the initiating peer sends a join message with the senderId set to the initiating peers ID and the protocolVersion set to "1"
  • The receiving peer waits until it receives a message from the initiating peer, if the initiating peer receives a message before sending the join message the initiating peer SHOULD terminate the connection.
  • When the receiving peer receives the join message
    • if the protocolVersion is not "1" the receiving peer sends an error message and terminates the connection
    • otherwise
      • store the senderId as the peer ID of the initiating peer
      • emit a "peer-candidate" event with the sender ID as the peer
      • respond with a peer message with the targetId set to the initiating peers peer ID, the senderId set to the receiving peers peer ID and the selectedProtocolVersion set to "1"
      • begin the sync phase
  • Once the initiating peer has sent the join message it waits for a peer message in response. If it receives any other message type before the join message the receiving peer should send an error message and terminates the connection
  • When the initiating peer receives the peer message
    • it stores the senderId as the peer ID of the receiving peer.
    • it emits a "peer-candidate" event with the sender ID as the peer
    • if the selectedProtocolVersion is anything other than "1" the initiating peer sends an error message and terminates the connection
    • it begins the sync phase

Peer IDs and storage IDs

The peer ID is an ephemeral ID which is assumed to only live for the lifetime of the process which advertises the given ID (e.g. a browser tab). Peers may optionally advertise a storage ID in the join and peer messages, this is an ID which is assumed to be tied to a persistent storage of some kind (e.g. an IndexedDB in a browser). Many peer IDs can advertise the same storage ID (as in the case of many browser tabs). The use of a storage ID allows other peers to know whether to save and reload sync states for a given peer (if the peer advertises a storage ID, then save and reload the sync state attached to that storage ID).

Sync Phase

In the sync phase either side may send a request, sync, unavailable, or ephemeral message. Sending these corresponds to implementing NetworkAdapter.send and receiving is emitting the corresponding event from the NetworkAdapter on receipt.

Remote heads gossiping

In some cases peers wish to know about the state of peers who are separated from them by several intermediate peers. For example, a tab running a text editor may wish to show whether the contents of the editor are up to date with respect to a tab running in a browser on another users device. This is achieved by gossiping remote heads across intermediate nodes. The logic for this is the following:

  • For a given connection each peer maintains a list of the storage IDs the remote peer is interested in (note this is storage IDs, not peer IDs)
  • Any peer can send a remote-subscription-changed message to change the set of storage IDs they want the recipient to watch on the sender's behalf
  • Any time a peer receives a sync message it checks:
    • Is the sync message from a peer with a storage ID which some other remote peer has registered interest in
    • Is the remote peer permitted access to the document which the message pertains to (i.e. either the sharePolicy return true or the local peer is already syncing the document with the remote)
  • The local peer sends a remote-heads-changed message to each remote peer who passes these checks
  • Additionally, whenever the local peer receives a remote-heads-changed message it performs the same checks and additionally checks if the timestamp on the remote-heads-changed message is greater than the last timestamp for the same storage ID/document combination and if so it forwards it.

In the browser <-> sync server <-> browser text editor example above each browser tab would send a remote-subscription-changed message to the sync server adding the other browsers storage ID (presumably communicated out of band) to their subscriptions with the sync server. The sync server will then send remote-heads-changed messages to each tab when their heads change.

In a more elaborate example such as browser <-> sync server <-> sync server <-> browser the intermediate sync servers could be configured to have their sharePolicy return true for every document when syncing with each other so that remote-heads-changed messages are forwarded between them unconditionally, allowing the browser tabs to still learn of each others heads.

Message Types

All messages are encoded using CBOR and are described in this document using cddl

Preamble

These type definitions are used in every message type

; The base64 encoded bytes of a Peer ID
peer_id = str
; The base64 encoded bytes of a Storage ID
storage_id = str
; The possible protocol versions (currently always the string "1")
protocol_version = "1"
; The bytes of an automerge sync message
sync_message = bstr
; The base58check encoded bytes of a document ID
document_id = str
; Metadata sent in either the join or peer message types
peer_metadata = {
    ; The storage ID of this peer
    ? storageId: storage_id,
    ; Whether the sender expects to connect again with this storage ID
    isEphemeral: bool
}

Join

Sent by the initiating peer in the handshake phase.

{
    type: "join",
    senderId: peer_id,
    supportedProtocolVersions: protocol_version
    ? metadata: peer_metadata,
}

Peer

Sent by the receiving peer in response to the join message in the handshake phase,

{
    type: "peer",
    senderId: peer_id,
    selectedProtocolVersion: protocol_version,
    targetId: peer_id,
    ? metadata: peer_metadata,
}

Leave

An advisory message sent by a peer when they are planning to disconnect.

{
    type: "leave",
    senderId: peer_id,
}

Request

Sent when the senderId is asking to begin sync for the given documentid. Identical to sync but indicates that the senderId would like an unavailable message if the targetId does not have the document.

{
    type: "request",
    documentId: document_id,
    ; The peer requesting to begin sync
    senderId: peer_id,
    targetId: peer_id,
    ; The initial automerge sync message from the sender
    data: sync_message
}

Sync

Sent any time either peer wants to send a sync message about a given document

{
    type: "sync",
    documentId: document_id,
    ; The peer requesting to begin sync
    senderId: peer_id,
    targetId: peer_id,
    ; The initial automerge sync message from the sender
    data: sync_message
}

Unavailable

Sent when a peer wants to indicate to the targetId that it doesn't have a given document and all of it's peers have also indicated that they don't have it

{
  type: "doc-unavailable",
  senderId: peer_id,
  targetId: peer_id,
  documentId: document_id,
}

Ephemeral

Sent when a peer wants to send an ephemeral message to another peer

{
  type: "ephemeral",
  ; The peer who sent this message
  senderId: peer_id,
  ; The target of this message
  targetId: peer_id,
  ; The sequence number of this message within its session
  count: uint,
  ; The unique session identifying this stream of ephemeral messages
  sessionId: str,
  ; The document ID this ephemera relates to
  documentId: document_id,
  ; The data of this message (in practice this is arbitrary CBOR)
  data: bstr
}

Error

Sent to inform the other end that there has been a protocol error and the connection will close

{
    type: "error",
    message: str,
}

Remote subscription changed

Sent when the sender wishes to change the set of storage IDs they wish to be notified of when the given remotes heads change.

{
  type: "remote-subscription-change"
  senderId: peer_id
  targetId: peer_id

  ; The storage IDs to add to the subscription
  ? add: [* storage_id]

  ; The storage IDs to remove from the subscription
  remove: [* storage_id]
}

Remote heads changed

Sent when the sender wishes to inform the receiver that a peer with a storage ID in the receivers remote heads subscription has changed heads. This is either sent when the local peer receives a new sync message directly from the listened-to peer, or when the local peer receives a remote-heads-changed message relating to the listened-to peer from another peer.

{
  type: "remote-heads-changed"
  senderId: peer_id
  targetId: peer_id

  ; The document ID of the document that has changed
  documentId: document_id

  ; A map from storage ID to the heads advertised for a given storage ID
  newHeads: {
    * storage_id => {
      ; The heads of the new document for the given storage ID as
      ; a list of base64 encoded SHA2 hashes
      heads: [* string]
      ; The local time on the node which initially sent the remote-heads-changed
      ; message as milliseconds since the unix epoch
      timestamp: uint
    }
  }
}