Request for Comments #23

hackergrrl · 2024-01-31T01:17:28Z

The Cable Protocol is a new, proposed protocol specification. It's made up of two smaller protocols: the Cable Wire Protocol and the Cable Handshake Protocol.

The purpose of the Cable Protocol is to facilitate the setup of a secure connection between two members of a cabal chat, and the creation and sync of that cabal, by allowing peers to exchange cryptographically signed documents with each other, such as chat messages, spread across various user-defined channels.

For small edits, please consider using the github feature for "suggested changes": https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/reviewing-changes-in-pull-requests/commenting-on-a-pull-request

Thank you for reviewing our work!

AljoschaMeyer

Quick reviewing pass on the handshake protocol. No compensation needed 💜

I hope github lets me open a second review for the wire protocol, about to find out.

README.md

handshake.md

AljoschaMeyer · 2024-02-01T15:34:57Z

handshake.md

+
+The pre-shared key MUST be mixed into the handshake state as per the rules in
+*9. Pre-shared symmetric keys* of the Noise specification.
+


If me and you both share an interest in 4 cabals, how do we know which connection attempts match to which cabal? Do we just attempt 16 different connections on different transport channels? What if we both are interested in 100 cabals each, 10 of which are shared?

Yeah feels like this won't scale great for a bunch of cabals being replicated on one machine. I assume you'd need separate ports for each advertisement in the peer discovery process?

It's like @RangerMauve says: you'd need to be listening and doing discovery for each cabal you are interested in.

Doesn't that simply make the ports you use part of the shared secret? That might allow me to probe the ports of several machines to find out whether they belong to the same cabals or not.

I didn't understand this. Could you rephrase? Are you describing an attack vector? What do you mean by port as part of the shared secret?

@hackergrrl Sorry, missed your reply there.

Suppose each cabal was identified only by its key, but there was no canonic information on where (i.e., on which port) peers expect connection attempts for that cabal. Suppose further, there is a peer that is interested in five cabals. That peer probably listens for incoming connections on five different ports. Given that there is no "official" port for each cabal, and they want to keep secret which cabals they belong to, they only advertise "you can connect to my cabals on ports 8000 to 8004". Now, if I come along, interested in only a single cabal, and want to connect to this peer, I have to try out all five ports. And if I'm interested in 20 cabals, none of which the other peer also has (which I don't know though), I make 5*20 connection attempts.

The most obvious way to restore efficiency with the current protocol is to have all members in a cabal agree on a port that they all will use for that cabal. But at that point, the port becomes an identifying secret, just like the actual secret key - if I'm not connecting to the correct port, I cannot join, just like if I didn't know the secret key. And it is a secret that a nosy participant can snoop by probing people which ports they expose (for cable, if that is detectable); the nosy peer can then obtain information on who shares cabals with whom.

Does that clarify things?

Ah, ok! I can see your understanding now.

Something not mentioned in this specification, but that is planned, is having a discovery mechanism for finding other members of cabals, which will include both an IP and port. This will resolve any uncertainty about what port to find which cabal on.

handshake.md

AljoschaMeyer · 2024-02-01T15:51:44Z

handshake.md

+these steps:
+
+1. Compute the total length of all of the cipertexts fragments, in bytes, as
+   `len`, a 32-bit unsigned little endian integer.


Isn't the "total length of all ciphertext fragments" just plaintext.len? If yes, write that instead. If no, what is it and how do I compute it?

Each ciphertext also carries a 16 byte MAC tag for authentication. That's also why the maximum plaintext length isn't 65535 (== u16::MAX) but 65519 (==u16::MAX - 16). So I suppose the total length of all ciphertexts is plaintext.len + 16 * ceil(plaintext.len/65519), or when using integer division, plaintext.len + 16 * (1 + plaintext.len/65519).

Upon encrypting a plaintext to a ciphertext, Noise adds 16 bytes of authentication data to each resulting ciphertext, so the total length here depends on the number of fragments. I agree about making the computation explicit though!

I added a brief explanation of the calculation. It is shown in the WriteMsg pseudocode, also.

I rewrote this section to be more clear. I'm curious what y'all think.

handshake.md

AljoschaMeyer

And the wire spec. Pedantic to the point of obnoxiousness in parts, but hopefully helpful nonetheless.

Overall, this was an enjoyable read, kudos. I have to run for now, but enjoy =)

wire.md

AljoschaMeyer · 2024-02-01T17:49:16Z

wire.md

+requested time range, though they may desire not to in certain circumstances,
+particularly if a channel has a very long history and the responding client
+lacks sufficient resources at the time to return thousands or hundreds of
+thousands of chat message hashes.


Any recommendation on which messages to transmit then (probably the newest ones?)?

I appreciate this catch. Specifying this makes pagination possible!

A responder is RECOMMENDED to send posts in reverse chronological order by post timestamp, so that, if the requester sees that they received a number of hashes equal to their limit, they can be assured that they now have all posts newer than the oldest post hash the responder did send, and can make subsequent requests to paginate backwards in time.

AljoschaMeyer · 2024-02-01T17:51:49Z

wire.md

+
+If `future = 0`, only the latest state posts will be included, and the request
+MUST NOT be held open.
+


Why is future boolean and not simply a limit int analogous to that of Channel Time Range Requests?

I don't think a limit would be useful here without a means of paginating through the response set of hashes.

AljoschaMeyer · 2024-02-01T17:53:05Z

wire.md

+skipping the first `offset` entries).
+
+The `offset` field can be combined with the `limit` field to allow clients to
+paginate through the list of all channel names known by a peer.


To paginate, they must be consistently sorted. The spec could require this? It might even require a particular ordering, to make pagination consistent irrespective of the peer.

I mention the stable sort requirement in the Channel List Response message, where it's used. I'll think about whether it makes sense to specify a sort method.

I thought about it, and agree that a sort should be specified.

The channel names SHOULD be sorted lexicographically in ascending order, so that requesters can effectively sent subsequent requests to paginate through the results.

AljoschaMeyer · 2024-02-01T17:55:35Z

wire.md

+
+A responder MUST send a Hash Response message with `hash_count = 0` to indicate
+that they do not intend to return any further hashes for the given `req_id` and
+they have concluded the request on their side.


This makes responses that consist of zero hashes indistinguishable from there-might-be-data-but-I-wont-deliver-it.

I think that is true more generally -- the recipient can't distinguish the "i don't want to send you more data" from the "i don't have any more data to send you" case.

@AljoschaMeyer Is there a case where knowing the difference would be helpful?

Fair =D I can't immediately come up with one.

wire.md

keks · 2024-02-03T23:39:25Z

Agreed, this was a nice read! I have left a few comments inline.

Regarding the version negotiation: I think right now it's not great that there is no cryptographic agreement on the major version, and I suggest using the Noise prologue to achieve it. And as long as there are no features to be negotiated, maybe it's not needed? Either way, the minor version negotiation could theoretically happen in the ciphertext now.

Regarding the paper I posted on fragmentation: I only skimmed it a long time ago, and the only thing I took away from it was "boundary-hiding is hard than just encrypting the length prefixes", because you usually can see where ciphertexts start and end by watching transmission patterns. It's definitely a vector of leakage, but when I think about what the actual concern is, then I think it boils down to fingerprinting: I see that you received a lot of ciphertexts that were the same size as that other persons, so you are likely in a chat group. If I recall correctly, the InterMAC stuff was a bit complicated to implement, so I'd understand if you consider this out of scope. But maybe you can add some padding inside the ciphertext, so messages look more uniform?

Looking forward to any follow-ups :)

Oh and yeah, no compensation please.

hackergrrl · 2024-02-06T01:44:04Z

@AljoschaMeyer @keks Thank you so much for your notes, wow! It's going to take me a bit to work through all of this, but I feel tremendous gratitude for y'all taking the time. 💜

RangerMauve

Cool! I like how it's focused on gossip. It'd be neat to compare performance against the hypercore version.

handshake.md

RangerMauve · 2024-02-21T17:14:43Z

handshake.md

+
+The pre-shared key MUST be mixed into the handshake state as per the rules in
+*9. Pre-shared symmetric keys* of the Noise specification.
+


Yeah feels like this won't scale great for a bunch of cabals being replicated on one machine. I assume you'd need separate ports for each advertisement in the peer discovery process?

wire.md

RangerMauve · 2024-02-21T17:40:26Z

wire.md

+cause ordering problems if a client's hardware clock is skewed, or a timestamp
+is spoofed.
+
+Implementations are RECOMMENDED to set and utilize links on chat messages


Mind elaborating on why it's recommended instead of mandatory?

The idea with recommended implementation is because it adds to the load of work implementors would need to do, which is an accessibility concern, making it possible for more minimal implementations to exist.

wire.md

Co-authored-by: Aljoscha Meyer <[email protected]>

hackergrrl · 2024-02-21T19:25:51Z

Thanks very much for the notes @RangerMauve!

keks

Oh my god I just realized somehow my review hasn't been posted?? I thought I posted it with my comment! One of these days I'll learn to github...

handshake.md

keks · 2024-02-03T19:20:03Z

handshake.md

+Protocol must be encoded and decoded.
+
+At a high level, all Cable Wire Protocol messages need to be passed through
+Noise for encryption, and then prefixed with an encrypted length indicator.


and then prefixed with an encrypted length indicator

I don't think this came up again, or did I miss it? I only saw plaintext length prefixes, and only for fragmentation, not for the simple case.

Oh, I see the confusion. I originally used encryption for length prefixes, but then removed it... and now have re-added it. 😅

keks · 2024-02-03T19:23:55Z

handshake.md

+these steps:
+
+1. Compute the total length of all of the cipertexts fragments, in bytes, as
+   `len`, a 32-bit unsigned little endian integer.


Each ciphertext also carries a 16 byte MAC tag for authentication. That's also why the maximum plaintext length isn't 65535 (== u16::MAX) but 65519 (==u16::MAX - 16). So I suppose the total length of all ciphertexts is plaintext.len + 16 * ceil(plaintext.len/65519), or when using integer division, plaintext.len + 16 * (1 + plaintext.len/65519).

keks · 2024-02-03T22:36:03Z

wire.md

+- `varint`: a variable-length unsigned integer. 
+
+### 6.2 Post Formats
+


I also found this pretty confusing.

keks · 2024-02-03T22:39:57Z

wire.md

+
+Regarding the above section (5.2.3 Limits), hashes that are deduplicated by an
+intermediary peer, and thus not transmitted back to the requester, do not count
+against the `limit`.


From the wording I would assume that is the case. They can request the posts/hashes of a time range that does not contain the message they just received 100 times.

wire.md

keks · 2024-02-03T23:09:35Z

wire.md

+
+A responder MUST send a Hash Response message with `hash_count = 0` to indicate
+that they do not intend to return any further hashes for the given `req_id` and
+they have concluded the request on their side.


I think that is true more generally -- the recipient can't distinguish the "i don't want to send you more data" from the "i don't have any more data to send you" case.

handshake.md

Co-authored-by: Jan Winkelmann <[email protected]>

hackergrrl · 2024-04-19T01:23:58Z

After consideration, I've decided to remove TTL from this version of the protocol. It's not yet clear that it's a necessary feature, and its removal simplifies implementation logic.

hackergrrl · 2024-04-22T18:18:40Z

@AljoschaMeyer @keks Hey y'all! How would you like to be credited as contributors (if at all)? If you can give me the verbatim name you'd like to use I can joyfully add it. :)

hackergrrl · 2024-04-22T18:40:20Z

Hey y'all! I'm going to consider feedback concluded on 8am Pacific on Wednesday April 24th, and merge in this work. :)

AljoschaMeyer · 2024-04-23T08:28:00Z

@AljoschaMeyer @keks Hey y'all! How would you like to be credited as contributors (if at all)? If you can give me the verbatim name you'd like to use I can joyfully add it. :)

Huh, "contributor" feels a bit strong to me. If you want to mention me (as "Aljoscha Meyer") for "providing feedback", or something along those lines, that works.

hackergrrl · 2024-04-23T18:59:15Z

@AljoschaMeyer

Huh, "contributor" feels a bit strong to me. If you want to mention me (as "Aljoscha Meyer") for "providing feedback", or something along those lines, that works.

For me, Contributor feels right. You definitely contributed. I see a lot of value in your comments, and the spec gained tremendously from them in my opinion. So far "Contributor" has included everyone who has contributed anything, including fixes to typos. It all counts to me!

If you still feel strongly about it though, I will of course honour your request and add an additional category for you called "Providing Feedback". 💙

AljoschaMeyer · 2024-04-23T21:35:57Z

In that case, "contributor" is fine for me =)

keks · 2024-04-24T14:42:06Z

If you feel like I contributed, I'll take it, but I don't have strong feelings either way. If you want to mention me (in whatever role), you can call me "Jan Winkelmann".

hackergrrl · 2024-04-25T00:24:46Z

Merged! Thank you so much y'all for your valuable time and comments. If y'all end up having thoughts re: any of the revision comments I made, please feel free to respond here still, or open a new issue.

cblgh · 2024-04-25T06:25:20Z

🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼!!!!

AljoschaMeyer · 2024-04-25T06:46:35Z

Woop, congratulations!

Cable Protocol 1.0-draft

4e2e46b

AljoschaMeyer reviewed Feb 1, 2024

View reviewed changes

hackergrrl changed the title ~~[DRAFT] Request for Comments~~ Request for Comments Feb 6, 2024

RangerMauve reviewed Feb 21, 2024

View reviewed changes

hackergrrl and others added 17 commits February 21, 2024 10:33

Fix grammar mistake

72ed7ef

Co-authored-by: Aljoscha Meyer <[email protected]>

Fix grammar mistake

d529a85

Co-authored-by: Aljoscha Meyer <[email protected]>

Fix grammar mistake

4a7838a

Co-authored-by: Aljoscha Meyer <[email protected]>

Improve sentence clarity

01c0cf2

Co-authored-by: Aljoscha Meyer <[email protected]>

typo

05a2d47

Fix grammar mistake

c010d1b

Fix grammar mistake

d0cdbf5

Clarify "head" definition

e47e0d5

Fix semantic mistake

8f81b75

Attempt to clarify sentence

e06727c

Attempt to clarify description

488d8cb

Clarify sentence

d2d96a5

Co-authored-by: Aljoscha Meyer <[email protected]>

Improve sentence clarity

9d7a457

Co-authored-by: Aljoscha Meyer <[email protected]>

Make more precise the action of signing a post

93b92b6

Co-authored-by: Aljoscha Meyer <[email protected]>

Remove redundant clause.

3494ea8

Remove circuit_id mention

f91667d

Change ttl varint use to u8

bd68c76

keks reviewed Feb 22, 2024

View reviewed changes

hackergrrl and others added 4 commits March 1, 2024 14:01

Attempt to improve clarity of intro paragraph.

c410719

Remove the Version Exchange phase.

6177a60

Make the prologue bytes very clear.

26fb95e

Fix Noise spec reference.

a61a14b

Co-authored-by: Jan Winkelmann <[email protected]>

hackergrrl added 5 commits April 12, 2024 18:23

Change req_id to be 64-bit.

642e512

Revise expected responses for a Cancel Request.

769725e

Remove MUST about ignoring ttl on Cancel Requests.

8def29e

Revise logic of time_end.

daf50e5

Remove TTL.

61003dc

hackergrrl added 11 commits April 18, 2024 18:25

Fix section numberings.

6ef408a

Remove Deduplication section.

e409c19

Add specificity to user membership clauses.

ba7633b

Specify what posts to send if a limit is reached.

dbca9c8

Specify sort for channel list responses.

fd7c217

Curve25519 -> Ed25519

e396c4f

Fix typo in element-of operator character.

58e999a

Define and use BLAKE2b function for clarity.

2b1360b

Add sections on post ingesting and a special case of returning state.

bb05ffc

Fix typos.

ba6f52d

Regenerate table of contents.

52d6921

Add section on keeping/discarding posts.

eec2a6b

Add contributors.

56a2ef8

hackergrrl force-pushed the request-for-comments branch from 4b59a57 to 56a2ef8 Compare April 24, 2024 17:00

hackergrrl closed this Apr 25, 2024


		The pre-shared key MUST be mixed into the handshake state as per the rules in
		9. Pre-shared symmetric keys of the Noise specification.


		If `future = 0`, only the latest state posts will be included, and the request
		MUST NOT be held open.

		- `varint`: a variable-length unsigned integer.

		### 6.2 Post Formats

Request for Comments #23

Request for Comments #23

Conversation

hackergrrl commented Jan 31, 2024 • edited Loading

AljoschaMeyer left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AljoschaMeyer left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hackergrrl Apr 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AljoschaMeyer Feb 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

keks commented Feb 3, 2024

hackergrrl commented Feb 6, 2024

RangerMauve left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hackergrrl commented Feb 21, 2024

keks left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hackergrrl commented Apr 19, 2024

hackergrrl commented Apr 22, 2024

hackergrrl commented Apr 22, 2024

AljoschaMeyer commented Apr 23, 2024

hackergrrl commented Apr 23, 2024 • edited Loading

AljoschaMeyer commented Apr 23, 2024

keks commented Apr 24, 2024

hackergrrl commented Apr 25, 2024

cblgh commented Apr 25, 2024

🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼🌼!!!!

AljoschaMeyer commented Apr 25, 2024

hackergrrl commented Jan 31, 2024 •

edited

Loading

hackergrrl Apr 19, 2024 •

edited

Loading

AljoschaMeyer Feb 1, 2024 •

edited

Loading

keks left a comment •

edited

Loading

hackergrrl commented Apr 23, 2024 •

edited

Loading