TLV testcases by rustyrussell · Pull Request #631 · lightning/bolts

rustyrussell · 2019-07-09T11:06:52Z

Came across some new requirements and a few tool fixes along the way...

t-bast

LGTM, thanks for this!

cfromknecht

Thanks for getting the ball rolling on the test vectors @rustyrussell! I'm about half way through, but the main thing i'm noticing so far is a discrepancy in the way that the varints are being encoded. CompactSize encodes any multi-byte values using little endian, while the tests are using big-endian.

cfromknecht · 2019-07-10T04:01:37Z

+1. Invalid stream: 0xfd01
+2. Reason: type truncated
+
+1. Invalid stream: 0xfd0001 01


Suggested change

1. Invalid stream: 0xfd0001 01

1. Invalid stream: 0xfd0100

the current value is minimally encoded, remove trailing 0x01

Thanks for getting the ball rolling on the test vectors @rustyrussell! I'm about half way through, but the main thing i'm noticing so far is a discrepancy in the way that the varints are being encoded. CompactSize encodes any multi-byte values using little endian, while the tests are using big-endian.

You're right :( Bitcoin strikes again...

I assume we should follow (and document!) Bitcoin CompactSize endian here, since that was the rationale for not inventing our own?

It should be implicit from the test vectors, but if we want to be more specific fine by me :)

fwiw I have a series of varint test vectors, should I make a separate pr with those?

Good catch @cfromknecht !
That shows I went too quickly when comparing my test suite and this PR, I'll spend more time on it today.

Unfortunately, my tests passed after I fixed the endian without fixing the data. I'll add some test vectors which only work in the correct endian.

And we need the trailing 01, otherwise it is invalid for a different reason: no length field. Though that should be 00 not 01.

shouldn't the check for whether the type is minimally encoded be applied before attempting to parse the length?

t-bast · 2019-07-10T11:45:44Z

+1. Invalid stream: 0x1f 00 0f 01 2a
+2. Reason: valid (ignored) tlv records but invalid ordering
+
+1. Invalid stream: 0x02 08 0000000000000231 02 08 0000000000000451


I think this is a duplicate of line 613.
Maybe replace by 0x0f 01 2a 0f 01 2b to disallow duplicate of unknown odd types?

Next line tests duplicate ignored. I'll just remove this one, good spotting!

cfromknecht · 2019-07-11T01:33:06Z

would also be useful to add this to the decoding failures:

Invalid stream: 0xffffffffffffffffff 00 00
Reason: type overflow

rustyrussell · 2019-07-11T02:27:33Z

would also be useful to add this to the decoding failures:
Invalid stream: 0xffffffffffffffffff 00 00
Reason: type overflow

I'm confused, why is that an overflow? That's just the max possible type, no? Which is odd, so that is valid (assuming unknown)?

rustyrussell · 2019-07-11T02:33:06Z

Please test the New Hotness, which contains all the fixes and some new vectors.

cfromknecht · 2019-07-11T02:35:23Z

I'm confused, why is that an overflow? That's just the max possible type, no? Which is odd, so that is valid (assuming unknown)?

The type is max uint64 and has length 0, which is optional and so we ignore it. The following type is 0, which wraps around and breaks monotonicity, and so is not canonical

cfromknecht · 2019-07-11T02:43:46Z

@rustyrussell here's how we detect this case https://github.com/lightningnetwork/lnd/pull/3061/files#diff-b776b065cfabb730273c018554f81204R218

t-bast · 2019-07-11T07:24:15Z

The type is max uint64 and has length 0, which is optional and so we ignore it. The following type is 0, which wraps around and breaks monotonicity, and so is not canonical

I agree, but the error message shouldn't be type overflow, it should be tlv records must be ordered by monotonically-increasing types or something like that.

cfromknecht · 2019-07-11T07:27:03Z

@t-bast sure not married to the name

t-bast

I integrated all of those in my test suite and everything is green!
ACK 99d3440 (and good luck with the annoying spell-checking failures)

Roasbeef · 2019-07-11T23:17:47Z

Please, let's not mix endianness in the protocol. There's no reason to inherit this wart of Bitcoin which makes it that much harder to understand.

cfromknecht · 2019-07-12T01:45:35Z

I made a PR that adds test vectors for the varint scheme. Given the recent discussion i made them using a big-endian encoding, but either way it's probably good to have something like this: #640

rustyrussell · 2019-07-16T01:53:07Z

OK, I'm about to add @cfromknecht 's max-then-min type test, and then rebase on top of #640...

rustyrussell · 2019-07-16T02:12:00Z

OK, so @cfromknecht 's ffff to 0 test fails for other reasons: 0 is even and unknown, and there's no length field. I prefer tests which test one thing at a time; I'll change n2s tlv1 to be 0 type, then this can be an invalid under n2 test: 0xffffffffffffffffff 00 00 00

rustyrussell · 2019-07-16T03:32:46Z

"This time for sure!" Please re-retest!

cfromknecht · 2019-07-16T03:56:21Z

OK, so @cfromknecht 's ffff to 0 test fails for other reasons: 0 is even and unknown, and there's no length field. I prefer tests which test one thing at a time; I'll change n2s tlv1 to be 0 type, then this can be an invalid under n2 test: 0xffffffffffffffffff 00 00 00

The way i have it implemented, we check that the types are canonically ordered before parsing the length (or checking if the type is known).

I think this is also relevant to my other comment where the canonical varint check for the type is being applied after parsing the length. In our implementation we check that each varint is canonical at the time it is parsed

(I think this last bit might be resolved if the varint also passed the BigSize test vectors?)

cfromknecht · 2019-07-16T03:57:05Z

I'll proceed in updating to the latest test vectors and report back :)

rustyrussell · 2019-07-16T04:14:30Z

OK, so @cfromknecht 's ffff to 0 test fails for other reasons: 0 is even and unknown, and there's no length field. I prefer tests which test one thing at a time; I'll change n2s tlv1 to be 0 type, then this can be an invalid under n2 test: 0xffffffffffffffffff 00 00 00

The way i have it implemented, we check that the types are canonically ordered before parsing the length (or checking if the type is known).

I think this is also relevant to my other comment where the canonical varint check for the type is being applied after parsing the length. In our implementation we check that the varint is canonical at the time it is parsed

Ah, OK. We run our checks in the following order instead (kinda arbitrary, really):

check type decodes.
check length decodes.
check there's enough data for length.
check type ordering.
lookup type.
if found:
1. Decode value. If that fails, fail.
2. Check no bytes remain.
Otherwise:
1. If type is even, fail.
2. if type is odd, skip.

It's nice to have implementations do it in different orders, though...

cfromknecht · 2019-07-16T04:24:55Z

That makes sense then :)

We do

check type decodes and canonical
check type ordering
check length decodes and canonical
lookup type
if found:
1. Decode value. If that fails, fail.
Otherwise:
1. If type is even, fail.
2. If type is odd, skip.

The check that there's enough data for length is applied while decoding or discarding the value

lightning/bolts#631 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

t-bast · 2019-07-16T07:48:06Z

Tested-ACK d70fe60

cfromknecht

@rustyrussell @t-bast updated our implementation with the latest test vectors, I think we're about ready! Noticed a few small things, but as is our implementation is all green 🚀

cfromknecht · 2019-07-17T01:30:01Z

+1. Invalid stream: 0x1f 00 1f 01 2a
+2. Reason: duplicate TLV type (ignored)
+
+The following TLV stream in namespace `n2` should trigger a decoding


i'm wondering, should this produce an error in either namespace? sending a non-canonical stream is more severe than sending an unknown required type, so failing with unknown required type on n1 would effectively downgrade the error. if we were to standardize something like:

- if stream is not canonical: - close channel - if stream has unknown required type: - disconnect

our implementations will behave differently

The purpose of the toy example was to just to highlight that we might need take different actions when encountering a parsing failure vs a negotiation failure.

With the current proposal feature negotiation is bundled alongside message deserialization, so we are forced to differentiate the errors at the tlv encoding level and be consistent across implementations.

t-bast · 2019-07-17T07:55:41Z

It's probably a nit, but I'm not a big fan of the name BigSize. It's not obvious at all that the Big is for big-endian especially since we're doing big-endian everywhere (so big-endian should be implicit). It sounds a lot like a BigInt type which is really something else.
What about CompactSizeBE instead? Or we don't name it at all and just mention that varint is a variable-length, unsigned integer encoding using Bitcoin's CompactSize format with big-endian encoding instead of little-endian?
WDYT?

cfromknecht · 2019-07-17T21:22:21Z

It's not obvious at all that the Big is for big-endian especially since we're doing big-endian everywhere (so big-endian should be implicit). It sounds a lot like a BigInt type which is really something else.

It's supposed to be a somewhat of a pun :P

What about CompactSizeBE instead?

Was hoping for something a little less bland tbh

Or we don't name it at all and just mention that varint is a variable-length, unsigned integer encoding using Bitcoin's CompactSize format with big-endian encoding instead of little-endian?
WDYT?

I'd prefer not to have this verbose description everywhere we want to reference it, I think we need some name just to make it easy to refer to in the spec. It was already painful enough to write it twice.

Didn't want to waste too much time on a name when drafting the proposal, and so just went with BigSize. Definitely open to more suggestions if you have any! :)

lightning/bolts#631 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

t-bast · 2019-07-18T07:51:07Z

I don't feel very strongly about the name and don't have a better choice than either BigSize or CompactSizeBE, I just want it to be simple to understand for spec readers without too much context.
If you and @rustyrussell like BigSize, let's go for it ;)

For some reason (typo?) we only allowed "2", not other numbers! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

We were swallowing the unused line after `data`, but it's normal to do: ``` 1. tlvs: `n1` 2. types: 1. type: 1 (`tlv1`) 2. data: * [`tu64`:`amount_msat`] 1. type: 2 (`tlv2`) 2. data: * [`short_channel_id`:`scid`] ``` Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

We didn't explicitly say that the TLV is bad if length exceeds the message length! We didn't specify whether to ignore extra bytes: we should. Similarly, contents of values must be minimal (i.e. tu64 etc). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

@t-bast

These are based on @t-bast's vectors from lightning#607, with a few more cases: 1. Explicitly test encodings for 253, 254 and 255. 2. Use BigSize and make sure tests break badly if endian parsing is wrong.' 3. Test wrap-around of type encodings in stream. Many thanks to @t-bast and @cfromknecht for their contributions and testing Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

rustyrussell · 2019-07-22T23:24:31Z

Rebased and squashed commits ready for merge.

rustyrussell requested a review from t-bast July 9, 2019 11:06

t-bast approved these changes Jul 9, 2019

View reviewed changes

cfromknecht requested changes Jul 10, 2019

View reviewed changes

t-bast reviewed Jul 10, 2019

View reviewed changes

t-bast self-requested a review July 10, 2019 11:47

rustyrussell requested a review from cfromknecht July 11, 2019 02:33

t-bast approved these changes Jul 11, 2019

View reviewed changes

t-bast mentioned this pull request Jul 15, 2019

bolt04: Variable hop_payload for the sphinx onion #619

Merged

rustyrussell force-pushed the tlv-testcases branch from 99d3440 to d70fe60 Compare July 16, 2019 03:32

rustyrussell added a commit to rustyrussell/lightning that referenced this pull request Jul 16, 2019

run-tlvstream: update to use latest BOLT draft.

7388cd7

lightning/bolts#631 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

cfromknecht reviewed Jul 17, 2019

View reviewed changes

rustyrussell added a commit to rustyrussell/lightning that referenced this pull request Jul 18, 2019

tests: add test for tlvstream (from BOLT 1 test vectors).

bf7cb15

lightning/bolts#631 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

rustyrussell added a commit to ElementsProject/lightning that referenced this pull request Jul 18, 2019

tests: add test for tlvstream (from BOLT 1 test vectors).

3477034

lightning/bolts#631 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

rustyrussell mentioned this pull request Jul 18, 2019

Base AMP #643

Merged

rustyrussell mentioned this pull request Jul 19, 2019

Simple tooling fixes #650

Merged

niftynei added Meeting Discussion Raise at next meeting and removed Meeting Discussion Raise at next meeting labels Jul 22, 2019

rustyrussell added 3 commits July 23, 2019 08:49

tools/extract-formats.py: recognize numerics in field names.

654ea50

For some reason (typo?) we only allowed "2", not other numbers! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

rustyrussell force-pushed the tlv-testcases branch from dd7893b to c499db4 Compare July 22, 2019 23:20

rustyrussell added 2 commits July 23, 2019 08:52

spellcheck: allow space-separated hex, and a few new terms.

2cc62a0

Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

rustyrussell force-pushed the tlv-testcases branch from c499db4 to fa09bfe Compare July 22, 2019 23:24

rustyrussell merged commit aa33af0 into lightning:master Jul 22, 2019

Conversation

rustyrussell commented Jul 9, 2019

Uh oh!

t-bast left a comment

Choose a reason for hiding this comment

Uh oh!

cfromknecht left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cfromknecht commented Jul 11, 2019

Uh oh!

rustyrussell commented Jul 11, 2019

Uh oh!

rustyrussell commented Jul 11, 2019

Uh oh!

cfromknecht commented Jul 11, 2019

Uh oh!

cfromknecht commented Jul 11, 2019

Uh oh!

t-bast commented Jul 11, 2019

Uh oh!

cfromknecht commented Jul 11, 2019

Uh oh!

t-bast left a comment

Choose a reason for hiding this comment

Uh oh!

Roasbeef commented Jul 11, 2019

Uh oh!

cfromknecht commented Jul 12, 2019

Uh oh!

rustyrussell commented Jul 16, 2019

Uh oh!

rustyrussell commented Jul 16, 2019

Uh oh!

rustyrussell commented Jul 16, 2019

Uh oh!

cfromknecht commented Jul 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cfromknecht commented Jul 16, 2019

Uh oh!

rustyrussell commented Jul 16, 2019

Uh oh!

cfromknecht commented Jul 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

t-bast commented Jul 16, 2019

Uh oh!

cfromknecht left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

t-bast commented Jul 17, 2019

cfromknecht commented Jul 16, 2019 •

edited

Loading

cfromknecht commented Jul 16, 2019 •

edited

Loading