Write only as many chunk bytes as needed by bboreham · Pull Request #1044 · cortexproject/cortex

bboreham · 2018-09-30T15:01:12Z

Fixes #631

~~Note we had not previously edited the chunk code copy-pasted from Prometheus (v1).~~

Only fully implemented for varbit chunks at present; I don't think we're intending to use other kinds.

Varbit chunks hold metadata in a header and a footer at the end of the chunk, however the footer is only used while appending more samples, so can be discarded when the chunk is "closed" and written to store.

A flag is added to make the new behaviour optional on writing, since shorter chunks cannot be read by older versions of Cortex so if you roll back after turning it on, you lose the ability to query any data written with short chunks.

The shorter encoding is also used when transferring chunks from one ingester to another on rolling updates.

tomwilkie · 2018-09-30T17:51:06Z

Would you mind adding some unit tests?

bboreham · 2018-09-30T17:56:07Z

I should have given some thought to making this optional. Once you write some smaller chunks, you need the code change to read them back, so you'd need to be sure you weren't rolling back past this change.

bboreham · 2018-09-30T17:56:44Z

Would you mind adding some unit tests?

The existing unit tests exercise this code. What extra were you thinking of?

tomwilkie · 2018-10-01T10:40:50Z

The existing unit tests exercise this code.

Theres nothing in this package which exercises this code. The tests in pkg/chunk/ cover it, but aren't exactly looking for bugs in the chunk encoding - for instance, TestChunkCodec only puts a single sample in the chunk.

I think it would be good to have a test which appends an increasing number of samples to a chunk, encodes it via a Marshal, decodes it and then checks all the samples are there.

tomwilkie · 2018-10-01T10:41:07Z

Also needs a DCO.

tomwilkie · 2018-10-01T10:41:39Z

Note we had not previously edited the chunk code copy-pasted from Prometheus (v1).

#1029 edited the code here too.

tomwilkie · 2018-10-02T18:44:42Z

I think it would be good to have a test which append..

I added such a test in #1048

tomwilkie · 2018-11-19T13:12:02Z

@bboreham now that #1048 is merged, want me to update this?

bboreham · 2018-11-19T13:27:38Z

Sure, if you like. I left it alone because I saw only a 5% overall reduction in storage size, but I still think it's worth doing if you have time.

- Only effective for varbit chunks at present. - Also allow undersized varbit chunks to be unmarshalled. - Use varbit encoding in chunk tests, since that's what we use most commonly in production - Remove chunk.Unmarshal(io.Reader), its only used in tests. Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

tomwilkie · 2018-11-19T16:38:19Z

There you go @bboreham; I ended up remove the Unmarshal func on chunks, as it was only used in tests.

tomwilkie · 2018-11-19T16:50:25Z

(Its an LGTM from me, but I'll let you merge it @bboreham)

bboreham · 2018-11-22T17:46:20Z

Current status is I believe I have tried this and I believe it isn't working, in that it still outputs a bunch of zeros at the end of the chunk.
Further research required.

bboreham · 2018-11-24T14:53:00Z

I take it back: the code where I thought this had been used was turned off.

bboreham · 2018-11-29T14:52:27Z

I did a more effective test. Everything worked when I had ingesters rolled forward and queriers to match, but when I tried to roll back the ingesters the chunk hand-over failed:

level=warn ts=2018-11-29T14:31:32.59474578Z caller=gokit.go:46 method=/cortex.Ingester/TransferChunks duration=25.357151ms err="insufficient bytes copied from buffer during unmarshaling, want 1024, got 155" msg="gRPC\n"

this is something I hadn't thought about: it's actually a nice efficiency gain for the old-style chunks to do the hand-over without padding, but you can't fail back to older ingesters.

So, what to do? Make it optional on the write path so we can update everything and then turn it on?

tomwilkie · 2018-11-29T17:14:39Z

cmd/ingester/main.go

Could this be done more like PreallocConfig in the client pkg? https://github.com/cortexproject/cortex/blob/master/pkg/ingester/client/timeseries.go#L13

tomwilkie · 2018-11-29T17:15:43Z

One nit, otherwise LGTM

This allows all components to be rolled out in a mode which accepts either size of chunk, then changed over to write the new way at a later date. Signed-off-by: Bryan Boreham <bryan@weave.works>

Also un-export MarshalLen since it is not called from outside. Signed-off-by: Bryan Boreham <bryan@weave.works>

tomwilkie · 2018-11-30T07:22:09Z

Nice, thanks! LGTM

bboreham · 2018-11-30T08:47:41Z

This is a bit broken, because you need the flag on the reading side but I only plumbed it in to ingester.

What I wanted was that all components could read chunks of either style, and the flag changes what ingester writes.

bboreham · 2018-12-02T16:17:55Z

Still broken - the chunks passed from one ingester to another in hand-over cannot be appended to.
Don't turn the feature on until this is fixed!

tomwilkie force-pushed the shrunk-chunks branch from 8ab1310 to 74cd091 Compare November 19, 2018 16:35

bboreham force-pushed the shrunk-chunks branch from 0e2fe2f to b8f3a78 Compare November 29, 2018 15:36

tomwilkie reviewed Nov 29, 2018

View reviewed changes

bboreham added 2 commits November 29, 2018 17:47

Add an option to control whether varbit chunks are saved full-size

a2a5a3f

This allows all components to be rolled out in a mode which accepts either size of chunk, then changed over to write the new way at a later date. Signed-off-by: Bryan Boreham <bryan@weave.works>

Make varbitChunk.Size() more accurate

5f8de0c

Also un-export MarshalLen since it is not called from outside. Signed-off-by: Bryan Boreham <bryan@weave.works>

bboreham force-pushed the shrunk-chunks branch from 7e07298 to 5f8de0c Compare November 29, 2018 17:48

bboreham merged commit 3c868b8 into master Nov 30, 2018

bboreham mentioned this pull request Nov 30, 2018

Allow smaller varbit chunks on read #1137

Merged

tomwilkie deleted the shrunk-chunks branch January 3, 2019 11:44

bboreham mentioned this pull request Jan 4, 2019

-store.fullsize-chunks=false breaks ingester hand-overs #1163

Closed

Conversation

bboreham commented Sep 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tomwilkie commented Sep 30, 2018

Uh oh!

bboreham commented Sep 30, 2018

Uh oh!

bboreham commented Sep 30, 2018

Uh oh!

tomwilkie commented Oct 1, 2018

Uh oh!

tomwilkie commented Oct 1, 2018

Uh oh!

tomwilkie commented Oct 1, 2018

Uh oh!

tomwilkie commented Oct 2, 2018

Uh oh!

tomwilkie commented Nov 19, 2018

Uh oh!

bboreham commented Nov 19, 2018

Uh oh!

tomwilkie commented Nov 19, 2018

Uh oh!

tomwilkie commented Nov 19, 2018

Uh oh!

bboreham commented Nov 22, 2018

Uh oh!

bboreham commented Nov 24, 2018

Uh oh!

bboreham commented Nov 29, 2018

Uh oh!

tomwilkie Nov 29, 2018

Choose a reason for hiding this comment

Uh oh!

tomwilkie commented Nov 29, 2018

Uh oh!

tomwilkie commented Nov 30, 2018

Uh oh!

bboreham commented Nov 30, 2018

Uh oh!

bboreham commented Dec 2, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bboreham commented Sep 30, 2018 •

edited

Loading