Batch index lookups on BigTable by tomwilkie · Pull Request #971 · cortexproject/cortex

tomwilkie · 2018-08-29T20:37:21Z

With the series index, resolving series ID to chunk IDs can issues 100s of thousands of queries to the index. The cache does a good job of handling most of these, but we should batch the rest.

As part of this I changes the BigTable backend to only read full rows - this shouldn't impact performance, as the index cache was forcing us to only read full rows anyway. I also changed the ReadBatch interface to be iterator-style, so we can filter it down without copies.

Fixes #969

Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

csmarchbanks

Some small comments/questions. Overall LGTM

csmarchbanks · 2018-08-31T19:01:12Z

pkg/chunk/util/util.go

@@ -0,0 +1,77 @@
+package util


Why the util package instead of another thing in chunk? I think everywhere that uses this right now uses chunk as well

I agree util packages suck, but I'm trying to break the chunk package up - its a bit of a behemoth, and I find I regularly get into circular imports with it. Hence this got stuck in a new package.

csmarchbanks · 2018-08-31T19:28:50Z

pkg/chunk/chunk_store.go

-
-	return entries, nil
+	})
+	return entries, err


Keeping the log message might be nice

csmarchbanks · 2018-08-31T19:32:44Z

pkg/chunk/gcp/storage_client.go

-		strings.TrimPrefix(b.items[index].Column, b.columnPrefix),
-	)
+func (b *bigtableReadBatchColumnKey) RangeValue() []byte {
+	return []byte(strings.TrimPrefix(b.items[b.i].Column, b.columnPrefix))


Consider inlining the columnPrefix since it is static? Then it would not need to be on bigtableReadBatchColumnKey

Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

…-lookups" This reverts commit 0d275f0, reversing changes made to 1ffffff. Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

…ch-index-lookups"" This reverts commit 8b74f90. Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

Revert "Merge pull request #971 from grafana/batch-index-lookups"

…ch-index-lookups"" This reverts commit 8b74f90. Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

bboreham · 2018-09-25T13:36:01Z

This was reverted and replaced by #981

tomwilkie added 2 commits August 29, 2018 22:03

Batch index lookups.

dc88103

Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

Prevent concurrent modifications of the map.

387a6f3

Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

tomwilkie force-pushed the batch-index-lookups branch from 5eea0b5 to 387a6f3 Compare August 29, 2018 21:03

Turn chunk.ReadBatch into a iterator style interface to reduce copying.

453785a

Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

tomwilkie force-pushed the batch-index-lookups branch from 390aa69 to 453785a Compare August 30, 2018 16:39

tomwilkie mentioned this pull request Aug 30, 2018

Tweak histograms and traces #972

Closed

csmarchbanks reviewed Aug 31, 2018

View reviewed changes

Review feedback.

3309d9c

Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

tomwilkie merged commit 0d275f0 into cortexproject:master Sep 3, 2018

tomwilkie deleted the batch-index-lookups branch September 3, 2018 10:07

tomwilkie added a commit to grafana/cortex that referenced this pull request Sep 3, 2018

Revert "Revert "Merge pull request cortexproject#971 from grafana/bat…

5adae1c

…ch-index-lookups"" This reverts commit 8b74f90. Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

tomwilkie added a commit that referenced this pull request Sep 3, 2018

Merge pull request #980 from grafana/revert-index-cache

cff5cfa

Revert "Merge pull request #971 from grafana/batch-index-lookups"

tomwilkie added a commit to grafana/cortex that referenced this pull request Sep 11, 2018

Revert "Revert "Merge pull request cortexproject#971 from grafana/bat…

14e7d55

…ch-index-lookups"" This reverts commit 8b74f90. Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch index lookups on BigTable#971

Batch index lookups on BigTable#971
tomwilkie merged 4 commits intocortexproject:masterfrom
grafana:batch-index-lookups

tomwilkie commented Aug 29, 2018 •

edited

Loading

Uh oh!

csmarchbanks left a comment

Uh oh!

csmarchbanks Aug 31, 2018

Uh oh!

tomwilkie Sep 3, 2018

Uh oh!

csmarchbanks Aug 31, 2018

Uh oh!

csmarchbanks Aug 31, 2018

Uh oh!

bboreham commented Sep 25, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tomwilkie commented Aug 29, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

csmarchbanks left a comment

Choose a reason for hiding this comment

Uh oh!

csmarchbanks Aug 31, 2018

Choose a reason for hiding this comment

Uh oh!

tomwilkie Sep 3, 2018

Choose a reason for hiding this comment

Uh oh!

csmarchbanks Aug 31, 2018

Choose a reason for hiding this comment

Uh oh!

csmarchbanks Aug 31, 2018

Choose a reason for hiding this comment

Uh oh!

bboreham commented Sep 25, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tomwilkie commented Aug 29, 2018 •

edited

Loading