One configurable cache per `repository pool` by kuba-- · Pull Request #464 · src-d/gitbase

kuba-- · 2018-09-10T11:09:04Z

This PR closes #440

Changes:

Updated go-git
repository interface requires Cache() cache.Object
one cache object per RepositoryPool

Signed-off-by: kuba-- <kuba@sourced.tech>

ajnavarro · 2018-09-10T11:26:03Z

+	Password      string         `short:"P" long:"password" default:"" description:"Password used for connection."`
+	PilosaURL     string         `long:"pilosa" default:"http://localhost:10101" description:"URL to your pilosa server." env:"PILOSA_ENDPOINT"`
+	IndexDir      string         `short:"i" long:"index" default:"/var/lib/gitbase/index" description:"Directory where the gitbase indexes information will be persisted." env:"GITBASE_INDEX_DIR"`
+	CacheSize     cache.FileSize `long:"cache" default:"536870912" description:"Object cache size" env:"GITBASE_CACHE_SIZE"`


could you change the default to something more human-friendly? like 100 and allows only MB

yeah sure, how about 512MB?

jfontan · 2018-09-10T12:44:05Z

+func NewRepositoryPool(maxCacheSize cache.FileSize) *RepositoryPool {
 	return &RepositoryPool{
 		repositories: make(map[string]repository),
+		cache:        cache.NewObjectLRU(maxCacheSize),


If I understand correctly we are using the same cache for all repositories. While this may be OK now we may have problems when partitions are used:

~~I'm not really sure of the cache implementation thread safety~~

Recent objects from one repo may be evicted by others and make it run slower

Concurrent use of the same cache may be slower (locking?)

I would have one cache per repo, a pool of caches that can be evicted or one cache per partition. I think we can have one per repo for now and get back to it when partitions are in use.

the problem of having one cache per repo is that we will not able to know the amount of memory gitbase will use. Maybe we can use as a default value 96 MiB * number of repositories. Or apart from that, maybe we can implement an LRU with two key layers, to evict keys from a specific repository. WDYT?

So maybe we can keep one cache in a pool but add mutex and prefix keys by repo id?

jfontan · 2018-09-10T12:46:34Z

 // AddGitWithID adds a git repository to the pool. ID should be specified.
 func (p *RepositoryPool) AddGitWithID(id, path string) error {
-	return p.Add(gitRepo(id, path))
+	return p.Add(gitRepo(id, path, p.cache))


I wouldn't store the cache in the git/sivaRepo. If they are different instances (one per repo) it can use lots of memory.

Signed-off-by: kuba-- <kuba@sourced.tech>

kuba-- · 2018-09-14T13:54:00Z

@jfontan - rebased. Lets go with the simplest approach, so far.

jfontan

Let's go with this approach and iterate over it. I agree.

kuba-- added 2 commits September 10, 2018 10:10

Update go-git

0876aa6

Signed-off-by: kuba-- <kuba@sourced.tech>

One cache for repository pool.

b1c8628

Signed-off-by: kuba-- <kuba@sourced.tech>

kuba-- added proposal proposal for new additions or changes performance Performance improvements labels Sep 10, 2018

Merge branch 'master' into cache-440

3d4dbb6

kuba-- requested review from a team and jfontan September 10, 2018 11:25

erizocosmico approved these changes Sep 10, 2018

View reviewed changes

ajnavarro reviewed Sep 10, 2018

View reviewed changes

jfontan suggested changes Sep 10, 2018

View reviewed changes

kuba-- added 2 commits September 13, 2018 13:22

cache in MB

54c3ba0

Signed-off-by: kuba-- <kuba@sourced.tech>

Merge branch 'master' of https://github.com/src-d/gitbase into cache-440

1d8e0c3

Signed-off-by: kuba-- <kuba@sourced.tech>

jfontan approved these changes Sep 14, 2018

View reviewed changes

ajnavarro approved these changes Sep 18, 2018

View reviewed changes

ajnavarro merged commit 1bb9a66 into src-d:master Sep 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

One configurable cache per `repository pool`#464

One configurable cache per `repository pool`#464
ajnavarro merged 5 commits intosrc-d:masterfrom
kuba--:cache-440

kuba-- commented Sep 10, 2018

Uh oh!

ajnavarro Sep 10, 2018

Uh oh!

kuba-- Sep 10, 2018

Uh oh!

jfontan Sep 10, 2018 •

edited

Loading

Uh oh!

ajnavarro Sep 10, 2018

Uh oh!

kuba-- Sep 10, 2018

Uh oh!

jfontan Sep 10, 2018

Uh oh!

kuba-- commented Sep 14, 2018

Uh oh!

jfontan left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

kuba-- commented Sep 10, 2018

Uh oh!

ajnavarro Sep 10, 2018

Choose a reason for hiding this comment

Uh oh!

kuba-- Sep 10, 2018

Choose a reason for hiding this comment

Uh oh!

jfontan Sep 10, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ajnavarro Sep 10, 2018

Choose a reason for hiding this comment

Uh oh!

kuba-- Sep 10, 2018

Choose a reason for hiding this comment

Uh oh!

jfontan Sep 10, 2018

Choose a reason for hiding this comment

Uh oh!

kuba-- commented Sep 14, 2018

Uh oh!

jfontan left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jfontan Sep 10, 2018 •

edited

Loading