Add Caffeine cache layer extension by drcrallen · Pull Request #2417 · apache/druid

drcrallen · 2016-02-08T21:59:29Z

Fixes Add a caffeine cache extension #2411

Tests are all based on #1937

TODO:

Reconcile requirement for java8
Add some kind of docs (either in extension README.md or in main cache readme)

drcrallen · 2016-02-08T22:09:53Z

The MapCache concurrency problems are a pain ( see #1836 ). Having a "local" cache that can be nice and performant would be awesome.

drcrallen · 2016-02-08T22:40:56Z

This will fail under java7 tests as

/Users/charlesallen/src/druid/extensions/caffeine-cache/src/main/java/io/druid/client/cache/CaffeineCache.java:[52,61] cannot access java.util.function.Supplier
  class file for java.util.function.Supplier not found

* Fixes apache#2411

ben-manes · 2016-02-09T04:41:46Z

+{
+  private static final Logger log = new Logger(CaffeineCache.class);
+  private final Cache<String, byte[]> cache;
+  private final AtomicReference<com.github.benmanes.caffeine.cache.stats.CacheStats> priorStats = new AtomicReference<>(


I would put this extension under client.cache.caffeine so that the package types don't take precedence. Then Droid's CacheStats could be fully qualified in only getStats() and the overall verbosity reduced.

Moved around at https://github.com/metamx/druid-cache-caffeine

b-slim · 2016-02-09T14:39:44Z

@drcrallen are we sure that this will fix the locking problems ? if yes can you explain or share some benchmarks ?

drcrallen · 2016-02-09T18:40:54Z

@b-slim I have a high degree of confidence this will fix locking issues, but almost zero information on if this offers better latency than the guava cache. Luckily it is in an extension so it is completely optional.

I want to get numbers aside from the general Caffeine benchmarks on the matter, but will not be able to immediately.

drcrallen · 2016-02-09T18:42:11Z

@ben-manes One thing I could find a good way to handle was "namespace"s

Does caffeine have any way to use a global retention rule, but divide the keys up into namespaces?

ben-manes · 2016-02-09T19:32:48Z

No, but it might be possible to build something like that on top. It is a little vague of what that means and how generally useful it is. I've seen it discussed in contexts of other caches, but it was never requested for Guava's.

Namespaces used to be asked for a lot back in memcached's heyday. Most of the time it was to invalidate a bunch of keys together which were inter-related. The solution was to use generational caching (a version number as part of the key) and increment the generation when the data store was updated. That relied on the cache to lazily evict the old keys which wouldn't be used anymore.

Alternatively you could build indexes and rely on locking since this is part of the same process. A CacheWriter could be useful to make insertion / eviction atomic with a separate multimap (namespace to keys).

If namespaces are to make regions to have sub and global thresholds, this is even messier. That's usually to share a cluster of memcached and perform resource accounting (e.g. Google's internal rewrite did this) or have an automatic cache sizing. The former makes no sense here, the latter is very messy in Java where GC masks sizes. Attempts to do it automatically (soft references, heap monitoring, etc) don't work well. Its better to be explicit and move large scale caching outside of the JVM (off-heap, memcached, etc).

So messy and confusing from my side of the fence. I think the useful hooks are exposed for a custom job, though.

drcrallen · 2016-02-09T19:42:33Z

@ben-manes the big thing for use in druid is that the data queries have the underlying data segment as part of the cache key. As such, if the segment is no longer present on a node (in the case of using local cache) then that node will no longer receive queries for that segment, so any sort of LRU-like eviction policy should favor destroying those no-longer-used keys.

The current methodology in this PR is similar to the generational approach you described, but the existing cache keys are scanned and invalidated in a best-effort manner. It could just fall back to lazy eviction if performance is a concern for actively scanning the keys.

drcrallen · 2016-02-09T19:42:42Z

Thanks for the insight.

ben-manes · 2016-02-09T19:55:39Z

If you know when a segment is no longer present then you could maintain an index as described above. Then when the segment is dropped you invalidate all of the associated keys. This could be done racy (depending on eviction to cleanup) or atomically (a bit of locking) depending on your preference. The atomic approach might use a CacheWriter (synchronous) or RemovalListener (asynchronous). The complexity to do it "correctly" and discard data proactively may not turn into a performance win, though.

fjy · 2016-02-11T01:06:33Z

This has transient failures in the new UT. Please please try to reduce transient failures.

drcrallen · 2016-02-17T19:37:07Z

@fjy failures in java7 UT are not transient. See master comment

gianm · 2016-02-17T19:55:18Z

@drcrallen IMO the best way to resolve the java7 thing is to have this not be in the druid repo, but be in a different repo and be labeled as requiring java8.

drcrallen · 2016-02-19T01:48:17Z

I have moved this over to https://github.com/metamx/druid-cache-caffeine since it requires java8

binlijin · 2016-02-19T02:37:10Z

👍

drcrallen added the Feature label Feb 8, 2016

drcrallen force-pushed the caffeineCache branch from f0673ea to f80ac22 Compare February 8, 2016 22:02

drcrallen added the Discuss label Feb 8, 2016

drcrallen force-pushed the caffeineCache branch from f80ac22 to 21f91a4 Compare February 8, 2016 22:41

Add Caffeine cache layer extension

2d5ca25

* Fixes apache#2411

drcrallen force-pushed the caffeineCache branch from 21f91a4 to 2d5ca25 Compare February 8, 2016 23:01

ben-manes reviewed Feb 9, 2016
View reviewed changes

drcrallen mentioned this pull request Feb 19, 2016

Initial commit metamx/druid-cache-caffeine#1

Merged

drcrallen closed this Feb 19, 2016

drcrallen deleted the caffeineCache branch February 19, 2016 01:49

seoeun25 pushed a commit to seoeun25/incubator-druid that referenced this pull request Jan 10, 2020

apache#2417 Remove concurrency related codes in incremental index

f33be84

Conversation

drcrallen commented Feb 8, 2016

Uh oh!

drcrallen commented Feb 8, 2016

Uh oh!

drcrallen commented Feb 8, 2016

Uh oh!

ben-manes Feb 9, 2016

Choose a reason for hiding this comment

Uh oh!

drcrallen Feb 19, 2016

Choose a reason for hiding this comment

Uh oh!

b-slim commented Feb 9, 2016

Uh oh!

drcrallen commented Feb 9, 2016

Uh oh!

drcrallen commented Feb 9, 2016

Uh oh!

ben-manes commented Feb 9, 2016

Uh oh!

drcrallen commented Feb 9, 2016

Uh oh!

drcrallen commented Feb 9, 2016

Uh oh!

ben-manes commented Feb 9, 2016

Uh oh!

fjy commented Feb 11, 2016

Uh oh!

drcrallen commented Feb 17, 2016

Uh oh!

gianm commented Feb 17, 2016

Uh oh!

drcrallen commented Feb 19, 2016

Uh oh!

binlijin commented Feb 19, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants