Part of #624
Cache.FetchChunkData() deserialises all chunks returned by memcached in a simple loop. This could be parallelised when there are thousands to work through.
As noted by jml we already have a few implementations of "parallel map" - maybe we can re-use one here?