Speed up sorting / reduce overhead of sorting

About a year ago @KaiBot3000, @aarthykc, and me put carmen under load, profiled with `perf`,  and noticed a single, significant bottleneck in carmen-cache.

The bottleneck was the sorting of `Context` objects, specifically `contextSortByRelev`. This makes sense given `Context` objects are large (copying them is expensive), there may be a lot of them collected in memory, and the sort function is not simple.

After https://github.com/mapbox/carmen-cache/pull/93 we should see a slight speedup / reduction in the cost of this sorting because `Context` objects are now moved rather than copied. However I would not be surprised if the top bottleneck in carmen is still this sorting in carmen-cache. So this ticket stands to:

 - remind us that this is worthwhile of more investigation
 - document what we saw in case optimization work around sorting is picked up again
 - reference https://github.com/mapbox/carmen-cache/pull/93 which includes some ideas for optimizations not yet attempted, including using `std::partial_sort`

Details:

 - The sort function: https://github.com/mapbox/carmen-cache/blob/dfa468a7c17e0f8ba51de2a437d62a90f5a31295/src/cpp_util.hpp#L192-L206
 - The place the sort takes place: (https://github.com/mapbox/carmen-cache/blob/dfa468a7c17e0f8ba51de2a437d62a90f5a31295/src/coalesce.cpp#L647) was identified as a bottleneck during profiling of carmen with `perf`. This is no surprise, sorting is expensive
 - The `perf` output that previously highlighted sorting as the primary bottleneck:

![0707a524-f469-11e6-9296-302ac929396b](https://user-images.githubusercontent.com/20300/37373363-89cb5742-26d3-11e8-91be-0937f5d4379e.png)


	inline bool contextSortByRelev(Context const& a, Context const& b) noexcept {
	if (b.relev > a.relev)
	return false;
	else if (b.relev < a.relev)
	return true;
	else if (b.coverList[0].scoredist > a.coverList[0].scoredist)
	return false;
	else if (b.coverList[0].scoredist < a.coverList[0].scoredist)
	return true;
	else if (b.coverList[0].idx < a.coverList[0].idx)
	return false;
	else if (b.coverList[0].idx > a.coverList[0].idx)
	return true;
	return (b.coverList[0].id > a.coverList[0].id);
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up sorting / reduce overhead of sorting #120

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Speed up sorting / reduce overhead of sorting #120

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions