Skip to content

Reduce hostd mem usage via event type filter#7001

Merged
mdubya66 merged 3 commits intovmware:masterfrom
hickeng:event-filters-and-big-page-size
Dec 29, 2017
Merged

Reduce hostd mem usage via event type filter#7001
mdubya66 merged 3 commits intovmware:masterfrom
hickeng:event-filters-and-big-page-size

Conversation

@hickeng
Copy link
Contributor

@hickeng hickeng commented Dec 22, 2017

This bumps eventPageSize back to 200 to determine if the event type filter has significant effect on memory footprint of task history.
Having run full CI against this branch we saw a maximum of 124880 which is directly inline with the numbers found for 1.2.1 full CI and an unfiltered pagesize of 25.
The fact these two numbers are almost identical implies to me that we're not saturating the 25 pagesize in CI and in fact the number is dominated at this time by the taskRetention interval in hostd.

https://ci.vcna.io/vmware/vic/15511 is the CI build used for reference. There is a failure in the log test which looks like it's related to either a local stall or clock skew - will confirm once logs are uploaded.

@dougm has tests for the filtering in vmware/govmomi and I've confirmed that we're getting only the expected events delivered. The docker ps and out-of-band tests already exercise the function that events support.

This is based off #7000 and should be merged instead of that, or after having dropped the govmomi portion of this PR.

Follow up for : #6886 and #6384

@hickeng hickeng changed the title WIP: event filters and big page size [specific ci=Group3-Docker-Compose] Large eventPageSize with filtered types [full ci] Dec 22, 2017
@hickeng hickeng force-pushed the event-filters-and-big-page-size branch 2 times, most recently from 6005cb2 to 73d83f2 Compare December 28, 2017 20:28
An alternative to increasing the collector page size.  It will reduce the throughput to the event collector and hence reduce event misses.

See issues vmware#6937 and vmware#6998
@hickeng hickeng force-pushed the event-filters-and-big-page-size branch from 73d83f2 to 613ac6f Compare December 29, 2017 00:32
Hypothesis is that it was primarily the reconfigure events that were
overwhelming the heap due to associated config spec. This has not been
confirmed by code inspection or similar, but this commit aims to
experientially determine the impact on hostd memory.
Experiments have shown that a filtered collector with pagesize of 200
has the same memory profile across full CI as the 1.2.1 release code
with an unfiltered pagesize of 25 (maxes out at approx 125000).
Without the filtering, significant deviation was observed.
Copy link
Member

@dougm dougm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@hickeng hickeng changed the title Large eventPageSize with filtered types [full ci] Reduce hostd mem usage via event type filter Dec 29, 2017
@mdubya66
Copy link
Contributor

mdubya66 commented Dec 29, 2017

LGTM

Approved with PullApprove Approved with PullApprove

@mdubya66
Copy link
Contributor

Merging to close the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants