Bug 1945017: Increase system reserved memory from 1Gi to 1.8Gi to support single node clusters#2504
Bug 1945017: Increase system reserved memory from 1Gi to 1.8Gi to support single node clusters#2504omertuc wants to merge 1 commit intoopenshift:masterfrom omertuc:more_system_reserved_memory
Conversation
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: omertuc The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
@omertuc: This pull request references Bugzilla bug 1945017, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
2 similar comments
|
@omertuc: This pull request references Bugzilla bug 1945017, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
@omertuc: This pull request references Bugzilla bug 1945017, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/test e2e-aws-workers-rhel7 |
|
@rphillips any issues with this? @omertuc please update the BZ as bot requested |
|
/bugzilla refresh |
|
/bugzilla refresh |
|
/test ? |
|
@omertuc: An error was encountered querying GitHub for users with public email (wabouham@redhat.com) for bug 1945017 on the Bugzilla server at https://bugzilla.redhat.com. No known errors were detected, please see the full error message for details. Full error message.
Post "http://ghproxy/graphql": dial tcp 172.30.229.2:80: i/o timeout
Please contact an administrator to resolve this issue, then request a bug refresh with DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/test e2e-aws-serial |
|
@omertuc: An error was encountered querying GitHub for users with public email (wabouham@redhat.com) for bug 1945017 on the Bugzilla server at https://bugzilla.redhat.com. No known errors were detected, please see the full error message for details. Full error message.
Post "http://ghproxy/graphql": dial tcp 172.30.229.2:80: i/o timeout
Please contact an administrator to resolve this issue, then request a bug refresh with DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
@omertuc: The following commands are available to trigger jobs:
Use
DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/test e2e-aws-single-node |
|
@omertuc: The following commands are available to trigger jobs:
Use
DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
…ode clusters When running E2E tests on single node clusters, the 1Gi reserved for system memory is insufficient. During this PR: #2501 - I had 3 e2e test runs on AWS single node, the peak recorded system memory usage during those tests was 1.40, 1.31 and 1.19 GiB respectively. In this PR I also saw a run that peaked at 1.56 GiB. The SystemMemoryExceedsReservation alerts demands that the actual usage would be less than 90% of the amount reserved, so that means the corresponding thresholds that should be set are at least 1.44, 1.46, 1.32 and 1.74 GiB. Or in short, the reserved memory should be increased to 1.8GiB to support single node (with some hopefully sufficient padding). Possible future improvements - 1) Different threshold depending on whether the cluster is a single node cluster or not 2) Find a way to lower single node system memory usage
|
@omertuc: This pull request references Bugzilla bug 1945017, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@kikisdeliveryservice It has to has to be triaged before I can do that, see comment in bz |
|
/test e2e-aws-workers-rhel7 |
|
/test okd-e2e-aws |
|
/test e2e-aws |
|
@omertuc: The following tests failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
|
The PR will somehow need to detect if SNO is enabled to set the reserve to 1.8GB |
|
/hold |
|
Closed in favor of openshift/release#17403 |
|
@omertuc: This pull request references Bugzilla bug 1945017. The bug has been updated to no longer refer to the pull request using the external bug tracker. All external bug links have been closed. The bug has been moved to the NEW state. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
When running E2E tests on single node clusters, the 1Gi reserved for
system memory is insufficient.
During this PR: #2501 -
I had 3 e2e test runs on AWS single node, the peak recorded system memory
usage during those tests was 1.40, 1.31 and 1.19 GiB respectively. In
this PR I also saw a run that peaked at 1.56 GiB:
The SystemMemoryExceedsReservation alerts demands that the actual usage
would be less than 90% of the amount reserved, so that means the
corresponding thresholds that should be set are at least 1.44, 1.46, 1.32 and
1.74 GiB.
Or in short, the reserved memory should be increased to 1.8GiB to
support single node (with some hopefully sufficient padding).
Possible future improvements -
Different threshold depending on whether the cluster is a single node
cluster or not
Find a way to lower single node system memory usage