kernel: enable config CONFIG_MEMORY_HOTPLUG_DEFAULT_ONLINE by default#335
Conversation
jodh-intel
left a comment
There was a problem hiding this comment.
lgtm
We might want input from @grahamwhaley in case this has an impact on the fragements work (#314).
|
:-) I think more importantly is how this does or does not fit in with the memory hotplug flow of kata - @jcvenegas @devimc for input from the 'how is kata hotplug meant to work' side. |
|
@wenlxie great job! do you know why kata-agent receives OOM? |
|
@wenlxie intesting what is the maximal amount of memory you can hotplug currently with out get the agent killed ? |
In my env, the limitation is 120Gi. I tested 119Gi memory, it is ok. |
|
@devimc |
|
@chavafg I am afraid we dont have a full kata testing in this repository we need to add one to fully test.
|
|
@jcvenegas @Pennyzct confirmed this feature works well for ARM and we don't need the manual logic of doing We need to try this out with |
|
@jcvenegas @chavafg Can we add full CI testing to this PR? |
2533a47 to
313e8b9
Compare
@jcvenegas Thanks, now the SoB check had passed. |
|
@wenlxie you can remove the option for s390x. The memory hotplug is not supported by qemu KVM yet |
|
/test |
28abbef to
78cb669
Compare
|
/test |
|
kata_config_version is also need to be changed. Or the Ci job will fail. |
|
/test |
|
@jodh-intel thanks. |
|
Need to run CI job again. Can someone help to trigger it? |
|
Sure.... /retest |
|
/test |
| @@ -1 +1 @@ | |||
| 25 | |||
| 26 | |||
There was a problem hiding this comment.
Hi @wenlxie - sorry about this, but I think the kernel config version has moved ahead in the master branch again - can you rebase and repush... sorry for the hassle - let's see if we can get this one moving again...
e05719c to
12d3cd6
Compare
|
@grahamwhaley I rebase it and need someone help to trigger the test, thank. |
|
/test |
42263e0 to
ee28a84
Compare
|
The kernel config version bumps again. :( |
|
/test |
|
@jodh-intel could you merge to another bump race condition :P ? |
|
Restarted CI which failed due to a timeout. |
|
time out again copy here to check if is a not stable test |
|
/test |
|
18.04 timed out. Prodded it with a sharper stick this time... |
|
@wenlxie This needs another update to |
Fixes: kata-containers#343 Signed-off-by: Wenli Xie <wenlxie@ebay.com>
|
@amshinde I rebased it. But the ci job always fails with timeout, no more infos to debug the issue. |
|
/test |
|
/me pounces... |
@grahamwhaley @jodh-intel Thanks |
In my test env, I assigned 120Gi memory to the container, but found that Sandbox can't be launched. For details: kata-containers/runtime#1244
kata-agent and udev process was killed because of OOM triggerd.
https://lwn.net/Articles/668944/
Signed-off-by: Wenli Xie wenlxie@ebay.com