Skip to content
This repository was archived by the owner on May 6, 2026. It is now read-only.
This repository was archived by the owner on May 6, 2026. It is now read-only.

ResourceSlice should be compatible with Cluster Autoscaler scale-from-existing-node #178

@gauravkghildiyal

Description

@gauravkghildiyal

Currently, DraNet removes devices from the ResourceSlice upon allocation. When a node's entire pool of devices is consumed, its ResourceSlice becomes empty. To decide whether to scale up, cluster autoscaler, creates a "template" of a new node by inspecting existing nodes of that type (and their respective ResourceSlices):

When it finds that all representative nodes have empty ResourceSlices (since DraNet removed all allocated devices from the ResourceSlice), it concludes that adding another node of this type would also yield no devices and hence not be able to schedule pending pods indefinitely.

This is an issue in how DraNet communicates state. It is not providing enough information for the autoscaler to differentiate between "a node that has no devices to begin with" and "a node whose devices are simply all in use."


Prior discussions related to this at https://kubernetes.slack.com/archives/C0409NGC1TK/p1748465553284579 (previous discussion was hypothetical while this is now a practical limitation)

Metadata

Metadata

Labels

No labels
No labels

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions