Skip to content

Add nvidia.com/gpu.present=true as a default label in NVIDIA variants #4756

@arnaldo2792

Description

@arnaldo2792

What I'd like:
The GPU Operator will automatically apply this label to hosts in the cluster that actually have GPUs (it leverages on NFD to detect the GPUs). Bottlerocket NVIDIA variants should apply this label to the nodes they run on, as we know that the variant will only run when NVIDIA GPUs are present. This label is required to run the NVIDIA DRA Plugin, as the official helm chart provided by NVIDIA uses the nvidia.com/gpu.present=true label as an affinity rule.

The EKS Optimized AMIs recently applied this change too: awslabs/amazon-eks-ami#2607

Any alternatives you've considered:
The administrators can apply the label to the nodes, but that's too manual as they will have to do it whenever a node joins a cluster.

Metadata

Metadata

Assignees

No one assigned

    Labels

    area/kubernetesK8s including EKS, EKS-A, and including VMWstatus/needs-proposalNeeds a more detailed proposal for next stepstype/enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions