GPU Resources
GPU models, request rules, and YAML examples in VKS
VKS offers several GPU models. Choose the one that fits your workload when provisioning a cluster.
GPU model table
| Model | GPU label | GPU memory | Request / limit |
|---|---|---|---|
| NVIDIA-H800A-NV-96G | Provisioned on request | 96 G | 18c : 200G : 1 GPU |
| NVIDIA-H800A-NV-80G | Provisioned on request | 80 G | 13c : 200G : 1 GPU |
| NVIDIA-H800-NV-80G | nvidia.com/gpu-h800 | 80 G | 18c : 200G : 1 GPU |
| NVIDIA-L40S-PCIE-48G | nvidia.com/gpu-l40s | 48 G | 10c : 100G : 1 GPU |
Using GPUs
In a container you can request and limit GPUs by setting the resource key (the GPU label) and the GPU count.
Request example:
- GPU label:
nvidia.com/gpu-h800 - GPU count: 1
- cpu: 18
- memory: 200G
Limit example:
- GPU label:
nvidia.com/gpu-h800 - GPU count: 1
- cpu: 18
- memory: 200G
Tip: For each GPU, the CPU and memory must not exceed the per-GPU caps in the table above. With multiple GPUs, scale CPU and memory in proportion. Write the CPU as a plain number — no
csuffix (e.g.cpu: 18).
Code example
spec:
imagePullSecrets:
- name: your-harbor-secret
containers:
- name: your-container-name
image: your-image:tag # replace with your own image
command: your-command
resources:
requests:
memory: "200G"
cpu: "18"
nvidia.com/gpu-h800: 1 # request 1 GPU
limits:
memory: "200G"
cpu: "18"
nvidia.com/gpu-h800: 1 # limit 1 GPULast updated on
Was this page helpful?
