Alaya NeW Cloud

Elastic Kubernetes Service (VKS)

Managed Kubernetes for GPU workloads — GPU labels, RDMA, persistent volumes, service publishing

Virtual Kubernetes Service (VKS) is the managed Kubernetes offering on Alaya NeW Cloud, with first-class support for GPU scheduling, cross-node RDMA, shared / dynamic storage, and service publishing — so training and inference workloads consume GPU on demand.

Capabilities

  • GPU resources — H800 / L40S and more, requested by label, with CPU and memory pre-sized to GPU count.
  • Cross-node RDMA — IB / RoCE built in, enabled with one line rdma/rdma_shared_device_a in your YAML.
  • Persistent storage — shared static PVCs across accounts plus on-demand dynamic PVCs; underlying NAS / block details are abstracted away.
  • Service publishing — built-in ServiceExporter auto-mints a public domain; no separate load balancer needed.
  • Multi-tenancy — primary-account-driven authorization, with per-user parallelism and consumption rules on dedicated clusters.

Learning path

Advanced recipes

Common extensions on top of VKS — full guides live in the Cookbook:

VKS vs CCI

CCI is for single-container ad-hoc workloads. VKS is for multi-container, multi-service orchestration. They share the same underlying GPU pool and can be combined.

Last updated on

Was this page helpful?

On this page