Alaya NeW Cloud

Product Overview

One-stop intelligent compute platform — compute, storage, APIs, and the LLM toolchain

Alaya NeW Cloud serves AI training, inference, agent development, and HPC (High-Performance Computing) workloads, providing a full-stack offering from low-level GPUs to the upper-layer model toolchain. This page lays out every product, what it does, and when to use it.

1. Product matrix

Core compute platforms

ProductWhat it isWhen to use it
Cloud Container Instance (CCI)Kubernetes-based serverless container service. Bring an image plus your business logic — no need to manage the underlying cluster.Isolated, single-user GPU experimentation; rapid deployment and operations of inference services
Alaya LabCloud-based one-stop AI development platform with VS Code and Jupyter as the entry points; integrates compute, storage, and training services.Ready-to-use pre-integrated environment; unified development across local IDE, console, and logs
HyperTrainKubernetes-based serverless distributed training service. Submit a job and resources are allocated automatically; built-in intelligent scheduling and self-healing on failure.Automated cluster and environment management; training stability and high availability
Virtual Kubernetes Service (VKS)Unified scheduling for large-scale heterogeneous GPU compute, with cluster scheduling and automated operations.Multi-node, multi-GPU training; full lifecycle management of training and inference
DKS (Dedicated Kubernetes Service)Enterprise-grade, physically isolated Kubernetes clusters with full self-control and a graphical operations UI.User-controlled physical resources; scenarios with strict data security and compliance requirements
DSC (Dedicated Slurm Cluster)Standardized, highly available, elastically scaling Slurm-based compute service with integrated visual operations and multi-dimensional monitoring.Standardized cloud HPC environments; large-scale parallel job scheduling

Development and toolchain

ProductWhat it isWhen to use it
OpenClawOpen-source AI Agent framework written in Node.js. Through a Gateway, it maps LLM inference into executable control over the local OS, file system, and APIs. Supports Feishu, DingTalk, WhatsApp, and other messaging tools as interaction front-ends; provides long-term memory and 24/7 background execution.Upgrade AI from "conversational" to "actionable"; bridge AI with local OS-level permissions

Storage and image services

ProductWhat it isWhen to use it
Mass storageCost-effective elastic shared file storage. Multi-protocol support, linear capacity scaling, distributed parallel architecture.Multi-scenario integration; sharing and centralized management of storage resources across applications
High-performance storageParallel file storage with extreme throughput, global namespace, single-directory view, and sustained performance under massively concurrent access.High-throughput training-data reads; enterprise-grade high-availability requirements
NAS storageElastic mounting of mass-storage or high-performance volumes with unified permission management; decouples compute from storage.Pick mass storage (cost-effective) or high-performance storage (low latency) by scenario; unified shared storage with permissions
AI-accelerated storageCloud-native high-speed storage deeply optimized for LLM training. Intelligent caching plus multi-protocol access solves bottlenecks in massive file reads.Multi-tier caching (memory / SSD / cluster) smoothly absorbs checkpoint write spikes
Object storageElastic object store for massive unstructured data. Flat structure with the standard S3 interface.Data backup, disaster recovery, compliance; data sources for AI frameworks
Image registryFully managed container image management with shared access plus permission management; enterprise-grade encrypted storage and image isolation.Sharing images within teams or across projects; rapid image-based deployment

API services

ProductWhat it isWhen to use it
Open APIWraps GPU compute, distributed scheduling, and container technology into standardized, programmable APIs.Reduce AI training to API calls; serverless per-second billing
Model Service API"Model as a Service" — wraps model deployment, operations, and scheduling as RESTful APIs.Minute-level one-click deployment from model file to live inference
Image Registry APIProgrammatic interface to the image registry.Automated image-registry management

LLM development and inference

ProductWhat it isWhen to use it
LM LabOne-stop LLM development and training platform. Pre-integrates mainstream open-source models, distributed training environments, and visual tuning tools; supports multi-source data ingestion and full model lifecycle management.Lower the barrier to LLM development; unified data and task management
InferenceEnterprise-grade model inference platform covering deployment, evaluation, release, and monitoring. Supports one-click deployment, version control, and elastic auto-scaling.Bring inference services online in minutes; real-time monitoring and evaluation

2. Typical scenarios and product combinations

1. LLM training and fine-tuning

For iterative training workloads on large datasets (images, audio, text), use VKS or HyperTrain for Tensor Core accelerated cards; use LM Lab for data management, model training, task management, model management, and environment management — covering the full development pipeline.

2. AI model inference

For online inference workloads with strict requirements on response latency and stability, use VKS to receive inference requests in real time and balance load automatically; use Inference for deployment, compression, and inference of LLM, CV, and NLP applications. To call models directly via API, see AlayaCode (OpenAI / Anthropic compatible).

3. AI Agent development

Use Alaya Studio, which integrates LLMs, embedding models, and multi-modal model APIs. Fine-tune open-source models with your own data and publish agents as APIs. Common applications: customer service, medical diagnosis, financial services, education assistance, and smart home.

4. HPC (High-Performance Computing)

VKS elastic scalability matches the parallelism and data intensity of HPC workloads, adjusting compute resource allocation in real time. Common domains: image processing and computer vision, engineering and industrial applications, machine learning, etc.

3. Get started

Sign up and verify

  • Register: visit the Alaya NeW website, click Sign Up, and fill in the basic information. See Sign up for the full procedure.
  • Real-name verification: complete enterprise or individual real-name verification per the prompts to unlock full platform privileges.
  • Top up: top up the account to activate it and start using all services.

Quick start

Last updated on

Was this page helpful?

On this page