跳到主要内容

离线更新(Expert)

部署离线更新专家。
PUT
https://api.alayanew.com/api/serverless-infer/v1/deployment/expert/{serviceId}
Authorizations
AuthorizationsStringHeaderRequired

用户可通过已获取Serverless API Key做验证,例如:plain Credential=[YOUR_AK],Signature=[YOUR_SK]。

Path Parameters
serviceIdStringRequired

服务ID。

Body
application/json
vksIdStringRequired

弹性容器集群(VKS)ID。

namespaceStringRequired

弹性容器集群(VKS)NameSpace。

nameStringRequired

服务名称。

servedNameList<String>Required

模型内部标识。

modelIdString

模型ID。

headConfigObjectRequired

workerConfigObject

scaleObjectRequired

Response
状态码:application/json
codeInt

code是一种常见的返回值形式,表示查询操作的执行结果。

0是成功标识,表示操作成功完成。
dataObject

msgString

code返回值为-1时,返回异常信息。

curl --location --request PUT 'https://api.alayanew.com/api/serverless-infer/v1/deployment/expert/38fbfc3d-6a88-4c35-b8b6-9efc83949d47' 
     --header 'Authorization:plain Credential=YOUR_AK,Signature=YOUR_SK' 
     --header 'Content-Type: application/json' 
     --data '{
        "name":"test-expert",
        "namespace": "default",
        "vksId": "vcacb50arkk4",
        "servedName": ["testsglang"],
        "modelId": "c486cdee-c316-4fc1-9f75-0d1741940f27",
        "scale": {
            "max": 3,
            "min": 1,
            "rpsValue": 10,
            "idleTime": 60
        },
        "headConfig": {
            "image": "registry.cn-hangzhou.aliyuncs.com/ls-2018/sglang:0.4.6",
            "cmd": ["sh", "-c", "test.sh"],
            "labels": {
                "usage": "test"
            },
            "env": {
                "VLLM_ALLOW_RUNTIME_LORA_UPDATING": "true"
            },
            "args": [],
            "resource": {
                "workers": 3,
                "mem": 32,
                "cpu": 4,
                "gpu": {
                    "gpuType": "vidia.com/gpu-l40s",
                    "count": 1
                }
            },
            "pvcMounts":[
                {
                    "containerPath": "/scripts",
                    "pvcName": "test-name"
                }
            ]
        },
        "workerConfig": {
            "workers": 3,
            "image": "registry.cn-hangzhou.aliyuncs.com/ls-2018/sglang:0.4.6",
            "cmd": ["sh", "-c", "test.sh"],
            "labels": {
                "usage": "test"
            },
            "env": {
                "VLLM_ALLOW_RUNTIME_LORA_UPDATING": "true"
            },
            "args": [""],
            "resource": {
                "workers": 3,
                "mem": 8,
                "cpu": 4,
                "gpu": {
                    "gpuType": "vidia.com/gpu-l40s",
                    "count": 1
                }
            },
            "pvcMounts":[
                {
                    "containerPath": "/scripts",
                    "pvcName": "test-name"
                }
            ]
        },
        "extensions": {
            "usage": "test"
        }
    }'
{
    "code": 0",
    "data": {   
    },
    "msg": "string, "
}