Offline update(QuickStart)
Users can update model deployment (QuickStart) information offline after stopping the instance.
PUT
https://api.alayanew.com/api/serverless-infer/v1/deployment/{serviceId}
Authorizations
Authorizations:StringHeaderRequired
用户可通过已获取Open API Key做验证,例如:plain Credential=[YOUR_AK],Signature=[YOUR_SK]。
Path Parameters
serviceId:StringRequired
Service ID.
Body
application/json
vksId:StringRequired
Vital Kubernetes Engine (VKS) ID.
namespace:StringRequired
Vital Kubernetes Engine (VKS) NameSpace.
name:StringRequired
Service Name.
servedName:List<String>Required
Internal model identifier.
modelId:StringRequired
Model ID.
backend:StringRequired
Backend service, vllm/sglang.
backendVersion:StringRequired
Backend service version.
backendArgs:StringRequired
Backend service parameters.
resource:ObjectRequired
Response
状态码:application/json
200
code:Int
code is a common return value form representing the execution result of the query operation.
0
-1
0 is the success flag, indicating the operation completed successfully.
data:Object
msg:StringRequired
Returns an error message when the code value is -1.
cURL
Python
JavaScript
Go
Java
curl --location --request PUT 'https://api.alayanew.com/api/serverless-infer/v1/deployment/38fbfc3d-6a88-4c35-b8b6-9efc83949d47'
--header 'Authorization:plain Credential=YOUR_AK,Signature=YOUR_SK'
--header 'Content-Type: application/json'
--data '{
"vksId": "vcacb50arkk4",
"namespace": "default",
"name": "testsglang",
"servedName": [
"testsglang"
],
"modelId": "c486cdee-c316-4fc1-9f75-0d1741940f27",
"backend": "sglang",
"backendVersion": "0.4.6",
"backendArgs": [],
"resource": {
"workers": 2,
"cpu": 4,
"gpu": {
"count": 1,
"gpuType": "nvidia.com/gpu-l40s"
},
"mem": 10
}
}'
200
400
401
403
404
500
{
"code": 0,
"data": {
},
"msg": "string, "
}