部署详情(QuickStart)
基模型服务详情。
GET
https://api.alayanew.com/api/serverless-infer/v1/deployment/${serviceId}
Authorizations
Authorizations:StringHeaderRequired
用户可通过已获取Serverless API Key做验证,例如:plain Credential=[YOUR_AK],Signature=[YOUR_SK]。
Path Parameters
serviceId:StringRequired
服务ID。
Response
状态码:application/json
200
serviceUrl:String
服务URL。
apiKey:String
API Key。
status:String
状态。
serviceId:String
服务ID。
vksId:String
弹性容器集群(VKS)ID。
namespace:String
弹性容器集群(VKS)NameSpace。
name:String
服务名称。
servedName:List<String>
模型内部标识。
modelId:String
模型ID。
mode:String
启动方式,例如:quickStart/expert。
quickStart:Object
cURL
Python
JavaScript
Go
Java
curl --location --request GET 'https://api.alayanew.com/api/serverless-infer/v1/deployment/38fbfc3d-6a88-4c35-b8b6-9efc83949d47'
--header 'Authorization:plain Credential=YOUR_AK,Signature=YOUR_SK'
--header 'Content-Type: application/json'
200
400
401
403
404
500
{
"code": 0,
"data": {
"serviceUrl":"string",
"apikey":"String ",
"status":"String ,模型部署状态 staring, running, stoping, stoped, failed",
"vksId":"",
"namespace":"",
"name":"string , 服务展示名称由用户自定义",
"servedName": ["string, 模型内部标识"],
"modelId": "String, 模型ID",
"mode":"quickStart",
"quickStart":{
"backend":"vllm/sglang",
"backendVersion":"0.8.4...",
"backenArgs":[],
"resource": {
"workers": "optional, int, number of workers",
"cpu": "required,int",
"mem": "required,int",
"gpu": {
"gpuType": "required, string, gpu type name",
"count": "required, int, number of gpu to be use"
}
},
"expert":{...}
}
}
}