Cortex Inference
Perform LLM text completion inference¶
POST/api/v2/cortex/inference:complete
Perform LLM text completion inference, similar to snowflake.cortex.Complete.
Parameter | Type | Description |
---|
{
"model": "string",
"messages": [
{
"role": "user",
"content": "string"
}
],
"temperature": 0,
"top_p": 1,
"max_tokens": 4096,
"max_output_tokens": 0,
"guardrails": {
"enabled": true,
"response_when_unsafe": "Response filtered by Cortex Guard"
}
}
Response¶
Code | Description |
---|---|
200 | OK {} |
400 | Bad Request. The request payload is invalid or malformed. This happens if the application didn't send the correct request payload. The response body may include the error code and message indicating the actual cause. The application must reconstruct the request body for retry. |
401 | Unauthorized. The request is not authorized. This happens if the attached access token is invalid or missing. The response body may include the error code and message indicating the actual cause, e.g., expired, invalid token. The application must obtain a new access token for retry. |
403 | Forbidden. The request is forbidden. This can also happen if the request is made even if the API is not enabled. |
404 | Not Found. The request endpoint is not valid. This happens if the API endpoint does not exist, or if the API is not enabled. |
405 | Method Not Allowed. The request method doesn't match the supported API. This happens, for example, if the application calls the API with GET method but the endpoint accepts only POST. |
500 | Internal Server Error. The server hit an unrecoverable system error. The response body may include the error code and message for further guidance. The application owner may need to reach out the customer support. |
503 | Service Unavailable. The request was not processed due to server side timeouts. The application may retry with backoff. The jittered backoff is recommended. |
504 | Gateway Timeout. The request was not processed due to server side timeouts. The application may retry with backoff. The jittered backoff is recommended. |
Parameter | Type | Description |
---|