Skip to main content

serving_endpoints

Operations on a serving_endpoints resource.

Overview

Nameserving_endpoints
TypeResource
Iddatabricks_workspace.realtimeserving.serving_endpoints

Fields

NameDatatype
idstring
namestring
configobject
creation_timestampinteger
creatorstring
last_updated_timestampinteger
permission_levelstring
route_optimizedboolean
stateobject
tagsarray

Methods

NameAccessible byRequired ParamsDescription
getSELECTname, deployment_nameRetrieves the details for a single serving endpoint.
listSELECTdeployment_name
createINSERTdeployment_name
deleteDELETEname, deployment_name
patchUPDATEname, deployment_nameUsed to batch add and delete tags from a serving endpoint with a single API call.
updateconfigUPDATEname, deployment_nameUpdates any combination of the serving endpoint's served entities, the compute configuration of those served entities, and the endpoint's traffic config. An endpoint that already has an update in progress can not be updated until the current update completes or fails.
putREPLACEname, deployment_nameUsed to update the rate limits of a serving endpoint. NOTE: Only foundation model endpoints are currently supported. For external models, use AI Gateway to manage rate limits.
queryEXECname, deployment_name

SELECT examples

SELECT
id,
name,
config,
creation_timestamp,
creator,
last_updated_timestamp,
permission_level,
route_optimized,
state,
tags
FROM databricks_workspace.realtimeserving.serving_endpoints
WHERE deployment_name = '{{ deployment_name }}';

INSERT example

Use the following StackQL query and manifest file to create a new serving_endpoints resource.

/*+ create */
INSERT INTO databricks_workspace.realtimeserving.serving_endpoints (
deployment_name,
data__name,
data__config,
data__ai_gateway,
data__tags
)
SELECT
'{{ deployment_name }}',
'{{ name }}',
'{{ config }}',
'{{ ai_gateway }}',
'{{ tags }}'
;

UPDATE example

Updates a serving_endpoints resource.

/*+ update */
-- replace field1, field2, etc. with the fields you want to update
UPDATE databricks_workspace.realtimeserving.serving_endpoints
SET field1 = '{{ value1 }}',
field2 = '{{ value2 }}', ...
WHERE name = '{{ name }}' AND
deployment_name = '{{ deployment_name }}';

REPLACE example

Replaces a serving_endpoints resource.

/*+ update */
-- replace field1, field2, etc. with the fields you want to update
REPLACE databricks_workspace.realtimeserving.serving_endpoints
SET field1 = '{ value1 }',
field2 = '{ value2 }', ...
WHERE name = '{{ name }}' AND
deployment_name = '{{ deployment_name }}';

DELETE example

Deletes a serving_endpoints resource.

/*+ delete */
DELETE FROM databricks_workspace.realtimeserving.serving_endpoints
WHERE name = '{{ name }}' AND
deployment_name = '{{ deployment_name }}';