Updates an existing endpointβs configuration. You can modify the display name, autoscaling settings, or change the endpointβs state (start/stop).
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
The ID of the endpoint to update
A human-readable name for the endpoint
"My Llama3 70b endpoint"
The desired state of the endpoint
STARTED, STOPPED "STARTED"
New autoscaling configuration for the endpoint
The number of minutes of inactivity after which the endpoint will be automatically stopped. Set to 0 to disable automatic timeout.
60
200
Details about a dedicated endpoint deployment
The type of object
endpoint "endpoint"
Unique identifier for the endpoint
"endpoint-d23901de-ef8f-44bf-b3e7-de9c1ca8f2d7"
System name for the endpoint
"devuser/meta-llama/Llama-3-8b-chat-hf-a32b82a1"
Human-readable name for the endpoint
"My Llama3 70b endpoint"
The model deployed on this endpoint
"meta-llama/Llama-3-8b-chat-hf"
The hardware configuration used for this endpoint
"1x_nvidia_a100_80gb_sxm"
The type of endpoint
dedicated "dedicated"
The owner of this endpoint
"devuser"
Current state of the endpoint
PENDING, STARTING, STARTED, STOPPING, STOPPED, ERROR "STARTED"
Configuration for automatic scaling of the endpoint
Timestamp when the endpoint was created
"2025-02-04T10:43:55.405Z"