API Reference | Elastic Grid Service

📄️ Overview

EGS exposes a number of APIs to create and manage slices and GPU provision request (GPRs)

This document describes the authentication mechanism for accessing the EGS Core APIs.

This topic describes the REST APIs to manage API keys.

This topic describes the API used to get the list of clusters registered with the EGS Controller.

A workspace is associated with one or more namespaces and a user.

This topic describes APIs to list, retrieve, and update workspace policies.

List GPU nodes available for allocation requests (GPRs) in the cluster. These nodes are managed and allocated by EGS.

This topic describes the REST APIs to manage GPR Templates.

A GPR template binding is the GPR template that associates with the workspace.

A GPR is a GPU Provisioning request to EGS to allocate GPUs to a workspace for a requested duration.

The GPU availability is determined by the current state of the GPU resources in EGS. EGS continuously monitors GPU usage and

An Inference Endpoint is a hosted service to perform inference tasks such as

This topic describes the steps to create a Workload Template in a workspace. You can use the Workload Template as a pre

This topic describes APIs used to create, get, update, and delete a Workload Placement.