ποΈ Overview
EGS exposes a number of APIs to create and manage slices and GPU provision request (GPRs)
ποΈ Authentication
The client needs to authenticate using the API Token to get a bearer token for the API calls.
ποΈ API Key
This topic describes the REST APIs to manage API keys.
ποΈ Workspace
A workspace is associated with one or more namespaces and a user.
ποΈ Inventory
List GPU nodes available for allocation requests (GPRs) in the cluster. These nodes are managed and allocated by EGS.
ποΈ GPR Templates
This topic describes the REST APIs to manage GPR Templates.
ποΈ GPR Template Binding
A GPR template binding is the GPR template that associates with the workspace.
ποΈ GPR
A GPR is a GPU Provisioning request to EGS to allocate GPUs to a workspace for a requested duration.
ποΈ GPR Wait Time
The GPU Provisioning Requests (GPR) wait time is calculated based on the current GPU availability and the requested GPU resources. If the
ποΈ Inference Endpoints
An Inference Endpoint is a hosted service to perform inference tasks such as