ποΈ Overview
EGS exposes a number of APIs to create and manage slices and GPU provision request (GPRs)
ποΈ Authentication
The client needs to authenticate using the API Token to get a bearer token for the API calls.
ποΈ API Key
This topic describes the REST APIs to manage API keys.
ποΈ Workspace
A workspace is associated with one or more namespaces and a user.
ποΈ Inventory
List GPU nodes available for allocation requests (GPRs) in the cluster. These nodes are managed and allocated by EGS.
ποΈ GPR Templates
This topic describes the REST APIs to manage GPR Templates.
ποΈ GPR Template Binding
A GPR template binding is the GPR template that associates with the workspace.
ποΈ GPR
A GPR is a GPU Provisioning request to the EGS to provision GPUs to workspace for a requested duration.
ποΈ GPR Wait Time
The GPR wait time is calculated based on the current GPU availability and the requested GPU resources. If the requested GPU resources
ποΈ Inference Endpoints
An Inference Endpoint is a hosted service to perform inference tasks such as