Concepts
This section of the documentation helps you learn about the components, APIs and abstractions that Kueue uses to represent your cluster and workloads.
APIs
Resource Flavor
An object that you can define to describe what resources are available
in a cluster. Typically, a ResourceFlavor
is associated with the characteristics
of a group of Nodes. It could distinguish among different characteristics of
resources such as availability, pricing, architecture, models, etc.
Cluster Queue
A cluster-scoped resource that governs a pool of resources, defining usage limits and fair sharing rules.
Local Queue
A namespaced resource that groups closely related workloads belonging to a single tenant.
Workload
An application that will run to completion. It is the unit of admission in Kueue. Sometimes referred to as job.
Workload Priority Class
WorkloadPriorityClass
defines a priority class for a workload,
independently from pod priority.
This priority value from a WorkloadPriorityClass
is only used for managing the queueing and preemption of Workloads.
Admission Check
A mechanism allowing internal or external components to influence the timing of workloads admission.
Topology Aware Scheduling
A mechanism allowing to schedule Workloads optimizing Pod placement for network throuput between the Pods.
Glossary
Quota Reservation
Quota reservation is the process during through which the kueue scheduler locks the resources needed by a workload within the targeted ClusterQueues ResourceGroups
Quota reservation is sometimes referred to as workload scheduling or job scheduling, but it should not to be confused with pod scheduling.
Admission
Admission is the process of allowing a Workload to start (Pods to be created). A Workload
is admitted when it has a Quota Reservation and all its AdmissionCheckStates
are Ready
.
Cohort
A cohort is a group of ClusterQueues that can borrow unused quota from each other.
Queueing
Queueing is the state of a Workload since the time it is created until Kueue admits it on a ClusterQueue. Typically, the Workload will compete with other Workloads for available quota based on the fair sharing rules of the ClusterQueue.
Preemption
Preemption is the process of evicting one or more admitted Workloads to accommodate another Workload. The Workload being evicted might be of a lower priority or might be borrowing resources that are now required by the owning ClusterQueue.
Feedback
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.