☸️Kubernetes

Performance Tuning in Kubernetes

Updated 2026-05-15

10 min read

Performance Tuning in Kubernetes

Introduction

Kubernetes is a powerful platform for managing containerized applications, but its performance can be influenced by various factors. Optimizing Kubernetes involves tuning both the cluster configuration and the application deployment to ensure efficient resource utilization and high availability. This tutorial will cover several tips and tricks to help you optimize your Kubernetes environment.

Concept

Resource Requests and Limits

Resource requests and limits are crucial for managing how much CPU and memory a container can use. Setting appropriate values ensures that containers have the resources they need without overcommitting them, which can lead to performance issues.

Requests: The minimum amount of CPU and memory required by a container.
Limits: The maximum amount of CPU and memory a container can consume.

Example

apiVersion: v1
kind: Pod
metadata:
  name: example-pod
spec:
  containers:
  - name: example-container
    image: nginx
    resources:
      requests:
        memory: "64Mi"
        cpu: "250m"
      limits:
        memory: "128Mi"
        cpu: "500m"

Horizontal Pod Autoscaler (HPA)

The HPA automatically scales the number of pods in a deployment or replica set based on observed CPU utilization or other select metrics.

Example

apiVersion: autoscaling/v2beta2
kind: HorizontalPodAutoscaler
metadata:
  name: example-hpa
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: example-deployment
  minReplicas: 1
  maxReplicas: 10
  metrics:
  - type: Resource
    resource:
      name: cpu
      target:
        type: Utilization
        averageUtilization: 50

Node Affinity and Anti-Affinity

Node affinity allows you to control which nodes your pods can be scheduled on based on labels. This can help optimize performance by ensuring that certain workloads are placed on specific types of nodes.

Example

apiVersion: v1
kind: Pod
metadata:
  name: example-pod-with-affinity
spec:
  affinity:
    nodeAffinity:
      requiredDuringSchedulingIgnoredDuringExecution:
        nodeSelectorTerms:
        - matchExpressions:
          - key: kubernetes.io/e2e-az-name
            operator: In
            values:
            - e2e-az1
            - e2e-az2
  containers:
  - name: example-container
    image: nginx

Pod Disruption Budgets (PDB)

PDBs ensure that a specified number of pods continue to run during voluntary disruptions, such as rolling updates or maintenance. This helps maintain application availability and performance.

Example

apiVersion: policy/v1
kind: PodDisruptionBudget
metadata:
  name: example-pdb
spec:
  minAvailable: 2
  selector:
    matchLabels:
      app: example-app

Examples

Optimizing Resource Requests and Limits

To optimize resource requests and limits, monitor your application's CPU and memory usage. Adjust the values based on observed metrics to ensure that containers have enough resources without overcommitting.

kubectl top pod <pod-name>

Configuring Horizontal Pod Autoscaler

To configure an HPA, define the target CPU utilization or other metrics and set appropriate min and max replicas for your deployment.

kubectl get hpa

Using Node Affinity

To use node affinity, label your nodes appropriately and configure your pod specifications to match those labels.

kubectl label nodes <node-name> kubernetes.io/e2e-az-name=e2e-az1

Setting Pod Disruption Budgets

To set a PDB, define the minimum number of pods that must be available during disruptions.

kubectl get pdb

What's Next?

After optimizing performance, consider exploring cost optimization strategies to further reduce your Kubernetes infrastructure costs. This might include using spot instances, resizing clusters, or implementing efficient resource management practices.

By following these tips and tricks, you can significantly enhance the performance of your Kubernetes environment, ensuring that your applications run efficiently and reliably.

☸️Kubernetes

Performance Tuning in Kubernetes

Updated 2026-05-15

10 min read

Performance Tuning in Kubernetes

Introduction

Concept

Resource Requests and Limits

Requests: The minimum amount of CPU and memory required by a container.
Limits: The maximum amount of CPU and memory a container can consume.

Example

apiVersion: v1
kind: Pod
metadata:
  name: example-pod
spec:
  containers:
  - name: example-container
    image: nginx
    resources:
      requests:
        memory: "64Mi"
        cpu: "250m"
      limits:
        memory: "128Mi"
        cpu: "500m"

Horizontal Pod Autoscaler (HPA)

The HPA automatically scales the number of pods in a deployment or replica set based on observed CPU utilization or other select metrics.

Example

apiVersion: autoscaling/v2beta2
kind: HorizontalPodAutoscaler
metadata:
  name: example-hpa
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: example-deployment
  minReplicas: 1
  maxReplicas: 10
  metrics:
  - type: Resource
    resource:
      name: cpu
      target:
        type: Utilization
        averageUtilization: 50

Node Affinity and Anti-Affinity

Example

apiVersion: v1
kind: Pod
metadata:
  name: example-pod-with-affinity
spec:
  affinity:
    nodeAffinity:
      requiredDuringSchedulingIgnoredDuringExecution:
        nodeSelectorTerms:
        - matchExpressions:
          - key: kubernetes.io/e2e-az-name
            operator: In
            values:
            - e2e-az1
            - e2e-az2
  containers:
  - name: example-container
    image: nginx

Pod Disruption Budgets (PDB)

PDBs ensure that a specified number of pods continue to run during voluntary disruptions, such as rolling updates or maintenance. This helps maintain application availability and performance.

Example

apiVersion: policy/v1
kind: PodDisruptionBudget
metadata:
  name: example-pdb
spec:
  minAvailable: 2
  selector:
    matchLabels:
      app: example-app

Examples

Optimizing Resource Requests and Limits

kubectl top pod <pod-name>

Configuring Horizontal Pod Autoscaler

To configure an HPA, define the target CPU utilization or other metrics and set appropriate min and max replicas for your deployment.

kubectl get hpa

Using Node Affinity

To use node affinity, label your nodes appropriately and configure your pod specifications to match those labels.

kubectl label nodes <node-name> kubernetes.io/e2e-az-name=e2e-az1

Setting Pod Disruption Budgets

To set a PDB, define the minimum number of pods that must be available during disruptions.

kubectl get pdb

What's Next?

By following these tips and tricks, you can significantly enhance the performance of your Kubernetes environment, ensuring that your applications run efficiently and reliably.