Master in Understanding StatefulSets in Kubernetes

When deploying applications on Kubernetes, most use cases involve stateless workloads. However, some applications require each instance to maintain a unique identity, persistent storage, or stable network identifiers. For such cases, Kubernetes provides StatefulSets, a resource designed specifically for managing stateful applications.

What are StatefulSets?

StatefulSets are a Kubernetes workload API object used to manage stateful applications. Unlike Deployments or ReplicaSets, StatefulSets provide guarantees about the order and uniqueness of pod creation, scaling, and deletion.

Key Features of StatefulSets

Stable Network Identity:
- Each pod in a StatefulSet gets a stable hostname that doesn’t change even if the pod is restarted.
- Hostnames follow the pattern: pod-name-[ordinal].
Persistent Storage:
- StatefulSets work seamlessly with persistent volumes (PVs), ensuring that storage is not lost even if pods are terminated or rescheduled.
Ordered Deployment and Scaling:
- Pods are created and terminated in a strict order.
- Kubernetes ensures that one pod is ready before proceeding to the next.
Use Case for Stateful Applications:
- Databases (e.g., MySQL, PostgreSQL).
- Distributed systems (e.g., Cassandra, Kafka).
- Applications requiring unique pod identities (e.g., Redis clusters).

When to Use StatefulSets?

When Each Instance Must Retain Its State: For example, a database where each instance stores different shards of data.
When Applications Depend on Unique Identifiers: Applications that use consistent hostnames or persistent storage tied to specific instances.
When Pods Need Stable, Persistent Storage: Ensuring data is retained even if a pod crashes.

StatefulSet vs Deployment

StatefulSet Example

Let’s deploy a simple example of a StatefulSet in Kubernetes.

Step 1: StatefulSet Manifest

Create a YAML file (statefulset.yaml) for the StatefulSet.

apiVersion: apps/v1
kind: StatefulSet
metadata:
  name: my-statefulset
spec:
  serviceName: "my-service"
  replicas: 3
  selector:
    matchLabels:
      app: my-app
  template:
    metadata:
      labels:
        app: my-app
    spec:
      containers:
      - name: app-container
        image: nginx
        ports:
        - containerPort: 80
        volumeMounts:
        - name: app-data
          mountPath: /usr/share/nginx/html
  volumeClaimTemplates:
  - metadata:
      name: app-data
    spec:
      accessModes: ["ReadWriteOnce"]
      resources:
        requests:
          storage: 1Gi

Step 2: Create a Headless Service

StatefulSets require a headless service for stable network identities. Create a service.yaml file:

apiVersion: v1
kind: Service
metadata:
  name: my-service
spec:
  clusterIP: None
  selector:
    app: my-app
  ports:
  - protocol: TCP
    port: 80
    targetPort: 80

Step 3: Apply the Configuration

Deploy the StatefulSet and the Service.

kubectl apply -f service.yaml
kubectl apply -f statefulset.yaml

Step 4: Verify the StatefulSet

Check the StatefulSet and its pods:

kubectl get statefulsets
kubectl get pods

Expected output for pods:

kubectl get statefulsets
kubectl get pods

Step 5: Test Persistent Storage

Each pod has its own volume. You can verify this by checking the volumes created:

kubectl get pvc

Expected output:

NAME                 STATUS   VOLUME                                     CAPACITY   ACCESS MODES   STORAGECLASS   AGE
app-data-my-statefulset-0   Bound    pvc-1234abcd-5678-efgh-ijkl   1Gi        RWO            standard       3m
app-data-my-statefulset-1   Bound    pvc-5678abcd-1234-ijkl-efgh   1Gi        RWO            standard       2m
app-data-my-statefulset-2   Bound    pvc-9101abcd-5678-ijkl-efgh   1Gi        RWO            standard       1m

Explaining StatefulSets to a Layman

Imagine you’re managing a library system:

StatefulSet: Each library branch has a unique ID (e.g., branch-0, branch-1). The books (data) stored in each branch are unique and remain even if the branch is temporarily closed.
Deployment: Each branch is a pop-up library, and books (data) are not unique or persistent.

Best Practices for StatefulSets

Use Persistent Volumes:
- Always define volumeClaimTemplates for persistent storage.
Use Headless Services:
- Ensure stable networking for StatefulSet pods.
Scale Carefully:
- Pods are created and destroyed in order. Avoid frequent scaling for critical stateful applications.
Monitor Resources:

Stateful applications often require high I/O performance. Optimize resources accordingly.

Conclusion

StatefulSets are a powerful feature in Kubernetes that enable you to run stateful applications efficiently. Whether you’re deploying a database, a distributed cache, or any application requiring stable identifiers and persistent storage, StatefulSets simplify the management of these workloads.

Are you using StatefulSets in your projects? Share your use cases or questions in the comments!

DotNet Full Stack Dev

Search This Blog