HyperShift with the KubeVirt provider Cluster Configuration

Last Updated

Last Updated: 2024-06-25 13:04 PM

Introduction

This document covers some detailed cluster configurations that you may find useful in a production environment.

Node Placement and Scheduling

HyperShift with KubeVirt offers excellent flexibility which allows the hosted control plane pods running on the same node where worker VMs are scheduled. It is great for resource ultilization, sometimes you might wanna isolate the hosted control planes in certain infra nodes and place the worker VMs somewhere else. To do that, we can add some node selectors to the HyperConverged Object which allows us to schedule them to the nodes that have matching selectors.

Why Patching HyperConverged Object is necessary?

Since hosted control planes are running as pods which does not diffierentiate itself from with the virt-launcher pod. By using node-selector from the HyperShift create command alone is not enough, it will schedule the hosted control plane pods to the designated nodes, but it doesn't prevent virt-launcher pods being scheduled on the same nodes.

Patch the HyperConvered Object with two nodeSelectors:

kubernetes.io/infra-virtualization=virt-components
kubernetes.io/workers=vm-workload

oc patch HyperConverged kubevirt-hyperconverged -n openshift-cnv -p '{"spec":{"infra":{"nodePlacement":{"nodeSelector":{"kubernetes.io/infra-virtualization":"virt-components"}}}}}' --type merge

oc patch HyperConverged kubevirt-hyperconverged -n openshift-cnv --type=json -p '{"spec":{"workloads":{"nodePlacement":{"nodeSelector":{"kubernetes.io/workers":"vm-workload"}}}}}' --type merge

Label the Corresponding Nodes:

Worker 011 to 013 for virtualization operator components

for i in {011..013}; do oc label node  worker"$i"-1029p kubernetes.io/infra-virtualization=virt-components;done

Worker 000-010 for VM workloads

for i in {000..004}; do oc label node worker"$i"-1029p kubernetes.io/workers=vm-workload;done

Then we can pass the node-Selector param to HyperShift create command:

 hypershift create cluster kubevirt \
        --name "$cluster" \
        --node-pool-replicas 3 \
        --control-plane-availability-policy 'HighlyAvailable' \
        --node-selector "kubernetes.io/infra-virtualization=virt-components" \
        --etcd-storage-class "local-worker011-013" \
        --memory '8Gi' \
        --cores '8' \
        --release-image "$image" \
        --pull-secret "$pull_secret"

This will place hosted control planes pods on nodes 011-013 and the virt-laucher pods (worker VMs) on nodes 000-010.

Zone Config

To make the ectd pods span across 3 different nodes when HighlyAvaiable param is passed, we need to add the zone label:

for i in {013..011}; do oc label node worker"$i"-1029p topology.kubernetes.io/zone=zone-${i:2:1};done

Label the coressponding node with its role just to make more visible when we examine them:

for i in {013..011}; do oc label node worker"$i"-1029p node-role.kubernetes.io/zone-${i:2:1}=;done

Guest cluster support matrix

oc extract configmap/supported-versions -n hypershift --to=- | jq

Storage

Storage Operators

There are many storage options avaible..

Persistent storage within hosted clusters

Example of configuring storage class and pvcs within the hosted cluster. Note this is only a temporary workaround for testing in 4.12.x

oc --kubeconfig guest-kubeconfig create -f - <<EOF
apiVersion: storage.k8s.io/v1
kind: StorageClass
metadata:
  name: kv-00-sc
  annotations:
    storageclass.kubernetes.io/is-default-class: "true"
provisioner: csi.kubevirt.io
parameters:
  infraStorageClassName: ocs-storagecluster-ceph-rbd
  bus: scsi
EOF

oc --kubeconfig guest-kubeconfig create -f - <<EOF
apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: kv-00
spec:
  accessModes:
  - ReadWriteOnce
  storageClassName: kv-00-sc
  resources:
    requests:
      storage: 5Gi
  volumeMode: Filesystem
EOF

Once the storage class and pvc is created, the corresponding pv/pvcs will be mirroed into the infra/mgmt cluster.

etcd

By default, hypershift operator is configured to capture telemetry metircs, edit the hypershift operator deployment object to set METRICS_SET value to All

oc patch deployment.apps/operator -n hypershift --type='json' -p='[{"op": "replace", "path": "/spec/template/spec/containers/0/env/2/value", "value": "All"}]'

We might have multile hosted clusters running on top of the manangement cluster, it might be easier to manange all the metrics using the mgmt promethus. To do so, we can edit the hypershift operator object to enable ocp cluster monitoring.

oc patch deployment.apps/operator -n hypershift --type='json' -p='[{"op": "replace", "path": "/spec/template/spec/containers/0/args", "value": ["run", "--namespace=$(MY_NAMESPACE)", "--pod-name=$(MY_NAME)", "--metrics-addr=:9000", "--enable-ocp-cluster-monitoring=true", "--enable-ci-debug-output=false", "--private-platform=None", "--cert-dir=/var/run/secrets/serving-cert", "--enable-uwm-telemetry-remote-write"]}]'

This will add label openshift.io/cluster-monitoring=true to all hosted cluster namespace, metrics under theose namespaces will be avaiable in mgmt promethus database. Note that a bug fix was merge by this PR, you might not be able to see etcd server metrics without this fix. We verified the availability of etcd server metrics on 4.12.8.

Promethus

Prometheus stores metrics on ephemeral storage by default. All metrics data will be lost once the Prometheus pods are restarted or terminated. The following steps shows how to mannually provision pvs and use configMap for promethus to consume the manual storage class:

Format the disk and mount it:

mkfs.ext4 /dev/nvme0n1 && sudo mkdir /var/promdb && sudo mount /dev/nvme0n1 /var/promdb

Create pvs on both machines:

apiVersion: v1
kind: PersistentVolume
metadata:
  name: worker027-1029p
spec:
  capacity:
    storage: 894Gi
  volumeMode: Filesystem
  accessModes:
  - ReadWriteOnce
  persistentVolumeReclaimPolicy: Retain
  storageClassName: promdb-pv
  local:
    path: /var/promdb
  nodeAffinity:
    required:
      nodeSelectorTerms:
      - matchExpressions:
        - key: kubernetes.io/hostname
          operator: In
          values:
          - worker027-1029p

Label both nodes so scheduler knows which node to place promethus pods:

oc label node worker026-1029p worker027-1029p nodePool=promdb
oc label node worker026-1029p worker027-1029p node-role.kubernetes.io/promdb=

Create configMap

apiVersion: v1
kind: ConfigMap
metadata:
  name: cluster-monitoring-config
  namespace: openshift-monitoring
data:
  config.yaml: |
    prometheusK8s:
      retention: 36w
      nodeSelector:
        nodePool: promdb
      volumeClaimTemplate:
        metadata:
          name: prometheusdb
        spec:
          storageClassName: promdb-pv
          resources:
            requests:
              storage: 894Gi

Network

Hosted Cluster Console

To access the hosted cluster console, we need to retrieve the kubeadmin password of the hosted cluster (different from the base cluster). Use the following commands to generate guest cluster kubeconfig, the kubeadmin password and the console url. Note you may need sshuttle to do port forwarding when you are accessing the console from the machine other than the bastion one.

NAMESPACE="clusters"
KUBEVIRT_CLUSTER_NAME=kv-00
hypershift create kubeconfig --name="$KUBEVIRT_CLUSTER_NAME" > "${KUBEVIRT_CLUSTER_NAME}-kubeconfig"
kubedamin_password=$(oc get secret -n "$NAMESPACE-$KUBEVIRT_CLUSTER_NAME" kubeadmin-password --template='{{.data.password | base64decode}}')
echo "https://$(oc --kubeconfig=${KUBEVIRT_CLUSTER_NAME}-kubeconfig -n openshift-console get routes console -o=jsonpath='{.spec.host}')"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hypershift-kubevirt-cluster-config.md

hypershift-kubevirt-cluster-config.md

HyperShift with the KubeVirt provider Cluster Configuration

Last Updated

Introduction

Node Placement and Scheduling

Why Patching HyperConverged Object is necessary?

Patch the HyperConvered Object with two nodeSelectors:

Label the Corresponding Nodes:

Zone Config

Guest cluster support matrix

Storage

Storage Operators

Persistent storage within hosted clusters

etcd

Promethus

Network

Hosted Cluster Console

Files

hypershift-kubevirt-cluster-config.md

Latest commit

History

hypershift-kubevirt-cluster-config.md

File metadata and controls

HyperShift with the KubeVirt provider Cluster Configuration

Last Updated

Introduction

Node Placement and Scheduling

Why Patching HyperConverged Object is necessary?

Patch the HyperConvered Object with two nodeSelectors:

Label the Corresponding Nodes:

Zone Config

Guest cluster support matrix

Storage

Storage Operators

Persistent storage within hosted clusters

etcd

Promethus

Network

Hosted Cluster Console