Exploring Cluster Events and Troubleshooting
Objective
Delve into the realm of Openshift's cluster events and troubleshooting, illuminating the path to understanding and resolution within the fortress's walls.
Prerequisites
To embark on this journey of exploration and discovery, you'll need:
- Access to an Openshift cluster or a terminal emulator connected to one.
- Basic familiarity with command-line interfaces (CLI) and Kubernetes concepts.
Introduction
As we venture deeper into the fortified confines of the Openshift citadel, we encounter the realm of cluster events and troubleshooting. Here, amidst the ebb and flow of activity, lies a treasure trove of insights and solutions waiting to be unearthed.
Get Cluster Events
Adventurers have the power to unveil the chronicles of the cluster through commands like:
Openshift boasts a monitoring stack anchored by Prometheus, residing within the 'openshift-monitoring' namespace. Powered by the Prometheus platform and Alertmanager, it stands as a sentinel, ever vigilant over the citadel's affairs.Check Node Status and Troubleshooting
Armed with knowledge and determination, explorers can scrutinize the general status of the cluster and delve deeper into its inner workings with commands such as:
oc cluster-info
oc get nodes
oc get node <NODE_NAME> -o jsonpath=*'{"Allocatable:\n"}{.status.allocatable}{"\n\n"}{"Capacity:\n"}{.status.capacity}{"\n"}'
oc get node <NODE_NAME> -o json | jq '.status.conditions'
oc adm node-logs <NODE_NAME> -u <COMPONENT_NAME> --tail 10
# To run binaries in the host's executable path
chroot /host
# To confirm that the services are active
for SERVICES in kubelet crio; do echo ---- $SERVICES ---- ;
systemctl is-active $SERVICES ; echo ""; done
Embark on this journey of discovery, as we illuminate the shadows cast by the veil of uncertainty, and emerge victorious in the quest for knowledge and resolution within the timeless bastion of Openshift's citadel.