Troubleshooting
This section focuses on how to investigate the common issues encountered during or after Helm chart deployments of the UltiHash cluster. It includes practical examples to ensure smooth operation and faster issue recovery from failures.
1. Helm chart install or upgrade failure
Purpose: Resolve failures when installing or upgrading the UltiHash cluster using Helm.
Symptoms:
helm install
orhelm upgrade
hangs or returns an errorApplication pods do not start
Helm status is stuck at
pending-install
orfailed
Steps to resolve:
Inspect the Helm release status:
Check for resource creation errors or pending pods:
Describe a failing pod to view events and errors:
Debug with Helm’s dry run mode:
After the issue has been found and eliminated, process with install or upgrade further.
Recommendation: Always use --dry-run
and --debug
to validate changes before applying them in production.
2. Missing or incorrect values in values.yaml
values.yaml
Purpose: Identify and correct configuration errors that prevent proper deployment.
Symptoms:
Helm fails with a rendering error
Application fails at runtime due to missing config (e.g., secrets, ports, env vars)
Steps to resolve:
Compare your values file with the chart defaults:
Test the rendered templates locally:
Reapply the corrected configuration:
Recommendation: Use a version-controlled values file and validate changes in a staging environment before rolling out to production.
3. Application pods stuck in CrashLoopBackOff
or ImagePullBackOff
CrashLoopBackOff
or ImagePullBackOff
Purpose: Diagnose runtime pod failures due to misconfiguration or image issues.
Symptoms:
Pods keep restarting or cannot pull the container image
Steps to resolve:
Inspect the pod state:
Check the logs of the failing pod:
Correct the config causing failure, then upgrade:
Recommendation: Ensure that image repositories are accessible and secrets for private registries are correctly configured in the cluster.
Last updated
Was this helpful?