Create an automation to safely rebuild a cluster node #9

Open
opened 2026-01-17 12:21:43 +00:00 by lab · 0 comments
Owner

Develop an automated workflow to safely rebuild a cluster node without manual intervention. The solution should handle the full lifecycle of the node, including cluster operations, VM recreation, and rejoining the cluster.

Requirements:

  • Cordon and uncordon a node in the cluster
  • Drain workloads from the node safely
  • Delete the existing VM
  • Recreate the VM using Terraform (without manually commenting/uncommenting configuration)
  • Configure SSH access automatically
  • Reinstall k3s and rejoin the node to the cluster

Acceptance criteria:

  • Node can be safely recreated by automation
Develop an automated workflow to safely rebuild a cluster node without manual intervention. The solution should handle the full lifecycle of the node, including cluster operations, VM recreation, and rejoining the cluster. Requirements: - Cordon and uncordon a node in the cluster - Drain workloads from the node safely - Delete the existing VM - Recreate the VM using Terraform (without manually commenting/uncommenting configuration) - Configure SSH access automatically - Reinstall k3s and rejoin the node to the cluster Acceptance criteria: - Node can be safely recreated by automation
Sign in to join this conversation.
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
lab/homelab#9
No description provided.