khook CLI reference

One binary, seven subcommands. Cluster access follows standard kubeconfig loading rules (KUBECONFIG, ~/.kube/config).

Global flags

Flag	Default	Notes
`--kubeconfig`	standard loading rules	path to a kubeconfig file
`--context`	current context	kubeconfig context to use
`--log-level`	`info`	`debug`, `info`, `warn`, `error` (`debug` includes Helm SDK output)
`--log-format`	`text`	`text` or `json`; logs go to stderr

Spec flags (`apply`, `destroy`, `plan`, `validate`, `graph`, `status`)

Flag	Notes
`-f, --file`	path to the Khook spec (required)
`--set NAME=value`	set a variable (repeatable)
`--var-file vars.yaml`	flat `NAME: value` YAML map
`--var-prefix`	env-var prefix consumed as variables (default `KHOOK_VAR_`)
`--secret-prefix`	env-var prefix consumed as secret variables (default `KHOOK_SECRET_`)

Variable precedence: --set > --var-file > secret env > prefixed env > ${NAME:-default} written in the spec. A ${NAME} with no source and no default fails validation, reporting all missing variables at once.

Secret variables: KHOOK_SECRET_TOKEN=x behaves exactly like KHOOK_VAR_TOKEN=x (it resolves ${TOKEN}), but the value is additionally masked as *** in everything khook prints — logs, plan output, plan --diff rendered manifests, summaries, and error messages. Sprig pipeline outputs of secret variables (e.g. ${TOKEN|b64enc}, see docs/dsl.md) are masked the same way. Masking applies only to khook’s own output; the substituted value still reaches the cluster, and anything that reads the created resources can see it. Masking is a textual best-effort (values split across lines in rendered YAML may not match); prefer keeping secrets out of specs entirely (external-secrets) and reserve this for bootstrap-time secrets.

Commands

`khook apply -f spec.yaml`

Parses, validates, resolves the DAG, then executes steps in parallel levels against the cluster. Output depends on where it runs:

Interactive terminal (stdout is a TTY): one live status line per step in DAG order — pending → running → ok/failed/skipped — with spinner, elapsed time, and retry attempt, redrawn in place. Full error details for failed steps print after the run. Step-level logging is silenced (raised to warn) so it doesn’t garble the display; pass --log-level explicitly to override. NO_COLOR disables colors; TERM=dumb disables live rendering.
Non-interactive (piped, CI): one log line per state change and a final summary table (step, type, status ok/failed/skipped, attempts, duration, detail — the error for failures, the reason for skips).
-o, --output json: instead of the table, the final results print as one JSON document on stdout — run name, run status (ok/failed), and per-step name, type, status, attempts, durationMs, error, skipReason. Logs still stream to stderr, so khook apply -o json 2>/dev/null is clean JSON for CI. Exit codes are unchanged.

When the spec enables state:, apply also maintains the run-state record: it loads the record Secret before the first step (proving it is writable — an unwritable record fails the run up front), skips steps a previous run completed whose inputs are unchanged — the comparison is per step, so editing one step re-runs only that step (summary/JSON show resumed steps skipped with reason unchanged since it succeeded in a previous run (state record)) — journals every step outcome as it happens, and stamps the final run status. A run whose steps all succeed but whose record cannot be written exits 1 — opting into state makes the journal part of the contract.

`khook destroy -f spec.yaml`

Tears down what the spec created, walking the DAG in reverse dependency order: each step’s resources are removed before the resources of the steps it needs, and independent branches tear down in parallel. Built for dev clusters and CI environments; output modes (live progress, summary table, -o json), retries, timeouts, and onError semantics are the same as apply.

What each step type tears down:

Step type	Teardown
`helm:`	uninstalls the release (waits until its resources are gone)
`apply:`	deletes the objects its manifests describe, last manifest first, and waits until each is gone
`job:`	deletes the step’s Job and its pods (refuses a Job not managed by khook)
`delete:` / `patch:`	skipped — khook does not restore deleted resources or revert patches
`wait:` / `rollout:`	skipped — nothing was created

Semantics worth knowing:

Idempotent: resources that are already gone count as success, so a destroy can be re-run after a partial failure and finishes the job.
A step whose when: condition is false is skipped, exactly as in apply (pass the same --set/env so the same steps are in play). Skipped steps of every kind still satisfy the teardown ordering.
Namespaces created via createNamespace: true are left in place — they may hold resources khook did not create. Delete the cluster (dev) or the namespaces themselves if you want them gone.
When the spec enables state:, a fully successful destroy also deletes the record Secret, so the next apply re-converges from scratch instead of resuming into an empty cluster. A teardown that succeeds but cannot remove the record exits 1.
A failed step stops new teardown work (onError: fail default); steps whose teardown depended on it are reported skipped, and the exit code is 1.

`khook plan -f spec.yaml`

Shows what apply would do. Prints the execution plan — levels, step types, one-line action summaries, needs edges — and checks every step against the live cluster (reads only; plan never mutates anything):

Step type	Predicted action
`helm:`	`install` (no release history), `upgrade` (shows current revision, chart version, status → target chart), or `skip` (`skipIf: installed`)
`apply:`	`create` / `configure`, listing which objects are new vs existing, or `skip` (`skipIf: exists`)
`delete:`	`delete` (named object or selector match count) or `no-op` (already absent)
`patch:`	`configure` (target exists) or `unknown` (target must exist by the time the step runs)
`wait:`	`no-op` when the condition already holds, `wait` otherwise (shows how many objects currently match)
`rollout:`	`restart`, `no-op` (rollout already complete), or `wait`
`job:`	`run` (first run or replacing a previous Job) or `skip` (`skipIf: succeeded`)

A step whose when: condition is false is reported skip (with the condition) without touching the cluster — --offline shows it too. Steps that can’t be assessed yet — e.g. a CRD or namespace an earlier step creates, a missing values file — are reported as unknown with the reason, not treated as errors. A closing Plan: line totals the actions. Variables are resolved, so the plan shows final values.

--diff adds kubectl-diff-style unified object diffs under each step that would change something (install/upgrade/create/configure):

apply: steps send each manifest as a server-side dry-run of the same request apply would make (create, merge patch, or server-side apply), so the diff includes server defaulting and admission effects. Managed fields are hidden, like kubectl diff.
helm: steps dry-run render the chart (server dry-run: real cluster capabilities, nothing stored) and diff it against the manifest of the release’s last revision. An install diffs against empty — all additions.
patch: steps send the patch as a server dry-run and diff the result against the live object.
delete: / wait: / rollout: / job: steps have no rendered objects; the plan line already says what happens.

An unchanged step prints diff: no changes — a re-run of an already-applied spec shows no diffs at all. A step whose diff can’t be computed (chart download failure, kind not on the cluster yet) prints diff: unavailable with the reason, and the plan still succeeds. Like plan itself, --diff never mutates the cluster; it cannot be combined with --offline.

--offline skips all cluster access and prints the DAG-only plan (no kubeconfig needed — useful in CI). An unreachable cluster without --offline exits 1.

`khook validate -f spec.yaml`

Parse + validation only (variables, schema, action keys, DAG cycles). No cluster access. Prints all problems at once, not just the first.

`khook status -f spec.yaml`

Reads the spec’s run-state record (state:) and shows the last run — read-only, no mutations. The record’s location is derived from the spec exactly as apply derives it (same variables, same defaulting), so point status at the same spec with the same --set/env. See the DSL page for when enabling state is worth it.

spec:    prod-bootstrap
record:  secret kube-system/khook-state-prod-bootstrap (khook v0.3.0)
run:     failed, started 2026-07-04T10:00:00+03:00, updated 2026-07-04T10:04:12+03:00
spec has changed since this run — 1 unchanged completed step(s) still resume on the next apply

STEP     TYPE   STATUS   ATTEMPTS  DURATION  DETAIL
cni      helm   ok       1         1m12s
ingress  helm   failed   3         2m40s     context deadline exceeded
smoke    job    skipped                      needs "ingress" which did not succeed

The summary line predicts the next apply: which completed steps still resume under the per-step change detection. A completed step whose inputs changed since it ran is flagged input changed — will re-run in the detail column; one that was edited out of the spec shows no longer in the spec.

A spec that does not enable state: is a validation error (exit 2) — there is no record to read.
No record found exits 0 with a message: “not applied yet” is a valid answer, not a failure. Scripts should use -o, --output json, which prints {"found": false} in that case and {"found": true, "specChanged": ..., "resumableSteps": [...], "record": {...}} otherwise (resumableSteps lists the steps the next apply resume-skips).
A step shown with status - was seeded but never finished — the run crashed or was interrupted while it was in flight; runStatus may also still read running (it is a marker, not a lock).

`khook graph -f spec.yaml`

Emits the step DAG as a diagram for docs and review. No cluster access. Default output is a Mermaid flowchart (renders directly in GitHub Markdown inside a ` ```mermaid ` fence); --format dot emits Graphviz DOT instead (khook graph -f spec.yaml --format dot | dot -Tsvg > dag.svg).

Each node shows the step name and type; edges are the needs relations. Variables are resolved (same spec flags as apply), so a step whose when: condition is false is drawn dashed/gray and labeled skipped. The DAG is validated first — a dependency cycle exits 2, same as validate.

$ khook graph -f examples/multi-app.yaml
flowchart TD
    n0["create-monitoring-namespace (apply)"]
    n1["prometheus (helm)"]
    n2["create-app-namespace (apply)"]
    n3["deploy-sample-app (apply)"]
    n0 --> n1
    n2 --> n3

`khook schema`

Prints the JSON Schema for the spec (draft 2020-12), including the constraints the type reflection alone can’t express: fixed apiVersion/kind, exactly one action key per step, exactly one source key per manifest/values entry, exactly one of restart/status (rollout) and manifests/resource/release (delete), and the onError/patch-type value sets.

The same schema is committed at docs/schema/v1/khook.json for editors (# yaml-language-server: $schema=...); make schema regenerates it, and a unit test fails when it drifts from the spec types. The schema validates specs as authored — ${VAR} references live inside string values and pass through untouched.

`khook version`

Version, commit, build date, platform (set via -ldflags -X github.com/dvrkn/khook/internal/cli.Version=... at release time).

Exit codes

Code	Meaning
0	success
1	execution failure (a step failed, cluster unreachable)
2	validation failure (bad spec, unresolved variables, DAG cycle, bad flags)

Execution semantics

Steps run in topologically sorted parallel levels. Per-step timeout bounds each attempt; retries/retryDelay control re-attempts; onError: fail (default) lets running steps finish but starts nothing new, onError: continue keeps scheduling other branches (dependents of the failed step are still skipped). A step whose when: condition is false is skipped but still satisfies its dependents’ needs (see docs/dsl.md). Skipped steps and the reason always appear in the summary. Ctrl-C cancels the run; steps not yet started are reported skipped.

Re-running the same spec is safe: helm: upgrades instead of installing, apply: patches existing resources, delete: treats absent resources as success (ignoreNotFound defaults true), job: replaces the previous run’s Job, and each type’s skipIf predicate (installed / exists / succeeded) short-circuits steps whose outcome already holds.

Logging

Structured log/slog output. --log-format json emits one JSON object per line for machine consumption; the summary (table, JSON results, or live progress lines) goes to stdout, logs to stderr. When apply renders live progress on a TTY, the default log level is raised to warn unless --log-level was given explicitly.

khook CLI reference

Global flags

Spec flags (apply, destroy, plan, validate, graph, status)

Commands

khook apply -f spec.yaml

khook destroy -f spec.yaml

khook plan -f spec.yaml

khook validate -f spec.yaml

khook status -f spec.yaml

khook graph -f spec.yaml

khook schema

khook version