learnclaude .dev
All courses

Paid course · £150

DevOps / Cloud Platform ready

28 lessons 560 min

An outcome-driven course for working engineers moving into a DevOps or Cloud Platform role — but with a specific modern angle: you don't just learn the domain, you learn it through Claude Code and leave with a library of seven reusable Claude skills you built and validated yourself. Seven modules, each ending in a hands-on task you drive through Claude on your own laptop. The payoff is cutting time-to-response and time-to-action when it matters: when a page comes in, you're reaching for a skill you already built and validated, not starting from a blank terminal. No theory you don't immediately apply, and no copy-pasting without reading.

By the end of the course you can, driving through Claude Code and understanding every file it writes:

  1. Deploy a containerised service to a Kubernetes cluster using kind locally.
  2. Package that service as a Helm chart with configurable values.
  3. Wire a GitHub Actions pipeline that builds the image, runs tests, pushes to a registry, and deploys.
  4. Provision a cluster with Terraform and know how state drift bites you in real work.
  5. Instrument observability — Prometheus metrics + a Grafana dashboard + a scrape-gap alert.
  6. Debug the three canonical pod failures (CrashLoopBackOff, ImagePullBackOff, OOMKilled) and know the shape of a healthy pod.
  7. Scope a ServiceAccount with least-privilege RBAC and sketch Workload Identity Federation for cloud IAM.

Each module has four lessons: a short concept explainer, a Claude-guided hands-on task with an acceptance test, a skill-building lesson where you codify the session as a reusable .claude/skills/<name>/ skill and validate it in a fresh Claude session against both valid and adversarial inputs, and a recall quiz. Expect ~9 hours of focused work across the course.

How this course works (the three principles you're signing up for)

  • Stay close to your problem. Claude types fast; that is never a license to disengage. Every task lesson requires you to read every file Claude produces before you run it, and to ask Claude to explain anything you don't understand. The skill you build at the end of each module is valuable only because you understood the domain underneath it.
  • Validate what Claude will NOT do, not just what it will. Every skill you produce ends with two sentences: "This skill handles: …" and "This skill does NOT handle: …". A skill that documents only its capabilities is dangerous in the hands of users who don't yet know its limits. You'll test every skill in a fresh Claude session against deliberately wrong inputs to confirm it fails loudly rather than silently doing the wrong thing.
  • Design for realistic work environments. Pristine demos lie. Every module includes a deliberate "break it on purpose" probe — a targetPort mismatch, a missing CI permission, a state-drifted cluster, a silently-broken ServiceMonitor — so the failure modes your skill documents come from lived experience, not guesses. The realistic moment we're designing for is the one with a clock on it: an alert fires, and you need to move from what's broken? to what's the fix? without starting from zero.

What you need before you start

  • A laptop with Docker Desktop (or Podman) installed.
  • Claude Code installed and authenticated — every lesson assumes you can open a Claude Code session in a working directory.
  • kubectl and kind on your PATH. Install steps are in module 1.
  • A GitHub account (used from module 3 onward).
  • A free-tier cloud account with one provider (GCP, AWS, or Azure) for module 4's bonus + module 7's federation sketch.
  • Comfort reading YAML and running terminal commands. Comfort reading what Claude wrote before running it.

What this course is not

  • Not a Kubernetes reference — the point is to build the reflexes, not memorise every resource type.
  • Not a prompt-engineering course — the focus is producing reliable skills that capture your process, not writing cleverer prompts.
  • Not a theory course — you won't get a full explanation of consensus, CNI internals, or the control plane unless a task needs it.
  • Not a "let Claude do the work" course. Every lesson insists you read, understand, and validate what Claude produces. If you're not willing to engage that way, this isn't the right course.
  • Not assessed by a certification — it's assessed by whether you can do the seven tasks on demand and hand over seven working skills that survive a fresh Claude session.
What you leave with 7 reusable Claude Code skills
  1. .claude/skills/kind-cluster-bootstrap/

    stateless HTTP services, single-cluster local development with kind, parameterised image/ports/replicas, preflight for docker, kubectl, and kind, and post-apply sanity checks for port-mapping drift.

  2. .claude/skills/helm-chart-scaffold/

    stateless HTTP services, parameterised image/replicas/port/nodePort, a single-cluster deployment, helm lint + install + post-install sanity check for port-mapping drift.

  3. .claude/skills/github-actions-k8s-deploy/

    Go / Python / Node services (parameterised toolchain setup), GHCR image push with short-lived GITHUB_TOKEN, kind-in-CI deploy via Helm chart, a single smoketest HTTP endpoint, and the packages: write permission most people forget.

  4. .claude/skills/terraform-kind-provision/

    local kind-based Kubernetes provisioning via Terraform, kubeconfig extraction to a local file, pre-apply plan display, post-apply state sanity, and the state-drift recovery pattern.

  5. .claude/skills/prom-grafana-instrument/

    HTTP-service metrics (counters / histograms on {path, code}), kube-prometheus-stack install or reuse, ServiceMonitor creation with selector hygiene, a RED-method Grafana panel, and a scrape-gap alerting rule.

  6. .claude/skills/k8s-pod-debug-triage/

    CrashLoopBackOff (with exit-code evidence), ImagePullBackOff / ErrImagePull, OOMKilled (exit 137), CreateContainerConfigError, healthy pods (explicit Healthy classification), and healthy-with-recent-restart. Classification + evidence only — no auto-fix.

  7. .claude/skills/rbac-iam-scaffold/

    a single namespace's ServiceAccount + Role + RoleBinding for a workload, with per-rule (verbs × resources × optional resourceNames) scoping, default-narrow wildcards, and an executable positive/negative assertion suite (kubectl auth can-i) that proves the scoping.

Each skill is validated in a fresh Claude Code session — happy path and adversarial input — before the module is considered done.

Paid course

Sign in first, then buy access to unlock every module.

Buy access · £150

Lessons

  1. 01

    Module 1 · Concepts — Pods, Deployments, Services, and why `kind`

    doc Sign in
    Open
  2. 02

    Module 1 · Task — Stand up a cluster and deploy podinfo (via Claude)

    doc Sign in
    Open
  3. 03

    Module 1 · Skill — Codify `kind-cluster-bootstrap`

    doc Sign in
    Open
  4. 04

    Module 1 · Recall check

    quiz Sign in
    Open
  5. 05

    Module 2 · Concepts — Helm, templating, and why you'll use charts anyway

    doc Sign in
    Open
  6. 06

    Module 2 · Task — Write a Helm chart for podinfo (via Claude)

    doc Sign in
    Open
  7. 07

    Module 2 · Skill — Codify `helm-chart-scaffold`

    doc Sign in
    Open
  8. 08

    Module 2 · Recall check

    quiz Sign in
    Open
  9. 09

    Module 3 · Concepts — What a CI/CD pipeline actually does

    doc Sign in
    Open
  10. 10

    Module 3 · Task — Ship a Go app through GitHub Actions to a cluster (via Claude)

    doc Sign in
    Open
  11. 11

    Module 3 · Skill — Codify `github-actions-k8s-deploy`

    doc Sign in
    Open
  12. 12

    Module 3 · Recall check

    quiz Sign in
    Open
  13. 13

    Module 4 · Concepts — State, providers, and why Terraform wins

    doc Sign in
    Open
  14. 14

    Module 4 · Task — Provision a cluster with Terraform (via Claude)

    doc Sign in
    Open
  15. 15

    Module 4 · Skill — Codify `terraform-kind-provision`

    doc Sign in
    Open
  16. 16

    Module 4 · Recall check

    quiz Sign in
    Open
  17. 17

    Module 5 · Concepts — Metrics, logs, and the three questions observability answers

    doc Sign in
    Open
  18. 18

    Module 5 · Task — Instrument your service and build a Grafana dashboard (via Claude)

    doc Sign in
    Open
  19. 19

    Module 5 · Skill — Codify `prom-grafana-instrument`

    doc Sign in
    Open
  20. 20

    Module 5 · Recall check

    quiz Sign in
    Open
  21. 21

    Module 6 · Concepts — Reading pod failures like a native speaker

    doc Sign in
    Open
  22. 22

    Module 6 · Task — Reproduce and fix the three canonical pod failures (via Claude)

    doc Sign in
    Open
  23. 23

    Module 6 · Skill — Codify `k8s-pod-debug-triage`

    doc Sign in
    Open
  24. 24

    Module 6 · Recall check

    quiz Sign in
    Open
  25. 25

    Module 7 · Concepts — RBAC, cloud IAM, and Workload Identity Federation

    doc Sign in
    Open
  26. 26

    Module 7 · Task — Scope a ServiceAccount, bind a Role, sketch Workload Identity Federation (via Claude)

    doc Sign in
    Open
  27. 27

    Module 7 · Skill — Codify `rbac-iam-scaffold`

    doc Sign in
    Open
  28. 28

    Module 7 · Recall quiz — RBAC, cloud IAM, and federation

    quiz Sign in
    Open