24 Guides

Technical Pillars of the Agentic Dev Loop

Each guide covers a real problem you will hit when building an agentic dev system. Practical, technical, and implementation-oriented.

Execution Environments

Where do humans run code? Where do agents run code? When is execution local vs remote?

Ephemeral Runtimes

Should every task run in a fresh environment? How long do workspaces live? What state is allowed to persist?

Reproducible Toolchains

How do we guarantee identical behavior across runs? How do we eliminate snowflake laptops?

Identity, Secrets & Trust Boundaries

Who is an agent, cryptographically? What credentials can it hold? How long do identities live?

Task Routing & Orchestration

Which agent handles which task? How is concurrency controlled? How are failures retried?

Who Writes Code?

When are agents allowed to author code? How is authorship recorded? Who owns the result?

Who Opens Pull Requests?

Can agents open PRs directly? How are branches named and scoped? How are tasks linked to specs?

Code Review in an Agentic World

Which reviews are automated? What do humans still review? How is drift prevented?

CI-Native Agents

How do agents integrate with pipelines? When do they run? How are flaky behaviors detected?

Policy & Guardrails

What actions are forbidden? What violations are blocking? How is drift detected early?

Human-in-the-Loop Design

Where must humans approve? How are confidence thresholds set? How are overrides handled?

Observability for the Dev Loop

What do we log? How do we trace actions across systems? How do we measure quality and cost?

Audit Trails & Provenance

Who did what? In which environment? With which credentials?

Writing Specs & Prompts

How do you write specs agents can execute against? When is a prompt enough vs. a structured charter?

Local-First Dev Environments

How do you set up a local environment that agents can also use? What does the inner loop look like?

Handing Off Environments to Agents

How do you swap your running dev environment with an agent? How does context transfer work?

Ingress & Public Access

How do agents access running services? How do you expose local or ephemeral environments to the outside?

Agent-in-the-Loop Implementation

How does an agent participate during the implementation phase? What does the human-agent handshake look like?

The Agentic Build Loop

How do you use agents to build agents? What does the meta-loop look like?

Reproduce Steps

Before any implementation begins, how do you verify the bug or behavior you are trying to fix actually exists?

Expect & Demo Steps

After implementation, how do you verify the change works? What does the acceptance loop look like?

Artifacts of Success & Failure

What evidence gets produced? Screenshots, logs, test results, diffs — how do you capture and store proof?

PR Review in Practice

What does a real agent-authored PR review look like? Who reviews what, and when does a human step in?

Building with the Claude Agent SDK

How do you use the Claude Agent SDK to wire up agentic workflows? Patterns, gotchas, and practical examples.

Recommended Starting Point

Begin with the foundations

Start with Execution Environments and Ephemeral Runtimes to establish the runtime foundation for agent-native systems.

Start with Guide 01