24 Guides
Technical Pillars of the Agentic Dev Loop
Each guide covers a real problem you will hit when building an agentic dev system. Practical, technical, and implementation-oriented.
Execution Environments
Where do humans run code? Where do agents run code? When is execution local vs remote?
02Ephemeral Runtimes
Should every task run in a fresh environment? How long do workspaces live? What state is allowed to persist?
03Reproducible Toolchains
How do we guarantee identical behavior across runs? How do we eliminate snowflake laptops?
04Identity, Secrets & Trust Boundaries
Who is an agent, cryptographically? What credentials can it hold? How long do identities live?
05Task Routing & Orchestration
Which agent handles which task? How is concurrency controlled? How are failures retried?
06Who Writes Code?
When are agents allowed to author code? How is authorship recorded? Who owns the result?
07Who Opens Pull Requests?
Can agents open PRs directly? How are branches named and scoped? How are tasks linked to specs?
08Code Review in an Agentic World
Which reviews are automated? What do humans still review? How is drift prevented?
09CI-Native Agents
How do agents integrate with pipelines? When do they run? How are flaky behaviors detected?
10Policy & Guardrails
What actions are forbidden? What violations are blocking? How is drift detected early?
11Human-in-the-Loop Design
Where must humans approve? How are confidence thresholds set? How are overrides handled?
12Observability for the Dev Loop
What do we log? How do we trace actions across systems? How do we measure quality and cost?
13Audit Trails & Provenance
Who did what? In which environment? With which credentials?
14Writing Specs & Prompts
How do you write specs agents can execute against? When is a prompt enough vs. a structured charter?
15Local-First Dev Environments
How do you set up a local environment that agents can also use? What does the inner loop look like?
16Handing Off Environments to Agents
How do you swap your running dev environment with an agent? How does context transfer work?
17Ingress & Public Access
How do agents access running services? How do you expose local or ephemeral environments to the outside?
18Agent-in-the-Loop Implementation
How does an agent participate during the implementation phase? What does the human-agent handshake look like?
19The Agentic Build Loop
How do you use agents to build agents? What does the meta-loop look like?
20Reproduce Steps
Before any implementation begins, how do you verify the bug or behavior you are trying to fix actually exists?
21Expect & Demo Steps
After implementation, how do you verify the change works? What does the acceptance loop look like?
22Artifacts of Success & Failure
What evidence gets produced? Screenshots, logs, test results, diffs — how do you capture and store proof?
23PR Review in Practice
What does a real agent-authored PR review look like? Who reviews what, and when does a human step in?
24Building with the Claude Agent SDK
How do you use the Claude Agent SDK to wire up agentic workflows? Patterns, gotchas, and practical examples.
Recommended Starting Point
Begin with the foundations
Start with Execution Environments and Ephemeral Runtimes to establish the runtime foundation for agent-native systems.
Start with Guide 01