The discipline guide to running multi-agent systems after the v0. Twelve chapters on orchestration, auth across agent boundaries, federated memory, and partial failure - the work that separates a fleet that ships from a fleet that falls over. Early access. PDF + EPUB + free updates. $39.
The moment a single-agent system becomes a two-agent system is where the engineering changes shape, not just the deployment topology. The temptation is to think of it as a small step. We have one agent. We're adding another. They'll talk via HTTP. Done.
The temptation is wrong. A two-agent system isn't a slightly-bigger one-agent system; it's the smallest possible distributed system. The fallacies of distributed computing -- the network is reliable, latency is zero, the topology doesn't change -- apply the same way they always have. The protocol is the easy part. Auth across hops, scope across agents, provenance through the telephone game, partial failure, observability that doesn't fall apart at the agent boundary -- these are the parts that cost you a quarter to discover and a year to retrofit.
Twelve chapters on multi-agent systems as distributed systems first and prompt-engineering second. Discipline-first, opinionated, war-stories. Early access -- all twelve chapters drafted and readable end to end as of v1.0.0-early.2; the early-access label remains in place through v1.0 while reader feedback shapes revisions.
Early access. PDF + EPUB. Free updates as chapters land. Secure checkout. Read Chapter 1 free.
Every one of these is a failure mode that didn't exist in the single-agent system and shows up the moment you add a second agent. The book gives you the architectural pattern, not the plug-and-play library.
Status: chapters marked [DRAFT] are finished and readable today; chapters marked [ADAPTED] are rewritten from the seeded Agent Memory in Production material into A2A-shape. All twelve chapters are drafted as of v1.0.0-early.2. The early-access label remains in place through v1.0 while reader feedback shapes revisions; buyers get every update at no extra cost.
You have shipped at least one agent to production. You know what an LLM call looks like, you've integrated tools, you've handled at least one production incident with an agent in it. You're now standing at the moment where one agent is becoming two -- and you can feel that the second agent isn't a small step. You're somewhere between mid and senior on the IC ladder, or a tech lead who needs to make architectural calls about whether to add an agent or add a tool.
You don't need to know the A2A spec by heart. The book references it (and the alternatives) where they constrain decisions. You do need to know what a distributed-systems fallacy looks like -- "the network is reliable" should ring a bell. If it doesn't, get a copy of Designing Data-Intensive Applications first; this book treats the distributed-systems half of multi-agent systems as load-bearing prior knowledge.
Not for: tutorials on building your first agent (Vol II covers operating an agent; the provider docs cover the API basics), protocol specifications (the A2A and MCP specs are documents in their own right), vendor comparison spreadsheets (the orchestration tooling moves quarterly), or pure prompt-engineering treatments (prompts matter; they aren't where multi-agent systems break in production).
A2A in Production is Volume IV of the Yaw Labs Production Series. Volume I, MCP in Production, is the protocol-and-server perspective on Model Context Protocol. Volume II, Claude Code in Production, is the operator's perspective on running an agent. Volume III, Semantic Search in Production, is the substrate the agent reaches into when it needs to find something. Volume IV is what happens when one agent becomes a fleet -- the discipline of multi-agent systems as their own engineering problem.
YawLabs/a2a-in-production-companion repo - starter code, exercises, and worked solutions at module-N-final tags, filling in by reader pull as chapters land.v1.0.0-early.1 launched with four chapters readable: Chapter 1 plus the three adapted state chapters in Part 4. v1.0.0-early.2 fills in the remaining eight (the protocol landscape, the orchestration shapes, the trust-between-agents chapters, observability, cost-latency-and-failure, and the closing forecasting chapter), so the book is now readable end to end. The early-access label remains in place through v1.0 -- the chapters are drafts, the companion repo is still being filled in tag by tag, and reader feedback is shaping revisions. The early-access price reflects that posture; you get every revision through v1.0 and beyond at no extra cost. If something in the book is wrong, unclear, or missing, file an issue against the companion repo -- early-access readers shape the priority more than the original outline does.
No. Each volume in the Production Series stands on its own. Volume IV is about multi-agent systems; if you're not building MCP servers, running Claude Code, or shipping semantic search, you don't need the other three to make sense of this one. The chapters that draw on prior-volume material (e.g. observability building on Vol II's event-log work) summarize what they need at the point of use.
No. The Google-authored A2A spec is one protocol option among several, and Chapter 2 covers it alongside LangGraph hand-offs, OpenAI Swarm-style routing, and raw HTTP between agents. The book is opinionated about which one to pick for which shape of problem, but the disciplines (auth, scope, provenance, observability, partial failure) apply across protocol choices. If you swap A2A for LangGraph next year, the chapters still apply.
Not yet, and especially not during early access -- the book is changing as chapters land, and a print edition only makes sense once v1.0 is feature-complete. The digital version is the canonical living one and keeps getting updates either way.
You enter your GitHub username at checkout. The order webhook fires an invite to that user, adding you as a collaborator to the private companion repo. You should see an email from GitHub with the accept-invitation link within a few minutes. If you don't get the invite within an hour, email contact@yaw.sh with the order ID and the GitHub username you want invited.
Buy A2A in Production (early access)
Twelve chapters drafted and readable end to end. PDF + EPUB. Free updates through v1.0 and beyond. $39 one-time, secure checkout.
Companion volumes: MCP in Production, Claude Code in Production, and Semantic Search in Production. Together they cover the agentic-tooling stack: protocol, operator, retrieval substrate, and now the multi-agent fleet.