A two-agent system is the smallest possible distributed system. Here's what changes when you cross that line.
The first agent you ship is software. The second is infrastructure. The moment you add a second agent that talks to the first, you've built a distributed system -- and every fallacy of distributed computing applies, plus a few new ones the AI layer adds on top.
This guide covers when a fleet is the right answer (it usually isn't), the four orchestration patterns when it is, the distributed-systems fallacies that apply doubly to agents, and the failure modes that show up only after launch. The full version is the early-access A2A in Production book.
Most "we should add another agent" instincts are wrong. The right shape for most problems is one bigger agent with more tools -- not because multi-agent is bad, but because:
The honest test: can a single agent with the union of all the tools you'd give the fleet handle this? If yes, don't build a fleet.
The patterns where multi-agent wins:
One agent receives the request and dispatches to a specialist. The router is the load-bearing piece -- if it routes wrong, the specialist does the wrong work confidently. The pattern earns its keep when specialists are genuinely different (different tools, different auth, different memory).
One agent owns the task end-to-end and decomposes it into subtasks for specialists. The supervisor sees the whole arc; the specialists each see their slice. The supervisor integrates results. Right for tasks that need plan-then-execute.
Multiple agents work in parallel; an aggregator combines results. No central orchestrator; each specialist is independent. Right for embarrassingly parallel work where one agent's result doesn't depend on another's.
Agents read from and write to a shared workspace; the next agent picks up where the previous left off. No central orchestrator; coordination is via shared state. Right for collaborative workflows where the shape of the work emerges from the interaction.
A2A in Production (early access) is twelve chapters on the discipline of multi-agent systems as distributed systems first and prompt-engineering second. The book covers the orchestration patterns in depth, the auth question, federated memory, observability across boundaries, partial failure under load, and when not to use A2A at all.
A2A in Production
The book on multi-agent systems and agent orchestration. Twelve chapters drafted; readable end to end as of v1.0.0-early.2. PDF + EPUB. Free updates through v1.0 and beyond. $39 (early access), secure checkout.
Read more & buy $39 →Published by Yaw Labs.