From Solo Agent to Agent Team
When one brilliant employee isn't enough, build a department
You've built one agent and it's incredible. It handles your morning briefing, writes content, monitors markets, and keeps your memory organized. But now you're hitting a wall: one agent can't do everything well. It's like having one employee who's your receptionist, accountant, marketer, and developer. Eventually, they start dropping balls.
When to Go Multi-Agent (Decision Framework)
Don't add agents just because you can. More agents = more complexity = more cost = more things that can break. Add another agent only when you hit one of these triggers:
Your agent's instructions + memory + current task exceed the context window. It starts "forgetting" parts of its job because there's too much to hold at once.
Your agent needs to be both a careful analyst AND a creative writer AND a blunt code reviewer. These personalities fight each other in a single system prompt.
You need things done simultaneously: research happening while content is being written while code is being reviewed. One agent means sequential, not parallel.
Some tasks need GPT-4/Claude Opus (complex reasoning). Others work fine with GPT-4o-mini/Haiku (simple extraction). One agent = one model tier for everything.
Shared vs. Private Memory
The most critical decision in multi-agent setups: what can each agent see?
workspace/ ├── shared/ # All agents read/write │ ├── MEMORY.md # Shared long-term memory │ ├── inbox/ # Cross-agent communication │ │ ├── research-to-writer.md │ │ └── analyst-to-chief.md │ └── knowledge/ # Shared knowledge base │ ├── agents/ │ ├── researcher/ # Only the researcher sees this │ │ ├── SOUL.md │ │ ├── memory/ │ │ └── scratch/ # Working drafts, raw data │ ├── writer/ # Only the writer sees this │ │ ├── SOUL.md │ │ ├── memory/ │ │ └── drafts/ │ └── analyst/ # Only the analyst sees this │ ├── SOUL.md │ ├── memory/ │ └── data/
Shared memory contains facts everyone needs: project status, user preferences, completed decisions. Private memory contains agent-specific working state: raw research data, draft iterations, intermediate analysis. Think of shared memory as the company wiki and private memory as each employee's notebook.
Communication Patterns
Agents need to talk to each other. There are two main patterns, and they have very different tradeoffs:
Agents write to a shared inbox/ folder. Other agents check the inbox periodically. Async, debuggable, auditable. Like internal memos.
One agent spawns or invokes another directly with a prompt. Faster but harder to debug. Like a phone call — no paper trail unless you create one.
Specialization Patterns
The four agent archetypes that cover 90% of use cases:
Searches the web, reads documents, summarizes findings. Optimized for breadth and accuracy. Model: can use cheaper models for bulk search, expensive for synthesis.
Takes research and turns it into content — blog posts, tweets, reports, emails. Optimized for tone, style, and persuasion. Needs the strongest language model.
Crunches numbers, spots patterns, makes recommendations. Optimized for accuracy and structured reasoning. Lives in spreadsheets and data files.
Executes tasks: deploys code, sends emails, updates databases, runs cron jobs. Optimized for reliability and tool use. Needs the highest trust level.
The "Chief of Staff" Pattern
The most powerful multi-agent pattern is deceptively simple: one main agent that delegates to specialized sub-agents.
# Main agent (Chief of Staff) receives:
"Analyze the AI agent market and write a blog post about it"
# It breaks this into sub-tasks:
1. → Researcher: "Find top 10 AI agent frameworks, their pricing,
user counts, and key differentiators. Output to
shared/inbox/research-ai-agents.md"
2. → Analyst: "Read shared/inbox/research-ai-agents.md. Compare
frameworks on cost, capability, and ease of use.
Score each 1-10. Output to shared/inbox/analysis-ai-agents.md"
3. → Writer: "Read research and analysis files. Write a 1500-word
blog post titled 'Best AI Agent Frameworks in 2026'.
Match our brand voice in SOUL.md.
Output to shared/inbox/draft-ai-agents-blog.md"
4. Chief compiles, reviews, delivers to human.Avoiding "Too Many Cooks"
More agents doesn't always mean better results. Here are the failure modes:
Real Example: Three Agents in Parallel
Here's a real setup running in production:
Scans Twitter sentiment, pulls price data, generates trading plan. Writes to shared/market-briefing.md. Cost: ~$0.80/day (GPT-4o-mini for scraping, Claude for analysis).
Reads market briefing + trending topics. Drafts 3-5 social posts. Writes to shared/content-drafts.md. Cost: ~$0.50/day (Claude Sonnet).
Mines Reddit/Twitter for pain points, validates against market briefings, scores opportunities. Writes to shared/ideas-scored.md. Cost: ~$2/week.
Total cost: ~$11/week for three specialized agents running autonomously. They share a knowledge base but each has private working space. The human reviews outputs for 15 minutes a day.
Cost Implications
Multi-agent setups multiply costs. Here's how to keep them sane:
Share this chapter
Chapter navigation
26 of 36
Previous lesson
Chapter 24: The Agent Workspace — Your Files Are Your API
Your agent doesn't need a fancy database — it needs a well-organized desk
10 min read
Next lesson
Chapter 26: The Agent's Daily Routine — Automation That Compounds
Your agent should have a better morning routine than you do
11 min read