Your AI Assistant Doesn't Need a Bigger Model. It Needs Colleagues

· 14:05
agent-architecture multi-agent mastra supervisor-pattern local-llm

Your AI assistant doesn’t need a bigger model — it needs colleagues. One agent doing CRM lookups, code review, meeting prep, and sales drafting at the same time will be mediocre at all four. The fix isn’t a larger context window or a more capable LLM. It’s specialization.

The supervisor + specialists pattern

This video walks through the multi-agent supervisor pattern in Mastra using the production system that runs my consulting business — my AI assistant, Emma. Same model across the whole fleet (Qwen3.6-27B running locally on Ollama), nine distinct agent identities, and a supervisor agent that delegates work via tool calls. No orchestration framework. No external service. Just Mastra’s agents: property and a routing table.

Tool sprawl is the real bottleneck

The inflection point wasn’t a plan — it was a failure. A single agent with too many tools started picking the wrong tool for the job. Not hallucinating, not hitting limits. Just wrong tool, repeatedly. I cover three rubrics for when to extract a new agent: domain size, the “would you hire a human for this role?” test, and whether the domain deserves its own memory. I also cover the over-delegation trap — going from one agent to fifteen and having to claw back.

Structural trust, not prompted trust

The ninth agent in the fleet (Envoy) is deliberately locked out of everything personal so my Practical AI community can interact with it safely. That isolation is enforced by tool access, not prompt instructions — which is the only kind of trust boundary that actually holds up in production.

If you’re designing agentic systems and want help with the architecture, see how I work with teams on AI agents. For the prerequisite patterns — single-agent foundations and persistent workspaces — see Build Your Own AI Agent from Scratch and I Gave My AI Agent Access to My Second Brain.

Building an AI agent?

I help teams design and ship agentic systems — from architecture to production.

See how I can help

Get new videos and posts by email

Weekly videos on AI engineering, plus deeper dives in the newsletter.

Occasional emails, no fluff.

Powered by Buttondown