Multi-Agent Systems: Why One AI Isn't Enough
When we started building MDFit Nova-Sonic, the first instinct was to use a single large language model for everything. One prompt, one model, handle all cases. It did not work.
The Single-Agent Problem
A single agent trying to handle scheduling, rescheduling, cancellations, FAQ, and escalation simultaneously suffers from context dilution. The more responsibilities you pile onto one prompt, the worse it performs at each individual task.
We saw this in testing: a general-purpose agent would occasionally book an appointment when the patient wanted to cancel, or provide scheduling information when they needed to be routed to a human.
The Multi-Agent Architecture
Our solution: 5 specialized agents, each with a focused prompt and dedicated tool set.
Agent Specialization
Each agent excels at exactly one thing:
- Narrow context: Smaller, focused prompts mean higher accuracy
- Dedicated tools: Each agent only has access to the tools it needs
- Clear boundaries: Intent classification routes to the right agent before any action is taken
The Orchestrator
The orchestrator is the brain of the system. It listens to the initial utterance, classifies intent, and routes to the appropriate agent. If the conversation shifts mid-call (patient starts by asking about hours, then wants to schedule), the orchestrator re-routes seamlessly.
Lessons Learned
1. Agent boundaries must be crisp
Ambiguous boundaries between agents lead to routing errors. We spent significant time defining exactly where one agent's responsibility ends and another begins.
2. Fallback to human is not failure
Our escalation agent handles cases that do not fit neatly into the other four categories. This is not a failure mode — it is a design choice. Some situations genuinely require a human.
3. Latency compounds
With multiple agents, every millisecond of routing overhead compounds. We optimized the orchestrator to make routing decisions in under 100ms.
Multi-agent systems are more complex to build, but they produce dramatically better results than monolithic AI approaches. The key is clear specialization and fast routing.