The Anthropic Claude MCP server allows you to call Claude models as a tool from within another Claude session — enabling powerful multi-agent architectures where specialised sub-agents handle specific tasks.
Features:
- Call Claude Haiku, Sonnet, or Opus from within any MCP client
- Pass custom system prompts to specialise each sub-agent
- Stream responses for long-form generation tasks
- Control temperature, max tokens, and other parameters
- Chain multiple Claude calls in a single workflow
- Use different models for different task types (fast Haiku for triage, Opus for reasoning)
- Built-in prompt caching for cost efficiency
Architecture patterns this enables:
- Orchestrator + worker: main Claude delegates to specialised sub-agents
- Critique and refine: one Claude generates, another reviews and improves
- Parallel processing: multiple Claudes working on parts of a problem simultaneously
- Specialised personas: legal Claude, code Claude, writing Claude all in one workflow