Agent Infrastructure Engineering at Bridge

The Fan-Out Problem: Why Grouping API Tools Isn't Enough

AI agents need fewer tools, not more. But grouping endpoints together creates a hidden trap, and the fix is simpler than you'd think.

Tomer Liran

CTO & Co-Founder

May 12, 2026·4 min read

The Fan-Out Problem: Why Grouping API Tools Isn't Enough

AI agents need fewer tools, not more. But grouping endpoints together creates a hidden trap, and the fix is simpler than you'd think.

If you've connected a real REST API to an AI agent, you've hit the tool-bloat problem. A typical SaaS API has 30-200 endpoints. Map them 1:1 to MCP tools and your agent is swimming in a flat list. Latency goes up, accuracy goes down, and half the tool calls are the agent picking the wrong one.

The obvious fix is grouping. Merge related endpoints into a single tool with an action field:

customers (tool)
  ├─ action: find
  ├─ action: search
  ├─ action: create
  ├─ action: update
  └─ action: delete

Clean. The agent sees 5 tools instead of 50. Everyone wins.

Until you hit the fan-out problem.

What fan-out looks like

Take a real-world API — say a CRM or an e-commerce backend. The "customers" resource is massive. It touches everything. If you naively group every endpoint that relates to customers under one tool, you get:

customers (tool)
  ├─ action: find
  ├─ action: search
  ├─ action: create
  ├─ action: update
  ├─ action: delete
  ├─ action: list_orders
  ├─ action: list_invoices
  ├─ action: merge_duplicates
  ├─ action: archive
  ├─ action: export
  ├─ action: import
  ├─ action: add_note
  ├─ action: assign_agent
  ├─ action: update_subscription
  ├─ action: send_email
  ├─ action: get_activity_log
  ├─ action: ... (you get the point)

Fan-out is the number of actions stuffed under one parent tool. And when that number gets high, you've recreated the original problem one level deeper. The agent is now scanning a giant action enum instead of a giant tool list. Same confusion, different shelf.

The grouping win disappears.

Why 8 is the right cap

This isn't arbitrary. LLMs read tool schemas linearly. Name, description, parameter list, action enum. When the action list is short (3-5 items), the agent scans it in one glance and picks the right action. When it's long (15+ items), the agent does what it does with the flat tool list. It guesses, sometimes wrong.

We landed on a maximum of 8 actions per grouped tool. Here's the reasoning:

Skimmability. 8 items fit in the agent's attention window without scrolling. The description + action list is digestible in one pass.
Forced clarity. Capping at 8 forces the optimizer to find meaningful sub-groupings. Instead of one bloated customers tool, you get:

More from the Bridge blog.

Agent InfrastructureEngineering at BridgeMCP Tooling

Add an MCP server to your SaaS in 10 minutes (free, no credit card)

70% of large SaaS brands now ship a remote MCP server. The build looks expensive — auth, transport, hosting, credential management. Here's how to skip all of it and go live in 10 minutes on the free tier.

Tomer Liran·May 12, 2026·5 min read

blog-post-photo-showing-icons-and-platform

Agent InfrastructureEngineering at Bridge

What I Learned Building a Managed MCP Infrastructure Layer

Building MCP servers is easy. Building the infrastructure around them like auth, multi-tenancy, credential management, context windows, etc. is where the real work lives. Lessons from shipping a managed MCP gateway, plus community insights from developers running MCP in production.

Tomer Liran·May 5, 2026·5 min read

← Back to all posts

The Fan-Out Problem: Why Grouping API Tools Isn't Enough

The Fan-Out Problem: Why Grouping API Tools Isn't Enough

What fan-out looks like

Why 8 is the right cap

More from the Bridge blog.

Add an MCP server to your SaaS in 10 minutes (free, no credit card)

What I Learned Building a Managed MCP Infrastructure Layer

What happens when you enforce this

This is one piece of a bigger picture

What we learned