How would "a million different tool calls at the same time" work? For instance, MCP is HTT...

ZYbCRq22HbJ2y7 • yesterday at 10:18 PM • 3 replies • view on HN

How would "a million different tool calls at the same time" work? For instance, MCP is HTTP based, even at low latency in incredibly parallel environments that would take forever.

Replies

Art9681 • today at 2:35 AM

It wouldn't. There is a difference between theory and practicality. Just because we could, doesnt mean we should, especially when costs per token are considered. Capability and scale are often at odds.

nkohari • today at 1:33 PM

There's a difference between discovery (asking an MCP server what capabilities it has) and use (actually using a tool on the MCP server).

I think the comment you're replying to is talking about discovery rather than use; that is, offering a million tools to the model, not calling a million tools simultaneously.

Jarwain • today at 12:19 AM

MCPs aren't the only way to embed tool calls into an LLM

➕ show 1 reply

alt Hacker News

Replies