logoalt Hacker News

busfahrerlast Friday at 9:16 AM1 replyview on HN

> Low cost specialized models

Can you elaborate on this? Is this something that companies would train themselves?


Replies

tempoponetlast Friday at 4:19 PM

You can fine-tune a model, but there are also smaller models fine-tuned for specific work like structured output and tool calling. You can build automated workflows that are largely deterministic and only slot in these models where you specifically need an LLM to do a bit of inference. If frontier models are a sledgehammer, this approach is the scalpel.

A common example would be that people are moving tasks from their OpenClaw setup off of expensive Anthropic APIs onto cheaper models for simple tasks like tagging emails, summarizing articles, etc.

Combined with memory systems, internal APIs, or just good documentation, a lot of tasks don't actually require much compute.