Nice one! Let's say I'm serving local models via vllm (because ollama comes with huge perf...

driese • yesterday at 4:49 PM • 1 reply • view on HN

Nice one! Let's say I'm serving local models via vllm (because ollama comes with huge performance hits), how would I implement that in gomodel?

Replies

devmor • yesterday at 5:02 PM

This is way more interesting to me as well. I have projects that use small limited-purpose language models that run on local network servers and something like this project would be a lot simpler than manually configuring API clients for each model in each project.

➕ show 1 reply

alt Hacker News

Replies