Probably long term each dev gets their own GPU and runs a model locally I expect. Seems like a more...

mattlondon • today at 5:12 PM • 1 reply • view on HN

Probably long term each dev gets their own GPU and runs a model locally I expect. Seems like a more sustainable approach, even if a local model is not absolute SOTA.

Replies

ianm218 • today at 6:12 PM

GPUs are much more efficient at parallelizing requests for LLMs so it's going to much more efficient to centrally host. Maybe big companies it would make sense to get their own though.

alt Hacker News

Replies