FOSS models have effectively caught up wrt. scale, see e.g. the latest DeepSeek V4 series - but they...

zozbot234 • yesterday at 7:01 PM • 0 replies • view on HN

FOSS models have effectively caught up wrt. scale, see e.g. the latest DeepSeek V4 series - but they still require major hardware resources (hundreds of gigabytes of RAM for a very lean deployment targeting single- or few-users inference) to run at acceptable throughput.

alt Hacker News