logoalt Hacker News

holografixlast Wednesday at 10:07 AM1 replyview on HN

The value in this really is running small custom models or the absolute latest open weight models.

Why bother when you can get payg API access to popular open weights models like Llama on Vertex AI model garden or at the edge on Cloudflare?


Replies

progbitslast Wednesday at 10:10 AM

Custom models.

We use this, pretty convenient and less hassle than managing our autoscaling GPU pools.