If OP meant they have the fastest implementation of Gemma 4 on Blackwell at the moment, I guess that...

nabakin • yesterday at 7:34 PM • 1 reply • view on HN

If OP meant they have the fastest implementation of Gemma 4 on Blackwell at the moment, I guess that is technically true. I doubt that will hold up when TensorRT-LLM finishes their implementation though.

Replies

pama • yesterday at 7:57 PM

How is the sglang performance on Blackwell for this model?

➕ show 1 reply

alt Hacker News

Replies