Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

114 points • by ag2718 • today at 7:21 PM • 14 comments • view on HN

https://web.archive.org/web/20260609200156/https://aarushgup...

Comments

So for people wondering if it can be used to accelerate LLM inference, sadly not.

I've been trying to hit 100,000tokens/s with a 3.28m dumb model, and even this is an order of magnitude too large to benefit.

It appears to be focussed more on latency, than throughput. Happy to be corrected?

➕ show 1 reply

RantyDave • today at 8:10 PM

Right. But ... this would limit you to either extremely small models or extremely large FPGA's, yes? If there's a simple machine learning task that requires a sub microsecond latency I can see the point but otherwise??

➕ show 2 replies

tomrod • today at 9:21 PM

Happy to hear that KANs continue to find solid footing.

Animats • today at 8:19 PM

This guy will be hired by a high-frequency trading firm, and the next time we hear about him, he will have a net worth in 9 figures.

➕ show 1 reply

babelfish • today at 8:26 PM

Archive link, as it looks like the original post was taken down: https://web.archive.org/web/20260609200156/https://aarushgup...

➕ show 1 reply

amdeisimncrmnls • today at 9:52 PM

[dead]

KAN_LUT • today at 10:23 PM

[dead]

alt Hacker News

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

Comments