logoalt Hacker News

vivahir215last Sunday at 11:50 AM1 replyview on HN

Interesting Approach. Curious about the latency tradeoff: OLS + SVD are much heavier than Top-K.Have you benchmarked end-to-end inference latency?


Replies

jchandralast Sunday at 11:57 AM

[dead]