logoalt Hacker News

whimsicalismlast Thursday at 5:37 PM0 repliesview on HN

I imagine you have to start decoding many speculative completions in parallel to have true low latency.