logoalt Hacker News

haomingkootoday at 2:56 PM0 repliesview on HN

Really interesting approach. Curious how the 2-bit quantization affects the model's reasoning ability on longer chains of thought vs shorter prompts. The benchmarkslook solid but real-world usage seems like a different story based on the comments here.