logoalt Hacker News

sinuhe69last Wednesday at 10:02 AM1 replyview on HN

The demo they showed was full of repeated sentences. The 3B model looks quite dense, TBH. Did they just want to show the speed?


Replies

newswasboringlast Wednesday at 12:50 PM

3B models, especially in quantized state, almost always behave like this.