> 25K parameters is about 70 million times smaller than GPT-4. It will produce broken sentences. ...

wk_end • yesterday at 10:23 PM • 2 replies • view on HN

> 25K parameters is about 70 million times smaller than GPT-4. It will produce broken sentences. That's the point - the architecture works at this scale.

Since it seems to just produce broken and nonsensical sentences (at least based on the one example given) I'm not sure if it does work at this scale.

Anyway, as written this passage doesn't really make a whole lot of sense (the point is that it produces broken sentences?), and given that it was almost certainly written by an AI, it demonstrates that the architecture doesn't work especially well at any scale (I kid, I kid).

Replies

forinti • yesterday at 10:38 PM

How does it compare to a Markov chain generator I wonder.

➕ show 1 reply

pizza234 • yesterday at 11:36 PM

[dead]

alt Hacker News

Replies