logoalt Hacker News

wk_endyesterday at 10:23 PM2 repliesview on HN

> 25K parameters is about 70 million times smaller than GPT-4. It will produce broken sentences. That's the point - the architecture works at this scale.

Since it seems to just produce broken and nonsensical sentences (at least based on the one example given) I'm not sure if it does work at this scale.

Anyway, as written this passage doesn't really make a whole lot of sense (the point is that it produces broken sentences?), and given that it was almost certainly written by an AI, it demonstrates that the architecture doesn't work especially well at any scale (I kid, I kid).


Replies

forintiyesterday at 10:38 PM

How does it compare to a Markov chain generator I wonder.

show 1 reply
pizza234yesterday at 11:36 PM

[dead]