logoalt Hacker News

Der_Einzige12/10/20240 repliesview on HN

If you’re not actively publishing at top conferences (I.e. NeurIPS), than this is a trash question and shows the lack of knowledge that many who are now entering the field will have.

Anything that you or others can answer to this which isn’t some stupid “gotcha” puzzle shit (lol it’s video cus LLMs aren’t video models amiright?) will be wrong because of things like structured decoding and the fact that ultra high temperature works with better samplers like min_p.

https://openreview.net/forum?id=FBkpCyujtS&noteId=mY7FMnuuC9