logoalt Hacker News

Der_Einzigeyesterday at 8:48 PM1 replyview on HN

Use modern samplers and you don’t need to limit yourself to 8bit at half the context window. I could push it down to 1.58 bits and get decently good output easily by simply not using the garbage default top_p and top_k that vendors continue to wrongly recommend.


Replies

anon373839yesterday at 9:39 PM

Where do you find optimal samplers and sampler settings for these models? Very interested in this as I, too, use Q8.