> I did, and I fixed Qwen's issues with trivial sampling and loop detection hacks.
Wow, that's amazing! Care to share the changes? Would love to try them out.
It's not amazing at all.
What's amazing is that LLM technologies are so immature that even basic engineering diligence isn't being done. (Like detecting token loops, for example.)
It's not amazing at all.
What's amazing is that LLM technologies are so immature that even basic engineering diligence isn't being done. (Like detecting token loops, for example.)