Do you have a good resource on how to finetune a model like Qwen? I am curious to try it out.

mettamage • yesterday at 5:52 PM • 2 replies • view on HN

Replies

Here is a dataset you can choose from: https://huggingface.co/datasets/Avtrkrb/combined-reasoning-o... Get a 10000 samples from it according to your needs and go for it. The key (in my opinion) is not cutting the Sequence Length among other things. Whatever traditional finetuning repo will do, if your hardware supports it Unsloth is faster.

verdverm • yesterday at 5:57 PM

Unsloth has good resources

alt Hacker News

Replies