logoalt Hacker News

mettamageyesterday at 5:52 PM2 repliesview on HN

Do you have a good resource on how to finetune a model like Qwen? I am curious to try it out.


Replies

trilogicyesterday at 6:03 PM

Here is a dataset you can choose from: https://huggingface.co/datasets/Avtrkrb/combined-reasoning-o... Get a 10000 samples from it according to your needs and go for it. The key (in my opinion) is not cutting the Sequence Length among other things. Whatever traditional finetuning repo will do, if your hardware supports it Unsloth is faster.

verdvermyesterday at 5:57 PM

Unsloth has good resources