If you want to go deeper on language models, try these project ideas: - Zero-shot encoders like ta...

deepsquirrelnet • today at 2:57 AM • 0 replies • view on HN

If you want to go deeper on language models, try these project ideas:

- Zero-shot encoders like tasksource or GliNER

- GRPO training

- GEPA prompt tuning Qwen 0.6B (or GEPA, then GRPO)

- Use an embedding model and train a classifier (MLP, logistic, svm)

- Use a larger LLM to generate a synthetic dataset (beware of lack of diversity, mine "seed text" from real sources first)

- Synthetically generate "hard examples" where more than one category may be valid and DPO tune your preferred responses

alt Hacker News