Does it generalize though? What a bag-of-words metaphor can say about a question "How many rein...

red75prime • today at 8:55 AM • 0 replies • view on HN

Does it generalize though? What a bag-of-words metaphor can say about a question "How many reinforcement learning training examples an LLM need to significantly improve performance on mathematical questions?"

alt Hacker News