as architectures evolve, i think it can be that we learn more "side effects".. back in 202...

oedemis • yesterday at 3:31 PM • 0 replies • view on HN

as architectures evolve, i think it can be that we learn more "side effects".. back in 2020 openai researchers said "GPT-3 is applied without any gradient updates or fine-tuning" the model emerges at a certain level of scale...

alt Hacker News