logoalt Hacker News

sterlindyesterday at 5:49 PM0 repliesview on HN

I'm extremely curious how these models learn to pack a lossily-compressed representation of the entire Internet (more or less) into a few hundred billion parameters. like, what's the ontology?