logoalt Hacker News

hodgehog11today at 4:31 AM0 repliesview on HN

Extremely well said. Universal approximation is necessary but not sufficient for the performance we are seeing. The secret sauce is implicit regularization, which comes about analogously to enforcing compression.