logoalt Hacker News

david-gputoday at 3:04 PM0 repliesview on HN

You don't need billions of parameters for that, precisely because the risk of being stuck at a local minimum decreases exponentially with the number of parameters. Right?