logoalt Hacker News

Hypernetworks: Neural Networks for Hierarchical Data

8 pointsby mkmccjrtoday at 4:55 PM2 commentsview on HN

Comments

joefouriertoday at 9:43 PM

Odd that the author didn’t try giving a latent embedding to the standard neural network (or modulated the activations with a FiLM layer) and had static embeddings as the baseline. There’s no real advantage to using a hypernetwork and they tend to be more unstable and difficult to train, and scale poorly unless you train a low rank adaptation.

show 1 reply