Hypernetworks: Neural Networks for Hierarchical Data

8 points - today at 4:55 PM

Source

Comments

joefourier today at 9:43 PM
Odd that the author didn’t try giving a latent embedding to the standard neural network (or modulated the activations with a FiLM layer) and had static embeddings as the baseline. There’s no real advantage to using a hypernetwork and they tend to be more unstable and difficult to train, and scale poorly unless you train a low rank adaptation.