Hypernetworks: Neural Networks for Hierarchical Data
8 points - today at 4:55 PM
SourceComments
joefourier today at 9:43 PM
Odd that the author didnβt try giving a latent embedding to the standard neural network (or modulated the activations with a FiLM layer) and had static embeddings as the baseline. Thereβs no real advantage to using a hypernetwork and they tend to be more unstable and difficult to train, and scale poorly unless you train a low rank adaptation.