How should NNs adjust their connectivity structure to get an appropriate inductive bias for a particular problem?
Continuous parameterisations enable the use of gradients to answer this question.
Thanks to @tychovdo and David Romero for a great collaboration across the North Sea.
See https://qoto.org/web/statuses/109347006769035328 for more.