@moeller_ml @geomblog I did mean the weight init, because clumpy parameter initialization might take longer to break symmetry?
@moeller_ml @geomblog I did mean the weight init,…
Posted
in
by
Tags:
@moeller_ml @geomblog I did mean the weight init, because clumpy parameter initialization might take longer to break symmetry?
Posted
in
by
Tags: