Ester Hlav – Medium

Ester Hlav

Ester Hlav
in
Towards Data Science

Kaiming He Initialization in Neural Networks — Math Proof

Deriving optimal initial variance of weight matrices in neural network layers with ReLU activation function

Feb 15, 2023

Kaiming He Initialization in Neural Networks — Math Proof

Feb 15, 2023

Ester Hlav
in
Towards Data Science

Xavier Glorot Initialization in Neural Networks — Math Proof

Detailed derivation for finding optimal initial distributions of weight matrices in deep learning layers with tanh activation function

Dec 23, 2022

Xavier Glorot Initialization in Neural Networks — Math Proof

Dec 23, 2022

Ester Hlav
in
Towards Data Science

5 Derivatives to Excel in Your Machine Learning Interview

Calculus behind Machine Learning: Review of Derivatives, Gradient, Jacobian, and Hessian

Sep 2, 2020

5 Derivatives to Excel in Your Machine Learning Interview

Sep 2, 2020

Ester Hlav
in
Towards Data Science

Activation Functions in Deep Learning: From Softmax to Sparsemax — Math Proof

Complete mathematical derivation of Sparsemax activation function: Softmax alternative for sparse outputs

Aug 26, 2020

Activation Functions in Deep Learning: From Softmax to Sparsemax — Math Proof

Aug 26, 2020

Ester Hlav

Ester Hlav

Machine Learning Software Engineer, Mathematics Graduate | github.com/EsterHlav

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams