Yuanrui Dong – Medium

Yuanrui Dong

Yuanrui Dong
in
AI³ | Theory, Practice, Business

ResBlock — A trick to impove the model

In deep learning, it’s common that the deeper network has stronger ability, and performance is better. However, the deeper network also…

3 min readSep 24, 2019

--

ResBlock — A trick to impove the model

--

Yuanrui Dong
in
AI³ | Theory, Practice, Business

Understanding Dropout

When training deep neural networks, we always encounter two major shortcomings:

3 min readSep 20, 2019

--

Understanding Dropout

--

Yuanrui Dong
in
AI³ | Theory, Practice, Business

AWD-LSTM

AWD-LSTM (ASGD Weight-Dropped LSTM) is one of the most popular language models. It has been used in many top papers, and its performance…

3 min readSep 9, 2019

--

1

AWD-LSTM

--

1

Yuanrui Dong
in
AI³ | Theory, Practice, Business

Hyper Parameter—Momentum

When we use the SGD (stochastic mini-batch gradient descent, commonly known as SGD in deep learning) to train parameters, sometimes it…

4 min readSep 9, 2019

--

Hyper Parameter—Momentum

--

Yuanrui Dong
in
AI³ | Theory, Practice, Business

Using softmax carefully

In deep learning, softmax is a very common and important function, especially in multi-classification image recognition. Originally, it’s…

3 min readSep 2, 2019

--

Using softmax carefully

--

Yuanrui Dong
in
AI³ | Theory, Practice, Business

Why Initializing a Neural Network is Important!

Generally, neural network models rely on stochastic gradient descent for model training and parameter updating. The final performance of…

3 min readAug 20, 2019

--

Why Initializing a Neural Network is Important!

--

Yuanrui Dong
in
AI³ | Theory, Practice, Business

Broadcasting Application

For the fastai Part 2 courses, it focous on how to rewrite the pytorch library. Therefroe, We know how the library works, and we can apply…

3 min readAug 19, 2019

--

Broadcasting Application

--

Yuanrui Dong

Yuanrui Dong

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams