Data Engineer @Skyscanner, AI writer @FloydHub, ex-biology teacher, language enthusiast.
Could The Transformer be another nail in the coffin for RNNs?
This follows my former post on how gradient descent works in linear regression.
Yesterday a magic moment arrived. After three days of cramming Andrew Ng machine learning videos, I…