Kiki RizkiUnderstand and Implement Transformer from Scratch PART I (Attention Mechanism)Many ground-breaking deep learning models in 2020 are based on transformer architecture. Like RNN, transformer is designed to handle…May 3, 2021May 3, 2021