윤미연, 신지후, 자유주의, 장인정신, 문제해결

Medium member since March 2019

From Attention in RNNs by Nir Arbel

*…at it does not attempt to encode a whole input sentence into a single fixed-length vector. Instead, **it encodes the input sentence into a sequence of vectors and chooses a subset of these vectors adaptively while decoding the translation. **This frees a neural translation model from having to squash all the information of a source sentence…*

From The Poisson Distribution and Poisson Process Explained by Will Koehrsen

A Poisson Process is a model for a series of discrete event where the *average time* between events is known, but the exact timing of events is random. The arrival of an event is independent of the event before (waiting time between events is memoryless). For example, suppose we own a website which our content delivery network (CDN) tells us goes down …