Artificialis
Published in

Artificialis

ViT — VisionTransformer, a Pytorch implementation

The Attention is all you need’s paper revolutionized the world of Natural Language Processing and Transformer-based architecture became the de-facto standard for natural language processing tasks.

It was only a matter of time before someone would actually try to reach the state of the art in Computer Vision, with attention mechanism and transformer architectures.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store