SyncedReview
Published in

SyncedReview

Google Brain Uncovers Representation Structure Differences Between CNNs and Vision Transformers

Although convolutional neural networks (CNNs) have dominated the field of computer vision for years, new vision transformer models (ViTs) have also shown remarkable abilities, achieving comparable and even better performance than CNNs on many computer vision tasks. The success of ViTs has raised a number of questions: How are…

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store