Dimitrios GagatsisA review on CVT: Introducing Convolutions to Vision TransformersOriginal Abstract We present in this paper a new architecture, named Convolutional vision Transformer (CvT), that improves Vision…Dec 27, 2022Dec 27, 2022
Dimitrios GagatsisBEIT: BERT Pre-Training of Image TransformersOriginal Abstract We introduce a self-supervised vision representation model BEIT, which stands for Bidirectional Encoder representation…Dec 10, 2022Dec 10, 2022
Dimitrios GagatsisDINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object DetectionAbstract We present DINO (DETR with Improved deNoising anchOr boxes), a state-of-the-art end-to-end object detector. DINO improves over…Nov 25, 2022Nov 25, 2022
Dimitrios GagatsisA paper review on SoftTeacherOriginal Abstract This paper presents an end-to-end semi-supervised object detection approach, in contrast to previous more complex…Nov 15, 2022Nov 15, 2022
Dimitrios GagatsisA Summary of Swin Transformer V2 SummaryOriginal Abstract We present techniques for scaling Swin Transformer up to 3 billion parameters and making it capable of training with…Nov 15, 2022Nov 15, 2022
Dimitrios GagatsisSwin Transformer V1 SummaryOriginal Abstract This paper presents a new vision Transformer, called Swin Transformer, that capably serves as a general-purpose backbone…Sep 12, 2022Sep 12, 2022