PinnedFahim Rustamy, PhDVision Transformers vs. Convolutional Neural NetworksThis blog post is inspired by the paper titled AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE from google’s…Jun 4, 20238Jun 4, 20238
Fahim Rustamy, PhDinTowards Data ScienceCLIP Model and The Importance of Multimodal EmbeddingsCLIP, which stands for Contrastive Language-Image Pretraining, is a deep learning model developed by OpenAI in 2021. CLIP’s embeddings for…Dec 11, 20232Dec 11, 20232
Fahim Rustamy, PhDDEtection TRansformer (DETR) vs. YOLO for object detection.Ever wondered how computers can analyze images, identifying and localizing objects within them? That’s exactly what object detection…Aug 20, 20235Aug 20, 20235
Fahim Rustamy, PhDMachine Learning Platforms Using KubeflowMachine learning workflow is an iterative process, and machine learning’s complete lifecycle involves a lot of experimentation. If this…Jun 4, 2023Jun 4, 2023
Fahim Rustamy, PhDServerless Question Answering NLPHave you ever wondered what happens when you ask a question on Google and immediately get an answer? For example, if we ask who is the…Dec 11, 2022Dec 11, 2022