Antonio ConsiglioYOLOv10 — Breaking Speed Barriers with NMS-Free Detection (with code)YOLOv10, one of the latest iteration in the YOLO family, brings a new level of efficiency to real-time object detection. By removing the…3h ago
Tapan BabbarBuild an AI Image Similarity Search with Transformers — ViT, CLIP, DINO-v2, and BLIP-2This project uses vision models to generate image embeddings and performs similarity searches with FAISS.Oct 186
InTowards Data SciencebyRo IsachenkoAn Introduction to VLMs: The Future of Computer Vision ModelsBuilding a 28% more accurate multimodal image search engine with VLMs.4d ago14d ago1
Muhammad Rizwan MunawarParking Management using Ultralytics YOLO11Managing parking effectively is essential for busy cities and public spaces. Traditional methods often need to catch up, leading to…9h ago9h ago
InTowards Data SciencebyRuth CrastoZero-Shot Localization with CLIP-Style EncodersHow can we see what a vision encoder sees?Sep 244Sep 244
Antonio ConsiglioYOLOv10 — Breaking Speed Barriers with NMS-Free Detection (with code)YOLOv10, one of the latest iteration in the YOLO family, brings a new level of efficiency to real-time object detection. By removing the…3h ago
Tapan BabbarBuild an AI Image Similarity Search with Transformers — ViT, CLIP, DINO-v2, and BLIP-2This project uses vision models to generate image embeddings and performs similarity searches with FAISS.Oct 186
InTowards Data SciencebyRo IsachenkoAn Introduction to VLMs: The Future of Computer Vision ModelsBuilding a 28% more accurate multimodal image search engine with VLMs.4d ago1
Muhammad Rizwan MunawarParking Management using Ultralytics YOLO11Managing parking effectively is essential for busy cities and public spaces. Traditional methods often need to catch up, leading to…9h ago
InTowards Data SciencebyRuth CrastoZero-Shot Localization with CLIP-Style EncodersHow can we see what a vision encoder sees?Sep 244
InTowards Data SciencebyMatthew GuntonBuilding a Convolutional Neural Network (CNNs) from ScratchLine-by-Line, Let’s Build a ResNet Classifier on the MNIST-Fashion Dataset5d ago
Paulina Irene Velasquez FerrufinoUnderstanding Uncertainty Calculation in YOLOv9YOLO (You Only Look Once) has transformed object detection by allowing systems to identify objects in real-time with a single image pass…1d ago2
InTowards Data SciencebyDr. Leon EversbergRevisiting Karpathy’s “State of Computer Vision and AI”Looking back at AI progress since the 2012 blog post “The state of Computer Vision and AI: we are really, really far away”Oct 1810