Abdulkader HelwanQ-FormerThe ability to seamlessly integrate and process information from both visual and textual domains has emerged as a crucial capability In the…Dec 22, 2023
Shashwat AgarwalSalesforce/Blip Model: The Pinnacle of Multimodal AIIn the expansive realm of artificial intelligence, the Salesforce/Blip model has emerged as a paragon of multimodal learning models…Jul 20Jul 20
Yuki ShizuyaBLIP-2 paper review and Explore BLIP-2’s embedding spacePaper Review of the state-of-the-art Vision-Language researchMar 3Mar 3
TechcodewithkkThe Journey of Final Year ProjectIt all started in the 7th semester. We began discussing which project to make, and as EC students, we initially thought of doing something…May 22May 22
Abdulkader HelwanQ-FormerThe ability to seamlessly integrate and process information from both visual and textual domains has emerged as a crucial capability In the…Dec 22, 2023
Shashwat AgarwalSalesforce/Blip Model: The Pinnacle of Multimodal AIIn the expansive realm of artificial intelligence, the Salesforce/Blip model has emerged as a paragon of multimodal learning models…Jul 20
Yuki ShizuyaBLIP-2 paper review and Explore BLIP-2’s embedding spacePaper Review of the state-of-the-art Vision-Language researchMar 3
TechcodewithkkThe Journey of Final Year ProjectIt all started in the 7th semester. We began discussing which project to make, and as EC students, we initially thought of doing something…May 22
shashank JainBLIP-2: A Detailed Look at the Architecture, Training, and InferenceIntroductionJul 9, 2023
Akriti UpadhyayUnveiling the Power of Multimodal Language Models in Image CaptioningDecoding the Language of Images with Advanced Captioning ModelsDec 6, 20231
Tim SpannImage Processing with Custom Python and NiFi 2.0Apache NiFi, Image Processing, BLIP, HuggingFace, Transformers, Python, Image CaptioningMar 13