Akshay KokaneMicrosoft’s New vision based GUI agent — OmniParserOmniParser: A Visionary Approach to GUI InteractionOct 26
Steve JonesUsing GPT-4-Vision and YOLOv8 to identify animals efficiently without additional trainingOr why cost optimization really matters with GPTDec 19, 20235
Felix KemethUsing LlamaParse and Multimodal LLMs for Extracting and Interpreting Text and Images from PDFsQuerying and evaluating PDF text and image content in less than 50 lines of codeOct 161Oct 161
Manoj MukherjeeExtracting Information from Images with OCR, Vision AI, and Language ModelsIn the digital age, extracting valuable information from images is crucial for various applications, ranging from document analysis to…Feb 271Feb 271
Akshay KokaneMicrosoft’s New vision based GUI agent — OmniParserOmniParser: A Visionary Approach to GUI InteractionOct 26
Steve JonesUsing GPT-4-Vision and YOLOv8 to identify animals efficiently without additional trainingOr why cost optimization really matters with GPTDec 19, 20235
Felix KemethUsing LlamaParse and Multimodal LLMs for Extracting and Interpreting Text and Images from PDFsQuerying and evaluating PDF text and image content in less than 50 lines of codeOct 161
Manoj MukherjeeExtracting Information from Images with OCR, Vision AI, and Language ModelsIn the digital age, extracting valuable information from images is crucial for various applications, ranging from document analysis to…Feb 271
Tee Kai FengOpenAI Visual Tokenizer ExplainedSimilar to text tokenizers, GPT-4 also “tokenizes” visual inputs (images/videos) into tokens, and the number of tokens will, in turn…Aug 5
InAI PlanetbyPlaban NayakMultimodal RAG using Langchain Expression Language And GPT4-VisionMany documents contain a mixture of content types including images an texts. Yet information captured in images is lost in most RAG…Dec 28, 2023
Nipun dixitAzure Open AI Models Collaboration“The image on the left is from James Webb and the one on the right is generated by Dalle using a text description of the former.”Jul 18