Columbia U’s ViperGPT Solves Complex Visual Queries via Python Execution
Published in
3 min readMar 21
--
In the new paper ViperGPT: Visual Inference via Python Execution for Reasoning, a Columbia University research team presents ViperGPT, a framework for solving complex visual queries by integrating code-generation models into vision via a Python interpreter. The proposed approach requires no additional training and achieves state-of-the-art results.