Michael XinArtificial Intelligence in Plain EnglishHow To Train Multimodal LLMs To Understand And Interact With Text, Image, Video And Audio: Model…Multimodal instruction models can be evaluated using closed-set and open-set questions, as well as qualitative assessments.6 min read·Apr 26, 2024----
Michael XinArtificial Intelligence in Plain EnglishHow To Train Multimodal LLMs To Understand And Interact With Text, Image, Video And Audio: Model…7 min read·Apr 2, 2024----
Michael XinArtificial Intelligence in Plain EnglishHow To Train Multimodal LLMs To Understand And Interact With Text, Image, Video And Audio: Model…Current MLLMs insert visual embeddings from vision experts into pre-trained language embedding space. Key works are introduced.9 min read·Mar 21, 2024----
Michael XinArtificial Intelligence in Plain EnglishHow To Train Multimodal LLMs To Understand And Interact With Text, Image, Video And AudioA concise introduction to the world of multimodal Large Language Models (LLMs), an overview of their background and how to train them.7 min read·Mar 8, 2024----
Michael XGenerative AI for Innovative Photo & Video Editing & Creation — APPs and Enabling DatasetsIn the ever-evolving realm of digital artistry, the emergence of Generative Artificial Intelligence (AI) has unlocked a vast array of…7 min read·Dec 19, 2023----
Michael XinArtificial Intelligence in Plain EnglishEnhancing LLMs With Vision Experts (Part 3)If you missed out on the previous articles of this series, please read:10 min read·Nov 22, 2023----
Michael XinArtificial Intelligence in Plain EnglishEnhancing LLMs With Vision Experts (Part 2)If you missed out on the first article of this series, please read:7 min read·Nov 15, 2023----
Michael XinArtificial Intelligence in Plain EnglishEnhancing LLMs With Vision Experts (Part 1)Abstract10 min read·Nov 9, 2023----
Michael XinArtificial Intelligence in Plain EnglishAutonomous Driving Technology Revolution : From SLAM+DL to BEV+Transformer (Part 3)If you missed out on the first article of this series, please click:6 min read·Oct 25, 2023----
Michael XinArtificial Intelligence in Plain EnglishAutonomous Driving Technology Revolution : From SLAM+DL to BEV+Transformer (Part 2)If you missed out on the first article of this series, please click:11 min read·Oct 18, 2023----