PinnedAstropomeaiGemini Pro API: Hey Gemini! Developing a Voice-Activated Multimodal AI AppYou may have dreamed of the future of AI after watching Deepmind’s demo videos, but the integration of voice and image remains a mystery to…Dec 27, 2023Dec 27, 2023
AstropomeaiTitle: Implementing RAG with Firebase GenkitFirebase is a mobile and web application development platform provided by Google. It offers various functions such as authentication…Jun 12Jun 12
AstropomeaiTitle: llama-3-vision-alpha: How to Convert LLaMA-3 into a Vision ModelLLaMA is a large-scale language model developed by Meta, but it doesn’t originally have vision capabilities. However, a method to extend…May 32May 32
AstropomeaiCreating a Custom YouTube Search Tool for Use with LangGraphLangChain is a framework designed to facilitate the development of applications utilizing Large Language Models (LLMs). LangGraph, built on…Feb 141Feb 141
AstropomeaiLaunching My GPT on the GPT Store: A Step-by-Step GuideWith the GPT Store now available, I’m excited to share the process I followed to publish my own GPT model. Interestingly, only TXT record…Jan 22Jan 22
AstropomeaiCreating Interactive AI Programs Using the Gemini APIWith the advent of Google’s latest multimodal AI, Gemini, a new user experience integrating text, voice, and image processing has become…Dec 16, 2023Dec 16, 2023
AstropomeaiGPT-4-vision: Realizing Autonomous Driving with LLMDisclaimer: The author does not possess specialized knowledge in automobiles or autonomous driving technologies, so please excuse any…Dec 4, 2023Dec 4, 2023
AstropomeaiGPT-4-vision: Trying Out Real-time Image Analysis Based on ContextIn this article, we delve into a process that captures video frames in real-time using a PC’s camera, encodes them into Base64 format, and…Nov 25, 2023Nov 25, 2023
AstropomeaiIntegrating YouTube API with GPTs for Video SearchOverview of the ActionNov 25, 2023Nov 25, 2023
AstropomeaiCreating an AI Agent mastering various tools with Streamlit × Langchain🦜Until now, we have been developing the LangChain AI Agent. This time, we decided to introduce a GUI to pursue a more intuitive operability…Sep 16, 2023Sep 16, 2023