Your Gemini Cheat Sheet: All AI Developments from Google I/O 2024

Siddharth Sudhakar
Accredian
Published in
7 min readMay 29, 2024
Source: Google I/O 2024 (io.google)

A little more than 120, the number of times the word “AI” was mentioned in Google I/O 2024. With numerous tech conferences focusing on Artificial Intelligence, it has become a running joke to keep track of how often presenters mention the word “AI” throughout the conference. This time at Google I/O, Sundar Pichai, the CEO of Google, generously shared the exact number.

The AI innovations on display were awe-inspiring. Google is working towards integrating AI into most of its current products and services. Let’s delve into the most exciting AI announcements from I/O and examine what they signify for the future of Google and AI.

Table of Contents

  1. Introduction
  2. Updates to Google Gemini
  3. Project Astra
  4. The New Google Search
  5. Adding a Pinch of AI to Google’s Products and Services
  6. Other Miscellaneous Developments
  7. Conclusion

Introduction

Source: Wikipedia

Google I/O is a highly anticipated annual developer conference hosted by Google. This event offers developers and tech enthusiasts a unique opportunity to gain insights into Google’s latest innovations, upcoming technologies, and plans.

However, the 2024 conference was something new. The focus on AI was more prominent than ever before at I/O. The event showcased various AI-powered innovations, including advanced language models and transformative applications across Google’s extensive product lineup.

Gemini, Google’s next-generation AI model, was at the heart of it all. With its enhanced capabilities and expanded reach, Gemini promises to reshape how we interact with technology and information.

But Gemini was just the tip of the iceberg. Google I/O 2024 unveiled a wide array of AI developments, including Project Astra’s vision for personalized AI assistants, the AI-powered makeover of Google Search, and the integration of AI into familiar tools like Gmail and Google Photos.

Google is making a big investment in AI and is encouraging everyone to join the journey. In the upcoming sections, we will explore the most significant AI advancements unveiled at Google I/O 2024.

Updates to Google Gemini

Gemini, Google’s next-generation AI model, took center stage. Here’s what’s new:

Gemini 1.5 Pro and Flash: Google has unveiled two versions of Gemini 1.5. The Pro version offers impressive overall performance and features a huge 1 million token context window, equivalent to about 1400 pages of text. Gemini 1.5 Pro will also have a 2 million token context window in the private preview. On the other hand, Gemini 1.5 Flash is a lighter and faster model specifically designed for large-scale deployments.

Source: Google

Gems from Gemini: A standout feature introduced with Gemini is Gems. These customizable AI chatbots can be tailored to your interests and needs. Whether you need a language tutor, a fitness coach, or a trivia expert, you can create a Gem to chat with and learn from. This level of personalization adds a new layer of engagement and utility to AI interactions.

Source: Google I/O 2024

Multimodal Capabilities: Gemini can now handle text, images, and audio, opening up new possibilities for creative applications and enhanced understanding.

Source: Google I/O 2024

API Access: Developers can now tap into Gemini’s power through the Gemini API, enabling them to build innovative AI-powered applications and services.

Project Astra

This ambitious project represents a paradigm shift in how we conceive and interact with AI. Astra aims to move beyond the current generation of assistants and create something far more sophisticated and integrated into our lives.

So, what sets Project Astra apart?

  • Personalized to the Core: Astra is designed to be highly personalized, learning your preferences, habits, and needs over time. This allows it to offer tailored recommendations, reminders, and suggestions that are genuinely relevant to you.
  • Proactive and Anticipatory: Astra aims to proactively anticipate your needs and offer assistance before you even ask. For example, it could automatically create a shopping list when it senses you’re running low on groceries or remind you of an upcoming appointment and offer to book transportation.
  • Contextual Understanding: Astra will go beyond simply processing words and phrases. It will strive to understand the context of your interactions, considering your location, time of day, and current activities. This will enable it to provide more meaningful and helpful responses.

While still in the early stages of development, Project Astra offers an exciting vision for the future of AI assistants. It envisions a future where AI isn’t just a tool we use, but a partner that understands us, anticipates our needs, and seamlessly integrates into our lives to make them easier, more productive, and more enjoyable.

The New Google Search

Google Search got a major AI-powered makeover with the introduction of Gemini-powered Search Generative Experience (SGE):

  • AI-Generated Snapshots: SGE provides AI-generated summaries of relevant information for complex queries, saving users time and effort.
  • Conversational Mode: SGE allows for natural language follow-up questions, making searches more interactive and intuitive.
  • Perspective Filter: Users can filter search results based on different perspectives, helping them explore various viewpoints.

“Let Google do the Googling for you.”

For example, let’s say I’m in the market to purchase my first car. Since I’m not well-versed in car prices, I’m researching the average prices of various car brands to narrow down my options. To assist with this, I can run a Google search for “What is the least expensive Mercedes car model,” and here are the results I get:

You can see that the AI Overview provides the information I need right at the top, eliminating the need to visit multiple websites and dig deeper.

There’s also a new ‘Web’ section, where you’ll get only the websites relevant to your search.

Multi-step reasoning takes search to the next level. You can now ask complex questions that require multiple steps to answer, and Google Search will break down the process, find the relevant information, and present it clearly and concisely.

Adding a Pinch of AI to Google’s Products and Services

AI is enhancing Google’s products and services in several ways, such as:

  • Google Photos: You can now ask contextual questions about your photos, and Google Photos will show the most relevant picture. For example, you can ask, “Show me a picture of me hiking in Yosemite last summer,” and it will find the specific image for you.
  • Gmail: AI has significantly improved email management. You can now summarize long email chains with a click, saving time. Additionally, Gmail can generate responses to questions based on the content of your emails, making communication more efficient.
  • Phone: AI is now helping to protect against scams. Google Phone uses AI to detect potential scam calls, providing warnings and helping users avoid unwanted interactions.

Other Miscellaneous Developments

The conference also showcased a range of innovative AI applications that push the boundaries of creativity and productivity:

Veo: This AI-powered video editor is a game-changer for content creators. With Veo, you can generate videos from multimodal prompts, combining text, images, and even audio to create unique and engaging content. Furthermore, Veo allows you to extend existing videos and generate additional scenes.

NotebookLM: This experimental AI-powered notebook is designed to be your ultimate research companion. It can summarize information from multiple sources, generate ideas, and even help you brainstorm. NotebookLM is still in its early stages, but it has the potential to revolutionize how we learn and create.

Source: Google

Imagen 3: Google’s text-to-image model has been upgraded to Imagen 3. This new iteration boasts even higher quality and more realistic image generation, opening new possibilities for creative expression and visual storytelling.

Source: Google DeepMind

Music AI Sandbox: Unleash your inner musician with Music AI Sandbox. This AI-powered tool allows you to create music from text descriptions or hummed melodies. Whether you’re a seasoned composer or a casual music enthusiast, this tool offers a fun and accessible way to explore the world of music creation.

Conclusion

Google I/O 2024 emphasized that AI is central to Google’s vision for the future. From the revolutionary capabilities of Gemini to the AI-enhanced experiences across Google’s products, the company is pushing the boundaries of what’s possible with artificial intelligence. It’s an exciting time to follow developments in this rapidly evolving field!

You can watch a recap to catch up from Google about the event here:

Thanks for reading! I’m eager to hear your thoughts and insights in the comments.

--

--