Gemini AI Nano on Pixel 8 Pro

TribalScale Inc.
TribalScale
Published in
5 min readDec 12, 2023

Written by: Sebastian Valencia, Engineering Manager, TribalScale

📫 Subscribe to receive our content here.

💬 Have questions about your next digital project, startup or TribalScale? Click here to chat with one of our experts!

Google has ushered in a new era of artificial intelligence with the release of Gemini AI, you can read more about it here. In this article, we aim to provide some initial insights into this groundbreaking technology running on Pixel 8 Pro devices. This large language model (LLM) comes in three variants — Nano, Pro, and Ultra — each tailored to diverse user needs. While Nano powers quick on-device tasks, Pro offers versatility, and Ultra, the most potent, awaits release after safety checks.

Image from Google DeepMind

Already making waves on the Pixel 8 Pro, Gemini Nano introduces enhanced features like summarization in the Recorder app and Smart Reply on Gboard, initially deployed in WhatsApp. Users can now experience a more streamlined interaction with their recorded content, courtesy of the summarization feature, which generates concise bullet points for easy reference. Smart Reply on Gboard takes communication efficiency to the next level, suggesting contextual responses with the precision we’ve come to expect from cutting-edge AI.

Image from Google Store

Gboard Evolution: Unveiling the Next Frontier of Smart Interactions

The Pixel 8 Pro Gemini has already reached impressive heights in realizing its potential. However, it’s essential to note that these advancements currently come with a few limitations. Firstly, the feature is exclusively accessible on WhatsApp, restricting its widespread use. Secondly, the support is limited to US English, underscoring the need for broader language integration.

At the heart of the Pixel 8 Pro Gemini’s capabilities lies Gemini Nano, operating discreetly in the background. This powerful tool enables applications to harness its prowess, extending beyond the confines of Gboard Smart Reply. My perspective emphasizes the potential for widespread adoption as the technology matures and becomes available on more platforms and languages.

Notably, Gemini Nano showcases its proficiency not only in basic language responses but also in more sophisticated tasks like advanced proofreading and grammar correction. This is made possible through the integration of Android AICore, a testament to the device’s cutting-edge capabilities.

Delving deeper into the technical aspects, the Pixel 8 Pro Gemini utilizes Android 14’s innovative system service. This on-device foundation serves as a critical infrastructure for managing model operations, runtimes, and safety features. From my perspective, this signifies a step forward in creating a comprehensive and secure environment for advanced language processing tasks.

As the technology continues to evolve, it is foreseeable that the Pixel 8 Pro Gemini’s capabilities will extend beyond its current confines, reaching a broader audience and supporting additional languages, making it an even more influential player in the realm of language processing.

Unveiling Pixel Voice Recorder’s AI Summarization

The Pixel Voice Recorder, featuring Gemini Nano and AICore, introduces an on-device summarization feature for efficient content extraction. While not revolutionary, this tool proves invaluable for users, enabling convenient main takeaway extraction. The ability to accomplish this seamlessly with a smartphone marks a significant quality-of-life improvement.

Image from Google Store

Summarization in Action

Gemini Nano powers the Summarize feature in Recorder and Gboard Smart Reply on the Pixel 8 Pro. When selecting “Summarize” in the “Transcript” tab, the application generates three bullet points, even offline.

Balancing Act: Limitations

Summarization has constraints; recordings under one minute receive an error, and those exceeding 15 minutes are deemed “too long.” This emphasizes the need for a sweet spot between 1–15 minutes for optimal use.

AI Fussiness and Safety

Safety measures in place for harmful content align with Google’s policy, targeting dangerous activities. This reflects the app’s commitment to ethical standards.

Optimal Conditions for Success

Under ideal conditions (1–15 minutes, no harmful content), the Summarize button triggers a step-by-step process, showcasing the potential of on-device processing and advancing voice recording technology.

Conclusion

In conclusion, the Pixel 8 Pro Gemini, with its Gemini Nano and AICore integration, represents a significant advancement in smart interactions and language processing. While currently limited in platform and language support, its capabilities in language responses, proofreading, and grammar correction demonstrate great potential for wider adoption. The Pixel Voice Recorder’s AI Summarization, powered by Gemini Nano, offers a valuable on-device feature for content extraction, despite some limitations. As technology evolves, the Gemini Nano is poised to expand its influence, providing users with a secure and sophisticated environment for advanced language processing and efficient content extraction.

Resources

Sebastian is an Engineer Manager at TribalScale and a tech enthusiast with a Master’s degree in Computer Science, specializing in Data Science. His passion lies in all things Android and has been working in the field for the past 8+ years. When he’s not coding or researching the latest Android technologies, you can find him hitting the gym to stay physically fit and maintain a balanced lifestyle.

--

--

TribalScale Inc.
TribalScale

A digital innovation firm with a mission to right the future.