Gemini Ai Vs ChatGPT 4 — A clash between two AI giants!

4 min readDec 12, 2023

--

Till now, there hasn’t been a chatbot that has been able to rival ChatGPT, BUT, can Google’s new model stump it now? Google has launched GEMINI, a multimodal AI model that they say is more powerful than any existing model. It recognizes oral prompts, images, and talks back in real-time. Unlike previous models, it is also constantly learning and updating. So — here’s a look at Google’s GEMINI and how it holds up against ChatGPT!

What is ChatGPT 4?

You probably know but let me tell you anyway. Generative Pre-trained Transformer 4 is a multimodal large language model created by OpenAI, and the fourth in its series of GPT foundation models. It was initially released on March 14, 2023, and has been made publicly available via the paid chatbot product ChatGPT Plus, and OpenAI’s API.

GPT-4 on traditional benchmarks designed for machine learning models. GPT-4 considerably outperforms existing large language models, alongside most state-of-the-art (SOTA) models.

What is Gemini AI?

Google just released Gemini, its new generative AI model. Ge mini AI is Google’s latest LLM that has been designed to be more powerful and capable than its predecessor. Gemini is built for multimodality that can reason seamlessly across text, images, video, audio, and code.

Gemini supports interleaved sequences of text, image, audio, and video as inputs (illustrated by tokens of different colors in the input sequence). It can output responses with interleaved images and text.

There’s a saying that “Once people believe what you are saying, they will buy what you are selling”. Google has been saying a lot of things about its new AI model Gemini and we will explore how much of it is actually true.

Gemini is the first model to outperform human experts on MMLU (Massive Multitask Language Understanding), one of the most popular methods to test the knowledge and problem solving abilities of AI models. — Gemini is the first model to outperform human experts on MMLU (Massive Multitask Language Understanding), one of the most popular methods to test the knowledge and problem-solving abilities of AI models.

Gemini Ai Vs ChatGPT 4

Evaluation of these benchmarks is challenging and may be affected by data contamination. Even so, model performance on these benchmarks gives us an indication of the model capabilities and where they may provide an impact on real-world tasks.

Gemini performance on text benchmarks with external comparisons and PaLM 2-L — Academic Benchmarks — Gemini’s performance on text benchmarks with external comparisons and PaLM 2-L (This snippet has been taken from Google's Deepmind official blog.)

So far we have seen that GPT-4 on traditional benchmarks designed for machine learning models. GPT-4 considerably outperforms existing large language models, alongside most state-of-the-art (SOTA) models which may include benchmark-specific crafting or additional training protocols. (This snippet has been taken from the OpenA’s official blog.)

Diversity in Gemini

Trends in Capabilities — Language understanding and generation performance of the Gemini model family across different capabilities.

We can observe consistent quality gains with increased model size, especially in reasoning, math/science, summarization, and long context. Gemini Ultra is the best model across the board for all six capabilities. Gemini Pro, the second-largest model in the Gemini family of models, is also quite competitive while being a lot more efficient to serve.

The Gemini Nano 1 and Nano 2 models are engineered for on-device deployments. These models excel in summarization and reading comprehension tasks with per-task finetuning. Nano-1 and Nano-2 model sizes are only 1.8B and 3.25B parameters respectively. Despite their size, they show exceptionally strong performance on factuality, i.e. retrieval-related tasks, and significant performance on reasoning, STEM, coding, multimodal, and multilingual tasks.

Diversity in ChatGPT

Unlike Gemini’s Ultra, Pro, and Nano, Open AI has GPT-3.5 which is freely available to all, and GPT-4 for premium subscribers.

In a casual conversation, the distinction between GPT-3.5 and GPT-4 can be subtle. The difference comes out when the complexity of the task reaches a sufficient threshold — GPT-4 is more reliable, creative, and able to handle much more nuanced instructions than GPT-3.5.

Gemini vs ChatGPT-4 — In a nutshell

Gemini, Google’s new AI model, compares favorably to ChatGPT-4 in several aspects. The Gemini AI model by Google is a highly capable family of multimodal models exhibiting remarkable capabilities across text, image, audio, and video understanding. It consists of three sizes: Ultra, Pro, and Nano, each tailored for specific applications, from complex reasoning tasks to memory-constrained uses. The Gemini Ultra model has achieved state-of-the-art performance in a broad range of benchmarks, surpassing existing models in many aspects. This advanced performance in multimodal reasoning and language understanding indicates Gemini’s potential to significantly impact various fields and compete effectively with models like ChatGPT-4. However, both models excel in their respective areas, with ChatGPT-4 being a powerful text-based model.

Exploring Google’s Gemini AI: A Hands-On Guide to Leveraging the Latest Large Language Model. Read Here