Rama LLAMA Ding Dong — an overview of four key LLMs: GPT, Llama, Claude and Bard

Published in

the-stepstone-group-tech-blog

2 min readJul 24, 2023

The launch of ChatGPT last November made a big splash. More than half a year later it is no longer the only game in town and many alternative large language models (LLMs) with slightly different strengths and weaknesses have been released. An LLM is a trained deep-learning model that understands and generates text in a human-like fashion. Just this week, Meta launched its Llama2 and made it widely available.

Here’s a high-level summary of four key LLMs, including their associated advantages and challenges.

OpenAI / GPT 4.0 (Microsoft)

Advanced language understanding and generation capabilities for a wide range of topics.
Impressive coding ability and sophisticated word choice
Suitable for content creation and coding tasks.
However, criticized for lack of transparency in model training process and the model is only available by using one of OpenAI’s services like user interfaces or APIs.

Llama 2.0 (Meta)

Exceptional performance in generating helpful responses for single and multi-turn prompts.
Suitable for chat applications and tasks requiring human interaction.
However, comparatively lower coding capability.
Open-source nature allows the model to be run in a company’s environment and larger control to finetune it to the specific domain and use cases.

Claude 2.0 (Anthropic)

Excellent at tasks involving coding, mathematics, logical thinking, and document understanding.
Proficient in processing large documents including PDFs, a challenging task for other models that are more limited regarding the maximum input size.
Impressive Python coding skills, scoring 71.2% on Codex HumanEval.
Similar to GPT 4.0, accessible via an API but not fully open source.

Bard (Google)

Designed for conversational AI applications, capable of generating human-like responses and providing advice, answering questions, and generating content.
Draws on information from the web to provide fresh, high-quality responses.
Limited in comparison to GPT-4, particularly in providing specific and accurate responses for certain tasks like CSS coding.
Currently not recommended to use at StSt because it doesn’t allow to block data sharing with Google.

Sources:

Co-authored by Felicitas von Rauch, Timm Lochmann and Heinke Hihn

Read more about The Stepstone Group, the technologies we use or take an inside look at our organisation & processes. Interested in joining us? Check out our careers page.