PinnedLars WiikGemma 2 (9B & 27B) Evaluation — vs. Open/Closed-Source LLMs 💎Multilingual Comparison vs. Llama 3, Phi 3, Qwen2, OpenAI, Anthropic, and GoogleJul 3Jul 3
PinnedLars WiikClaude 3.5 Sonnet — Better than GPT4o ⭐Multilingual Evaluation vs. GPT-4o and Gemini 1.5Jun 20Jun 20
PinnedLars WiikBest Embedding Model 🌟 — OpenAI / Cohere / Google / E5 / BGEAn In-depth Comparison of Multilingual Embedding ModelsApr 75Apr 75
PinnedLars WiikClaude Opus vs. GPT-4o vs. Gemini 1.5 ⭐ — Multilingual PerformancePerformance Analysis of Leading LLMsMay 24May 24
PinnedLars WiikGPT-4o vs. GPT-4 vs. Gemini 1.5 ⭐ — Performance AnalysisMeasuring English Language Understanding of OpenAI’s New Flagship ModelMay 1336May 1336
Lars WiikinTowards Data ScienceVoyage Multilingual 2 Embedding EvaluationCompared to OpenAI, Cohere, Google, and E5Jun 20Jun 20
Lars WiikApple WWDC 2024 Apple Intelligence and More!A summary of what’s new and what’s to comeJun 11Jun 11
Lars WiikTop LLM Reasoning Evaluation ⭐OpenAI’s GPT-4 Omni vs. Google’s Gemini vs. Anthropic’s Claude OpusJun 6Jun 6
Lars WiikLLM Instruction Placement in Prompts — It Matters a Lot! ⭐Gemini 1.5 — Evaluation of how instruction placement in large prompts affects qualityMay 312May 312
Lars WiikHow to use LLM APIs ⭐— OpenAI, Claude, GoogleExamples for GPT-4o, Claude 3 Opus, and Gemini 1.5May 251May 251