PinnedLars WiikClaude 3.5 Sonnet — Better than GPT4o ⭐Multilingual Evaluation vs. GPT-4o and Gemini 1.56d ago6d ago
PinnedLars WiikBest Embedding Model 🌟 — OpenAI / Cohere / Google / E5 / BGEAn In-depth Comparison of Multilingual Embedding ModelsApr 75Apr 75
PinnedLars WiikClaude Opus vs. GPT-4o vs. Gemini 1.5 ⭐ — Multilingual PerformancePerformance Analysis of Leading LLMsMay 24May 24
PinnedLars WiikGPT-4o vs. GPT-4 vs. Gemini 1.5 ⭐ — Performance AnalysisMeasuring English Language Understanding of OpenAI’s New Flagship ModelMay 1335May 1335
PinnedLars WiikOpenAI’s GPT-4o vs. Gemini 1.5 ⭐ Context Memory EvaluationNeedle in Haystack Evaluation— OpenAI vs. GoogleMay 195May 195
Lars WiikinTowards Data ScienceVoyage Multilingual 2 Embedding EvaluationCompared to OpenAI, Cohere, Google, and E56d ago6d ago
Lars WiikApple WWDC 2024 Apple Intelligence and More!A summary of what’s new and what’s to comeJun 11Jun 11
Lars WiikTop LLM Reasoning Evaluation ⭐OpenAI’s GPT-4 Omni vs. Google’s Gemini vs. Anthropic’s Claude OpusJun 6Jun 6
Lars WiikLLM Instruction Placement in Prompts — It Matters a Lot! ⭐Gemini 1.5 — Evaluation of how instruction placement in large prompts affects qualityMay 311May 311
Lars WiikHow to use LLM APIs ⭐— OpenAI, Claude, GoogleExamples for GPT-4o, Claude 3 Opus, and Gemini 1.5May 251May 251