The most insightful stories about Inference - Medium

Machine Learning

Artificial Intelligence

Large Language Models

Inference

Topic

·

19 Followers

·

519 Stories

Recommended stories

Surya Vara Prasad Alla
Groq: Revolutionizing AI with Lightning-Fast Inference
In the rapidly evolving landscape of artificial intelligence, a new player has emerged to challenge the status quo. Groq, a startup founded…
1h ago
In
Towards Data Science
by
João Paulo Figueira
Map-Matching for Speed Prediction
How fast will you drive?
Jan 19
2
In
Towards Data Science
by
Richa Gadgil
Combining Large and Small LLMs to Boost Inference Time and QualityImplementing Speculative and Contrastive Decoding
Dec 5
1
Dec 5
1
Michael Iantosca
Within Reason: A survey of Reasoning and Inference models and techniques for generative AI…Michael Iantosca 
Senior Director of Knowledge Platforms and Engineering
Avalara Inc.
4d ago
4d ago
In
Towards Data Science
by
Alon Agmon
Streamlining Serverless ML Inference: Unleashing Candle Framework’s Power in RustBuilding a lean and robust model serving layer for vector embedding and search with Hugging Face’s new Candle Framework
Dec 21, 2023
1
Dec 21, 2023
1

Groq: Revolutionizing AI with Lightning-Fast Inference

Groq: Revolutionizing AI with Lightning-Fast Inference

Surya Vara Prasad Alla

Groq: Revolutionizing AI with Lightning-Fast Inference

In the rapidly evolving landscape of artificial intelligence, a new player has emerged to challenge the status quo. Groq, a startup founded…

1h ago

Map-Matching for Speed Prediction

Map-Matching for Speed Prediction

In

Towards Data Science

by

João Paulo Figueira

Map-Matching for Speed Prediction

How fast will you drive?

Jan 19

Combining Large and Small LLMs to Boost Inference Time and Quality

In

Towards Data Science

by

Richa Gadgil

Combining Large and Small LLMs to Boost Inference Time and Quality

Implementing Speculative and Contrastive Decoding

Dec 5

Within Reason: A survey of Reasoning and Inference models and techniques for generative AI…

Michael Iantosca

Within Reason: A survey of Reasoning and Inference models and techniques for generative AI…

Michael Iantosca Senior Director of Knowledge Platforms and Engineering Avalara Inc.

4d ago

Streamlining Serverless ML Inference: Unleashing Candle Framework’s Power in Rust

In

Towards Data Science

by

Alon Agmon

Streamlining Serverless ML Inference: Unleashing Candle Framework’s Power in Rust

Building a lean and robust model serving layer for vector embedding and search with Hugging Face’s new Candle Framework

Dec 21, 2023

The Best NVIDIA GPUs for LLM Inference: A Comprehensive Guide

Mahernaija

The Best NVIDIA GPUs for LLM Inference: A Comprehensive Guide

Large Language Models (LLMs) like GPT-4, BERT, and other transformer-based models have revolutionized the AI landscape. These models demand…

Aug 27

How to run Google VLM PaliGemma 2 with explanations

Manyi

How to run Google VLM PaliGemma 2 with explanations

PaliGemma 2 is a vision-language model (VLM) which incorporates the capabilities of the Gemma 2 models. The PaliGemma family of models is…

4d ago

Fireworks Raises the Quality Bar with Function Calling Model and API Release

Fireworks.ai

Fireworks Raises the Quality Bar with Function Calling Model and API Release

Fireworks conducts alpha launch of our function calling model and API, with quality reaching GPT-4 and surpassing open-source models

Dec 20, 2023

See more recommended stories