The most insightful stories about Gguf - Medium

Large Language Models

Gguf

Topic

·

3 Followers

·

35 Stories

Recommended stories

Md Monsur ali
How to Use Ollama with GGUF Models from Hugging Face Hub: A Step-by-Step Guide
Learn how to easily run GGUF quantized models from Hugging Face using Ollama, customize quantization, chat templates, and sampling…
6h ago
D
The easiest way to convert a model to GGUF and Quantize
docker run ghcr.io/ggerganov/llama.cpp. The models will be in the same folder with .bin extension. That’s it!
Jun 18
2
Wei-Meng Lee
in
AI Advances
Serving LLMs using LM StudioLearn how to run and serve a LLM Locally on your computer
2d ago
2d ago
Nicolas Moreno
Local LLMs on iOSExploring the implementation of open-source ML and LLMs models locally on a iOS mobile device with GGUF formats.
May 20
May 20
Lada Hang
Running ComfyUI — Flux-Upscaler-GGUF-Workflow on MimicPCWhen I initially created a Flux-dev-Upscaler-workflow on MimicPC, some users reported issues running it on Medium configurations (T4 16GB…
Sep 30
Sep 30

How to Use Ollama with GGUF Models from Hugging Face Hub: A Step-by-Step Guide

How to Use Ollama with GGUF Models from Hugging Face Hub: A Step-by-Step Guide

Md Monsur ali

How to Use Ollama with GGUF Models from Hugging Face Hub: A Step-by-Step Guide

Learn how to easily run GGUF quantized models from Hugging Face using Ollama, customize quantization, chat templates, and sampling…

6h ago

The easiest way to convert a model to GGUF and Quantize

The easiest way to convert a model to GGUF and Quantize

D

The easiest way to convert a model to GGUF and Quantize

docker run ghcr.io/ggerganov/llama.cpp. The models will be in the same folder with .bin extension. That’s it!

Jun 18

Serving LLMs using LM Studio

Wei-Meng Lee
in
AI Advances

Serving LLMs using LM Studio

Learn how to run and serve a LLM Locally on your computer

2d ago

Local LLMs on iOS

Nicolas Moreno

Local LLMs on iOS

Exploring the implementation of open-source ML and LLMs models locally on a iOS mobile device with GGUF formats.

May 20

Running ComfyUI — Flux-Upscaler-GGUF-Workflow on MimicPC

Lada Hang

Running ComfyUI — Flux-Upscaler-GGUF-Workflow on MimicPC

When I initially created a Flux-dev-Upscaler-workflow on MimicPC, some users reported issues running it on Medium configurations (T4 16GB…

Sep 30

Instruction Fine-Tuning Gemma-2B on Medical Reasoning and Convert the finetuned model into GGUF…

Plaban Nayak
in
The AI Forum

Instruction Fine-Tuning Gemma-2B on Medical Reasoning and Convert the finetuned model into GGUF…

Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini…

Mar 10

Import embeddings model with Ollama

Pierre Mesure

Import embeddings model with Ollama

Are you struggling like I did to import your Pytorch embeddings model to Ollama? Here’s how I did it.

Sep 28

Exploring Bits-and-Bytes, AWQ, GPTQ, EXL2, and GGUF Quantization Techniques with Practical Examples

kirouane Ayoub
in
GoPenAI

Exploring Bits-and-Bytes, AWQ, GPTQ, EXL2, and GGUF Quantization Techniques with Practical Examples

1. Bits-and-Bytes Quantization

Aug 22

See more recommended stories