Michael Humor – Medium

Michael Humor

Pinned

Michael Humor
in
GoPenAI

Build your own AI PC (Part I): setting up LLM daemons on Darwin (MacOS)

Towards building your own AI PC, this tutorial shows how to set up LLM as system-level daemons on Darwin (MacOS).

Mar 24

Mar 24

Michael Humor
in
GoPenAI

Llama 3.1 vs Llama 3 Differences

It seems Llama 3.1 outperforms Llama 3 significantly in terms of math and reasoning capabilities. For instance, according to the Meta’s…

Aug 7

Llama 3.1 vs Llama 3 Differences

Aug 7

Michael Humor
in
GoPenAI

What LLM quantization works best for you? Q4_K_S or Q4_K_M

If you are working with a giant LLM, quantization is your friend to optimize performance and speed. There are so many different…

Apr 26

What LLM quantization works best for you? Q4_K_S or Q4_K_M

Apr 26

Michael Humor
in
Dev Genius

Llama-3 8B Model Stats

Llama-3 8B with 4-bit quantization only needs around 4GB of RAM to run on a PC.

Apr 26

Apr 26

Michael Humor
in
Dev Genius

A single script to install Docker on Linux VM (Microsoft Azure)

Here it is:

Apr 26

A single script to install Docker on Linux VM (Microsoft Azure)

Apr 26

Michael Humor
in
GoPenAI

How to build llama.cpp on Windows with NVIDIA GPU?

If you have RTX 3090/4090 GPU on your Windows machine, and you want to build llama.cpp to serve your own local model, this tutorial shows…

Apr 12

How to build llama.cpp on Windows with NVIDIA GPU?

Apr 12

Michael Humor

Grok 1.0 Model Stats

xAI’s Grok 1.0 model (see Github repo) has 64 layers, 8K context length, in total 314B parameters.

Mar 31

Mar 31

Michael Humor
in
Dev Genius

What’s a System Prompt for AI?

In short, a “system prompt” is a specialized type of prompt that sets the context for the AI’s interactions.

Mar 22

What’s a System Prompt for AI?

Mar 22

Michael Humor
in
Dev Genius

The TAO of Prompt Engineering (Part-2): writing an email assistant

In the last article, we have introduced TAO (Thought-Action-Observation), a method for LLM prompt engineering. In this article, we focus on…

Mar 21

The TAO of Prompt Engineering (Part-2): writing an email assistant

Mar 21

Michael Humor
in
Dev Genius

The TAO of Prompt Engineering (Part-1): understanding the ReAct framework

In this article, we introduce a method for prompt engineering called TAO (Thought-Action-Observation), inspired by ReAct (Reason+Act) for…

Mar 21

The TAO of Prompt Engineering (Part-1): understanding the ReAct framework

Mar 21

Michael Humor

Michael Humor

michael@paircoder.com

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams