Member-only story

How to use Huggingface to use LLama-2 on your custom machine?

It was not hard, just tricky.

Rahul Agarwal

Published in

The Algorithmic Minds

5 min readJul 19, 2023

Meta’s newly open-sourced LLama 2 Chat model has been making waves on the OpenLLMs Leaderboard. This powerful language model is now available for anyone, even commercially. Intrigued, I decided to try implementing LLama 2 myself. While the process was straightforward, it did require a few steps that I had to dig around to figure out.

In this post, I’ll explain how I got LLama 2 up and running. With Meta open-sourcing more of their AI capabilities, it’s an exciting time to experiment with cutting-edge language models!

So without further ado, let's dig into all the steps you will require to get that LLama-2 Chat Model running.

Get Accesses

To use LLama 2, you’ll need to request access from Meta. You can sign up at https://ai.meta.com/resources/models-and-libraries/llama-downloads/ to get approval to download the model.

Once granted access, you have two options to get the LLama 2 files. You can download it directly from Meta’s GitHub repository. However, I found using Hugging Face’s copy of the model more convenient. So, in addition to the Meta access, I got approval to download from Hugging Face’s repo here: https://huggingface.co/meta-llama/Llama-2-13b-chat-hf.

Getting access from both Meta and Hugging Face ensured I could easily obtain the latest LLama 2 model to try out. Access takes a few hours, but then you’re ready to start experimenting!

Once I had this, I just tried the following:


from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("meta-llama/Llama-2-13b-chat-hf")
model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-2-13b-chat-hf")

Which throws up authentication errors as the Meta repo is private. So we will need an auth token, but it was not straightforward to find where to get it.

Get Auth Token

You will need the hugging face token if you use your model on custom GPU machines hosted on AWS, GCP, or your machine. You can get this by going to Settings > Access Tokens.

The Algorithmic Minds

How to use Huggingface to use LLama-2 on your custom machine?

It was not hard, just tricky.

Get Accesses

Get Auth Token

Create an account to read the full story.

Published in The Algorithmic Minds

Written by Rahul Agarwal

Responses (4)

More from Rahul Agarwal and The Algorithmic Minds

SQL: The Practical Guide — Inserting data into table , Adding Foreign Key constraint and Altering…

In this article we will talk about “How to insert the data into table” and “How to add foreign key constraint by altering the table”

SQL: The Practical Guide — The Language ( part 5 )

SQL is a programming language that we use to store the data in the relational database and also to interact with that data.

SQL: The Practical Guide — Foreign-Key ( part 4 )

Foreign-key is the value that connects the two dependent table together.

SQL: The Practical Guide — Normalization ( part 3 )

In simple term normalization is the process of distributing the data in the form of a table in such a way that there is no duplicate data…

Recommended from Medium

PayPal Senior Data Scientist 1: SQL Round

I will list table schema and questions that were asked in the PayPal Job Interview for the role of Data Scientist 1 for the Bangalore…

Creating The Dashboard That Got Me A Data Analyst Job Offer

A walkthrough of the Udemy dashboard that got me a job offer from one of the biggest names in academic publishing.

Fired From Meta After 1 Week: Here’s All The Dirt I Got

This is not just another story of a disgruntled ex-employee. I’m not shying away from the serious corporate espionage or the ethical…

The Impact of Missing Data on Statistical Analysis and How to Fix It

Learn effective techniques for handling missing data in statistical analysis. Explore methods like imputation, deletion, and advanced…

Power BI Project — HR Dashboard

1 dashboard page + 9 pages of requests. With DAX, findings, and recommendations. Reference: Chandoo YouTube channel

SQL Performance: The Ultimate Guide to Optimized Queries

Note: If you’re not a medium member, CLICK HERE