Exploring AI’s Next Frontier : Customized Chatbot By Chat With RTX

ChaitanyaWanjari
tech@iiit-gwalior
Published in
5 min readApr 9, 2024

Every detail at once about NVDIA’S Chat with RTX — Your personal assistant without internet.

A Beginner’s Guide To NVIDIA :

In the emerging world of various fascinating Artificial Intelligence technologies , for users that value privacy, quick response times, and cost-effectiveness, it’s important to run generative AI software like widely used Chat GPT on personal computers instead of relying on cloud-based services. However, this requires a large number of computers that are capable of running AI software efficiently , as well as some advanced tools for developers to customize and optimize AI models to work well on personal setups. To address this requirement, NVIDIA is introducing new innovations across all of its technologies. These innovations will enable new AI-powered experiences and build upon the more than 500 applications (yes you read that right) and games that already use NVIDIA’s RTX technology to accelerate AI capabilities.

Chat With RTX is one of the technologies that will pave the way for mind-blowing AI experiences that’ll make you question reality itself. So let’s dwell more into this innovation.

A Glimpse : Watch How It Works

In order to sustain your interest , first let’s check out NVIDIA’S Chat With RTX.

Teaser of NVDIA’S Chat With RTX

So , what exactly is Chat With RTX? It is a tech demo that let’s its users to personalize a chatbot with their device files. Cool isn’t it?

But how exactly does this work ? Chat with RTX uses retrieval-augmented generation (RAG), NVIDIA TensorRT-LLM software and NVIDIA RTX acceleration to incorporate generative AI capabilities to local, GeForce-powered Windows PCs(sorry MAC users). Users can easily connect local files on a PC as a dataset to an open-source large language model like Mistral or Llama 2, enabling queries for quick, contextually relevant responses.

Too many technical terms right? Let’s get to know them.

  1. LLM : Large Language Model is a deep learning algorithm that can perform a variety of Natural Language Processing tasks.
  2. RAG : Retrieval Augmented Generation , it performs the task of extracting information from the user files(external knowledge base) to provide Large Language Models(LLM) with most recent and current data.
  3. NVIDIA TensorRT-LLM : It is basically an open source library that enhances and speeds up the inference performance of the most recent large language models (LLMs) on the NVIDIA AI platform.

Can Your PC Afford This Personal Assistant ?

Since we are trying to run a generative AI software on our PC , there are some System Requirements to download and install NVIDIA’S Chat With RTX :

System Requirements to use Chat With RTX on your PC

How Can NVIDIA’S Chat With RTX Make Our Lives Easier ?

  1. Chat with Documents, Videos and Notes : You can work with any kind of file — text ,pdf ,doc/docx or xml. In a matter of seconds, the application will load your files into the library if you only point it at the folder containing them. The program also allows you to query the content covered by a YouTube playlist by loading the transcriptions of the videos when you enter the playlist’s website.

2. Chat for Developers : For our inquisitive developers , you can follow this TensorRT-LLM RAG reference project available on GitHub (do check this out) to deploy your own RAG-based applications for RTX.

These are the primary tasks performed by NVIDIA’S Chat With RTX. However what more can we achieve ?

Chat With RTX vs Cloud Based AI’s :

Chat with RTX definitely has an edge over other Cloud based AI’s in the context of Privacy and Security, but there’s more to it:

  1. Researchers can efficiently dig into various documents stored on their PC’s and extract data of importance.
  2. Not only Research papers but even preparation of Thesis or going through a bunch of Legal Documents can be done smoothly with this Personal Assistant. It can scan large data in seconds and fetch solution for your every problem.
  3. It can skim through videos from Youtube or personal folders and save your time.
  4. You can customize this tech demo using Large Language Models like Mistral and deploy according to your preferences. For instance, you can check this project : Cheap meets mistralAI.

You can Download and Install Chat With RTX using the following link : NVIDIA Chat With RTXYour Personalized AI Chatbot.www.nvidia.com

Follow the steps provided in this video in order to install Chat With RTX hassle free : https://youtu.be/O0UNAwT6nrQ?si=EfEEkE3LjaCvoQQV

But we have some issues as well that do not bother users in case of Cloud Based AI’s:

  1. The System Requirements to use Chat With RTX are specific and very few users actually posses them.
  2. Also the download and installation procedure is not simple. There are various issues that will trouble you. I recommend to visit the comment section of the NVIDIA’S blog about Chat With RTX. Various issues and their solutions are provided by the community.

Conclusion :

To sum up, NVIDIA has brought an AI revolution by providing us with a Personal Chat Bot — Chat With RTX. It can tackle any question related to files saved on the PC and also queries related to youtube videos. Last but not the least , for our inquisitive developers, NVIDIA has provided a git hub repository which can be modified and used accordingly. So this is this gist of NVIDIA’s Chat With RTX.

Do Check out : https://blogs.nvidia.com/blog/chat-with-rtx-available-now/

Follow for more updates!!

--

--