RAGstack: Facilitate Private Document Discussions with Your Colleagues

Technology is constantly evolving, and in the world of AI, things are getting more and more interesting. Today, I will talk to you about a project that is not only interesting but also very useful: RAGstack.

The idea is simple and allows you to deploy a private alternative to ChatGPT on your VPC (Virtual Private Cloud – your virtual private cloud network) and offers to explore open-source LLM models such as Llama 2, Falcon, and GPT4All.

RAGstack is a reliable solution for companies that want to use the so-called Retrieval Augmented Generation (RAG) technique to improve the capabilities of LLMs by retrieving information from other systems. Deployed in business, this will allow your employees to save time in their daily lives. In addition, it is less expensive and faster than fine-tuning.

The RAGstack project supports different models, including GPT4all, Llama2, and Falcon-7B. It can be deployed on GCP, AWS, and Azure. But you can also run it on your own server like this:

Copy ragstack-ui/local.env to ragstack-ui/.env and copy server/example.env to server/.env.

In server/.env, replace YOUR_SUPABASE_URL with the URL of your Supabase project and YOUR_SUPABASE_KEY with the secret API key of your Supabase.

In ragstack-ui/.env, replace YOUR_SUPABASE_URL with the URL of your Supabase project and YOUR_SUPABASE_PUBLIC_KEY with the secret API key of your Supabase.

You can find these values in your Supabase dashboard under Settings > API.

Then, in Supabase, create a ragstack_users table with the following columns:

id, uuidapp_id,
 uuidsecret_key, 
uuidemail, 
textavatar_url, 
textfull_name, text

If you added row-level security in the base, make sure that inserts and selects have a WITH CHECK expression of (auth.uid() = id).

Run scripts/local/run-dev. This will download ggml-gpt4all-j-v1.3-groovy.bin to server/llm/local/ and run the server, LLM, and Qdrant vector database locally.

All services will then be ready when you see the following message:

INFO: Application startup complete.

Well done, it’s functional! You can also try an online version.

The source code for the RAGstack project can be found on GitHub.

Mohamed SAKHRI
Mohamed SAKHRI

I'm the creator and editor-in-chief of Tech To Geek. Through this little blog, I share with you my passion for technology. I specialize in various operating systems such as Windows, Linux, macOS, and Android, focusing on providing practical and valuable guides.

Articles: 1751

Newsletter Updates

Enter your email address below and subscribe to our newsletter

Leave a Reply

Your email address will not be published. Required fields are marked *