Skip to content

Manfredss/RAG_Langchain

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

7 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ¦œπŸ”— RAG Retrieval with Langchain

1. Install dependencies

First install pytorch, then run the following command to install the rest of the dependencies under the same environment:

pip install -r requirements.txt

2. How to use

1. Start the server βœ…

Using Qwen-7B as the model, use up to 80% of the GPU memory

python -m vllm.entrypoints.openai.api_server --model 'Qwen-7B-Chat-Int4' --trust-remote-code -q gptq -dtype float16 --gpu-memory-utilization 0.8

2. Run indexer.py to get the vector embeddings of the documents πŸ“š

python indexer.py

3. Run rag.py to get some taste of the rag-retrieval technique πŸ€–πŸ’¬

python rag.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages