🦜🔗 RAG Retrieval with Langchain

1. Install dependencies

First install pytorch, then run the following command to install the rest of the dependencies under the same environment:

pip install -r requirements.txt

Using Qwen-7B as the model, use up to 80% of the GPU memory

python -m vllm.entrypoints.openai.api_server --model 'Qwen-7B-Chat-Int4' --trust-remote-code -q gptq -dtype float16 --gpu-memory-utilization 0.8

python indexer.py

python rag.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
indexer.py		indexer.py
rag.py		rag.py
requirements.txt		requirements.txt
test.pdf		test.pdf