Documents RAG

Overview

This project provides a simple RAG (Retrieval-Augmented Generation) pipeline. It takes a document from the files directory, splits it into chunks, and generates embeddings for each chunk using an external embedding service (for example, the fab-codes/embedding-service).

The embeddings are then stored in a vector database, allowing semantic search or retrieval over the document contents.

How to run with Docker

In the project root, run:

docker-compose up --build -d

This will build and start the service together with its dependencies.

How to run locally

Set the document path in the .env file (for example, PDF_FILE_PATH=files/mydocument.pdf).
Open a shell inside the container and go to the app directory:
```
docker exec -it documents-rag bash
cd app
```
Run the main script:
```
python -m src.main
```

Environment Variables

Main variables to configure in .env:

PDF_FILE_PATH → path to the file you want to process inside the files/ directory.
EMBEDDING_SERVICE_URL → URL of the external embedding service (default: http://embedding-service:8000).
QDRANT_URL, QDRANT_API_KEY, QDRANT_COLLECTION → connection details for the vector database.

How it works

Load a document (just PDF in this moment) from the files folder.
Split the document into smaller text chunks.
Call the external embedding service to generate embeddings for each chunk.
Store the embeddings in Qdrant.
Enable retrieval and semantic search over the indexed document.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
files		files
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Documents RAG

Overview

How to run with Docker

How to run locally

Environment Variables

How it works

About

Uh oh!

Releases

Packages

Languages

fab-codes/documents-rag

Folders and files

Latest commit

History

Repository files navigation

Documents RAG

Overview

How to run with Docker

How to run locally

Environment Variables

How it works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages