RAG System

A robust, production-quality Retrieval-Augmented Generation (RAG) system template built with LangChain and ChromaDB.

Suitable for local use, internal enterprise deployments, or as a foundation for building scalable, production-ready RAG applications. While not yet a fully production-hardened or scalable solution, it demonstrates enterprise-level best practices in code quality, testing, and extensibility.

Demo: Web UI in Action

🚀 Features

Multi-format Document Support: PDF, TXT, DOCX, MD, CSV, XLSX
Advanced Text Processing: Intelligent chunking with configurable overlap
Vector Storage: ChromaDB integration with persistent storage
Embedding Generation: Sentence Transformers models via Hugging Face with customizable model selection
Dual Interface: Command-line and Streamlit web UI
Structured Logging: Logging with correlation IDs
Type Safety: Comprehensive type hints with MyPy integration
Testing: Comprehensive test suite with enterprise-grade fixtures
Code Quality: Black, isort, flake8, autoflake, and bandit security scanning

📋 Requirements

Python 3.12+ (required - the project uses Python 3.12 features)
CUDA-compatible GPU (optional, for faster embedding generation)

🛠️ Installation

Complete Setup

# 1. Clone the repository
git clone <repository-url>
cd llm-rag-chroma-demo

# 2. Create virtual environment with Python 3.12
python3.12 -m venv venv312  # For Python 3.12 (required)
# OR
python -m venv venv  # Uses your default Python (must be 3.12+)

# 3. Activate virtual environment that you have created above
source venv312/bin/activate  # On Linux/Mac (if using venv312)
# OR
source venv/bin/activate     # On Linux/Mac (if using venv)
# OR
venv312\Scripts\activate     # On Windows (if using venv312)
# OR
venv\Scripts\activate        # On Windows (if using venv)

# 4. Install dependencies
make install-dev

# 5. Verify installation
make info

Quick Setup (if you already have a virtual environment)

# Navigate to your existing project directory
cd llm-rag-chroma-demo

# Activate your existing virtual environment (must be Python 3.12+)
source venv/bin/activate  # On Linux/Mac
# OR
venv\Scripts\activate     # On Windows

# Install dependencies
make install-dev

⚙️ Configuration

The project is designed to work out-of-the-box with sensible defaults. Configuration is managed via environment variables, which you can set up using a .env file.

1. Start with the Default Configuration

A template file, .env.default, is provided in the project root. To get started, copy this file to .env:

cp .env.default .env

You can then edit .env to customize your settings as needed.

2. OpenAI API Key (Optional)

The project will run with or without an OpenAI API key.

With an OpenAI API Key:
If you provide an OpenAI API key, the system can combine your enterprise's private or customer documents with powerful LLMs (like OpenAI) to deliver richer, more accurate, and context-aware responses. This enables advanced inference by leveraging both your internal knowledge base and state-of-the-art language models.

Without an OpenAI API Key:
If you do not provide an OpenAI API key, the RAG system will still function as a robust query engine over your embedded document store. In this mode, responses are generated purely from your indexed documents, without LLM-powered augmentation. This is suitable for environments where external API calls are restricted or not desired, but may result in less nuanced or generative answers.

To use an OpenAI API key, add it to your .env file:

OPENAI_API_KEY=your-openai-api-key

3. Other Configuration Options

You can further customize the system by editing other variables in your .env file, such as:

Logging level
Supported file types
Chunk size and overlap
ChromaDB settings
Embedding model

All available options are documented in .env.default with comments.

Summary:

Copy .env.default to .env and edit as needed.
OpenAI API key is optional, but recommended for best inference quality.
The project works out-of-the-box with default settings.

🎯 Usage

Quick Start

# Run the demo
make run-demo

# Or start the web interface
make run-ui

Demo: Running `make run-demo`

Below is a demonstration of how to run the demo and what output to expect:

Command Line Interface

# Interactive mode
rag-demo interactive

# Ingest all documents
rag-demo ingest

# Query the system
rag-demo query "What are the HR policies?"

# Get system statistics
rag-demo stats

# Clear the database
rag-demo clear

Web Interface

# Start Streamlit UI
make run-ui
# or
streamlit run rag_web_interface.py

Programmatic Usage

from rag_system import RAGSystem

# Initialize the system
rag = RAGSystem()

# Ingest documents
stats = rag.ingest_documents()
print(f"Processed {stats['documents_stored']} documents")

# Query the system
results = rag.query("What are the vacation policies?")
for doc in results:
    print(f"Source: {doc.metadata['source']}")
    print(f"Content: {doc.page_content[:200]}...")

🏗️ Project Structure

rag-system/
├── rag_system/                 # Main package
│   ├── core/                   # Core functionality
│   │   ├── config.py          # Configuration management
│   │   └── logging.py         # Structured logging
│   ├── ingestion/             # Document processing
│   │   ├── document_loader.py # Multi-format document loading
│   │   ├── text_processor.py  # Text chunking and embedding
│   │   └── vector_store.py    # ChromaDB integration
│   ├── ui/                    # User interfaces
│   │   └── streamlit_app.py   # Streamlit web UI
│   ├── cli.py                 # Command-line interface
│   └── rag_system.py          # Main orchestrator
├── tests/                     # Test suite
│   ├── test_core_config.py    # Configuration tests
│   └── test_ingestion.py      # Ingestion component tests
├── data/                      # Document storage
├── rag_demo.py               # Demo script
├── rag_web_interface.py      # Web interface
├── pyproject.toml             # Project configuration
├── Makefile                   # Development workflow
└── README.md                  # This file

🧪 Testing

# Run all tests
make test

# Run tests with coverage
make test-cov

# Run tests in watch mode
make test-watch

# Quick test cycle (format + lint + test)
make quick-test

# Run specific test file
pytest tests/test_ingestion.py -v

# Run tests with specific marker
pytest -m "slow" -v

🔧 Development

Code Quality and Type Safety

This project maintains enterprise-grade code quality with:

Comprehensive Type Annotations: All functions, methods, and variables have type hints
Static Type Checking: MyPy integration ensures type safety across the codebase
Code Formatting: Black and isort ensure consistent code style
Linting: Flake8 and autoflake maintain code quality
Security Scanning: Bandit identifies potential security issues
Testing: Comprehensive test suite with clean fixtures

Code Quality

# Format code (black + isort)
make format

# Run linting (flake8 + autoflake)
make lint

# Type checking (mypy)
make type-check

# Security scanning (bandit)
make security-check

# All quality checks
make check-all

# Clean unused imports and variables
make clean-imports

# Pre-commit validation (runs automatically on staged files)
# The custom pre-commit hook runs the same tools as make check-all
# but only on staged files during git commit

Development Workflow

# Complete development setup
make dev-setup

# Clean build artifacts
make clean

# (Recommended) Unset all environment variables from .env in your current shell
source clean-env.sh

# Build package
make build

# System information
make info

📊 Monitoring

The system includes comprehensive logging:

import structlog

# Structured logging with correlation IDs
logger = structlog.get_logger(__name__)
logger.info("Processing document",
           document_id="doc_123",
           file_type="pdf",
           chunk_count=15)

🚀 Deployment

Production Setup

Environment Configuration:

export PRODUCTION=true
export LOG_LEVEL=WARNING
export CHROMA_PERSIST_DIRECTORY=chroma_db

Install Production Dependencies:
```
make install
```
Initialize System:
```
rag-demo ingest
```

Docker (Future)

# Build image
make docker-build

# Run container
make docker-run

🤝 Contributing

We welcome contributions! Please see our CONTRIBUTING.md for guidelines and best practices.

Fork the repository
Create a feature branch: git checkout -b feature/amazing-feature
Make your changes and add tests
Run quality checks: make check-all
Commit your changes: git commit -m 'Add amazing feature'
Push to the branch: git push origin feature/amazing-feature
Open a Pull Request

Development Guidelines

Use snake_case for files and functions
Use PascalCase for classes
Write comprehensive docstrings
Maintain 80%+ test coverage
Follow comprehensive type hints throughout
Raise exceptions rather than returning error strings
Never commit .env files - use .env.default as template
Run make check-all before committing
Pre-commit hooks run automatically on staged files during commit

📝 License

This project is licensed under the GNU General Public License v3 - see the LICENSE file for details.

🙏 Acknowledgments

LangChain for the RAG framework
ChromaDB for vector storage
Sentence Transformers for embeddings via Hugging Face
Streamlit for the web interface

📞 Support

For support and questions:

Create an issue
Check the documentation
Review the examples

💼 Consultation

Need help implementing this RAG system in your organization? I offer:

Custom RAG Solutions: Tailored implementations for your specific use case
Architecture Review: Optimize your AI/ML infrastructure design
Production Deployment: From prototype to production-ready systems
Team Training: Workshops on RAG systems and best practices
Ongoing Support: Maintenance, optimization, and feature development

Ready to accelerate your AI initiatives? Book a consultation to discuss your project requirements.

Built with ❤️ for production AI applications

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
.github/workflows		.github/workflows
data		data
docs		docs
rag_system		rag_system
tests		tests
.cursorignore		.cursorignore
.env.default		.env.default
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml.disabled		.pre-commit-config.yaml.disabled
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
clean-env.sh		clean-env.sh
pyproject.toml		pyproject.toml
rag_demo.py		rag_demo.py
rag_web_interface.py		rag_web_interface.py

License

mehdimahmoud/llm-rag-chroma-demo

Folders and files

Latest commit

History

Repository files navigation