Documents Chat(FastAPI + React + RAG + LLM(OpenAI/Local LLM))

A local-first Retrieval-Augmented Generation (RAG) app with a FastAPI backend, React (Vite) frontend, ChromaDB vector store.

🎥 Demo Video: DEMO/DEMO.mp4

DEMO.mp4

System Overview

flowchart LR
    subgraph Client["User Browser"]
        UI["React (Vite app)"]
    end

    subgraph Frontend["Frontend Container (Nginx)"]
        NGINX["Nginx</br>- Serves static SPA</br>- Proxies /v1/* to backend"]
    end

    subgraph Backend["Backend Container (FastAPI)"]
        API["FastAPI /v1 API"]
        PARSE["Ingestion & Parsing</br>- PDF: pypdf</br>- DOCX: docx2txt</br>- CSV: pandas</br>(Optional OCR)"]
        CHUNK["Chunking</br>RecursiveCharacterTextSplitter"]
        EMB["Embeddings</br>SentenceTransformers (BAAI/bge-m3)"]
        VDB["Vector DB</br>Chroma (cosine)</br>Persistent volume"]
        LLM["LLM Router</br>- openai</br>- ollama"]
    end

    subgraph Storage["Docker Volumes"]
        HF["/hf_cache</br>(HF model cache)"]
        CHR["/data/chroma_store</br>(Chroma persistence)"]
    end

    UI -->|"HTTP: / (static)"| NGINX
    UI -->|"HTTP: /v1/*"| NGINX -->|"proxy"| API

    API --> PARSE --> CHUNK --> EMB --> VDB
    API -->|"RAG retrieve"| VDB
    API -->|"Prompt + context"| LLM

    VDB <-.persisted.-> CHR
    EMB <-.downloads models.-> HF

    subgraph LLM_Providers["LLM Backends (choose one)"]
        OAI["OpenAI API</br>(model: e.g., gpt-4o-mini)"]
        OLM["Ollama</br>(model: llama3:8b-instruct)"]
    end

    LLM -->|provider=openai| OAI
    LLM -->|provider=ollama| OLM

Request / Response Flows

Upload & Index (RAG)

sequenceDiagram
    participant B as Browser (React)
    participant F as Nginx (Frontend)
    participant A as FastAPI /v1
    participant P as Parse/Chunk/Embed
    participant C as Chroma (Vectors)

    B->>F: POST /v1/files (multipart, user-id)
    F->>A: /v1/files
    A->>P: Parse (PDF/DOCX/CSV) → Chunk → Embed (bge-m3)
    P->>C: Upsert(chunks, vectors, {user_id, doc_id, source, page})
    A-->>F: { status: "indexed", chunks, doc_id }
    F-->>B: Response

Chat over Documents (RAG + LLM)

sequenceDiagram
    participant B as Browser (React)
    participant F as Nginx (Frontend)
    participant A as FastAPI /v1
    participant C as Chroma (Vectors)
    participant L as LLM Router
    participant P1 as OpenAI
    participant P2 as Ollama

    B->>F: POST /v1/chat {query, user-id}
    F->>A: /v1/chat
    A->>C: query(embed(query), where user_id)
    C-->>A: top-K chunks + metadata
    A->>L: build prompt + context → generate

    alt LLM_PROVIDER=openai
        L->>P1: /v1/chat/ (X-OpenAI-Key)
        P1-->>L: answer
    else LLM_PROVIDER=ollama
        L->>P2: /api/chat (model: llama3:8b-instruct)
    end

    L-->>A: answer
    A-->>F: { response, sources[] }
    F-->>B: Render answer + citations

🎥 Quick Tour Video: Tour/Tour.mp4

Overview.and.Code.Review.of.Documents.Chat.Project.1.mp4

Third-Party Software

This project uses third-party libraries and tools as specified in backend/requirements.txt and the Dockerfiles.
Those components are licensed separately under their respective open-source licenses (MIT, Apache-2.0, BSD, etc.).
If you distribute built artifacts (e.g., Docker images), review and include the relevant third-party notices.

Architecture

Backend (backend/): FastAPI, SQLAlchemy (SQLite), sentence-transformers (e.g., BAAI/bge-m3) for embeddings, ChromaDB for the vector store. File parsing supports PDF, DOCX, CSV.
Frontend (frontend/): React 18 + Vite + Tailwind. Minimal state via context and hooks. Talks to the backend with a user-id header.
Storage (/data): SQLite app DB (app.db) and Chroma collection (/data/chroma_store).

Quick Start (Docker)

# From the repo root
docker compose up --build

Services:

Backend: http://localhost:8000 (health check: /health)
Frontend:
- Dev (Vite): http://localhost:5173 (via npm run dev)
- Container (Nginx): http://localhost:8080 (via frontend/Dockerfile)

Environment

Compose reads .env files from both root and backend. Key variables (see backend/.env.example):

CHROMA_DIR — directory for ChromaDB store (default /data/chroma_store)
DB_URL — SQLAlchemy DB URL (default sqlite:////data/app.db)
LLM_PROVIDER — openai | ollama
OPENAI_API_KEY — required if using OpenAI providers
OLLAMA_BASE_URL — e.g., http://host.docker.internal:11434
EMBED_MODEL — default BAAI/bge-m3

How It Works

Upload → /v1/files (requires user-id). Backend extracts text (OCR if enabled), splits into chunks, embeds, and stores in Chroma with user_id + doc_id.
Chat → /v1/chat with your doc_ids. Answers are grounded only on those documents.
Conversations → persisted in SQLite; list, read, or delete as needed.

API

GET /health
POST /v1/auth/signup, POST /v1/auth/signin
GET /v1/files
POST /v1/files
DELETE /v1/files/{doc_id}
POST /v1/reset
POST /v1/chat
GET /v1/conversations
GET /v1/conversations/{id}
DELETE /v1/conversations/{id}
POST /v1/admin/reset_all

Required Headers

user-id: <uuid-or-string> — identifies the user and scopes files/queries
Optional overrides:
- X-LLM-Provider: openai|ollama
- X-LLM-Model: <model-name>
- X-OpenAI-Key: <key>

Local Development

Backend

cd backend
pip install -e .
uvicorn app.main:app --reload  # http://localhost:8000

Frontend

cd frontend
npm install
npm run dev  # http://localhost:5173

Adjust API base in one of:

.env with VITE_API_BASE=http://localhost:8000
public/app-config.js (window.__APP_CONFIG__.API_BASE)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
DEMO		DEMO
backend		backend
frontend		frontend
public		public
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Documents Chat(FastAPI + React + RAG + LLM(OpenAI/Local LLM))

System Overview

Request / Response Flows

Upload & Index (RAG)

Chat over Documents (RAG + LLM)

Third-Party Software

Architecture

Quick Start (Docker)

Environment

How It Works

API

Required Headers

Local Development

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Documents Chat(FastAPI + React + RAG + LLM(OpenAI/Local LLM))

System Overview

Request / Response Flows

Upload & Index (RAG)

Chat over Documents (RAG + LLM)

Third-Party Software

Architecture

Quick Start (Docker)

Environment

How It Works

API

Required Headers

Local Development

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages