Chat with your PDFs, Word docs, and text files—powered by Llama 3 on Groq, LangChain, and Streamlit.
FileIQ lets you upload multiple documents (PDF, DOCX, TXT) and query them conversationally. Under the hood it
- splits documents into semantic chunks, embeds them with
sentence‑transformers, - stores them in a FAISS vector index,
- retrieves relevant chunks at question time via LangChain’s Conversational Retrieval Chain, and
- streams answers from blazing‑fast Llama 3 (or DeepSeek) models served by Groq API, complete with inline source citations.
Feel free to clone and run locally. A public demo link will be added soon.
- Multi‑document support – batch‑upload PDFs, TXT, or DOCX files
- Natural‑language Q&A – chat like you would with a human tutor
- Source transparency – expand any answer to see the exact excerpts used
- Model choice – toggle between
llama-3.3‑70b‑versatile,llama-3.1‑8b‑instant, anddeepseek‑r1‑distill‑llama‑70b - Fully local front‑end – quick, light Streamlit UI
| Layer | Tools |
|---|---|
| UI | Streamlit 1.34† |
| LLM | Groq API (Llama 3 & DeepSeek) |
| Retrieval | LangChain v0.3 ConversationalRetrievalChain |
| Embeddings | sentence‑transformers/all‑MiniLM‑L6‑v2 |
| Vector Store | FAISS |
† Any recent Streamlit version ≥1.28 should work.
├── app.py # main Streamlit app
├── requirements.txt # Python deps
├── README.md # you’re here ✨
└── docs/
└── demo.gif # optional demo screencast
# 1️⃣ Clone
$ git clone https://github.com/choudaryhussainali/fileiq_document-intelligence-bot.git
$ cd document-intelligence-bot
# 2️⃣ Install deps (Python ≥ 3.9)
$ pip install -r requirements.txt
# 3️⃣ Set your Groq API key
$ export GROQ_API_KEY="sk‑..." # Linux / macOS
$ setx GROQ_API_KEY "sk‑..." # Windows
# 4️⃣ Run
$ streamlit run app.pyThen open the provided local URL, upload some docs, and start chatting!
| Variable | Purpose |
|---|---|
GROQ_API_KEY |
Required. Your secret key from https://console.groq.com |
You can also enter the key in the sidebar at runtime, but exporting it avoids re‑typing.
- Quickly summarise lengthy research papers or policy docs
- Extract obligations, dates, and parties from contracts
- Create study aids from lecture notes
- Generate FAQ answers from technical manuals
Pull requests are welcome! Open an issue first to discuss major changes.
- Fork → Create branch → Commit → Push → PR.
- Follow PEP‑8 and conventional commit messages.
- Run
pre‑commithooks before pushing.
This project is proprietary and confidential. All rights reserved.
© 2025 HUSSAIN ALI. This code may not be copied, modified, distributed, or used without explicit permission.
For questions or collaboration requests:
- 📧 Email: choudaryhussainali@outlook.com
- 🌐 GitHub: choudaryhussainali
- LangChain for the retrieval framework
- Groq for ultra‑fast inference
- Sentence‑Transformers for lightweight embeddings
✨ “Information is only useful when it can be questioned.” – Adapted from McNamara
Happy chatting!
