The "Crawl, Walk, Run" of Implementing a RAG System

In this GitHub repo, I'm going to be using the same dataset and build a Human Resources RAG Agent three different ways: from the very basics of of RAG running on your local laptop using mainly open-source tooling, to fully managed platform on Google Cloud's Vertex AI. Welcome to my "Crawl, Walk, Run with Retrieval-Augmented Generation"!

I will use the same dataset through the three phases. It's British Columbia government's HR policy PDF documents. I've downloaded them all, bundled it in a tarball, and put them in a publicly readable GCS bucket access.

The Goal

This is mean to be educational and if you don't know how Retrieval-Augmented Generation works, this will hopefully get you a acquainted. If you do already know how it works, then maybe I can offer some new perspectives and tools that you can try out to enhance the results of you existing RAG system.

Crawl

The basics:

Process PDFs
Chunk + create embeddings
Insert into local vector database
Perform semantic search
ADK agent to interact with the user

Walk

Builds on 'Crawl' phase:

Improves document processing
Improves chunking strategy
Perform reranking after semantic search

Run

Run a fully-managed RAG system that applies the concepts covered in "Crawl" and "Walk" phases:

Vertex AI RAG Engine
Model Armor to provide guardrails

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
01_crawl		01_crawl
02_walk		02_walk
03_run		03_run
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The "Crawl, Walk, Run" of Implementing a RAG System

The Goal

Crawl

Walk

Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

The "Crawl, Walk, Run" of Implementing a RAG System

The Goal

Crawl

Walk

Run

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages