amd · itomek · Apr 6, 2026 · Apr 7, 2026 · Apr 8, 2026 · Apr 8, 2026
@@ -0,0 +1,40 @@
+{
+  "repo_notes": [
+    {
+      "content": "GAIA is AMD's open-source framework for building AI agents in Python and C++ that run entirely on local hardware. No cloud dependency — all processing stays on-device with AMD NPU and GPU acceleration on Ryzen AI processors. The repository has two frameworks: a Python SDK (src/gaia/) and a C++ SDK (cpp/). Documentation lives in docs/ using .mdx format (Mintlify), published at https://amd-gaia.ai. Python agents inherit from the base Agent class in src/gaia/agents/base/agent.py and register tools via the @tool decorator. LLM inference runs locally via Lemonade Server. Default models: Qwen3-0.6B-GGUF (general), Qwen3.5-35B-A3B-GGUF (agents/code), Qwen3-VL-4B-Instruct-GGUF (vision). CLI entry point: src/gaia/cli.py. Development setup: uv pip install -e '.[dev]'."
+    }
+  ],
+  "pages": [
+    {
+      "title": "Architecture Overview",
+      "purpose": "High-level architecture of both Python and C++ frameworks: agent system, LLM backends (Lemonade Server), MCP integration, and how components connect"
+    },
+    {
+      "title": "Python Agent Framework",
+      "purpose": "Base Agent class (src/gaia/agents/base/), tool registration with @tool decorator, mixins (MCPAgent, ApiAgent), and how to create new agents",
+      "parent": "Architecture Overview"
+    },
+    {
+      "title": "C++ Agent Framework",
+      "purpose": "C++ SDK for building native agents (cpp/), gaia::Agent base class, tool registration, health/wifi/process agent examples",
+      "parent": "Architecture Overview"
+    },
+    {
+      "title": "Agent UI",
+      "purpose": "Privacy-first desktop chat with drag-and-drop document Q&A. FastAPI backend (src/gaia/ui/), React/Electron frontend (src/gaia/apps/webui/), launched via gaia --ui"
+    },
+    {
+      "title": "Core Capabilities",
+      "purpose": "Document Q&A with RAG (src/gaia/rag/), speech-to-speech (Whisper ASR + Kokoro TTS in src/gaia/audio/), image generation (Stable Diffusion in src/gaia/agents/sd/), agent routing, and MCP integration"
+    },
+    {
+      "title": "Code Index",
+      "purpose": "Semantic code search over repositories using FAISS + Lemonade embeddings (src/gaia/code_index/), CLI via gaia index, dedicated CodeIndexAgent",
+      "parent": "Core Capabilities"
+    },
+    {
+      "title": "CLI and Configuration",
+      "purpose": "CLI entry point (gaia command) with subcommands: chat, talk, llm, api, mcp, index, sd, blender, jira, docker, eval. Also gaia --ui for Agent UI"
+    }
+  ]
+}
@@ -429,3 +429,22 @@ EOF
             release-assets/*
             dist/**
           body_path: RELEASE_BODY.md
+
+  refresh-context7:
+    runs-on: ubuntu-latest
+    needs: [github-release]
+    steps:
+      - name: Refresh Context7
+        run: |
+          HTTP_STATUS=$(curl -s -o /dev/null -w "%{http_code}" \
+            -X POST https://context7.com/api/v1/refresh \
+            -H "Authorization: Bearer ${{ secrets.CONTEXT7_API_KEY }}" \
+            -H "Content-Type: application/json" \
+            -d '{"libraryName": "/amd/gaia"}')
+          if [ "$HTTP_STATUS" = "200" ] || [ "$HTTP_STATUS" = "202" ]; then
+            echo "Context7 refresh triggered (HTTP $HTTP_STATUS)"
+          elif [ "$HTTP_STATUS" = "429" ]; then
+            echo "::warning::Context7 rate limited — refresh skipped"
+          else
+            echo "::warning::Context7 refresh returned HTTP $HTTP_STATUS"
+          fi
@@ -0,0 +1,25 @@
+{
+  "$schema": "https://context7.com/schema/context7.json",
+  "projectTitle": "GAIA",
+  "description": "AMD's open-source framework for building AI agents in Python and C++ that run entirely on local hardware, with AMD NPU and GPU acceleration on Ryzen AI processors.",
+  "folders": ["docs"],
+  "excludeFolders": ["tests", "scripts", "workshop", "docs/spec"],
+  "excludeFiles": ["CHANGELOG.md"],
+  "rules": [
+    "GAIA has two frameworks: Python (src/gaia/) and C++ (cpp/). Most documentation covers the Python SDK.",
+    "Python agents inherit from the base Agent class in src/gaia/agents/base/agent.py",
+    "Tools are registered using the @tool decorator from gaia.agents.base.tools",
+    "LLM inference runs locally via Lemonade Server on AMD NPU/GPU hardware",
+    "Default models: Qwen3-0.6B-GGUF (general), Qwen3.5-35B-A3B-GGUF (agents/code), Qwen3-VL-4B-Instruct-GGUF (vision)",
+    "Agent UI is the primary user interface — launch with 'gaia --ui' for privacy-first desktop chat with document Q&A",
+    "All new features require tests in tests/ and documentation in docs/ (.mdx format for Mintlify)",
+    "Use 'uv pip install -e .[dev]' for development setup",
+    "The code index (gaia.code_index) provides semantic search over repositories using local FAISS + Lemonade embeddings"
+  ],
+  "previousVersions": [
+    {"tag": "v0.17.1"},
+    {"tag": "v0.17.0"},
+    {"tag": "v0.16.0"},
+    {"tag": "v0.15.0"}
+  ]
+}
@@ -64,6 +64,7 @@
                   "guides/chat",
                   "guides/talk",
                   "guides/code",
+                  "guides/code-index",
                   "guides/sd",
                   "guides/emr",
                   "guides/blender",
@@ -135,6 +136,7 @@
                       "sdk/sdks/chat",
                       "sdk/sdks/agent-ui",
                       "sdk/sdks/rag",
+                      "sdk/sdks/code-index",
                       "sdk/sdks/mcp",
                       "sdk/sdks/llm",
                       "sdk/sdks/vlm",

@@ -0,0 +1,180 @@
+---
+title: Code Index
+description: Semantic search over your codebase, git history, and pull requests using local AMD-accelerated embeddings.
+---
+
+The GAIA Code Index enables fast semantic search over large codebases without sending your code to the cloud. It parses source files, generates embeddings via Lemonade Server on AMD NPU/GPU hardware, and stores them in a local FAISS index for sub-second queries.
+
+## Overview
+
+| Feature | Description |
+|---------|-------------|
+| **Languages** | Python (AST), JavaScript, TypeScript, Go, Rust, Java, C, C++ |
+| **Git history** | Optional — index commit messages and file changes |
+| **PR search** | Optional — index closed/merged GitHub PRs via `gh` CLI |
+| **Embeddings** | Local AMD NPU/GPU via Lemonade Server |
+| **Storage** | `~/.gaia/code_index/<repo-hash>/` |
+
+## Setup
+
+Install the required dependency:
+
+```bash
+pip install faiss-cpu
+# or, for GPU acceleration:
+pip install faiss-gpu
+```
+
+Lemonade Server must be running to generate embeddings:
+
+```bash
+lemonade-server serve
+```
+
+## CLI Usage
+
+### Index a repository
+
+```bash
+# Index the current directory
+gaia index
+
+# Index a specific repository
+gaia index --repo /path/to/repo
+
+# Include git history (commit messages and changed files)
+gaia index --repo /path/to/repo --git-history
+
+# Include GitHub pull requests (requires gh CLI and authentication)
+gaia index --repo /path/to/repo --prs
+```
+
+### Search the index
+
+```bash
+# Semantic search across code, commits, and PRs
+gaia index search "how does the agent handle errors"
+
+# Search only source code
+gaia index search "authentication flow" --scope code
+
+# Search commit history
+gaia index search "fix memory leak" --scope commit
+
+# Return more results
+gaia index search "embedding model" --top-k 20
+```
+
+### Manage the index
+
+```bash
+# Show index status
+gaia index status
+
+# Clear and rebuild
+gaia index clear
+gaia index
+```
+
+## Agent Tools
+
+When the code index is wired into an agent (ChatAgent or CodeAgent), five tools become available:
+
+| Tool | Description |
+|------|-------------|
+| `index_codebase` | Index a repository (path optional) |
+| `search_code_index` | Semantic search over indexed chunks |
+| `code_index_status` | Show index statistics |
+| `clear_code_index` | Remove the cached index |
+| `search_git_history` | Text search over commit messages via git |
+
+### Example agent interaction
+
+```
+User: Find all places in the codebase where we handle authentication errors
+
+Agent: [calls search_code_index with query="authentication error handling"]
+
+Results:
+- src/gaia/agents/base/agent.py:145 — handle_error() function
+- src/gaia/llm/lemonade_client.py:89 — auth retry logic
+- tests/unit/test_auth.py:23 — test_auth_error_recovery
+```
+
+## Python SDK
+
+```python
+from gaia.code_index.sdk import CodeIndexConfig, CodeIndexSDK
+
+config = CodeIndexConfig(
+    repo_path="/path/to/repo",
+    index_git_history=True,
+    index_prs=False,
+    max_files=5000,
+    embedding_model="nomic-embed-text-v2-moe-GGUF",
+)
+
+sdk = CodeIndexSDK(config)
+
+# Index the repository
+result = sdk.index_repository()
+print(f"Indexed {result.files_indexed} files, {result.chunks_created} chunks")
+
+# Search
+results = sdk.search("how does agent tool registration work", top_k=5)
+for r in results:
+    chunk = r.chunk
+    print(f"{chunk.file_path}:{chunk.start_line} — {chunk.symbol_name} (score: {r.score:.3f})")
+
+# Check status
+status = sdk.get_status()
+print(f"Total chunks: {status['total_chunks']}")
+```
+
+## Configuration
+
+```python
+from gaia.code_index.sdk import CodeIndexConfig
+
+config = CodeIndexConfig(
+    repo_path=".",              # Repository root path (required)
+    max_files=5000,             # Max files to index
+    max_file_size_mb=1.0,       # Skip files larger than this
+    chunk_overlap=50,           # Token overlap between chunks
+    embedding_model="nomic-embed-text-v2-moe-GGUF",  # Lemonade model
+    cache_dir="~/.gaia/code_index",  # Cache location
+    index_git_history=True,     # Include git commits
+    index_prs=False,            # Include GitHub PRs
+    max_commits=1000,           # Max commits to index
+    embedding_base_url=None,    # Custom Lemonade URL (default: localhost)
+)
+```
+
+## Supported Languages
+
+| Language | Parser | Symbols extracted |
+|----------|--------|-------------------|
+| Python | AST | Functions, classes, methods |
+| JavaScript/TypeScript | Regex | Functions, classes, interfaces, arrow functions |
+| Go | Regex | Functions, structs, interfaces |
+| Rust | Regex | Functions (`fn`), structs, enums, impl blocks |
+| Java | Regex | Classes, methods |
+| C/C++ | Regex | Functions |
+| Other | Block splitter | Paragraph blocks |
+
+## Cache Layout
+
+```
+~/.gaia/code_index/
+└── <repo-hash>/
+    ├── metadata.json    # Chunk metadata, file hashes, model name
+    └── index.faiss      # FAISS IndexFlatL2 embeddings
+```
+
+The cache is keyed by the repository root path hash. The embedding model name is stored in metadata — a warning is shown if the model changes between runs (requiring a re-index).
+
+## Privacy
+
+All processing is local. No source code, commit messages, or PR data is sent to external services. Embeddings are generated by your local Lemonade Server instance using AMD NPU/GPU hardware.
+
+Sensitive files are automatically excluded from indexing: `.env`, `.pem`, `.key`, credential files, and files matching common secret patterns.
@@ -37,7 +37,7 @@ The GAIA Docker Agent provides a natural language interface for containerizing a
 
    Note: The model is over 17GB and can take a while to download depending on your internet connection. It provides excellent results for Dockerfile generation and application analysis.
 
-   **Important**: The Docker agent requires a higher context size (8192) than the default (4096) to handle complex application analysis and Dockerfile generation. For more details on Lemonade Server CLI options, see the [Lemonade Server documentation](https://lemonade-server.ai/docs/server/lemonade-server-cli/#command-line-options-for-serve-and-run).
+   **Important**: The Docker agent requires a higher context size (8192) than the default (4096) to handle complex application analysis and Dockerfile generation. For more details on Lemonade Server CLI options, see the [Lemonade Server documentation](https://lemonade-server.ai/docs/lemonade-cli#options-for-run).
 
 ### Verify Installation