Hokage Vision Agent

An agentic computer vision workbench for anime character detection, powered by YOLO, PySide6, Docker, and tool-calling workflows.

This is a fan-made research and portfolio project and is not affiliated with Naruto, Shueisha, Pierrot, or related copyright holders.

中文 README · Documentation

Project Status

Hokage Vision Agent is positioned as a training-ready and model-ready computer vision workbench. The repository ships the engineering platform: desktop GUI, CLI, API, Agent tool orchestration, dataset validation, annotation assistance, training dry-runs, model registry, evaluation, packaging, CI, and documentation.

It does not publish Naruto/Hokage screenshots, private datasets, or real character-detection weights. Any such artifacts are intentionally external because image sources, redistribution rights, model license, and non-commercial research scope must be reviewed before release. The committed demo uses a deterministic mock backend and a tiny synthetic YOLO smoke dataset so the full project can be tested without copyrighted media, GPU, private data, or API keys.

Features

Shared detection types and inference service for CLI, GUI, API, and Agent workflows.
Deterministic mock backend for CI, demos, and headless GUI tests without GPU or model downloads.
PySide6 desktop GUI with image, video, batch, settings, statistics, and agent assistant panels.
Rule-based Agent with allowlisted project tools and optional LLM provider extension points.
Synthetic smoke dataset, dataset manifest, YOLO validation, annotation assistance, training dry-runs, evaluation, and model comparison foundations.
FastAPI service for health, model listing, mock detection, agent runs, dataset validation, smoke training, and model comparison.
Docker-first development, CI, package build, desktop executable build, and MkDocs documentation.

Screenshots

Portfolio preview assets live under assets/screenshots/. They are generated project visuals, not Naruto screenshots or redistributed anime media.

Docker-first Quick Start

docker compose build
docker compose run --rm test
docker compose run --rm gui-test

Start the API:

docker compose up api

Build docs, Python packages, and the Linux desktop executable:

docker compose run --rm docs
docker compose run --rm package
docker compose run --rm desktop-build

Build the optional training image only when real YOLO training dependencies are needed:

docker compose --profile train build train
docker compose --profile train run --rm train

Docker is the primary workflow. Local Python installation is optional. Docker Compose defaults to a Debian mirror for more stable local builds; override DEBIAN_MIRROR and DEBIAN_SECURITY_MIRROR if another mirror is faster for your network.

Interview Demo Path

This is the shortest reproducible demo path for an interview or portfolio walkthrough:

docker compose run --rm test hokage-vision dataset validate configs/dataset.example.yaml
docker compose run --rm test hokage-vision detect image examples/images/sample.jpg --backend mock
docker compose run --rm test hokage-vision agent run "训练模型"
docker compose run --rm gui-test pytest tests/gui -m gui
docker compose up api

The story to tell is simple: the project is not claiming a public Naruto model release; it demonstrates the production workflow around such a model: safe data governance, model-pluggable inference, repeatable Docker validation, and Agent-controlled training/evaluation orchestration.

Local Optional Install

python -m venv .venv
pip install -e ".[dev,gui,api,train]"

GUI Demo

hokage-vision gui

The GUI defaults to the mock backend. Configure real weights through Settings or YAML config. Docker headless GUI tests are supported; Docker is not advertised as a zero-configuration way to display a real desktop GUI on every host.

CLI Demo

hokage-vision --help
hokage-vision detect image examples/images/sample.jpg --backend mock
hokage-vision detect folder examples/images --backend mock
hokage-vision dataset validate configs/dataset.example.yaml
hokage-vision train yolo --data configs/dataset.example.yaml --epochs 1 --dry-run
hokage-vision model compare --models models/a.pt models/b.pt --mock

Agent Demo

hokage-vision agent run "检测 examples/images 里的图片"
hokage-vision agent run "检查数据集并给出训练建议"

The default agent is rule-based, does not require API keys, and only calls allowlisted project tools. It does not execute arbitrary shell commands or scrape copyrighted images.

API Demo

docker compose up api
curl http://localhost:8000/health

OpenAPI docs are available at http://localhost:8000/docs.

Dataset and Training Workflow

Record image sources and redistribution terms in a dataset manifest.
Validate YOLO dataset structure and labels.
Use annotation assistance only to generate review-required candidates.
Manually review annotations.
Run smoke training or a real training dry-run.
Execute real training only after explicit confirmation.
Register, evaluate, and compare models before release.

The included examples/dataset/ fixture is synthetic and exists only to prove dataset validation and training planning. A real character model requires user-provided or otherwise licensed images, reviewed annotations, and external weight storage.

Adding a new character class requires new images, verified rights, bounding-box annotations, updated class names, dataset YAML changes, retraining or fine-tuning, evaluation, registry updates, and documentation updates.

Project Structure

src/hokage_vision/   Core package for config, vision, data, training, agents, API, and UI
apps/                Thin desktop and API entrypoints
configs/             Default app, model, agent, dataset, and training config
docs/                MkDocs static documentation site
tests/               Unit, integration, GUI, and packaging tests
models/              Local registry metadata and external weight placement notes
data/                Local data workspace with manifest and license guidance
legacy/old_project/  Isolated legacy YOLOv5 + PySide6 tree for audit and compatibility

Architecture

The GUI, CLI, API, and Agent layers all call shared services. YOLO/CV backends perform detection. Agents only plan and orchestrate project-scoped tools.

Roadmap

Add a Docker train profile for CPU-safe real training once training dependencies are separated from the default test image.
Add model cards and release metadata for any reviewed external weights.
Expand evaluation reports with real metrics after data rights are approved.
Harden desktop packaging across Linux, Windows, and macOS.

License

New Hokage Vision Agent code is intended to be Apache-2.0. Legacy YOLOv5-derived code remains governed by the applicable upstream YOLOv5 license. Model weights, datasets, annotations, and documentation may have separate license terms. See LICENSES/README.md and docs/license-audit.md.

Acknowledgements

This project builds on the Python, PySide6/Qt, FastAPI, Ultralytics/YOLO, Docker, MkDocs, and open source testing ecosystems.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.devcontainer		.devcontainer
.github		.github
LICENSES		LICENSES
apps		apps
assets		assets
configs		configs
data		data
docs		docs
examples		examples
legacy		legacy
models		models
reports		reports
scripts		scripts
src/hokage_vision		src/hokage_vision
tests		tests
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README.zh-CN.md		README.zh-CN.md
SECURITY.md		SECURITY.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
docker-compose.yml		docker-compose.yml
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
requirements-api.txt		requirements-api.txt
requirements-desktop-build.txt		requirements-desktop-build.txt
requirements-docker.txt		requirements-docker.txt
requirements-docs.txt		requirements-docs.txt
requirements-gui.txt		requirements-gui.txt
requirements-train.txt		requirements-train.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hokage Vision Agent

Project Status

Features

Screenshots

Docker-first Quick Start

Interview Demo Path

Local Optional Install

GUI Demo

CLI Demo

Agent Demo

API Demo

Dataset and Training Workflow

Project Structure

Architecture

Roadmap

License

Acknowledgements

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hokage Vision Agent

Project Status

Features

Screenshots

Docker-first Quick Start

Interview Demo Path

Local Optional Install

GUI Demo

CLI Demo

Agent Demo

API Demo

Dataset and Training Workflow

Project Structure

Architecture

Roadmap

License

Acknowledgements

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages