Skip to content

Phoenix0531-sudo/Hokage_Vision_Agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Hokage Vision Agent

Hokage Vision Agent logo

CI GUI Tests Docker Docs Package Desktop Build Python 3.12 License

An agentic computer vision workbench for anime character detection, powered by YOLO, PySide6, Docker, and tool-calling workflows.

This is a fan-made research and portfolio project and is not affiliated with Naruto, Shueisha, Pierrot, or related copyright holders.

中文 README · Documentation

Project Status

Hokage Vision Agent is positioned as a training-ready and model-ready computer vision workbench. The repository ships the engineering platform: desktop GUI, CLI, API, Agent tool orchestration, dataset validation, annotation assistance, training dry-runs, model registry, evaluation, packaging, CI, and documentation.

It does not publish Naruto/Hokage screenshots, private datasets, or real character-detection weights. Any such artifacts are intentionally external because image sources, redistribution rights, model license, and non-commercial research scope must be reviewed before release. The committed demo uses a deterministic mock backend and a tiny synthetic YOLO smoke dataset so the full project can be tested without copyrighted media, GPU, private data, or API keys.

Features

  • Shared detection types and inference service for CLI, GUI, API, and Agent workflows.
  • Deterministic mock backend for CI, demos, and headless GUI tests without GPU or model downloads.
  • PySide6 desktop GUI with image, video, batch, settings, statistics, and agent assistant panels.
  • Rule-based Agent with allowlisted project tools and optional LLM provider extension points.
  • Synthetic smoke dataset, dataset manifest, YOLO validation, annotation assistance, training dry-runs, evaluation, and model comparison foundations.
  • FastAPI service for health, model listing, mock detection, agent runs, dataset validation, smoke training, and model comparison.
  • Docker-first development, CI, package build, desktop executable build, and MkDocs documentation.

Screenshots

Portfolio preview assets live under assets/screenshots/. They are generated project visuals, not Naruto screenshots or redistributed anime media.

GUI preview

CLI preview

Docker-first Quick Start

docker compose build
docker compose run --rm test
docker compose run --rm gui-test

Start the API:

docker compose up api

Build docs, Python packages, and the Linux desktop executable:

docker compose run --rm docs
docker compose run --rm package
docker compose run --rm desktop-build

Build the optional training image only when real YOLO training dependencies are needed:

docker compose --profile train build train
docker compose --profile train run --rm train

Docker is the primary workflow. Local Python installation is optional. Docker Compose defaults to a Debian mirror for more stable local builds; override DEBIAN_MIRROR and DEBIAN_SECURITY_MIRROR if another mirror is faster for your network.

Interview Demo Path

This is the shortest reproducible demo path for an interview or portfolio walkthrough:

docker compose run --rm test hokage-vision dataset validate configs/dataset.example.yaml
docker compose run --rm test hokage-vision detect image examples/images/sample.jpg --backend mock
docker compose run --rm test hokage-vision agent run "训练模型"
docker compose run --rm gui-test pytest tests/gui -m gui
docker compose up api

The story to tell is simple: the project is not claiming a public Naruto model release; it demonstrates the production workflow around such a model: safe data governance, model-pluggable inference, repeatable Docker validation, and Agent-controlled training/evaluation orchestration.

Local Optional Install

python -m venv .venv
pip install -e ".[dev,gui,api,train]"

GUI Demo

hokage-vision gui

The GUI defaults to the mock backend. Configure real weights through Settings or YAML config. Docker headless GUI tests are supported; Docker is not advertised as a zero-configuration way to display a real desktop GUI on every host.

CLI Demo

hokage-vision --help
hokage-vision detect image examples/images/sample.jpg --backend mock
hokage-vision detect folder examples/images --backend mock
hokage-vision dataset validate configs/dataset.example.yaml
hokage-vision train yolo --data configs/dataset.example.yaml --epochs 1 --dry-run
hokage-vision model compare --models models/a.pt models/b.pt --mock

Agent Demo

hokage-vision agent run "检测 examples/images 里的图片"
hokage-vision agent run "检查数据集并给出训练建议"

The default agent is rule-based, does not require API keys, and only calls allowlisted project tools. It does not execute arbitrary shell commands or scrape copyrighted images.

API Demo

docker compose up api
curl http://localhost:8000/health

OpenAPI docs are available at http://localhost:8000/docs.

Dataset and Training Workflow

  1. Record image sources and redistribution terms in a dataset manifest.
  2. Validate YOLO dataset structure and labels.
  3. Use annotation assistance only to generate review-required candidates.
  4. Manually review annotations.
  5. Run smoke training or a real training dry-run.
  6. Execute real training only after explicit confirmation.
  7. Register, evaluate, and compare models before release.

The included examples/dataset/ fixture is synthetic and exists only to prove dataset validation and training planning. A real character model requires user-provided or otherwise licensed images, reviewed annotations, and external weight storage.

Adding a new character class requires new images, verified rights, bounding-box annotations, updated class names, dataset YAML changes, retraining or fine-tuning, evaluation, registry updates, and documentation updates.

Project Structure

src/hokage_vision/   Core package for config, vision, data, training, agents, API, and UI
apps/                Thin desktop and API entrypoints
configs/             Default app, model, agent, dataset, and training config
docs/                MkDocs static documentation site
tests/               Unit, integration, GUI, and packaging tests
models/              Local registry metadata and external weight placement notes
data/                Local data workspace with manifest and license guidance
legacy/old_project/  Isolated legacy YOLOv5 + PySide6 tree for audit and compatibility

Architecture

The GUI, CLI, API, and Agent layers all call shared services. YOLO/CV backends perform detection. Agents only plan and orchestrate project-scoped tools.

Roadmap

  • Add a Docker train profile for CPU-safe real training once training dependencies are separated from the default test image.
  • Add model cards and release metadata for any reviewed external weights.
  • Expand evaluation reports with real metrics after data rights are approved.
  • Harden desktop packaging across Linux, Windows, and macOS.

License

New Hokage Vision Agent code is intended to be Apache-2.0. Legacy YOLOv5-derived code remains governed by the applicable upstream YOLOv5 license. Model weights, datasets, annotations, and documentation may have separate license terms. See LICENSES/README.md and docs/license-audit.md.

Acknowledgements

This project builds on the Python, PySide6/Qt, FastAPI, Ultralytics/YOLO, Docker, MkDocs, and open source testing ecosystems.

About

An agentic anime character detection workbench powered by YOLO, PySide6, Docker, and tool workflows. 基于 YOLO、PySide6、Docker 与 Agent 工具编排的动漫角色检测工作台。

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors