Skip to content

franklioxygen/agent-workflows

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Agent Workflows

Language: English | 简体中文

Reusable engineering workflows for AI coding agents.

agent-workflows helps agents choose the right process for project initialization, feature work, bug fixes, code review, incident response, refactoring, and tech debt cleanup. The library separates workflow-specific guidance from shared safety, preflight, and validation conventions so the docs stay reusable and easier to maintain, while adding checkpoints that help absorb silent base-model quality drift.

EC6BFAEE-1138-4B0E-B6AB-23A17A72F38A

New to the library? Start with how-to-use-agent-workflows.md.

Quick Start

Choose one workflow and follow it directly:

Manual usage example:

Use the bug-fix workflow in bug-fix-agent-workflow.md for this issue:

<bug report>

Automation usage example:

Use $workflow-automation to select the right workflow and execute it for this task:

<task description>

Why Use Workflows Instead of a Single Prompt?

The difference is usually not raw model capability. It is process discipline: workflows force triage, validation, and handoff steps that ad hoc prompting often skips.

Workflows also help absorb silent base-model quality drift. If the underlying model becomes less careful, less reliable, or less consistent without an obvious product change, the workflow still adds checkpoints that reduce the chance of a major hidden drop in output quality.

The chart below comes from a 6-task exploratory Claude Code study. The full statistical protocol (blinded scoring, bootstrap confidence intervals, paired permutation tests with Holm–Bonferroni correction, inter-rater agreement) is defined in evaluation/README.md.

Measured Outcome Comparison

Scoring note: higher is better for every score.

Metric key:

  • TPR: task pass rate — average task-level pass rate across repeated runs.
  • RP@k: reliable pass at k — share of tasks where every repeated run passes.
  • CPR: clean pass rate — share of runs that pass, validate, introduce no regression, and need no rework.
  • RFR: regression-free rate — share of runs with no unrelated regression and, when declared, passing locked evaluator checks.
  • NRR: no-rework rate — share of runs that need no repair pass.

Interpretation:

  • Task Pass Rate improves because workflows help execution discipline and validation, not because they change the underlying model's raw capability.
  • Reliable Pass@k improves more noticeably because workflows reduce variance by making the agent follow a stable sequence of triage, implementation, and validation steps.
  • Clean Pass Rate is the strictest headline quality metric because a run must pass acceptance criteria, pass validation, avoid unrelated regressions, and need no repair pass.
  • Regression-Free Rate and No-Rework Rate improve because workflows reduce mistakes by enforcing baseline capture, hidden regression/evaluator checks, and post-change revalidation.
  • Workflows are also more resilient to silent base-model regressions, because process checkpoints catch quality drops that a one-shot prompt may otherwise let through unchecked.

Available Workflows

Shared Building Blocks

Bundled Skills

This repository includes Codex skills for using and maintaining the workflow library:

Shared support files for bundled skills live in skills/_shared/. This is not an installable skill; it contains reusable helper scripts and shared operating rules used by the skill folders.

Each installable skill includes one canonical agent metadata file:

  • agents/interface.yaml

Typical setup:

  1. Copy the needed folder from skills/ into your Codex skills directory.
  2. Make sure the skill can find this repository, either by running it from a workspace that contains agent-workflows/ or by setting AGENT_WORKFLOWS_ROOT.
  3. Invoke it with a task such as:
Use $workflow-automation to route and execute the right workflow for this task:

<task description>

Repository Structure

agent-workflows/
|- README.md
|- how-to-use-agent-workflows.md
|- project-initialization-agent-workflow.md
|- feature-development-agent-workflow.md
|- bug-fix-agent-workflow.md
|- code-review-agent-workflow.md
|- incident-debugging-agent-workflow.md
|- refactoring-agent-workflow.md
|- tech-debt-cleanup-agent-workflow.md
|- shared/
|  |- repository-preflight.md
|  |- safety-rules.md
|  |- workflow-conventions.md
|- skills/
   |- _shared/
   |- workflow-automation/
   |- project-initialization/
   |- workflow-maintainer/
   |- release-prep/
   |- security-review/
   |- test-strategy/
   |- migration-planning/
   |- performance-review/
   |- docs-maintenance/

When Not to Use This Library

  • One-line fixes with no ambiguity (typo, constant, import) — just make the change.
  • Greenfield project setup without meaningful decisions — if the project is a single script or throwaway prototype, scaffold it directly. For projects with real tech-stack, structure, or tooling decisions, use the project initialization workflow.
  • Infrastructure-as-code or CI/CD implementation changes — the feature, bug-fix, refactoring, and cleanup workflows are oriented around application code. Code review and incident workflows can still be used to inspect infrastructure-related changes.
  • Pure documentation changes (README updates, runbook creation) — the overhead of a full workflow is not justified.
  • Exploratory prototyping — if the goal is to experiment and throw away code, skip the process.

If you are unsure, the triage gates inside each workflow will tell you to use a lighter process when the task is small enough.

Contributing

Issues and pull requests are welcome.

When contributing:

  • Keep workflow-specific guidance in the relevant workflow file.
  • Move repeated boilerplate into shared/ instead of copying it across multiple files.
  • Keep the automation skill aligned with the workflow library when workflow names, paths, or shared conventions change.

License

MIT

About

Engineering workflows for AI coding agents or flesh engineers. It helps absorb silent base-model quality drift.

Topics

Resources

License

Stars

Watchers

Forks

Contributors

Languages