GitHub - workersio/skills: The only skill you will need to write high quality tests for your software

Agent workflows for high-quality software testing, test strategy, and test-suite reliability

Write better tests, not more tests

Most AI-written tests optimize for coverage. They assert implementation details, mock away the real risk, and pass even when the product breaks.

WIO is one testing workflow skill with five commands: $wio scan, $wio test, $wio workload, $wio review, and $wio doctor.

Install

Direct skill install:

npx skills add workersio/skills

Claude Code marketplace install:

claude plugin marketplace add workersio/skills
claude plugin install wio@workersio-skills

Claude plugin installs include the WIO skill, plugin hooks, and Markdown subagents from plugins/wio/agents/.

Codex marketplace add:

codex plugin marketplace add workersio/skills

For local Codex plugin testing, add this checkout as the marketplace root:

codex plugin marketplace add .

Codex plugin installs include the WIO skill and plugin hook config. Codex custom agents are a separate native surface: they are loaded from .codex/agents/ in a project or ~/.codex/agents/ for the user.

To enable WIO Codex agents globally:

mkdir -p ~/.codex/agents
cp .codex/agents/wio-*.toml ~/.codex/agents/

To enable WIO Codex agents for the current project:

mkdir -p .codex/agents
cp /path/to/wio-skills/.codex/agents/wio-*.toml .codex/agents/

To enable the WIO Codex hook config for the current project:

cp /path/to/wio-skills/.codex/hooks.json .codex/hooks.json

Verify Codex agent files:

find .codex/agents ~/.codex/agents -name 'wio-*.toml' 2>/dev/null

Commands

Command	What it does
`$wio scan [target]`	Maps product behavior, existing tests, CI, and risk areas to find the highest-value tests to add next.
`$wio test [target]`	Runs the full loop: discover a bug-prone candidate, pick strategy, write test, validate, review, then keep only if valuable.
`$wio workload [target]`	Generates or implements realistic, replayable user-session/API/CLI/job/load workloads with controlled variance and assertions.
`$wio review [target]`	Reviews a test for customer value, developer-flow value, signal quality, maintainability, and false confidence.
`$wio doctor [target]`	Audits test-suite health: weak assertions, flakes, excessive mocks, broad snapshots, slow feedback, skipped tests, and missing critical behavior coverage.

Subagents

WIO includes three focused subagents:

Subagent	Role
`wio-candidate-scout`	Read-only discovery of high-value test candidates before implementation.
`wio-strategy-critic`	Read-only challenge of the selected strategy before editing tests.
`wio-test-reviewer`	Read-only post-write review that returns `KEEP`, `REDO`, or `REMOVE`.

The main agent still writes the test. Subagents gather evidence, challenge the strategy, and review value. They do not duplicate reference content and they do not own the workflow.

Claude plugin subagents live in plugins/wio/agents/, which is the official Claude plugin location. Project-local Claude subagents can also be copied to .claude/agents/ when a repo is not using the plugin.

Codex project subagents live in .codex/agents/*.toml, which is the official Codex custom-agent location.

Hooks

WIO hooks only remind the active agent to validate test changes and apply the WIO value gate. The executable hook logic lives in plugins/wio/skills/wio/scripts/test-review-reminder.py.

Hook config exists in the official locations:

Claude plugin hook: plugins/wio/hooks/hooks.json
Claude project hook: .claude/settings.json
Codex plugin hook: plugins/wio/hooks/hooks.json
Codex project hook: .codex/hooks.json

References

Detailed testing guidance lives only in plugins/wio/skills/wio/references/.

Reference topics include:

Area	Covers
Behavior mapping	Turning product behavior, workflows, APIs, and incidents into test candidates.
Risk-based testing	Prioritizing tests by customer impact, likelihood, confidence gap, and cost.
Test level selection	Choosing unit, component, integration, contract, E2E, monitoring, or specialized checks.
Workload modeling	Building realistic session, traffic, stateful, synthetic, or operator workloads with bounded variance and replay.
Oracles and assertions	Designing assertions that fail for real regressions and explain what broke.
Test data and fixtures	Setup, isolation, factories, seeds, cleanup, and state management.
Mocking and doubles	Preserving fidelity while keeping tests deterministic and fast.
Suite health	Finding flakes, weak signal, slow feedback, skipped tests, and CI blind spots.
Advanced strategies	Static analysis, security testing, fuzzing, property-based testing, mutation testing, performance testing, resilience testing, and regression selection.

Usage

$wio scan checkout
$wio test billing eligibility regression
$wio workload onboarding session
$wio review tests/billing_eligibility_test.py
$wio doctor API test suite

Use scan when you do not yet know what to test. Use test when you want the whole candidate-strategy-write-review loop. Use workload when interaction across a realistic session is the risk. Use review when a test already exists or has just been written. Use doctor when an existing suite is hard to trust.

What Good Means

A generated or recommended test should answer:

What user, operator, customer, or API consumer failure does this prevent?
What production, release, support, debugging, or review risk does it reduce?
Would it fail for the regression that matters?
Is the assertion specific enough to diagnose the broken behavior?
Does the setup preserve the important dependency, state, permission, timing, or data risk?
Does this belong in local development, PR CI, nightly, release, or production monitoring?
If this is a workload, is variance bounded, seeded, replayable, and checked by meaningful invariants?

If those answers are weak, the test should be redesigned or removed.

Contributing

Keep the public surface area small: one skill, wio, with command modes scan, test, workload, review, and doctor.

Detailed testing guidance belongs in plugins/wio/skills/wio/references/, not duplicated inside workflow files, cloud folders, subagents, hooks, or extra skill trees. When adding a reference topic, add both overview.md and tools.md, then link it from plugins/wio/skills/wio/references/index.md.

Host-specific files must stay minimal and point back to WIO:

Claude Code plugins: shared install packages live in plugins/wio/ with .claude-plugin/plugin.json, root-level skills/, agents/, and hooks/.
Claude Code marketplace: .claude-plugin/marketplace.json.
Codex plugins: shared install packages live in plugins/wio/ with .codex-plugin/plugin.json, root-level skills/, and hooks/.
Codex marketplace: .agents/plugins/marketplace.json.
Codex subagents: project custom agents live in .codex/agents/*.toml per the official Codex subagents docs.
Codex hooks: project hooks can live in .codex/hooks.json per the official Codex hooks docs.

The quality bar is simple: do not accept tests for coverage alone. A test should reduce real user risk, production risk, support load, debugging time, review time, or release risk.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.agents/plugins		.agents/plugins
.claude-plugin		.claude-plugin
.claude		.claude
.codex		.codex
plugins/wio		plugins/wio
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
header.jpg		header.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Write better tests, not more tests

Install

Commands

Subagents

Hooks

References

Usage

What Good Means

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Write better tests, not more tests

Install

Commands

Subagents

Hooks

References

Usage

What Good Means

Contributing

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages