CLAUDE CODE PLUGIN

Every PR is a potential
crime scene.

A bug is unexpected behavior born from wrong intent.
These agents find it, prove it, and file the report - before it ships.

10 specialized QA agents. One question each. One Markdown report each.

Get Started Star on GitHub

QA Orchestra detective investigating code diffs on multiple monitors

claude code

You: @functional-reviewer Compare PR #42 against these ACs:
     AC-1: users can add items to cart from product listing
     AC-2: cart count in nav updates immediately

Writing qa-output/functional-review.md
  AC-1: COVERED at src/components/quick-add-button.tsx:43
  AC-2: AT RISK - no router.refresh() after server action
  2 regression risks in unchanged code paths
  Verdict: GAPS - needs browser validation on AC-2

Agents

10 agents, 3 tiers

Each agent answers one question, runs independently, and writes a Markdown file. Most users only need Tier 1. Need a specialist we don't ship? Write your own agent.

TIER 1 Standalone - start here

Daily drivers. Each answers one question, produces one report.

functional-reviewer OPUS

Does this diff actually implement the acceptance criteria? Where are the gaps and risks?

test-scenario-designer SONNET

What test scenarios do I need? Happy path, negative, boundary, edge cases.

smart-test-selector SONNET

Which existing tests does this diff affect? What's likely to break? Where are coverage gaps?

bug-reporter SONNET

Turn QA findings into structured, developer-ready bug reports.

TIER 2 Live validation

Test the feature in a real browser, not just read the diff.

environment-manager SONNET

Checks out the PR branch, starts the app locally, verifies end-to-end health.

browser-validator SONNET

Navigates the running app via Chrome MCP, executes test scenarios, captures evidence.

TIER 3 Orchestration & supporting

Full pipeline and specialized workflows.

orchestrator SONNET

Routes a ticket through the full pipeline, deciding which agents to run and in what order.

release-analyzer OPUS

Multi-repo release diff analysis - cross-repo impact, AC compliance, deployment risks.

automation-writer SONNET

Converts test scenarios into Playwright, Cypress, Selenium, or Gherkin test code.

manual-validator SONNET

Guides manual test execution and produces a structured validation report.

How it works

Output chaining

Every agent writes structured Markdown to qa-output/. The next agent reads from there. No copy-pasting between agents.

qa-output/
  functional-review.md ← Functional Reviewer
  test-scenarios.md ← Test Scenario Designer
  test-selection.md ← Smart Test Selector
  bug-reports.md ← Bug Reporter
  environment-status.md ← Environment Manager
  browser-validation.md ← Browser Validator
  validation-report.md ← Manual Validator
  release-analysis.md ← Release Analyzer
  plan.md ← Orchestrator
  automation/ ← Automation Writer
    feature.spec.ts
    pages/feature.page.ts

Install

Up and running in 30 seconds

Three ways to install. The plugin method is recommended - all 10 agents load automatically. Once installed, fill in your project details: see how context works.

Recommended

Plugin install

Run these commands inside Claude Code

/plugin marketplace add Anasss/qa-orchestra

/plugin install qa-orchestra@qa-orchestra

/reload-plugins

Alternative

Global agents

Run in your terminal — agents available everywhere

$ git clone https://github.com/Anasss/qa-orchestra.git

$ cp qa-orchestra/.claude/agents/*.md ~/.claude/agents/

Alternative

Clone into workspace

Run in your terminal — scoped to one project

$ git clone https://github.com/Anasss/qa-orchestra.git

$ cd qa-orchestra

$ cp examples/CONTEXT.example.md context/CONTEXT.md

$ claude

Recipes

Pick a recipe, run the command

Find the row that matches your question. Each row is a complete, standalone invocation.

I want to…	Run
Review a PR for AC compliance	@functional-reviewer Compare this diff against these ACs: …
Generate test scenarios from a ticket	@test-scenario-designer Generate scenarios for these ACs: …
Find which tests a diff affects	@smart-test-selector Which existing tests are affected by this diff?
Turn findings into bug reports	@bug-reporter Read qa-output/functional-review.md and create bug reports
Get scenarios + runnable test code	@test-scenario-designer → then @automation-writer
Validate a feature live in a browser	@environment-manager → then @browser-validator
Analyze a multi-repo release	@release-analyzer Analyze the diff between v1.0 and HEAD across all repos
Run the full pipeline	@orchestrator Run full pipeline for PR #42

FAQ

How AI QA orchestration works

Common questions about QA agents, AI-powered testing, and how QA Orchestra fits into your workflow.

What is QA orchestration?+

QA orchestration is the coordination of multiple QA activities into a structured pipeline. Instead of running a single test suite, QA Orchestra routes your code change through specialized agents - functional review, test scenario design, browser validation, bug reporting - each handling one phase of the QA lifecycle. The output of one agent feeds into the next through qa-output/ files.

Is this only for developers?+

No — manual testers get the biggest leverage. Reading a diff against acceptance criteria used to be a developer skill; an LLM makes it a tester skill. The functional-reviewer agent tells you in plain language which ACs are implemented, which are missing, and where the edges are — before you open the browser. Your manual time then goes where it matters most. See AI for manual testers for the full argument.

How is AI used in QA here?+

Each QA agent is an AI-powered specialist running inside Claude Code. The functional reviewer reads your diff against acceptance criteria and identifies gaps. The test scenario designer generates comprehensive test cases (happy path, negative, boundary, edge). The browser validator navigates your running app via Chrome DevTools MCP and verifies behavior. AI handles the analysis and reasoning; you provide the context in context/CONTEXT.md.

What makes QA agents different from regular AI code review?+

Regular AI code review gives you one opinion about your code. QA Orchestra gives you 10 specialized agents, each answering a different question. The functional reviewer checks AC compliance. The smart test selector maps your diff to existing tests. The browser validator tests the running app, not just the diff. Each agent writes a structured Markdown report you can paste into GitHub or Jira. Different questions, different agents, different outputs.

Does it work with my stack?+

Yes. QA Orchestra is stack-agnostic. It works with React, Angular, Vue, Next.js, Rails, Django, Spring Boot, or any other web framework. All project-specific details (URLs, commands, repo paths) live in a single context/CONTEXT.md file. The agents read it before every run and adapt to your stack.

Do I need to run all 10 agents?+

No. Each agent is standalone. Most users only use Tier 1: @functional-reviewer for PR review, @test-scenario-designer for test planning, or @smart-test-selector to find affected tests. Pick the agent that matches your question. The orchestrator exists for full-pipeline runs, but it's optional.

How does browser validation work?+

The environment manager checks out your PR branch, installs dependencies, and starts the app locally. The browser validator then connects via Chrome DevTools MCP, navigates to pages, clicks elements, fills forms, and verifies expected behavior - just like a manual QA tester, but automated through AI. This catches issues that static diff analysis misses.

What do the agents output?+

Every agent writes structured Markdown to the qa-output/ directory. Functional reviews, test scenarios, bug reports, validation results - all in Markdown format you can paste directly into GitHub PRs, Jira tickets, or Linear issues. No proprietary formats, no dashboards to learn.

Does QA Orchestra have memory?+

Not in the LLM-memory sense — no vector database, no auto-indexed history. What it has is a convention: agents append project-specific discoveries to plain Markdown files in context/annotations/. Those files are committed to git, so team learnings compound across engineers and sessions. See the learning loop for how it works.

Is it free?+

QA Orchestra is open-source under the MIT license. The agents run inside Claude Code, which requires a Claude subscription. No additional API keys, SaaS accounts, or paid services needed.

Every PR is a potentialcrime scene.