Run any Skill in Manus with one click

e2e-regression-testing

Comprehensive Playwright E2E regression testing methodology. Activate this skill when Flash needs to create a full regression test suite by auditing an entire codebase from scratch — not when testing specific acceptance criteria from a plan. That is testing-methodology's domain. This skill applies when the instruction is "regression test this application", "write a full E2E suite", or "test coverage from scratch". It covers codebase discovery, live app exploration via Playwright MCP tools, Page Object Model architecture, multi-viewport testing (375px / 768px / 1280px), measurable coverage thresholds, and output artifact production.

Run Skill in Manus

Overview

Install command

npx skills add https://github.com/gulati8/justice-league-factory --skill e2e-regression-testing

Copy and paste this command into Claude Code to install the skill

Source

gulati8/justice-league-factory

Stars3

Forks0

UpdatedApril 28, 2026 at 16:45

File Explorer

2 files

SKILL.md

readonly

E2E Regression Testing

This guides how you create a comprehensive Playwright E2E regression test suite from scratch. You are not reading a plan — you are auditing the application itself and turning what you discover into tests. This skill supplements testing-methodology; it does not replace it. When a plan with acceptance criteria exists, use testing-methodology. When you need full regression coverage of an entire codebase, use this skill.

Execute all nine phases in order. Do not skip or merge phases.

Phase 1: Codebase Discovery

Map the entire application surface before writing any tests.

Route and Page Inventory

Find every user-reachable URL:

# React Router / Next.js file-based routing
glob "**/*.{tsx,jsx}" --include routes, pages, app directories
grep -r "path=\|createBrowserRouter\|<Route" src/

# Express / Fastify / Hono
grep -r "app\.get\|app\.post\|router\.get\|router\.post" src/

# Next.js app directory
ls -R pages/ app/

Produce a manifest: one row per URL pattern with expected page content.

API Endpoint Inventory

Map every API endpoint the frontend calls:

grep -r "fetch(\|axios\.\|useQuery\|useMutation" src/
grep -r "app\.get\|app\.post\|app\.put\|app\.delete\|app\.patch" src/

For each endpoint, note: HTTP method, path, request shape, response shape.

Workflow Identification

Trace user journeys through the route inventory. Identify at minimum:

Onboarding — signup, email verification, initial setup
Authentication — login, logout, password reset, session management
Core value loop — create, read, update, delete the primary entity
Settings — profile edits, billing, preferences
Admin paths — if an admin role exists
Error and recovery flows — what happens when something goes wrong

Critical Path Prioritization

Priority	Definition	Examples
Critical	App unusable or revenue lost if broken	auth, payment, core value action
High	Major feature broken, affects most users	CRUD on primary entity, navigation
Medium	Secondary feature or edge case	Advanced filters, export, settings
Low	Edge case, rarely used path	Accessibility shortcut, legacy page

Test critical paths first. All paths must still be covered; priority governs authoring order, not inclusion.

Phase 2: Live App Exploration with Playwright MCP

Use the Playwright MCP server tools to explore the running application. This phase produces the selector knowledge, state dependency map, and visual baselines needed to write accurate tests.

Confirm the app is running before starting: browser_navigate to the base URL and verify a valid page load.

Exploration Protocol

For every discovered route:

browser_navigate to the route
browser_snapshot — read the accessibility tree; record roles, labels, and text content you will use as selectors
browser_network_requests — record which API calls fire on load
browser_console_messages — log any errors (note them, do not block authoring)
browser_click buttons; browser_type into forms; observe state changes
browser_resize to 375px, 768px, 1280px — note layout shifts and hidden elements
browser_take_screenshot at each breakpoint for visual baseline capture

For forms: additionally submit empty, submit with invalid data (one rule at a time), then submit with valid data. Observe every validation message location and text.

For error states: use browser_route to mock 401, 403, 404, and 500 responses. Observe the resulting UI at each status code.

Phase 3: Test Architecture Setup

Every test file imports page objects. No test file uses raw Playwright page locators directly.

Directory Structure

tests/
  e2e/
    auth/           # login.spec.ts, logout.spec.ts, signup.spec.ts, etc.
    navigation/     # routing.spec.ts, deep-linking.spec.ts
    [feature-area]/ # one directory per major feature
    user-journeys/  # cross-feature end-to-end flows
    pages/          # Page Object Model classes
      BasePage.ts
      LoginPage.ts
      [Feature]Page.ts
    fixtures/       # shared auth state, test data factories
    utils/          # API helpers, data seeding
playwright.config.ts

Base Page Class

Every page object extends BasePage with goto(), action helpers, and assertion helpers. See references/test-patterns.md for the full BasePage implementation. Use the Read tool to load it.

Naming: *.spec.ts for test files, *Page.ts (PascalCase) for page objects.

What Gets Committed

The test suite is the primary deliverable of this skill — not the run report. These files are permanent project assets designed for repeated execution in CI:

Committed (project source)	Ephemeral (gitignored)
`tests/e2e/*/.spec.ts` — test files	`tmp/playwright/` — reports, traces, screenshots, videos
`tests/e2e/pages/**` — page objects	`.factory-run/test-results.json` — factory run report
`tests/e2e/fixtures/**` — test data and setup
`tests/e2e/utils/**` — helpers
`tests/e2e/__snapshots__/**` — visual baselines
`playwright.config.ts` — runner configuration

Install Playwright as a dev dependency and add test scripts to package.json:

npm install -D @playwright/test @axe-core/playwright
npx playwright install --with-deps

Add to package.json scripts:

{
  "scripts": {
    "test:e2e": "playwright test",
    "test:e2e:ui": "playwright test --ui",
    "test:e2e:desktop": "playwright test --project=desktop-chrome",
    "test:e2e:mobile": "playwright test --project=mobile-chrome --project=mobile-safari"
  }
}

These scripts enable CI pipelines and developers to run the suite independently of the factory.

Phase 4: Playwright Configuration

Create playwright.config.ts at the project root. See references/test-patterns.md for the full config template. Use the Read tool to load it.

Mandatory config requirements — do not remove any of these:

retries: 2 in CI, 0 locally
Both html and json reporters
screenshot: 'only-on-failure'
trace: 'on-first-retry'
Four projects: desktop-chrome (1280px), mobile-chrome, mobile-safari, tablet
snapshotPathTemplate consolidating snapshots under tests/e2e/__snapshots__/

Artifact Hygiene

All non-persisted Playwright output — reports, traces, screenshots, videos, and test runner artifacts — goes into tmp/playwright/ at the project root. This directory must be gitignored. Never write transient test output to the project root or alongside source files.

Before running the suite, verify:

# Confirm tmp/ and tmp/playwright/ are gitignored
grep -q "tmp/" .gitignore || echo "tmp/" >> .gitignore

If .gitignore does not already contain tmp/, append it. The tmp/ entry covers tmp/playwright/ and any other ephemeral working directories.

Phase 5: Multi-Viewport Testing

Every critical flow must run at mobile AND desktop. The four Playwright projects enforce this automatically for all *.spec.ts files.

Breakpoints

Breakpoint	Width	Project
Mobile	375px	mobile-chrome, mobile-safari
Tablet	768px	tablet
Desktop	1280px	desktop-chrome

Responsive-Specific Tests

Write dedicated tests for elements that transform across breakpoints. Loop over a viewports array and assert different UI states at each size. See references/test-patterns.md for the pattern.

Test explicitly: hamburger menus (hidden by default, opens on click, closes on ESC and backdrop), stacked layouts (no overflow), touch targets (minimum 44x44px on mobile — verify with getBoundingClientRect via browser_evaluate).

Phase 6: Test Authoring Standards

Every test maps to a user workflow, not an implementation detail. Group with test.describe by feature area. Tag critical tests @critical.

Auth Flows

Write one test for each:

Login with valid credentials — verify redirect to authenticated home
Login with wrong password — verify error message, no redirect
Login with unregistered email — verify error message
Logout — verify redirect to login, session cleared
Signup with valid data — verify account created, next step shown
Signup with duplicate email — verify error message
Password reset request — verify confirmation shown
Password reset with invalid/expired token — verify error state
Protected route redirect — navigate while logged out, verify redirect with return URL preserved
Session expiry — mock expired token, verify re-authentication prompt

Forms

For every form:

Valid submission — verify success state and side effects
Empty submission — verify required field errors appear for each required field
Each validation rule individually — one test per rule
Partial data — verify only unfilled required fields show errors
Special characters — apostrophes, ampersands, emoji in text fields

Navigation

Every route reachable via UI (follow the critical path first)
Deep linking — navigate directly to each route by URL, verify correct content
Back and forward browser history — verify state preserved or reset correctly
404 handling — navigate to non-existent URL, verify error page or redirect

Data Display

For every list or data display page: loading state (intercept API with delay), empty state (mock empty response), error state (mock 500), populated state with realistic data, pagination (if present — next/previous work, URL reflects page).

Interactive Elements

Modals — opens on trigger; closes on button click, ESC key, and backdrop click; does not close on modal body click
Dropdowns — opens on trigger; closes on selection and outside click; selected value reflected in label
Tabs — active tab shows content; inactive tabs are hidden; active state is visually indicated
Accordions — toggle behavior correct (single or multi-open per design)
Tooltips — appear on hover and focus with correct text

Cross-Feature User Journeys

Write at minimum one test per major workflow spanning feature areas (e.g., signup → onboarding → core action → verify). See references/test-patterns.md for the pattern.

Phase 7: Coverage Expectations

Cross-reference written tests against the Phase 1 discovery manifest before declaring the suite complete.

Metric	Threshold
Critical user journeys with at least one test	100%
All discovered workflows with at least one test	>= 80%
Routes visited by at least one test	100%
Features with a happy path test	100%
Features with a primary error path test	100%
Form validation rules explicitly tested	100%
API error states tested (401, 403, 404, 500)	All four
Pages with accessibility scan (axe-core)	100%
Key pages with visual regression baseline	100%, at each breakpoint

Accessibility

Install @axe-core/playwright. Add an axe scan to every page-level test. See references/test-patterns.md for the pattern.

Visual Regression

Use toHaveScreenshot() at each breakpoint. Snapshot baselines ARE committed to version control — they are reference images that document expected UI appearance. Only the ephemeral tmp/playwright/ artifacts are excluded via .gitignore. Run with --update-snapshots on first pass to establish baselines. See references/test-patterns.md for the pattern and config fields.

Coverage Audit Command

grep -r "goto(" tests/e2e/ | sed "s/.*goto('//" | sed "s/').*//" | sort | uniq

Compare extracted URLs against the Phase 1 manifest. Every uncovered route is a gap to document.

Phase 8: Test Quality Rules

These are hard constraints. Fix violations before declaring the suite complete.

No Arbitrary Waits

page.waitForTimeout() is banned. Use web-first assertions instead: expect(locator).toBeVisible(), expect(locator).toHaveText(), expect(page).toHaveURL(). Playwright retries these automatically. For animations, wait for the animated element's final state. See references/test-patterns.md for concrete replacement patterns.

No Hardcoded Selectors

Use getByRole(), getByLabel(), getByTestId() — never CSS class or structural selectors. If an element lacks an accessible label or test ID, add a data-testid attribute to the implementation before writing the test.

Test Independence

Each test creates all required state itself and leaves no side effects. Use beforeEach with direct API calls for setup — never click through the UI to create prerequisite state.

Realistic Test Data

Use plausible names, emails, and values (sarah.chen@example.com, not test@test.com). Realistic data surfaces real rendering issues.

For code examples of all four rules, see references/test-patterns.md.

Flaky Test Policy

Zero tolerance. Options in order:

Fix the root cause (race condition, missing await, non-deterministic selector)
Add a more specific web-first assertion to wait for correct state
Quarantine with test.skip and a linked issue — never leave a flaky test silently masked by retries: 2

Test Run Time

Full suite must complete in under 15 minutes with fullyParallel: true. If it exceeds this: increase workers, move slow setup to globalSetup, replace UI-based setup with API-driven setup.

Phase 9: Output Artifact

The test suite you wrote in Phases 3-8 is the primary deliverable — it lives in the project permanently and runs in CI. This phase produces a secondary artifact: a run report for the factory.

Run the full suite (npx playwright test), then write .factory-run/test-results.json conforming to .claude/schemas/test-results.schema.json.

The HTML report and traces in tmp/playwright/ are useful for local debugging but are ephemeral — do not treat them as factory artifacts. Only .factory-run/test-results.json is consumed by Batman, Wonder Woman, and Oracle.

covers Field Convention

In regression mode, covers maps to a discovered workflow or page — not a plan acceptance criterion. Use the format "[Area]: [workflow description]":

{
  "file": "tests/e2e/auth/login.spec.ts",
  "test_name": "user can log in with valid credentials",
  "covers": "Auth flow: login with valid email and password"
}

summary Field

State: total test count, spec file count, project count and names, coverage percentage against Phase 7 thresholds, number of gaps.

Example: "52 tests across 14 spec files. 4 projects. 100% of critical journeys covered. 85% workflow coverage (11/13). All 7 routes visited. 2 gaps documented."

coverage_gaps Field

List every discovered workflow or route without test coverage, with a brief explanation. Do not leave this array empty if gaps exist. Batman and Wonder Woman read this field to assess risk.

{
  "coverage_gaps": [
    "OAuth login: requires third-party provider not available in test environment",
    "Payment checkout: Stripe test mode not configured in test environment"
  ]
}

Include raw output from the Playwright test runner in test_run.output — evidence, not just numbers.

name	e2e-regression-testing
description	Comprehensive Playwright E2E regression testing methodology. Activate this skill when Flash needs to create a full regression test suite by auditing an entire codebase from scratch — not when testing specific acceptance criteria from a plan. That is testing-methodology's domain. This skill applies when the instruction is "regression test this application", "write a full E2E suite", or "test coverage from scratch". It covers codebase discovery, live app exploration via Playwright MCP tools, Page Object Model architecture, multi-viewport testing (375px / 768px / 1280px), measurable coverage thresholds, and output artifact production.
user-invocable	false
disable-model-invocation	true
last_reviewed	"2026-04-28T00:00:00.000Z"

e2e-regression-testing

More from this repository

More from this repository

E2E Regression Testing

Phase 1: Codebase Discovery

Route and Page Inventory

API Endpoint Inventory

Workflow Identification

Critical Path Prioritization

Phase 2: Live App Exploration with Playwright MCP

Exploration Protocol

Phase 3: Test Architecture Setup

Directory Structure

Base Page Class

What Gets Committed

Phase 4: Playwright Configuration

Artifact Hygiene

Phase 5: Multi-Viewport Testing

Breakpoints

Responsive-Specific Tests

Phase 6: Test Authoring Standards

Auth Flows

Forms

Navigation

Data Display

Interactive Elements

Cross-Feature User Journeys

Phase 7: Coverage Expectations

Accessibility

Visual Regression

Coverage Audit Command

Phase 8: Test Quality Rules

No Arbitrary Waits

No Hardcoded Selectors

Test Independence

Realistic Test Data

Flaky Test Policy

Test Run Time

Phase 9: Output Artifact

covers Field Convention

summary Field

coverage_gaps Field

E2E Regression Testing

Phase 1: Codebase Discovery

Route and Page Inventory

API Endpoint Inventory

Workflow Identification

Critical Path Prioritization

Phase 2: Live App Exploration with Playwright MCP

Exploration Protocol

Phase 3: Test Architecture Setup

Directory Structure

Base Page Class

What Gets Committed

Phase 4: Playwright Configuration

Artifact Hygiene

Phase 5: Multi-Viewport Testing

Breakpoints

Responsive-Specific Tests

Phase 6: Test Authoring Standards

Auth Flows

Forms

Navigation

Data Display

Interactive Elements

Cross-Feature User Journeys

Phase 7: Coverage Expectations

Accessibility

Visual Regression

Coverage Audit Command

Phase 8: Test Quality Rules

No Arbitrary Waits

No Hardcoded Selectors

Test Independence

Realistic Test Data

Flaky Test Policy

Test Run Time

Phase 9: Output Artifact

covers Field Convention

summary Field