一键在 Manus 中运行任何 Skill

$pwd:

diagnose-pw-failure

Name: Diagnose Pw Failure
Author: appsmithorg

// Diagnose a Playwright test failure as a product bug by analyzing error output, screenshots, traces, and server logs. Produces a structured bug report with expected vs actual behavior, reproduction steps, and likely root cause. Use when a Playwright test keeps failing after spec fixes, "is this a product bug", "diagnose this test failure", or "the test is correct but the app is broken".

在 Manus 中运行

$ git log --oneline --stat

stars:39,886

forks:4,571

updated:2026年5月12日 11:49

SKILL.md

readonly

name	diagnose-pw-failure
description	Diagnose a Playwright test failure as a product bug by analyzing error output, screenshots, traces, and server logs. Produces a structured bug report with expected vs actual behavior, reproduction steps, and likely root cause. Use when a Playwright test keeps failing after spec fixes, "is this a product bug", "diagnose this test failure", or "the test is correct but the app is broken".

Diagnose Playwright Failure as Product Bug

When to Use

Use this skill when:

A spec has been fixed 2-3 times and the same assertion keeps failing with the same received value
The spec's selectors and waits are correct but the app renders wrong content
The server returns unexpected status codes (500, 403) on valid requests
The write-and-verify-pw-test or fix-pw-spec skill has exhausted its retry budget

Do not use if the error is clearly a test code issue (import error, wrong selector syntax, TypeScript compilation error). Use fix-pw-spec instead.

Step 1 — Collect Evidence

Gather all available evidence:

a) Playwright error output

Read the terminal output from the failed test run. Note:

The exact assertion that failed
Expected vs received values
Which test step failed (line number in the spec)

b) Screenshots

Check app/client/playwright/results/ for failure screenshots:

ls -la app/client/playwright/results/

If screenshots exist, read them (the Read tool supports images). They show exactly what the browser rendered at failure time.

c) Trace (if available)

Traces are at app/client/playwright/results/<test-path>/trace.zip. Note their location for the bug report — they can be viewed with npx playwright show-trace <path>.

d) Network responses

If the spec uses waitForResponse, check if the API response was captured in the error output. Common patterns:

API returned 200 but with wrong data → backend logic bug
API returned 500 → server crash
API returned 403 → permission/auth regression
API never responded (timeout) → endpoint broken or renamed

e) Server-side code (optional, for deeper diagnosis)

If the failure involves an API call, trace the server code:

Identify the API endpoint from the spec (e.g., API.actionsExecute → /api/v1/actions/execute)
Find the controller: grep app/server/ for @PostMapping("/api/v1/actions/execute") or similar
Read the service method to understand what could go wrong
Check if recent commits changed this code path

Step 2 — Classify the Bug

Category	Signals	Severity
UI rendering	Wrong text, missing element, broken layout (screenshot shows it)	Medium
Data regression	API returns correct status but wrong payload	High
Server error	500 response, stack trace in logs	Critical
Auth/permission	401/403 on previously working endpoint	High
Feature flag	Feature works with flag on but not off (or vice versa)	Medium
Deployment issue	ECONNREFUSED, DNS failure, unhealthy containers	Blocker
Race condition	Intermittent — passes sometimes, fails others	Medium (flaky)

Step 3 — Verify It's Not a Spec Bug

Before declaring "product bug", do a sanity check:

Manual verification: Does the spec's assertion make sense? Re-read the test name and expected behavior.
Check the deployment manually: Navigate to PLAYWRIGHT_BASE_URL in your analysis and verify the page actually shows what the test expects.
Check if the feature exists on this deployment: The deployment might be on an older version that doesn't have the feature yet.
Check feature flags: If the feature is behind a flag, verify the flag is enabled on the deployment (check PW_FLAG_OVERRIDES or query /api/v1/users/features).

Step 4 — Produce Bug Report

Output a structured diagnosis:

## Playwright Failure Diagnosis: PRODUCT BUG

**Spec**: playwright/tests/sanity/widgets/table-filter.spec.ts
**Test**: "filters table by country"
**Deployment**: https://my-dp.appsmith.com
**Category**: Data regression

### Expected behavior
Filtering the table by Country "starts with Ba" should show rows including "Bangladesh".

### Actual behavior
Filter returns 0 rows. The table is empty after applying the filter.

### Evidence
- **Assertion**: `expect(table.cell(2, 0)).toContainText("Bangladesh")` — timed out, cell doesn't exist
- **Screenshot**: playwright/results/sanity-widgets-table-filter/test-failed-1.png
  - Shows table with "No data" message after filter is applied
- **API response**: GET /api/v1/actions/execute returned 200 with `{ data: [] }`
- **Spec verified correct**: Selector targets the right table, filter UI interaction works (filter chip appears)

### Likely root cause
The execute API returns empty results for the MySQL "starts with" filter. Possible causes:
- Query generation bug in the server's filter-to-SQL translation
- Datasource connection issue (DATASOURCE_HOST may be unreachable from this deployment)

### Reproduction steps
1. Open the app in deployed mode
2. Click "Add Filter" on the data_table widget
3. Set Column: Country, Condition: starts with, Value: Ba
4. Observe: table shows "No data" instead of filtered results

### Suggested investigation
- Check server logs for the execute query
- Verify DATASOURCE_HOST is reachable from the deployment
- Test the same filter on dev.appsmith.com to compare

When Diagnosis Is Inconclusive

If you can't determine whether it's a spec bug or product bug:

## Playwright Failure Diagnosis: INCONCLUSIVE

**Spec**: <path>
**Test**: <name>

### What we know
- [facts from error output]

### What we don't know
- [ambiguities]

### Recommended next steps
1. Run the test with `--debug` flag for step-by-step execution
2. Check server logs on the deployment
3. Try reproducing manually in the browser

related-skills.json

同仓库

fix-pw-spec.md

from "appsmithorg/appsmith"

Fix a failing Playwright spec by analyzing error output, identifying the root cause in the test code, and applying corrections. Use when a Playwright test fails due to wrong selectors, bad waits, incorrect assertions, or code errors. Trigger phrases: "fix this playwright test", "this spec is failing", "playwright test broken", "fix the flaky test".

2026-05-1239.9k

write-and-verify-pw-test.md

from "appsmithorg/appsmith"

Write a Playwright E2E test from a prompt and verify it passes against a live Appsmith deployment. Configures environment variables, writes the spec following project conventions, runs it, and auto-fixes up to 3 times. Use when asked to "write a playwright test", "test this feature on a dp", "create and run an e2e test", or "verify this flow with playwright".

2026-05-1239.9k

package.json

"author": "appsmithorg"

"repository": "appsmithorg/appsmith"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

软件质量保证分析师与测试员计算机与数学类职业15-1253L4

name	diagnose-pw-failure
description	Diagnose a Playwright test failure as a product bug by analyzing error output, screenshots, traces, and server logs. Produces a structured bug report with expected vs actual behavior, reproduction steps, and likely root cause. Use when a Playwright test keeps failing after spec fixes, "is this a product bug", "diagnose this test failure", or "the test is correct but the app is broken".

Diagnose Playwright Failure as Product Bug

When to Use

Use this skill when:

A spec has been fixed 2-3 times and the same assertion keeps failing with the same received value
The spec's selectors and waits are correct but the app renders wrong content
The server returns unexpected status codes (500, 403) on valid requests
The write-and-verify-pw-test or fix-pw-spec skill has exhausted its retry budget

Do not use if the error is clearly a test code issue (import error, wrong selector syntax, TypeScript compilation error). Use fix-pw-spec instead.

Step 1 — Collect Evidence

Gather all available evidence:

a) Playwright error output

Read the terminal output from the failed test run. Note:

The exact assertion that failed
Expected vs received values
Which test step failed (line number in the spec)

b) Screenshots

Check app/client/playwright/results/ for failure screenshots:

ls -la app/client/playwright/results/

If screenshots exist, read them (the Read tool supports images). They show exactly what the browser rendered at failure time.

c) Trace (if available)

Traces are at app/client/playwright/results/<test-path>/trace.zip. Note their location for the bug report — they can be viewed with npx playwright show-trace <path>.

d) Network responses

If the spec uses waitForResponse, check if the API response was captured in the error output. Common patterns:

API returned 200 but with wrong data → backend logic bug
API returned 500 → server crash
API returned 403 → permission/auth regression
API never responded (timeout) → endpoint broken or renamed

e) Server-side code (optional, for deeper diagnosis)

If the failure involves an API call, trace the server code:

Identify the API endpoint from the spec (e.g., API.actionsExecute → /api/v1/actions/execute)
Find the controller: grep app/server/ for @PostMapping("/api/v1/actions/execute") or similar
Read the service method to understand what could go wrong
Check if recent commits changed this code path

Step 2 — Classify the Bug

Category	Signals	Severity
UI rendering	Wrong text, missing element, broken layout (screenshot shows it)	Medium
Data regression	API returns correct status but wrong payload	High
Server error	500 response, stack trace in logs	Critical
Auth/permission	401/403 on previously working endpoint	High
Feature flag	Feature works with flag on but not off (or vice versa)	Medium
Deployment issue	ECONNREFUSED, DNS failure, unhealthy containers	Blocker
Race condition	Intermittent — passes sometimes, fails others	Medium (flaky)

Step 3 — Verify It's Not a Spec Bug

Before declaring "product bug", do a sanity check:

Manual verification: Does the spec's assertion make sense? Re-read the test name and expected behavior.
Check the deployment manually: Navigate to PLAYWRIGHT_BASE_URL in your analysis and verify the page actually shows what the test expects.
Check if the feature exists on this deployment: The deployment might be on an older version that doesn't have the feature yet.
Check feature flags: If the feature is behind a flag, verify the flag is enabled on the deployment (check PW_FLAG_OVERRIDES or query /api/v1/users/features).

Step 4 — Produce Bug Report

Output a structured diagnosis:

## Playwright Failure Diagnosis: PRODUCT BUG

**Spec**: playwright/tests/sanity/widgets/table-filter.spec.ts
**Test**: "filters table by country"
**Deployment**: https://my-dp.appsmith.com
**Category**: Data regression

### Expected behavior
Filtering the table by Country "starts with Ba" should show rows including "Bangladesh".

### Actual behavior
Filter returns 0 rows. The table is empty after applying the filter.

### Evidence
- **Assertion**: `expect(table.cell(2, 0)).toContainText("Bangladesh")` — timed out, cell doesn't exist
- **Screenshot**: playwright/results/sanity-widgets-table-filter/test-failed-1.png
  - Shows table with "No data" message after filter is applied
- **API response**: GET /api/v1/actions/execute returned 200 with `{ data: [] }`
- **Spec verified correct**: Selector targets the right table, filter UI interaction works (filter chip appears)

### Likely root cause
The execute API returns empty results for the MySQL "starts with" filter. Possible causes:
- Query generation bug in the server's filter-to-SQL translation
- Datasource connection issue (DATASOURCE_HOST may be unreachable from this deployment)

### Reproduction steps
1. Open the app in deployed mode
2. Click "Add Filter" on the data_table widget
3. Set Column: Country, Condition: starts with, Value: Ba
4. Observe: table shows "No data" instead of filtered results

### Suggested investigation
- Check server logs for the execute query
- Verify DATASOURCE_HOST is reachable from the deployment
- Test the same filter on dev.appsmith.com to compare

When Diagnosis Is Inconclusive

If you can't determine whether it's a spec bug or product bug:

## Playwright Failure Diagnosis: INCONCLUSIVE

**Spec**: <path>
**Test**: <name>

### What we know
- [facts from error output]

### What we don't know
- [ambiguities]

### Recommended next steps
1. Run the test with `--debug` flag for step-by-step execution
2. Check server logs on the deployment
3. Try reproducing manually in the browser

diagnose-pw-failure

Diagnose Playwright Failure as Product Bug

When to Use

Step 1 — Collect Evidence

a) Playwright error output

b) Screenshots

c) Trace (if available)

d) Network responses

e) Server-side code (optional, for deeper diagnosis)

Step 2 — Classify the Bug

Step 3 — Verify It's Not a Spec Bug

Step 4 — Produce Bug Report

When Diagnosis Is Inconclusive

同仓库更多 Skills

同仓库更多 Skills

Diagnose Playwright Failure as Product Bug

When to Use

Step 1 — Collect Evidence

a) Playwright error output

b) Screenshots

c) Trace (if available)

d) Network responses

e) Server-side code (optional, for deeper diagnosis)

Step 2 — Classify the Bug

Step 3 — Verify It's Not a Spec Bug

Step 4 — Produce Bug Report

When Diagnosis Is Inconclusive