تشغيل أي مهارة في Manus بنقرة واحدة

ابدأ الآن

dyaddeflake-e2e

النجوم٢٠٬٧١٩

التفرعات٢٬٤٧٩

آخر تحديث١٢ فبراير ٢٠٢٦ في ٢١:٠٥

Identify and fix flaky E2E tests by running them repeatedly and investigating failures.

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

dyad-sh

dyad-sh/dyad

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

محللو ضمان جودة البرمجيات والمختبرونمهن الحاسوب والرياضيات·SOC 15-1253

SKILL.md

readonly

name	dyad:deflake-e2e
description	Identify and fix flaky E2E tests by running them repeatedly and investigating failures.

Deflake E2E Tests

Identify and fix flaky E2E tests by running them repeatedly and investigating failures.

Arguments

$ARGUMENTS: (Optional) Specific E2E test file(s) to deflake (e.g., main.spec.ts or e2e-tests/main.spec.ts). If not provided, will prompt to deflake the entire test suite.

Instructions

Check if specific tests are provided:

If $ARGUMENTS is empty or not provided, ask the user:

"No specific tests provided. Do you want to deflake the entire E2E test suite? This can take a very long time as each test will be run 10 times."

Wait for user confirmation before proceeding. If they decline, ask them to provide specific test files.
Install dependencies:
```
npm install
```
Build the app binary:
```
npm run build
```
IMPORTANT: This step is required before running E2E tests. E2E tests run against the built binary. If you make any changes to application code (anything outside of e2e-tests/), you MUST re-run npm run build before running E2E tests again, otherwise you'll be testing the old version.
Run tests repeatedly to detect flakiness:

For each test file, run it 10 times:
```
PLAYWRIGHT_RETRIES=0 PLAYWRIGHT_HTML_OPEN=never npm run e2e -- e2e-tests/<testfile>.spec.ts --repeat-each=10
```
IMPORTANT: PLAYWRIGHT_RETRIES=0 is required to disable automatic retries. Without it, CI environments (where CI=true) default to 2 retries, causing flaky tests to pass on retry and be incorrectly skipped as "not flaky."

Notes:
- If $ARGUMENTS is provided without the e2e-tests/ prefix, add it
- If $ARGUMENTS is provided without the .spec.ts suffix, add it
- A test is considered flaky if it fails at least once out of 10 runs
For each flaky test, investigate with debug logs:

Run the failing test with Playwright browser debugging enabled:
```
DEBUG=pw:browser PLAYWRIGHT_RETRIES=0 PLAYWRIGHT_HTML_OPEN=never npm run e2e -- e2e-tests/<testfile>.spec.ts
```
Analyze the debug output to understand:
- Timing issues (race conditions, elements not ready)
- Animation/transition interference
- Network timing variability
- State leaking between tests
- Snapshot comparison differences
Fix the flaky test:

Common fixes following Playwright best practices:
- Use await expect(locator).toBeVisible() before interacting with elements
- Use await page.waitForLoadState('networkidle') for network-dependent tests
- Use stable selectors (data-testid, role, text) instead of fragile CSS selectors
- Add explicit waits for animations: await page.waitForTimeout(300) (use sparingly)
- Use await expect(locator).toHaveScreenshot() options like maxDiffPixelRatio for visual tests
- Ensure proper test isolation (clean state before/after tests)
IMPORTANT: Do NOT change any application code. Assume the application code is correct. Only modify test files and snapshot baselines.

Update snapshot baselines if needed:

If the flakiness is due to legitimate visual differences:

PLAYWRIGHT_RETRIES=0 PLAYWRIGHT_HTML_OPEN=never npm run e2e -- e2e-tests/<testfile>.spec.ts --update-snapshots

Verify the fix:

Re-run the test 10 times to confirm it's no longer flaky:
```
PLAYWRIGHT_RETRIES=0 PLAYWRIGHT_HTML_OPEN=never npm run e2e -- e2e-tests/<testfile>.spec.ts --repeat-each=10
```
The test should pass all 10 runs consistently.
Summarize results:

Report to the user:
- Which tests were identified as flaky
- What was causing the flakiness
- What fixes were applied
- Verification results (all 10 runs passing)
- Any tests that could not be fixed and need further investigation

المزيد من هذا المستودع

نفس المستودع

dyad-pr-push

dyad-sh/dyad

Commit any uncommitted changes, run lint checks, fix any issues, and push the current branch.

2026-06-1520.7k

dyad-deflake-e2e-from-run

dyad-sh/dyad

Root-cause flaky or failing E2E tests from a specific CI run by downloading and analyzing the Playwright HTML report (traces, screenshots, errors). Use this when given a GitHub Actions run URL and asked to investigate failures. Diagnose from report artifacts first, then rebuild and rerun the affected E2E tests locally after making fixes.

2026-05-0620.7k

dyad-pr-fix-actions

dyad-sh/dyad

Fix failing CI checks and GitHub Actions on a Pull Request.

2026-04-0820.7k

dyad-pr-fix-comments

dyad-sh/dyad

Read all unresolved GitHub PR comments from trusted authors and address or resolve them appropriately.

2026-04-0820.7k

dyad-deflake-e2e-recent-commits

dyad-sh/dyad

Automatically gather flaky E2E tests from recent CI runs on the main branch and from recent PRs by wwwillchen/keppo-bot/dyad-assistant, then deflake them.

2026-04-0120.7k

dyadpromote-beta-to-stable

dyad-sh/dyad

Promote the latest pre-release to a stable release by creating a release branch, bumping the version, and pushing.

2026-03-0920.7k

Deflake E2E Tests

Identify and fix flaky E2E tests by running them repeatedly and investigating failures.

Arguments

$ARGUMENTS: (Optional) Specific E2E test file(s) to deflake (e.g., main.spec.ts or e2e-tests/main.spec.ts). If not provided, will prompt to deflake the entire test suite.

Instructions

Check if specific tests are provided:

If $ARGUMENTS is empty or not provided, ask the user:

"No specific tests provided. Do you want to deflake the entire E2E test suite? This can take a very long time as each test will be run 10 times."

Wait for user confirmation before proceeding. If they decline, ask them to provide specific test files.

Install dependencies:

npm install

Build the app binary:

npm run build

IMPORTANT: This step is required before running E2E tests. E2E tests run against the built binary. If you make any changes to application code (anything outside of e2e-tests/), you MUST re-run npm run build before running E2E tests again, otherwise you'll be testing the old version.

Run tests repeatedly to detect flakiness:

For each test file, run it 10 times:

PLAYWRIGHT_RETRIES=0 PLAYWRIGHT_HTML_OPEN=never npm run e2e -- e2e-tests/<testfile>.spec.ts --repeat-each=10

IMPORTANT: PLAYWRIGHT_RETRIES=0 is required to disable automatic retries. Without it, CI environments (where CI=true) default to 2 retries, causing flaky tests to pass on retry and be incorrectly skipped as "not flaky."

Notes:

If $ARGUMENTS is provided without the e2e-tests/ prefix, add it
If $ARGUMENTS is provided without the .spec.ts suffix, add it
A test is considered flaky if it fails at least once out of 10 runs

For each flaky test, investigate with debug logs:

Run the failing test with Playwright browser debugging enabled:

DEBUG=pw:browser PLAYWRIGHT_RETRIES=0 PLAYWRIGHT_HTML_OPEN=never npm run e2e -- e2e-tests/<testfile>.spec.ts

Analyze the debug output to understand:

Timing issues (race conditions, elements not ready)
Animation/transition interference
Network timing variability
State leaking between tests
Snapshot comparison differences

Fix the flaky test:

Common fixes following Playwright best practices:

Use await expect(locator).toBeVisible() before interacting with elements
Use await page.waitForLoadState('networkidle') for network-dependent tests
Use stable selectors (data-testid, role, text) instead of fragile CSS selectors
Add explicit waits for animations: await page.waitForTimeout(300) (use sparingly)
Use await expect(locator).toHaveScreenshot() options like maxDiffPixelRatio for visual tests
Ensure proper test isolation (clean state before/after tests)

IMPORTANT: Do NOT change any application code. Assume the application code is correct. Only modify test files and snapshot baselines.

Update snapshot baselines if needed:

If the flakiness is due to legitimate visual differences:

PLAYWRIGHT_RETRIES=0 PLAYWRIGHT_HTML_OPEN=never npm run e2e -- e2e-tests/<testfile>.spec.ts --update-snapshots

Verify the fix:

Re-run the test 10 times to confirm it's no longer flaky:

PLAYWRIGHT_RETRIES=0 PLAYWRIGHT_HTML_OPEN=never npm run e2e -- e2e-tests/<testfile>.spec.ts --repeat-each=10

The test should pass all 10 runs consistently.

Summarize results:

Report to the user:

Which tests were identified as flaky
What was causing the flakiness
What fixes were applied
Verification results (all 10 runs passing)
Any tests that could not be fixed and need further investigation