تشغيل أي مهارة في Manus بنقرة واحدة

ابدأ الآن

evaluation

النجوم٠

التفرعات٠

آخر تحديث١٧ فبراير ٢٠٢٦ في ١٥:٥٩

Reference templates for Codex evaluation. Used by build/improve orchestrators — not executed directly.

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

Objective-Arts

Objective-Arts/lens-dist

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

محللو ضمان جودة البرمجيات والمختبرونمهن الحاسوب والرياضيات·SOC 15-1253

SKILL.md

readonly

name	evaluation
description	Reference templates for Codex evaluation. Used by build/improve orchestrators — not executed directly.

Evaluation Reference

Templates for the Phase 8 evaluation loop. The orchestrator in /build and /improve reads these templates and runs scoring via Bash.

This file is NOT executed directly. The orchestrator owns the score-fix loop. Scoring runs via codex exec in Bash — never delegated to an agent (agents fabricate scores).

Rubric Loading

Read .claude/rubric/AUTO-DETECT.md for the detection table
Always load: .claude/rubric/base.md and .claude/rubric/product-quality.md
Auto-detect domains: check target files against the detection table, load matching domain rubrics
Combine into {RUBRIC_CRITERIA}

If a rubric file doesn't exist, skip it and continue.

Scorecard Prompt

The orchestrator runs this directly via Bash:

cd {TARGET} && codex exec -s read-only -o /tmp/lens-eval-scores.md "CODE QUALITY REVIEW

Rate this codebase on a scale of 1-100. Evaluate everything: code quality, security, error handling, naming, structure, test coverage, CI/CD, documentation, and project hygiene.

Also check against these criteria:
{RUBRIC_CRITERIA}

Every issue you report will be sent to an agent for fixing. Be specific — cite the exact file and line, and say exactly what needs to change.

OUTPUT FORMAT (strict — no prose, no strengths, no explanation):

ISSUE: {file:line} — {description}
ISSUE: {file:line} — {description}
...

SCORE: NN/100" 2>&1

Rescore Prompt

After fixes are applied, the orchestrator runs this to get the final score:

cd {TARGET} && codex exec -s read-only -o /tmp/lens-eval-scores.md "CODE QUALITY RE-SCORE

Previous score: {PREVIOUS_SCORE}/100

Fixes applied since last scoring:

{FIX_APPLIED_LINES}

Re-read the codebase and re-score 1-100. Every issue you report will be sent to an agent for fixing. Be specific.

OUTPUT FORMAT (strict — no prose, no strengths, no explanation):

ISSUE: {file:line} — {description}
...

SCORE: NN/100" 2>&1

Classification Tree

For each fix applied, the LESSON agent classifies:

Code pattern that should be avoided in future code?
  YES -> General rule?
    YES -> LESSON -> both lessons files (deduped)
    NO  -> LESSON -> .claude/lessons.md only
  NO  -> Suggests pipeline/tool/config change?
    YES -> PROPOSAL -> .claude/eval-proposals.md
    NO  -> eval-report.md only

Each LESSON gets a category: LOGIC, DESIGN, CODE_QUALITY, DUPLICATION, or AI_SMELL.

Report Template

The LESSON agent replaces .claude/eval-report.md with:

# Eval Report — {TARGET}

**Date:** {ISO date}
**Evaluator:** Codex
**Score:** {initial}/100 → {final}/100

## Issues Found ({count})

| # | File | Issue |
|---|------|-------|
| 1 | {file:line} | {description} |

## Fixes Applied ({count})

| # | File | Fix |
|---|------|-----|
| 1 | {file:line} | {what was fixed} |

## Remaining Issues ({count})

| # | File | Issue |
|---|------|-------|
| 1 | {file:line} | {description} |

## Lessons ({count})

| # | Category | Description |
|---|----------|-------------|
| 1 | {cat} | {desc} |

## Proposals ({count})

| # | Type | Description | Action |
|---|------|-------------|--------|
| 1 | {type} | {desc} | {action} |

Lesson Files

.claude/lessons.md — append new lessons under appropriate category sections
.claude/universal-lessons.md — append only general patterns (not project-specific), deduplicate against existing
.claude/eval-proposals.md — append new proposals with PENDING status

المزيد من هذا المستودع

نفس المستودع

code-scan

Objective-Arts/lens-dist

Read-only quality scan of components. Reports problems without making changes. Uses software-base + domain profile skills.

2026-02-170

refactoring

Objective-Arts/lens-dist

Refactoring patterns - improving code design without changing behavior

2026-02-170

code-scan

Objective-Arts/lens-dist

Read-only quality scan of components. Reports problems without making changes. Uses software-base + domain profile skills.

2026-02-170

codex-review

Objective-Arts/lens-dist

Internal phase: independent Codex review + targeted fixes. Not user-facing.

2026-02-170

deduplication

Objective-Arts/lens-dist

Find duplicated code and consolidate into shared utilities. Fixes all duplicates.

2026-02-170

gemini-review

Objective-Arts/lens-dist

Hard-ass code review via Gemini. ALL issues must be fixed. No exceptions.

2026-02-170

name	evaluation
description	Reference templates for Codex evaluation. Used by build/improve orchestrators — not executed directly.

Evaluation Reference

Templates for the Phase 8 evaluation loop. The orchestrator in /build and /improve reads these templates and runs scoring via Bash.

This file is NOT executed directly. The orchestrator owns the score-fix loop. Scoring runs via codex exec in Bash — never delegated to an agent (agents fabricate scores).

Rubric Loading

Read .claude/rubric/AUTO-DETECT.md for the detection table
Always load: .claude/rubric/base.md and .claude/rubric/product-quality.md
Auto-detect domains: check target files against the detection table, load matching domain rubrics
Combine into {RUBRIC_CRITERIA}

If a rubric file doesn't exist, skip it and continue.

Scorecard Prompt

The orchestrator runs this directly via Bash:

cd {TARGET} && codex exec -s read-only -o /tmp/lens-eval-scores.md "CODE QUALITY REVIEW

Rate this codebase on a scale of 1-100. Evaluate everything: code quality, security, error handling, naming, structure, test coverage, CI/CD, documentation, and project hygiene.

Also check against these criteria:
{RUBRIC_CRITERIA}

Every issue you report will be sent to an agent for fixing. Be specific — cite the exact file and line, and say exactly what needs to change.

OUTPUT FORMAT (strict — no prose, no strengths, no explanation):

ISSUE: {file:line} — {description}
ISSUE: {file:line} — {description}
...

SCORE: NN/100" 2>&1

Rescore Prompt

After fixes are applied, the orchestrator runs this to get the final score:

cd {TARGET} && codex exec -s read-only -o /tmp/lens-eval-scores.md "CODE QUALITY RE-SCORE

Previous score: {PREVIOUS_SCORE}/100

Fixes applied since last scoring:

{FIX_APPLIED_LINES}

Re-read the codebase and re-score 1-100. Every issue you report will be sent to an agent for fixing. Be specific.

OUTPUT FORMAT (strict — no prose, no strengths, no explanation):

ISSUE: {file:line} — {description}
...

SCORE: NN/100" 2>&1

Classification Tree

For each fix applied, the LESSON agent classifies:

Code pattern that should be avoided in future code?
  YES -> General rule?
    YES -> LESSON -> both lessons files (deduped)
    NO  -> LESSON -> .claude/lessons.md only
  NO  -> Suggests pipeline/tool/config change?
    YES -> PROPOSAL -> .claude/eval-proposals.md
    NO  -> eval-report.md only

Each LESSON gets a category: LOGIC, DESIGN, CODE_QUALITY, DUPLICATION, or AI_SMELL.

Report Template

The LESSON agent replaces .claude/eval-report.md with:

# Eval Report — {TARGET}

**Date:** {ISO date}
**Evaluator:** Codex
**Score:** {initial}/100 → {final}/100

## Issues Found ({count})

| # | File | Issue |
|---|------|-------|
| 1 | {file:line} | {description} |

## Fixes Applied ({count})

| # | File | Fix |
|---|------|-----|
| 1 | {file:line} | {what was fixed} |

## Remaining Issues ({count})

| # | File | Issue |
|---|------|-------|
| 1 | {file:line} | {description} |

## Lessons ({count})

| # | Category | Description |
|---|----------|-------------|
| 1 | {cat} | {desc} |

## Proposals ({count})

| # | Type | Description | Action |
|---|------|-------------|--------|
| 1 | {type} | {desc} | {action} |

Lesson Files

.claude/lessons.md — append new lessons under appropriate category sections
.claude/universal-lessons.md — append only general patterns (not project-specific), deduplicate against existing
.claude/eval-proposals.md — append new proposals with PENDING status