Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

rank-llm-verify

Name: Rank Llm Verify
Author: castorini

// Use when validating rank_llm artifacts after rerank, retrieve-cache, or related CLI workflows. Checks JSONL integrity, TREC formatting, candidate shape, invocation-history structure, and duplicate query identifiers. Wraps rank-llm view plus custom assertions.

Ejecutar en Manus

$ git log --oneline --stat

stars:598

forks:92

updated:27 de marzo de 2026, 14:53

Explorador de archivos

2 archivos

SKILL.md

readonly

name	rank-llm-verify
description	Use when validating rank_llm artifacts after rerank, retrieve-cache, or related CLI workflows. Checks JSONL integrity, TREC formatting, candidate shape, invocation-history structure, and duplicate query identifiers. Wraps rank-llm view plus custom assertions.

rank_llm Verify

Validates stored RankLLM artifacts for correctness and structural consistency.

When to Use

After rank-llm rerank
After rank-llm retrieve-cache
Before using rerank outputs for evaluate or analyze
When comparing artifacts across models or prompt templates

What It Checks

JSONL Integrity

Every line is valid JSON
No empty files
No truncated or malformed records

Request Input

Every record has query and candidates
Every record has at least one candidate
Candidate entries expose either doc or text

Rerank Output

Every record has query and candidates
Every candidate has docid, score, and doc
No duplicate query identifiers when qid values are present

Invocation History

Top-level file is a JSON list
Each record has an invocations_history list
Every invocation entry is object-shaped

TREC Output

Every non-empty line has 6 columns
Rank field is an integer
Score field is numeric

Usage

Run the verification script:

bash .claude/skills/rank-llm-verify/scripts/verify.sh <artifact-path> [artifact-type]

Supported artifact types:

request-input
rerank-output
invocations-history
trec-output

If no artifact type is provided, the script attempts to auto-detect it with rank-llm view.

Verification Script

See scripts/verify.sh for the runnable verification wrapper.

Gotchas

rank-llm validate rerank checks input contracts before execution. The verify script checks stored output artifacts after execution.
rank-llm view only detects supported artifact families. If a file is not one of those shapes, pass the artifact type manually or inspect the file directly.
A rerank JSONL file can be structurally valid and still be low quality. Use evaluate and analyze for score and response diagnostics.

related-skills.json

mismo repositorio

rank-llm-install.md

from "castorini/rank_llm"

Set up a rank_llm development environment. Use when someone is onboarding, setting up a fresh clone, choosing extras such as cloud, api, local, or pyserini, or troubleshooting whether the packaged rank-llm CLI is ready.

2026-05-30598

rank-llm-quickstart.md

from "castorini/rank_llm"

Use when working with the rank-llm CLI: rerank, evaluate, analyze, retrieve-cache, serve, validate, prompt, view, describe, schema, or doctor. Covers entry points, common flags, JSONL and TREC artifacts, and end-to-end retrieval plus reranking workflows.

2026-05-30598

rank-llm-eval.md

from "castorini/rank_llm"

Use when analyzing rank_llm evaluation outputs across runs or models. Covers aggregated trec_eval JSONL files, response-analysis metrics, retrieval-cache handoff files, and side-by-side comparison of stored evaluation artifacts.

2026-03-27598

package.json

"author": "castorini"

"repository": "castorini/rank_llm"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Desarrolladores de softwareOcupaciones informáticas y matemáticas15-1252L4

name	rank-llm-verify
description	Use when validating rank_llm artifacts after rerank, retrieve-cache, or related CLI workflows. Checks JSONL integrity, TREC formatting, candidate shape, invocation-history structure, and duplicate query identifiers. Wraps rank-llm view plus custom assertions.

rank_llm Verify

Validates stored RankLLM artifacts for correctness and structural consistency.

When to Use

After rank-llm rerank
After rank-llm retrieve-cache
Before using rerank outputs for evaluate or analyze
When comparing artifacts across models or prompt templates

What It Checks

JSONL Integrity

Every line is valid JSON
No empty files
No truncated or malformed records

Request Input

Every record has query and candidates
Every record has at least one candidate
Candidate entries expose either doc or text

Rerank Output

Every record has query and candidates
Every candidate has docid, score, and doc
No duplicate query identifiers when qid values are present

Invocation History

Top-level file is a JSON list
Each record has an invocations_history list
Every invocation entry is object-shaped

TREC Output

Every non-empty line has 6 columns
Rank field is an integer
Score field is numeric

Usage

Run the verification script:

bash .claude/skills/rank-llm-verify/scripts/verify.sh <artifact-path> [artifact-type]

Supported artifact types:

request-input
rerank-output
invocations-history
trec-output

If no artifact type is provided, the script attempts to auto-detect it with rank-llm view.

Verification Script

See scripts/verify.sh for the runnable verification wrapper.

Gotchas

rank-llm validate rerank checks input contracts before execution. The verify script checks stored output artifacts after execution.
rank-llm view only detects supported artifact families. If a file is not one of those shapes, pass the artifact type manually or inspect the file directly.
A rerank JSONL file can be structurally valid and still be low quality. Use evaluate and analyze for score and response diagnostics.

rank-llm-verify

rank_llm Verify

When to Use

What It Checks

JSONL Integrity

Request Input

Rerank Output

Invocation History

TREC Output

Usage

Verification Script

Gotchas

Más de este repositorio

Más de este repositorio

rank_llm Verify

When to Use

What It Checks

JSONL Integrity

Request Input

Rerank Output

Invocation History

TREC Output

Usage

Verification Script

Gotchas