원클릭으로
원클릭으로
| name | citorigin |
| description | Run, inspect, and explain CitOrigin evidence-to-claim audit workflows. |
| context | inline |
| allowed-tools | Bash, Read, Grep, Glob |
| arguments | task |
User request:
${task}
Execute the user request now. This is not a request to describe the skill.
Use this skill for CitOrigin run, inspect, debug, visualize, and explain workflows.
CitOrigin is an audit tool, not a generation tool. The main workflow is:
shapley: exact full-subset attributiondrop_hold_per_unit: SelfCite-style diagnostics with drop, hold, and selfcite_rewardshapley,drop_hold_per_unit: output bothDo not answer with a generic readiness message. If the request already names a workflow, file, example id, or output path, complete that task in this turn.
Assume the current working directory is the CitOrigin project root.
Use the active Python environment. Prefer:
PYTHONPATH=src python -m citorigin.cli ...
If the project is already installed in the environment, python -m citorigin.cli ...
also works.
Use relative paths from the repository root. Do not assume any machine-specific absolute paths.
Read $ARGUMENTS and choose the closest workflow:
input-path, documents, claims, evidence blocks.documents.json plus a claims file.生成可视化 html.If the requested workflow would overwrite an existing output file, choose a new timestamped or descriptive output path unless the user explicitly asked to overwrite.
When the user asks for the main CitOrigin workflow, interpret it as:
Use one of these three inputs:
score-claimscore-claim-from-filesscore-claims-from-exampleshapley
attribution.shapley_values, support_strength, weakly_groundeddrop_hold_per_unit
drop_from_full, hold_vs_empty, selfcite_rewardshapley,drop_hold_per_unit
If the user does not specify, default to:
shapley
After scoring, use:
scripts/build_attribution_reader_demo.py
This HTML builder supports:
score-claimscore-claims-from-exampleaudit_outputs.drop_hold_per_unit is present:
ShapleySelfCite-styleHoldDropUse this when the user provides a JSON payload or wants to score externally provided evidence blocks and claims.
Expected payload shape:
{
"question": "optional string",
"claim_text": "required string",
"documents": [
{
"doc_id": "d1",
"title": "optional string",
"content": "required string"
}
]
}
Run:
PYTHONPATH=src python \
-m citorigin.cli score-claim \
--provider <local_or_api> \
--audit-methods <audit_methods> \
--input-path <payload_json_path> \
--output-path <result_json_path>
If --provider local, include:
--model-path <model_path> --generator-backend transformers --device-map auto --torch-dtype bfloat16
If --provider api, include:
--api-config-path ./api_config.env
Use this when the user gives one claim text plus evidence files.
Supported evidence inputs:
--pdf-path--txt-path--doc-json-pathRun:
PYTHONPATH=src python \
-m citorigin.cli score-claim-from-files \
--provider <local_or_api> \
--audit-methods <audit_methods> \
--claim-text "<claim_text>" \
--question "<optional question>" \
--pdf-path <pdf1> \
--pdf-path <pdf2> \
--doc-json-path <json1> \
--output-path <result_json_path>
Use this workflow when the user talks about:
Use this when the user has multiple claims plus documents packaged under one folder.
Expected example directory contents:
*.pdf, ordocuments.jsonClaims input:
claims.json, orRun:
PYTHONPATH=src python \
-m citorigin.cli score-claims-from-example \
--provider <local_or_api> \
--audit-methods <audit_methods> \
--example-dir <example_dir> \
--claims-path <claims_json_path> \
--output-path <result_json_path>
This is the best workflow when the user says:
Use this whenever the user asks for:
生成可视化 htmlRun:
python scripts/build_attribution_reader_demo.py \
--result-path <result_json_path> \
--claims-path <claims_json_path_if_needed> \
--output-html <output_html_path>
If the result JSON is from:
score-claim
claims-path is optionalscore-claims-from-example
After generation, report both paths:
Use this when the user asks to explain or compare results.
If the result contains attribution, summarize:
shapley_valuesnormalized_shapley_valuessupport_strengthweakly_groundeddominant_segmentsIf the result contains audit_outputs.drop_hold_per_unit, summarize:
drop_from_fullhold_vs_emptyselfcite_rewardWhen comparing methods:
selfcite_reward = drop + holdUse this when $ARGUMENTS includes project status, scope, handoff, or what is this project.
Read if present and relevant:
PROJECT_STATUS.md
README.md
README_EN.md
docs/project_framing_and_evaluation_cn.md
Summarize:
api_config.env.