Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

document-memory-builder

Read a specified set of documents and distill them into a reusable, source-grounded memory pack. Use when Codex is asked to ingest project documents, papers, specs, notes, policies, or mixed corpora and create durable memory artifacts such as canonical facts, terminology, workflows, constraints, entity maps, and open questions for later reuse. Also use for long-form or chaptered material such as books, long reports, multi-section drafts, or large corpus updates that require section-aware extraction and synthesis.

In Manus ausführen

Überblick

Installationsbefehl

npx skills add https://github.com/yananlong/codex-skills --skill document-memory-builder

Kopieren Sie diesen Befehl und fügen Sie ihn in Claude Code ein, um den Skill zu installieren

Quelle

yananlong/codex-skills

Sterne0

Forks0

Aktualisiert12. Mai 2026 um 23:06

Datei-Explorer

5 Dateien

SKILL.md

readonly

name

document-memory-builder

description

Document Memory Builder

Quick start

Require the document set and intended output location before doing substantive work.
Ask for the downstream purpose only if it materially changes what should be remembered; otherwise default to general reusable project memory.
Inventory the sources, identify canonical documents, and note recency/version signals.
Run python3 scripts/init_memory_pack.py <memory-name> --path <output-dir> to create the memory pack scaffold.
Read the corpus in priority order, extract durable information, and keep every memory item traceable to one or more sources.
Write the pack, explicitly separating stable memory from volatile items and unresolved conflicts.

Inputs and defaults

Required

documents: explicit file paths, URLs, or a clearly bounded document set.
output_dir: where the memory pack should be written.

Optional

memory_name: default to a hyphen-case name inferred from the document set or project folder.
purpose: default to general reusable memory.
overwrite: default to false. Refuse to overwrite an existing pack unless the user explicitly requests it.

Output contract

Create one memory directory named after memory_name inside output_dir:

<memory_name>.memory.md: canonical reusable memory.
<memory_name>.source-map.md: source inventory, authority, and coverage notes.
<memory_name>.open-questions.md: contradictions, gaps, and follow-up questions.

Use the section structure from references/memory-pack-template.md.

Workflow

1) Bound the corpus

Refuse vague scopes such as "read the repo" unless the user explicitly wants that breadth.
Convert the request into a concrete source list before reading in depth.
Record document type, path/URL, title if obvious, and any version/date signal.

2) Rank sources by authority

Use this default precedence unless the user specifies otherwise:

Normative specs, finalized design docs, and authoritative policy documents.
Maintained project docs, READMEs, architecture notes, and recent revision plans.
Working notes, issue threads, and exploratory drafts.

When two sources disagree, prefer the higher-authority source. If authority is equal, prefer the more recent source. If the conflict remains unresolved, keep both claims in open-questions instead of collapsing them.

3) Initialize the memory pack

Run python3 scripts/init_memory_pack.py.
Do not improvise new file names unless the user asks for a different structure.
If the pack already exists and overwrite is not allowed, update it in place rather than replacing it.

4) Read in passes

First pass: skim every source to understand scope, terminology, and duplication.
Second pass: read canonical sources closely and extract the durable backbone.
Third pass: read supporting sources to fill gaps, add examples only when they teach a reusable pattern, and surface conflicts.

Prefer compression over exhaustiveness. The goal is reusable memory, not a complete summary of every paragraph.

4b) Long materials and subagents

For long-form material, build a section or chapter map before extraction.
Split by meaningful units such as chapters, sections, appendices, scenes, or argument arcs, not by arbitrary size.
If one unit is too large, split at subheadings or clear conceptual turns, not mid-argument.
When runtime and user permissions explicitly allow subagents, delegate independent chunks in parallel.
If subagents are unavailable or not allowed, run the same chunk workflow serially in the main agent.
Give every chunk pass a shared brief: user goal, source-authority rules, terminology constraints, and locked facts.
For subagent runs, require each subagent to return extracted memory candidates for the assigned chunk with source pointers.
Also require brief notes on preserved facts, unresolved continuity or conflict issues, and cross-chunk transition needs.
Integrate all chunk outputs in the main agent: normalize voice and terminology, resolve or surface conflicts, remove duplicates, smooth cross-section continuity, and run a whole-corpus final pass before writing the pack.

5) Extract only memory-worthy content

Use references/extraction-rules.md when deciding what to keep. By default, preserve:

Stable facts, definitions, and terminology.
Enduring project goals, constraints, assumptions, and non-goals.
Reusable workflows, decision rules, and evaluation criteria.
Important entities and relationships.
Recurring pitfalls, caveats, and known failure modes.

By default, do not promote the following into canonical memory unless the user explicitly wants historical detail:

Transient status updates.
One-off examples that do not generalize.
Ephemeral deadlines, owners, or temporary plans.
Raw quotations without synthesis.

6) Keep source traceability

Every substantive memory item must cite at least one source.
For local files, prefer path:line pointers when feasible.
For PDFs or documents without stable line numbers, use page/section references.
If a memory item is inferred from multiple sources, label it as an inference and cite all relevant sources.

7) Separate stable memory from volatile memory

In <memory_name>.memory.md, keep volatile or likely-to-change items in a short Change watchlist section instead of mixing them into stable facts. Examples:

active milestones
provisional decisions
fast-moving metrics
still-debated terminology

8) Write open questions aggressively

Use <memory_name>.open-questions.md for:

unresolved contradictions
missing definitions
ambiguous ownership or process
references to documents that were not provided
claims that appear important but weakly supported

Do not silently guess when the corpus is incomplete.

9) Update behavior

When asked to refresh memory from new documents:

retain stable sections that still hold
add new source entries
revise or retire contradicted items with explicit notes
preserve prior unresolved questions unless the new corpus resolves them

Quality bar

Keep the main memory concise enough to reread quickly.
Prefer normalized terminology over source-specific phrasing.
Collapse duplicates across documents.
Mark uncertainty explicitly.
Optimize for future reuse by another agent or by the same agent in a later session.

Resources

scripts/init_memory_pack.py: initialize the standard memory pack directory and files.
references/memory-pack-template.md: required output structure.
references/extraction-rules.md: rules for deciding what belongs in durable memory, plus long-form chunking and integration guidance.

Mehr aus diesem Repository

gleiches Repository

paper-prose-polisher

yananlong/codex-skills

Revise existing academic drafts into clear, audience-facing paper prose while preserving evidence, technical meaning, uncertainty, and scope. Use when Codex needs to make a draft read like a paper rather than a lab report, memo, or internal note; convert project nicknames or internal nomenclature into reader-facing terminology; sharpen contribution framing without inventing novelty; or rewrite methods, results, limitations, and discussion prose for an academic audience. Do not use when the main task is paper planning, citation work, novelty review, or experiment design.

2026-05-270

research-review-loop

yananlong/codex-skills

Run iterative adversarial review over research plans, experiment outputs, and drafts with claim ledgers, issue tracking, evidence checks, and explicit closure criteria. Use when asked to red-team a research artifact across multiple rounds, maintain issue state across revisions, or pressure-test whether revised results and prose actually support a claim. Prefer `research-paper-review` for an initial single-paper critique or OCR/extraction workflow, and prefer `research-rebuttal` when concrete external reviewer comments already exist and the task is to draft a venue response.

2026-05-270

commercialize-academic-research

yananlong/codex-skills

Run evidence-gated commercialization analysis for academic research by translating a bounded research asset into workflow pain, current workarounds, buyer and budget logic, commercialization paths, risk-retirement tests, and kill/pivot/continue decisions. Use when Codex needs to evaluate market pull for a paper, prototype, dataset, device, algorithm, lab result, or platform technology; distinguish user, buyer, operator, procurement, and budget roles; compare startup, licensing, partnership, service, or component routes; identify beachhead wedges; build validation sprints; or red-team unsupported commercialization claims.

2026-05-210

expand-prose-reduce-bullets

yananlong/codex-skills

Rewrite or draft responses, documents, reviews, explanations, and notes that are too list-heavy, outline-like, or terse. Use when the user asks for fewer bullet points, more natural prose, fuller paragraphs, smoother narrative flow, or when a draft feels choppy, compressed, or over-structured. Typical triggers include turning notes into paragraphs, softening bullet-heavy summaries, expanding terse explanations, and preserving substance while making writing read like connected prose.

2026-05-100

research-experiment-plan

yananlong/codex-skills

Convert a concrete research claim into a tracked, decisive experiment plan that works either as a standalone planning artifact or as the experiment stage inside a coordinated research workflow. Use when asked to design experiments, define baselines or ablations, decide run order, separate must-run from nice-to-have evidence, or turn a claim plus evaluation goal into a validation plan.

2026-04-070

research-novelty-review

yananlong/codex-skills

Run a stringent, adversarial novelty review over a concrete research idea, method, protocol, artifact, or claimed finding. Use when asked to assess novelty, position a paper or project, build a prior-art matrix, decide whether something is incremental, or pressure-test whether the right move is to proceed, reframe, or abandon. Prefer `research-paper-review` for first-pass technical critique of a single paper artifact and `research-rebuttal` when the task is to answer concrete reviewer comments rather than establish novelty positioning from scratch.

2026-04-070

Quelle

yananlong

yananlong/codex-skills

GitHub-Repository öffnen Creator-Repositorys ansehen

Installationsbefehl

Download

In Manus ausführen

Nützlich fürSOC

SoftwareentwicklerInformatik- und Mathematikberufe15-1252L4

name

document-memory-builder

description

Document Memory Builder

Quick start

Require the document set and intended output location before doing substantive work.
Ask for the downstream purpose only if it materially changes what should be remembered; otherwise default to general reusable project memory.
Inventory the sources, identify canonical documents, and note recency/version signals.
Run python3 scripts/init_memory_pack.py <memory-name> --path <output-dir> to create the memory pack scaffold.
Read the corpus in priority order, extract durable information, and keep every memory item traceable to one or more sources.
Write the pack, explicitly separating stable memory from volatile items and unresolved conflicts.

Inputs and defaults

Required

documents: explicit file paths, URLs, or a clearly bounded document set.
output_dir: where the memory pack should be written.

Optional

memory_name: default to a hyphen-case name inferred from the document set or project folder.
purpose: default to general reusable memory.
overwrite: default to false. Refuse to overwrite an existing pack unless the user explicitly requests it.

Output contract

Create one memory directory named after memory_name inside output_dir:

<memory_name>.memory.md: canonical reusable memory.
<memory_name>.source-map.md: source inventory, authority, and coverage notes.
<memory_name>.open-questions.md: contradictions, gaps, and follow-up questions.

Use the section structure from references/memory-pack-template.md.

Workflow

1) Bound the corpus

Refuse vague scopes such as "read the repo" unless the user explicitly wants that breadth.
Convert the request into a concrete source list before reading in depth.
Record document type, path/URL, title if obvious, and any version/date signal.

2) Rank sources by authority

Use this default precedence unless the user specifies otherwise:

Normative specs, finalized design docs, and authoritative policy documents.
Maintained project docs, READMEs, architecture notes, and recent revision plans.
Working notes, issue threads, and exploratory drafts.

3) Initialize the memory pack

Run python3 scripts/init_memory_pack.py.
Do not improvise new file names unless the user asks for a different structure.
If the pack already exists and overwrite is not allowed, update it in place rather than replacing it.

4) Read in passes

First pass: skim every source to understand scope, terminology, and duplication.
Second pass: read canonical sources closely and extract the durable backbone.
Third pass: read supporting sources to fill gaps, add examples only when they teach a reusable pattern, and surface conflicts.

Prefer compression over exhaustiveness. The goal is reusable memory, not a complete summary of every paragraph.

4b) Long materials and subagents

For long-form material, build a section or chapter map before extraction.
Split by meaningful units such as chapters, sections, appendices, scenes, or argument arcs, not by arbitrary size.
If one unit is too large, split at subheadings or clear conceptual turns, not mid-argument.
When runtime and user permissions explicitly allow subagents, delegate independent chunks in parallel.
If subagents are unavailable or not allowed, run the same chunk workflow serially in the main agent.
Give every chunk pass a shared brief: user goal, source-authority rules, terminology constraints, and locked facts.
For subagent runs, require each subagent to return extracted memory candidates for the assigned chunk with source pointers.
Also require brief notes on preserved facts, unresolved continuity or conflict issues, and cross-chunk transition needs.
Integrate all chunk outputs in the main agent: normalize voice and terminology, resolve or surface conflicts, remove duplicates, smooth cross-section continuity, and run a whole-corpus final pass before writing the pack.

5) Extract only memory-worthy content

Use references/extraction-rules.md when deciding what to keep. By default, preserve:

Stable facts, definitions, and terminology.
Enduring project goals, constraints, assumptions, and non-goals.
Reusable workflows, decision rules, and evaluation criteria.
Important entities and relationships.
Recurring pitfalls, caveats, and known failure modes.

By default, do not promote the following into canonical memory unless the user explicitly wants historical detail:

Transient status updates.
One-off examples that do not generalize.
Ephemeral deadlines, owners, or temporary plans.
Raw quotations without synthesis.

6) Keep source traceability

Every substantive memory item must cite at least one source.
For local files, prefer path:line pointers when feasible.
For PDFs or documents without stable line numbers, use page/section references.
If a memory item is inferred from multiple sources, label it as an inference and cite all relevant sources.

7) Separate stable memory from volatile memory

In <memory_name>.memory.md, keep volatile or likely-to-change items in a short Change watchlist section instead of mixing them into stable facts. Examples:

active milestones
provisional decisions
fast-moving metrics
still-debated terminology

8) Write open questions aggressively

Use <memory_name>.open-questions.md for:

unresolved contradictions
missing definitions
ambiguous ownership or process
references to documents that were not provided
claims that appear important but weakly supported

Do not silently guess when the corpus is incomplete.

9) Update behavior

When asked to refresh memory from new documents:

retain stable sections that still hold
add new source entries
revise or retire contradicted items with explicit notes
preserve prior unresolved questions unless the new corpus resolves them

Quality bar

Keep the main memory concise enough to reread quickly.
Prefer normalized terminology over source-specific phrasing.
Collapse duplicates across documents.
Mark uncertainty explicitly.
Optimize for future reuse by another agent or by the same agent in a later session.

Resources

scripts/init_memory_pack.py: initialize the standard memory pack directory and files.
references/memory-pack-template.md: required output structure.
references/extraction-rules.md: rules for deciding what belongs in durable memory, plus long-form chunking and integration guidance.