Run any Skill in Manus with one click

$pwd:

create-generation-plugin

Name: Create Generation Plugin
Author: NomaDamas

// Guide developers through creating a custom generation pipeline plugin for AutoRAG-Research. Walks through scaffolding, implementing BaseGenerationPipeline methods, composing with retrieval pipelines, writing YAML configs, testing, and installing. Use when building a new RAG generation strategy (e.g., chain-of-thought RAG, multi-hop RAG).

Run Skill in Manus

$ git log --oneline --stat

stars:139

forks:22

updated:March 28, 2026 at 08:36

SKILL.md

readonly

name	create-generation-plugin
description	Guide developers through creating a custom generation pipeline plugin for AutoRAG-Research. Walks through scaffolding, implementing BaseGenerationPipeline methods, composing with retrieval pipelines, writing YAML configs, testing, and installing. Use when building a new RAG generation strategy (e.g., chain-of-thought RAG, multi-hop RAG).
allowed-tools	["Bash","Read","Write","Edit"]

Create Generation Plugin

Workflow

1. Scaffold

autorag-research plugin create my_rag --type=generation

Read the generated pipeline.py, pyproject.toml, YAML config, and test file to understand the structure.

2. Implement

For the shared pipeline implementation and testing rules, read:

ai_instructions/pipeline_implementer.md
ai_instructions/pipeline_test_writer.md
ai_instructions/pipeline_architecture_mapper.md

Implement the _generate(query_id, top_k) method. This is where your RAG strategy lives.

Available attributes inside the pipeline:

self._llm — LangChain BaseLanguageModel (use await self._llm.ainvoke(prompt))
self._retrieval_pipeline — composed retrieval pipeline (use await self._retrieval_pipeline._retrieve_by_id(query_id, top_k))
self._service — GenerationPipelineService (use self._service.get_chunk_contents(chunk_ids), self._get_query_text(query_id))

Must return a GenerationResult(text=...) (from autorag_research.orm.service.generation_pipeline).

DO NOT add your own asyncio.gather, asyncio.Semaphore, or any concurrency control. The base pipeline's run() already handles parallel execution of all queries via run_with_concurrency_limit() (semaphore + gather), controlled by the max_concurrency config parameter. Your _generate method is called once per single query — just implement the retrieve-and-generate logic for that one query.

Custom parameters: Add fields to your config class and pass them via get_pipeline_kwargs() → accept them in the pipeline constructor.

Inherited config fields (from BaseGenerationPipelineConfig):

llm — LLM model string (auto-converted to LangChain model instance)
retrieval_pipeline_name — name of the retrieval pipeline to compose with (Executor injects it)

3. Write tests and install

Use langchain_core.language_models.FakeListLLM to mock the LLM in tests.

cd my_rag_plugin
pip install -e .   # or: uv pip install -e .
cd .. && autorag-research plugin sync

Verify: ls configs/pipelines/generation/my_rag.yaml

Key Files

Purpose	Path
Base config class	`autorag_research/config.py` → `BaseGenerationPipelineConfig`
Base pipeline class	`autorag_research/pipelines/generation/base.py` → `BaseGenerationPipeline`
Service + GenerationResult	`autorag_research/orm/service/generation_pipeline.py`
Plugin entry point discovery	`autorag_research/plugin_registry.py`

Examples

Study these existing implementations for patterns:

autorag_research/pipelines/generation/basic_rag.py — Simple retrieve-then-generate (start here)
autorag_research/pipelines/generation/ircot.py — Interleaving retrieval with chain-of-thought
autorag_research/pipelines/generation/et2rag.py — Entity-aware RAG
autorag_research/pipelines/generation/main_rag.py — Main RAG pipeline
YAML configs: configs/pipelines/generation/basic_rag.yaml, configs/pipelines/generation/ircot.yaml

related-skills.json

same repository

create-ingestor-plugin.md

from "NomaDamas/AutoRAG-Research"

Guide developers through creating a custom data ingestor plugin for AutoRAG-Research. Ingestors load external datasets (HuggingFace, local files, APIs) into the database. Uses @register_ingestor decorator for automatic CLI parameter extraction. Use when ingesting a new dataset format into AutoRAG-Research.

2026-03-28139

create-retrieval-plugin.md

from "NomaDamas/AutoRAG-Research"

Guide developers through creating a custom retrieval pipeline plugin for AutoRAG-Research. Walks through scaffolding, implementing BaseRetrievalPipeline methods, writing YAML configs, testing, and installing. Use when building a new search/retrieval strategy (e.g., Elasticsearch, ColBERT, custom vector search).

2026-03-28139

create-metric-plugin.md

from "NomaDamas/AutoRAG-Research"

Guide developers through creating a custom evaluation metric plugin for AutoRAG-Research. Covers both retrieval metrics (recall, precision, etc.) and generation metrics (BLEU, ROUGE, etc.). Walks through scaffolding, implementing metric functions with @metric decorators, writing configs, testing, and installing. Use when building a new evaluation metric.

2026-02-21139

autorag-query.md

from "NomaDamas/AutoRAG-Research"

Query AutoRAG-Research pipeline results using natural language. Converts questions to SQL, executes safely (SELECT-only), returns formatted results. Auto-detects DB connection from configs/db.yaml or env vars. Use for pipeline comparison, metrics analysis, token usage.

2026-02-20139

resolve-conversation.md

from "NomaDamas/AutoRAG-Research"

Process [APPROVE] and [IGNORE] replies on /refactor review threads. Applies approved fixes to the codebase, resolves all responded threads on GitHub, commits and pushes changes. Sequential single-agent workflow. All output is in English.

2026-02-10139

refactor.md

from "NomaDamas/AutoRAG-Research"

Orchestrate a 3-agent PR code review debate using Claude Code Teams. Spawns Devil's Advocate, Neutral Judge, and Approval Advocate reviewers who analyze the current PR diff in parallel. Synthesizes findings, auto-fixes unanimous issues, and posts inline PR comments for disagreements. All output is in English.

2026-02-10139

package.json

"author": "NomaDamas"

"repository": "NomaDamas/AutoRAG-Research"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software DevelopersComputer and Mathematical Occupations15-1252L4

name	create-generation-plugin
description	Guide developers through creating a custom generation pipeline plugin for AutoRAG-Research. Walks through scaffolding, implementing BaseGenerationPipeline methods, composing with retrieval pipelines, writing YAML configs, testing, and installing. Use when building a new RAG generation strategy (e.g., chain-of-thought RAG, multi-hop RAG).
allowed-tools	["Bash","Read","Write","Edit"]

Create Generation Plugin

Workflow

1. Scaffold

autorag-research plugin create my_rag --type=generation

Read the generated pipeline.py, pyproject.toml, YAML config, and test file to understand the structure.

2. Implement

For the shared pipeline implementation and testing rules, read:

ai_instructions/pipeline_implementer.md
ai_instructions/pipeline_test_writer.md
ai_instructions/pipeline_architecture_mapper.md

Implement the _generate(query_id, top_k) method. This is where your RAG strategy lives.

Available attributes inside the pipeline:

self._llm — LangChain BaseLanguageModel (use await self._llm.ainvoke(prompt))
self._retrieval_pipeline — composed retrieval pipeline (use await self._retrieval_pipeline._retrieve_by_id(query_id, top_k))
self._service — GenerationPipelineService (use self._service.get_chunk_contents(chunk_ids), self._get_query_text(query_id))

Must return a GenerationResult(text=...) (from autorag_research.orm.service.generation_pipeline).

DO NOT add your own asyncio.gather, asyncio.Semaphore, or any concurrency control. The base pipeline's run() already handles parallel execution of all queries via run_with_concurrency_limit() (semaphore + gather), controlled by the max_concurrency config parameter. Your _generate method is called once per single query — just implement the retrieve-and-generate logic for that one query.

Custom parameters: Add fields to your config class and pass them via get_pipeline_kwargs() → accept them in the pipeline constructor.

Inherited config fields (from BaseGenerationPipelineConfig):

llm — LLM model string (auto-converted to LangChain model instance)
retrieval_pipeline_name — name of the retrieval pipeline to compose with (Executor injects it)

3. Write tests and install

Use langchain_core.language_models.FakeListLLM to mock the LLM in tests.

cd my_rag_plugin
pip install -e .   # or: uv pip install -e .
cd .. && autorag-research plugin sync

Verify: ls configs/pipelines/generation/my_rag.yaml

Key Files

Purpose	Path
Base config class	`autorag_research/config.py` → `BaseGenerationPipelineConfig`
Base pipeline class	`autorag_research/pipelines/generation/base.py` → `BaseGenerationPipeline`
Service + GenerationResult	`autorag_research/orm/service/generation_pipeline.py`
Plugin entry point discovery	`autorag_research/plugin_registry.py`

Examples

Study these existing implementations for patterns:

autorag_research/pipelines/generation/basic_rag.py — Simple retrieve-then-generate (start here)
autorag_research/pipelines/generation/ircot.py — Interleaving retrieval with chain-of-thought
autorag_research/pipelines/generation/et2rag.py — Entity-aware RAG
autorag_research/pipelines/generation/main_rag.py — Main RAG pipeline
YAML configs: configs/pipelines/generation/basic_rag.yaml, configs/pipelines/generation/ircot.yaml

create-generation-plugin

Create Generation Plugin

Workflow

1. Scaffold

2. Implement

3. Write tests and install

Key Files

Examples

More from this repository

Create Generation Plugin

Workflow

1. Scaffold

2. Implement

3. Write tests and install

Key Files

Examples

More from this repository