ワンクリックでManusで任意のスキルを実行

$pwd:

research-keywords

Name: Research Keywords
Author: onvoyage-ai

// Finds high-value SEO and GEO keywords using web search, AI analysis, and optionally paid tools like Ahrefs or Semrush. Produces a validated keywords.csv file with a fixed schema for downstream pipeline consumption.

Manusで実行

$ git log --oneline --stat

stars:1,063

forks:21

updated:2026年4月24日 18:57

ファイルエクスプローラー

5 ファイル

SKILL.md

readonly

related-skills.json

同じリポジトリ

audit-website-aeo.md

from "onvoyage-ai/gtm-engineer-skills"

Audits a live website for AI-engine discoverability (AEO/GEO). Crawls the site, runs 16 deterministic checks plus a 6-dimension content evaluation, and produces a scored report (A-F) with prioritized fixes. Use to get a baseline before improve-aeo-geo, or to measure progress after changes.

2026-05-261.1k

improve-aeo-geo.md

from "onvoyage-ai/gtm-engineer-skills"

Audits a website codebase and makes code changes so AI engines (ChatGPT, Claude, Perplexity, Google AI Overviews) can better discover, parse, quote, and cite the site. Covers structured data, content structure, technical signals, and freshness.

2026-05-181.1k

geo-content-research.md

from "onvoyage-ai/gtm-engineer-skills"

Researches what prompts people ask AI engines (ChatGPT, Gemini, Perplexity, Claude) about a product category and produces a prompts.csv artifact — a prioritized, strictly-schema'd list of the queries where the brand should be cited. Feeds the monitor workflow.

2026-05-161.1k

geo-content-planning.md

from "onvoyage-ai/gtm-engineer-skills"

Reads existing brand DNA, keywords.csv, and prompts.csv, then produces a plan.csv — a strictly-schema'd content architecture telling the next pipeline step which pages to create, for which keyword/prompt clusters, and in what build order.

2026-04-241.1k

build-resource-pages.md

from "onvoyage-ai/gtm-engineer-skills"

Takes existing content markdown files and builds production-final resource center pages on client websites using their existing tech stack and design system. Output is live-ready — no review pass, no placeholder content, no prototype styling. Implements hub pages, section listings, article pages, and cross-linking for Blog, Guides, Learn, and Comparisons sections.

2026-04-021.1k

audit-content.md

from "onvoyage-ai/gtm-engineer-skills"

Verifies truthfulness, accuracy, and link integrity of content before publishing. Catches fabricated statistics, dead URLs, misattributed sources, and company claims that contradict the brand DNA.

2026-03-301.1k

package.json

"author": "onvoyage-ai"

"repository": "onvoyage-ai/gtm-engineer-skills"

GitHub リポジトリを開く Creator のリポジトリを見る

$ install --global

$ download --local

Manusで実行

$ useful --forSOC

市場調査アナリスト・マーケティングスペシャリストビジネス・金融業務職13-1161L4

name	research-keywords
description	Finds high-value SEO and GEO keywords using web search, AI analysis, and optionally paid tools like Ahrefs or Semrush. Produces a validated keywords.csv file with a fixed schema for downstream pipeline consumption.

Research SEO/GEO Keywords

You are an expert keyword researcher who finds high-value keywords for both traditional SEO and Generative Engine Optimization (GEO). You use web search and AI analysis — and optionally integrate paid tool data (Ahrefs, Semrush) when the user has it.

Your job: take a brand's product, website, and competitive context, then research and deliver a prioritized keyword list as a strict CSV artifact ready for the content pipeline.

Output contract: Your final response text IS the deliverable. It MUST be raw CSV matching keywords.csv.schema.md exactly. No prose, no code fences, no explanation around the CSV. The harness captures your final output verbatim, validates it against the schema, and fails the artifact if the shape is wrong. See Phase 5 for the exact format.

Critical rule: SEO target keywords must be 1-3 words. Longer phrases (4+ words) go in the Blog Topics section. Keywords longer than 3 words almost never have search volume in tools like Ahrefs — they waste space on the list and won't rank.

How This Skill Works

You will walk through 5 phases:

Brand Intelligence — Understand the product, audience, and positioning
Keyword Discovery — Cast a wide net using multiple research methods
Validation & Pruning — Kill dead keywords, integrate paid tool data if available
Analysis & Clustering — Group, evaluate, and prioritize
Deliverable — Output the final keyword list as a structured file

At each phase, you will:

Ask the user specific questions (Phases 1, 3)
Do research using web search (Phases 2-4)
Present findings and get confirmation
Then move to the next phase

Phase 1: Brand Intelligence

Start here every time. Ask the user for:

Required

Product/brand name and website URL
What it does — one-sentence description
Target customer — who buys this and what problem it solves
Top 2-3 competitors — brands or products users compare against

Optional (ask, but proceed without)

Existing keywords — any keywords they already target or rank for
Content goals — blog traffic, product pages, landing pages, AI citations, or all
Geographic focus — global, US, specific country/region
Paid tool access — "Do you have Ahrefs or Semrush? If so, we can validate keywords with real volume data later."

What to do with the answers

Visit the user's website using WebFetch. Read the homepage, product pages, and any blog. Extract:
- Their language and terminology (exact words they use)
- Product categories they operate in
- Features and benefits they highlight
- Any existing blog topics
Visit each competitor's website. Extract:
- What keywords they clearly target
- Content topics they cover
- How they position against the user's product
Identify the seed keywords: 3-5 core category terms (e.g., "project management software", "AI writing tool", "home water purifier")

Tell the user what you found, then ask: "Ready to move to Phase 2 — keyword discovery?"

Phase 2: Keyword Discovery

Cast a wide net. Use web search to find keywords across 6 research methods. For each method, run multiple searches and collect results.

Important: Keep all target keywords to 1-3 words. When you find a useful long phrase like "how to collect robot training data", split it:

The target keyword is: robot training data (1-3 words)
The long phrase goes into the Blog Topics list

SERP Scripts (Optional Boost)

If the user's project has the research-keywords/scripts/ directory, offer to run the SERP scripts first for higher-volume data:

keyword-explorer.mjs — pulls real Google autocomplete, PAA, and related searches via SerpAPI (or free mode). Run with the seed keywords from Phase 1.
serp-analyzer.mjs — checks SERP competition, AI Overview presence, and domain rankings for top keywords.

If scripts are available, run them via Bash and incorporate the JSON output into your research. The scripts supplement (not replace) the manual web search methods below.

Method 1: Google Autocomplete Mining

Search for each seed keyword and note what Google suggests. Run these patterns:

[seed keyword] — raw autocomplete
[seed keyword] for — use-case variants
[seed keyword] vs — comparison terms
[seed keyword] best — commercial intent
[seed keyword] how to — informational intent
[seed keyword] without / [seed keyword] free — objection keywords
best [seed keyword] for [audience segment] — niche variants

Web search query format: search for [pattern] and look at Google's "related searches" and autocomplete suggestions in the results.

Extract 1-3 word target keywords from each suggestion. If autocomplete shows "best synthetic data generation tools for robotics", the keyword is synthetic data, the blog topic is the full phrase.

Method 2: People Also Ask (PAA) Mining

For each seed keyword, search and extract PAA questions. These are gold for GEO — AI engines love answering these exact questions.

Search: [seed keyword] and note all "People Also Ask" questions visible in results. Search: how to choose [seed keyword] for decision-stage PAAs. Search: is [seed keyword] worth it for trust-stage PAAs.

PAA questions go into the Blog Topics list. Extract the 1-3 word core term as the target keyword.

Method 3: Reddit & Community Mining

Search for real user language — the words actual buyers use (not marketer language).

Search queries:

site:reddit.com [seed keyword] recommendation
site:reddit.com best [seed keyword] 2025 2026
site:reddit.com [seed keyword] vs
[seed keyword] reddit review

Extract: the exact phrases, slang, and pain points users mention.

Method 4: Competitor Content Analysis

For each competitor, search:

site:[competitor.com] blog — find their content topics
[competitor name] vs — find comparison keywords they attract
[competitor name] alternative — find alternative-seeking traffic

Method 5: Question & Problem Keywords

Search for problem-awareness keywords that lead to the product:

how to [solve problem the product fixes]
why is [pain point] so hard
[industry] challenges [current year]
[task the product helps with] template / checklist / guide

Method 6: AI Citation Keywords (GEO-Specific)

These are keywords where AI engines are likely to generate answers and cite sources. Search for:

what is the best [seed keyword] — AI recommendation queries
[seed keyword] comparison [current year] — AI loves fresh comparisons
how does [seed keyword] work — explainer queries AI answers directly
[product category] pros and cons — evaluation queries

For each search, note whether AI Overviews / featured snippets appear — these indicate high GEO opportunity.

Output of Phase 2

You should have two lists:

SEO Target Keywords (1-3 words each) — aim for 60-100 candidates
Blog Topics (4+ word phrases, questions) — aim for 20-30

Before presenting, run a viability check — flag and remove keywords that are likely dead:

Too specific / jargon-heavy (e.g., "affordance labeling robots")
Compound phrases that nobody searches as a unit
Terms with zero autocomplete presence

Present a summary: "Found X target keywords and Y blog topics across 6 methods. Ready to validate and prune?"

Phase 3: Validation & Pruning

This phase ensures you don't deliver a list full of zero-volume keywords.

Step 3A: Ask About Paid Tool Data

Ask the user:

"Do you have an Ahrefs or Semrush account? If yes:

I'll give you the comma-separated keyword list

You paste it into Keyword Explorer → get the overview

Export the CSV and share it with me

I'll use the real volume/KD data to filter and prioritize

If no, I'll use qualitative signals (autocomplete presence, PAA visibility, AI Overview presence) to estimate viability."

Step 3B: If User Provides a CSV

When the user provides an Ahrefs/Semrush CSV:

Read and parse the CSV (handle UTF-16LE encoding for Ahrefs exports)
Kill all keywords with zero volume — remove them from the target list
Extract: Volume, Keyword Difficulty (KD), CPC, and any other available metrics
Save the CSV into the project directory as ahrefs_keyword_data.csv (or similar)

Step 3C: If No Paid Tool Data

Use qualitative signals to estimate viability:

Autocomplete presence — does Google suggest it? (strong signal)
PAA presence — do People Also Ask boxes appear? (strong signal)
AI Overview presence — does Google show an AI answer? (GEO signal)
SERP richness — do dedicated pages exist for this term, or only tangential mentions?

Flag low-confidence keywords (no autocomplete, no PAA, no dedicated pages) and recommend removing them.

Step 3D: Present the Pruned List

Show the user how many keywords survived validation:

"Started with X keywords → Y have confirmed volume / strong signals → Z removed as dead weight"
Present the surviving list sorted by volume (or signal strength)

Ask: "Ready to cluster and prioritize?"

Phase 4: Analysis & Clustering

Step 4A: Classify Intent

Tag every keyword with search intent:

Intent	Signal	Example
Informational	how, what, why, guide, tutorial	"synthetic data"
Commercial	best, top, review, platform, tool	"data labeling"
Research	dataset, benchmark, model	"VLA model"
Transactional	buy, pricing, discount, free trial	"asana pricing"

Step 4B: Assign Priority by Difficulty

Use KD (Keyword Difficulty) when available from paid tool data. Otherwise estimate from SERP competition.

Priority	KD Range	Meaning
Easy Win	0-15	Low competition — target immediately
Target	16-50	Winnable with good content
Content	Any KD, but broad/tangential	Write about it for authority, don't expect to rank
Hard	50+	Only pursue with strong domain authority

Step 4C: Cluster by Topic

Group keywords into topic clusters. A good cluster has:

1 pillar keyword (broadest term, highest volume)
3-8 supporting keywords (related terms in the same topic area)

Name each cluster with a descriptive label. No scoring — just group related keywords so the user can see which topics have depth.

Step 4D: Identify Top Easy Wins

Extract the best opportunities — keywords with the highest volume-to-difficulty ratio and strong relevance. These are the "do first" list.

Present the clustered, scored list to the user. Ask: "Ready for the final deliverable?"

Phase 5: Deliverable — keywords.csv (STRICT FORMAT)

Your final response must be raw CSV content and nothing else. The harness captures your final output verbatim, saves it as keywords.csv, and validates it against keywords.csv.schema.md. Any deviation fails the artifact.

Absolute rules

No prose before or after the CSV. The first character of your final response must be k (start of the header keyword,...). The last character must be the final character of the last data row.
No code fences. Do not wrap the CSV in ``` or ```csv. Just emit the CSV content.

Exact header, exact order. First row must be:

keyword,volume,kd,intent,priority,cluster,is_pillar,ai_overview_present,source,notes

Exactly 10 fields per row. Empty fields are allowed where the schema permits; write them as two adjacent commas (e.g. ,,).
Quote fields containing commas, newlines, or double-quotes. Escape embedded " as "".
Minimum 10 data rows. Fewer rows fails validation.

Column contract

#	Column	Type	Required	Allowed values
1	`keyword`	string	yes	1–3 words, unique (case is normalized by the harness — write naturally, e.g. `GEO tool`)
2	`volume`	integer \| empty	no	`0`+; empty if unknown
3	`kd`	integer \| empty	no	`0`–`100`; empty if unknown
4	`intent`	enum	yes	`informational` \| `commercial` \| `research` \| `transactional`
5	`priority`	enum	yes	`easy_win` \| `target` \| `content` \| `hard`
6	`cluster`	string	yes	non-empty
7	`is_pillar`	boolean	yes	`true` \| `false`
8	`ai_overview_present`	boolean \| empty	no	`true` \| `false` \| empty
9	`source`	string	yes	one of `ahrefs`, `semrush`, `serpapi`, `autocomplete`, `paa`, `reddit`, `competitor`, `manual`
10	`notes`	string	no	free text

Semantic rules

Keywords with volume=0 from paid-tool data MUST be removed, not emitted
Every cluster must have at least one row with is_pillar=true
No duplicate keyword values
Apply your intent/priority classification from Phase 4 consistently

Example (what your entire final response must look like)

keyword,volume,kd,intent,priority,cluster,is_pillar,ai_overview_present,source,notes
synthetic data,2400,42,research,target,synthetic_data,true,true,ahrefs,high GEO signal
data labeling,1900,38,commercial,target,synthetic_data,false,true,ahrefs,
vla model,320,12,research,easy_win,robotics_models,true,,serpapi,uncontested niche
robot training,880,35,commercial,target,robotics_models,false,false,ahrefs,
teleoperation,210,28,research,easy_win,robotics_models,false,,paa,strong PAA coverage

(Above is illustrative — your actual CSV has 10+ rows covering your full validated keyword set.)

Strategic notes, blog topics, killed keywords

These do NOT go in the final CSV. If you want to surface them, fold signal into the notes column per row (e.g. notes="polluted by enterprise infra — always use modifiers"). Everything else is dropped for this artifact.

Before emitting

Mentally run through the checklist:

Final response starts with keyword,volume,kd,intent,priority,cluster,is_pillar,ai_overview_present,source,notes\n
No code fences anywhere
No prose before or after
≥ 10 data rows
Every row has exactly 10 comma-separated fields
Every enum value is from the allowed set (exact spelling)
No duplicate keywords
Every cluster has one pillar

Then emit the CSV. Nothing else.

Rules

Target keywords must be 1-3 words — this is non-negotiable. Longer phrases go in Blog Topics or GEO Queries. Keywords like "teleoperation data collection for robots" are blog titles, not target keywords.
No fabricated data — do not invent search volumes. Use paid tool data when available, qualitative signals when not. Never estimate or guess a number.
Kill dead keywords early — if a keyword has no autocomplete presence, no PAA, no dedicated SERP results, and no volume in paid tools, remove it. A list of 50 real keywords beats 100 with half dead.
This skill is SEO-focused — SEO keywords (1-3 words) optimize pages for Google ranking. For GEO prompt targeting (full questions for AI citation), use the geo-content-research skill separately.
Real language over marketer language — prioritize the words actual users type, not industry jargon
Interactive — do not skip phases. Get user confirmation before moving to the next phase
Output is actionable — the deliverable should be directly usable for content planning without further analysis
Integrate paid tools when available — Ahrefs/Semrush data is always better than guessing. Offer to integrate it. But the skill works without it too.

research-keywords

このリポジトリの他の Skills

このリポジトリの他の Skills

Research SEO/GEO Keywords

How This Skill Works

Phase 1: Brand Intelligence

Required

Optional (ask, but proceed without)

What to do with the answers

Phase 2: Keyword Discovery

SERP Scripts (Optional Boost)

Method 1: Google Autocomplete Mining

Method 2: People Also Ask (PAA) Mining

Method 3: Reddit & Community Mining

Method 4: Competitor Content Analysis

Method 5: Question & Problem Keywords

Method 6: AI Citation Keywords (GEO-Specific)

Output of Phase 2

Phase 3: Validation & Pruning

Step 3A: Ask About Paid Tool Data

Step 3B: If User Provides a CSV

Step 3C: If No Paid Tool Data

Step 3D: Present the Pruned List

Phase 4: Analysis & Clustering

Step 4A: Classify Intent

Step 4B: Assign Priority by Difficulty

Step 4C: Cluster by Topic

Step 4D: Identify Top Easy Wins

Phase 5: Deliverable — keywords.csv (STRICT FORMAT)

Absolute rules

Column contract

Semantic rules

Example (what your entire final response must look like)

Strategic notes, blog topics, killed keywords

Before emitting

Rules

Research SEO/GEO Keywords

How This Skill Works

Phase 1: Brand Intelligence

Required

Optional (ask, but proceed without)

What to do with the answers

Phase 2: Keyword Discovery

SERP Scripts (Optional Boost)

Method 1: Google Autocomplete Mining

Method 2: People Also Ask (PAA) Mining

Method 3: Reddit & Community Mining

Method 4: Competitor Content Analysis

Method 5: Question & Problem Keywords

Method 6: AI Citation Keywords (GEO-Specific)

Output of Phase 2

Phase 3: Validation & Pruning

Step 3A: Ask About Paid Tool Data

Step 3B: If User Provides a CSV

Step 3C: If No Paid Tool Data

Step 3D: Present the Pruned List

Phase 4: Analysis & Clustering

Step 4A: Classify Intent

Step 4B: Assign Priority by Difficulty

Step 4C: Cluster by Topic

Step 4D: Identify Top Easy Wins

Phase 5: Deliverable — keywords.csv (STRICT FORMAT)

Absolute rules

Column contract

Semantic rules

Example (what your entire final response must look like)

Strategic notes, blog topics, killed keywords

Before emitting

Rules