Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

thumbnail-creator

Name: Thumbnail Creator
Author: mohitagw15856

// Generate article or newsletter thumbnail candidates using the Gemini API from inside Claude Code. Claude reads article copy, proposes composition concepts, writes image generation prompts incorporating brand specs, calls Gemini to generate the images, evaluates the results via computer vision, and returns ranked candidates with rationale. Use when asked to create thumbnails, generate cover images, or produce visual candidates for an article or newsletter.

Ejecutar en Manus

$ git log --oneline --stat

stars:915

forks:165

updated:27 de mayo de 2026, 12:29

SKILL.md

readonly

related-skills.json

mismo repositorio

aeo-optimizer.md

from "mohitagw15856/pm-claude-skills"

Optimize an article for Answer Engine Optimization (AEO) — restructuring content so AI engines like ChatGPT, Perplexity, and Claude can extract, quote, and cite it. Rewrites headings as questions, drops 50-80 word answer capsules, audits paragraph length, and flags trust signals. Use when asked to AEO-optimize, make content AI-readable, improve AI citation chances, or adapt an article for answer engines.

2026-05-27915

claude-superpowers.md

from "mohitagw15856/pm-claude-skills"

Force Claude Code to work like a senior developer: plan before coding, work in isolation, write tests first, and review its own output twice before presenting it. Use when asked to enable superpowers mode, activate the superpowers framework, or turn on superpowers for this session. Installs a 4-stage framework — Plan, Isolate, Test First, Double Review — that prevents Claude from sprinting into broken code.

2026-05-27915

context-mode.md

from "mohitagw15856/pm-claude-skills"

Activate output filtering, session logging, and auto-resume to keep Claude Code sessions running for hours without context bloat or memory loss. Use when asked to enable context mode, turn on long session mode, or activate session persistence. Installs a session log at project root, filters verbose command output before it enters context, and auto-resumes after Claude resets.

2026-05-27915

email-triage.md

from "mohitagw15856/pm-claude-skills"

Reads your Gmail inbox for a configurable window (default: last 8 hours) and surfaces only what needs action — replies, decisions, or follow-up. Filters out receipts, notifications, newsletters, and anything that doesn't need you.

2026-05-27915

morning-intelligence.md

from "mohitagw15856/pm-claude-skills"

Run a 15-question interview to capture your role, topics, sources, exclusions, and format preferences — then write a master prompt you can drop into a scheduled task or Claude Code Routine to get a personalised news brief every morning. Use when asked to set up a morning intelligence brief, build a morning news prompt, or create a personalised news briefing.

2026-05-27915

notebooklm-connector.md

from "mohitagw15856/pm-claude-skills"

Automates NotebookLM from Claude Code — creates notebooks, adds sources, and triggers outputs (mindmaps, audio overviews, slide decks) without manual clicking via the Claude Chrome extension.

2026-05-27915

package.json

"author": "mohitagw15856"

"repository": "mohitagw15856/pm-claude-skills"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Desarrolladores de softwareOcupaciones informáticas y matemáticas15-1252L4

name

thumbnail-creator

description

Generate article or newsletter thumbnail candidates using the Gemini API from inside Claude Code. Claude reads article copy, proposes composition concepts, writes image generation prompts incorporating brand specs, calls Gemini to generate the images, evaluates the results via computer vision, and returns ranked candidates with rationale. Use when asked to create thumbnails, generate cover images, or produce visual candidates for an article or newsletter.

Thumbnail Creator Skill (via Gemini)

Generates article and newsletter thumbnail candidates by acting as an image-generation agent inside Claude Code. Instead of switching between tools and prompting Gemini's web UI one image at a time, this skill makes Claude do the full loop: read the copy, propose compositions, write tailored prompts, call the Gemini API, evaluate the outputs, and return ranked results with brief rationale.

The output is production-ready thumbnail candidates you can drop directly into your CMS, newsletter tool, or social scheduler.

Prerequisites

Both of these must be in place before the skill can generate images:

1. Gemini API Key

Get a free key from Google AI Studio.

Set it as an environment variable:

export GEMINI_API_KEY="your-key-here"

To persist it across sessions, add to your shell profile (~/.zshrc or ~/.bashrc):

echo 'export GEMINI_API_KEY="your-key-here"' >> ~/.zshrc
source ~/.zshrc

Verify it is set:

echo $GEMINI_API_KEY

2. generate_image.py Script

This script must exist at ./generate_image.py in the project root. The full template is provided in the Script Template section below. Claude will check for it and offer to create it if missing.

Python dependencies:

pip install google-generativeai Pillow requests

Or with uv:

uv pip install google-generativeai Pillow requests

Required Inputs

Claude will ask for these if not provided:

Input	Required	Notes
Article copy or URL	Yes	Paste the full article text, or provide a URL to fetch. Used to extract themes, hooks, and key claims for composition.
Brand colours	Recommended	Hex codes or descriptive names. E.g. `#1A1A2E` (navy), `#E94560` (coral). If not provided, Claude uses clean neutral defaults.
Fonts / type style	Recommended	E.g. "bold sans-serif", "editorial serif", "Neue Haas Grotesk". Used in prompt to guide text treatment.
Style reference description	Recommended	E.g. "flat illustration, minimal, like Stripe's marketing site" or "photorealistic, dark background, high contrast". A style image URL can also be provided.
Output dimensions	No	Defaults to `1792x1024` (landscape, standard article thumbnail). Options: `1024x1024` (square), `1024x1792` (portrait/mobile).
Number of candidates	No	Defaults to 4. Min 1, max 8 (API limits and cost).
Article title (if different from H1)	No	Used as the primary text element in image prompts.
Candidate selection	No	After proposing compositions, Claude asks which to generate. User can say "all" or pick by number.

Output Structure

Phase 1 — Composition Proposals (text, before any API calls)

Claude presents 3-4 composition concepts for user approval. Format:

Composition Concepts for: "[Article Title]"

1. BOLD CLAIM
   Layout:    Full-bleed dark background, large white headline centred, 
              single accent data point (e.g. "3x faster") in brand colour below
   Mood:      High authority, newsletter-style
   Best for:  LinkedIn, Substack headers
   Rationale: The article's central claim ("X outperforms Y by 3x") is specific 
              enough to anchor the visual — readers stop on data.

2. CONCEPTUAL OBJECT
   Layout:    Central object illustration (e.g. a broken clock for a time-waste article), 
              title in upper third, minimal texture background
   Mood:      Editorial, Medium-style
   Best for:  Blog header, Medium cover, email preheader
   Rationale: Gives art directors visual metaphor flexibility; works across sizes.

3. CONTRAST SPLIT
   Layout:    Left half brand colour, right half white or image, 
              title on colour side, supporting subtext on white side
   Mood:      Clean, professional, startup-brand feel
   Best for:  Newsletter, LinkedIn carousel first slide
   Rationale: Split layout performs consistently in newsletter A/B tests; 
              text is readable at small sizes.

4. TYPOGRAPHIC ONLY
   Layout:    No illustration, oversized title treatment, 
              author name in small caps at bottom, thin rule separator
   Mood:      Premium, confident, editorial
   Best for:  Substack, Ghost, high-density email lists
   Rationale: Works when the brand has strong type identity. Fastest to produce.

Which compositions do you want generated? (Reply with numbers, e.g. "1, 3" or "all")

Phase 2 — Generated Image Files

After generation, Claude saves files to ./thumbnails/[article-slug]/:

thumbnails/
└── article-slug-from-title/
    ├── candidate_01_bold_claim.png
    ├── candidate_02_conceptual_object.png
    ├── candidate_03_contrast_split.png
    ├── candidate_04_typographic.png
    └── evaluation_report.md

Phase 3 — Evaluation Summary Table

Claude evaluates each returned image via computer vision and produces:

Thumbnail Evaluation — "[Article Title]"
Generated: 2026-05-27  |  Model: Gemini Imagen  |  Dimensions: 1792x1024

| # | Candidate | Composition | Brand Fit /10 | Text Legibility /10 | Recommendation |
|---|---|---|---|---|---|
| 1 | candidate_01_bold_claim.png | Bold Claim | 9 | 8 | ★ Top pick — strong data anchor, brand colours correct, title readable at 200px width |
| 2 | candidate_02_conceptual_object.png | Conceptual Object | 7 | 9 | Good fallback — legible, clean, but illustration style drifted slightly from brand |
| 3 | candidate_03_contrast_split.png | Contrast Split | 8 | 7 | Works well at full size; test at thumbnail size before publishing — right side text tightens |
| 4 | candidate_04_typographic.png | Typographic | 9 | 10 | Strongest for email — zero brand drift risk, completely text-based |

Recommended for web:          candidate_01_bold_claim.png
Recommended for email/mobile: candidate_04_typographic.png
Recommended for social:       candidate_03_contrast_split.png

Files saved to: ./thumbnails/article-slug-from-title/

How Claude Should Execute This Skill

Step 1 — Ingest and analyse the article

Accept article copy as pasted text or a URL.

If a URL is provided, fetch the page and extract:

The H1 title
The first 3-5 paragraphs (the hook, central claim, and key points)
Any notable statistics or named frameworks mentioned
The author name (for typographic compositions)

If text is pasted, read it directly. Focus on:

The hook: What is the opening claim or tension?
The central thesis: What is the one thing the article argues or teaches?
Key specifics: Any numbers, named frameworks, or concrete examples that could anchor a visual
Tone: Is this formal/authoritative, conversational/accessible, provocative/challenge-based?

Summarise these findings internally before proposing compositions — the proposals should feel tailored to this specific article, not generic.

Step 2 — Collect brand specs

Ask the user for brand specs if not provided:

To generate on-brand thumbnails, I need a few details:

1. Brand colours (hex codes or descriptions) — e.g. #1A1A2E, #E94560
2. Font style preference — e.g. "bold sans-serif", "editorial serif", "geometric"
3. Visual style — e.g. "flat minimal", "photorealistic", "illustrated", "typographic only"
4. Any style references — describe a brand or publication whose aesthetic you want to match, 
   or share an image URL

If you don't have brand specs yet, say "use clean defaults" and I'll use a professional 
dark-on-white editorial style.

If the user says "use clean defaults", apply:

Background: #FFFFFF or #0F0F0F (dark mode default)
Accent: #2563EB (blue)
Font style: bold geometric sans-serif
Style: minimal flat, no textures, high contrast

Step 3 — Propose composition concepts

Write 3-4 composition concepts tailored to the article's tone and content. Each concept must:

Have a name (short, memorable label)
Describe the layout precisely (where title goes, what visual element anchors it, background treatment)
Note the mood and the use case it's best suited for
Include a rationale sentence explaining why this composition fits this specific article

After presenting the concepts, ask which to generate. Wait for user confirmation before making any API calls.

Step 4 — Write Gemini image generation prompts

For each selected composition, write a detailed image generation prompt. Image generation prompts follow a different grammar than text prompts — they are descriptive, not instructional.

Prompt structure:

[Subject/composition] + [Style] + [Colour palette] + [Mood/lighting] + 
[Text treatment if any] + [What to avoid]

Example prompt for Bold Claim composition:

Article thumbnail image. Large bold white sans-serif headline text reading "3x Faster Than 
Traditional Methods" centred on a deep navy blue background (#1A1A2E). Small coral accent 
text (#E94560) below reading the subtitle. Minimal flat design, no gradients, no stock photo 
elements, no people. Clean professional editorial style, high contrast, newsletter header 
format, 16:9 landscape orientation. The composition is typographic — text is the hero, 
no illustration required. Avoid: clip art, drop shadows, low contrast, crowded layout.

Prompt rules:

Include exact hex colours when brand colours are provided
Specify the exact headline text to appear in the image
Name the style explicitly ("flat design", "editorial", "photorealistic") — Gemini responds well to style category names
Add a negative prompt ("Avoid: ...") at the end to reduce drift from brand style
Keep prompts under 300 words — longer prompts do not reliably produce better outputs

Step 5 — Check prerequisites and run the generation script

Before calling the API, verify:

# Check API key is set
echo $GEMINI_API_KEY

# Check script exists
ls -la ./generate_image.py

# Check dependencies
python3 -c "import google.generativeai, PIL, requests; print('Dependencies OK')"

If the script is missing, offer to create it using the template in the Script Template section below.

Run the generation script for each prompt:

python3 generate_image.py \
  --prompt "your full prompt here" \
  --output "./thumbnails/article-slug/candidate_01_bold_claim.png" \
  --width 1792 \
  --height 1024

Or pass all prompts in a batch config file:

python3 generate_image.py --config ./thumbnails/article-slug/prompts.json

Step 6 — Evaluate generated images

After each image is saved, examine it using computer vision. Evaluate on two dimensions:

Brand Fit (score /10):

Are the brand colours correct? (1-2 points each)
Does the style match the requested aesthetic? (2 points)
Is the layout consistent with the composition brief? (2 points)
Are there any AI artefacts, distorted text, or unintended elements? (-1 per issue)

Text Legibility (score /10):

Is the headline text readable at full resolution? (3 points)
Is the headline text readable when the image is scaled to 300px wide (thumbnail size)? (3 points)
Is there sufficient contrast between text and background? (2 points)
Is the text placement within safe zones (not cut off at edges)? (2 points)

Note: Gemini Imagen sometimes renders text with spelling errors or distorted letterforms. If this happens, note it in the evaluation and suggest the user add the text overlay manually in Canva or Figma.

Step 7 — Produce the evaluation report

Write the evaluation summary table (format shown in Output Structure section) and save it as evaluation_report.md in the output folder.

Include:

One-line rationale for each score
A top pick recommendation per use case (web, email/mobile, social)
Any production notes (e.g. "text rendering is imperfect on candidate_02 — overlay text manually")
The full prompts used, so the user can iterate directly in AI Studio if needed

Step 8 — Offer iteration

After delivering the candidates, offer one iteration pass:

Want me to iterate on any of these?

Options:
- Adjust colours or style on a specific candidate
- Try a different composition concept
- Change the headline text
- Rerun with different Gemini parameters (different temperature/seed)
- Generate additional variants of the top pick

Just tell me what to change.

Script Template

Claude should offer to write this file if generate_image.py is not present. This is the canonical template to use.

#!/usr/bin/env python3
"""
generate_image.py — Gemini Imagen wrapper for Thumbnail Creator skill.

Usage:
    python3 generate_image.py --prompt "..." --output "./out.png" [--width 1792] [--height 1024]
    python3 generate_image.py --config ./prompts.json

Config JSON format:
    [
      {
        "prompt": "...",
        "output": "./thumbnails/slug/candidate_01.png",
        "width": 1792,
        "height": 1024
      }
    ]

Requirements:
    pip install google-generativeai Pillow
"""

import os
import sys
import json
import argparse
import base64
from pathlib import Path

try:
    import google.generativeai as genai
    from google.generativeai import types as genai_types
except ImportError:
    print("ERROR: google-generativeai not installed. Run: pip install google-generativeai")
    sys.exit(1)

try:
    from PIL import Image
    import io
except ImportError:
    print("ERROR: Pillow not installed. Run: pip install Pillow")
    sys.exit(1)


def get_api_key() -> str:
    key = os.environ.get("GEMINI_API_KEY", "")
    if not key:
        print("ERROR: GEMINI_API_KEY environment variable is not set.")
        print("Get a key at: https://aistudio.google.com/app/apikey")
        print("Then run: export GEMINI_API_KEY='your-key-here'")
        sys.exit(1)
    return key


def generate_image(
    prompt: str,
    output_path: str,
    width: int = 1792,
    height: int = 1024,
) -> bool:
    """
    Call Gemini Imagen to generate a single image and save it to output_path.
    Returns True on success, False on failure.
    """
    api_key = get_api_key()
    genai.configure(api_key=api_key)

    # Determine aspect ratio from dimensions
    ratio = width / height
    if abs(ratio - 16/9) < 0.1:
        aspect_ratio = "16:9"
    elif abs(ratio - 1.0) < 0.1:
        aspect_ratio = "1:1"
    elif abs(ratio - 9/16) < 0.1:
        aspect_ratio = "9:16"
    else:
        aspect_ratio = "16:9"  # default fallback

    try:
        imagen_model = genai.ImageGenerationModel("imagen-3.0-generate-002")

        result = imagen_model.generate_images(
            prompt=prompt,
            number_of_images=1,
            aspect_ratio=aspect_ratio,
            safety_filter_level="block_only_high",
            person_generation="allow_adult",
        )

        if not result.images:
            print(f"  No images returned for: {output_path}")
            return False

        image_data = result.images[0]

        # Ensure output directory exists
        Path(output_path).parent.mkdir(parents=True, exist_ok=True)

        # Save the image
        if hasattr(image_data, '_image_bytes'):
            img_bytes = image_data._image_bytes
        elif hasattr(image_data, 'image'):
            img_bytes = image_data.image
        else:
            # Fallback: try to access raw data
            img_bytes = bytes(image_data)

        img = Image.open(io.BytesIO(img_bytes))

        # Resize to exact dimensions if needed
        if img.size != (width, height):
            img = img.resize((width, height), Image.LANCZOS)

        img.save(output_path, format="PNG", optimize=True)
        print(f"  Saved: {output_path} ({img.size[0]}x{img.size[1]})")
        return True

    except Exception as e:
        print(f"  ERROR generating image: {e}")
        return False


def run_from_args():
    parser = argparse.ArgumentParser(description="Gemini Imagen wrapper for thumbnail generation")
    parser.add_argument("--prompt", type=str, help="Image generation prompt")
    parser.add_argument("--output", type=str, help="Output file path (.png)")
    parser.add_argument("--width", type=int, default=1792, help="Image width in pixels")
    parser.add_argument("--height", type=int, default=1024, help="Image height in pixels")
    parser.add_argument("--config", type=str, help="JSON config file with batch of prompts")
    args = parser.parse_args()

    if args.config:
        # Batch mode
        with open(args.config, "r") as f:
            items = json.load(f)
        print(f"Batch mode: {len(items)} image(s) to generate")
        results = []
        for i, item in enumerate(items, start=1):
            print(f"\n[{i}/{len(items)}] Generating: {item['output']}")
            ok = generate_image(
                prompt=item["prompt"],
                output_path=item["output"],
                width=item.get("width", 1792),
                height=item.get("height", 1024),
            )
            results.append({"output": item["output"], "ok": ok})

        print(f"\nBatch complete: {sum(r['ok'] for r in results)}/{len(results)} succeeded")
        for r in results:
            status = "OK " if r["ok"] else "ERR"
            print(f"  {status}  {r['output']}")

    elif args.prompt and args.output:
        # Single image mode
        print(f"Generating: {args.output}")
        ok = generate_image(
            prompt=args.prompt,
            output_path=args.output,
            width=args.width,
            height=args.height,
        )
        if ok:
            print("Done.")
        else:
            print("Failed.")
            sys.exit(1)

    else:
        parser.print_help()
        sys.exit(1)


if __name__ == "__main__":
    run_from_args()

To create this file from inside Claude Code:

# Claude will write this file if it doesn't exist:
ls ./generate_image.py || echo "Script missing — Claude will create it"

Prompt Writing Reference

Claude should use this reference when writing image generation prompts. These patterns produce the most consistent results with Gemini Imagen.

Composition patterns

Composition type	Prompt anchor phrase
Text-led, dark background	"Bold white sans-serif headline text on deep [colour] background, minimal flat design"
Text-led, light background	"High-contrast black headline text on clean white background, editorial layout"
Object/illustration centred	"Centred [object] illustration, [style], [colour] background, title text in upper third"
Split layout	"Vertical split: left half [colour], right half white. Headline on left side, supporting text on right"
Photography style	"Photorealistic [scene description], [mood] lighting, [colour] colour grade, text overlay area at [position]"

Style modifiers that work well with Gemini

flat design, no gradients — clean vector-style outputs
editorial magazine style — sophisticated, typographic
minimal, lots of whitespace — reduces visual noise
high contrast, bold typography — strong thumbnail legibility
Bauhaus-inspired — geometric, structured
dark mode aesthetic — dark backgrounds with light text
startup marketing style — clean, optimistic, sans-serif

Negative prompts (always include)

Append to every prompt:

Avoid: stock photography clichés, clipart, excessive gradients, drop shadows, 
cluttered layout, lens flares, watermarks, low contrast text, AI artefacts.

Text rendering note

Gemini Imagen sometimes renders short text phrases accurately and longer headlines poorly. If the article headline is longer than 6 words, consider splitting it in the prompt:

Primary headline: "[First 4-5 words]"
Secondary text:   "[Remaining words]"

Or instruct the user to add text overlay manually in Canva after generation if legibility is critical.

Troubleshooting

Issue	Cause	Fix
`GEMINI_API_KEY not set`	Environment variable missing	Run `export GEMINI_API_KEY="your-key"` and retry
`ModuleNotFoundError: google.generativeai`	Dependency missing	Run `pip install google-generativeai`
`No images returned`	Safety filter triggered	Revise prompt to remove any ambiguous language; check that the prompt doesn't describe faces, violence, or brand logos
Generated image has garbled text	Imagen text rendering limitation	Use shorter headline in prompt, or plan to add text overlay in Canva/Figma post-generation
Image is the wrong size	Aspect ratio mismatch	Confirm `--width` and `--height` args match one of the supported ratios (16:9, 1:1, 9:16)
`generate_image.py not found`	Script not created yet	Ask Claude to create it using the template above
API quota exceeded	Free tier limit	Wait or upgrade to Gemini API paid tier
Style drift from brand	Prompt not specific enough	Add exact hex codes and specific style descriptors; add stronger negative prompt

Quality Checks

Before marking the task complete, verify each item:

Example Trigger Phrases

"Create thumbnails for this article"
"Generate cover image candidates for my newsletter"
"Make me 4 thumbnail options for this post"
"Can you generate some thumbnail ideas using Gemini?"
"I need a featured image for this article — use my brand colours"
"Create a thumbnail for this piece using Gemini" [followed by article text or URL]
"Generate article cover images for these brand specs: [colours, style]"
"Make thumbnail candidates and rank them"
"I need newsletter header images — here's the copy"
"Generate and evaluate thumbnail options for this draft"
"Use Gemini to create cover image options"
"Thumbnail this article" [followed by article text]
"Create 3 thumbnail compositions and pick the best one"

Cost and Rate Limits

Gemini AI Studio free tier (as of early 2026):

Imagen 3: 10 images per day (free)
Rate limit: varies by region and account tier

Paid tier:

Imagen 3 pricing: approximately $0.03-0.05 per image (check current Google Cloud pricing)
For a typical session generating 4-8 candidates, total cost is under $0.40

Recommendation:

Use the free tier for exploration and iteration
Generate final production candidates on paid tier for higher daily limits
For newsletter teams generating thumbnails weekly, the paid tier is more practical

Originally created by Karen Spinner (Wondering About AI) — adapted and extended for this library.

thumbnail-creator

Más de este repositorio

Más de este repositorio

Thumbnail Creator Skill (via Gemini)

Prerequisites

1. Gemini API Key

2. generate_image.py Script

Required Inputs

Output Structure

Phase 1 — Composition Proposals (text, before any API calls)

Phase 2 — Generated Image Files

Phase 3 — Evaluation Summary Table

How Claude Should Execute This Skill

Step 1 — Ingest and analyse the article

Step 2 — Collect brand specs

Step 3 — Propose composition concepts

Step 4 — Write Gemini image generation prompts

Step 5 — Check prerequisites and run the generation script

Step 6 — Evaluate generated images

Step 7 — Produce the evaluation report

Step 8 — Offer iteration

Script Template

Prompt Writing Reference

Composition patterns

Style modifiers that work well with Gemini

Negative prompts (always include)

Text rendering note

Troubleshooting

Quality Checks

Example Trigger Phrases

Cost and Rate Limits

Thumbnail Creator Skill (via Gemini)

Prerequisites

1. Gemini API Key

2. generate_image.py Script

Required Inputs

Output Structure

Phase 1 — Composition Proposals (text, before any API calls)

Phase 2 — Generated Image Files

Phase 3 — Evaluation Summary Table

How Claude Should Execute This Skill

Step 1 — Ingest and analyse the article

Step 2 — Collect brand specs

Step 3 — Propose composition concepts

Step 4 — Write Gemini image generation prompts

Step 5 — Check prerequisites and run the generation script

Step 6 — Evaluate generated images

Step 7 — Produce the evaluation report

Step 8 — Offer iteration

Script Template

Prompt Writing Reference

Composition patterns

Style modifiers that work well with Gemini

Negative prompts (always include)

Text rendering note

Troubleshooting

Quality Checks

Example Trigger Phrases

Cost and Rate Limits