Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

image-gen

Name: Image Gen
Author: peterkrueck

// Generate character art and image variations using AI image generation (Google Gemini) with reference images for style and character consistency. Use this skill when the user asks to generate new character poses, mascot variations, art assets, illustrations, or any AI-generated images — especially when maintaining consistency with an existing character or style.

Ejecutar en Manus

$ git log --oneline --stat

stars:1354

forks:154

updated:28 de marzo de 2026, 13:14

Explorador de archivos

2 archivos

SKILL.md

readonly

name	image-gen
description	Generate character art and image variations using AI image generation (Google Gemini) with reference images for style and character consistency. Use this skill when the user asks to generate new character poses, mascot variations, art assets, illustrations, or any AI-generated images — especially when maintaining consistency with an existing character or style.
user_invocable	true

AI Image Generation (Gemini)

Generate image variations using Google's Gemini image generation model with reference images for style and character consistency. The model supports up to 14 reference images per request and can maintain consistency across multiple characters.

Prerequisites

GEMINI_API_KEY environment variable must be set
- Get a key at https://aistudio.google.com/apikey
- The key needs billing enabled for image generation (~$0.067/image at 1K resolution)
Deno runtime installed (for the generation script)

Workflow

Step 1 — Understand what the user wants

Clarify the subject, pose, expression, context, and where the asset will be used (app screen, social media, website, etc.). This context helps craft the right prompt and choose the right aspect ratio.

Step 2 — Select reference images

Always use 1-2 reference images for consistency:

Primary reference (always first): The most canonical image of the character/subject. This anchors identity — face shape, color palette, defining features.
Style/pose reference (second, optional): Pick the closest existing approved asset to the target pose. This anchors proportions and art style.

The primary reference anchors identity; the style reference anchors proportions. Both together produce the most consistent results.

Step 3 — Craft the prompt

Write a detailed prompt that describes the exact pose, expression, and style:

Character/subject description — physical traits that define the character (so the model doesn't drift)
Pose and expression — what the character is doing
Style directives — art style, line style, shading approach
Background — color, scene, or transparent
Framing — full body, bust, three-quarter view, etc.

Prompt template:

[CHARACTER_DESCRIPTION]. [POSE_AND_EXPRESSION]. [STYLE_DIRECTIVES]. [BACKGROUND]. [VIEW/FRAMING].

Tips:

Be specific about what each hand/arm is doing — vague descriptions lead to random poses
Always specify the background explicitly
Include style keywords consistently (e.g., "flat color fills", "3D render", "watercolor")

Step 4 — Generate variations

Run the bundled generation script:

deno run --allow-env --allow-read --allow-write --allow-net \
  .claude/skills/image-gen/scripts/generate.ts \
  --prompt "your prompt here" \
  --ref path/to/primary-reference.png \
  --ref path/to/style-reference.png \
  --output-dir /tmp/image-gen \
  --variants 4 \
  --aspect "<choose based on use case>" \
  --size "2K"

Parameters:

Flag	Default	Options
`--variants`	4	1-8 (each is a separate API call)
`--aspect`	1:1	1:1, 3:4, 4:3, 9:16, 16:9, 2:3, 3:2
`--size`	1K	512, 1K, 2K, 4K

Always default to 2K for size — higher resolution gives better quality and can always be downscaled.

Choose aspect ratio based on use case:

Use Case	Aspect Ratio
Full-body character poses	`3:4`
App icons, avatars, social profiles	`1:1`
Mobile screens, in-app cards	`9:16` or `3:4`
Banner/header images, OG images	`16:9` or `4:3`
Bust/upper-body portraits	`1:1` or `4:3`

Cost: ~$0.10/image at 2K = ~$0.40 for 4 variants.

Step 5 — Pick the best variant

Use the Read tool to visually inspect all generated images. Score each on:

Consistency (most important):

Does it match the reference images — face, proportions, colors, style?
Is the art style consistent (not drifting to photorealistic, 3D, etc.)?

Quality (tiebreaker):

Does the image have personality and visual appeal?
Would this work well as a production asset?

Pick the single best variant and copy it to the project's assets directory with a descriptive name. Briefly explain why you picked it.

If none are good enough, explain what went wrong and offer to regenerate with prompt adjustments.

Step 6 — Post-process

After picking the best variant:

Copy the chosen file to the appropriate assets directory
Clean up: delete the rejected variants and the temp output directory
Use the image-edit skill if the user needs a different crop or size

Rate Limits

If some variants fail with 429 errors: wait 60 seconds, then rerun with only the missing number of variants. Don't retry all — just fill in the gaps.

If all fail with 429: wait 60 seconds and try again. If it keeps failing, the daily quota may be exhausted — try later or enable billing for higher limits.

Troubleshooting

"GEMINI_API_KEY not set" — Get a key at https://aistudio.google.com/apikey
"Billing not enabled" or 403 — Enable billing in Google AI Studio for image generation
429 rate limit — Wait 60 seconds and retry
Character looks wrong — Be more specific about physical traits, ensure both reference images are included
Style drifted — Reinforce style keywords more strongly in the prompt
Pose is wrong — Be extremely specific about what each arm/hand is doing

related-skills.json

mismo repositorio

deploy.md

from "peterkrueck/Claude-Code-Development-Kit"

Test and deploy changes safely. Runs tests as a pre-deploy gate, deploys, then runs post-deploy verification. This is a TEMPLATE — customize the commands and checks for your specific deployment pipeline.

2026-05-181.4k

second-opinion.md

from "peterkrueck/Claude-Code-Development-Kit"

Get a second opinion from Google's Gemini Pro via the locally installed Gemini CLI (defaults to gemini-3.1-pro-preview; override with the CLAUDE_SECOND_OPINION_MODEL env var). Use this skill when in Plan Mode for large or critical tasks, when stuck on a debugging dead end, when facing architecture trade-offs, for subtle edge cases in code review, or any situation where an independent perspective would add value. Also use when the user explicitly asks for a "second opinion", "ask Gemini", "another perspective", or "cross-check this". Reports unavailability rather than falling back to a weaker model.

2026-05-181.4k

update-docs.md

from "peterkrueck/Claude-Code-Development-Kit"

Update project documentation after code changes. Maintains the 4 core ai-context files (spec, project-structure, progress, deployment-infrastructure) and CLAUDE.md. Use after completing features, refactors, or any changes that affect project structure, capabilities, or status. Also creates initial documentation if files don't exist yet.

2026-04-071.4k

review-work.md

from "peterkrueck/Claude-Code-Development-Kit"

Review uncommitted code changes using parallel Claude sub-agents with specialized roles (Bug Hunter, Rules Auditor). Spawns multiple focused reviewers for large diffs (50+ lines), single reviewer for smaller changes. Checks for bugs, security issues, CLAUDE.md compliance, and test coverage gaps. Use after completing substantial implementation work, or when the Stop hook requests it. Also invocable manually with /review-work.

2026-04-061.4k

bg-remove.md

from "peterkrueck/Claude-Code-Development-Kit"

Remove backgrounds from images using local AI (rembg). Use when removing backgrounds from character art, mascot images, photos, or any image that needs a transparent background.

2026-03-281.4k

image-edit.md

from "peterkrueck/Claude-Code-Development-Kit"

Edit images with precision — crop, resize, mirror, rotate, trim, and reframe. Use this skill whenever the user asks to crop, resize, trim, mirror, flip, rotate, reframe, or otherwise manipulate an image. Also use for creating square crops, portraits/headshots from full-body images, icon sizes, or any image transformation. Even if the request sounds simple, this skill prevents common pitfalls and ensures correct results on the first try.

2026-03-281.4k

package.json

"author": "peterkrueck"

"repository": "peterkrueck/Claude-Code-Development-Kit"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Artistas de efectos especiales y animadoresArtes, diseño, entretenimiento, deportes y medios27-1014L4

name	image-gen
description	Generate character art and image variations using AI image generation (Google Gemini) with reference images for style and character consistency. Use this skill when the user asks to generate new character poses, mascot variations, art assets, illustrations, or any AI-generated images — especially when maintaining consistency with an existing character or style.
user_invocable	true

AI Image Generation (Gemini)

Prerequisites

GEMINI_API_KEY environment variable must be set
- Get a key at https://aistudio.google.com/apikey
- The key needs billing enabled for image generation (~$0.067/image at 1K resolution)
Deno runtime installed (for the generation script)

Workflow

Step 1 — Understand what the user wants

Step 2 — Select reference images

Always use 1-2 reference images for consistency:

Primary reference (always first): The most canonical image of the character/subject. This anchors identity — face shape, color palette, defining features.
Style/pose reference (second, optional): Pick the closest existing approved asset to the target pose. This anchors proportions and art style.

The primary reference anchors identity; the style reference anchors proportions. Both together produce the most consistent results.

Step 3 — Craft the prompt

Write a detailed prompt that describes the exact pose, expression, and style:

Character/subject description — physical traits that define the character (so the model doesn't drift)
Pose and expression — what the character is doing
Style directives — art style, line style, shading approach
Background — color, scene, or transparent
Framing — full body, bust, three-quarter view, etc.

Prompt template:

[CHARACTER_DESCRIPTION]. [POSE_AND_EXPRESSION]. [STYLE_DIRECTIVES]. [BACKGROUND]. [VIEW/FRAMING].

Tips:

Be specific about what each hand/arm is doing — vague descriptions lead to random poses
Always specify the background explicitly
Include style keywords consistently (e.g., "flat color fills", "3D render", "watercolor")

Step 4 — Generate variations

Run the bundled generation script:

deno run --allow-env --allow-read --allow-write --allow-net \
  .claude/skills/image-gen/scripts/generate.ts \
  --prompt "your prompt here" \
  --ref path/to/primary-reference.png \
  --ref path/to/style-reference.png \
  --output-dir /tmp/image-gen \
  --variants 4 \
  --aspect "<choose based on use case>" \
  --size "2K"

Parameters:

Flag	Default	Options
`--variants`	4	1-8 (each is a separate API call)
`--aspect`	1:1	1:1, 3:4, 4:3, 9:16, 16:9, 2:3, 3:2
`--size`	1K	512, 1K, 2K, 4K

Always default to 2K for size — higher resolution gives better quality and can always be downscaled.

Choose aspect ratio based on use case:

Use Case	Aspect Ratio
Full-body character poses	`3:4`
App icons, avatars, social profiles	`1:1`
Mobile screens, in-app cards	`9:16` or `3:4`
Banner/header images, OG images	`16:9` or `4:3`
Bust/upper-body portraits	`1:1` or `4:3`

Cost: ~$0.10/image at 2K = ~$0.40 for 4 variants.

Step 5 — Pick the best variant

Use the Read tool to visually inspect all generated images. Score each on:

Consistency (most important):

Does it match the reference images — face, proportions, colors, style?
Is the art style consistent (not drifting to photorealistic, 3D, etc.)?

Quality (tiebreaker):

Does the image have personality and visual appeal?
Would this work well as a production asset?

Pick the single best variant and copy it to the project's assets directory with a descriptive name. Briefly explain why you picked it.

If none are good enough, explain what went wrong and offer to regenerate with prompt adjustments.

Step 6 — Post-process

After picking the best variant:

Copy the chosen file to the appropriate assets directory
Clean up: delete the rejected variants and the temp output directory
Use the image-edit skill if the user needs a different crop or size

Rate Limits

If some variants fail with 429 errors: wait 60 seconds, then rerun with only the missing number of variants. Don't retry all — just fill in the gaps.

If all fail with 429: wait 60 seconds and try again. If it keeps failing, the daily quota may be exhausted — try later or enable billing for higher limits.

Troubleshooting

"GEMINI_API_KEY not set" — Get a key at https://aistudio.google.com/apikey
"Billing not enabled" or 403 — Enable billing in Google AI Studio for image generation
429 rate limit — Wait 60 seconds and retry
Character looks wrong — Be more specific about physical traits, ensure both reference images are included
Style drifted — Reinforce style keywords more strongly in the prompt
Pose is wrong — Be extremely specific about what each arm/hand is doing

image-gen

AI Image Generation (Gemini)

Prerequisites

Workflow

Step 1 — Understand what the user wants

Step 2 — Select reference images

Step 3 — Craft the prompt

Step 4 — Generate variations

Step 5 — Pick the best variant

Step 6 — Post-process

Rate Limits

Troubleshooting

Más de este repositorio

AI Image Generation (Gemini)

Prerequisites

Workflow

Step 1 — Understand what the user wants

Step 2 — Select reference images

Step 3 — Craft the prompt

Step 4 — Generate variations

Step 5 — Pick the best variant

Step 6 — Post-process

Rate Limits

Troubleshooting

Más de este repositorio