Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

Loslegen

avatar-video

Sterne1

Forks0

Aktualisiert3. Februar 2026 um 14:16

Generate lip-synced avatar video from text using OmniHuman v1.5. Use when creating talking-head or avatar videos.

Installation

Mit Codex oder Claude installieren Kopieren Sie diesen Prompt, fügen Sie ihn in Codex, Claude oder einen anderen Assistant ein und lassen Sie die Skill-Seite prüfen und installieren.

In Manus ausführen

Quelle

az9713

az9713/whatsapp-claude

GitHub-Repository öffnen Creator-Repositorys ansehen

Download

In Manus ausführen

Verwandte BerufeSOC

Basierend auf der SOC-Berufsklassifikation

SoftwareentwicklerInformatik- und Mathematikberufe·SOC 15-1252

SKILL.md

readonly

name	avatar-video
description	Generate lip-synced avatar video from text using OmniHuman v1.5. Use when creating talking-head or avatar videos.
allowed-tools	["Bash","Read","Write"]

Avatar Video Generation

Generate lip-synced avatar videos from text using the OmniHuman v1.5 pipeline.

Prerequisites

Python 3.10+
OpenAI API key (for TTS)
fal.ai API key (for OmniHuman)
Avatar image (portrait, clear face)

Pipeline Overview

Text → TTS → Audio → CDN Upload → OmniHuman → Video → Post-Processing → Final Video

Step-by-Step Commands

1. Generate Voice-Over

python video-pipeline/voice_generator.py "Your script text here" output/voice.mp3

Options:

--voice nova (default: nova)
Available: alloy, echo, fable, onyx, nova, shimmer

2. Upload to CDN

python video-pipeline/storage_manager.py output/voice.mp3 assets/avatar.png

Returns:

Audio URL
Image URL

3. Generate Avatar Video

python video-pipeline/video_generator.py <audio_url> <image_url>

Returns:

Job ID
Video URL (when complete)

OmniHuman v1.5 settings:

Resolution: 720p
FPS: 25
Face enhancement: enabled

4. Download Generated Video

python video-pipeline/video_retriever.py <video_url> output/avatar.mp4

5. Post-Processing (Optional)

Add background music:

python video-pipeline/post_processor.py output/avatar.mp4 assets/music/bg.mp3 output/final.mp4

Options:

--music-volume 0.15 (15% volume, default)
--trim-start 0 (trim seconds from start)
--trim-end 0 (trim seconds from end)

Full Pipeline Example

# Step 1: Generate voice
python video-pipeline/voice_generator.py "Hello! Welcome to my channel. Today I'm going to show you something amazing." output/voice.mp3 --voice nova

# Step 2: Upload to CDN (capture URLs)
URLS=$(python video-pipeline/storage_manager.py output/voice.mp3 assets/avatar.png)
AUDIO_URL=$(echo "$URLS" | grep audio | cut -d' ' -f2)
IMAGE_URL=$(echo "$URLS" | grep image | cut -d' ' -f2)

# Step 3: Generate video
VIDEO_URL=$(python video-pipeline/video_generator.py "$AUDIO_URL" "$IMAGE_URL")

# Step 4: Download
python video-pipeline/video_retriever.py "$VIDEO_URL" output/avatar.mp4

# Step 5: Add music
python video-pipeline/post_processor.py output/avatar.mp4 assets/music/background.mp3 output/final.mp4

Avatar Image Guidelines

For best results:

Portrait orientation preferred
Clear, well-lit face
Neutral expression
Front-facing
Minimum 512x512 pixels
Supported formats: PNG, JPG

Output Quality

Setting	Value
Resolution	720p (1280x720)
FPS	25
Codec	H.264
Audio	AAC 128kbps

Cost Estimate

Component	Cost
OpenAI TTS (1000 chars)	~$0.015
fal.ai OmniHuman (per video)	~$0.10-0.50

Tips

Keep scripts under 2 minutes for best quality
Use clear, well-paced speech in script
Test with short clips first
Store avatar images in assets/ for reuse
Check fal.ai queue status for busy times

Mehr aus diesem Repository

gleiches Repository

gmail

az9713/whatsapp-claude

Send and read emails via Gmail browser automation. Use when asked to send email or check inbox.

2026-02-031

schedule-job

az9713/whatsapp-claude

Schedule tasks using natural language time expressions. Use when asked to schedule a recurring or timed task.

2026-02-031

tts

az9713/whatsapp-claude

Generate voice-over audio using OpenAI TTS. Use when creating narration or voice for videos.

2026-02-031

video-render

az9713/whatsapp-claude

Render videos using Remotion compositions. Use when creating or generating videos.

2026-02-031

video-research

az9713/whatsapp-claude

Research topics for video content creation. Use when researching ideas for videos.

2026-02-031

video-script

az9713/whatsapp-claude

Write video scripts with hooks, structure, and timing. Use when creating scripts for videos.

2026-02-031

name	avatar-video
description	Generate lip-synced avatar video from text using OmniHuman v1.5. Use when creating talking-head or avatar videos.
allowed-tools	["Bash","Read","Write"]

Avatar Video Generation

Generate lip-synced avatar videos from text using the OmniHuman v1.5 pipeline.

Prerequisites

Python 3.10+
OpenAI API key (for TTS)
fal.ai API key (for OmniHuman)
Avatar image (portrait, clear face)

Pipeline Overview

Text → TTS → Audio → CDN Upload → OmniHuman → Video → Post-Processing → Final Video

Step-by-Step Commands

1. Generate Voice-Over

python video-pipeline/voice_generator.py "Your script text here" output/voice.mp3

Options:

--voice nova (default: nova)
Available: alloy, echo, fable, onyx, nova, shimmer

2. Upload to CDN

python video-pipeline/storage_manager.py output/voice.mp3 assets/avatar.png

Returns:

Audio URL
Image URL

3. Generate Avatar Video

python video-pipeline/video_generator.py <audio_url> <image_url>

Returns:

Job ID
Video URL (when complete)

OmniHuman v1.5 settings:

Resolution: 720p
FPS: 25
Face enhancement: enabled

4. Download Generated Video

python video-pipeline/video_retriever.py <video_url> output/avatar.mp4

5. Post-Processing (Optional)

Add background music:

python video-pipeline/post_processor.py output/avatar.mp4 assets/music/bg.mp3 output/final.mp4

Options:

--music-volume 0.15 (15% volume, default)
--trim-start 0 (trim seconds from start)
--trim-end 0 (trim seconds from end)

Full Pipeline Example

# Step 1: Generate voice
python video-pipeline/voice_generator.py "Hello! Welcome to my channel. Today I'm going to show you something amazing." output/voice.mp3 --voice nova

# Step 2: Upload to CDN (capture URLs)
URLS=$(python video-pipeline/storage_manager.py output/voice.mp3 assets/avatar.png)
AUDIO_URL=$(echo "$URLS" | grep audio | cut -d' ' -f2)
IMAGE_URL=$(echo "$URLS" | grep image | cut -d' ' -f2)

# Step 3: Generate video
VIDEO_URL=$(python video-pipeline/video_generator.py "$AUDIO_URL" "$IMAGE_URL")

# Step 4: Download
python video-pipeline/video_retriever.py "$VIDEO_URL" output/avatar.mp4

# Step 5: Add music
python video-pipeline/post_processor.py output/avatar.mp4 assets/music/background.mp3 output/final.mp4

Avatar Image Guidelines

For best results:

Portrait orientation preferred
Clear, well-lit face
Neutral expression
Front-facing
Minimum 512x512 pixels
Supported formats: PNG, JPG

Output Quality

Setting	Value
Resolution	720p (1280x720)
FPS	25
Codec	H.264
Audio	AAC 128kbps

Cost Estimate

Component	Cost
OpenAI TTS (1000 chars)	~$0.015
fal.ai OmniHuman (per video)	~$0.10-0.50

Tips

Keep scripts under 2 minutes for best quality
Use clear, well-paced speech in script
Test with short clips first
Store avatar images in assets/ for reuse
Check fal.ai queue status for busy times