Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

mkt-full-video-with-11-hyperframe-heygen

End-to-end short-video pipeline — từ kịch bản (Việt/Anh) ra MP4 TikTok/Reels 9:16 hoàn chỉnh. Orchestrator 3 phase ghép 3 skill có sẵn — (1) `mkt-elevenlabs-tts-to-mp3` đọc script bằng voice của Hoàng, (2) checkpoint user duyệt MP3, (3) `heygen-mp3-to-mp4` lip-sync avatar HeyGen, (4) delegate Phase 3 packaging cho sub-agent `mkt-full-video-phase3-packager` (transcribe + scene outline + checkpoint + fan-out N scene writers parallel + scaffold + preview Studio). USE WHEN user nói "tạo full video từ script", "script to tiktok video", "pipeline full video heygen + hyperframe", "tạo video từ kịch bản đến mp4", "elevenlabs heygen hyperframe full pipeline", "kịch bản ra video tiktok", hoặc có sẵn 1 script + (optional) ảnh b-roll và muốn ra MP4 9:16 đóng gói có captions, SFX, b-roll.

Ejecutar en Manus

Resumen

Comando de instalación

npx skills add https://github.com/hoanghd218/hoang-ai-marketing --skill mkt-full-video-with-11-hyperframe-heygen

Copia y pega este comando en Claude Code para instalar la habilidad

Fuente

hoanghd218/hoang-ai-marketing

Estrellas1

Forks2

Actualizado3 de mayo de 2026, 22:10

Explorador de archivos

2 archivos

SKILL.md

readonly

Más de este repositorio

mismo repositorio

heygen-mp3-to-mp4

hoanghd218/hoang-ai-marketing

Convert a single MP3 voiceover file into a single HeyGen avatar lip-sync MP4 video. Single-purpose — no planning, no SRT, no chunking, no Remotion compositing. Uses HeyGen MCP tools exclusively (no direct REST API calls). Locks avatar look and voice ID to a fixed allowlist. USE WHEN user says "tạo video heygen từ mp3", "mp3 to heygen", "heygen mp4 từ audio", "convert mp3 sang heygen video", "tạo avatar video từ file mp3", "lip sync mp3 heygen", "biến mp3 thành video heygen", or any time the user has exactly one MP3 file and wants exactly one HeyGen avatar MP4 out.

2026-05-031

heygen-script-to-mp4

hoanghd218/hoang-ai-marketing

Convert a script (Vietnamese or English text) directly into a single HeyGen avatar video — HeyGen handles TTS using the locked ElevenLabs voice, then lip-syncs the avatar. Single-purpose — no MP3 generation, no SRT, no chunking, no Remotion. Uses HeyGen MCP tools exclusively (no direct REST API calls). Avatar look and voice ID are fixed allowlists. USE WHEN user says "tạo video heygen từ script", "script to heygen", "heygen mp4 từ text", "convert script sang heygen video", "tạo avatar video từ script", "heygen text to video", "biến script thành video heygen", or any time the user has script text (not an MP3) and wants exactly one HeyGen avatar MP4 out.

2026-05-031

mkt-elevenlabs-tts-to-mp3

hoanghd218/hoang-ai-marketing

Convert Vietnamese/English script text to MP3 voiceover using ElevenLabs TTS API. Calls POST /v1/text-to-speech/{voice_id}, streams audio bytes, writes MP3 directly. Locked to Hoang's brand voice ID by default. USE WHEN user says 'tạo mp3 elevenlabs', 'elevenlabs tts', 'eleven labs voice', 'text to speech elevenlabs', 'tạo voiceover elevenlabs', 'đọc text bằng elevenlabs', 'tts elevenlabs to mp3', 'eleven labs script to mp3', 'voiceover bằng elevenlabs', 'giọng elevenlabs'.

2026-05-031

mkt-kane-anti-pattern-auditor

hoanghd218/hoang-ai-marketing

Audit bài Facebook / Reels / YouTube để phát hiện 4 downward drivers (over-branding, over-production, stock imagery, standardized aesthetic) + frequency-over-quality. Mỗi anti-pattern bị drop ~75% performance nếu hiện diện. USE WHEN user says 'audit content', 'check anti pattern', 'content có lỗi gì không', 'kiểm tra over branding', 'vì sao video bị chết', 'vì sao post không ai xem', 'content audit', 'kiểm tra chất lượng content'.

2026-05-031

mkt-kane-cross-industry-viral-scout

hoanghd218/hoang-ai-marketing

Tìm viral pattern ở ngành khác (bác sĩ, luật sư, tài chính, bất động sản, thủ công) có thể apply cho niche AI/automation của Hoang. Ngách AI educator VN còn ít format được khai thác — cross-industry adaptation là blue ocean. USE WHEN user says 'tìm format ngành khác', 'cross industry research', 'học format từ ngành khác', 'blue ocean format', 'tìm format chưa ai làm', 'adapt format từ niche khác', 'cross industry viral'.

2026-05-031

mkt-kane-cta-non-autocratic-rewriter

hoanghd218/hoang-ai-marketing

Rewrite CTA kiểu autocratic (Mua ngay! Follow ngay! Đăng ký liền!) sang 3 variants Democratic / Benevolent / Laissez-faire — reach 85% dân số thay vì chỉ 5% (action-based). Áp dụng cho landing page, caption FB, CTA video, email. USE WHEN user says 'rewrite cta', 'sửa cta', 'cta không autocratic', 'viết lại kêu gọi hành động', 'cta cho landing page', 'cta nhẹ nhàng hơn', 'inclusive cta', 'cta democratic benevolent'.

2026-05-031

Fuente

hoanghd218

hoanghd218/hoang-ai-marketing

Abrir repositorio de GitHub Ver repositorios del creador

Comando de instalación

Descarga

Ejecutar en Manus

Útil paraSOC

Editores de cine y videoArtes, diseño, entretenimiento, deportes y medios27-4032L4

name

mkt-full-video-with-11-hyperframe-heygen

description

mkt-full-video-with-11-hyperframe-heygen

End-to-end orchestrator: script → final TikTok/Reels MP4 9:16.

Pipeline có 2 user checkpoints:

MP3 checkpoint (orchestrator) — sau Phase 1, user duyệt voiceover.
Scenes-outline checkpoint (Phase 3 sub-agent) — user duyệt scene structure + variant trước khi fan-out content writers. Skip được nếu user pass auto_scenes=true.

Khi nào dùng

User có script Việt/Anh ≤ 5000 ký tự muốn ra video TikTok hoàn chỉnh
Có sẵn (optional) ảnh hoặc video b-roll kèm mục đích sử dụng
Muốn đi 1 mạch từ kịch bản đến preview Studio

Không dùng skill này nếu:

User đã có MP3 sẵn → dùng thẳng heygen-mp3-to-mp4
User đã có MP4 talking-head sẵn → dùng thẳng mkt-hyperframe-talking-head-video
User cần HeyGen tự đọc text (không qua ElevenLabs) → dùng heygen-script-to-mp4
Script > 5000 ký tự → split semantic rồi gọi pipeline cho từng segment

Pipeline overview

Script (text + optional b-roll)
    │
    ▼
Phase 1 ── mkt-elevenlabs-tts-to-mp3 ───► voiceover.mp3
    │                                         │
    │                                         ▼
    │                              CHECKPOINT #1 — user nghe + duyệt MP3
    │                                         │
    │                                         ▼ (OK)
Phase 2 ── heygen-mp3-to-mp4 ──────────► source.mp4 (9:16 lip-sync)
    │
    ▼
Phase 3 ── spawn agent mkt-full-video-phase3-packager (isolated context)
            │
            ├─ transcribe + clean + group captions
            ├─ detect scene boundaries + classify mockup_variant
            ├─ CHECKPOINT #2 — user duyệt scenes outline
            ├─ FAN-OUT N general-purpose sub-agents (1 per scene) parallel
            │   → mỗi sub-agent build content JSON theo variant schema
            ├─ merge → scenes.json
            ├─ parallel: scaffold sub-comps + copy SFX + inject captions
            ├─ generate root index.html
            └─ lint + preview Studio
    │
    ▼
User duyệt preview → "render" → final MP4

Checkpoint rule: Orchestrator quản checkpoint #1 (MP3). Phase 2 → 3 chạy auto. Checkpoint #2 (scenes outline) do Phase 3 sub-agent quản. Render gate cuối ở Studio.

Inputs

Input	Required	Format / ví dụ
Script text	Yes	File path (`.txt`/`.md`) hoặc inline string. ≤ 5000 ký tự.
Slug project	No	Auto-derive từ 5 từ đầu của script. Lowercase, ASCII, dash.
B-roll list	No	Array `[{path: "...", purpose: "Bài học 1 — minh họa X"}, ...]`.
Voice settings override	No	`{stability, similarity_boost, style}` cho ElevenLabs.
Avatar look	No	1 trong 2 ID allowlist HeyGen (`ff800d7f...` / `66e75e22...`). Random nếu không chọn.
`auto_scenes`	No	Default `false`. `true` để skip scenes-outline checkpoint trong Phase 3.

Workspace layout

workspace/content/YYYY-MM-DD/<slug>/
├── script.txt              # Phase 0
├── voiceover.mp3           # Phase 1
├── source.mp4              # Phase 2
├── broll/                  # User-provided b-roll (copy)
├── transcript.json         # Phase 3
├── caption-groups.json     # Phase 3
├── scenes-outline.json     # Phase 3 — pre-checkpoint outline
├── scenes/                 # Phase 3 — fan-out per-scene content writes here
│   ├── scene-1.json
│   └── ...
├── scenes.json             # Phase 3 — merged final
├── compositions/           # Phase 3 — HF sub-comps
├── sfx/                    # Phase 3 — 6 SFX
└── index.html              # Phase 3 — root composition

YYYY-MM-DD = ngày hôm nay (UTC+7).

Workflow

Step 0 — Setup

Validate len(script_text) <= 5000. Vượt → stop, yêu cầu user split semantic.
Derive slug nếu thiếu: 5 từ đầu → lowercase → bỏ dấu → space→dash.
Tạo workspace/content/YYYY-MM-DD/<slug>/. Save script.txt.
Nếu user có b-roll: tạo <folder>/broll/, copy file giữ tên gốc.
Báo user: "Workspace tạo tại <folder>. Bắt đầu Phase 1 — ElevenLabs TTS."

Step 1 — Phase 1: Script → MP3 (ElevenLabs)

uv run .claude/skills/mkt-elevenlabs-tts-to-mp3/scripts/text_to_mp3.py \
  --file workspace/content/YYYY-MM-DD/<slug>/script.txt \
  -o workspace/content/YYYY-MM-DD/<slug>/voiceover.mp3

Voice settings overrides → thêm --stability / --similarity_boost / --style.

Sau khi xong, check duration:

uv run .claude/skills/heygen-mp3-to-mp4/scripts/check_duration.py \
  workspace/content/YYYY-MM-DD/<slug>/voiceover.mp3

TOO_LONG → stop, báo user MP3 > 5 phút HeyGen, yêu cầu rút script.

Step 2 — CHECKPOINT #1: user nghe MP3

Điểm dừng duy nhất do orchestrator quản. Báo user format đúng như sau:

## Voiceover ready — duyệt giúp mình

**File:** `workspace/content/YYYY-MM-DD/<slug>/voiceover.mp3`
**Duration:** <X.X>s
**Size:** <Y.Y> MB
**Voice:** ElevenLabs Brand Voice của Hoàng (`K7ewtjKRNtwwt3lKQ6M0`)

Mở file nghe thử. Reply 1 trong:
- **`OK`** / **`tiếp`** → mình chạy Phase 2 (HeyGen avatar lip-sync)
- **`regen`** + (optional) lý do → mình tweak voice settings và regen MP3
- **`sửa script`** + nội dung mới → mình save script mới và rerun Phase 1

Stop tool calls. Đợi user reply rõ ràng.

Khi user OK → Phase 2. Khi user regen → rerun text_to_mp3.py với settings mới, quay lại checkpoint. Khi user sửa script → overwrite script.txt, rerun Phase 1 từ đầu.

Step 3 — Phase 2: MP3 → HeyGen MP4 (auto)

Theo sub-skill heygen-mp3-to-mp4:

Pick avatar ID — random từ allowlist nếu user không chỉ định:

import random
AVATAR_LOOKS = ["ff800d7f76aa48f5a23eb6a742ed5365", "66e75e22e6584bbdaa56a19088286dc8"]
avatar_id = random.choice(AVATAR_LOOKS)

Báo user pick nào trước khi gọi MCP.

Upload MP3 lên HeyGen qua MCP (upload_asset, asset type audio). KHÔNG curl https://upload.heygen.com/....

Tạo avatar video qua MCP (generate_avatar_video):

character: { type: avatar, avatar_id: <picked>, scale: 1.0 }
voice:     { type: audio, audio_asset_id: <from upload> }
dimension: { width: 720, height: 1280 }   # 9:16
title:     "<slug>-<timestamp>"

Poll status mỗi ~10s → completed (cap 10 min). failed → stop, show error.

Download MP4 → workspace/content/YYYY-MM-DD/<slug>/source.mp4 (filename inviolable):

uv run .claude/skills/heygen-mp3-to-mp4/scripts/download_video.py \
  "<video_url>" "workspace/content/YYYY-MM-DD/<slug>/source.mp4"

Báo user 1 dòng: "Phase 2 done — <path> (avatar <id>, s, MB). Spawning Phase 3 packager…"

Không stop ở đây. Auto-flow sang Phase 3.

Step 4 — Phase 3: Spawn packager sub-agent

Use the Task tool with subagent_type: "mkt-full-video-phase3-packager". Pass a self-contained prompt with the inputs the agent needs. Example:

Workspace: workspace/content/2026-05-03/hom-nay-minh-chia-se/
Slug: hom-nay-minh-chia-se
Script: <full script text>
B-roll: [
  {"path": "workspace/content/2026-05-03/hom-nay-minh-chia-se/broll/alphabet.png", "purpose": "Bài học 1"},
  ...
]
auto_scenes: false
header_label: "3 BÀI HỌC AI"
footer_handle: "@tranvanhoang.com"

Run the full Phase 3 packaging pipeline per your agent definition. Return Studio URL when preview is open.

The sub-agent:

Transcribes source.mp4, builds scene outline
Stops to ask user to approve outline (CHECKPOINT #2) unless auto_scenes: true
Fans out N scene-content writers in parallel
Merges + scaffolds + lints + opens preview
Returns Studio URL

While the sub-agent is running, the orchestrator's only job is to relay user replies to the sub-agent (the user reply at the scenes-outline checkpoint will arrive in this conversation; route it to the sub-agent if it's still active, otherwise treat it as a follow-up instruction).

Step 5 — Hand off

When the sub-agent returns the Studio URL, format the final report:

## Full video pipeline DONE — preview ready

**Workspace:** `workspace/content/YYYY-MM-DD/<slug>/`

**Phase 1 (ElevenLabs):** voiceover.mp3 — <D1>s, <S1>MB
**Phase 2 (HeyGen):** source.mp4 — avatar `<avatar_id>`, <D2>s, <S2>MB
**Phase 3 (HyperFrames):** <N> scenes (<list variants>), <K> caption groups, 6 SFX

**Studio URL:** http://localhost:3002

Mở browser scrub timeline. Nói **`render`** khi OK → mình chạy `npx hyperframes render` xuất MP4 1080×1920 30fps.

Stop here. Không auto-render. User confirm rồi mới gọi npx hyperframes render.

Critical orchestration rules

2 user checkpoints, 1 orchestrator gate — Orchestrator chỉ stop ở MP3 (Step 2). Scenes-outline checkpoint do Phase 3 sub-agent quản. Render gate ở Studio do user.
Path conventions inviolable — voiceover phải là voiceover.mp3, talking-head phải là source.mp4. HF sub-skill expect tên source.mp4.
HeyGen MCP only — không bao giờ curl https://api.heygen.com/.... Hard constraint của heygen-mp3-to-mp4.
Voice ID lock — ElevenLabs default K7ewtjKRNtwwt3lKQ6M0 (Hoàng's brand voice). Override qua --voice_id nhưng pipeline báo rõ pick nào.
Script length hard cap 5000 ký tự — fail fast ở Step 0.1.
MP3 duration ≤ 300s — HeyGen single-video cap. Check ngay sau Phase 1.
Preview-first — Phase 3 KHÔNG auto-render. Memory feedback_hyperframes_workflow.md.
Phase 3 isolation — Phase 3 chạy trong sub-agent context riêng. HF skill body + 4 reference docs (~40KB) load vào sub-agent thay vì main orchestrator.

Failure modes & fallback

Symptom	Hành động
Script > 5000 ký tự	Stop, yêu cầu user split semantic
ElevenLabs API fail	Báo error, suggest check `ELEVENLABS_API_KEY` trong `.env`
MP3 > 300s sau Phase 1	Stop pipeline, suggest `heygen-short-video` (chunking)
HeyGen MCP not connected	Stop, báo `claude mcp list` để verify
HeyGen render failed	Show error, gợi ý check credits qua `mcp__heygen__get_current_user`
Phase 3 sub-agent fail	Đọc error trace, gợi ý user re-run Phase 3 standalone bằng `mkt-hyperframe-talking-head-video` skill
Scene writer returns malformed JSON	Sub-agent tự re-spawn cho scene đó (không phải orchestrator's concern)
User reject MP3 voice	Quay lại Phase 1 với voice settings tweak

Example end-to-end

User:

Có script đây, chạy full pipeline ra video TikTok. Kèm 2 ảnh b-roll cho lesson 1 và lesson 2. Script: "Hôm nay mình chia sẻ 3 bài học từ Anthropic. Bài học đầu tiên là Alphabet ship beta. Bài học thứ 2 là 24h…" B-roll: alphabet.png (lesson 1), clock.png (lesson 2)

Pipeline:

Step 0 — slug hom-nay-minh-chia-se. Folder workspace/content/2026-05-03/hom-nay-minh-chia-se/. Save script.txt, copy b-roll.
Step 1 — text_to_mp3.py → voiceover.mp3 (45s, 0.7MB).
Step 2 — CHECKPOINT #1 — báo user path + duration. Đợi reply.
User: OK.
Step 3 — Pick avatar 66e75e22…. Upload MP3 → asset_id. Generate video → poll → download source.mp4 (45s, 7.5MB).
Step 4 — Spawn mkt-full-video-phase3-packager sub-agent.
Sub-agent transcribes (35s), builds outline (3 lessons + recap + cta), shows outline → CHECKPOINT #2.
User: OK.
Sub-agent fan-out 5 scene writers parallel (~8s wall-clock vs ~40s serial).
Sub-agent merges → scenes.json → scaffold + SFX + captions parallel → root → lint → preview.
Sub-agent returns Studio URL.
Step 5 — Orchestrator báo user format chốt. Đợi render.

Total wall-clock: ~4–6 phút (vs 5–8 phút ở pipeline serial cũ).

What this skill does NOT do

KHÔNG viết script (dùng mkt-create-script-short-video hoặc mkt-create-script-storytelling-video trước).
KHÔNG handle script > 5000 ký tự (fail fast — user split semantic).
KHÔNG chunk MP3 (single-clip pipeline; > 300s dùng heygen-short-video).
KHÔNG auto-render MP4 cuối (user gate ở preview Studio).
KHÔNG handle Phase 3 internals — delegate cho sub-agent mkt-full-video-phase3-packager.
KHÔNG override hard constraint của sub-skill (avatar allowlist, MCP-only, font Be Vietnam Pro, etc.).

References

Sub-skill mkt-elevenlabs-tts-to-mp3 — .claude/skills/mkt-elevenlabs-tts-to-mp3/SKILL.md
Sub-skill heygen-mp3-to-mp4 — .claude/skills/heygen-mp3-to-mp4/SKILL.md
Sub-skill mkt-hyperframe-talking-head-video — .claude/skills/mkt-hyperframe-talking-head-video/SKILL.md (loaded by Phase 3 sub-agent)
Sub-agent mkt-full-video-phase3-packager — .claude/agents/mkt-full-video-phase3-packager.md
Reference HyperFrames project — workspace/video-projects/3-bai-hoc/