원클릭으로 Manus에서 모든 스킬 실행

$pwd:

transcribe-audio

Name: Transcribe Audio
Author: barefootford

// Transcribes video audio using WhisperX, preserving original timestamps. Creates JSON transcript with word-level timing. Use when you need to generate audio transcripts for videos.

Manus에서 실행

$ git log --oneline --stat

stars:509

forks:80

updated:2026년 5월 25일 05:32

파일 탐색기

3 개 파일

SKILL.md

readonly

name	transcribe-audio
description	Transcribes video audio using WhisperX, preserving original timestamps. Creates JSON transcript with word-level timing. Use when you need to generate audio transcripts for videos.

Skill: Transcribe Audio (parent brief)

Transcribes video audio using WhisperX and produces a clean JSON transcript with word-level timing.

SKILL.md is the parent's dispatch brief. The sub-agent's working prompt lives in agent_prompt.md — inline its contents when launching the Task agent. Don't pass SKILL.md.

Parallelism

Launch at most 2 in parallel. WhisperX is already multithreaded internally (~4 CPU threads via CTranslate2); 2 processes is the throughput-vs-RAM sweet spot on a 16GB Mac.

Inputs to gather and pass inline

The parent reads library.yaml and settings.yaml and passes these values inline in each agent's prompt:

video_path — absolute path to the video file
transcript_output_dir — where to write the transcript JSON (e.g. libraries/<library>/transcripts)
language_code — ISO 639-1 code (e.g. en, es) — parent maps from library.yaml's language name
whisper_model — model size from settings.yaml (e.g. small, medium, turbo)
transcript_refinement — boolean from library.yaml. If true, also pass:
- user_context (may be empty string)
- footage_summary (may be empty string)

After the agent returns, update library.yaml with transcript: <filename>.json.

Next step

Once all videos have audio transcripts, dispatch analyze-video for visual descriptions.

Dependencies

WhisperX must be installed. Use the setup skill to verify.

related-skills.json

같은 저장소

process-library.md

from "barefootford/buttercut"

Skill for processing footage (video clips, sounds, photos, etc). Use this when creating a new library, adding new footage (videos) to an existing library, or resuming processing on an existing library.

2026-05-25509

cut.md

from "barefootford/buttercut"

Build a cut from a library — scene, selects, roughcut, or custom task. Starts by asking what kind of cut the user wants, then works with them to determine what they want to create. Always exports a file for Final Cut, Premiere, or Resolve at the end. Use when the user asks for a "roughcut", "sequence", "scene", "selects", or any other cut-shaped output.

2026-05-25509

analyze-video.md

from "barefootford/buttercut"

Full footage analysis pipeline — audio transcripts, contact sheets, and Sonnet-written summaries. Produces every artifact the cut skill reads. Orchestrated from the main thread.

2026-05-25509

backup-library.md

from "barefootford/buttercut"

Backs up user libraries and all their contents (external video excluded). This skill can also be useful when you need to restore a library.

2026-05-25509

contact-sheet.md

from "barefootford/buttercut"

Builds a contact sheet from a video clip — evenly spaced frames laid out in a single grid image, each with its hh:mm:ss timestamp burned in. Use when the user asks for a "contact sheet", "grid", "film strip", or wants a one-image overview of part of a clip.

2026-05-25509

full-transcript.md

from "barefootford/buttercut"

Exports all dialogue from every clip in a library into a single text file. One clip per block — filename, then its spoken words. Use when the user asks for a "full transcript", "full script", or wants all the dialogue from a library in one place.

2026-05-25509

package.json

"author": "barefootford"

"repository": "barefootford/buttercut"

GitHub 저장소 열기 Creator 저장소 보기

$ install --global

$ download --local

Manus에서 실행

$ useful --forSOC

소프트웨어 개발자컴퓨터 및 수학직15-1252L4

name	transcribe-audio
description	Transcribes video audio using WhisperX, preserving original timestamps. Creates JSON transcript with word-level timing. Use when you need to generate audio transcripts for videos.

Skill: Transcribe Audio (parent brief)

Transcribes video audio using WhisperX and produces a clean JSON transcript with word-level timing.

SKILL.md is the parent's dispatch brief. The sub-agent's working prompt lives in agent_prompt.md — inline its contents when launching the Task agent. Don't pass SKILL.md.

Parallelism

Launch at most 2 in parallel. WhisperX is already multithreaded internally (~4 CPU threads via CTranslate2); 2 processes is the throughput-vs-RAM sweet spot on a 16GB Mac.

Inputs to gather and pass inline

The parent reads library.yaml and settings.yaml and passes these values inline in each agent's prompt:

video_path — absolute path to the video file
transcript_output_dir — where to write the transcript JSON (e.g. libraries/<library>/transcripts)
language_code — ISO 639-1 code (e.g. en, es) — parent maps from library.yaml's language name
whisper_model — model size from settings.yaml (e.g. small, medium, turbo)
transcript_refinement — boolean from library.yaml. If true, also pass:
- user_context (may be empty string)
- footage_summary (may be empty string)

After the agent returns, update library.yaml with transcript: <filename>.json.

Next step

Once all videos have audio transcripts, dispatch analyze-video for visual descriptions.

Dependencies

WhisperX must be installed. Use the setup skill to verify.

transcribe-audio

Skill: Transcribe Audio (parent brief)

Parallelism

Inputs to gather and pass inline

Next step

Dependencies

이 저장소의 다른 Skills

이 저장소의 다른 Skills

Skill: Transcribe Audio (parent brief)

Parallelism

Inputs to gather and pass inline

Next step

Dependencies