تشغيل أي مهارة في Manus بنقرة واحدة

$pwd:

workflow-ai-enhancement

Name: Workflow Ai Enhancement
Author: damionrashford

// Restore, upscale, and enhance existing footage using 2026 open-source AI models — Real-ESRGAN/SwinIR/HAT super-resolution, RIFE/FILM interpolation, DeepFilterNet/RNNoise audio denoise, rembg/BiRefNet/RVM matting, Depth-Anything v2 depth — with strict OSI-open commercial-safe license filter. Use when the user says "upscale old footage", "remaster", "enhance quality", "30 to 60fps", "AI denoise", "restore VHS", "remove background from video", or anything about AI-driven footage restoration.

تشغيل في Manus

$ git log --oneline --stat

stars:٧

forks:٢

updated:١٨ أبريل ٢٠٢٦ في ٠٥:٥٥

SKILL.md

readonly

related-skills.json

نفس المستودع

ffmpeg-cut-concat.md

from "damionrashford/media-os"

Trim, cut, split, segment, and concatenate media with ffmpeg (stream copy when possible, re-encode across cut boundaries). Use when the user asks to trim a video, cut a clip, extract a segment by timestamps, remove a section, split into parts, join/merge videos, concatenate files, or build a segmented HLS-style playlist.

2026-05-177

obs-config.md

from "damionrashford/media-os"

Install and configure OBS Studio programmatically: install via brew cask / winget / Flatpak / apt, author profiles (basic.ini, streamEncoder.json, recordEncoder.json), author scene collections (scenes JSON), manage global.ini, set defaults for encoder / output / audio / hotkeys, cross-platform config paths. Use when the user asks to install OBS, set up an OBS profile, create a scene collection from code, configure OBS defaults without the GUI, edit basic.ini, manage multiple OBS profiles, or script a fresh OBS install with known-good settings.

2026-05-177

workflow-analysis-quality.md

from "damionrashford/media-os"

Deep media inspection + automated QC — ffprobe stream details, MediaInfo diagnostics, VMAF/PSNR/SSIM quality metrics, PySceneDetect scene cuts, crop/silence/black-frame/interlacing detection, ffplay scope debugging, NAL/SEI bitstream forensics, metadata audits, loudness compliance against Spotify/Apple/ATSC/EBU specs, and automated CI QC gates. Use when the user says "QC this file", "run VMAF", "compare encoders", "detect scene cuts", "check loudness compliance", "validate delivery spec", "automated QC pipeline", or any deep inspection / quality gating.

2026-05-177

media-pipeline-router.md

from "damionrashford/media-os"

Routes media production requests to the right Media OS specialist subagent. ALWAYS use this skill when the user expresses ANY media production intent — "go live", "start streaming", "OBS broadcast", "wire up live rig", "NDI to stream", "PTZ camera setup", "DeckLink capture stream", "HLS deliver", "DASH package", "encode for streaming", "CDN upload", "make a HLS manifest", "multi-bitrate ladder", "CMAF package", "LL-HLS", "low-latency stream", "Widevine package", "DRM package", "package for streaming", "broadcast deliver", "MXF master", "IMF package", "ProRes master", "DPP deliver", "AS-11 deliver", "Netflix deliver", "broadcast spec deliver", "deliver for air", "create IMF", "Premiere to Resolve", "round-trip", "OTIO export", "editorial conform", "XML round-trip", "FCPXML", "EDL export", "AAF export", "convert timeline", "Avid to Premiere", "Resolve to Premiere", "upscale this", "interpolate frames", "denoise AI", "remove background", "rotoscope", "matte", "depth estimate", "AI upscale", "RIFE", "Real-ESRGAN"

2026-05-177

workflow-acquisition-archive.md

from "damionrashford/media-os"

Ingest from every source — web (yt-dlp), screen / webcam / mic capture, SDI (DeckLink), DSLR tether (gphoto2), NDI network sources, RTSP / IP cameras via MediaMTX, PTZ — then verify integrity (SHA-256 + full-decode pass), preserve metadata (EXIF / XMP / IPTC), normalize to MKV or FFV1 / J2K / ProRes archival containers, and push to cold-storage cloud (Glacier / B2 / Archive.org). Use when the user says "download YouTube playlist", "capture SDI for 24 hours", "archive IP cameras", "preserve VHS rips", "tether DSLR timelapse", "cold-storage upload", or any ingest-to-archive workflow.

2026-04-187

workflow-ai-generation.md

from "damionrashford/media-os"

Generate media from scratch with 2026 open-source AI — TTS voiceover (Kokoro / OpenVoice / Piper), image gen (FLUX-schnell / Kolors / Sana / ComfyUI), video gen (LTX-Video / CogVideoX / Mochi / Wan), music (Riffusion / YuE), lipsync talking heads (LivePortrait / LatentSync), OCR (PaddleOCR / Tesseract 5 / TrOCR), zero-shot tagging (CLIP / SigLIP / BLIP-2 / LLaVA). Strict commercial-safe license filter. Use when the user says "generate a video", "TTS voiceover", "AI explainer video", "clone my voice", "generate music", "AI image", "digital human", or anything about from-scratch AI media.

2026-04-187

package.json

"author": "damionrashford"

"repository": "damionrashford/media-os"

فتح مستودع GitHub عرض مستودعات المنشئ

$ install --global

$ download --local

تشغيل في Manus

$ useful --forSOC

محررو الأفلام والفيديوالفنون والتصميم والترفيه والرياضة والإعلام27-4032L4

فنيو هندسة الصوتL4

name	workflow-ai-enhancement
description	Restore, upscale, and enhance existing footage using 2026 open-source AI models — Real-ESRGAN/SwinIR/HAT super-resolution, RIFE/FILM interpolation, DeepFilterNet/RNNoise audio denoise, rembg/BiRefNet/RVM matting, Depth-Anything v2 depth — with strict OSI-open commercial-safe license filter. Use when the user says "upscale old footage", "remaster", "enhance quality", "30 to 60fps", "AI denoise", "restore VHS", "remove background from video", or anything about AI-driven footage restoration.
argument-hint	["source"]

Workflow — AI Enhancement

What: Take existing footage and make it visually/sonically better using open-source AI. Strict license discipline: Apache-2 / MIT / BSD / GPL only. NC / research-only models are documented-and-dropped.

Skills used

media-upscale, media-interpolate, media-matte, media-depth, media-denoise-ai, media-demucs, ffmpeg-denoise-restore, ffmpeg-stabilize, ffmpeg-ivtc, ffmpeg-hdr-color, ffmpeg-lut-grade, ffmpeg-ocio-colorpro, ffmpeg-transcode, ffmpeg-hwaccel, ffmpeg-quality.

Pipeline

Step 1 — Probe source

ffmpeg-probe. Capture resolution, frame rate, codec artifacts, audio noise floor, color space (BT.601 / BT.709 / SDR), bit depth, chroma.

Step 2 — Undo legacy artifacts BEFORE AI

Telecined 29.97i → 23.976p — ffmpeg-ivtc (fieldmatch → decimate).
True interlaced — yadif via ffmpeg-video-filter.
Compression noise — ffmpeg-denoise-restore (nlmeans, hqdn3d).
Unstable handheld — ffmpeg-stabilize two-pass (vidstabdetect → vidstabtransform).

Step 3 — AI super-resolution

Use media-upscale:

Real-ESRGAN x4plus — live-action default.
Real-ESRGAN anime6b — animation.
SwinIR — graphics / text.
HAT — best-quality slowest.
Face-enhance — for close-ups; never use CodeFormer (NC).

Step 4 — Frame interpolation

media-interpolate:

RIFE v4.6 (MIT) — handles scene cuts.
FILM (Apache-2) — slightly smoother, drops on scene cuts.

Target integer multiples for clean math (23.976 → 47.952 exact 2×). 23.976 → 60 is 2.504× — quality suffers on fractional positions.

Step 5 — AI audio denoise

media-denoise-ai:

DeepFilterNet — general, 16 kHz or 48 kHz MONO only.
RNNoise — lightweight, hardcoded 48 kHz mono 16-bit.
Resemble Enhance — speech super-res.

Isolate stems first with media-demucs if source is mixed.

Step 6 — Background removal (optional)

media-matte:

rembg (MIT) — fastest.
BiRefNet / RMBG-2.0 — stills only; per-frame for video produces temporal flicker.
RVM (RobustVideoMatting) — temporally coherent, GPL-3 (propagates if shipped embedded; commercial OK via dynamic linking).

Step 7 — Depth estimation (optional)

media-depth:

Depth-Anything v2 (Apache-2) — fastest, relative depth.
MiDaS — for relighting / 3D reprojection.

Step 8 — Color + HDR finish

LUT (ffmpeg-lut-grade), OCIO ACES (ffmpeg-ocio-colorpro), or SDR→HLG tone-map (ffmpeg-hdr-color).

Step 9 — QC + final encode

ffmpeg-quality VMAF vs source (80–95 expected). ffmpeg-transcode to delivery: H.264 10-bit yuv420p10le for broad compat, AV1 for bandwidth-constrained.

Variants

Animation — Real-ESRGAN anime variant + RIFE with smooth motion.
Face-focused — GFPGAN (MIT) for talking heads. NEVER CodeFormer (NC).
VHS → 4K archive — detect cadence → yadif → heavy hqdn3d → upscale 480p → 1440p → color correct.
Game capture 30 → 120 fps — RIFE with higher factors.
Handheld + upscale — stabilize FIRST (crop introduced), THEN upscale.

Gotchas

License landmines — always-drop list: CodeFormer (NC), DAIN (research), XTTS-v2 / F5-TTS (CPML NC), Stable Video Diffusion (NC), Wav2Lip / SadTalker (research), MusicGen Meta (CC-BY-NC), Surya (commercial restriction). Each AI skill's references/LICENSES.md pins the allow-list.
Upscaling amplifies noise. ALWAYS denoise first — upscaler + noisy source = HD noise.
RIFE handles scene cuts; FILM doesn't. Watch for ghost frames on FILM output.
Real-ESRGAN default tile size 400 — drop to 200 if VRAM-bound. Model doesn't "see" the whole frame; artifacts at tile borders on wild tile sizes.
After AI upscale, force -pix_fmt yuv420p (or yuv420p10le for 10-bit deliver). AI models often output RGB or YUV 4:4:4; consumer players need 4:2:0.
10-bit upscale into 8-bit codec = banding. If upscale to 10-bit, deliver in 10-bit: libx264 -pix_fmt yuv420p10le or equivalent HEVC.
DeepFilterNet expects 16 kHz or 48 kHz MONO. Feed right format or it downsamples badly.
RNNoise hardcoded 48 kHz mono 16-bit. Resample first or it fails silently.
Demucs expects 44.1 kHz or 48 kHz STEREO. Mono input → degraded stems.
RVM is GPL-3. If shipped embedded in proprietary code, GPL-3 propagates. Commercial OK via dynamic linking.
rembg default u2net is fine; isnet-general-use is better but heavier.
BiRefNet is image-only. Per-frame video = temporal flicker. Use RVM for temporal coherence.
Depth-Anything v2 outputs RELATIVE depth. Good for VFX; NOT for measurement.
Hugging Face model licenses can change without notice. Pin exact commit hash in references/LICENSES.md.
AI inference benefits 10× from GPU. On CPU, Real-ESRGAN 1080p30s = hours.
ffmpeg -hwaccel is decode-only. AI models run on their own CUDA / Metal / ROCm paths.
Apple Silicon (M1–M4) MPS: set PYTORCH_ENABLE_MPS_FALLBACK=1 for unported ops.

Example — Upscale + interpolate old 720p30 → 4K60

Probe source → denoise with hqdn3d → Real-ESRGAN x4plus to 2880×? tile=400 → RIFE v4.6 ×2 → LUT grade → ffmpeg-transcode to HEVC 10-bit HDR-ready yuv420p10le. VMAF check against source at 720p (upscale reference). Deliver.

workflow-ai-generation — pure AI-generated media (not enhancement of existing).
workflow-vod-post-production — traditional color/stabilize/denoise path without AI.
workflow-hdr — if AI output needs HDR mastering.

workflow-ai-enhancement

Workflow — AI Enhancement

Skills used

Pipeline

Step 1 — Probe source

Step 2 — Undo legacy artifacts BEFORE AI

Step 3 — AI super-resolution

Step 4 — Frame interpolation

Step 5 — AI audio denoise

Step 6 — Background removal (optional)

Step 7 — Depth estimation (optional)

Step 8 — Color + HDR finish

Step 9 — QC + final encode

Variants

Gotchas

Example — Upscale + interpolate old 720p30 → 4K60

Related

Workflow — AI Enhancement

Skills used

Pipeline

Step 1 — Probe source

Step 2 — Undo legacy artifacts BEFORE AI

Step 3 — AI super-resolution

Step 4 — Frame interpolation

Step 5 — AI audio denoise

Step 6 — Background removal (optional)

Step 7 — Depth estimation (optional)

Step 8 — Color + HDR finish

Step 9 — QC + final encode

Variants

Gotchas

Example — Upscale + interpolate old 720p30 → 4K60

Related

workflow-ai-enhancement

المزيد من هذا المستودع

المزيد من هذا المستودع

Workflow — AI Enhancement

Skills used

Pipeline

Step 1 — Probe source

Step 2 — Undo legacy artifacts BEFORE AI

Step 3 — AI super-resolution

Step 4 — Frame interpolation

Step 5 — AI audio denoise

Step 6 — Background removal (optional)

Step 7 — Depth estimation (optional)

Step 8 — Color + HDR finish

Step 9 — QC + final encode

Variants

Gotchas

Example — Upscale + interpolate old 720p30 → 4K60

Related

Workflow — AI Enhancement

Skills used

Pipeline

Step 1 — Probe source

Step 2 — Undo legacy artifacts BEFORE AI

Step 3 — AI super-resolution

Step 4 — Frame interpolation

Step 5 — AI audio denoise

Step 6 — Background removal (optional)

Step 7 — Depth estimation (optional)

Step 8 — Color + HDR finish

Step 9 — QC + final encode

Variants

Gotchas

Example — Upscale + interpolate old 720p30 → 4K60

Related