Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

$pwd:

geek-skills-podcast-generator

Name: Geek Skills Podcast Generator
Author: staruhub

// Generate AI podcasts using Volcano Engine's Podcast AI Model. Use when user wants to create podcast audio from text input, generate conversational audio content, or transform written content into multi-speaker podcast format. Supports Chinese dual-speaker podcasts with customizable voice options.

In Manus ausführen

$ git log --oneline --stat

stars:443

forks:77

updated:22. April 2026 um 10:58

Datei-Explorer

3 Dateien

SKILL.md

readonly

name	Geek-skills-podcast-generator
version	1.0.0
description	Generate AI podcasts using Volcano Engine's Podcast AI Model. Use when user wants to create podcast audio from text input, generate conversational audio content, or transform written content into multi-speaker podcast format. Supports Chinese dual-speaker podcasts with customizable voice options.

Podcast Generator

Overview

Generate professional AI-powered podcasts using Volcano Engine's Podcast AI Model. This skill transforms text input into engaging dual-speaker podcast audio with natural conversation flow, supporting multiple audio formats and voice customization.

Quick Start

To generate a podcast:

Ensure Volcano Engine credentials are available (APP_ID and ACCESS_KEY)
Prepare the podcast topic/content text (up to 25,000 characters)
Run the generation script with required parameters
Receive the output audio file in your preferred format

Core Workflow

Step 1: Prepare Input

Required information:

Podcast topic or content text (Chinese, up to 25k characters)
Volcano Engine APP ID
Volcano Engine Access Key

Optional customization:

Audio format (mp3, ogg_opus, pcm, aac)
Sample rate (default: 24000 Hz)
Speech rate (-50 to 100, where 100 = 2.0x speed)
Speaker voices (default: male + female duo)
Opening music (default: disabled)

Step 2: Generate Podcast

Run the generation script:

python scripts/generate_podcast.py \
  --text "Your podcast topic or content" \
  --output "/path/to/output.mp3" \
  --app-id "YOUR_APP_ID" \
  --access-key "YOUR_ACCESS_KEY" \
  --format mp3 \
  --sample-rate 24000 \
  --speech-rate 0

Alternative: Use as Python module

import asyncio
from scripts.generate_podcast import PodcastGenerator

async def create_podcast():
    generator = PodcastGenerator(
        app_id="YOUR_APP_ID",
        access_key="YOUR_ACCESS_KEY"
    )
    
    result = await generator.generate_podcast(
        input_text="分析下当前的大模型发展",
        output_path="podcast.mp3",
        audio_format="mp3",
        sample_rate=24000,
        speech_rate=0,
        use_head_music=False
    )
    
    if result['success']:
        print(f"✅ Podcast generated: {result['output_path']}")
    else:
        print(f"❌ Failed: {result['error']}")

asyncio.run(create_podcast())

Step 3: Handle Output

The script will:

Stream audio data in real-time
Display progress for each speaking round
Save the complete audio file to the specified path
Return generation statistics (file size, round count, etc.)

Advanced Features

Resume from Interruption

If generation is interrupted, use the resume capability:

result = await generator.generate_podcast(
    input_text="Your topic",
    output_path="podcast.mp3",
    retry_info={
        "retry_task_id": "previous_task_id",
        "last_finished_round_id": 5
    }
)

The system will continue from the last completed round instead of starting over.

Custom Speaker Configuration

Specify different speaker voices:

result = await generator.generate_podcast(
    input_text="Your topic",
    output_path="podcast.mp3",
    speakers=[
        "zh_male_dayixiansheng_v2_saturn_bigtts",
        "zh_female_mizaitongxue_v2_saturn_bigtts"
    ]
)

Audio Format Options

Supported formats and use cases:

mp3: Best for general distribution (compressed, widely supported)
ogg_opus: High quality with good compression
pcm: Uncompressed raw audio (largest file size, highest quality)
aac: Modern compressed format with good quality

Speech Rate Adjustment

Control speaking speed:

speech_rate=0: Normal speed (1.0x)
speech_rate=100: 2x speed (fast)
speech_rate=-50: 0.5x speed (slow)

Common Usage Patterns

Pattern 1: Quick Blog Post to Podcast

blog_text = """
[Your blog post content here - can be long form]
"""

result = await generator.generate_podcast(
    input_text=blog_text,
    output_path="blog_podcast.mp3"
)

Pattern 2: Research Paper Summary

paper_summary = "Summarize the key findings of the latest AI research..."

result = await generator.generate_podcast(
    input_text=paper_summary,
    output_path="research_podcast.mp3",
    use_head_music=True  # Add opening music for professional touch
)

Pattern 3: Educational Content

lesson_topic = "Explain quantum computing concepts for beginners"

result = await generator.generate_podcast(
    input_text=lesson_topic,
    output_path="lesson.mp3",
    speech_rate=-20  # Slightly slower for educational content
)

Error Handling

Common issues and solutions:

Connection Errors:

Verify APP_ID and ACCESS_KEY are correct
Check network connectivity
Ensure firewall allows WebSocket connections

Text Too Long:

The model truncates at 25,000 characters
Split long content into multiple podcasts

Audio Not Generated:

Check output path is writable
Verify sufficient disk space
Review error messages for specific issues

Incomplete Generation:

Use retry_info to resume from last completed round
Check logs for the task_id and last_finished_round_id

Resource Usage

scripts/generate_podcast.py

Complete WebSocket client implementation for Volcano Engine's Podcast API:

Handles binary protocol communication
Manages streaming audio reception
Implements automatic retry logic
Provides both CLI and programmatic interfaces

Key features:

Async/await pattern for efficient I/O
Progress tracking with emoji indicators
Comprehensive error handling
Flexible parameter configuration

references/api_reference.md

Detailed API documentation including:

Complete parameter specifications
WebSocket protocol details
Event type reference
Error code explanations

Consult this file for:

Advanced API usage
Protocol-level debugging
Custom implementation needs

Requirements

Python dependencies:

pip install websockets

Credentials:

Volcano Engine APP ID (obtain from console: https://console.volcengine.com/speech/service/10028)
Volcano Engine Access Key

Best Practices

Input Text Quality: Use clear, well-structured Chinese text for best results
Length Optimization: Aim for 500-3000 characters for optimal podcast length
Format Selection: Use MP3 for distribution, PCM for further processing
Error Handling: Always check the success field in results
Resource Management: Close connections properly to avoid quota issues

Limitations

Maximum text length: 25,000 characters (model truncates longer input)
Language support: Primarily optimized for Chinese
Concurrent requests: Subject to your account's quota limits
Audio quality: Determined by model capabilities, not controllable via parameters

related-skills.json

gleiches Repository

geek-skills-ai-sales-champion.md

from "staruhub/ClaudeSkills"

AI咨询/销售的对话策略助手。当用户需要准备AI方案沟通、跟业务部门聊AI落地、写AI提案、应对客户异议、做AI培训破冰时使用。触发场景："怎么跟老板聊AI"、"客户说AI不靠谱"、"准备一个AI方案汇报"、"帮我想想怎么推AI"、"业务部门不配合"、"AI项目怎么卖"、"demo之后怎么跟进"。也适用于AI咨询师、技术合伙人、CTO做内部AI推广。

2026-04-22443

geek-skills-keqian-method.md

from "staruhub/ClaudeSkills"

胥克谦式AI-Native产品开发方法论。适用于：(1) 使用AI Agent（Claude Code、Codex、Cursor等）进行产品级软件开发，(2) 设计和优化Harness/Skill体系，(3) 文档驱动开发(SDD)流程，(4) 构建自动化质量门禁和eval机制，(5) Token成本优化与缓存策略，(6) 产品人转型开发者的AI编程实践。触发场景包括"帮我设计开发流程"、"怎么降低token成本"、"怎么提高AI编码质量"、"文档驱动"、"质量门禁"、"harness设计"、"单agent vs multi-agent"、"自动化迭代"、"AI产品开发"、"SDD"、"eval机制"等。即使用户只是说"帮我用AI写代码"或"怎么让agent干活更靠谱"也应触发。

2026-04-22443

geek-skills-notion-infographic.md

from "staruhub/ClaudeSkills"

基于大纲自动研究并生成高质量可视化内容的 Agent Pipeline。支持两种输出模式：(A) PPTX 演示文稿（PptxGenJS 编程生成，含完整设计系统）； (B) 信息图提示词组图（Notion 手绘风 / 多风格可选，可直接用于 imageGen / DALL·E）。用户只需提供主题大纲或关键词，skill 自动启动专家子 Agent 并行抓取信息，主 Agent 负责规划、设计决策和验收，最终输出风格统一的高质量视觉内容。触发场景："帮我做一组信息图"、"生成 Notion 风格图片"、"做个PPT"、"做个演示文稿"、 "把这个大纲做成图"、"infographic"、"信息图"、"手绘信息图"、"图解"、 "把这篇文章可视化"、"做成社交媒体传播图"、"小红书图文"、"slides"、 "presentation"、"deck"、"pptx"、"演示文稿"、"汇报PPT"。即使用户没有明确说"信息图"或"PPT"，但在提供大纲/要点并要求可视化传播时也应触发。当用户上传文章/文稿并要求"做成图"、"可视化"、"做成演示"时，同样触发此 skill。

2026-04-22443

geek-skills-xuefeng-method.md

from "staruhub/ClaudeSkills"

雪峰式AI-Native产品开发方法论。适用于：(1) 用户行为开放、不可穷举的AI-native产品（AI日历、AI助手、AI推荐、对话式产品等），(2) 强模型依赖型场景，AI驱动核心决策而非仅辅助，(3) 多专精Agent架构设计与分工，(4) 上线后快速校准、行为审计与漂移检测，(5) 模型选择和智能路由策略，(6) 概率性输出的质量评估。触发场景包括"AI-native产品怎么做"、"用户行为不可预测怎么办"、"多agent怎么分工"、"模型漂移怎么处理"、"校准到95%太难了"、"唯快不破"、"怎么选模型"、"agent并行分工"、"AI产品上线后怎么迭代"。注意：如果产品是场景明确、边界可定义的+AI类型，请改用 keqian-method skill。即使用户没有明确说"AI-native"，但在讨论AI驱动决策、用户行为不可预测、概率性输出等话题时也应触发。

2026-04-22443

deep-research.md

from "staruhub/ClaudeSkills"

Use this skill when the user wants an evidence-based research memo, literature review, market/policy/technical landscape, or a multi-source decision brief with citations, trade-offs, and a clear conclusion. Best for tasks that need synthesis across multiple external sources, iterative follow-up research, or a reusable written artifact. Do not use for quick factual lookups, single-source summaries, simple Q&A, or when the user clearly wants a short answer instead of a report. Chinese trigger examples: "帮我调研", "深度研究", "综述报告", "技术选型分析", "竞品研究", "政策分析". Success = scoped plan, grounded notes, verified citations, explicit limitations, and a final brief/report that clearly separates evidence from analysis.

2026-04-21443

geek-skills-product-manager.md

from "staruhub/ClaudeSkills"

资深产品经理助手,提供PRD文档创作与评审、产品策略咨询、留存增长分析、竞品研究、功能优先级排序等全方位产品管理支持。适用于创作或评审PRD/MRD/BRD/用户故事等产品文档；诊断产品问题（留存低、转化差、增长瓶颈）并给出可执行策略；进行竞品分析和市场研究；设计功能方案和用户体验优化。当用户提到"PRD"、"需求文档"、"产品规划"、"用户留存"、"功能设计"、"竞品分析"、"产品指标"、"增长策略"、"用户体验优化"、"功能优先级"等产品管理相关话题时，使用此skill。即使用户没有明确说"产品"，但在讨论App功能设计、用户增长、商业模式、需求分析等话题时也应触发。

2026-04-21443

package.json

"author": "staruhub"

"repository": "staruhub/ClaudeSkills"

GitHub-Repository öffnen Creator-Repositorys ansehen

$ install --global

$ download --local

In Manus ausführen

$ useful --forSOC

SoftwareentwicklerInformatik- und Mathematikberufe15-1252L4

name	Geek-skills-podcast-generator
version	1.0.0
description	Generate AI podcasts using Volcano Engine's Podcast AI Model. Use when user wants to create podcast audio from text input, generate conversational audio content, or transform written content into multi-speaker podcast format. Supports Chinese dual-speaker podcasts with customizable voice options.

Podcast Generator

Overview

Quick Start

To generate a podcast:

Ensure Volcano Engine credentials are available (APP_ID and ACCESS_KEY)
Prepare the podcast topic/content text (up to 25,000 characters)
Run the generation script with required parameters
Receive the output audio file in your preferred format

Core Workflow

Step 1: Prepare Input

Required information:

Podcast topic or content text (Chinese, up to 25k characters)
Volcano Engine APP ID
Volcano Engine Access Key

Optional customization:

Audio format (mp3, ogg_opus, pcm, aac)
Sample rate (default: 24000 Hz)
Speech rate (-50 to 100, where 100 = 2.0x speed)
Speaker voices (default: male + female duo)
Opening music (default: disabled)

Step 2: Generate Podcast

Run the generation script:

python scripts/generate_podcast.py \
  --text "Your podcast topic or content" \
  --output "/path/to/output.mp3" \
  --app-id "YOUR_APP_ID" \
  --access-key "YOUR_ACCESS_KEY" \
  --format mp3 \
  --sample-rate 24000 \
  --speech-rate 0

Alternative: Use as Python module

import asyncio
from scripts.generate_podcast import PodcastGenerator

async def create_podcast():
    generator = PodcastGenerator(
        app_id="YOUR_APP_ID",
        access_key="YOUR_ACCESS_KEY"
    )
    
    result = await generator.generate_podcast(
        input_text="分析下当前的大模型发展",
        output_path="podcast.mp3",
        audio_format="mp3",
        sample_rate=24000,
        speech_rate=0,
        use_head_music=False
    )
    
    if result['success']:
        print(f"✅ Podcast generated: {result['output_path']}")
    else:
        print(f"❌ Failed: {result['error']}")

asyncio.run(create_podcast())

Step 3: Handle Output

The script will:

Stream audio data in real-time
Display progress for each speaking round
Save the complete audio file to the specified path
Return generation statistics (file size, round count, etc.)

Advanced Features

Resume from Interruption

If generation is interrupted, use the resume capability:

result = await generator.generate_podcast(
    input_text="Your topic",
    output_path="podcast.mp3",
    retry_info={
        "retry_task_id": "previous_task_id",
        "last_finished_round_id": 5
    }
)

The system will continue from the last completed round instead of starting over.

Custom Speaker Configuration

Specify different speaker voices:

result = await generator.generate_podcast(
    input_text="Your topic",
    output_path="podcast.mp3",
    speakers=[
        "zh_male_dayixiansheng_v2_saturn_bigtts",
        "zh_female_mizaitongxue_v2_saturn_bigtts"
    ]
)

Audio Format Options

Supported formats and use cases:

mp3: Best for general distribution (compressed, widely supported)
ogg_opus: High quality with good compression
pcm: Uncompressed raw audio (largest file size, highest quality)
aac: Modern compressed format with good quality

Speech Rate Adjustment

Control speaking speed:

speech_rate=0: Normal speed (1.0x)
speech_rate=100: 2x speed (fast)
speech_rate=-50: 0.5x speed (slow)

Common Usage Patterns

Pattern 1: Quick Blog Post to Podcast

blog_text = """
[Your blog post content here - can be long form]
"""

result = await generator.generate_podcast(
    input_text=blog_text,
    output_path="blog_podcast.mp3"
)

Pattern 2: Research Paper Summary

paper_summary = "Summarize the key findings of the latest AI research..."

result = await generator.generate_podcast(
    input_text=paper_summary,
    output_path="research_podcast.mp3",
    use_head_music=True  # Add opening music for professional touch
)

Pattern 3: Educational Content

lesson_topic = "Explain quantum computing concepts for beginners"

result = await generator.generate_podcast(
    input_text=lesson_topic,
    output_path="lesson.mp3",
    speech_rate=-20  # Slightly slower for educational content
)

Error Handling

Common issues and solutions:

Connection Errors:

Verify APP_ID and ACCESS_KEY are correct
Check network connectivity
Ensure firewall allows WebSocket connections

Text Too Long:

The model truncates at 25,000 characters
Split long content into multiple podcasts

Audio Not Generated:

Check output path is writable
Verify sufficient disk space
Review error messages for specific issues

Incomplete Generation:

Use retry_info to resume from last completed round
Check logs for the task_id and last_finished_round_id

Resource Usage

scripts/generate_podcast.py

Complete WebSocket client implementation for Volcano Engine's Podcast API:

Handles binary protocol communication
Manages streaming audio reception
Implements automatic retry logic
Provides both CLI and programmatic interfaces

Key features:

Async/await pattern for efficient I/O
Progress tracking with emoji indicators
Comprehensive error handling
Flexible parameter configuration

references/api_reference.md

Detailed API documentation including:

Complete parameter specifications
WebSocket protocol details
Event type reference
Error code explanations

Consult this file for:

Advanced API usage
Protocol-level debugging
Custom implementation needs

Requirements

Python dependencies:

pip install websockets

Credentials:

Volcano Engine APP ID (obtain from console: https://console.volcengine.com/speech/service/10028)
Volcano Engine Access Key

Best Practices

Input Text Quality: Use clear, well-structured Chinese text for best results
Length Optimization: Aim for 500-3000 characters for optimal podcast length
Format Selection: Use MP3 for distribution, PCM for further processing
Error Handling: Always check the success field in results
Resource Management: Close connections properly to avoid quota issues

Limitations

Maximum text length: 25,000 characters (model truncates longer input)
Language support: Primarily optimized for Chinese
Concurrent requests: Subject to your account's quota limits
Audio quality: Determined by model capabilities, not controllable via parameters

geek-skills-podcast-generator

Podcast Generator

Overview

Quick Start

Core Workflow

Step 1: Prepare Input

Step 2: Generate Podcast

Step 3: Handle Output

Advanced Features

Resume from Interruption

Custom Speaker Configuration

Audio Format Options

Speech Rate Adjustment

Common Usage Patterns

Pattern 1: Quick Blog Post to Podcast

Pattern 2: Research Paper Summary

Pattern 3: Educational Content

Error Handling

Resource Usage

scripts/generate_podcast.py

references/api_reference.md

Requirements

Best Practices

Limitations

Mehr aus diesem Repository

Mehr aus diesem Repository

Podcast Generator

Overview

Quick Start

Core Workflow

Step 1: Prepare Input

Step 2: Generate Podcast

Step 3: Handle Output

Advanced Features

Resume from Interruption

Custom Speaker Configuration

Audio Format Options

Speech Rate Adjustment

Common Usage Patterns

Pattern 1: Quick Blog Post to Podcast

Pattern 2: Research Paper Summary

Pattern 3: Educational Content

Error Handling

Resource Usage

scripts/generate_podcast.py

references/api_reference.md

Requirements

Best Practices

Limitations