ワンクリックでManusで任意のスキルを実行

$pwd:

aliyun-qwen-tts-realtime

Name: Aliyun Qwen Tts Realtime
Author: cinience

// Use when real-time speech synthesis is needed with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instruction-controlled realtime synthesis.

Manusで実行

$ git log --oneline --stat

stars:391

forks:34

updated:2026年4月27日 22:35

ファイルエクスプローラー

4 ファイル

SKILL.md

readonly

name	aliyun-qwen-tts-realtime
description	Use when real-time speech synthesis is needed with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instruction-controlled realtime synthesis.
version	1.0.0

Category: provider

Model Studio Qwen TTS Realtime

Use realtime TTS models for low-latency streaming speech output.

Critical model names

Use one of these exact model strings:

qwen3-tts-flash-realtime
qwen3-tts-instruct-flash-realtime
qwen3-tts-instruct-flash-realtime-2026-01-22
qwen3-tts-vd-realtime-2026-01-15
qwen3-tts-vc-realtime-2026-01-15

Prerequisites

Install SDK in a virtual environment:

python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.realtime)

Request

text (string, required)
voice (string, required)
instruction (string, optional)
sample_rate (int, optional)

Response

audio_base64_pcm_chunks (array)
sample_rate (int)
finish_reason (string)

Operational guidance

Use websocket or streaming endpoint for realtime mode.
Keep each utterance short for lower latency.
For instruction models, keep instruction explicit and concise.
Some SDK/runtime combinations may reject realtime model calls over MultiModalConversation; use the probe script below to verify compatibility.

Local demo script

Use the probe script to verify realtime compatibility in your current SDK/runtime, and optionally fallback to a non-realtime model for immediate output:

.venv/bin/python skills/ai/audio/aliyun-qwen-tts-realtime/scripts/realtime_tts_demo.py \
  --text "This is a realtime speech demo." \
  --fallback \
  --output output/ai-audio-tts-realtime/audio/fallback-demo.wav

Strict mode (for CI / gating):

.venv/bin/python skills/ai/audio/aliyun-qwen-tts-realtime/scripts/realtime_tts_demo.py \
  --text "realtime health check" \
  --strict

Output location

Default output: output/ai-audio-tts-realtime/audio/
Override base dir with OUTPUT_DIR.

Validation

mkdir -p output/aliyun-qwen-tts-realtime
for f in skills/ai/audio/aliyun-qwen-tts-realtime/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/aliyun-qwen-tts-realtime/validate.txt

Pass criteria: command exits 0 and output/aliyun-qwen-tts-realtime/validate.txt is generated.

Output And Evidence

Save artifacts, command outputs, and API response summaries under output/aliyun-qwen-tts-realtime/.
Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
Run one minimal read-only query first to verify connectivity and permissions.
Execute the target operation with explicit parameters and bounded scope.
Verify results and save output/evidence files.

References

references/sources.md

related-skills.json

同じリポジトリ

aliyun-arms-query.md

from "cinience/alicloud-skills"

Use when querying distributed traces or application metrics in Alibaba Cloud ARMS (Application Real-Time Monitoring Service). Use for trace search by service/duration/tags, trace detail and method stack retrieval, application listing, and performance metrics queries.

2026-05-30391

aliyun-arms-query-test.md

from "cinience/alicloud-skills"

Smoke test for aliyun-arms-query skill. Validates script compilation and basic SDK client initialization.

2026-05-30391

aliyun-cosyvoice-voice-clone.md

from "cinience/alicloud-skills"

Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from reference audio and then reusing the returned voice_id in later TTS calls.

2026-04-27391

aliyun-cosyvoice-voice-design.md

from "cinience/alicloud-skills"

Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from a voice prompt plus preview text before using the returned voice_id in TTS.

2026-04-27391

aliyun-qwen-asr-realtime.md

from "cinience/alicloud-skills"

Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, live captions, or duplex voice agents.

2026-04-27391

aliyun-qwen-asr.md

from "cinience/alicloud-skills"

Use when transcribing non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when converting recorded audio files to text, generating transcripts with timestamps, or documenting DashScope/OpenAI-compatible ASR request and response fields.

2026-04-27391

package.json

"author": "cinience"

"repository": "cinience/alicloud-skills"

GitHub リポジトリを開く Creator のリポジトリを見る

$ install --global

$ download --local

Manusで実行

$ useful --forSOC

ソフトウェア開発者コンピュータ・数学職15-1252L4

name	aliyun-qwen-tts-realtime
description	Use when real-time speech synthesis is needed with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instruction-controlled realtime synthesis.
version	1.0.0

Category: provider

Model Studio Qwen TTS Realtime

Use realtime TTS models for low-latency streaming speech output.

Critical model names

Use one of these exact model strings:

qwen3-tts-flash-realtime
qwen3-tts-instruct-flash-realtime
qwen3-tts-instruct-flash-realtime-2026-01-22
qwen3-tts-vd-realtime-2026-01-15
qwen3-tts-vc-realtime-2026-01-15

Prerequisites

Install SDK in a virtual environment:

python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.realtime)

Request

text (string, required)
voice (string, required)
instruction (string, optional)
sample_rate (int, optional)

Response

audio_base64_pcm_chunks (array)
sample_rate (int)
finish_reason (string)

Operational guidance

Use websocket or streaming endpoint for realtime mode.
Keep each utterance short for lower latency.
For instruction models, keep instruction explicit and concise.
Some SDK/runtime combinations may reject realtime model calls over MultiModalConversation; use the probe script below to verify compatibility.

Local demo script

Use the probe script to verify realtime compatibility in your current SDK/runtime, and optionally fallback to a non-realtime model for immediate output:

.venv/bin/python skills/ai/audio/aliyun-qwen-tts-realtime/scripts/realtime_tts_demo.py \
  --text "This is a realtime speech demo." \
  --fallback \
  --output output/ai-audio-tts-realtime/audio/fallback-demo.wav

Strict mode (for CI / gating):

.venv/bin/python skills/ai/audio/aliyun-qwen-tts-realtime/scripts/realtime_tts_demo.py \
  --text "realtime health check" \
  --strict

Output location

Default output: output/ai-audio-tts-realtime/audio/
Override base dir with OUTPUT_DIR.

Validation

mkdir -p output/aliyun-qwen-tts-realtime
for f in skills/ai/audio/aliyun-qwen-tts-realtime/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/aliyun-qwen-tts-realtime/validate.txt

Pass criteria: command exits 0 and output/aliyun-qwen-tts-realtime/validate.txt is generated.

Output And Evidence

Save artifacts, command outputs, and API response summaries under output/aliyun-qwen-tts-realtime/.
Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
Run one minimal read-only query first to verify connectivity and permissions.
Execute the target operation with explicit parameters and bounded scope.
Verify results and save output/evidence files.

References

references/sources.md

aliyun-qwen-tts-realtime

Model Studio Qwen TTS Realtime

Critical model names

Prerequisites

Normalized interface (tts.realtime)

Request

Response

Operational guidance

Local demo script

Output location

Validation

Output And Evidence

Workflow

References

このリポジトリの他の Skills

このリポジトリの他の Skills

Model Studio Qwen TTS Realtime

Critical model names

Prerequisites

Normalized interface (tts.realtime)

Request

Response

Operational guidance

Local demo script

Output location

Validation

Output And Evidence

Workflow

References