with one click
aliyun-qwen-asr-realtime
// Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, live captions, or duplex voice agents.
// Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, live captions, or duplex voice agents.
Use when querying distributed traces or application metrics in Alibaba Cloud ARMS (Application Real-Time Monitoring Service). Use for trace search by service/duration/tags, trace detail and method stack retrieval, application listing, and performance metrics queries.
Smoke test for aliyun-arms-query skill. Validates script compilation and basic SDK client initialization.
Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from reference audio and then reusing the returned voice_id in later TTS calls.
Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from a voice prompt plus preview text before using the returned voice_id in TTS.
Use when transcribing non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when converting recorded audio files to text, generating transcripts with timestamps, or documenting DashScope/OpenAI-compatible ASR request and response fields.
Use when live speech translation is needed with Alibaba Cloud Model Studio Qwen LiveTranslate models, including bilingual meetings, realtime interpretation, and speech-to-speech or speech-to-text translation flows.
| name | aliyun-qwen-asr-realtime |
| description | Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, live captions, or duplex voice agents. |
| version | 1.0.0 |
Category: provider
mkdir -p output/aliyun-qwen-asr-realtime
python -m py_compile skills/ai/audio/aliyun-qwen-asr-realtime/scripts/prepare_realtime_asr_request.py && echo "py_compile_ok" > output/aliyun-qwen-asr-realtime/validate.txt
Pass criteria: command exits 0 and output/aliyun-qwen-asr-realtime/validate.txt is generated.
output/aliyun-qwen-asr-realtime/.Use one of these exact model strings:
qwen3-asr-flash-realtimeqwen3-asr-flash-realtime-2026-02-10DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.model (string, optional): default qwen3-asr-flash-realtimelanguage_hints (array, optional)format (string, optional): e.g. pcm, wavsample_rate (int, optional): e.g. 16000chunk_ms (int, optional): frame size in millisecondstext (string): recognized transcript fragmentis_final (bool): finalization markerusage (object, optional)Generate a request template:
python skills/ai/audio/aliyun-qwen-asr-realtime/scripts/prepare_realtime_asr_request.py \
--output output/aliyun-qwen-asr-realtime/request.json
skills/ai/audio/aliyun-qwen-asr/ instead.references/sources.md