一键导入
aliyun-qwen-asr-realtime
// Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, live captions, or duplex voice agents.
// Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, live captions, or duplex voice agents.
| name | aliyun-qwen-asr-realtime |
| description | Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, live captions, or duplex voice agents. |
| version | 1.0.0 |
Category: provider
mkdir -p output/aliyun-qwen-asr-realtime
python -m py_compile skills/ai/audio/aliyun-qwen-asr-realtime/scripts/prepare_realtime_asr_request.py && echo "py_compile_ok" > output/aliyun-qwen-asr-realtime/validate.txt
Pass criteria: command exits 0 and output/aliyun-qwen-asr-realtime/validate.txt is generated.
output/aliyun-qwen-asr-realtime/.Use one of these exact model strings:
qwen3-asr-flash-realtimeqwen3-asr-flash-realtime-2026-02-10DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.model (string, optional): default qwen3-asr-flash-realtimelanguage_hints (array, optional)format (string, optional): e.g. pcm, wavsample_rate (int, optional): e.g. 16000chunk_ms (int, optional): frame size in millisecondstext (string): recognized transcript fragmentis_final (bool): finalization markerusage (object, optional)Generate a request template:
python skills/ai/audio/aliyun-qwen-asr-realtime/scripts/prepare_realtime_asr_request.py \
--output output/aliyun-qwen-asr-realtime/request.json
skills/ai/audio/aliyun-qwen-asr/ instead.references/sources.mdUse when querying distributed traces or application metrics in Alibaba Cloud ARMS (Application Real-Time Monitoring Service). Use for trace search by service/duration/tags, trace detail and method stack retrieval, application listing, and performance metrics queries.
Smoke test for aliyun-arms-query skill. Validates script compilation and basic SDK client initialization.
Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from reference audio and then reusing the returned voice_id in later TTS calls.
Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from a voice prompt plus preview text before using the returned voice_id in TTS.
Use when transcribing non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when converting recorded audio files to text, generating transcripts with timestamps, or documenting DashScope/OpenAI-compatible ASR request and response fields.
Use when live speech translation is needed with Alibaba Cloud Model Studio Qwen LiveTranslate models, including bilingual meetings, realtime interpretation, and speech-to-speech or speech-to-text translation flows.