| name | xiaoyuzhou-download |
| description | Download Xiaoyuzhou FM podcast episode audio files from xiaoyuzhoufm.com/episode links. Use when the user provides a Xiaoyuzhou episode URL or says in Chinese or English that they want to download this podcast, download this Xiaoyuzhou podcast, download this episode, download the audio, save the podcast audio, get the .m4a file, extract the audio URL, or verify a Xiaoyuzhou audio download. Chinese trigger phrases include 帮我下载这个播客, 下载这个播客, 下载这个小宇宙播客, 下载这期小宇宙, 把这期播客音频下下来, 把这个音频保存下来. |
Xiaoyuzhou Audio Download
Overview
Use the bundled script when the user asks to download a Xiaoyuzhou podcast episode in natural language, even if they do not explicitly say audio or .m4a. Fetch the episode page, extract the first .m4a URL from the HTML or embedded JSON, normalize escaped ampersands, and download the audio file.
Workflow
- Run
scripts/download_xiaoyuzhou_audio.py with the episode URL.
- Save into the user's current working directory unless they ask for another folder.
- Prefer a readable filename from the page title. Fall back to the episode id.
- After download, verify file size and type with
ls -lh and file.
- Report the absolute path to the downloaded
.m4a file.
Commands
Download directly:
python3 /Users/gai/.codex/skills/xiaoyuzhou-download/scripts/download_xiaoyuzhou_audio.py \
'https://www.xiaoyuzhoufm.com/episode/...'
Download to a specific directory:
python3 /Users/gai/.codex/skills/xiaoyuzhou-download/scripts/download_xiaoyuzhou_audio.py \
'https://www.xiaoyuzhoufm.com/episode/...' \
--output-dir /path/to/output
Only extract the audio URL:
python3 /Users/gai/.codex/skills/xiaoyuzhou-download/scripts/download_xiaoyuzhou_audio.py \
'https://www.xiaoyuzhoufm.com/episode/...' \
--dry-run
Notes
The audio URL usually appears in the server returned page source as a .m4a URL. It may be HTML escaped as & or JSON escaped as \u0026. The script handles both forms.
If the script cannot find a .m4a URL, fetch the page manually with curl -L and inspect whether Xiaoyuzhou changed the page structure or requires browser rendering.