Agent setup: If your agent doesn't auto-load skills (e.g. Claude Code),
see agent-compatibility.md once per session.

Qwen Audio TTS (Text-to-Speech)

Synthesize natural speech from text using Qwen TTS models.
This skill is part of qwencloud/qwencloud-ai.

Skill directory

Use this skill's internal files to execute and learn. Load reference files on demand when the default path fails or you need details.

Location	Purpose
INLINECODE0	Qwen TTS (HTTP API) — qwen3-tts-flash, qwen3-tts-instruct-flash
INLINECODE1

Security

NEVER output any API key or credential in plaintext. Always use variable references ($DASHSCOPE_API_KEY in shell, os.environ["DASHSCOPE_API_KEY"] in Python). Any check or detection of credentials must be non-plaintext: report only status (e.g. "set" / "not set", "valid" / "invalid"), never the value. Never display contents of .env or config files that may contain secrets.

When the API key is not configured, NEVER ask the user to provide it directly. Instead, help create a .env file with a placeholder (DASHSCOPE_API_KEY=sk-your-key-here) and instruct the user to replace it with their actual key from the QwenCloud Console. Only write the actual key value if the user explicitly requests it.

Key Compatibility

Scripts require a standard QwenCloud API key (sk-...). Coding Plan keys (sk-sp-...) cannot be used — TTS models are not available on Coding Plan, and Coding Plan does not support the native QwenCloud API. The script detects sk-sp- keys at startup and prints a warning. If qwencloud-ops-auth is installed, see its references/codingplan.md for full details.

Model Selection

Qwen TTS (HTTP API) — use `scripts/tts.py`

Model	Use Case	Notes
INLINECODE19	Recommended (standard) — fast, multi-language	Cost-effective, widely available
INLINECODE20

Instruction-guided style control | Tone/emotion via instructions |

CosyVoice (WebSocket API) — use `scripts/tts_cosyvoice.py`

Model	Use Case
INLINECODE22	High quality, fast
INLINECODE23

Highest quality |

Note: CosyVoice requires dashscope SDK and uses different voices. See cosyvoice-guide.md.

1. User specified a model → use the appropriate script:

- qwen3-tts-* → scripts/tts.py - cosyvoice-* → scripts/tts_cosyvoice.py

2. Consult the qwencloud-model-selector skill when model choice depends on capability, scenario, or pricing.
No signal, clear task → qwen3-tts-flash via tts.py (default for standard tasks).

⚠️ Important: The model list above is a point-in-time snapshot and may be outdated. Model availability
changes frequently. Always check the official model list
for the authoritative, up-to-date catalog before making model decisions.

Available Voices

Voice	Description	Script
Cherry, Ethan, Serena	Qwen TTS system voices	INLINECODE31
longanyang, longanhuan, longhuhu_v3

CosyVoice verified voices | tts_cosyvoice.py |

Full lists: api-guide.md (Qwen TTS) · cosyvoice-guide.md (CosyVoice)

Execution

⚠️ Multiple artifacts: When generating multiple files in a single session, you MUST append a numeric suffix to each filename (e.g. out_1.wav, out_2.wav) to prevent overwrites.

Qwen TTS (HTTP API) — `tts.py`

Prerequisites

- API Key: Check that DASHSCOPE_API_KEY (or QWEN_API_KEY) is set using a non-plaintext check only (e.g. in shell: [ -n "$DASHSCOPE_API_KEY" ]; report only "set" or "not set", never the key value). If not set: run the qwencloud-ops-auth skill if available; otherwise guide the user to obtain a key from QwenCloud Console and set it via .env file (echo 'DASHSCOPE_API_KEY=sk-your-key-here' >> .env in project root or current directory) or environment variable. The script searches for .env in the current working directory and the project root. Skills may be installed independently — do not assume qwencloud-ops-auth is present.
Python 3.9+ (stdlib only, no pip install needed)

Environment Check

Before first execution, verify Python is available:

CODEBLOCK0

If python3 is not found, try python --version or py -3 --version. If Python is unavailable or below 3.9, skip to Path 2 (curl) in execution-guide.md.

Default: Run Script

Script path: Scripts are in the scripts/ subdirectory of this skill's directory (the directory containing this SKILL.md). You MUST first locate this skill's installation directory, then ALWAYS use the full absolute path to execute scripts. Do NOT assume scripts are in the current working directory. Do NOT use cd to switch directories before execution.

Execution note: Run all scripts in the foreground — wait for stdout; do not background.

Discovery: Run python3 <this-skill-dir>/scripts/tts.py --help first to see all available arguments.

CODEBLOCK1

Argument	Description
INLINECODE48	JSON request body
INLINECODE49

Verify Result

- Exit code 0 + output contains valid JSON with output.audio field → success
Non-zero exit, HTTP error, empty response, or error JSON → fail
Post-execution check: Verify the output audio file exists and has non-zero size (ls -la <output_dir>)
MANDATORY — stderr signal check: After confirming the result, scan the command's stderr output for [ACTION_REQUIRED] or [UPDATE_AVAILABLE]. If either signal is present, you MUST follow the instructions in Update Check below before responding to the user.

On Failure

If the script fails, match the error output against the diagnostic table below to determine the resolution. If no match, read execution-guide.md for alternative paths: curl commands (Path 2 — standard, instruct, streaming), code generation (Path 3), and autonomous resolution (Path 5).

If Python is not available at all → skip directly to Path 2 (curl) in execution-guide.md.

CosyVoice — `tts_cosyvoice.py`

CosyVoice requires dashscope SDK. Quick start:

CODEBLOCK2

Full guide: cosyvoice-guide.md (setup, voices, examples, errors)

Error Pattern	Diagnosis	Resolution
INLINECODE61	Python not on PATH	Try `python` or `py -3`; install Python 3.9+ if missing
INLINECODE64

Quick Reference

Request Fields

Field	Type	Description
INLINECODE80	string	Required — text to synthesize (max 600 chars)
INLINECODE81

Response Fields

Field	Description
INLINECODE96	URL of generated audio (valid 24h)
INLINECODE97

Important Notes

- text: Max 600 characters per request.
instructions: Only works with qwen3-tts-instruct-flash.
languagetype: Auto for mixed language; specify for better pronunciation.
audiourl: Valid for 24 hours — download promptly.
Real-time/streaming TTS: For WebSocket-based real-time TTS (CosyVoice, qwen3-tts-flash-realtime), a WebSocket client is required. This skill covers the HTTP-based non-real-time API. For real-time streaming use cases, refer to the official docs in sources.md.

Cross-Skill Chaining

When passing generated audio to another skill (e.g., video-gen audio overlay):

- Pass audio_url directly — scripts detect URL prefixes and pass through without re-upload
Use audio_file only for local playback or non-API operations

Error Handling

Error	Cause	Action
INLINECODE104	Invalid or missing API key	Run qwencloud-ops-auth if available; else prompt user to set key (non-plaintext check only)
INLINECODE105

Output Location

Prefer the current working directory. Default subdirectory: ./output/qwencloud-audio-tts/.

Write prohibition: NEVER write output files into this skill's installation directory or any skills/ hierarchy. All generated content must go to output/ under the current working directory or a user-specified path.

Update Check (MANDATORY Post-Execution)

After script execution, check stderr for action signals. The script automatically detects update status and emits structured signals. You MUST act on them. This section also applies when the user explicitly asks to check for updates.

Signal: `[ACTION_REQUIRED] qwencloud-update-check skill is not installed`

When stderr contains this signal, you MUST present the following options to the user and wait for their response before proceeding. Do NOT silently skip this.

The update-check skill is not installed. It keeps you informed about new qwencloud/qwencloud-ai features and bug fixes.
1) Install now — I'll run the install command for you
2) Skip for now — do nothing this time
3) Never remind me — permanently suppress this prompt

After the user responds:

- User chooses 1 → run: npx skills add QwenCloud/qwencloud-ai --skill qwencloud-update-check -y, then run: INLINECODE113
User chooses 2 → run: python3 <this-skill-scripts-dir>/gossamer.py --dismiss, then continue normally
User chooses 3 → run: python3 <this-skill-scripts-dir>/gossamer.py --never-install, then continue normally

Signal: `[UPDATE_AVAILABLE]`

When stderr contains this signal, you MUST append a brief update notice to your response, including the version info and the update command shown in the stderr output.

No signal in stderr

If stderr contains neither [ACTION_REQUIRED] nor [UPDATE_AVAILABLE], no action is needed — the skill is installed and up to date (or cached within 24h).

Explicit user request

When the user explicitly asks to check for updates (e.g. "check for updates", "check version"):

1. Look for qwencloud-update-check/SKILL.md in sibling skill directories.
If found — run: python3 <qwencloud-update-check-dir>/scripts/check_update.py --print-response and report the result.
If not found — present the install options above.

References

- execution-guide.md — Fallback paths (curl standard/instruct/streaming, code generation, autonomous)
api-guide.md — API supplementary guide
sources.md — Official documentation URLs

Agent 设置：如果你的代理不会自动加载技能（例如 Claude Code），
请在每个会话中查看一次 agent-compatibility.md。

Qwen 音频 TTS（文本转语音）

使用 Qwen TTS 模型从文本合成自然语音。
此技能是 qwencloud/qwencloud-ai 的一部分。

技能目录

使用此技能的内部文件来执行和学习。当默认路径失败或你需要详细信息时，按需加载参考文件。

位置	用途
scripts/tts.py	Qwen TTS（HTTP API）— qwen3-tts-flash, qwen3-tts-instruct-flash
scripts/tts_cosyvoice.py

安全

切勿以明文形式输出任何 API 密钥或凭证。 始终使用变量引用（shell 中使用 $DASHSCOPEAPIKEY，Python 中使用 os.environ[DASHSCOPEAPIKEY]）。任何对凭证的检查或检测必须非明文：仅报告状态（例如“已设置”/“未设置”、“有效”/“无效”），切勿输出值。切勿显示可能包含密钥的 .env 或配置文件内容。

当 API 密钥未配置时，切勿要求用户直接提供。 相反，应帮助创建一个包含占位符（DASHSCOPEAPIKEY=sk-your-key-here）的 .env 文件，并指导用户从 QwenCloud 控制台替换为他们的实际密钥。仅当用户明确请求时才写入实际的密钥值。

密钥兼容性

脚本需要标准的 QwenCloud API 密钥（sk-...）。编程计划密钥（sk-sp-...）无法使用 — TTS 模型在编程计划上不可用，且编程计划不支持原生 QwenCloud API。脚本在启动时会检测 sk-sp- 密钥并打印警告。如果安装了 qwencloud-ops-auth，请参阅其 references/codingplan.md 了解完整详情。

模型选择

Qwen TTS（HTTP API）— 使用 scripts/tts.py

模型	用例	备注
qwen3-tts-flash	推荐（标准）— 快速、多语言	性价比高，广泛可用
qwen3-tts-instruct-flash

指令引导的风格控制 | 通过指令控制语气/情感 |

CosyVoice（WebSocket API）— 使用 scripts/tts_cosyvoice.py

模型	用例
cosyvoice-v3-flash	高质量，快速
cosyvoice-v3-plus

最高质量 |

注意：CosyVoice 需要 dashscope SDK 并使用不同的音色。请参阅 cosyvoice-guide.md。

1. 用户指定了模型 → 使用相应的脚本：

- qwen3-tts-* → scripts/tts.py - cosyvoice-* → scripts/tts_cosyvoice.py

2. 当模型选择取决于能力、场景或定价时，请咨询 qwencloud-model-selector 技能。
无信号，任务明确 → 通过 tts.py 使用 qwen3-tts-flash（标准任务的默认选项）。

⚠️ 重要：上述模型列表是某个时间点的快照，可能已过时。模型可用性
频繁变化。在做出模型决策之前，请始终查看官方模型列表
以获取权威、最新的目录。

可用音色

音色	描述	脚本
Cherry, Ethan, Serena	Qwen TTS 系统音色	tts.py
longanyang, longanhuan, longhuhuv3

CosyVoice 已验证音色 | ttscosyvoice.py |

完整列表：api-guide.md（Qwen TTS）· cosyvoice-guide.md（CosyVoice）

执行

⚠️ 多个文件：在单次会话中生成多个文件时，你必须在每个文件名后附加数字后缀（例如 out1.wav、out2.wav）以防止覆盖。

Qwen TTS（HTTP API）— tts.py

前提条件

- API 密钥：使用非明文检查确认 DASHSCOPEAPIKEY（或 QWENAPIKEY）已设置（例如在 shell 中：[ -n $DASHSCOPEAPIKEY ]；仅报告“已设置”或“未设置”，切勿输出密钥值）。如果未设置：如果可用，运行 qwencloud-ops-auth 技能；否则指导用户从 QwenCloud 控制台获取密钥，并通过 .env 文件（在项目根目录或当前目录执行 echo DASHSCOPEAPI_KEY=sk-your-key-here >> .env）或环境变量进行设置。脚本会在当前工作目录和项目根目录中搜索 .env。技能可能独立安装 — 不要假设 qwencloud-ops-auth 存在。
Python 3.9+（仅标准库，无需 pip 安装）

环境检查

在首次执行前，确认 Python 可用：

bash
python3 --version # 必须为 3.9+

如果找不到 python3，请尝试 python --version 或 py -3 --version。如果 Python 不可用或版本低于 3.9，请跳转到 execution-guide.md 中的路径 2（curl）。

默认：运行脚本

脚本路径：脚本位于此技能目录（包含此 SKILL.md 的目录）的 scripts/ 子目录中。你必须首先找到此技能的安装目录，然后始终使用完整的绝对路径来执行脚本。 不要假设脚本在当前工作目录中。不要在执行前使用 cd 切换目录。

执行说明：在前台运行所有脚本 — 等待标准输出；不要后台运行。

发现：首先运行 python3 <此技能目录>/scripts/tts.py --help 查看所有可用参数。

bash
python3 <此技能目录>/scripts/tts.py \
--request {text:Hello, this is a test.,voice:Cherry} \
--output output/qwencloud-audio-tts/ \
--print-response

参数	描述
--request {...}	JSON 请求体
--file path.json

验证结果

- 退出码 0 + 输出包含带有 output.audio 字段的有效 JSON → 成功
非零退出、HTTP 错误、空响应或错误 JSON → 失败
执行后检查：确认输出音频文件存在且大小非零（ls -la <输出目录>）
强制 — 标准错误信号检查：确认结果后，扫描命令的标准错误输出中是否包含 [ACTIONREQUIRED] 或 [UPDATEAVAILABLE]。如果存在任一信号，你必须在回复用户之前按照下方更新检查中的说明进行操作。

失败时

如果脚本失败，将错误输出与下面的诊断表进行匹配以确定解决方案。如果没有匹配项，请阅读 execution-guide.md 了解替代路径：curl

qwencloud-audio-ttsQwen语音合成

qwencloud-audio-tts

Qwen Audio TTS (Text-to-Speech)

Skill directory

Security

Key Compatibility

Model Selection

Qwen TTS (HTTP API) — use scripts/tts.py

CosyVoice (WebSocket API) — use scripts/tts_cosyvoice.py

Available Voices

Execution

Qwen TTS (HTTP API) — tts.py

Prerequisites

Environment Check

Default: Run Script

Verify Result

On Failure

CosyVoice — tts_cosyvoice.py

Quick Reference

Request Fields

Response Fields

Important Notes

Cross-Skill Chaining

Error Handling

Output Location

Update Check (MANDATORY Post-Execution)

Signal: [ACTION_REQUIRED] qwencloud-update-check skill is not installed

Signal: [UPDATE_AVAILABLE]

No signal in stderr

Explicit user request

References

Qwen 音频 TTS（文本转语音）

技能目录

安全

密钥兼容性

模型选择

Qwen TTS（HTTP API）— 使用 scripts/tts.py

CosyVoice（WebSocket API）— 使用 scripts/tts_cosyvoice.py

可用音色

执行

Qwen TTS（HTTP API）— tts.py

前提条件

环境检查

默认：运行脚本

验证结果

失败时

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement

Qwen TTS (HTTP API) — use `scripts/tts.py`

CosyVoice (WebSocket API) — use `scripts/tts_cosyvoice.py`

Qwen TTS (HTTP API) — `tts.py`

CosyVoice — `tts_cosyvoice.py`

Signal: `[ACTION_REQUIRED] qwencloud-update-check skill is not installed`

Signal: `[UPDATE_AVAILABLE]`