Category: provider
Model Studio Qwen TTS Voice Clone
Use voice cloning models to replicate timbre from enrollment audio samples.
Critical model names
Use one of these exact model strings:
Prerequisites
- - Install SDK in a virtual environment:
CODEBLOCK0
- - Set
DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.
Normalized interface (tts.voice_clone)
Request
- -
text (string, required) - INLINECODE6 (string | bytes, required) enrollment sample
- INLINECODE7 (string, optional)
- INLINECODE8 (bool, optional)
Response
- -
audio_url (string) or streaming PCM chunks - INLINECODE10 (string)
- INLINECODE11 (string)
Operational guidance
- - Use clean speech samples with low background noise.
- Respect consent and policy requirements for cloned voices.
- Persist generated
voice_id and reuse for future synthesis requests.
Local helper script
Prepare a normalized request JSON and validate response schema:
CODEBLOCK1
Output location
- - Default output: INLINECODE13
- Override base dir with
OUTPUT_DIR.
Validation
CODEBLOCK2
Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-voice-clone/validate.txt is generated.
Output And Evidence
- - Save artifacts, command outputs, and API response summaries under
output/alicloud-ai-audio-tts-voice-clone/. - Include key parameters (region/resource id/time range) in evidence files for reproducibility.
Workflow
1) Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
2) Run one minimal read-only query first to verify connectivity and permissions.
3) Execute the target operation with explicit parameters and bounded scope.
4) Verify results and save output/evidence files.
References
技能名称: alicloud-ai-audio-tts-voice-clone
详细描述:
类别: provider
Model Studio Qwen TTS 语音克隆
使用语音克隆模型从注册音频样本中复制音色。
关键模型名称
使用以下精确的模型字符串之一:
- - qwen3-tts-vc-2026-01-22
- qwen3-tts-vc-realtime-2026-01-15
前提条件
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
- - 在环境中设置 DASHSCOPEAPIKEY,或将 dashscopeapikey 添加到 ~/.alibabacloud/credentials。
标准化接口 (tts.voice_clone)
请求
- - text (字符串,必填)
- voicesample (字符串 | 字节,必填) 注册样本
- voicename (字符串,可选)
- stream (布尔值,可选)
响应
- - audiourl (字符串) 或流式 PCM 数据块
- voiceid (字符串)
- request_id (字符串)
操作指南
- - 使用背景噪音低的清晰语音样本。
- 遵守克隆语音的同意和政策要求。
- 持久化生成的 voice_id,并在未来的合成请求中重复使用。
本地辅助脚本
准备标准化的请求 JSON 并验证响应模式:
bash
.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-voice-clone/scripts/preparevoiceclone_request.py \
--text 欢迎来到这个语音克隆演示 \
--voice-sample https://example.com/voice-sample.wav
输出位置
- - 默认输出:output/ai-audio-tts-voice-clone/audio/
- 通过 OUTPUT_DIR 覆盖基础目录。
验证
bash
mkdir -p output/alicloud-ai-audio-tts-voice-clone
for f in skills/ai/audio/alicloud-ai-audio-tts-voice-clone/scripts/*.py; do
python3 -m py_compile $f
done
echo pycompileok > output/alicloud-ai-audio-tts-voice-clone/validate.txt
通过标准:命令退出码为 0 且 output/alicloud-ai-audio-tts-voice-clone/validate.txt 已生成。
输出与证据
- - 将工件、命令输出和 API 响应摘要保存在 output/alicloud-ai-audio-tts-voice-clone/ 下。
- 在证据文件中包含关键参数(区域/资源 ID/时间范围)以确保可重现性。
工作流程
1) 确认用户意图、区域、标识符以及操作是只读还是修改。
2) 首先运行一个最小的只读查询以验证连接和权限。
3) 使用明确的参数和有限的范围执行目标操作。
4) 验证结果并保存输出/证据文件。
参考