Edge-TTS Skill

Overview

Generate high-quality text-to-speech audio using Microsoft Edge's neural TTS service via Python edge-tts. Supports multiple languages, voices, adjustable speed/pitch, and subtitle generation (SRT/VTT).

Quick Start

When you detect TTS intent from triggers or user request:

1. Call the tts tool (Clawdbot built-in) to convert text to speech
The tool returns a MEDIA: path
Clawdbot routes the audio to the current channel

Trigger Detection

Recognize "tts" keyword as TTS requests. The skill automatically filters out TTS-related keywords from text before conversion.

Advanced Customization

Using the Python Scripts

For more control, use the bundled scripts directly:

TTS Converter

CODEBLOCK0

Options:

- --voice, -v: Voice name (default: en-US-MichelleNeural)
INLINECODE1: Language code (e.g., en-US, zh-CN)
INLINECODE2: Rate adjustment (e.g., +10%, -20%)
INLINECODE3: Volume adjustment (e.g., +0%, -50%)
INLINECODE4: Pitch adjustment (e.g., +0Hz, -10Hz)
INLINECODE5: Output file path (default: temp file)
INLINECODE6: Save subtitles to file (.vtt or .srt)
INLINECODE7: Read text from file
INLINECODE8: Proxy URL
INLINECODE9: Receive timeout in seconds (default: 60)
INLINECODE10: List available voices
INLINECODE11: Filter voices by language (used with --list-voices)

Configuration Manager

CODEBLOCK1

Voice Selection

Common voices (use --list-voices for full list):

English:

- en-US-MichelleNeural (female, natural, default)
INLINECODE14 (female, natural)
INLINECODE15 (male, natural)
INLINECODE16 (female, British)
INLINECODE17 (male, British)

Chinese:

- zh-CN-XiaoxiaoNeural (female)
INLINECODE19 (male, news style)
INLINECODE20 (male, natural)

Other Languages:

- es-ES-ElviraNeural (Spanish)
INLINECODE22 (French)
INLINECODE23 (German)
INLINECODE24 (Japanese)
INLINECODE25 (Arabic)

Rate Guidelines

Rate values use percentage format:

- "+0%": Normal speed (default)
INLINECODE27 to "-10%": Slow, clear (tutorials, stories, accessibility)
INLINECODE29 to "+20%": Slightly fast (summaries)
INLINECODE31 to "+50%": Fast (news, efficiency)

Resources

scripts/tts_converter.py

Main TTS conversion script using edge-tts. Generates audio files with customizable voice, rate, volume, pitch. Supports subtitle generation (VTT/SRT) and voice listing.

scripts/config_manager.py

Manages persistent user preferences for TTS settings. Stores config in ~/.tts-config.json.

Voice Testing

Test different voices and preview audio quality at: https://tts.travisvn.com/

Installation

CODEBLOCK2

Workflow

1. Detect intent: Check for "tts" trigger or keyword in user message
Choose method: Use built-in tts tool for simple requests, or scripts/tts_converter.py for customization
Generate audio: Convert the target text
Return to user: The tts tool returns a MEDIA: path; Clawdbot handles delivery

Testing

Basic Test

CODEBLOCK3

Chinese Test

CODEBLOCK4

List Voices

CODEBLOCK5

Configuration Test

CODEBLOCK6

Notes

- edge-tts uses Microsoft Edge's online TTS service
No API key needed (free service)
Output is MP3 format by default
Requires internet connection
Supports subtitle generation (standard VTT/SRT format)
Temporary File Handling: By default, audio files are saved to the system's temporary directory with unique filenames. Specify a custom output path with --output for permanent storage.
TTS keyword filtering: Automatically filters out TTS-related keywords from text before conversion
Neural voices (ending in Neural) provide higher quality

Edge-TTS 技能

概述

通过 Python edge-tts 使用微软 Edge 的神经 TTS 服务，生成高质量文本转语音音频。支持多种语言、声音、可调节语速/音调，以及字幕生成（SRT/VTT）。

快速开始

当您从触发器或用户请求中检测到 TTS 意图时：

1. 调用 tts 工具（Clawdbot 内置）将文本转换为语音
该工具返回一个 MEDIA: 路径
Clawdbot 将音频路由到当前频道

触发器检测

将tts关键词识别为 TTS 请求。该技能在转换前会自动从文本中过滤掉与 TTS 相关的关键词。

高级自定义

使用 Python 脚本

如需更多控制，可直接使用捆绑脚本：

TTS 转换器

bash cd scripts python3 tts_converter.py 您的文本 --voice en-US-AriaNeural --rate +10% -o output.mp3 python3 tts_converter.py -f input.txt --voice zh-CN-XiaoxiaoNeural -o output.mp3 python3 tts_converter.py -f input.txt -v zh-CN-YunxiNeural -r +10% -o output.mp3 -s output.vtt

选项：

- --voice, -v：声音名称（默认：en-US-MichelleNeural）
--lang, -l：语言代码（例如 en-US、zh-CN）
--rate, -r：语速调整（例如 +10%、-20%）
--volume：音量调整（例如 +0%、-50%）
--pitch：音调调整（例如 +0Hz、-10Hz）
--output, -o：输出文件路径（默认：临时文件）
--subtitles, -s：保存字幕到文件（.vtt 或 .srt）
--file, -f：从文件读取文本
--proxy, -p：代理 URL
--timeout：接收超时时间（秒，默认：60）
--list-voices, -L：列出可用声音
--lang-filter：按语言过滤声音（与 --list-voices 一起使用）

配置管理器

bash cd scripts python3 config_manager.py --set voice zh-CN-XiaoxiaoNeural python3 config_manager.py --set rate +10% python3 config_manager.py --get python3 config_manager.py --reset

声音选择

常见声音（使用 --list-voices 查看完整列表）：

英语：

- en-US-MichelleNeural（女声，自然，默认）
en-US-AriaNeural（女声，自然）
en-US-GuyNeural（男声，自然）
en-GB-SoniaNeural（女声，英式）
en-GB-RyanNeural（男声，英式）

中文：

- zh-CN-XiaoxiaoNeural（女声）
zh-CN-YunyangNeural（男声，新闻风格）
zh-CN-YunxiNeural（男声，自然）

其他语言：

- es-ES-ElviraNeural（西班牙语）
fr-FR-DeniseNeural（法语）
de-DE-KatjaNeural（德语）
ja-JP-NanamiNeural（日语）
ar-SA-ZariyahNeural（阿拉伯语）

语速指南

语速值使用百分比格式：

- +0%：正常速度（默认）
-20% 到 -10%：慢速、清晰（教程、故事、无障碍）
+10% 到 +20%：稍快（摘要）
+30% 到 +50%：快速（新闻、高效）

资源

scripts/tts_converter.py

使用 edge-tts 的主要 TTS 转换脚本。生成可自定义声音、语速、音量、音调的音频文件。支持字幕生成（VTT/SRT）和声音列表。

scripts/config_manager.py

管理 TTS 设置的持久化用户偏好。将配置存储在 ~/.tts-config.json 中。

声音测试

在 https://tts.travisvn.com/ 测试不同声音并预览音频质量。

安装

bash
pip install edge-tts

工作流程

1. 检测意图：检查用户消息中是否有tts触发器或关键词
选择方法：简单请求使用内置的 tts 工具，自定义需求使用 scripts/tts_converter.py
生成音频：转换目标文本
返回给用户：tts 工具返回 MEDIA: 路径；Clawdbot 负责投递

测试

基础测试

bash cd scripts python3 tts_converter.py 你好，这是一个测试。 -o test-output.mp3

中文测试

bash python3 tts_converter.py 这是一个测试 -v zh-CN-XiaoxiaoNeural -o test-zh.mp3

列出声音

bash python3 tts_converter.py --list-voices --lang-filter zh

配置测试

bash python3 config_manager.py --get python3 config_manager.py --set voice en-US-GuyNeural python3 config_manager.py --get voice

注意事项

- edge-tts 使用微软 Edge 的在线 TTS 服务
无需 API 密钥（免费服务）
默认输出为 MP3 格式
需要网络连接
支持字幕生成（标准 VTT/SRT 格式）
临时文件处理：默认情况下，音频文件以唯一文件名保存到系统临时目录。使用 --output 指定自定义输出路径以永久保存。
TTS 关键词过滤：在转换前自动从文本中过滤掉与 TTS 相关的关键词
神经声音（以 Neural 结尾）提供更高质量

lh-edge-ttslh边缘语音合成

lh-edge-tts

Edge-TTS Skill

Overview

Quick Start

Trigger Detection

Advanced Customization

Using the Python Scripts

TTS Converter

Configuration Manager

Voice Selection

Rate Guidelines

Resources

scripts/tts_converter.py

scripts/config_manager.py

Voice Testing

Installation

Workflow

Testing

Basic Test

Chinese Test

List Voices

Configuration Test

Notes

Edge-TTS 技能

概述

快速开始

触发器检测

高级自定义

使用 Python 脚本

TTS 转换器

配置管理器

声音选择

语速指南

资源

scripts/tts_converter.py

scripts/config_manager.py

声音测试

安装

工作流程

测试

基础测试

中文测试

列出声音

配置测试

注意事项

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement