openclaw-whisper-voice本地语音转文字

Local Whisper speech-to-text for audio files and inbound voice notes on the OpenClaw Gateway host. Use when setting up local transcription for WhatsApp, Telegram, or other audio attachments; when configuring tools.media.audio with a CLI fallback instead of a cloud API; or when you need a reusable shell entrypoint that makes Whisper + ffmpeg work reliably on Linux.

作者: admin | 来源: ClawHub

OpenClaw Whisper 语音

使用此技能可使本地 Whisper 转录功能依赖于 OpenClaw 网关主机。

在主机上安装

运行：

bash
{baseDir}/scripts/installlocalwhisper.sh

安装程序将：

- 将 Python 包安装到 ~/.local
安装 CPU 兼容的 PyTorch 构建
安装 openai-whisper
安装 imageio-ffmpeg
创建稳定的 ~/.local/bin/whisper 和 ~/.local/bin/ffmpeg 启动器

手动转录文件

当可靠性至关重要时，请使用包装器而非原始 whisper：

bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --model tiny --stdout-only
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --task translate --format srt

配置入站 WhatsApp 和 Telegram 语音消息

修改 OpenClaw 配置，使入站音频使用包装器：

json5
{
tools: {
media: {
audio: {
enabled: true,
maxBytes: 20971520,
timeoutSeconds: 120,
models: [
{
type: cli,
command: {baseDir}/scripts/transcribe.sh,
args: [{{MediaPath}}, --model, base, --stdout-only],
timeoutSeconds: 120
}
]
}
}
}
}

模型选择

- tiny：最快，准确度最低
base：聊天语音消息的最佳默认选择
small 或更大：准确度更高，CPU 和内存占用更大

输出规则

- 对于 tools.media.audio，使用 --stdout-only，使标准输出仅为转录文本。
对于独立文件转录，使用 --format txt|srt|vtt|json。
首次模型下载将保存到 ~/.cache/whisper。

openclaw-whisper-voice本地语音转文字

openclaw-whisper-voice

OpenClaw Whisper Voice

Install on the host

Transcribe a file manually

Configure inbound WhatsApp and Telegram voice notes

Model choices

Output rules

OpenClaw Whisper 语音

在主机上安装

手动转录文件

配置入站 WhatsApp 和 Telegram 语音消息

模型选择

输出规则

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

openclaw-whisper-voice本地语音转文字

openclaw-whisper-voice

OpenClaw Whisper Voice

Install on the host

Transcribe a file manually

Configure inbound WhatsApp and Telegram voice notes

Model choices

Output rules

OpenClaw Whisper 语音

在主机上安装

手动转录文件

配置入站 WhatsApp 和 Telegram 语音消息

模型选择

输出规则

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement