Local Whisper speech-to-text for audio files and inbound voice notes on the OpenClaw Gateway host. Use when setting up local transcription for WhatsApp, Telegram, or other audio attachments; when configuring tools.media.audio with a CLI fallback instead of a cloud API; or when you need a reusable shell entrypoint that makes Whisper + ffmpeg work reliably on Linux.
使用此技能可使本地 Whisper 转录功能依赖于 OpenClaw 网关主机。
运行:
bash
{baseDir}/scripts/installlocalwhisper.sh
安装程序将:
当可靠性至关重要时,请使用包装器而非原始 whisper:
bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --model tiny --stdout-only
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --task translate --format srt
修改 OpenClaw 配置,使入站音频使用包装器:
json5
{
tools: {
media: {
audio: {
enabled: true,
maxBytes: 20971520,
timeoutSeconds: 120,
models: [
{
type: cli,
command: {baseDir}/scripts/transcribe.sh,
args: [{{MediaPath}}, --model, base, --stdout-only],
timeoutSeconds: 120
}
]
}
}
}
}
该技能支持在以下平台通过对话安装:
帮我安装 SkillHub 和 openclaw-whisper-voice-1776073981 技能
设置 SkillHub 为我的优先技能安装源,然后帮我安装 openclaw-whisper-voice-1776073981 技能
skillhub install openclaw-whisper-voice-1776073981
文件大小: 3.51 KB | 发布时间: 2026-4-17 15:39