MoodCast
Transform any text into emotionally expressive audio with ambient soundscapes. MoodCast analyzes your content, adds expressive delivery using ElevenLabs v3 audio tags, and layers matching ambient soundscapes.
When to Use This Skill
Use MoodCast when the user wants to:
- - Hear text read with natural emotional expression
- Create audio versions of articles, stories, or scripts
- Generate expressive voiceovers with ambient atmosphere
- Listen to morning briefings that actually sound engaging
- Transform boring text into captivating audio content
Trigger phrases: "read this dramatically", "make this sound good", "create audio for", "moodcast this", "read with emotion", "narrate this"
Slash command: INLINECODE0
Core Capabilities
1. Emotion-Aware Text Enhancement
Automatically analyzes text and inserts appropriate v3 audio tags:
- - Emotions:
[excited], [nervous], [angry], [sorrowful], [calm], INLINECODE6 - Delivery:
[whispers], [shouts], [rushed], INLINECODE10 - Reactions:
[laughs], [sighs], [gasps], [giggles], INLINECODE15 - Pacing:
[pause], [breathes], [stammers], INLINECODE19
2. Ambient Soundscape Generation
Creates matching background audio using Sound Effects API:
- - News → subtle office ambiance
- Story → atmospheric soundscape matching mood
- Motivational → uplifting background
- Scary → tense, eerie atmosphere
3. Multi-Voice Dialogue
For conversations/scripts, assigns different voices to speakers with appropriate emotional delivery.
Instructions
Quick Read (Single Command)
CODEBLOCK0
With Ambient Sound
CODEBLOCK1
Save to File
CODEBLOCK2
Different Moods
CODEBLOCK3
List Available Voices
CODEBLOCK4
Custom Configuration
CODEBLOCK5
Emotion Detection Rules
The skill automatically detects and enhances:
| Text Pattern | Audio Tag Added |
|---|
| "amazing", "incredible", "wow" | INLINECODE20 |
| "scared", "afraid", "terrified" |
[nervous] |
| "angry", "furious", "hate" |
[angry] |
| "sad", "sorry", "unfortunately" |
[sorrowful] |
| "secret", "quiet", "between us" |
[whispers] |
| "!" exclamations |
[excited] |
| "..." trailing off |
[pause] |
| "haha", "lol" |
[laughs] |
| Questions | Natural rising intonation |
Example Transformations
Input:
CODEBLOCK6
Enhanced Output:
CODEBLOCK7
Input:
CODEBLOCK8
Enhanced Output:
CODEBLOCK9
Environment Variables
- -
ELEVENLABS_API_KEY (required) - Your ElevenLabs API key - INLINECODE29 (optional) - Default voice ID (defaults to
CwhRBWXzGAHq8TQ4Fs17) - INLINECODE31 (optional) - Default model ID (defaults to
eleven_v3) - INLINECODE33 (optional) - Default output format (defaults to
mp3_44100_128) - INLINECODE35 (optional) - Set to
"true" for automatic ambient sounds when using INLINECODE37
Configuration Priority: CLI arguments override environment variables, which override hardcoded defaults.
Technical Notes
- - Uses ElevenLabs Eleven v3 model for audio tag support
- Sound Effects API for ambient generation (up to 30 seconds)
- Free tier: 10,000 credits/month (~10 min audio)
- Max 2,400 characters per chunk (v3 supports 5,000, but we split conservatively for reliability)
- Audio tags must be lowercase:
[whispers] not INLINECODE39
Tips for Best Results
- 1. Dramatic content works best - stories, news, scripts
- Shorter segments (under 500 chars) sound more natural
- Combine with ambient for immersive experience
- Roger and Rachel voices are most expressive with v3
Credits
Built by ashutosh887
Using ElevenLabs Text-to-Speech v3 + Sound Effects API
Created for #ClawdEleven Hackathon
MoodCast
将任何文本转化为带有情感表达和环境音景的音频。MoodCast会分析您的内容,使用ElevenLabs v3音频标签添加富有表现力的朗读,并叠加匹配的环境音景。
何时使用此技能
当用户想要以下功能时,请使用MoodCast:
- - 以自然的情感表达聆听文本朗读
- 创建文章、故事或剧本的音频版本
- 生成带有环境氛围的富有表现力的配音
- 收听真正引人入胜的晨间简报
- 将枯燥文本转化为引人入胜的音频内容
触发短语: 戏剧化地朗读这个、让这个听起来更好、为这个创建音频、moodcast这个、带着感情朗读、叙述这个
斜杠命令: /moodcast
核心能力
1. 情感感知文本增强
自动分析文本并插入适当的v3音频标签:
- - 情感: [兴奋]、[紧张]、[愤怒]、[悲伤]、[平静]、[开心]
- 表达方式: [低语]、[喊叫]、[急促]、[放慢]
- 反应: [笑]、[叹气]、[倒吸一口气]、[咯咯笑]、[哭泣]
- 节奏: [停顿]、[呼吸]、[结巴]、[犹豫]
2. 环境音景生成
使用音效API创建匹配的背景音频:
- - 新闻 → 微妙的办公室氛围
- 故事 → 与情绪匹配的氛围音景
- 励志 → 振奋人心的背景
- 恐怖 → 紧张、诡异的氛围
3. 多角色对话
对于对话/剧本,为不同说话者分配不同的声音,并配以适当的情感表达。
使用说明
快速朗读(单条命令)
bash
python3 {baseDir}/scripts/moodcast.py --text 您的文本
带环境音效
bash
python3 {baseDir}/scripts/moodcast.py --text 您的文本 --ambient 咖啡店背景噪音
保存到文件
bash
python3 {baseDir}/scripts/moodcast.py --text 您的文本 --output story.mp3
不同情绪
bash
python3 {baseDir}/scripts/moodcast.py --text 您的文本 --mood 戏剧化
python3 {baseDir}/scripts/moodcast.py --text 您的文本 --mood 平静
python3 {baseDir}/scripts/moodcast.py --text 您的文本 --mood 兴奋
python3 {baseDir}/scripts/moodcast.py --text 您的文本 --mood 恐怖
列出可用声音
bash
python3 {baseDir}/scripts/moodcast.py --list-voices
自定义配置
bash
python3 {baseDir}/scripts/moodcast.py --text 您的文本 --voice VOICE
ID --model elevenv3 --output-format mp3
44100128
情感检测规则
该技能会自动检测并增强以下内容:
| 文本模式 | 添加的音频标签 |
|---|
| 太棒了、难以置信、哇 | [兴奋] |
| 害怕、恐惧、惊恐 |
[紧张] |
| 生气、愤怒、讨厌 | [愤怒] |
| 难过、抱歉、不幸的是 | [悲伤] |
| 秘密、安静、我们之间 | [低语] |
| !感叹号 | [兴奋] |
| ...省略号 | [停顿] |
| 哈哈、lol | [笑] |
| 疑问句 | 自然的升调 |
示例转换
输入:
突发新闻!科学家们发现了一些不可思议的东西。
这可能改变我们对宇宙的一切认知……
我简直不敢相信。
增强输出:
[兴奋] 突发新闻!科学家们发现了一些不可思议的东西。
[停顿] 这可能改变我们对宇宙的一切认知……
[倒吸一口气] [低语] 我简直不敢相信。
输入:
那是一个漆黑的夜晚。老房子吱吱作响。
阴影中有东西在移动……
谁在那里?她低声问道。
增强输出:
[放慢] 那是一个漆黑的夜晚。[停顿] 老房子吱吱作响。
[紧张] 阴影中有东西在移动……
[低语] 谁在那里?她低声问道。
环境变量
- - ELEVENLABSAPIKEY(必需)- 您的ElevenLabs API密钥
- MOODCASTDEFAULTVOICE(可选)- 默认声音ID(默认为CwhRBWXzGAHq8TQ4Fs17)
- MOODCASTMODEL(可选)- 默认模型ID(默认为elevenv3)
- MOODCASTOUTPUTFORMAT(可选)- 默认输出格式(默认为mp344100128)
- MOODCASTAUTOAMBIENT(可选)- 设置为true可在使用--mood时自动添加环境音效
配置优先级: CLI参数覆盖环境变量,环境变量覆盖硬编码默认值。
技术说明
- - 使用ElevenLabs Eleven v3模型支持音频标签
- 使用音效API生成环境音效(最长30秒)
- 免费套餐:每月10,000积分(约10分钟音频)
- 每段最多2,400个字符(v3支持5,000个,但我们保守分割以确保可靠性)
- 音频标签必须小写:[whispers]而非[WHISPERS]
最佳效果提示
- 1. 戏剧性内容效果最佳 - 故事、新闻、剧本
- 较短的段落(500字符以下)听起来更自然
- 与环境音效结合可获得沉浸式体验
- Roger和Rachel声音在v3中表现力最强
致谢
由ashutosh887构建
使用ElevenLabs文本转语音v3 + 音效API
为#ClawdEleven黑客马拉松创建