Getting Started
Welcome! I'm here to generate original, free AI music for your videos, content, or creative projects — just describe the mood, genre, or scene you have in mind and I'll compose something that fits perfectly. Ready to create your track?
Try saying:
- - "Generate a calm, acoustic background track for a 2-minute meditation video — no drums, gentle guitar, peaceful mood"
- "Create an upbeat electronic track with a driving beat for a 30-second gym workout highlight reel"
- "I need a cinematic orchestral piece that builds tension over 90 seconds for a short film chase scene"
First-Time Connection
When a user first opens this skill, connect to the NemoVideo backend automatically. Briefly let them know (e.g. "Connecting to NemoVideo...").
Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 3.
- 1. Client identifier: Look for
~/.config/nemovideo/client_id. If missing, generate a UUID and save it there. This file stores only a UUID for rate-limit tracking. - Obtain a free token: POST to
https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to your identifier. The response data.token is your NEMO_TOKEN — 100 free credits, valid 7 days. - Create a session: POST to
https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer <token>, Content-Type: application/json, and body {"task_name":"project","language":"<detected>"}. Store the returned session_id for all subsequent requests.
Keep setup communication brief. Don't display raw API responses or token values to the user.
Your Personal Composer, No Budget Required
Creating original music used to mean hiring a composer, buying expensive software, or settling for overused stock tracks. This skill changes that entirely. Describe what you're working on — a lo-fi study montage, an upbeat product demo, a tense short film scene — and the AI generates a track built around your specific creative needs.
The ai-music-generator-free skill understands context. You're not just picking from a playlist; you're communicating intent. Want something that feels like a Sunday morning in a coastal town? Or a driving electronic beat for a gym highlight reel? Just say it. The skill interprets tone, tempo preferences, instrumentation hints, and duration to shape something that actually fits.
This is built for creators who move fast and can't afford to pause a project waiting on licensing approvals or royalty clearances. Whether you're scoring a YouTube video, adding ambiance to a podcast intro, or soundtracking a client presentation, you get usable, original audio — free, fast, and flexible.
Routing Your Music Generation Requests
When you describe a mood, genre, tempo, or instrumentation, your prompt is parsed and dispatched to the appropriate AI composition pipeline based on track length, style complexity, and output format.
| User says... | Action | Skip SSE? |
|---|
| "export" / "导出" / "download" / "send me the video" | → §3.5 Export | ✅ |
| "credits" / "积分" / "balance" / "余额" |
→ §3.3 Credits | ✅ |
| "status" / "状态" / "show tracks" | → §3.4 State | ✅ |
| "upload" / "上传" / user sends file | → §3.2 Upload | ✅ |
| Everything else (generate, edit, add BGM…) | → §3.1 SSE | ❌ |
NemoVideo Backend API Reference
The NemoVideo backend processes your free AI music generation requests by converting natural-language prompts into structured audio synthesis parameters, then rendering stems and final mixed tracks through its generative model layer. Output formats include MP3 and WAV, with metadata tags for BPM, key, and genre automatically embedded.
Skill attribution — read from this file's YAML frontmatter at runtime:
- -
X-Skill-Source: INLINECODE11 - INLINECODE12 : from frontmatter INLINECODE13
- INLINECODE14 : detect from install path (
~/.clawhub/ → clawhub, ~/.cursor/skills/ → cursor, else unknown)
All requests must include: Authorization: Bearer <NEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.
API base: INLINECODE24
Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"<lang>"} — returns task_id, session_id. After creating a session, give the user a link: INLINECODE29
Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"<sid>","new_message":{"parts":[{"text":"<msg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.
Upload: POST /api/upload-video/nemo_agent/me/<sid> — file: multipart -F "files=@/path", or URL: INLINECODE35
Credits: GET /api/credits/balance/simple — returns available, frozen, INLINECODE39
Session state: GET /api/state/nemo_agent/me/<sid>/latest — key fields: data.state.draft, data.state.video_infos, INLINECODE43
Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_<ts>","sessionId":"<sid>","draft":<json>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/<id> every 30s until status = completed. Download URL at output.url.
Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.
SSE Event Handling
| Event | Action |
|---|
| Text response | Apply GUI translation (§4), present to user |
| Tool call/result |
Process internally, don't forward |
|
heartbeat / empty
data: | Keep waiting. Every 2 min: "⏳ Still working..." |
| Stream closes | Process final response |
~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.
Backend Response Translation
The backend assumes a GUI exists. Translate these into API actions:
| Backend says | You do |
|---|
| "click [button]" / "点击" | Execute via API |
| "open [panel]" / "打开" |
Query session state |
| "drag/drop" / "拖拽" | Send edit via SSE |
| "preview in timeline" | Show track summary |
| "Export button" / "导出" | Execute export workflow |
Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.
CODEBLOCK0
Error Handling
| Code | Meaning | Action |
|---|
| 0 | Success | Continue |
| 1001 |
Bad/expired token | Re-auth via anonymous-token (tokens expire after 7 days) |
| 1002 | Session not found | New session §3.0 |
| 2001 | No credits | Anonymous: show registration URL with
?bind=<id> (get
<id> from create-session or state response when needed). Registered: "Top up at nemovideo.ai" |
| 4001 | Unsupported file | Show supported formats |
| 4002 | File too large | Suggest compress/trim |
| 400 | Missing X-Client-Id | Generate Client-Id and retry (see §1) |
| 402 | Free plan export blocked | Subscription tier issue, NOT credits. "Register at nemovideo.ai to unlock export." |
| 429 | Rate limit (1 token/client/7 days) | Retry in 30s once |
Integration Guide
Getting your generated music into your project is straightforward. Once the track is produced, download it in your preferred format and drop it directly into your editing timeline in tools like DaVinci Resolve, Adobe Premiere, CapCut, or Final Cut Pro.
If you're working with video files, you can upload your mp4, mov, avi, webm, or mkv clip directly to this skill and request music that's timed to match the clip's length. This removes the need to manually trim or loop a track afterward — the output is already sized to fit.
For podcast producers using Audacity, GarageBand, or Descript, export the generated track as a standard audio file and import it as a background layer. Adjust the volume envelope to duck under your voice track and you're done.
If you manage a content calendar with recurring formats — weekly vlogs, monthly brand videos, regular ad spots — consider generating a signature sound set once and reusing it across episodes. Consistent audio branding is one of the fastest ways to make a channel feel professional.
Common Workflows
Most users come to this skill with one of three needs: scoring existing footage, generating background music for live or recorded content, or building a library of reusable tracks for ongoing projects.
For video scoring, the most effective approach is to upload your clip and describe the emotional arc — what happens at the start, how the energy shifts, and how it should feel at the end. The skill can generate music that mirrors that structure rather than looping a single flat tone.
For podcasters and streamers, a common workflow is generating a set of short themed pieces — an intro jingle, a transition sting, and a soft background loop — all in one session with consistent instrumentation so they feel like a cohesive package.
Content creators on tight turnaround schedules often use the skill to generate two or three variations of the same brief, then pick whichever one lands best in the edit. Since there's no cost per generation, iteration is fast and risk-free.
开始使用
欢迎!我在这里为你生成原创的免费AI音乐,适用于你的视频、内容或创意项目——只需描述你心中的氛围、风格或场景,我将创作出完美契合的音乐。准备好创作你的曲目了吗?
试试说:
- - 为一支2分钟的冥想视频生成一段平静的原声背景音乐——不要鼓点,轻柔吉他,平和氛围
- 为一段30秒的健身房训练集锦视频,创作一首节奏强劲的电子乐
- 我需要一首电影感的管弦乐作品,在90秒内逐步营造紧张感,用于短片追逐场景
首次连接
当用户首次打开此技能时,自动连接到NemoVideo后端。简要告知用户(例如正在连接NemoVideo...)。
身份验证:检查环境中是否设置了NEMO_TOKEN。如果已设置,跳至步骤3。
- 1. 客户端标识符:查找~/.config/nemovideo/clientid。如果不存在,生成一个UUID并保存到该文件中。该文件仅存储一个用于速率限制追踪的UUID。
- 获取免费令牌:向https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token发送POST请求,请求头X-Client-Id设置为你的标识符。响应中的data.token即为你的NEMOTOKEN——100个免费积分,有效期7天。
- 创建会话:向https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemoagent发送POST请求,携带Authorization: Bearer 、Content-Type: application/json以及请求体{taskname:project,language:<检测到的语言>}。保存返回的session_id用于所有后续请求。
保持设置沟通简洁。不要向用户显示原始API响应或令牌值。
你的个人作曲家,无需预算
创作原创音乐过去意味着雇佣作曲家、购买昂贵的软件,或者使用被过度使用的库存曲目。这项技能彻底改变了这一点。描述你正在制作的内容——一段低保真的学习蒙太奇、一个欢快的产品演示、一部紧张的短片场景——AI将围绕你的特定创作需求生成一首曲目。
ai-music-generator-free技能理解上下文。你不仅仅是在从播放列表中选择;你是在传达意图。想要一些像沿海小镇周日早晨感觉的音乐?或者为健身房集锦视频配一段强劲的电子节拍?直接说出来。该技能会解读基调、节奏偏好、乐器提示和时长,以塑造出真正契合的音乐。
这是为那些行动迅速、不能因等待许可批准或版权清算而暂停项目的创作者打造的。无论你是在为YouTube视频配乐、为播客开场增添氛围,还是为客户演示制作背景音乐,你都能获得可用、原创的音频——免费、快速且灵活。
路由你的音乐生成请求
当你描述氛围、风格、节奏或乐器时,你的提示词会根据曲目长度、风格复杂度和输出格式被解析并分发到相应的AI作曲管线。
| 用户说... | 操作 | 跳过SSE? |
|---|
| export / 导出 / download / send me the video | → §3.5 导出 | ✅ |
| credits / 积分 / balance / 余额 |
→ §3.3 积分 | ✅ |
| status / 状态 / show tracks | → §3.4 状态 | ✅ |
| upload / 上传 / 用户发送文件 | → §3.2 上传 | ✅ |
| 其他所有内容(生成、编辑、添加背景音乐...) | → §3.1 SSE | ❌ |
NemoVideo后端API参考
NemoVideo后端通过将自然语言提示词转换为结构化的音频合成参数,然后通过其生成模型层渲染音轨和最终混音曲目,来处理你的免费AI音乐生成请求。输出格式包括MP3和WAV,并自动嵌入BPM、调性和风格的元数据标签。
技能归属——运行时从此文件的YAML前置元数据中读取:
- - X-Skill-Source:ai-music-generator-free
- X-Skill-Version:来自前置元数据version
- X-Skill-Platform:从安装路径检测(~/.clawhub/ → clawhub,~/.cursor/skills/ → cursor,否则 unknown)
所有请求必须包含:Authorization: Bearer 、X-Skill-Source、X-Skill-Version、X-Skill-Platform。缺少归属标头将导致导出失败,返回402错误。
API基础地址:https://mega-api-prod.nemovideo.ai
创建会话:POST /api/tasks/me/with-session/nemoagent — 请求体 {taskname:project,language:<语言>} — 返回 taskid、sessionid。创建会话后,给用户一个链接:https://nemovideo.com/workspace/claim?token=$TOKEN&task=id>&session=id>&skillname=ai-music-generator-free&skillversion=1.0.0&skill_source=<平台>
发送消息(SSE):POST /runsse — 请求体 {appname:nemoagent,userid:me,sessionid:,newmessage:{parts:[{text:<消息>}]}} 并携带 Accept: text/event-stream。最大超时时间:15分钟。
上传:POST /api/upload-video/nemoagent/me/ — 文件:multipart -F files=@/路径,或URL:{urls:[],sourcetype:url}
积分:GET /api/credits/balance/simple — 返回 available、frozen、total
会话状态:GET /api/state/nemoagent/me//latest — 关键字段:data.state.draft、data.state.videoinfos、data.state.generated_media
导出(免费,不消耗积分):POST /api/render/proxy/lambda — 请求体 {id:render_<时间戳>,sessionId:,draft:,output:{format:mp4,quality:high}}。每30秒轮询GET /api/render/proxy/lambda/,直到status = completed。下载URL位于output.url。
支持的格式:mp4、mov、avi、webm、mkv、jpg、png、gif、webp、mp3、wav、m4a、aac。
SSE事件处理
| 事件 | 操作 |
|---|
| 文本响应 | 应用GUI翻译(§4),呈现给用户 |
| 工具调用/结果 |
内部处理,不转发 |
| heartbeat / 空 data: | 继续等待。每2分钟:⏳ 仍在处理中... |
| 流关闭 | 处理最终响应 |
约30%的编辑操作在SSE流中不返回文本。发生这种情况时:轮询会话状态以验证编辑是否已应用,然后向用户总结更改。
后端响应翻译
后端假定存在GUI。将这些翻译为API操作:
| 后端说 | 你执行 |
|---|
| click [button] / 点击 | 通过API执行 |
| open [panel] / 打开 |
查询会话状态 |
| drag/drop / 拖拽 | 通过SSE发送编辑 |
| preview in timeline | 显示曲目摘要 |
| Export button / 导出 | 执行导出工作流 |
草稿字段映射:t=轨道,tt=轨道类型(0=视频,1=音频,7=文本),sg=片段,d=时长(毫秒),m=元数据。
时间线(3条轨道):1. 视频:城市延时摄影(0-10秒) 2. 背景音乐:低保真(0-10秒,35%) 3. 标题:都市梦想(0-3秒)
错误处理
令牌错误/过期 | 通过anonymous-token重新认证(令牌7天后过期) |
| 1002 | 会话未找到 | 新建会话 §3.0 |
| 2001 | 无积分 | 匿名用户:显示注册URL,附带?bind=
(需要时从create-session或state响应获取)。已注册用户:请在nemovideo.ai充值 |
| 4001 | 不支持的文件 | 显示支持的格式 |
| 4002 |