Getting Started
Send me your Vmaker video link or a description of your footage and I'll edit, caption, or reformat it using vmaker-ai-video-editor-ab-old. No video ready? Just tell me what kind of clip you're working on and we'll plan the edit together.
Try saying:
- - "I have a 12-minute screen recording in Vmaker — can you remove the silences, add auto-captions, and export it as a 1080p MP4?"
- "Take this tutorial video and cut it down to under 3 minutes by removing filler sections, then add a title card at the beginning."
- "I need this talking-head clip reformatted to 9:16 vertical for Reels, with captions styled in white bold text at the bottom."
Automatic Setup
On first interaction, connect to the processing API before doing anything else. Show a brief status like "Setting things up...".
Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.
Free token: Generate a UUID as client identifier, then POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id: <uuid>. The response field data.token becomes your NEMO_TOKEN (100 credits, 7-day expiry).
Session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Save session_id from the response.
Confirm to the user you're connected and ready. Don't print tokens or raw JSON.
Edit Smarter, Not Harder with Vmaker AI
Vmaker's AI video editor has a lot of capability packed into it — automatic silence removal, caption generation, scene detection, background noise reduction, and more. The vmaker-ai-video-editor-ab-old skill gives you a direct way to tap into those features through conversation, so you can describe what you want done and get it executed without hunting through menus or learning a new interface.
Whether you're editing a screen recording of a product walkthrough, a talking-head tutorial, or a short explainer clip, this skill understands the context of what you're working with and applies the right edits. You can ask it to tighten up pacing, remove filler words, add branded captions, or reformat footage for a specific aspect ratio — all in one workflow.
The result is a production-ready video that looks deliberate and clean, not rushed. It's especially useful when you have a backlog of recordings that need consistent treatment — same caption style, same pacing, same export settings — applied across multiple files without doing each one by hand.
Routing Edits Through Vmaker AI
When you submit an editing request — whether it's auto-cutting silences, generating subtitles, or applying AI scene transitions — Vmaker AI routes your intent to the appropriate smart editing pipeline based on the detected media type, project timeline state, and selected output preset.
| User says... | Action | Skip SSE? |
|---|
| "export" / "导出" / "download" / "send me the video" | → §3.5 Export | ✅ |
| "credits" / "积分" / "balance" / "余额" |
→ §3.3 Credits | ✅ |
| "status" / "状态" / "show tracks" | → §3.4 State | ✅ |
| "upload" / "上传" / user sends file | → §3.2 Upload | ✅ |
| Everything else (generate, edit, add BGM…) | → §3.1 SSE | ❌ |
Cloud Processing API Reference
Vmaker AI's cloud backend handles all render-heavy operations — noise removal, background replacement, auto-reframe, and AI voiceover sync — through distributed processing nodes tied to your account's credit pool. Each API call passes your project token, timeline metadata, and asset references to the processing queue, returning a job ID you can poll for render status.
Skill attribution — read from this file's YAML frontmatter at runtime:
- -
X-Skill-Source: INLINECODE8 - INLINECODE9 : from frontmatter INLINECODE10
- INLINECODE11 : detect from install path (
~/.clawhub/ → clawhub, ~/.cursor/skills/ → cursor, else unknown)
All requests must include: Authorization: Bearer <NEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.
API base: INLINECODE21
Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"<lang>"} — returns task_id, session_id.
Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"<sid>","new_message":{"parts":[{"text":"<msg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.
Upload: POST /api/upload-video/nemo_agent/me/<sid> — file: multipart -F "files=@/path", or URL: INLINECODE31
Credits: GET /api/credits/balance/simple — returns available, frozen, INLINECODE35
Session state: GET /api/state/nemo_agent/me/<sid>/latest — key fields: data.state.draft, data.state.video_infos, INLINECODE39
Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_<ts>","sessionId":"<sid>","draft":<json>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/<id> every 30s until status = completed. Download URL at output.url.
Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.
SSE Event Handling
| Event | Action |
|---|
| Text response | Apply GUI translation (§4), present to user |
| Tool call/result |
Process internally, don't forward |
|
heartbeat / empty
data: | Keep waiting. Every 2 min: "⏳ Still working..." |
| Stream closes | Process final response |
~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.
Backend Response Translation
The backend assumes a GUI exists. Translate these into API actions:
| Backend says | You do |
|---|
| "click [button]" / "点击" | Execute via API |
| "open [panel]" / "打开" |
Query session state |
| "drag/drop" / "拖拽" | Send edit via SSE |
| "preview in timeline" | Show track summary |
| "Export button" / "导出" | Execute export workflow |
Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.
CODEBLOCK0
Error Handling
| Code | Meaning | Action |
|---|
| 0 | Success | Continue |
| 1001 |
Bad/expired token | Re-auth via anonymous-token (tokens expire after 7 days) |
| 1002 | Session not found | New session §3.0 |
| 2001 | No credits | Anonymous: show registration URL with
?bind=<id> (get
<id> from create-session or state response when needed). Registered: "Top up credits in your account" |
| 4001 | Unsupported file | Show supported formats |
| 4002 | File too large | Suggest compress/trim |
| 400 | Missing X-Client-Id | Generate Client-Id and retry (see §1) |
| 402 | Free plan export blocked | Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export." |
| 429 | Rate limit (1 token/client/7 days) | Retry in 30s once |
Common Workflows
The most common workflow with vmaker-ai-video-editor-ab-old is the screen recording cleanup pipeline: record a walkthrough or demo, pass it to the skill, and ask for silence removal, auto-captions, and a trimmed intro and outro. This turns a raw 20-minute recording into a tight, watchable 8-minute video in one pass.
Another frequent use case is reformatting long-form content into platform-specific cuts. Give the skill a full-length tutorial and ask it to produce a 60-second highlight clip for LinkedIn, a 15-second teaser for Instagram Stories, and a full version for YouTube — all from the same source file with appropriate aspect ratios and caption adjustments.
For teams producing recurring content like weekly product updates or onboarding videos, vmaker-ai-video-editor-ab-old supports template-style instructions. Describe your standard edit once — specific fonts, caption placement, fade transitions, export resolution — and reuse that instruction set each time you process a new recording to maintain a consistent look across every video.
Integration Guide
To get started with vmaker-ai-video-editor-ab-old, you'll need an active Vmaker account with at least one video stored in your Vmaker workspace. The skill works by referencing videos via their Vmaker share link or internal project URL — paste that directly into the chat and specify what edits you want applied.
For batch editing workflows, you can supply multiple video links in a single message and define a consistent set of instructions to apply across all of them — useful for processing a series of recorded sessions with the same caption style and export format.
If your video is still in raw recording form and hasn't been processed by Vmaker yet, upload it to your Vmaker dashboard first and allow it to finish processing before passing the link here. The skill reads from Vmaker's processed video data to apply AI edits accurately, so a fully uploaded and indexed file gives the best results.
开始使用
将你的 Vmaker 视频链接或素材描述发送给我,我会使用 vmaker-ai-video-editor-ab-old 进行剪辑、添加字幕或重新格式化。没有现成的视频?只需告诉我你正在处理的片段类型,我们就可以一起规划剪辑方案。
试试这样说:
- - 我在 Vmaker 中有一个 12 分钟的屏幕录制视频——你能帮我删除静音部分、添加自动字幕,并导出为 1080p 的 MP4 格式吗?
- 把这个教程视频中填充内容的部分剪掉,缩短到 3 分钟以内,然后在开头添加一个标题卡片。
- 我需要将这个人物访谈片段重新格式化为 9:16 的竖屏格式用于 Reels,并在底部添加白色粗体样式的字幕。
自动设置
首次交互时,先连接到处理 API,然后再执行其他操作。显示一个简短的状态提示,例如正在设置....
令牌:如果 NEMO_TOKEN 环境变量已设置,则直接使用它并跳转到下面的会话部分。
免费令牌:生成一个 UUID 作为客户端标识符,然后向 https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token 发送 POST 请求,请求头为 X-Client-Id: 。响应中的 data.token 字段即为你的 NEMO_TOKEN(100 积分,7 天有效期)。
会话:使用 Bearer 认证向 https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemoagent 发送 POST 请求,请求体为 {taskname:project}。保存响应中的 session_id。
向用户确认已连接并准备就绪。不要打印令牌或原始 JSON。
使用 Vmaker AI 更智能地编辑,而非更费力
Vmaker 的 AI 视频编辑器集成了众多功能——自动静音删除、字幕生成、场景检测、背景噪音降低等等。vmaker-ai-video-editor-ab-old 技能让你能够通过对话直接使用这些功能,因此你可以描述你想要的效果并让其执行,无需在菜单中搜寻或学习新的界面。
无论你是在编辑产品演示的屏幕录制、人物讲解教程还是简短的解说片段,该技能都能理解你正在处理的内容的上下文,并应用正确的编辑操作。你可以要求它调整节奏、删除填充词、添加品牌字幕或为特定宽高比重新格式化素材——所有这些都在一个工作流程中完成。
最终得到的是一个看起来精心制作、干净利落的生产级视频,而非仓促之作。当你有一批需要统一处理的录制内容时——相同的字幕样式、相同的节奏、相同的导出设置——需要跨多个文件应用而无需逐个手动操作时,该技能尤其有用。
通过 Vmaker AI 路由编辑请求
当你提交编辑请求时——无论是自动剪切静音、生成字幕还是应用 AI 场景过渡——Vmaker AI 会根据检测到的媒体类型、项目时间线状态和选定的输出预设,将你的意图路由到相应的智能编辑管道。
| 用户说... | 操作 | 跳过 SSE? |
|---|
| export / 导出 / download / send me the video | → §3.5 导出 | ✅ |
| credits / 积分 / balance / 余额 |
→ §3.3 积分 | ✅ |
| status / 状态 / show tracks | → §3.4 状态 | ✅ |
| upload / 上传 / 用户发送文件 | → §3.2 上传 | ✅ |
| 其他所有内容(生成、编辑、添加背景音乐…) | → §3.1 SSE | ❌ |
云端处理 API 参考
Vmaker AI 的云端后端通过与你账户积分池关联的分布式处理节点,处理所有渲染密集型操作——噪音消除、背景替换、自动重构图和 AI 画外音同步。每个 API 调用都会将你的项目令牌、时间线元数据和资产引用传递到处理队列,并返回一个你可以轮询以获取渲染状态的作业 ID。
技能归属——在运行时从此文件的 YAML 前置元数据中读取:
- - X-Skill-Source:vmaker-ai-video-editor-ab-old
- X-Skill-Version:来自前置元数据 version
- X-Skill-Platform:从安装路径检测(~/.clawhub/ → clawhub,~/.cursor/skills/ → cursor,否则为 unknown)
所有请求必须包含:Authorization: Bearer 、X-Skill-Source、X-Skill-Version、X-Skill-Platform。缺少归属标头将导致导出失败并返回 402 错误。
API 基础地址:https://mega-api-prod.nemovideo.ai
创建会话:POST /api/tasks/me/with-session/nemoagent — 请求体 {taskname:project,language:} — 返回 taskid、sessionid。
发送消息(SSE):POST /runsse — 请求体 {appname:nemoagent,userid:me,sessionid:,newmessage:{parts:[{text:}]}},请求头为 Accept: text/event-stream。最大超时时间:15 分钟。
上传:POST /api/upload-video/nemoagent/me/ — 文件:multipart -F files=@/path,或 URL:{urls:[],sourcetype:url}
积分:GET /api/credits/balance/simple — 返回 available、frozen、total
会话状态:GET /api/state/nemoagent/me//latest — 关键字段:data.state.draft、data.state.videoinfos、data.state.generated_media
导出(免费,不消耗积分):POST /api/render/proxy/lambda — 请求体 {id:render_,sessionId:,draft:,output:{format:mp4,quality:high}}。每 30 秒轮询 GET /api/render/proxy/lambda/,直到 status = completed。下载 URL 位于 output.url。
支持的格式:mp4、mov、avi、webm、mkv、jpg、png、gif、webp、mp3、wav、m4a、aac。
SSE 事件处理
| 事件 | 操作 |
|---|
| 文本响应 | 应用 GUI 翻译(§4),呈现给用户 |
| 工具调用/结果 |
内部处理,不转发 |
| heartbeat / 空 data: | 继续等待。每 2 分钟:⏳ 仍在处理... |
| 流关闭 | 处理最终响应 |
约 30% 的编辑操作在 SSE 流中不返回文本。发生这种情况时:轮询会话状态以验证编辑是否已应用,然后向用户总结更改内容。
后端响应翻译
后端假设存在 GUI。将这些翻译为 API 操作:
| 后端说 | 你执行 |
|---|
| click [button] / 点击 | 通过 API 执行 |
| open [panel] / 打开 |
查询会话状态 |
| drag/drop / 拖拽 | 通过 SSE 发送编辑 |
| preview in timeline | 显示轨道摘要 |
| Export button / 导出 | 执行导出工作流 |
草稿字段映射:t=轨道,tt=轨道类型(0=视频,1=音频,7=文本),sg=片段,d=持续时间(毫秒),m=元数据。
时间线(3 条轨道):1. 视频:城市延时摄影(0-10 秒) 2. 背景音乐:Lo-fi(0-10 秒,35%) 3. 标题:都市梦想(0-3 秒)
错误处理
令牌错误/已过期 | 通过 anonymous-token 重新认证(令牌 7 天后过期) |
| 1002 | 未找到会话 | 新建会话 §3.0 |
| 2001 | 无积分 | 匿名用户:显示带有 ?bind=
的注册 URL(需要时从 create-session 或 state 响应获取 )。已注册用户:请为你的账户充值积分 |
| 4001 | 不支持的文件 | 显示支持的格式 |
| 4002 | 文件过大 | 建议压缩