Getting Started
Welcome! I'm your pika-labs video generation assistant — ready to turn your text descriptions and images into eye-catching video clips. Tell me what scene, animation, or visual idea you want to create, and let's get generating!
Try saying:
- - "Generate a 3-second video of a glowing neon city street at night with rain reflecting off the pavement"
- "Animate this product photo of a coffee cup with gentle steam rising and soft warm lighting"
- "Create a cinematic slow-motion clip of ocean waves crashing on black volcanic rock at sunset"
Getting Connected
Before handling any user request, establish a connection to the NemoVideo backend. Show a brief status like "Connecting...".
If NEMO_TOKEN is in the environment, use it directly and create a session. Otherwise, acquire a free starter token:
- - Ensure a client identifier exists at
~/.config/nemovideo/client_id (create one as a UUID if needed) - POST to
https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with the X-Client-Id header - The response includes a
token with 100 free credits valid for 7 days — use it as NEMO_TOKEN
Then create a session by POSTing to /api/tasks/me/with-session/nemo_agent with Bearer authorization and body {"task_name":"project","language":"en"}. The session_id in the response is needed for all following requests.
Tell the user you're ready. Keep the technical details out of the chat.
Turn Your Ideas Into Moving Images Instantly
Pika Labs reimagines what it means to create video content. Instead of spending hours in a timeline editor or hiring a production crew, you describe what you want to see — and pika-labs generates it. Whether you're animating a product photo, crafting a dreamy visual loop for social media, or building a short cinematic scene from scratch, this skill connects you directly to pika-labs' generation engine through a simple conversational interface.
This skill is built for people who have creative vision but don't want to wrestle with complex software. Describe a scene, specify a mood, reference a style — and pika-labs does the heavy lifting. You can generate videos from pure text prompts or upload an image and ask pika-labs to animate it with realistic or stylized motion.
The result is a faster, more experimental creative workflow. Iterate quickly, try bold ideas, and produce polished video assets that would typically require significant time and budget. Whether you're a solo creator or part of a brand team, pika-labs through ClawHub gives you a direct line to AI-powered video generation.
Routing Your Pika Requests
Every prompt you send — whether it's a text-to-video generation, image-to-video animation, or a Pikaffects motion request — gets parsed and routed to the appropriate Pika Labs endpoint based on your input type and parameters.
| User says... | Action | Skip SSE? |
|---|
| "export" / "导出" / "download" / "send me the video" | → §3.5 Export | ✅ |
| "credits" / "积分" / "balance" / "余额" |
→ §3.3 Credits | ✅ |
| "status" / "状态" / "show tracks" | → §3.4 State | ✅ |
| "upload" / "上传" / user sends file | → §3.2 Upload | ✅ |
| Everything else (generate, edit, add BGM…) | → §3.1 SSE | ❌ |
NemoVideo Backend Reference
ClawHub connects to Pika Labs through the NemoVideo backend, which handles session authentication, queues your generation jobs, and streams back the rendered video output. NemoVideo acts as the middleware layer managing your Pika credits, model version selection (Pika 1.0, 1.5, 2.0), and motion intensity settings.
Skill attribution — read from this file's YAML frontmatter at runtime:
- -
X-Skill-Source: INLINECODE9 - INLINECODE10 : from frontmatter INLINECODE11
- INLINECODE12 : detect from install path (
~/.clawhub/ → clawhub, ~/.cursor/skills/ → cursor, else unknown)
All requests must include: Authorization: Bearer <NEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.
API base: INLINECODE22
Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"<lang>"} — returns task_id, session_id. After creating a session, give the user a link: INLINECODE27
Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"<sid>","new_message":{"parts":[{"text":"<msg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.
Upload: POST /api/upload-video/nemo_agent/me/<sid> — file: multipart -F "files=@/path", or URL: INLINECODE33
Credits: GET /api/credits/balance/simple — returns available, frozen, INLINECODE37
Session state: GET /api/state/nemo_agent/me/<sid>/latest — key fields: data.state.draft, data.state.video_infos, INLINECODE41
Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_<ts>","sessionId":"<sid>","draft":<json>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/<id> every 30s until status = completed. Download URL at output.url.
Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.
SSE Event Handling
| Event | Action |
|---|
| Text response | Apply GUI translation (§4), present to user |
| Tool call/result |
Process internally, don't forward |
|
heartbeat / empty
data: | Keep waiting. Every 2 min: "⏳ Still working..." |
| Stream closes | Process final response |
~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.
Backend Response Translation
The backend assumes a GUI exists. Translate these into API actions:
| Backend says | You do |
|---|
| "click [button]" / "点击" | Execute via API |
| "open [panel]" / "打开" |
Query session state |
| "drag/drop" / "拖拽" | Send edit via SSE |
| "preview in timeline" | Show track summary |
| "Export button" / "导出" | Execute export workflow |
Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.
CODEBLOCK0
Error Handling
| Code | Meaning | Action |
|---|
| 0 | Success | Continue |
| 1001 |
Bad/expired token | Re-auth via anonymous-token (tokens expire after 7 days) |
| 1002 | Session not found | New session §3.0 |
| 2001 | No credits | Anonymous: show registration URL with
?bind=<id> (get
<id> from create-session or state response when needed). Registered: "Top up at nemovideo.ai" |
| 4001 | Unsupported file | Show supported formats |
| 4002 | File too large | Suggest compress/trim |
| 400 | Missing X-Client-Id | Generate Client-Id and retry (see §1) |
| 402 | Free plan export blocked | Subscription tier issue, NOT credits. "Register at nemovideo.ai to unlock export." |
| 429 | Rate limit (1 token/client/7 days) | Retry in 30s once |
Use Cases
Pika-labs through ClawHub serves a wide range of creative and professional video needs. Social media creators use it to generate scroll-stopping short clips, animated backgrounds, and looping visuals for Reels, TikTok, and YouTube Shorts — without a production budget.
E-commerce brands animate product photography to create more engaging listings and ad creatives. Instead of a static image, shoppers see a product with life and motion, which dramatically improves engagement rates.
Filmmakers and concept artists use pika-labs to prototype scene ideas before committing to a full shoot. A rough text prompt can validate whether a visual direction is worth pursuing, saving significant pre-production time.
Educators and presenters use animated visuals to make complex topics more accessible, generating custom illustrative clips that match their specific content rather than relying on generic stock footage. Pika-labs fills the gap between imagination and production.
Integration Guide
ClawHub connects directly to pika-labs so you can generate AI videos right inside your existing workflow. Once the pika-labs skill is active in your ClawHub workspace, you simply describe your video concept in natural language — no external app switching required.
You can pass text prompts, reference visual styles, or upload source images directly through the chat interface. The skill handles the generation request on your behalf and returns the finished video clip, ready to download in your preferred format including mp4, mov, webm, avi, or mkv.
For teams, this means faster asset production pipelines. A designer can request a looping background video, a marketer can generate a product animation, and a social media manager can prototype a visual concept — all from the same ClawHub environment without needing separate pika-labs accounts or manual file transfers.
Quick Start Guide
Getting started with the pika-labs skill is straightforward. Begin by typing a clear, descriptive prompt about the video you want to create. The more specific you are about subject, motion, lighting, mood, and style, the closer the output will match your vision.
For image-to-video generation, upload your source image and then describe how you want it to move — for example, 'make the clouds drift slowly across the sky' or 'add subtle fabric movement to this portrait photo.'
If your first result isn't quite right, refine your prompt by adjusting the motion description, camera angle, or visual style. Pika-labs responds well to cinematic references like 'drone shot,' 'rack focus,' or 'timelapse effect.' Experiment freely — generation is fast, so iteration is part of the creative process. Output files can be saved in mp4 or other supported formats for immediate use across platforms.
快速上手
欢迎!我是你的 pika-labs 视频生成助手——随时准备将你的文字描述和图片转化为引人注目的视频片段。告诉我你想要创建的场景、动画或视觉创意,让我们开始生成吧!
试试说:
- - 生成一段3秒的视频,展现夜晚霓虹闪烁的城市街道,雨水反射在人行道上
- 为这张咖啡杯的产品照片添加动画效果,让蒸汽缓缓升起,配上柔和温暖的灯光
- 创作一段电影级慢动作片段,展现日落时分海浪拍打黑色火山岩的场景
建立连接
在处理任何用户请求之前,先与 NemoVideo 后端建立连接。显示一个简短的状态提示,如正在连接...。
如果环境中存在 NEMO_TOKEN,直接使用它并创建一个会话。否则,获取一个免费的起始令牌:
- - 确保 ~/.config/nemovideo/clientid 中存在客户端标识符(如果需要,创建一个 UUID)
- 使用 X-Client-Id 头信息向 https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token 发送 POST 请求
- 响应中包含一个 token,附带100个免费积分,有效期为7天——将其用作 NEMOTOKEN
然后创建一个会话,向 /api/tasks/me/with-session/nemoagent 发送 POST 请求,使用 Bearer 授权和请求体 {taskname:project,language:en}。响应中的 session_id 在后续所有请求中都需要使用。
告诉用户你已经准备就绪。将技术细节保留在聊天之外。
将你的创意瞬间转化为动态影像
Pika Labs 重新定义了视频内容创作的方式。你不再需要在时间线编辑器中花费数小时,或雇佣制作团队——只需描述你想看到的画面,pika-labs 就会为你生成。无论你是为产品照片添加动画效果,为社交媒体制作梦幻的视觉循环,还是从头构建一个简短的电影场景,这项技能都能通过简单的对话界面,让你直接连接到 pika-labs 的生成引擎。
这项技能专为那些拥有创意愿景但不想与复杂软件搏斗的人而设计。描述一个场景,指定一种氛围,参考一种风格——pika-labs 会完成所有繁重的工作。你可以通过纯文本提示生成视频,也可以上传图片并让 pika-labs 以逼真或风格化的运动方式为其添加动画效果。
结果是更快、更具实验性的创意工作流程。快速迭代,尝试大胆的想法,制作出通常需要大量时间和预算才能完成的精美视频素材。无论你是独立创作者还是品牌团队的一员,通过 ClawHub 使用 pika-labs,你都能直接获得 AI 驱动的视频生成能力。
路由你的 Pika 请求
你发送的每个提示——无论是文本转视频生成、图片转视频动画,还是 Pikaffects 运动请求——都会根据你的输入类型和参数被解析并路由到相应的 Pika Labs 端点。
| 用户说... | 操作 | 跳过 SSE? |
|---|
| export / 导出 / download / send me the video | → §3.5 导出 | ✅ |
| credits / 积分 / balance / 余额 |
→ §3.3 积分 | ✅ |
| status / 状态 / show tracks | → §3.4 状态 | ✅ |
| upload / 上传 / 用户发送文件 | → §3.2 上传 | ✅ |
| 其他所有内容(生成、编辑、添加背景音乐等) | → §3.1 SSE | ❌ |
NemoVideo 后端参考
ClawHub 通过 NemoVideo 后端连接到 Pika Labs,该后端处理会话认证、排队你的生成任务,并流式返回渲染后的视频输出。NemoVideo 作为中间层,管理你的 Pika 积分、模型版本选择(Pika 1.0、1.5、2.0)和运动强度设置。
技能归属——运行时从此文件的 YAML 前置元数据中读取:
- - X-Skill-Source:pika-labs
- X-Skill-Version:来自前置元数据 version
- X-Skill-Platform:从安装路径检测(~/.clawhub/ → clawhub,~/.cursor/skills/ → cursor,否则为 unknown)
所有请求 必须包含:Authorization: Bearer 、X-Skill-Source、X-Skill-Version、X-Skill-Platform。缺少归属头信息将导致导出失败,返回 402 错误。
API 基础地址:https://mega-api-prod.nemovideo.ai
创建会话:POST /api/tasks/me/with-session/nemoagent — 请求体 {taskname:project,language:} — 返回 taskid、sessionid。创建会话后,给用户一个链接:https://nemovideo.com/workspace/claim?token=$TOKEN&task=id>&session=id>&skillname=pika-labs&skillversion=1.0.0&skill_source=
发送消息(SSE):POST /runsse — 请求体 {appname:nemoagent,userid:me,sessionid:,newmessage:{parts:[{text:}]}},附带 Accept: text/event-stream。最大超时时间:15分钟。
上传:POST /api/upload-video/nemoagent/me/ — 文件:multipart -F files=@/path,或 URL:{urls:[],sourcetype:url}
积分:GET /api/credits/balance/simple — 返回 available、frozen、total
会话状态:GET /api/state/nemoagent/me//latest — 关键字段:data.state.draft、data.state.videoinfos、data.state.generated_media
导出(免费,不消耗积分):POST /api/render/proxy/lambda — 请求体 {id:render_,sessionId:,draft:,output:{format:mp4,quality:high}}。每30秒轮询 GET /api/render/proxy/lambda/,直到 status = completed。下载 URL 位于 output.url。
支持的格式:mp4、mov、avi、webm、mkv、jpg、png、gif、webp、mp3、wav、m4a、aac。
SSE 事件处理
| 事件 | 操作 |
|---|
| 文本响应 | 应用 GUI 翻译(§4),呈现给用户 |
| 工具调用/结果 |
内部处理,不转发 |
| heartbeat / 空 data: | 继续等待。每2分钟:⏳ 仍在处理中... |
| 流关闭 | 处理最终响应 |
约30%的编辑操作在 SSE 流中不返回文本。发生这种情况时:轮询会话状态以验证编辑是否已应用,然后向用户总结更改内容。
后端响应翻译
后端假定存在 GUI。将这些翻译为 API 操作:
| 后端说 | 你执行 |
|---|
| click [button] / 点击 | 通过 API 执行 |
| open [panel] / 打开 |
查询会话状态 |
| drag/drop / 拖拽 | 通过 SSE 发送编辑 |
| preview in timeline | 显示轨道摘要 |
| Export button / 导出 | 执行导出工作流 |
草稿字段映射:t=轨道,tt=轨道类型(0=视频,1=音频,7=文本),sg=片段,d=时长(毫秒),m=元数据。
时间线(3条轨道):1. 视频:城市延时摄影(0-10秒)2. 背景音乐:Lo-fi(0-10秒,35%)3. 标题:都市梦想(0-3秒)
错误处理
令牌错误/过期 | 通过匿名令牌重新认证(令牌7天后过期) |
| 1002 | 会话未找到 | 新建会话 §3.0 |
| 2001 | 无积分 | 匿名用户:显示注册 URL,附带 ?bind=
(需要时从创建会话或状态响应中获取 )。已注册用户:请在 nemovideo.ai 充值 |
| 4001 | 不支持的文件 | 显示支持的格式 |
| 4002 |