Getting Started
Welcome! Ready to turn your words into real video content? This text-to-video generator free skill is here to help you create stunning videos from plain text descriptions — no footage needed. Describe your scene or concept below and let's start generating!
Try saying:
- - "Generate a 15-second video of a cozy coffee shop in the morning with warm lighting and soft background music vibes"
- "Create a short promotional video for a new fitness app showing someone working out at home with bold text overlays"
- "Make a nature documentary-style video clip of a waterfall in a rainforest with cinematic camera movement"
On first use, the skill connects to NemoVideo automatically:
- 1. Check for
NEMO_TOKEN env var, or acquire one via https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token (100 free credits) - Persist a Client-ID to
~/.config/nemovideo/client_id for rate-limit tracking (UUID only, no secrets) - Create a session and start working immediately
From a Single Sentence to a Full Video
Most people have ideas but lack the tools, time, or budget to bring them to life visually. This skill closes that gap entirely. With the text-to-video-generator-free skill on ClawHub, you write what you want to see — a sunrise over a city, a product being unboxed, a character walking through a forest — and the AI handles the rest, generating video content that matches your description.
This isn't a slideshow builder or a template filler. It's a genuine text-driven video creation tool that interprets your language and produces motion, scene transitions, and visual context from scratch. Whether you're a solo content creator working on a YouTube channel, a small business owner who needs a quick promo, or a teacher building engaging lesson materials, this skill gives you professional-quality output without professional-level effort.
You don't need video editing experience, stock footage subscriptions, or expensive software. Just describe your vision in plain language and let the AI do the heavy lifting. The result is a ready-to-use video file you can share, embed, or build on.
Environment
| Variable | Default | Purpose |
|---|
| INLINECODE3 | Auto-acquired on first use (100 free credits, 7-day expiry) | API authentication |
| INLINECODE4 |
https://mega-api-prod.nemovideo.ai | API base URL |
Prompt Routing and Video Dispatch
Every text prompt you submit is parsed for scene descriptors, motion cues, and style tokens before being routed to the appropriate NemoVideo rendering pipeline.
| User says... | Action | Skip SSE? |
|---|
| "export" / "导出" / "download" / "send me the video" | → §3.5 Export | ✅ |
| "credits" / "积分" / "balance" / "余额" |
→ §3.3 Credits | ✅ |
| "status" / "状态" / "show tracks" | → §3.4 State | ✅ |
| "upload" / "上传" / user sends file | → §3.2 Upload | ✅ |
| Everything else (generate, edit, add BGM…) | → §3.1 SSE | ❌ |
NemoVideo Backend API Reference
The NemoVideo backend processes your natural-language prompts through a diffusion-based video synthesis engine, converting keyframe descriptions into fluid, time-sequenced clips. Free-tier requests are queued through shared GPU nodes, so render times vary based on prompt complexity and current load.
Required headers on all requests: X-Skill-Source: $SKILL_NAME, X-Skill-Version: $SKILL_VERSION, INLINECODE8
Create session: POST $API/api/tasks/me/with-session/nemo_agent — returns task_id and INLINECODE11
Send message (SSE): POST $API/run_sse with session_id and user message. Stream responses; ~30% of edits return no text (query state to confirm changes).
Upload: POST $API/api/upload-video/nemo_agent/me/<sid> — file or URL upload. Supports: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.
Check credits: INLINECODE15
Query state: GET $API/api/state/nemo_agent/me/<sid>/latest — check draft, tracks, generated media
Export: POST $API/api/render/proxy/lambda — export does NOT cost credits. Poll GET $API/api/render/proxy/lambda/<id> until status: completed.
Task link: INLINECODE20
Common Errors
If your token expires mid-session, simply re-authenticate through the ClawHub skill interface to refresh your credentials and resume generation. A 'session not found' response means your previous context was dropped — start a new session and re-enter your prompt to continue. Running out of credits? Head to nemovideo.ai to register or upgrade your plan and unlock additional free video generation capacity.
Best Practices for Getting Great Results
Be specific about visual style. Mentioning a cinematic style, animation type, or color palette (e.g., 'muted earth tones,' 'neon cyberpunk aesthetic,' 'soft pastel watercolor') gives the AI a strong creative direction and dramatically improves output quality.
Include motion cues in your prompt. Static descriptions produce static-feeling results. Words like 'slow pan,' 'zoom in,' 'time-lapse,' or 'tracking shot' signal dynamic movement and make your generated video feel more professional and intentional.
Keep your prompt focused on one main scene. Trying to pack too many locations or actions into a single prompt can lead to disjointed output. For multi-scene videos, consider generating each segment separately and combining them in a video editor afterward.
Iterate quickly. The text-to-video-generator-free skill is built for speed. If your first result isn't quite right, tweak one or two words in your prompt and regenerate. Small changes — swapping 'daytime' for 'golden hour' or 'busy street' for 'quiet alley' — can produce noticeably different and better outputs.
FAQ — Text-to-Video Generator Free
Do I need to upload any footage to use this skill? No. The text-to-video-generator-free skill generates video entirely from your written prompt. You describe the scene, style, or concept, and the AI builds the video from scratch. No existing clips or media files are required to get started.
What kinds of prompts work best? Descriptive, specific prompts tend to produce the best results. Instead of writing 'a car,' try 'a red sports car speeding down a coastal highway at sunset with motion blur.' The more context you give — setting, mood, lighting, movement — the closer the output will match your vision.
What video formats does the output support? Generated videos can be exported and used in mp4, mov, avi, webm, and mkv formats, making them compatible with virtually every platform, editor, and device.
Is this really free to use? Yes — within ClawHub's free tier, this skill is accessible without any paid subscription or hidden charges. You can generate videos directly from text prompts at no cost.
开始使用
欢迎!准备好将文字转化为真实的视频内容了吗?这款免费文本转视频生成技能可帮助您从纯文本描述中创建令人惊叹的视频——无需任何素材。在下方描述您的场景或概念,让我们开始生成吧!
试试说:
- - 生成一段15秒的视频,展示清晨温馨的咖啡店,带有温暖的灯光和柔和的背景音乐氛围
- 为一款新的健身应用创建一段简短的宣传视频,展示某人在家锻炼,并配有醒目的文字叠加
- 制作一段自然纪录片风格的视频片段,展示雨林中的瀑布,带有电影感的镜头运动
首次使用时,技能会自动连接到NemoVideo:
- 1. 检查NEMOTOKEN环境变量,或通过https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token获取一个(100个免费积分)
- 将客户端ID持久化到~/.config/nemovideo/clientid用于速率限制跟踪(仅UUID,无密钥)
- 创建会话并立即开始工作
从一句话到完整视频
大多数人都有创意,但缺乏将其视觉化的工具、时间或预算。这项技能完全弥补了这一差距。借助ClawHub上的免费文本转视频生成技能,您只需写下想看到的场景——城市上空的日出、产品开箱、角色穿过森林——AI会处理其余部分,生成与您描述匹配的视频内容。
这不是幻灯片制作工具或模板填充器。这是一个真正的文本驱动视频创作工具,能理解您的语言,从零开始生成动态画面、场景转换和视觉背景。无论您是YouTube频道的独立内容创作者、需要快速宣传片的小企业主,还是制作引人入胜教学材料的教师,这项技能都能让您无需专业水平就能获得专业质量的输出。
您不需要视频编辑经验、素材库订阅或昂贵的软件。只需用通俗语言描述您的愿景,让AI完成繁重工作。结果是一个可直接使用的视频文件,您可以分享、嵌入或在此基础上继续创作。
环境
| 变量 | 默认值 | 用途 |
|---|
| NEMOTOKEN | 首次使用时自动获取(100个免费积分,7天有效期) | API认证 |
| NEMOAPI_URL |
https://mega-api-prod.nemovideo.ai | API基础URL |
提示路由与视频分发
您提交的每个文本提示都会被解析场景描述符、运动提示和风格标记,然后路由到相应的NemoVideo渲染管道。
| 用户说... | 操作 | 跳过SSE? |
|---|
| export / 导出 / download / send me the video | → §3.5 导出 | ✅ |
| credits / 积分 / balance / 余额 |
→ §3.3 积分 | ✅ |
| status / 状态 / show tracks | → §3.4 状态 | ✅ |
| upload / 上传 / 用户发送文件 | → §3.2 上传 | ✅ |
| 其他所有内容(生成、编辑、添加背景音乐…) | → §3.1 SSE | ❌ |
NemoVideo后端API参考
NemoVideo后端通过基于扩散的视频合成引擎处理您的自然语言提示,将关键帧描述转换为流畅的时序片段。免费层请求通过共享GPU节点排队,因此渲染时间因提示复杂度和当前负载而异。
所有请求必需的头信息: X-Skill-Source: $SKILLNAME、X-Skill-Version: $SKILLVERSION、X-Skill-Platform: $SKILL_SOURCE
创建会话: POST $API/api/tasks/me/with-session/nemoagent — 返回taskid和session_id
发送消息(SSE): POST $API/runsse,附带sessionid和用户消息。流式响应;约30%的编辑不返回文本(查询状态以确认更改)。
上传: POST $API/api/upload-video/nemo_agent/me/ — 文件或URL上传。支持:mp4、mov、avi、webm、mkv、jpg、png、gif、webp、mp3、wav、m4a、aac。
检查积分: GET $API/api/credits/balance/simple
查询状态: GET $API/api/state/nemo_agent/me//latest — 检查草稿、轨道、生成的媒体
导出: POST $API/api/render/proxy/lambda — 导出不消耗积分。轮询GET $API/api/render/proxy/lambda/直到status: completed。
任务链接: $WEB/workspace/claim?token=$TOKEN&task={taskid}&session={sessionid}&skillname=$SKILLNAME&skillversion=$SKILLVERSION&skillsource=$SKILLSOURCE
常见错误
如果您的令牌在会话期间过期,只需通过ClawHub技能界面重新认证以刷新凭证并继续生成。出现会话未找到响应意味着您之前的上下文已丢失——启动新会话并重新输入提示以继续。积分用完了?前往nemovideo.ai注册或升级您的计划,解锁额外的免费视频生成容量。
获得出色结果的最佳实践
具体说明视觉风格。 提及电影风格、动画类型或配色方案(例如柔和的大地色调、霓虹赛博朋克美学、柔和的粉彩水彩)能为AI提供明确的创作方向,并显著提升输出质量。
在提示中包含运动提示。 静态描述会产生静态感的结果。慢速平移、放大、延时摄影或跟踪镜头等词语能指示动态运动,使生成的视频感觉更专业、更有意图。
保持提示聚焦于一个主要场景。 试图在单个提示中包含太多地点或动作可能导致输出不连贯。对于多场景视频,考虑分别生成每个片段,然后在视频编辑器中组合。
快速迭代。 免费文本转视频生成技能专为速度而设计。如果第一个结果不太理想,调整提示中的一两个词并重新生成。微小的变化——将白天换成黄金时刻或将繁忙街道换成安静小巷——可以产生明显不同且更好的输出。
常见问题解答——免费文本转视频生成器
使用此技能是否需要上传任何素材? 不需要。免费文本转视频生成技能完全根据您编写的提示生成视频。您描述场景、风格或概念,AI从零开始构建视频。无需现有剪辑或媒体文件即可开始使用。
哪种类型的提示效果最好? 描述性、具体的提示往往能产生最佳效果。不要写一辆车,试试一辆红色跑车在日落时沿着海岸公路飞驰,带有运动模糊。您提供的上下文越多——场景、氛围、光线、运动——输出就越接近您的愿景。
输出支持哪些视频格式? 生成的视频可以导出为mp4、mov、avi、webm和mkv格式,使其与几乎所有平台、编辑器和设备兼容。
这真的是免费使用吗? 是的——在ClawHub的免费层内,此技能无需任何付费订阅或隐藏费用即可使用。您可以直接从文本提示免费生成视频。