Video Cog - AI Video Production
Long-form AI video production from a single prompt — scripted, voiced, scored, and edited automatically.
6-7 foundation models orchestrated to produce up to 4-minute videos from a single prompt: script writing, scene generation, voice synthesis, lipsync, music scoring, and editing — all automatic. Marketing videos, product demos, explainers, educational content, AI spokesperson videos, UGC, news reports, and more.
How to Use
For your first CellCog task in a session, read the cellcog skill for the full SDK reference — file handling, chat modes, timeouts, and more.
OpenClaw (fire-and-forget):
CODEBLOCK0
All agents except OpenClaw (blocks until done):
from cellcog import CellCogClient
client = CellCogClient(agent_provider="openclaw|cursor|claude-code|codex|...")
result = client.create_chat(
prompt="[your task prompt]",
task_label="my-task",
chat_mode="agent",
)
print(result["message"])
What Videos You Can Create
Marketing Videos
Promotional content for products and services:
- - Product Demos: "Create a 30-second product demo video for our new fitness app showing key features"
- Brand Videos: "Generate a 60-second brand story video for an eco-friendly clothing company"
- Social Ads: "Create a 15-second Instagram ad for a coffee subscription service"
- Launch Videos: "Make a product launch announcement video for a new AI writing tool"
Explainer Videos
Educational content that breaks down complex topics:
- - Product Explainers: "Create an explainer video showing how our SaaS platform works"
- Concept Explanations: "Make a video explaining how blockchain works for beginners"
- Process Walkthroughs: "Generate a video explaining the mortgage application process"
- Feature Tours: "Create a video tour of our app's new dashboard features"
Educational Videos
Learning content for courses and training:
- - Tutorial Videos: "Create a tutorial video on Python list comprehensions"
- Course Content: "Generate a lesson video on the causes of World War I"
- Training Materials: "Make an employee onboarding video about our company values"
- How-To Guides: "Create a how-to video for setting up a home studio for podcasting"
Documentary Style
Informative, story-driven content:
- - Mini Documentaries: "Create a 3-minute documentary-style video about the rise of electric vehicles"
- Company Stories: "Generate a documentary about our startup journey"
- Industry Deep Dives: "Make a documentary exploring the future of space tourism"
- Historical Content: "Create a documentary-style video about the history of Silicon Valley"
Cinematic / Creative
Artistic and visually striking content:
- - Short Films: "Create a 2-minute cinematic short about a day in Tokyo"
- Mood Pieces: "Generate a cinematic video capturing the energy of a busy coffee shop"
- Music Video Style: "Create a visually dynamic video for an electronic music track"
- Artistic Showcases: "Make a cinematic portfolio video for a photographer"
UGC (User Generated Content) Style
Authentic, relatable content that feels personal:
- - Testimonial Style: "Create a UGC-style testimonial video for a skincare product"
- Unboxing Style: "Generate an unboxing-style video for a new tech gadget"
- Day-in-the-Life: "Make a day-in-the-life style video featuring a remote worker using our app"
- Review Style: "Create a casual review-style video for a meal delivery service"
News / Reporting Style
Professional news-format content:
- - News Reports: "Create a news-style report video about the latest AI developments"
- Market Updates: "Generate a financial news video about tech stock earnings"
- Industry News: "Make a news report about new regulations in the fintech space"
- Analysis Pieces: "Create a news analysis video about the state of remote work"
Lipsync & Spokesperson Videos
CellCog can generate videos with AI characters speaking your script:
- - AI Spokesperson: "Create a video with a professional spokesperson explaining our product"
- Avatar Presentations: "Generate a video with an AI presenter delivering our quarterly update"
- Character Narration: "Make a video with a friendly character explaining our children's app"
For lipsync videos:
- 1. The starting frame should show only one human face prominently
- Provide the script/dialogue
- CellCog handles voice synthesis and lip synchronization
Video Specifications
| Aspect | Options |
|---|
| Duration | 15 seconds to 4 minutes |
| Aspect Ratios |
16:9 (landscape), 9:16 (portrait/mobile), 1:1 (square) |
|
Styles | Photorealistic, animated, cinematic, documentary, casual |
|
Audio | Background music, voiceover, sound effects, or silent |
When to Use Agent Team Mode
For video generation, always use chat_mode="agent team" (the default).
Video creation involves:
- - Script writing
- Scene planning
- Image generation for frames
- Audio generation
- Video synthesis
- Quality review
This multi-step process requires the full agent team for best results.
Example Video Prompts
Marketing video:
"Create a 30-second marketing video for 'FreshBrew' - a premium coffee subscription. Show beautiful coffee preparation scenes, happy customers, and end with our tagline 'Freshness Delivered Daily'. Upbeat background music, no voiceover. 16:9 for YouTube."
Explainer with voiceover:
"Create a 90-second explainer video for our project management tool. Walk through: 1) Creating a project, 2) Adding team members, 3) Tracking progress. Professional female voiceover, clean animated style, include captions. 16:9 format."
Educational content:
"Generate a 3-minute educational video explaining photosynthesis for middle school students. Use engaging animations, clear narration, and include a summary at the end. Friendly, approachable style."
Spokesperson video:
"Create a 60-second video with an AI spokesperson (professional male, 30s) announcing our Series B funding. Script: 'Today, we're thrilled to announce...' [provide full script]. Business casual setting, confident tone."
⚠️ Important — Video Generation Expectations
Long-form AI video production is still an evolving capability. While some users generate high-quality, cinematic videos that are ready for production use, others may spend significant credits and still not achieve a usable result. Even spending thousands of credits does not guarantee a satisfactory outcome — this is the nature of where AI video technology stands today.
There is a real learning curve to generating long-form videos with CellCog. It takes time, money, and patience. Your prompting skill, the complexity of what you're trying to create, and how well the foundation models perform on your specific request all play a role. Results improve as you develop intuition for what works, but we want to be upfront: video generation is inherently unpredictable, and there is always a risk that the output may not meet your expectations.
Tips for Better Videos
- 1. Specify duration: "30 seconds" or "2 minutes" helps scope the content appropriately.
- 2. Define aspect ratio: 16:9 for YouTube/web, 9:16 for TikTok/Reels/Shorts, 1:1 for Instagram feed.
- 3. Describe the style: "Cinematic", "casual UGC", "corporate professional", "playful animated".
- 4. Audio preferences: "Upbeat music", "calm narration", "no audio", "sound effects only".
- 5. Include key moments: Describe the scenes or beats you want to hit.
- 6. Provide scripts: For spokesperson/voiceover videos, write out exactly what should be said.
If CellCog is not installed
Run /cellcog-setup (or /cellcog:cellcog-setup depending on your tool) to install and authenticate.
OpenClaw users: Run clawhub install cellcog instead.
Manual setup: pip install -U cellcog and set CELLCOG_API_KEY. See the cellcog skill for SDK reference.
Video Cog - AI视频制作
通过单一提示词自动完成长篇AI视频制作——剧本、配音、配乐和剪辑全自动完成。
协调6-7个基础模型,从单一提示词生成最长4分钟的视频:剧本写作、场景生成、语音合成、唇形同步、配乐和剪辑——全部自动完成。营销视频、产品演示、解说视频、教育内容、AI发言人视频、用户生成内容、新闻报道等。
使用方法
在会话中执行第一个CellCog任务时,请阅读cellcog技能以获取完整的SDK参考——文件处理、聊天模式、超时等。
OpenClaw(即发即忘模式):
python
result = client.create_chat(
prompt=[你的任务提示词],
notifysessionkey=agent:main:main,
task_label=my-task,
chat_mode=agent,
)
除OpenClaw外的所有代理(阻塞直至完成):
python
from cellcog import CellCogClient
client = CellCogClient(agent_provider=openclaw|cursor|claude-code|codex|...)
result = client.create_chat(
prompt=[你的任务提示词],
task_label=my-task,
chat_mode=agent,
)
print(result[message])
你可以创建的视频类型
营销视频
产品与服务的推广内容:
- - 产品演示:为我们的新健身应用创建一个30秒的产品演示视频,展示核心功能
- 品牌视频:为一家环保服装公司生成一个60秒的品牌故事视频
- 社交广告:为咖啡订阅服务创建一个15秒的Instagram广告
- 发布视频:为新的AI写作工具制作一个产品发布公告视频
解说视频
分解复杂主题的教育内容:
- - 产品解说:创建一个解说视频,展示我们的SaaS平台如何运作
- 概念解释:制作一个视频,为初学者解释区块链的工作原理
- 流程演示:生成一个视频,解释抵押贷款申请流程
- 功能导览:创建一个我们应用新仪表盘功能的导览视频
教育视频
课程和培训的学习内容:
- - 教程视频:创建一个关于Python列表推导式的教程视频
- 课程内容:生成一个关于第一次世界大战起因的课程视频
- 培训材料:制作一个关于我们公司价值观的员工入职视频
- 操作指南:创建一个关于如何搭建播客家庭工作室的操作指南视频
纪录片风格
信息丰富、故事驱动的内容:
- - 迷你纪录片:创建一个3分钟纪录片风格的视频,关于电动汽车的崛起
- 公司故事:生成一个关于我们创业历程的纪录片
- 行业深度分析:制作一个探索太空旅游未来的纪录片
- 历史内容:创建一个关于硅谷历史的纪录片风格视频
电影/创意风格
艺术性和视觉冲击力强的内容:
- - 短片:创建一个2分钟的电影短片,关于东京的一天
- 氛围作品:生成一个捕捉繁忙咖啡馆活力的电影感视频
- 音乐视频风格:为一首电子音乐曲目创建一个视觉动感视频
- 艺术展示:为摄影师制作一个电影感作品集视频
用户生成内容风格
真实、有共鸣、感觉个人化的内容:
- - 推荐风格:为护肤品创建一个用户生成内容风格的推荐视频
- 开箱风格:为新的科技小工具生成一个开箱风格视频
- 日常生活:制作一个日常生活风格视频,展示远程工作者使用我们的应用
- 评测风格:为送餐服务创建一个休闲评测风格视频
新闻/报道风格
专业新闻格式的内容:
- - 新闻报道:创建一个新闻风格报道视频,关于最新的AI发展
- 市场更新:生成一个关于科技股收益的财经新闻视频
- 行业新闻:制作一个关于金融科技领域新规的新闻报道
- 分析内容:创建一个关于远程工作现状的新闻分析视频
唇形同步与发言人视频
CellCog可以生成AI角色朗读你剧本的视频:
- - AI发言人:创建一个专业发言人解释我们产品的视频
- 虚拟形象演示:生成一个AI主持人发布我们季度更新的视频
- 角色旁白:制作一个友好角色解释我们儿童应用的视频
唇形同步视频的要求:
- 1. 起始帧应只突出显示一张人脸
- 提供剧本/对话内容
- CellCog处理语音合成和唇形同步
视频规格
16:9(横屏)、9:16(竖屏/手机)、1:1(方形) |
|
风格 | 照片级真实、动画、电影、纪录片、休闲 |
|
音频 | 背景音乐、画外音、音效或静音 |
何时使用代理团队模式
对于视频生成,始终使用chat_mode=agent team(默认模式)。
视频创作涉及:
- - 剧本写作
- 场景规划
- 帧图像生成
- 音频生成
- 视频合成
- 质量审核
这个多步骤流程需要完整的代理团队才能获得最佳效果。
视频提示词示例
营销视频:
为FreshBrew创建一个30秒的营销视频——这是一个高端咖啡订阅服务。展示精美的咖啡制作场景、满意的顾客,最后以我们的标语每日新鲜送达结束。使用欢快的背景音乐,无需画外音。16:9格式,用于YouTube。
带画外音的解说视频:
为我们的项目管理工具创建一个90秒的解说视频。逐步展示:1)创建项目,2)添加团队成员,3)跟踪进度。专业女声画外音,简洁动画风格,包含字幕。16:9格式。
教育内容:
为中学生生成一个3分钟的教育视频,解释光合作用。使用引人入胜的动画、清晰的旁白,并在结尾包含总结。友好、亲切的风格。
发言人视频:
创建一个60秒的视频,使用AI发言人(专业男性,30多岁)宣布我们的B轮融资。剧本:今天,我们激动地宣布...[提供完整剧本]。商务休闲着装,自信的语气。
⚠️ 重要提示——视频生成预期
长篇AI视频制作仍是一项不断发展的能力。虽然有些用户能生成高质量、电影感十足且可直接用于生产的视频,但其他用户可能花费大量积分仍无法获得可用结果。即使花费数千积分也不能保证满意的结果——这就是当前AI视频技术的现状。
使用CellCog生成长篇视频确实存在学习曲线。这需要时间、金钱和耐心。你的提示词技巧、尝试创建内容的复杂程度,以及基础模型对你特定请求的表现,都起着重要作用。随着你对什么有效形成直觉,结果会有所改善,但我们要坦诚相告:视频生成本质上具有不可预测性,输出结果可能始终无法达到你的预期。
制作更好视频的技巧
- 1. 指定时长:30秒或2分钟有助于适当限定内容范围。
- 2. 定义宽高比:YouTube/网页使用16:9,TikTok/Reels/Shorts使用9:16,Instagram信息流使用1:1。
- 3. 描述风格:电影感、休闲用户生成内容、企业专业、趣味动画。
- 4. 音频偏好:欢快音乐、平静旁白、无音频、仅音效。
- 5. 包含关键场景:描述你想要呈现的场景或节奏点。
- 6. 提供剧本:对于发言人/画外音视频,准确写出应该说的内容。
如果未安装CellCog
运行/cellcog-setup(或根据你的工具使用/cellcog:cellcog-setup)进行安装和认证。
OpenClaw用户: 请运行clawhub install cellcog。
手动安装: pip install -U cellcog并设置CELLCOGAPIKEY。请参阅cellcog技能获取SDK参考。