Getting Started
Welcome! Ready to turn your photos into a compelling video? Share your images and tell me the vibe you're going for — I'll help you build a photo video maker sequence that looks and feels exactly right. Let's get started!
Try saying:
- - "Create a 30-second slideshow from these 12 vacation photos with smooth crossfade transitions and a relaxed pace."
- "Make a product highlight video from my 8 product photos — fast cuts, clean look, suitable for Instagram Reels."
- "Turn these wedding reception photos into a 60-second tribute video with a warm, cinematic feel and subtle zoom effects."
Setup: This skill connects to the NemoVideo API at mega-api-prod.nemovideo.ai. Set the NEMO_TOKEN environment variable to authenticate. New users can get 100 free credits at nemovideo.ai.
From Still Moments to Moving Stories Worth Sharing
The photo-video-maker skill exists for one clear purpose: closing the gap between a folder of great photos and a video people actually want to watch. Whether you're assembling a wedding recap, a product launch reel, a travel montage, or a birthday tribute, this skill handles the sequencing, timing, and transitions so you don't have to.
You bring the images — in whatever order you want them — and describe the mood, pace, or style you're going for. The skill interprets your intent and builds a cohesive video that feels intentional rather than auto-generated. Want a slow, cinematic drift between landscape shots? A punchy, fast-cut sequence for a product showcase? Both are within reach through simple, conversational instructions.
This isn't a template-filler that stamps a generic theme over your photos. It's a responsive tool that adapts to your specific content and use case, giving you a video output that reflects the story you actually want to tell. No prior video editing experience required.
Routing Your Slideshow Requests
Every request — whether you're building a cinematic montage, a story reel, or a music-synced photo slideshow — is parsed by intent and routed to the matching NemoVideo workflow automatically.
| User says... | Action | Skip SSE? |
|---|
| "export" / "导出" / "download" / "send me the video" | → §3.5 Export | ✅ |
| "credits" / "积分" / "balance" / "余额" |
→ §3.3 Credits | ✅ |
| "status" / "状态" / "show tracks" | → §3.4 State | ✅ |
| "upload" / "上传" / user sends file | → §3.2 Upload | ✅ |
| Everything else (generate, edit, add BGM…) | → §3.1 SSE | ❌ |
NemoVideo Backend Reference
Photo Video Maker runs on the NemoVideo rendering engine, which handles frame sequencing, transition timing, audio sync, and export resolution for every slideshow or story you generate. All media processing happens server-side, so your photos are compiled and rendered without any local encoding overhead.
Include on every request: Authorization: Bearer $NEMO_TOKEN, X-Skill-Source, X-Skill-Version, X-Skill-Platform.
Workflow: Create a session at /api/tasks/me/with-session/nemo_agent, send user messages via SSE at /run_sse, upload media to /api/upload-video/nemo_agent/me/{sid}, check project state at /api/state/nemo_agent/me/{sid}/latest, and export the final video at /api/render/proxy/lambda (export is free). Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.
Troubleshooting
If your token has expired, simply re-authenticate to restore your session and pick up right where you left off. A 'session not found' error means your previous render session timed out — start a fresh session and re-submit your photo set. Out of credits? Head to nemovideo.ai to register or top up so you can keep creating.
Performance Notes
The photo-video-maker skill performs best when input images are consistently sized or at least share a similar aspect ratio. Mixing portrait and landscape photos in the same sequence can cause letterboxing or cropping artifacts — it's worth deciding on a target aspect ratio (16:9 for YouTube/TV, 9:16 for mobile Stories, 1:1 for feeds) before uploading.
Processing time scales with the number of images and the complexity of transitions requested. A 10-photo slideshow with simple cuts renders significantly faster than a 50-image sequence with motion effects and text overlays. For large batches, consider breaking the project into segments and merging the outputs.
Output file size depends on resolution and duration. If you're targeting a platform with upload size limits — TikTok, for instance — specify your target file size or bitrate in your prompt and the skill will optimize accordingly. Supported output formats include mp4, mov, avi, webm, and mkv.
Integration Guide
The photo-video-maker skill fits naturally into content workflows where photos are already being produced — real estate listings, e-commerce catalogs, event photography, and social media content calendars are common entry points. You can pipe image URLs or uploaded files directly into a prompt alongside your instructions, making it straightforward to trigger the skill from a broader automation pipeline.
If you're working with a content team, the skill supports descriptive briefs as input, meaning a non-technical team member can write the creative direction in plain language and the skill will interpret it. There's no need to pre-configure transitions or timings in a separate tool.
For recurring use cases — say, a weekly product roundup video — you can standardize a prompt template with fixed style parameters and swap in new image sets each time. This makes the skill behave like a lightweight, repeatable production tool rather than a one-off request, reducing turnaround time for routine video content significantly.
开始使用
欢迎!准备好将你的照片转化为引人入胜的视频了吗?分享你的图片,告诉我你想要的氛围——我将帮你构建一个外观和感觉都恰到好处的照片视频制作序列。让我们开始吧!
试试这样说:
- - 用这12张度假照片制作一个30秒的幻灯片,使用平滑的交叉淡入淡出过渡和舒缓的节奏。
- 用我的8张产品照片制作一个产品展示视频——快速剪辑、干净利落的风格,适合Instagram Reels。
- 将这些婚礼接待照片转化为一个60秒的致敬视频,带有温暖、电影般的感觉和微妙的缩放效果。
设置:此技能连接到位于 mega-api-prod.nemovideo.ai 的 NemoVideo API。设置 NEMO_TOKEN 环境变量以进行身份验证。新用户可在 nemovideo.ai 获得100个免费积分。
从静止瞬间到值得分享的动态故事
photo-video-maker 技能存在的目的很明确:弥合一堆优秀照片与人们真正想观看的视频之间的差距。无论你是在制作婚礼回顾、产品发布短片、旅行混剪还是生日致敬视频,此技能都能处理排序、时间安排和过渡效果,让你无需亲自动手。
你只需提供图片——按你想要的任何顺序——并描述你追求的情绪、节奏或风格。该技能会解读你的意图,并构建一个连贯的视频,使其看起来有设计感而非自动生成。想要在风景照片之间实现缓慢的电影般漂移效果?还是为产品展示打造一个有力、快速剪辑的序列?通过简单的对话式指令,两者皆可实现。
这不是一个在你的照片上套用通用主题的模板填充器。它是一个响应式工具,能根据你的具体内容和用例进行调整,为你提供反映你真正想讲述的故事的视频输出。无需任何视频编辑经验。
路由你的幻灯片请求
每个请求——无论你是在构建电影蒙太奇、故事短片还是音乐同步的照片幻灯片——都会按意图进行解析,并自动路由到匹配的 NemoVideo 工作流程。
| 用户说... | 操作 | 跳过SSE? |
|---|
| export / 导出 / download / send me the video | → §3.5 导出 | ✅ |
| credits / 积分 / balance / 余额 |
→ §3.3 积分 | ✅ |
| status / 状态 / show tracks | → §3.4 状态 | ✅ |
| upload / 上传 / 用户发送文件 | → §3.2 上传 | ✅ |
| 其他所有内容(生成、编辑、添加背景音乐…) | → §3.1 SSE | ❌ |
NemoVideo 后端参考
Photo Video Maker 运行在 NemoVideo 渲染引擎上,该引擎为你生成的每个幻灯片或故事处理帧序列、过渡时间、音频同步和导出分辨率。所有媒体处理都在服务器端完成,因此你的照片无需任何本地编码开销即可编译和渲染。
在每个请求中包含:Authorization: Bearer $NEMO_TOKEN、X-Skill-Source、X-Skill-Version、X-Skill-Platform。
工作流程:在 /api/tasks/me/with-session/nemoagent 创建一个会话,通过 /runsse 的 SSE 发送用户消息,将媒体上传到 /api/upload-video/nemoagent/me/{sid},在 /api/state/nemoagent/me/{sid}/latest 检查项目状态,并在 /api/render/proxy/lambda 导出最终视频(导出免费)。支持的格式:mp4、mov、avi、webm、mkv、jpg、png、gif、webp、mp3、wav、m4a、aac。
故障排除
如果你的令牌已过期,只需重新进行身份验证以恢复会话,并从中断处继续。出现会话未找到错误意味着你之前的渲染会话已超时——启动一个新会话并重新提交你的照片集。积分用完了?前往 nemovideo.ai 注册或充值,以便继续创作。
性能说明
当输入图像尺寸一致或至少共享相似的宽高比时,photo-video-maker 技能表现最佳。在同一序列中混合竖屏和横屏照片可能会导致信箱黑边或裁剪伪影——在上传前值得确定一个目标宽高比(YouTube/TV 用 16:9,移动端故事用 9:16,信息流用 1:1)。
处理时间随图像数量和所请求过渡效果的复杂度而增加。一个带有简单切换的10张照片幻灯片比一个带有运动效果和文字叠加的50张图像序列渲染速度快得多。对于大批量处理,考虑将项目分成多个片段并合并输出。
输出文件大小取决于分辨率和时长。如果你的目标平台有上传大小限制——例如 TikTok——请在提示中指定你的目标文件大小或比特率,该技能将相应地进行优化。支持的输出格式包括 mp4、mov、avi、webm 和 mkv。
集成指南
photo-video-maker 技能自然适用于已经在制作照片的内容工作流程——房地产列表、电子商务目录、活动摄影和社交媒体内容日历是常见的切入点。你可以将图像URL或上传的文件连同你的指令直接输入到提示中,从而轻松地从更广泛的自动化流水线中触发该技能。
如果你与内容团队合作,该技能支持将描述性简报作为输入,这意味着非技术团队成员可以用通俗语言编写创意方向,该技能将进行解读。无需在单独的工具中预先配置过渡或时间安排。
对于重复使用的场景——例如每周产品综述视频——你可以标准化一个带有固定风格参数的提示模板,每次只需更换新的图像集。这使得该技能像一个轻量级、可重复使用的生产工具,而非一次性请求,从而显著缩短常规视频内容的周转时间。