Agent setup: If your agent doesn't auto-load skills (e.g. Claude Code),
see agent-compatibility.md once per session.

Qwen Video Generation

Generate videos using Wan models. All tasks are asynchronous — submit, then poll until
completion.
This skill is part of qwencloud/qwencloud-ai.

⚠️ Critical Parameter Differences by Mode:

- kf2v (First+Last Frame): Duration is fixed at 5 seconds — other values will fail. Output is silent only.
Resolution parameter varies: t2v/r2v/vace use size (e.g. "1280*720"); i2v/kf2v use resolution (e.g. "720P").

Skill directory

Use this skill's internal files to execute and learn. Load reference files on demand when the default path fails or you need details.

Location	Purpose
INLINECODE4	Default execution — mode auto-detect, submit, poll, download
INLINECODE5

Security

NEVER output any API key or credential in plaintext. Always use variable references ($DASHSCOPE_API_KEY in shell, os.environ["DASHSCOPE_API_KEY"] in Python). Any check or detection of credentials must be non-plaintext: report only status (e.g. "set" / "not set", "valid" / "invalid"), never the value. Never display contents of .env or config files that may contain secrets.

When the API key is not configured, NEVER ask the user to provide it directly. Instead, help create a .env file with a placeholder (DASHSCOPE_API_KEY=sk-your-key-here) and instruct the user to replace it with their actual key from the QwenCloud Console. Only write the actual key value if the user explicitly requests it.

Key Compatibility

Scripts require a standard QwenCloud API key (sk-...). Coding Plan keys (sk-sp-...) cannot be used — video generation models are not available on Coding Plan, and Coding Plan does not support the native QwenCloud API. Video generation incurs per-second charges on standard keys. The script detects sk-sp- keys at startup and prints a warning. If qwencloud-ops-auth is installed, see its references/codingplan.md for full details.

Mode Selection Guide

User Want	Mode	Key Field
Generate video from text description only	t2v	INLINECODE23 only
Animate a single image

Model Selection

1. User specified a model → use directly.
Consult the qwencloud-model-selector skill when model choice depends on capability, scenario, or pricing.
No signal, clear task → per-mode defaults: t2v → wan2.6-t2v, i2v → wan2.6-i2v-flash, kf2v → wan2.2-kf2v-flash, r2v → wan2.6-r2v-flash, vace → wan2.1-vace-plus.

Models

t2v (Text-to-Video)

Model	Features
INLINECODE35 recommended	Audio, multi-shot, 2–15s, 720P/1080P
INLINECODE36

Audio, 5s/10s, 480P/720P/1080P | | wan2.2-t2v-plus | Silent, 5s, 480P/1080P |

i2v (Image-to-Video)

Model	Features
INLINECODE38 recommended	Audio/silent, multi-shot, 2–15s, 720P/1080P
INLINECODE39

Audio, multi-shot, 2–15s, 720P/1080P | | wan2.5-i2v-preview | Audio, 5s/10s, 480P/720P/1080P |

kf2v / r2v / vace

Model	Features
INLINECODE41 (kf2v default)	Silent, 5s, 480P/720P/1080P
INLINECODE42

Audio, single/multi character, 2–10s, 720P/1080P | | wan2.6-r2v-flash (r2v default) | Audio/silent, multi-character, 2–10s, 720P/1080P | | wan2.1-vace-plus (vace) | Multi-image ref, repainting, local edit, ≤5s, 720P |

⚠️ Important: The model list above is a point-in-time snapshot and may be outdated. Model availability
changes frequently. Always check the official model list
for the authoritative, up-to-date catalog before making model decisions.

Execution

⚠️ Multiple artifacts: When generating multiple files in a single session, you MUST append a numeric suffix to each filename (e.g. out_1.mp4, out_2.mp4) to prevent overwrites.

Prerequisites

- API Key: Check that DASHSCOPE_API_KEY (or QWEN_API_KEY) is set using a non-plaintext check only (e.g. in shell:

[ -n "$DASHSCOPE_API_KEY" ]; report only "set" or "not set", never the key value). If not set: run the * qwencloud-ops-auth* skill if available; otherwise guide the user to obtain a key from QwenCloud Console and set it via .env file ( echo 'DASHSCOPE_API_KEY=sk-your-key-here' >> .env in project root or current directory) or environment variable. The script searches for .env in the current working directory and the project root. Skills may be installed independently — do not assume qwencloud-ops-auth is present.

- Python 3.9+ (stdlib only, no pip install needed)
For media merging (concat, trim, audio overlay): see merge-media.md for ffmpeg/moviepy recipes suited to the user's environment

Environment Check

Before first execution, verify Python is available:

CODEBLOCK0

If python3 is not found, try python --version or py -3 --version. If Python is unavailable or below 3.9, skip to Path 2 (curl) in execution-guide.md.

Default: Run Script

Script path: Scripts are in the scripts/ subdirectory of this skill's directory (the directory containing this SKILL.md). You MUST first locate this skill's installation directory, then ALWAYS use the full absolute path to execute scripts. Do NOT assume scripts are in the current working directory. Do NOT use cd to switch directories before execution.

Execution note: Run all scripts in the foreground — wait for stdout; do not background.

Discovery: Run python3 <this-skill-dir>/scripts/video.py --help first to see all available arguments.

CODEBLOCK1

Argument	Description
INLINECODE59	JSON request body
INLINECODE60

Verify Result

- Exit code 0 + response has output.task_id → submission success
Poll reaches task_status: SUCCEEDED → generation complete
Non-zero exit, HTTP error, or FAILED status → fail
Post-execution check: Verify the output video file exists and has non-zero size (ls -la <output_dir>)
MANDATORY — stderr signal check: After confirming the result, scan the command's stderr output for [ACTION_REQUIRED] or [UPDATE_AVAILABLE]. If either signal is present, you MUST follow the instructions in Update Check below before responding to the user.

On Failure

If the script fails, match the error output against the diagnostic table below to determine the resolution. If no match, read execution-guide.md for alternative paths: curl commands (Path 2 — all 5 modes), code generation (Path 3), and autonomous resolution (Path 5).

If Python is not available at all → skip directly to Path 2 (curl) in execution-guide.md.

Error Pattern	Diagnosis	Resolution
INLINECODE76	Python not on PATH	Try `python` or `py -3`; install Python 3.9+ if missing
INLINECODE79

Request Fields Summary

All modes require prompt. See request-fields.md for full field tables per mode.

⚠️ Resolution Parameter by Mode (Critical)

Mode	Parameter	Format	Example
t2v	INLINECODE98	INLINECODE99	INLINECODE100, INLINECODE101
r2v

size | "WxH" | "1280*720", "1920*1080" | | vace | size | "WxH" | "1280*720" | | i2v | resolution | "xxxP" | "720P", "1080P" | | kf2v | resolution | "xxxP" | "480P", "720P", "1080P" |

Using the wrong parameter name will cause the API call to fail.

Mode-Specific Required Fields

- i2v needs img_url/reference_image. kf2v needs first_frame_url + last_frame_url. r2v needs reference_urls. vace needs function.

Cost Estimation

🚨 NEVER guess or fabricate any price figure. Always direct the user to the
official pricing page for exact rates.

Cost is billed per second of generated video. Price varies by model and resolution. For the latest rates, check
the official pricing page.

Model	720P (USD)	1080P (USD)
wan2.6-t2v	per-second billing	per-second billing
wan2.6-i2v-flash

Quick example: wan2.6-t2v 5s 720P — check
the official pricing page for current per-second
rates. Some models may offer a limited free quota — verify availability in the
user's QwenCloud console before assuming any call
is free.

Local File Handling

When the user provides local file paths (images, videos, audio), pass them directly to the script. The script automatically uploads local files to DashScope temporary storage (oss:// URL, 48h TTL) and injects the X-DashScope-OssResourceResolve: enable header. No manual upload step is needed.

Production: Default temp storage has 48h TTL and 100 QPS upload limit — not suitable for production, high-concurrency, or load-testing. To use your own OSS bucket, set QWEN_TMP_OSS_BUCKET and QWEN_TMP_OSS_REGION in .env, install pip install alibabacloud-oss-v2, and provide credentials via QWEN_TMP_OSS_AK_ID / QWEN_TMP_OSS_AK_SECRET or the standard OSS_ACCESS_KEY_ID / OSS_ACCESS_KEY_SECRET. Use a RAM user with least-privilege (oss:PutObject + oss:GetObject on target bucket only). If qwencloud-ops-auth is installed, see its references/custom-oss.md for the full setup guide.

Cross-Skill Chaining

When using output from another skill as input (e.g., image-gen → i2v, audio-tts → audio overlay):

- Pass the URL directly (e.g., "img_url": "<image_url from image-gen>") — do NOT download and re-pass as local path
The script detects URL prefixes (https://, oss://) and passes them through without re-upload
Use local_path from the response only for user preview or non-API operations

When passing this skill's output to another skill (e.g., vace edit, vision analyze):

- Pass video_url from the response — do NOT download and re-pass as local path

Scenario	Use
Feed to another skill	INLINECODE142 / `image_url` (URL)
Show to user / local playback

local_path (local file) |

Important Notes

- Async only: All video APIs require X-DashScope-Async: enable header.
kf2v: Uses a different API endpoint. Duration fixed at 5s, silent only.
r2v: Use character1/character2/... in prompt. Up to 5 references (max 3 videos).
vace: Must specify function. Silent only, output ≤5s.
Multi-shot: Set shot_type: "multi" AND prompt_extend: true.
Video URL expires in 24h — the script auto-downloads to --output dir. When chaining to another skill (e.g., vace edit), pass video_url directly — do NOT re-download.
For advanced workflows → see workflows.md.

Error Handling

Error	Cause	Action
INLINECODE153	Invalid or missing API key	Run qwencloud-ops-auth if available; else prompt user to set key (non-plaintext check only)
INLINECODE154

Output Location

Prefer the current working directory. Default subdirectory: ./output/qwencloud-video-generation/.

Write prohibition: NEVER write output files into this skill's installation directory or any skills/ hierarchy. All generated content must go to output/ under the current working directory or a user-specified path.

Update Check (MANDATORY Post-Execution)

After script execution, check stderr for action signals. The script automatically detects update status and emits structured signals. You MUST act on them. This section also applies when the user explicitly asks to check for updates.

Signal: `[ACTION_REQUIRED] qwencloud-update-check skill is not installed`

When stderr contains this signal, you MUST present the following options to the user and wait for their response before proceeding. Do NOT silently skip this.

The update-check skill is not installed. It keeps you informed about new qwencloud/qwencloud-ai features and bug fixes.
1) Install now — I'll run the install command for you
2) Skip for now — do nothing this time
3) Never remind me — permanently suppress this prompt

After the user responds:

- User chooses 1 → run: npx skills add QwenCloud/qwencloud-ai --skill qwencloud-update-check -y, then run: INLINECODE165
User chooses 2 → run: python3 <this-skill-scripts-dir>/gossamer.py --dismiss, then continue normally
User chooses 3 → run: python3 <this-skill-scripts-dir>/gossamer.py --never-install, then continue normally

Signal: `[UPDATE_AVAILABLE]`

When stderr contains this signal, you MUST append a brief update notice to your response, including the version info and the update command shown in the stderr output.

No signal in stderr

If stderr contains neither [ACTION_REQUIRED] nor [UPDATE_AVAILABLE], no action is needed — the skill is installed and up to date (or cached within 24h).

Explicit user request

When the user explicitly asks to check for updates (e.g. "check for updates", "check version"):

1. Look for qwencloud-update-check/SKILL.md in sibling skill directories.
If found — run: python3 <qwencloud-update-check-dir>/scripts/check_update.py --print-response and report the result.
If not found — present the install options above.

References

- execution-guide.md — Fallback paths (curl for all modes, code generation, autonomous)
request-fields.md — Detailed field tables by mode + audio handling
workflows.md — Duration extensions, audio workarounds, multi-shot, VACE pipelines
polling-guide.md — Polling patterns and timing recommendations
merge-media.md — Guide for generating merge/trim/audio-overlay code
examples.md — Full script execution examples for all modes
sources.md — Official documentation URLs

Agent 设置：如果你的智能体不会自动加载技能（例如 Claude Code），请在每个会话中查看一次 agent-compatibility.md。

Qwen 视频生成

使用 Wan 模型生成视频。所有任务均为异步——提交，然后轮询直至完成。
此技能属于 qwencloud/qwencloud-ai 的一部分。

⚠️ 不同模式的关键参数差异：

- kf2v（首帧+尾帧）：时长固定为 5 秒——其他值将导致失败。输出仅为无声视频。
分辨率参数因模式而异：t2v/r2v/vace 使用 size（例如 1280*720）；i2v/kf2v 使用 resolution（例如 720P）。

技能目录

使用此技能的内部文件来执行和学习。当默认路径失败或需要详细信息时，按需加载参考文件。

位置	用途
scripts/video.py	默认执行——模式自动检测、提交、轮询、下载
references/execution-guide.md

安全性

切勿以明文形式输出任何 API 密钥或凭据。 始终使用变量引用（Shell 中使用 $DASHSCOPEAPIKEY，Python 中使用 os.environ[DASHSCOPEAPIKEY]）。任何对凭据的检查或检测都必须是非明文的：仅报告状态（例如“已设置”/“未设置”、“有效”/“无效”），切勿输出值。切勿显示可能包含密钥的 .env 或配置文件的内容。

当 API 密钥未配置时，切勿要求用户直接提供。 相反，应帮助创建一个包含占位符（DASHSCOPEAPIKEY=sk-your-key-here）的 .env 文件，并指导用户从 QwenCloud 控制台获取实际密钥进行替换。仅当用户明确要求时才写入实际的密钥值。

密钥兼容性

脚本需要标准的 QwenCloud API 密钥（sk-...）。编程计划密钥（sk-sp-...）不能使用——视频生成模型在编程计划上不可用，且编程计划不支持原生 QwenCloud API。视频生成对标准密钥按秒计费。脚本在启动时会检测 sk-sp- 密钥并打印警告。如果已安装 qwencloud-ops-auth，请参阅其 references/codingplan.md 获取完整详情。

模式选择指南

用户需求	模式	关键字段
仅根据文本描述生成视频	t2v	仅 prompt
将单张图片动画化

模型选择

1. 用户指定了模型 → 直接使用。
当模型选择取决于能力、场景或价格时，请咨询 qwencloud-model-selector 技能。
无指示，任务明确 → 各模式默认值：t2v → wan2.6-t2v，i2v → wan2.6-i2v-flash，kf2v → wan2.2-kf2v-flash，r2v → wan2.6-r2v-flash，vace → wan2.1-vace-plus。

模型

t2v（文本到视频）

模型	特性
wan2.6-t2v 推荐	音频、多镜头、2–15 秒、720P/1080P
wan2.5-t2v-preview

音频、5 秒/10 秒、480P/720P/1080P | | wan2.2-t2v-plus | 无声、5 秒、480P/1080P |

i2v（图片到视频）

模型	特性
wan2.6-i2v-flash 推荐	音频/无声、多镜头、2–15 秒、720P/1080P
wan2.6-i2v

音频、多镜头、2–15 秒、720P/1080P | | wan2.5-i2v-preview | 音频、5 秒/10 秒、480P/720P/1080P |

kf2v / r2v / vace

模型	特性
wan2.2-kf2v-flash （kf2v 默认）	无声、5 秒、480P/720P/1080P
wan2.6-r2v

⚠️ 重要提示：上述模型列表是某个时间点的快照，可能已过时。模型可用性会频繁变化。在做出模型决策之前，请始终查看官方模型列表以获取权威的最新目录。

执行

⚠️ 多个输出文件：在单次会话中生成多个文件时，你必须在每个文件名后附加数字后缀（例如 out1.mp4、out2.mp4）以防止覆盖。

前提条件

- API 密钥：仅使用非明文检查来确认 DASHSCOPEAPIKEY（或 QWENAPIKEY）是否已设置（例如在 Shell 中：[ -n $DASHSCOPEAPIKEY ]；仅报告“已设置”或“未设置”，切勿输出密钥值）。如果未设置：如果可用，运行 qwencloud-ops-auth 技能；否则指导用户从 QwenCloud 控制台获取密钥，并通过 .env 文件（在项目根目录或当前目录执行 echo DASHSCOPEAPIKEY=sk-your-key-here >> .env）或环境变量进行设置。脚本会在当前工作目录和项目根目录中搜索 .env。技能可能独立安装——不要假设 qwencloud-ops-auth 存在。
Python 3.9+（仅使用标准库，无需 pip install）
对于媒体合并（拼接、裁剪、音频叠加）：请参阅 merge-media.md 获取适合用户环境的 ffmpeg/moviepy 方案

环境检查

首次执行前，确认 Python 可用：

bash
python3 --version # 必须为 3.9+

如果找不到 python3，请尝试 python --version 或 py -3 --version。如果 Python 不可用或版本低于 3.9，请跳转到 execution-guide.md 中的路径 2（curl）。

默认：运行脚本

脚本路径：脚本位于此技能目录（包含此 SKILL.md 的目录）

qwencloud-video-generation通义万相视频生成

qwencloud-video-generation

Qwen Video Generation

Skill directory

Security

Key Compatibility

Mode Selection Guide

Model Selection

Models

t2v (Text-to-Video)

i2v (Image-to-Video)

kf2v / r2v / vace

Execution

Prerequisites

Environment Check

Default: Run Script

Verify Result

On Failure

Request Fields Summary

⚠️ Resolution Parameter by Mode (Critical)

Mode-Specific Required Fields

Cost Estimation

Local File Handling

Cross-Skill Chaining

Important Notes

Error Handling

Output Location

Update Check (MANDATORY Post-Execution)

Signal: [ACTION_REQUIRED] qwencloud-update-check skill is not installed

Signal: [UPDATE_AVAILABLE]

No signal in stderr

Explicit user request

References

Qwen 视频生成

技能目录

安全性

密钥兼容性

模式选择指南

模型选择

模型

t2v（文本到视频）

i2v（图片到视频）

kf2v / r2v / vace

执行

前提条件

环境检查

默认：运行脚本

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement

Signal: `[ACTION_REQUIRED] qwencloud-update-check skill is not installed`

Signal: `[UPDATE_AVAILABLE]`