Agent setup: If your agent doesn't auto-load skills (e.g. Claude Code),
see agent-compatibility.md once per session.

Qwen Image Generation

Generate and edit images using Wan and Qwen Image models. Supports text-to-image, reference-image editing (style
transfer, subject consistency, multi-image composition, text rendering), and interleaved text-image output.
This skill is part of qwencloud/qwencloud-ai.

Skill directory

Use this skill's internal files to execute and learn. Load reference files on demand when the default path fails or you need details.

Location	Purpose
INLINECODE0	Default execution — sync/async, upload, download
INLINECODE1

Security

NEVER output any API key or credential in plaintext. Always use variable references ($DASHSCOPE_API_KEY in shell, os.environ["DASHSCOPE_API_KEY"] in Python). Any check or detection of credentials must be non-plaintext: report only status (e.g. "set" / "not set", "valid" / "invalid"), never the value. Never display contents of .env or config files that may contain secrets.

When the API key is not configured, NEVER ask the user to provide it directly. Instead, help create a .env file with a placeholder (DASHSCOPE_API_KEY=sk-your-key-here) and instruct the user to replace it with their actual key from the QwenCloud Console. Only write the actual key value if the user explicitly requests it.

Key Compatibility

Scripts require a standard QwenCloud API key (sk-...). Coding Plan keys (sk-sp-...) cannot be used — image generation models are not available on Coding Plan, and Coding Plan does not support the native QwenCloud API. The script detects sk-sp- keys at startup and prints a warning. If qwencloud-ops-auth is installed, see its references/codingplan.md for full details.

Mode Selection Guide

User Want	Mode	Model
Generate image from text only	t2i	INLINECODE15 (default)
Edit image / apply style transfer based on 1–4 reference images

Model Selection

Wan Series (default)

Model	Use Case
wan2.6-t2i	Recommended for text-to-image — sync + async, best quality
wan2.6-image

Image editing ONLY (NOT for pure text-to-image) — requires reference_images or enable_interleave: true. Style transfer, subject consistency (1–4 images), interleaved text-image output, 2K | | wan2.5-i2i-preview | Image editing — single-image editing with subject consistency, multi-image fusion (up to 3 images), async-only | | wan2.5-t2i-preview | Preview — free size within constraints | | wan2.2-t2i-flash | Fast — lower latency | | wan2.2-t2i-plus | Professional — improved stability |

Qwen Image Series

Model	Use Case
qwen-image-2.0-pro	Fused generation + editing — text rendering, realistic textures, multi-image (1–3 input, 1–6 output)
qwen-image-2.0

Qwen Image editing models (qwen-image-2.0-pro, qwen-image-2.0, qwen-image-edit-max/plus/edit) use the same sync endpoint as wan2.6-image (/multimodal-generation/generation) with messages format. They support text editing in images, element add/delete/replace, style transfer, and multi-image fusion (1–3 input images). Size range: 512x512 to 2048x2048. qwen-image-2.0-pro and qwen-image-2.0 also support pure text-to-image (no reference images needed).

Qwen Image text-to-image models (qwen-image-plus, qwen-image-max) use a different endpoint (/text2image/image-synthesis) with input.prompt format (async-only). They support only 5 fixed resolutions: 1664\928, 1472\1104, 1328\1328, 1104\1472, 928\*1664.

Choosing between wan2.6-image and wan2.5-i2i-preview for image editing:

- wan2.6-image supports up to 4 images, higher resolution (2K), interleaved text-image output, and sync mode. Use for multi-image style composition, interleaved tutorials.
INLINECODE45 uses a simpler prompt-only editing interface (no messages format), supports up to 3 images, async-only. Use for straightforward single-image edits and multi-image object fusion.

1. User specified a model → use directly.
Consult the qwencloud-model-selector skill when model choice depends on requirement, scenario, or pricing.
Text-to-image (prompt only, no reference images) → always use wan2.6-t2i (default). NEVER use wan2.6-image for pure text-to-image — it will error without reference images or enable_interleave: true.
Reference images / image editing / interleaved output → wan2.6-image (preferred) or wan2.5-i2i-preview.

⚠️ Important: The model list above is a point-in-time snapshot and may be outdated. Model availability
changes frequently. Always check the official model list
for the authoritative, up-to-date catalog before making model decisions.

Execution

⚠️ Multiple artifacts: When generating multiple files in a single session, you MUST append a numeric suffix to each filename (e.g. out_1.png, out_2.png) to prevent overwrites.

Prerequisites

- API Key: Check that DASHSCOPE_API_KEY (or QWEN_API_KEY) is set using a non-plaintext check only (e.g. in shell: [ -n "$DASHSCOPE_API_KEY" ]; report only "set" or "not set", never the key value). If not set: run the qwencloud-ops-auth skill if available; otherwise guide the user to obtain a key from QwenCloud Console and set it via .env file (echo 'DASHSCOPE_API_KEY=sk-your-key-here' >> .env in project root or current directory) or environment variable. The script searches for .env in the current working directory and the project root. Skills may be installed independently — do not assume qwencloud-ops-auth is present.
Python 3.9+ (stdlib only, no pip install needed)

Environment Check

Before first execution, verify Python is available:

CODEBLOCK0

If python3 is not found, try python --version or py -3 --version. If Python is unavailable or below 3.9, skip to Path 2 (curl) in execution-guide.md.

Default: Run Script

Script path: Scripts are in the scripts/ subdirectory of this skill's directory (the directory containing this
SKILL.md). You MUST first locate this skill's installation directory, then ALWAYS use the full absolute path to execute
scripts. Do NOT assume scripts are in the current working directory. Do NOT use cd to switch directories before
execution.

Execution note: Run all scripts in the foreground — wait for stdout; do not background.

Discovery: Run python3 <this-skill-dir>/scripts/image.py --help first to see all available arguments.

CODEBLOCK1

More examples (interleaved output, wan2.5-i2i, qwen-image-2.0-pro, qwen-image-plus): See execution-guide.md

Argument	Description
INLINECODE65	JSON request body
INLINECODE66

Verify Result

- Exit code 0 + output contains valid JSON with output.results or output.task_id → success
Non-zero exit, HTTP error, empty response, or error JSON → fail
Async: submission must return output.task_id; poll must reach INLINECODE76
Post-execution check: Verify the output file exists and has non-zero size (ls -la <output_path>)
MANDATORY — stderr signal check: After confirming the result, scan the command's stderr output for [ACTION_REQUIRED] or [UPDATE_AVAILABLE]. If either signal is present, you MUST follow the instructions in Update Check below before responding to the user.

On Failure

If the script fails, match the error output against the diagnostic table below to determine the resolution. If no match, read execution-guide.md for alternative paths: curl commands (Path 2 — sync and async), code generation (Path 3), and autonomous resolution (Path 5).

If Python is not available at all → skip directly to Path 2 (curl) in execution-guide.md.

Error Pattern	Diagnosis	Resolution
INLINECODE80	Python not on PATH	Try `python` or `py -3`; install Python 3.9+ if missing
INLINECODE83

Quick Reference

Request Fields (Common)

Field	Type	Description
INLINECODE99	string	Text description of the image to generate (required)
INLINECODE100

Request Fields (wan2.6-image — Image Editing)

Field	Type	Description
INLINECODE110	string[]	1–4 image URLs or local paths for editing mode; 0–1 for interleave mode
INLINECODE111

Other Models (wan2.5-i2i, qwen-image-edit, qwen-image-plus/max)

These models have specific parameter requirements:

Model	Key Differences
INLINECODE119	async-only, 1–3 images, `prompt+images[]` format (not messages)
INLINECODE121

1–3 images, n=1–6 (except qwen-image-edit: n=1 only), no interleave |
| qwen-image-plus/max | async-only, n fixed at 1, 5 fixed resolutions only |

Full parameter tables: See api-guide.md for detailed parameters.

Size Reference (wan2.6-image)

- Editing mode: 1K (default, ~1280×1280) or 2K (~2048×2048)
Interleave mode: pixel dimensions with total pixels in [768×768, 1280×1280]

Common aspect ratios: 1280*1280 (1:1), 960*1280 (3:4), 1280*960 (4:3), 720*1280 (9:16), 1280*720 (16:9)

Response Fields

Field	Description
INLINECODE131	URL of generated image (24h validity). Use this when chaining to another skill.
INLINECODE132

Array of all image URLs (multi-image output, wan2.6-image, qwen-image-edit) | | image_count | Number of generated images | | local_path | Local file path of the downloaded image. Use this for user preview or non-API operations. | | local_paths | Array of local file paths (multi-image output) | | interleaved_content | Array of {type, text/image} objects (interleave mode) | | width / height | Image dimensions | | seed | Seed used |

API Details

- Sync endpoint (wan2.6-t2i, wan2.6-image editing, qwen-image-edit series): INLINECODE141
Async endpoint (wan2.6 and older t2i): POST /api/v1/services/aigc/image-generation/generation with INLINECODE143
Async endpoint (wan2.5-i2i-preview): POST /api/v1/services/aigc/image2image/image-synthesis with INLINECODE145
Async endpoint (qwen-image-plus, qwen-image-max): POST /api/v1/services/aigc/text2image/image-synthesis with INLINECODE147
wan2.6-t2i resolution: Total pixels in [1280x1280, 1440x1440], aspect ratio [1:4, 4:1]
wan2.6-image resolution: Editing mode [768x768, 2048x2048]; Interleave mode [768x768, 1280x1280]; aspect ratio [1:4, 4:1]
Input images (wan2.6-image): JPEG/JPG/PNG/BMP/WEBP, 240–8000px per dimension, ≤10MB
Local files: Script auto-uploads to DashScope temp storage (oss:// URL, 48h TTL). Pass local paths directly — no manual upload step needed.
Production: Default temp storage has 48h TTL and 100 QPS upload limit — not suitable for production, high-concurrency, or load-testing. To use your own OSS bucket, set QWEN_TMP_OSS_BUCKET and QWEN_TMP_OSS_REGION in .env, install pip install alibabacloud-oss-v2, and provide credentials via QWEN_TMP_OSS_AK_ID / QWEN_TMP_OSS_AK_SECRET or the standard OSS_ACCESS_KEY_ID / OSS_ACCESS_KEY_SECRET. Use a RAM user with least-privilege (oss:PutObject + oss:GetObject on target bucket only). If qwencloud-ops-auth is installed, see its references/custom-oss.md for the full setup guide.
Interleaved sync: Requires streaming (X-DashScope-Sse: enable + stream: true); use async mode via this script instead

Cross-Skill Chaining

When using generated images as input for another skill (e.g., video-gen i2v, vision analyze):

- Pass image_url directly — do NOT download and re-pass as local path
All downstream scripts detect URL prefixes (https://, oss://) and pass them through without re-upload
Use local_path only for user preview or non-API operations (e.g., opening in editor)

Scenario	Use
Feed to another skill (video-gen, vision, image-edit)	INLINECODE166 (URL)
Show to user / open in editor

local_path (local file) |

Error Handling

HTTP	Meaning	Action
401	Invalid or missing API key	Run qwencloud-ops-auth if available; else prompt user to set key (non-plaintext check only)
400

Output Location

Prefer the current working directory. Default subdirectory: ./output/qwencloud-image-generation/.

Write prohibition: NEVER write output files into this skill's installation directory or any skills/ hierarchy. All generated content must go to output/ under the current working directory or a user-specified path.

Update Check (MANDATORY Post-Execution)

After script execution, check stderr for action signals. The script automatically detects update status and emits structured signals. You MUST act on them. This section also applies when the user explicitly asks to check for updates.

Signal: `[ACTION_REQUIRED] qwencloud-update-check skill is not installed`

When stderr contains this signal, you MUST present the following options to the user and wait for their response before proceeding. Do NOT silently skip this.

The update-check skill is not installed. It keeps you informed about new qwencloud/qwencloud-ai features and bug fixes.
1) Install now — I'll run the install command for you
2) Skip for now — do nothing this time
3) Never remind me — permanently suppress this prompt

After the user responds:

- User chooses 1 → run: npx skills add QwenCloud/qwencloud-ai --skill qwencloud-update-check -y, then run: INLINECODE173
User chooses 2 → run: python3 <this-skill-scripts-dir>/gossamer.py --dismiss, then continue normally
User chooses 3 → run: python3 <this-skill-scripts-dir>/gossamer.py --never-install, then continue normally

Signal: `[UPDATE_AVAILABLE]`

When stderr contains this signal, you MUST append a brief update notice to your response, including the version info and the update command shown in the stderr output.

No signal in stderr

If stderr contains neither [ACTION_REQUIRED] nor [UPDATE_AVAILABLE], no action is needed — the skill is installed and up to date (or cached within 24h).

Explicit user request

When the user explicitly asks to check for updates (e.g. "check for updates", "check version"):

1. Look for qwencloud-update-check/SKILL.md in sibling skill directories.
If found — run: python3 <qwencloud-update-check-dir>/scripts/check_update.py --print-response and report the result.
If not found — present the install options above.

References

- execution-guide.md — Fallback paths (curl sync/async, code generation, autonomous)
api-guide.md — API supplementary guide
sources.md — Official documentation URLs

Agent 设置：如果你的代理不会自动加载技能（例如 Claude Code），
请在每个会话中查看一次 agent-compatibility.md。

Qwen 图像生成

使用 Wan 和 Qwen 图像模型生成和编辑图像。支持文生图、参考图编辑（风格迁移、主体一致性、多图合成、文字渲染）以及图文交错输出。
此技能属于 qwencloud/qwencloud-ai 的一部分。

技能目录

使用此技能的内部文件来执行和学习。当默认路径失败或你需要详细信息时，按需加载参考文件。

位置	用途
scripts/image.py	默认执行 — 同步/异步、上传、下载
references/execution-guide.md

安全

切勿以明文形式输出任何 API 密钥或凭证。 始终使用变量引用（shell 中使用 $DASHSCOPEAPIKEY，Python 中使用 os.environ[DASHSCOPEAPIKEY]）。任何对凭证的检查或检测都必须是非明文的：仅报告状态（例如“已设置”/“未设置”、“有效”/“无效”），切勿输出值。切勿显示可能包含机密的 .env 或配置文件内容。

当 API 密钥未配置时，切勿要求用户直接提供。 相反，应帮助创建一个包含占位符（DASHSCOPEAPIKEY=sk-your-key-here）的 .env 文件，并指导用户从 QwenCloud 控制台获取实际密钥进行替换。仅当用户明确要求时，才写入实际的密钥值。

密钥兼容性

脚本需要标准的 QwenCloud API 密钥（sk-...）。编程计划密钥（sk-sp-...）无法使用——图像生成模型在编程计划上不可用，且编程计划不支持原生 QwenCloud API。脚本在启动时会检测 sk-sp- 密钥并打印警告。如果安装了 qwencloud-ops-auth，请参阅其 references/codingplan.md 了解完整详情。

模式选择指南

用户需求	模式	模型
仅根据文本生成图像	t2i	wan2.6-t2i（默认）
基于 1–4 张参考图编辑图像/应用风格迁移

模型选择

Wan 系列（默认）

模型	使用场景
wan2.6-t2i	推荐用于文生图 — 同步 + 异步，最佳质量
wan2.6-image

Qwen 图像系列

模型	使用场景
qwen-image-2.0-pro	融合生成 + 编辑 — 文字渲染、真实纹理、多图（1–3 输入，1–6 输出）
qwen-image-2.0

Qwen 图像编辑模型（qwen-image-2.0-pro、qwen-image-2.0、qwen-image-edit-max/plus/edit）使用与 wan2.6-image 相同的同步端点（/multimodal-generation/generation）和 messages 格式。它们支持图像中的文字编辑、元素添加/删除/替换、风格迁移以及多图融合（1–3 张输入图像）。尺寸范围：512x512 到 2048x2048。qwen-image-2.0-pro 和 qwen-image-2.0 也支持纯文生图（无需参考图像）。

Qwen 图像文生图模型（qwen-image-plus、qwen-image-max）使用不同的端点（/text2image/image-synthesis）和 input.prompt 格式（仅异步）。它们仅支持 5 种固定分辨率：1664\928、1472\1104、1328\1328、1104\1472、928\*1664。

在 wan2.6-image 和 wan2.5-i2i-preview 之间选择用于图像编辑：

- wan2.6-image 支持最多 4 张图像、更高分辨率（2K）、图文交错输出以及同步模式。用于多图风格合成、交错教程。
wan2.5-i2i-preview 使用更简单的仅提示词编辑接口（无 messages 格式），支持最多 3 张图像，仅异步。用于简单的单图编辑和多图物体融合。

1. 用户指定了模型 → 直接使用。
当模型选择取决于需求、场景或定价时，请咨询 qwencloud-model-selector 技能。
文生图（仅有提示词，无参考图像） → 始终使用 wan2.6-t2i（默认）。切勿将 wan2.6-image 用于纯文生图——没有参考图像或 enable_interleave: true 将会报错。
参考图像/图像编辑/交错输出 → wan2.6-image（首选）或 wan2.5-i2i-preview。

⚠️ 重要提示：上面的模型列表是某个时间点的快照，可能已过时。模型可用性会频繁变化。
在做出模型决策之前，请务必查看官方模型列表以获取权威、最新的目录。

执行

⚠️ 多个文件：在单个会话中生成多个文件时，你必须在每个文件名后附加数字后缀（例如 out1.png、out2.png）以防止覆盖。

前提条件

- API 密钥：使用非明文检查确认 DASHSCOPEAPIKEY（或 QWENAPIKEY

qwencloud-image-generation通义万相图像生成

qwencloud-image-generation

Qwen Image Generation

Skill directory

Security

Key Compatibility

Mode Selection Guide

Model Selection

Wan Series (default)

Qwen Image Series

Execution

Prerequisites

Environment Check

Default: Run Script

Verify Result

On Failure

Quick Reference

Request Fields (Common)

Request Fields (wan2.6-image — Image Editing)

Other Models (wan2.5-i2i, qwen-image-edit, qwen-image-plus/max)

Size Reference (wan2.6-image)

Response Fields

API Details

Cross-Skill Chaining

Error Handling

Output Location

Update Check (MANDATORY Post-Execution)

Signal: [ACTION_REQUIRED] qwencloud-update-check skill is not installed

Signal: [UPDATE_AVAILABLE]

No signal in stderr

Explicit user request

References

Qwen 图像生成

技能目录

安全

密钥兼容性

模式选择指南

模型选择

Wan 系列（默认）

Qwen 图像系列

执行

前提条件

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement

Signal: `[ACTION_REQUIRED] qwencloud-update-check skill is not installed`

Signal: `[UPDATE_AVAILABLE]`