Chanjing Avatar (Lip-Syncing)

功能说明

调用蝉镜 Avatar Open API：上传音视频素材、创建对口型任务、轮询与获取结果链接。脚本为 Python HTTP/上传客户端，不依赖 ffmpeg/ffprobe。

运行依赖

- python3 与同仓库 INLINECODE0
无 ffmpeg/ffprobe 门控

环境变量与机器可读声明

- 环境变量键名与说明：manifest.yaml（environment 段）及本文
变量、凭据、合规 permissions、clientPermissions、agentPolicy：manifest.yaml

使用命令

- ClawHub（slug 以注册表为准）：INLINECODE7
本仓库：python skills/chanjing-avatar/scripts/create_task.py …（流程见正文 How to Use）

登记与审稿（单一事实来源）

主凭据、上传/下载边界、浏览器引导等：以 manifest.yaml 为准。本篇 How to Use 起为 API 步骤说明。

When to Use This Skill

Use this skill when the user needs to create lip-syncing videos (digital avatar videos) with synchronized mouth movements.

Chanjing Avatar supports:

* Text-driven or audio-driven lip-syncing
Multiple system voices for TTS
Video resolution customization
Task status polling and callback

How to Use This Skill

前置条件：执行本 Skill 前，必须先通过 chanjing-credentials-guard 完成 AK/SK 与 Token 校验。凭据与审稿对表见 manifest.yaml。

Security & credentials（引用）

详见 manifest.yaml 中 credentials 与 clientPermissions（含本地上传、结果 URL、浏览器行为；合规见顶层 permissions）。

Multiple APIs need to be invoked. All share the domain: "https://open-api.chanjing.cc".
All requests communicate using json.
You should use utf-8 to encode and decode text throughout this task.

1. Obtain an access_token, which is required for all subsequent API calls
Upload your video/audio files using the File Management API to get INLINECODE16
Create a lip-syncing task with video and audio/text using these file_id values
Poll the Query Task Detail API or use Task List API to check status
Download the generated video using the url in response when status is completed

Obtain AccessToken

从 ~/.chanjing/credentials.json 读取 app_id 和 secret_key，若无有效 Token 则调用：

CODEBLOCK0

请求体（使用本地配置的 appid、secretkey）：

CODEBLOCK1

Response example:

CODEBLOCK2

Response field description:

First-level Field	Second-level Field	Description
code		Response status code
msg

Response Status Code Description

Code	Description
0	Success
400

Upload Media Files (File Management)

Before creating a lip-syncing task, you must upload your video (and optional audio) files using the File Management API to obtain file_id values.

The full documentation is here: [File Management](https://doc.chanjing.cc/api/file/file-management.html).

Step 1: Get upload URL

CODEBLOCK3

Query params:

Key	Example	Description
service	lipsyncvideo / lipsyncaudio	File usage purpose. Use `lip_sync_video` for driving video, `lip_sync_audio` for audio (if audio-driven).
name

1.mp4 | Original file name including extension |

You will get a response containing sign_url, mime_type, and file_id. Use the sign_url with HTTP PUT to upload the file, setting Content-Type to the returned mime_type. After the PUT completes, poll the file detail API until the file is ready (do not assume a fixed wait). Keep the returned file_id for video_file_id / audio_file_id below.

Polling: Call GET /open/v1/common/file_detail?id={{file_id}} with access_token until the response data.status indicates success (e.g. status === 2). Only then use the file_id for the create task API.

Create Lip-Syncing Task

Submit a lip-syncing video creation task, which returns a video ID for polling later.

CODEBLOCK4

Request body example (TTS-driven):

CODEBLOCK5

Request body example (Audio-driven):

CODEBLOCK6

Request field description:

Parameter Name	Type	Required	Description
videofileid	string	Yes	Video file ID from File Management (`data.file_id`). Supports mp4, mov, webm
screen_width

Response example:

CODEBLOCK7

Response field description:

Field	Description
code	Response status code
msg

Response message |
| data | Video ID for subsequent polling |

Query Task List

Get a list of lip-syncing tasks.

CODEBLOCK8

Request body:

CODEBLOCK9

Request field description:

Parameter	Type	Required	Description
page	int	No	Page number, default 1
page_size

int | No | Page size, default 10 |

Response example:

CODEBLOCK10

Response field description:

First-level Field	Second-level Field	Description
code		Response status code
msg

Query Task Detail

Poll the following API to check task status until completed.

CODEBLOCK11

Query params:

Parameter	Description
id	Video ID

Example: INLINECODE43

Response example:

CODEBLOCK12

Response field description:

First-level Field	Second-level Field	Description
code		Response status code
msg

Callback Notification

When a callback URL is provided, the system will send a POST request when the task completes:

CODEBLOCK13

Scripts

本 Skill 提供脚本（skills/chanjing-avatar/scripts/），与 chanjing-credentials-guard 使用同一配置文件（~/.chanjing/credentials.json）获取 Token。

脚本	说明
INLINECODE46	获取上传链接，输出 `sign_url`、`mime_type`、INLINECODE49
INLINECODE50

示例（在项目根或 skill 目录下执行）：

CODEBLOCK14

音频驱动时：先上传音频得到 audio_file_id，再 create_task.py --video-file-id <id> --audio-file-id <audio_file_id>。

Response Status Code Description

Code	Description
0	Response successful
10400

AccessToken verification failed | | 40000 | Parameter error | | 40001 | Exceeds RPM/QPS limit | | 50000 | System internal error |

Chanjing Avatar（对口型）

功能说明

调用蝉镜 Avatar Open API：上传音视频素材、创建对口型任务、轮询与获取结果链接。脚本为 Python HTTP/上传客户端，不依赖 ffmpeg/ffprobe。

运行依赖

- python3 与同仓库 scripts/*.py
无 ffmpeg/ffprobe 门控

环境变量与机器可读声明

- 环境变量键名与说明：manifest.yaml（environment 段）及本文
变量、凭据、合规 permissions、clientPermissions、agentPolicy：manifest.yaml

使用命令

- ClawHub（slug 以注册表为准）：clawhub run chanjing-avatar
本仓库：python skills/chanjing-avatar/scripts/create_task.py …（流程见正文 如何使用）

登记与审稿（单一事实来源）

主凭据、上传/下载边界、浏览器引导等：以 manifest.yaml 为准。本篇 如何使用 起为 API 步骤说明。

何时使用此技能

当用户需要创建口型同步视频（数字人视频）时使用此技能。

蝉镜 Avatar 支持：

* 文本驱动或音频驱动的口型同步
多种系统语音用于 TTS
视频分辨率自定义
任务状态轮询和回调

如何使用此技能

前置条件：执行本技能前，必须先通过 chanjing-credentials-guard 完成 AK/SK 与 Token 校验。凭据与审稿对表见 manifest.yaml。

安全与凭据（引用）

详见 manifest.yaml 中 credentials 与 clientPermissions（含本地上传、结果 URL、浏览器行为；合规见顶层 permissions）。

需要调用多个 API。所有 API 共享域名：https://open-api.chanjing.cc。
所有请求使用 json 通信。
在整个任务中应使用 utf-8 编码和解码文本。

1. 获取 accesstoken，后续所有 API 调用都需要
使用文件管理 API 上传视频/音频文件，获取 fileid
使用这些 file_id 值创建对口型任务（视频+音频/文本）
轮询查询任务详情 API 或使用任务列表 API 检查状态
状态完成后，使用响应中的 url 下载生成的视频

获取 AccessToken

从 ~/.chanjing/credentials.json 读取 appid 和 secretkey，若无有效 Token 则调用：

http
POST /open/v1/access_token
Content-Type: application/json

请求体（使用本地配置的 appid、secretkey）：

json
{
app_id: <从 credentials.json 读取>,
secret_key: <从 credentials.json 读取>
}

响应示例：

json
{
trace_id: 8ff3fcd57b33566048ef28568c6cee96,
code: 0,
msg: success,
data: {
access_token: 1208CuZcV1Vlzj8MxqbO0kd1Wcl4yxwoHl6pYIzvAGoP3DpwmCCa73zmgR5NCrNu,
expire_in: 1721289220
}
}

响应字段说明：

一级字段	二级字段	说明
code		响应状态码
msg

响应状态码说明

状态码	说明
0	成功
400

上传媒体文件（文件管理）

在创建对口型任务之前，必须使用文件管理 API 上传视频（和可选的音频）文件以获取 file_id 值。

完整文档在此：文件管理。

步骤 1：获取上传 URL

http
GET /open/v1/common/createuploadurl
accesstoken: {{accesstoken}}

查询参数：

键	示例	说明
service	lipsyncvideo / lipsyncaudio	文件用途。驱动视频使用 lipsyncvideo，音频（若音频驱动）使用 lipsyncaudio。
name

1.mp4 | 包含扩展名的原始文件名 |

你将收到包含 signurl、mimetype 和 fileid 的响应。使用 signurl 通过 HTTP PUT 上传文件，设置 Content-Type 为返回的 mimetype。PUT 完成后，轮询文件详情 API 直到文件就绪（不要假设固定等待时间）。保留返回的 fileid 用于下面的 videofileid / audiofileid。

轮询： 使用 accesstoken 调用 GET /open/v1/common/filedetail?id={{fileid}}，直到响应 data.status 指示成功（例如 status === 2）。然后才能使用该 fileid 创建任务 API。

创建对口型任务

提交对口型视频创建任务，返回用于后续轮询的视频 ID。

http
POST /open/v1/videolipsync/create
accesstoken: {{accesstoken}}
Content-Type: application/json

请求体示例（TTS 驱动）：

json
{
videofileid: e284db4d95de4220afe78132158156b5,
screen_width: 1080,
screen_height: 1920,
callback: https://example.com/openapi/callback,
model: 0,
audio_type: tts,
tts_config: {
text: 君不见黄河之水天上来，奔流到海不复回。,
audiomanid: C-f2429d07554749839849497589199916,
speed: 1,
pitch: 1
}
}

请求体示例（音频驱动）：

json
{
videofileid: e284db4d95de4220afe78132158156b5,
screen_width: 1080,
screen_height: 1920,
model: 0,
audio_type: audio,
audiofileid: audiofileidfromfile_management
}

请求字段说明：

参数名	类型	必填	说明
videofileid	string	是	文件管理返回的视频文件 ID（data.fileid）。支持 mp4、mov、webm
screenwidth

响应示例：

json
{
trace_id: 8d10659438827bd4d59eaa2696f9d391,
code: 0,
msg: success,
data: 9499ed79995

chanjing-avatar幻境化身

chanjing-avatar

Chanjing Avatar (Lip-Syncing)

功能说明

运行依赖

环境变量与机器可读声明

使用命令

登记与审稿（单一事实来源）

When to Use This Skill