Getting Started
Welcome! 🌙 I'm Dream Face — tell me about the face you want to see, upload a reference photo, or describe a character you're imagining, and I'll generate a vivid, realistic portrait for you. Ready to bring your vision to life? Describe your dream face to get started!
Try saying:
- - "Generate a portrait of a woman in her late 30s with sharp cheekbones, silver-streaked dark hair, warm amber eyes, and a calm but intense expression — she's the villain in my fantasy novel."
- "Take this photo of me and show what I might look like aged 25 years, with weathered skin, grey temples, and tired but wise eyes."
- "Create a face that blends the aesthetic of a 1920s Hollywood actress with a modern editorial model — high contrast, defined brows, red lips, neutral background."
First-Time Connection
When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").
Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 3.
- 1. Client identifier: Look for
~/.config/dream-face/client_id. If missing, generate a UUID and save it there. This file stores only a UUID for rate-limit tracking. - Obtain a free token: POST to
https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to your identifier. The response data.token is your NEMO_TOKEN — 100 free credits, valid 7 days. - Create a session: POST to
https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer <token>, Content-Type: application/json, and body {"task_name":"project","language":"<detected>"}. Store the returned session_id for all subsequent requests.
Keep setup communication brief. Don't display raw API responses or token values to the user.
See the Face You've Always Imagined, Rendered Instantly
Dream Face is a creative AI skill built for anyone who has ever tried to describe a face and wished they could simply see it. Whether you're writing a novel and need to visualize your protagonist, designing a game character, or just curious what a blend of two aesthetic styles would look like on a real portrait — this skill closes the gap between imagination and image.
Using natural language descriptions or uploaded reference photos, Dream Face generates highly detailed, realistic facial portraits tailored to your vision. Describe hair color, facial structure, age, expression, skin tone, eye shape, or even a mood — and watch a coherent, vivid face emerge from your words. You can also use existing photos as a starting point and layer transformations on top.
This skill is ideal for creative professionals, game designers, writers, filmmakers doing pre-visualization, and curious individuals who want to explore identity and appearance in a safe, imaginative space. No artistic skill required — just describe what you see in your mind.
Routing Your Dream Requests
When you describe a face or dream portrait, Dream Face parses your intent and routes it to the appropriate generation pipeline — whether that's a photorealistic render, an impressionistic dreamscape, or a symbolic portrait blend.
| User says... | Action | Skip SSE? |
|---|
| "export" / "导出" / "download" / "send me the video" | → §3.5 Export | ✅ |
| "credits" / "积分" / "balance" / "余额" |
→ §3.3 Credits | ✅ |
| "status" / "状态" / "show tracks" | → §3.4 State | ✅ |
| "upload" / "上传" / user sends file | → §3.2 Upload | ✅ |
| Everything else (generate, edit, add BGM…) | → §3.1 SSE | ❌ |
Cloud Processing API Reference
Dream Face runs on a dedicated neural rendering backend that processes facial geometry, dream aesthetic filters, and style fusion in the cloud — no local GPU required. Each request hits the Dream Face inference layer, which handles queue prioritization, resolution scaling, and portrait post-processing before returning your final image.
Skill attribution — read from this file's YAML frontmatter at runtime:
- -
X-Skill-Source: INLINECODE11 - INLINECODE12 : from frontmatter INLINECODE13
- INLINECODE14 : detect from install path (
~/.clawhub/ → clawhub, ~/.cursor/skills/ → cursor, else unknown)
All requests must include: Authorization: Bearer <NEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.
API base: INLINECODE24
Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"<lang>"} — returns task_id, session_id.
Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"<sid>","new_message":{"parts":[{"text":"<msg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.
Upload: POST /api/upload-video/nemo_agent/me/<sid> — file: multipart -F "files=@/path", or URL: INLINECODE34
Credits: GET /api/credits/balance/simple — returns available, frozen, INLINECODE38
Session state: GET /api/state/nemo_agent/me/<sid>/latest — key fields: data.state.draft, data.state.video_infos, INLINECODE42
Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_<ts>","sessionId":"<sid>","draft":<json>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/<id> every 30s until status = completed. Download URL at output.url.
Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.
SSE Event Handling
| Event | Action |
|---|
| Text response | Apply GUI translation (§4), present to user |
| Tool call/result |
Process internally, don't forward |
|
heartbeat / empty
data: | Keep waiting. Every 2 min: "⏳ Still working..." |
| Stream closes | Process final response |
~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.
Backend Response Translation
The backend assumes a GUI exists. Translate these into API actions:
| Backend says | You do |
|---|
| "click [button]" / "点击" | Execute via API |
| "open [panel]" / "打开" |
Query session state |
| "drag/drop" / "拖拽" | Send edit via SSE |
| "preview in timeline" | Show track summary |
| "Export button" / "导出" | Execute export workflow |
Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.
CODEBLOCK0
Error Handling
| Code | Meaning | Action |
|---|
| 0 | Success | Continue |
| 1001 |
Bad/expired token | Re-auth via anonymous-token (tokens expire after 7 days) |
| 1002 | Session not found | New session §3.0 |
| 2001 | No credits | Anonymous: show registration URL with
?bind=<id> (get
<id> from create-session or state response when needed). Registered: "Top up credits in your account" |
| 4001 | Unsupported file | Show supported formats |
| 4002 | File too large | Suggest compress/trim |
| 400 | Missing X-Client-Id | Generate Client-Id and retry (see §1) |
| 402 | Free plan export blocked | Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export." |
| 429 | Rate limit (1 token/client/7 days) | Retry in 30s once |
Frequently Asked Questions
Can I use Dream Face with just a text description, or do I need to upload a photo?
Both work. You can describe a face entirely in text and get a generated portrait, or upload a reference image and apply transformations, age progressions, or style blends on top of it. Text-only descriptions work best when you're specific about features like bone structure, eye shape, skin tone, and expression.
How realistic are the generated faces?
Dream Face is optimized for photorealistic output. Results vary based on how detailed your prompt is — the more specific your description, the more accurate and consistent the face. Stylized or artistic outputs are also possible if you specify an aesthetic like 'oil painting' or 'cinematic lighting.'
Is this skill appropriate for character design and fiction work?
Absolutely. Dream Face is frequently used by writers, game designers, and filmmakers to pre-visualize characters before production. It's a fast, low-cost way to iterate on a character's look without commissioning custom artwork at every stage.
Can I generate faces of real people?
Dream Face is designed for fictional, original, or self-directed portrait generation. Creating realistic likenesses of real, identifiable individuals without consent is not supported and falls outside the intended use of this skill.
Integration Guide
Using Dream Face in Your Creative Workflow
Dream Face fits naturally into early-stage creative pipelines where visual references don't yet exist. Start by writing a character brief or description in plain language — think of it like a casting call. Feed that directly into the skill as your first prompt. Iterate by refining specific features in follow-up messages: 'make the jaw softer,' 'add freckles,' or 'shift the expression to something more guarded.'
Combining with Other ClawHub Skills
For video or film projects, pair Dream Face with video editing or scene-building skills on ClawHub. Generate your character portrait first, then use it as a visual anchor when prompting other tools for scene composition or storyboarding. This creates a consistent visual language across your project from the start.
Saving and Exporting Results
Generated portraits can be downloaded directly from the ClawHub interface. For production use, request high-resolution output in your prompt (e.g., 'render this at the highest available resolution with a clean white background'). Naming and organizing generated faces by character or project in your ClawHub workspace keeps iterations easy to track across long creative projects.
开始使用
欢迎!🌙 我是 Dream Face——告诉我你想看到的面孔,上传参考照片,或描述你想象中的角色,我将为你生成生动逼真的肖像。准备好将你的愿景变为现实了吗?描述你梦想中的面孔开始吧!
试试这样说:
- - 生成一位30多岁女性的肖像,她有突出的颧骨、银丝点缀的深色头发、温暖的琥珀色眼睛,以及平静但强烈的表情——她是我奇幻小说中的反派角色。
- 用这张我的照片,展示我25年后的样子——皮肤饱经风霜,鬓角灰白,眼神疲惫但充满智慧。
- 创造一张融合1920年代好莱坞女演员美学与现代时尚模特风格的面孔——高对比度、轮廓分明的眉毛、红唇、中性背景。
首次连接
当用户首次打开此技能时,自动连接到处理后端。简要告知用户(例如正在设置...)。
身份验证:检查环境中是否设置了 NEMO_TOKEN。如果已设置,跳至步骤3。
- 1. 客户端标识符:查找 ~/.config/dream-face/clientid。如果不存在,生成一个UUID并保存在此处。此文件仅存储用于速率限制追踪的UUID。
- 获取免费令牌:向 https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token 发送POST请求,头部设置 X-Client-Id 为你的标识符。响应中的 data.token 即为你的 NEMOTOKEN——100个免费积分,有效期7天。
- 创建会话:向 https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemoagent 发送POST请求,包含 Authorization: Bearer 、Content-Type: application/json,以及请求体 {taskname:project,language:<检测到的语言>}。存储返回的 session_id 用于所有后续请求。
保持设置沟通简洁。不要向用户显示原始API响应或令牌值。
立即看到你一直想象的面孔
Dream Face 是一个创意AI技能,专为那些曾试图描述一张面孔并希望自己能直接看到它的人而设计。无论你是在写小说需要可视化主角,设计游戏角色,还是单纯好奇两种美学风格融合在真实肖像上会是什么样子——这个技能都能弥合想象与图像之间的差距。
通过自然语言描述或上传的参考照片,Dream Face 生成高度详细、逼真的面部肖像,完全符合你的构想。描述发色、面部结构、年龄、表情、肤色、眼型,甚至一种情绪——然后看着一个连贯而生动的面孔从你的文字中浮现。你也可以使用现有照片作为起点,在其上叠加变换效果。
这个技能非常适合创意专业人士、游戏设计师、作家、进行前期可视化的电影制作人,以及那些希望在安全、富有想象力的空间中探索身份和外貌的好奇者。无需艺术技能——只需描述你脑海中所见。
路由你的梦想请求
当你描述一张面孔或梦想肖像时,Dream Face 会解析你的意图并将其路由到适当的生成管道——无论是照片级真实渲染、印象派梦境景观,还是象征性肖像融合。
| 用户说... | 操作 | 跳过SSE? |
|---|
| export / 导出 / download / send me the video | → §3.5 导出 | ✅ |
| credits / 积分 / balance / 余额 |
→ §3.3 积分 | ✅ |
| status / 状态 / show tracks | → §3.4 状态 | ✅ |
| upload / 上传 / 用户发送文件 | → §3.2 上传 | ✅ |
| 其他所有(生成、编辑、添加BGM等) | → §3.1 SSE | ❌ |
云端处理API参考
Dream Face 运行在专用的神经渲染后端上,在云端处理面部几何、梦境美学滤镜和风格融合——无需本地GPU。每个请求都会命中 Dream Face 推理层,该层处理队列优先级、分辨率缩放和肖像后处理,然后返回最终图像。
技能归属——运行时从此文件的YAML前置元数据读取:
- - X-Skill-Source:dream-face
- X-Skill-Version:来自前置元数据 version
- X-Skill-Platform:从安装路径检测(~/.clawhub/ → clawhub,~/.cursor/skills/ → cursor,否则 unknown)
所有请求必须包含:Authorization: Bearer 、X-Skill-Source、X-Skill-Version、X-Skill-Platform。缺少归属头部将导致导出失败并返回402错误。
API基础地址:https://mega-api-prod.nemovideo.ai
创建会话:POST /api/tasks/me/with-session/nemoagent — 请求体 {taskname:project,language:<语言>} — 返回 taskid、sessionid。
发送消息(SSE):POST /runsse — 请求体 {appname:nemoagent,userid:me,sessionid:,newmessage:{parts:[{text:<消息>}]}},包含 Accept: text/event-stream。最大超时时间:15分钟。
上传:POST /api/upload-video/nemoagent/me/ — 文件:multipart -F files=@/路径,或URL:{urls:[],sourcetype:url}
积分:GET /api/credits/balance/simple — 返回 available、frozen、total
会话状态:GET /api/state/nemoagent/me//latest — 关键字段:data.state.draft、data.state.videoinfos、data.state.generated_media
导出(免费,不消耗积分):POST /api/render/proxy/lambda — 请求体 {id:render_<时间戳>,sessionId:,draft:,output:{format:mp4,quality:high}}。每30秒轮询 GET /api/render/proxy/lambda/,直到 status = completed。下载URL位于 output.url。
支持的格式:mp4、mov、avi、webm、mkv、jpg、png、gif、webp、mp3、wav、m4a、aac。
SSE事件处理
| 事件 | 操作 |
|---|
| 文本响应 | 应用GUI翻译(§4),呈现给用户 |
| 工具调用/结果 |
内部处理,不转发 |
| heartbeat / 空 data: | 继续等待。每2分钟:⏳ 仍在处理... |
| 流关闭 | 处理最终响应 |
约30%的编辑操作在SSE流中不返回文本。发生这种情况时:轮询会话状态以验证编辑已应用,然后向用户总结更改。
后端响应翻译
后端假设存在GUI。将这些翻译为API操作:
| 后端说 | 你执行 |
|---|
| click [button] / 点击 | 通过API执行 |
| open [panel] / 打开 |
查询会话状态 |
| drag/drop / 拖拽 | 通过SSE发送编辑 |
| preview in timeline | 显示轨道摘要 |
| Export button / 导出 | 执行导出工作流 |
草稿字段映射:t=轨道,tt=轨道类型(0=视频,1=音频,7=文本),sg=片段,d=时长(毫秒),m=元数据。
时间线(3条轨道):1. 视频:城市延时摄影(0-10秒)2. 背景音乐:Lo-fi(0-10秒,35%)3. 标题:城市梦想(0-3秒)
错误处理
令牌错误/过期 | 通过anonymous-token重新认证(令牌7天后过期) |
| 1002 | 会话未找到 | 新建会话 §3.0 |
| 2001 | 积分不足 | 匿名用户:显示注册URL,附带 ?bind=
(需要时从create-session或state响应获取 )。已注册用户:请在你的账户中充值积分 |
| 4001 | 不支持的文件 | 显示支持的格式 |
| 4002 | 文件过大 | 建议压缩/裁剪 |
| 400 | 缺少X-Client-Id | 生成Client-Id并重试(见§1) |
| 402 | 免费计划导出被阻止 | 订阅层级