YouTube Thumbnail Design
Create high-CTR YouTube thumbnails with AI image generation via inference.sh CLI.
Quick Start
CODEBLOCK0
Install note: The install script only detects your OS/architecture, downloads the matching binary from dist.inference.sh, and verifies its SHA-256 checksum. No elevated permissions or background processes. Manual install & verification available.
Specifications
| Spec | Value |
|---|
| Dimensions | 1280 x 720 px (minimum) |
| Recommended |
1920 x 1080 px |
| Aspect ratio | 16:9 |
| Max file size | 2 MB |
| Formats | JPG, GIF, PNG |
The 120px Test
Your thumbnail appears at roughly 120px wide on mobile — that's how most viewers first see it.
At 120px, viewers must be able to identify:
- 1. The mood/emotion (from colors and expression)
- The general subject (from composition)
- The text (if any — only if large enough)
Test: view your thumbnail at 120px width. If it's a muddy blur, redesign.
Safe Zones
CODEBLOCK1
Avoid placing critical elements in:
- - Bottom-right corner (video duration timestamp)
- Bottom-left corner (chapter markers, progress bar)
- Extreme edges (cropping varies by device)
Color Strategy
High-Contrast Pairs That Work
| Combination | Mood | Best For |
|---|
| Yellow + Black | Urgency, attention | Tech, business, lists |
| Red + White |
Energy, excitement | Entertainment, reactions |
| Blue + Orange | Professional contrast | Education, tutorials |
| Green + White | Growth, money | Finance, success stories |
| Purple + Yellow | Premium, creative | Design, art, creativity |
| White + Dark | Clean, minimal | Luxury, minimalist channels |
Color Rules
- - Background and text/subject should be complementary or high-contrast
- Avoid same-temperature colors touching (red on orange = mud)
- Use 3 colors maximum per thumbnail
- Saturate more than real life — thumbnails compete with bright UI
Text on Thumbnails
When to Use Text
- - Lists/numbers: "7 Tips", "Top 10"
- Strong opinions: "STOP Doing This"
- Results: "$10K in 30 Days"
- Comparisons: "vs" between two things
When NOT to Use Text
- - The video title already says it (redundant)
- The emotion/visual tells the story
- You can't make it large enough to read at 120px
Text Rules
| Rule | Reason |
|---|
| Max 6 words | Readability at thumbnail size |
| Min 60pt equivalent |
Must be legible at 120px width |
| Bold sans-serif font | Thin fonts disappear at small sizes |
| Contrast stroke/shadow | Ensures readability on any background |
| No small text | If it's not readable small, cut it |
Face Expression Psychology
Thumbnails with faces get higher CTR than faceless thumbnails. Expression matters:
| Expression | CTR Impact | Best For |
|---|
| Surprise/shock | Highest | Reaction, reveal, discovery content |
| Curiosity |
High | Tutorial, how-to, tips |
|
Excitement | High | Unboxing, reviews, announcements |
|
Concern/worry | Medium-high | Warning, mistake, problem content |
|
Confidence | Medium | Expert advice, authority content |
|
Neutral | Lowest | Avoid unless your brand is minimalist |
Face Composition Rules
- - Face should fill 30-50% of the thumbnail
- Eyes looking toward the text or subject (directs viewer attention)
- Eyes looking at camera = connection. Eyes looking at object = curiosity.
- Place face on one side (usually left), text or subject on the other
CODEBLOCK2
Thumbnail Patterns by Content Type
Tutorial / How-To
CODEBLOCK3
Before/After
CODEBLOCK4
Product Review / Comparison
CODEBLOCK5
Listicle / Number
CODEBLOCK6
A/B Testing
Test one variable at a time:
| Variable | Test A vs B |
|---|
| Face vs No face | Same composition, with/without person |
| Expression |
Surprise vs curiosity |
| Color scheme | Warm vs cool palette |
| Text vs No text | With/without text overlay |
| Background | Bright vs dark |
| Composition | Left-facing vs right-facing subject |
CODEBLOCK7
Thumbnail Checklist
- - [ ] 1280x720 minimum (1920x1080 preferred)
- [ ] Under 2MB file size
- [ ] Passes the 120px squint test
- [ ] No critical elements in bottom-right (timestamp) or bottom-left (chapter)
- [ ] Max 3 colors, high contrast
- [ ] Text (if any) is max 6 words, bold, with contrast stroke
- [ ] Face expression matches content energy (if applicable)
- [ ] Doesn't duplicate the video title
- [ ] Stands out from surrounding thumbnails (check your niche)
- [ ] Works on both light and dark YouTube backgrounds
Common Mistakes
| Mistake | Problem | Fix |
|---|
| Too much text | Unreadable at thumbnail size | Max 6 words or no text |
| Low contrast |
Disappears in the feed | Use complementary colors |
| Cluttered composition | Eye doesn't know where to look | One focal point |
| Generic stock photo feel | No personality, gets skipped | Authentic expressions, unique angles |
| Tiny details | Lost at 120px | Bold, simple shapes |
| Same style every video | Viewer fatigue | Vary within brand guidelines |
| Misleading thumbnail | Kills trust, hurts retention | Match the actual content |
Related Skills
CODEBLOCK8
Browse all apps: INLINECODE1
YouTube 缩略图设计
通过 inference.sh CLI 的 AI 图像生成功能,创建高点击率的 YouTube 缩略图。
快速开始
bash
curl -fsSL https://cli.inference.sh | sh && infsh login
生成缩略图
infsh app run falai/flux-dev-lora --input {
prompt: YouTube缩略图风格,人物特写,面对发光的笔记本电脑屏幕露出惊讶兴奋的表情,鲜艳的蓝橙色配色方案,戏剧性影棚灯光,浅景深,高对比度,电影感,
width: 1280,
height: 720
}
安装说明: 安装脚本仅检测您的操作系统/架构,从 dist.inference.sh 下载匹配的二进制文件,并验证其 SHA-256 校验和。无需提升权限或后台进程。也可手动安装和验证。
规格参数
| 规格 | 数值 |
|---|
| 尺寸 | 1280 x 720 像素(最低) |
| 推荐尺寸 |
1920 x 1080 像素 |
| 宽高比 | 16:9 |
| 最大文件大小 | 2 MB |
| 格式 | JPG、GIF、PNG |
120像素测试
您的缩略图在移动设备上大约显示为 120像素宽——这是大多数观众第一次看到它的方式。
在120像素下,观众必须能够识别:
- 1. 情绪/情感(通过颜色和表情)
- 大致主题(通过构图)
- 文字(如果有——仅当足够大时)
测试: 以120像素宽度查看您的缩略图。如果模糊不清,请重新设计。
安全区域
┌─────────────────────────────────────────────┐
│ │
│ ✅ 文字和关键元素安全区域 │
│ │
│ │
│ │
│ │
│ ┌───┐ │
│ │ ⏱ │ │ ← 时间戳叠加
│ ┌────────┴───┘ │ (右下角)
│ ┌────┐ │ 时长 │
│ │ 章节│ 章节标记 └──────────────│
└───┴────┴────────────────────────────────────┘
↑ 左下角:章节/进度标记
避免将关键元素放置在:
- - 右下角(视频时长时间戳)
- 左下角(章节标记、进度条)
- 极端边缘(不同设备的裁剪方式不同)
色彩策略
有效的高对比度组合
| 组合 | 情绪 | 最适合 |
|---|
| 黄色 + 黑色 | 紧迫感、注意力 | 科技、商业、列表 |
| 红色 + 白色 |
能量、兴奋 | 娱乐、反应 |
| 蓝色 + 橙色 | 专业对比 | 教育、教程 |
| 绿色 + 白色 | 成长、金钱 | 金融、成功故事 |
| 紫色 + 黄色 | 高端、创意 | 设计、艺术、创意 |
| 白色 + 深色 | 干净、简约 | 奢侈品、极简频道 |
色彩规则
- - 背景和文字/主体应为互补色或高对比度
- 避免同色系颜色相邻(红色在橙色上会显得浑浊)
- 每个缩略图最多使用 3种颜色
- 饱和度高于现实生活——缩略图需要与明亮的UI竞争
缩略图上的文字
何时使用文字
- - 列表/数字:7个技巧、Top 10
- 强烈观点:别再这样做
- 结果:30天赚1万美元
- 对比:两个事物之间的vs
何时不使用文字
- - 视频标题已经说明了一切(冗余)
- 情感/视觉已经讲述了故事
- 无法使其大到在120像素下可读
文字规则
| 规则 | 原因 |
|---|
| 最多6个词 | 缩略图尺寸下的可读性 |
| 最小相当于60pt |
在120像素宽度下必须清晰可辨 |
| 粗体无衬线字体 | 细字体在小尺寸下消失 |
| 对比描边/阴影 | 确保在任何背景上的可读性 |
| 无小文字 | 如果小到不可读,就删掉 |
面部表情心理学
带有人脸的缩略图比无人脸的缩略图获得更高的点击率。表情很重要:
| 表情 | 点击率影响 | 最适合 |
|---|
| 惊讶/震惊 | 最高 | 反应、揭秘、发现类内容 |
| 好奇 |
高 | 教程、操作指南、技巧 |
|
兴奋 | 高 | 开箱、评测、公告 |
|
担忧/担心 | 中高 | 警告、错误、问题类内容 |
|
自信 | 中等 | 专家建议、权威内容 |
|
中性 | 最低 | 除非您的品牌是极简风格,否则避免 |
面部构图规则
- - 面部应占据缩略图的 30-50%
- 眼睛看向文字或主体(引导观众注意力)
- 眼睛看向镜头 = 连接感。眼睛看向物体 = 好奇心。
- 将面部放在一侧(通常是左侧),文字或主体放在另一侧
bash
生成面向镜头的缩略图
infsh app run falai/flux-dev-lora --input {
prompt: 男性特写肖像,表情真诚惊讶,嘴巴微张,眉毛上扬,看向镜头,画面左侧,鲜艳的青色背景,戏剧性边缘光,YouTube缩略图风格,高对比度,电影感,
width: 1280,
height: 720
}
生成面部看向主体的缩略图
infsh app run bytedance/seedream-4-5 --input {
prompt: 人物惊讶地看着发光的全息图表显示上升增长趋势,戏剧性的蓝绿色灯光,右侧面轮廓视图,深色背景,科技美学,高能量,
size: 2K
}
按内容类型的缩略图模式
教程 / 操作指南
bash
infsh app run falai/flux-dev-lora --input {
prompt: 俯视平铺布局,有序的工作空间,笔记本电脑显示代码编辑器,彩色便利贴,咖啡杯,干净明亮的背景,专业设置,教程风格构图,温暖灯光,
width: 1280,
height: 720
}
前后对比
bash
infsh app run falai/flux-dev-lora --input {
prompt: 分割构图,左侧黑暗杂乱的办公桌,右侧明亮干净整洁的极简工作空间,混乱与秩序之间的戏剧性对比,中心清晰的分割线,高对比度,
width: 1280,
height: 720
}
产品评测 / 对比
bash
infsh app run falai/flux-dev-lora --input {
prompt: 两个产品面对面,戏剧性灯光和火花在它们之间,竞争对决概念,深色背景配彩色边缘光,对比风格,高能量,产品摄影,
width: 1280,
height: 720
}
列表 / 数字
bash
infsh app run falai/flux-dev-lora --input {
prompt: 7个不同彩色物体在空间中漂浮的动态排列,深色渐变背景,每个物品独特且清晰分离,充满活力的构图,鲜艳饱和的色彩,影棚灯光,
width: 1280,
height: 720
}
A/B 测试
一次测试一个变量:
| 变量 | 测试A vs B |
|---|
| 有脸 vs 无脸 | 相同构图,有/无人 |
| 表情 |
惊讶 vs 好奇 |
| 配色方案 | 暖色 vs 冷色调 |
| 有文字 vs 无文字 | 有/无文字叠加 |
| 背景 | 明亮 vs 深色 |
| 构图 | 左侧朝向 vs 右侧朝向的主体 |
bash
生成变体A
infsh app run falai/flux-dev-lora --input {
prompt: ..., 亮黄色背景, ...,
width: 1280, height: 720
} --no-wait
生成变体B(相同提示,不同背景)
infsh app run falai/flux-dev-lora --input {
prompt: ..., 深海军蓝背景, ...,
width: 1280, height: 720
} --no-wait
缩略图检查清单