Tinker SDK Skill

Tinker is an ML platform SDK by Thinking Machines AI for
managing training runs, model checkpoints, and fine-tuning
workflows. This skill covers the CLI, Python SDK, and the
tinker-cookbook training recipes.

Source repos:

- SDK: https://github.com/thinking-machines-lab/tinker
Cookbook: https://github.com/thinking-machines-lab/tinker-cookbook
This skill: https://github.com/zjrwtx/max_skills

IMPORTANT — Always use the latest version:
Before running any Tinker command or cookbook recipe,
ensure the latest version is installed:

uv pip install --upgrade tinker
# For cookbook, pull latest and reinstall:
cd <cookbook-dir> && git pull && uv pip install -e .

When you need more detailed information about API
internals, recipe implementations, or SDK source code,
always check the latest code from these repos — do NOT
rely on cached or outdated knowledge. Clone or browse
the repos directly to get up-to-date APIs and options.

Quick Start

Authentication

CODEBLOCK1

Verify Installation

CODEBLOCK2

Tinker Path Format

All checkpoint operations use tinker paths:

CODEBLOCK3

- TYPE: weights (training) or sampler_weights (sampler)
Example: INLINECODE3

CLI Commands

Global Options

- --format [table|json] or -f — output format

(default: table)

- -h / --help — help on any command

Run Commands

CODEBLOCK4

Checkpoint Commands

CODEBLOCK5

For full flag details and output format examples,
read references/cli-reference.md.

Common Workflows

1. Find and Download a Checkpoint

CODEBLOCK6

2. Push a Checkpoint to HuggingFace

CODEBLOCK7

3. Clean Up Old Checkpoints

CODEBLOCK8

4. Scripting with JSON Output

CODEBLOCK9

Cookbook Recipes

The tinker-cookbook provides ready-to-use training recipes.
Repo: https://github.com/thinking-machines-lab/tinker-cookbook

Recipe Architecture

Every recipe follows the same pattern:

CODEBLOCK10

Override any config field from the command line:
CODEBLOCK11

Running SFT (Supervised Fine-Tuning)

CODEBLOCK12

Running RL Training

CODEBLOCK13

Available Recipes

Recipe	Type	Use Case
INLINECODE9	SFT	Minimal SFT template
INLINECODE10

Key Utilities

CODEBLOCK14

Supported Models

Llama 3.x, Qwen 3/3.5, DeepSeek V3, Nemotron 3,
Kimi K2/K2.5, GPT-OSS, and 30+ more. Each model has
a recommended renderer in model_info.py.

For recipe deep-dives, renderer details, dataset
builder patterns, and RL environment setup, read
references/cookbook-recipes.md.

Quick Troubleshooting

Problem	Fix
Auth failure	Check `TINKER_API_KEY` or INLINECODE26
Checkpoint not found

For the full error catalog, read
references/troubleshooting.md.

Detailed References

When the SKILL.md cheat sheet is not enough:

- references/cli-reference.md — Every flag, output

format example (table + JSON), exit codes, date format rules, bulk delete filter logic

- references/cookbook-recipes.md — Per-recipe config

fields, renderer selection, dataset builder interface, RL environment pattern, hyperparameter guidance

- references/troubleshooting.md — Extended error

catalog with 15+ error-to-fix mappings, network/proxy issues, W&B integration, checkpoint corruption

技能名称: tinker
详细描述:

Tinker SDK 技能

Tinker 是 Thinking Machines AI 开发的一个机器学习平台 SDK，用于管理训练运行、模型检查点和微调工作流。本技能涵盖 CLI、Python SDK 以及 tinker-cookbook 训练配方。

源代码仓库:

- SDK: https://github.com/thinking-machines-lab/tinker
Cookbook: https://github.com/thinking-machines-lab/tinker-cookbook
本技能: https://github.com/zjrwtx/max_skills

重要 — 始终使用最新版本:
在运行任何 Tinker 命令或 cookbook 配方之前，请确保已安装最新版本:
bash
uv pip install --upgrade tinker

对于 cookbook，拉取最新版本并重新安装:

cd && git pull && uv pip install -e .

当您需要关于 API 内部实现、配方实现或 SDK 源代码的更详细信息时，请始终从这些仓库查看最新代码 — 不要依赖缓存或过时的知识。直接克隆或浏览这些仓库以获取最新的 API 和选项。

快速开始

身份验证

bash

选项 1: 环境变量（推荐）

export TINKERAPIKEY=your-api-key

选项 2: 配置文件 (~/.tinker/config.json)

mkdir -p ~/.tinker echo {api_key: your-api-key} > ~/.tinker/config.json

验证安装

bash
tinker version
tinker run list --limit 3

Tinker 路径格式

所有检查点操作都使用 tinker 路径:

tinker:////

- TYPE: weights（训练）或 sampler_weights（采样器）
示例: tinker://run-abc123/weights/00040

CLI 命令

全局选项

- --format [table|json] 或 -f — 输出格式（默认: table）
-h / --help — 任何命令的帮助信息

运行命令

bash

列出训练运行（默认: 20，使用 --limit=0 获取全部）

tinker run list [--limit N] [-c COLUMNS]

可用列:

id, model, owner, lora, updated, status,

checkpoint, checkpoint_time

默认列: id, model, lora, updated, status

显示特定运行的详细信息

tinker run info

检查点命令

bash

列出检查点（所有运行，或通过 --run-id 过滤）

tinker checkpoint list [--run-id ID] [--limit N]

显示检查点详细信息

tinker checkpoint info

下载并本地解压检查点

tinker checkpoint download \ [-o OUTPUT_DIR] [--force]

切换公共访问权限

tinker checkpoint publish tinker checkpoint unpublish

设置或移除过期时间（TTL，单位秒）

tinker checkpoint set-ttl --ttl 604800 tinker checkpoint set-ttl --remove

删除检查点（按路径或按过滤条件）

tinker checkpoint delete [PATH2 ...] [-y] tinker checkpoint delete --run-id \ [--type weights|sampler_weights] \ [--before DATE] [--after DATE] [-y]

推送检查点到 HuggingFace Hub

tinker checkpoint push-hf \ [-r REPO_ID] [--public] [--revision REV] \ [--commit-message MSG] [--create-pr] \ [--allow-pattern PAT] [--ignore-pattern PAT] \ [--no-model-card]

有关完整的标志详细信息和输出格式示例，请阅读 references/cli-reference.md。

常见工作流

1. 查找并下载检查点

bash

步骤 1: 查找您的训练运行

tinker run list

步骤 2: 检查运行详情

tinker run info

步骤 3: 列出可用的检查点

tinker checkpoint list --run-id

步骤 4: 下载

tinker checkpoint download \ tinker:///weights/ \ -o ./models/ --force

2. 推送检查点到 HuggingFace

bash

先决条件: 使用 HF 进行身份验证

pip install huggingface_hub && hf auth login

作为公共 PEFT 适配器推送

tinker checkpoint push-hf \ tinker://ID>/samplerweights/ \ -r myorg/my-lora --public

或者创建 PR 而不是直接推送

tinker checkpoint push-hf \ tinker://ID>/samplerweights/ \ -r myorg/my-lora --create-pr

3. 清理旧检查点

bash

删除某个日期之前的检查点

tinker checkpoint delete --run-id \
--type weights --before 2025-01-01 -y

删除特定检查点

tinker checkpoint delete \ tinker:///weights/0001 \ tinker:///weights/0002 -y

4. 使用 JSON 输出编写脚本

bash

将所有运行导出为 JSON

tinker --format json run list --limit=0 > runs.json

使用 jq 解析

jq .runs[].trainingrunid runs.json

批量列出每个运行的检查点

for rid in $(jq -r .runs[].trainingrunid runs.json) do tinker --format json checkpoint list --run-id $rid done

Cookbook 配方

tinker-cookbook 提供即用型训练配方。仓库: https://github.com/thinking-machines-lab/tinker-cookbook

配方架构

每个配方都遵循相同的模式:

python
import chz
from tinker_cookbook.rl import train # 或 supervised

1. 通过 chz.Blueprint 构建类型化配置

def buildconfigblueprint() -> chz.Blueprint[train.Config]: return chz.Blueprint(train.Config).apply({ model_name: meta-llama/Llama-3.1-8B, learning_rate: 2e-4, ... })

2. 运行训练循环

def main(config): asyncio.run(train.main(config))

3. 带有 chz 覆盖的 CLI 入口点

if name == main: bp = buildconfigblueprint() bp.makefromargv(sys.argv[1:]) main(bp.make())

从命令行覆盖任何配置字段:
bash
python -m tinkercookbook.recipes.slbasic \
--model_name Qwen/Qwen3-8B \
--learning_rate 1e-4 \
--log_path /tmp/my-run

运行 SFT（监督微调）

bash

在 NoRobots 数据集上进行最小化 SFT

python -m tinkercookbook.recipes.slbasic

使用自定义数据集（对话的 JSONL 格式）

编辑 sl_basic.py 以使用 FromConversationFileBuilder:

file_path=/path/to/conversations.jsonl

格式: 与 example_data/conversations.jsonl 相同

运行 RL 训练

bash

在 GSM8K 上进行数学 RL

python -m tinkercookbook.recipes.rlbasic

覆盖超参数

python -m tinkercookbook.recipes.rlbasic \ --learning_rate 4e-5 \ --max_tokens 256

可用配方

配方	类型	用例
slbasic	SFT	最小化 SFT 模板
rlbasic

tinker修补匠

tinker

Tinker SDK Skill

Quick Start

Authentication

Verify Installation

Tinker Path Format

CLI Commands

Global Options

Run Commands

Checkpoint Commands

Common Workflows

1. Find and Download a Checkpoint

2. Push a Checkpoint to HuggingFace

3. Clean Up Old Checkpoints

4. Scripting with JSON Output

Cookbook Recipes

Recipe Architecture

Running SFT (Supervised Fine-Tuning)

Running RL Training

Available Recipes

Key Utilities

Supported Models

Quick Troubleshooting

Detailed References

Tinker SDK 技能

对于 cookbook，拉取最新版本并重新安装:

快速开始

身份验证

选项 1: 环境变量（推荐）

选项 2: 配置文件 (~/.tinker/config.json)

验证安装

Tinker 路径格式

CLI 命令

全局选项

运行命令

列出训练运行（默认: 20，使用 --limit=0 获取全部）

可用列:

id, model, owner, lora, updated, status,

checkpoint, checkpoint_time

默认列: id, model, lora, updated, status

显示特定运行的详细信息

检查点命令

列出检查点（所有运行，或通过 --run-id 过滤）

显示检查点详细信息

下载并本地解压检查点

切换公共访问权限

设置或移除过期时间（TTL，单位秒）

删除检查点（按路径或按过滤条件）

推送检查点到 HuggingFace Hub

常见工作流

1. 查找并下载检查点

步骤 1: 查找您的训练运行

步骤 2: 检查运行详情

步骤 3: 列出可用的检查点

步骤 4: 下载

2. 推送检查点到 HuggingFace

先决条件: 使用 HF 进行身份验证

pip install huggingface_hub && hf auth login

作为公共 PEFT 适配器推送

或者创建 PR 而不是直接推送

3. 清理旧检查点

删除某个日期之前的检查点

删除特定检查点

4. 使用 JSON 输出编写脚本

将所有运行导出为 JSON

使用 jq 解析

批量列出每个运行的检查点

Cookbook 配方

配方架构

1. 通过 chz.Blueprint 构建类型化配置

2. 运行训练循环

3. 带有 chz 覆盖的 CLI 入口点

运行 SFT（监督微调）

在 NoRobots 数据集上进行最小化 SFT

使用自定义数据集（对话的 JSONL 格式）

编辑 sl_basic.py 以使用 FromConversationFileBuilder:

file_path=/path/to/conversations.jsonl

格式: 与 example_data/conversations.jsonl 相同

运行 RL 训练