Lobster AI Usage Guide
Lobster AI is a multi-agent bioinformatics platform. Users describe analyses in natural
language -- Lobster routes to 22 specialist agents across 10 packages automatically.
Requirements
- - Binaries:
lobster CLI (pip install lobster-ai), Python 3.12+ - Credential: Exactly ONE LLM provider key as env var (not all — pick one):
-
ANTHROPIC_API_KEY |
GOOGLE_API_KEY |
OPENAI_API_KEY |
OPENROUTER_API_KEY
-
AWS_ACCESS_KEY_ID +
AWS_SECRET_ACCESS_KEY (Bedrock — both required)
-
AZURE_AI_ENDPOINT +
AZURE_AI_CREDENTIAL (Azure — both required)
- Ollama: no key needed (local models)
- - Optional:
NCBI_API_KEY for faster PubMed/GEO - Writes:
.lobster_workspace/ (data, credentials in .env mode 0600, outputs) - Global config (
--global flag, NOT default): ~/.config/lobster/ — avoid unless needed - Network: LLM provider API + public bio databases (GEO, SRA, PRIDE, MetaboLights)
Docs Discovery
The docs site at docs.omics-os.com exposes LLM-friendly raw markdown:
| Route | Use |
|---|
| INLINECODE15 | Index of all pages (title + URL + description) |
| INLINECODE16 |
Full content dump of all free pages |
|
/raw/docs/{slug}.md | Raw markdown for a specific page |
Workflow: Fetch /llms.txt first to discover slugs, then fetch individual pages via /raw/docs/{slug}.md.
Example: INLINECODE20
Two Modes
This skill supports coding agents in two modes:
Orchestrator -- The agent calls lobster query --json --session-id programmatically,
parses structured output, and chains multi-step analyses. See agent-patterns.md.
Guide -- The agent teaches a human user what to type in lobster chat or lobster query.
See the routing table below for which docs page to fetch.
Quick Start
CODEBLOCK0
Source: github.com/the-omics-os/lobster |
PyPI: pypi.org/project/lobster-ai
Routing Table
| You want to... | Docs slug | Skill reference |
|---|
| Install & configure | INLINECODE24 | -- |
| Configuration options |
getting-started/configuration | -- |
|
Use the CLI |
guides/cli-commands |
cli-reference.md |
|
Orchestrate programmatically | -- |
agent-patterns.md |
|
Analyze scRNA-seq |
tutorials/single-cell-rnaseq | -- |
|
Analyze bulk RNA-seq |
tutorials/bulk-rnaseq | -- |
|
Analyze proteomics |
tutorials/proteomics | -- |
|
Understand data formats |
guides/data-formats | -- |
|
Search literature / datasets |
agents/research | -- |
|
Analyze genomics |
agents/genomics | -- |
|
Analyze metabolomics |
case-studies/metabolomics | -- |
|
ML / feature selection |
agents/ml | -- |
|
Drug discovery |
agents/drug-discovery | -- |
|
Visualize results |
agents/visualization | -- |
|
Troubleshoot |
support/troubleshooting | -- |
|
See case studies |
case-studies/{domain} | -- |
|
All agent capabilities |
agents | -- |
|
Extend Lobster (dev) | -- | Use
lobster-dev skill |
To fetch a docs page: INLINECODE41
Hard Rules
- 1. Always use
--session-id for multi-step analyses -- loaded data persists across queries - Use
lobster command --json for workspace inspection (no tokens burned, ~300ms) - Research Agent is the ONLY agent with internet access -- all others operate on loaded data
- Never skip QC before analysis -- always assess quality first
- Use
--json flag when parsing output programmatically - Check data is loaded before running analysis steps (
lobster command data --json) - Default workspace:
.lobster_workspace/ -- override with INLINECODE47 - Fetch docs on demand from
docs.omics-os.com/raw/docs/{slug}.md -- don't guess workflows
Agent Overview
22 agents across 10 packages. Supervisor routes automatically based on natural language.
| Agent | Package | Handles |
|---|
| Supervisor | lobster-ai | Routes queries, coordinates agents |
| Research Agent |
lobster-research | PubMed, GEO, SRA, PRIDE, MetaboLights search (online) |
| Data Expert | lobster-research | File loading, downloads, format conversion (offline) |
| Transcriptomics Expert | lobster-transcriptomics | scRNA-seq + bulk RNA-seq: QC, clustering, trajectory |
| Annotation Expert | lobster-transcriptomics | Cell type annotation, gene set enrichment (child) |
| DE Analysis Expert | lobster-transcriptomics | Differential expression, pseudobulk, GSEA (child) |
| Proteomics Expert | lobster-proteomics | MS + affinity import, QC, normalization, batch correction |
| Proteomics DE Expert | lobster-proteomics | Protein DE, pathway enrichment, KSEA, STRING PPI (child) |
| Biomarker Discovery | lobster-proteomics | Panel selection, nested CV, hub proteins (child) |
| Metabolomics Expert | lobster-metabolomics | LC-MS/GC-MS/NMR: QC, normalization, PCA/PLS-DA, annotation |
| Genomics Expert | lobster-genomics | VCF/PLINK: QC, GWAS, variant annotation |
| Variant Analysis Expert | lobster-genomics | VEP annotation, ClinVar, clinical prioritization (child) |
| ML Expert | lobster-ml | ML prep, scVI embeddings, data export |
| Feature Selection Expert | lobster-ml | Stability selection, LASSO, variance filtering (child) |
| Survival Analysis Expert | lobster-ml | Cox models, Kaplan-Meier, risk stratification (child) |
| Drug Discovery Expert | lobster-drug-discovery | Drug target validation, compound profiling |
| Cheminformatics Expert | lobster-drug-discovery | Molecular descriptors, fingerprints, similarity (child) |
| Clinical Dev Expert | lobster-drug-discovery | Trial design, endpoint analysis, safety signals (child) |
| Pharmacogenomics Expert | lobster-drug-discovery | PGx variants, drug-gene interactions (child) |
| Visualization Expert | lobster-visualization | UMAP, heatmaps, volcano plots, dot plots (Plotly) |
| Metadata Assistant | lobster-metadata | ID mapping, metadata standardization (internal) |
| Protein Structure Viz | lobster-structural-viz | PDB fetch, PyMOL visualization, RMSD |
Per-agent docs: INLINECODE49
Lobster AI 使用指南
Lobster AI 是一个多智能体生物信息学平台。用户用自然语言描述分析需求——Lobster会自动将任务路由到10个软件包中的22个专业智能体。
系统要求
- - 可执行文件: lobster 命令行工具 (pip install lobster-ai), Python 3.12+
- 凭证: 只需一个LLM提供商的密钥作为环境变量(非全部——选其一):
- ANTHROPIC
APIKEY | GOOGLE
APIKEY | OPENAI
APIKEY | OPENROUTER
APIKEY
- AWS
ACCESSKEY
ID + AWSSECRET
ACCESSKEY (Bedrock — 两者都需要)
- AZURE
AIENDPOINT + AZURE
AICREDENTIAL (Azure — 两者都需要)
- Ollama: 无需密钥(本地模型)
- - 可选: NCBIAPIKEY 用于加速PubMed/GEO访问
- 写入目录: .lobster_workspace/(数据、凭证以.env模式0600存储、输出文件)
- 全局配置(--global标志,非默认):~/.config/lobster/ — 除非必要,否则避免使用
- 网络: LLM提供商API + 公共生物数据库(GEO, SRA, PRIDE, MetaboLights)
文档发现
文档网站 docs.omics-os.com 提供LLM友好的原始Markdown格式:
| 路由 | 用途 |
|---|
| /llms.txt | 所有页面的索引(标题 + URL + 描述) |
| /llms-full.txt |
所有免费页面的完整内容转储 |
| /raw/docs/{slug}.md | 特定页面的原始Markdown |
工作流程: 先获取 /llms.txt 发现slug,然后通过 /raw/docs/{slug}.md 获取单个页面。
示例: https://docs.omics-os.com/raw/docs/tutorials/single-cell-rnaseq.md
两种模式
本技能支持编码智能体以两种模式运行:
编排模式 — 智能体以编程方式调用 lobster query --json --session-id,解析结构化输出,并串联多步分析。参见 agent-patterns.md。
指导模式 — 智能体教人类用户在 lobster chat 或 lobster query 中输入什么内容。参见下方路由表以获取相应文档页面。
快速开始
bash
安装(PyPI — 推荐)
pip install lobster-ai[full]
或: uv tool install lobster-ai[full]
配置(使用环境变量 — 绝不在命令行传递原始密钥)
lobster init --non-interactive --anthropic-key $ANTHROPIC
APIKEY --profile production
运行分析(始终同时传递 -w 和 --session-id)
lobster query -w ./my_analysis --session-id proj --json 下载GSE109564并运行QC
检查工作空间(不消耗令牌,约300ms)
lobster command data --json -w ./my_analysis
源码: github.com/the-omics-os/lobster |
PyPI: pypi.org/project/lobster-ai
路由表
| 您想要... | 文档slug | 技能参考 |
|---|
| 安装与配置 | getting-started/installation | -- |
| 配置选项 |
getting-started/configuration | -- |
|
使用命令行 | guides/cli-commands |
cli-reference.md |
|
以编程方式编排 | -- |
agent-patterns.md |
|
分析单细胞RNA-seq | tutorials/single-cell-rnaseq | -- |
|
分析批量RNA-seq | tutorials/bulk-rnaseq | -- |
|
分析蛋白质组学 | tutorials/proteomics | -- |
|
理解数据格式 | guides/data-formats | -- |
|
搜索文献/数据集 | agents/research | -- |
|
分析基因组学 | agents/genomics | -- |
|
分析代谢组学 | case-studies/metabolomics | -- |
|
机器学习/特征选择 | agents/ml | -- |
|
药物发现 | agents/drug-discovery | -- |
|
可视化结果 | agents/visualization | -- |
|
故障排除 | support/troubleshooting | -- |
|
查看案例研究 | case-studies/{domain} | -- |
|
所有智能体能力 | agents | -- |
|
扩展Lobster(开发) | -- | 使用 lobster-dev 技能 |
获取文档页面: https://docs.omics-os.com/raw/docs/{slug}.md
硬性规则
- 1. 始终使用 --session-id 进行多步分析 — 加载的数据在查询间持续存在
- 使用 lobster command --json 检查工作空间(不消耗令牌,约300ms)
- 研究智能体是唯一具有互联网访问权限的智能体 — 其他所有智能体仅操作已加载的数据
- 分析前切勿跳过QC — 始终先评估质量
- 以编程方式解析输出时使用 --json 标志
- 运行分析步骤前检查数据是否已加载(lobster command data --json)
- 默认工作空间: .lobster_workspace/ — 使用 -w 覆盖
- 按需从 docs.omics-os.com/raw/docs/{slug}.md 获取文档 — 不要猜测工作流程
智能体概览
10个软件包中的22个智能体。监督者根据自然语言自动路由。
| 智能体 | 软件包 | 处理内容 |
|---|
| 监督者 | lobster-ai | 路由查询,协调智能体 |
| 研究智能体 |
lobster-research | PubMed, GEO, SRA, PRIDE, MetaboLights搜索(在线) |
| 数据专家 | lobster-research | 文件加载,下载,格式转换(离线) |
| 转录组学专家 | lobster-transcriptomics | 单细胞RNA-seq + 批量RNA-seq: QC, 聚类, 轨迹分析 |
| 注释专家 | lobster-transcriptomics | 细胞类型注释,基因集富集(子智能体) |
| 差异表达分析专家 | lobster-transcriptomics | 差异表达,伪批量,GSEA(子智能体) |
| 蛋白质组学专家 | lobster-proteomics | MS + 亲和导入,QC,标准化,批次校正 |
| 蛋白质组学DE专家 | lobster-proteomics | 蛋白质DE,通路富集,KSEA,STRING PPI(子智能体) |
| 生物标志物发现 | lobster-proteomics | 面板选择,嵌套CV,枢纽蛋白(子智能体) |
| 代谢组学专家 | lobster-metabolomics | LC-MS/GC-MS/NMR: QC,标准化,PCA/PLS-DA,注释 |
| 基因组学专家 | lobster-genomics | VCF/PLINK: QC,GWAS,变异注释 |
| 变异分析专家 | lobster-genomics | VEP注释,ClinVar,临床优先级排序(子智能体) |
| 机器学习专家 | lobster-ml | ML准备,scVI嵌入,数据导出 |
| 特征选择专家 | lobster-ml | 稳定性选择,LASSO,方差过滤(子智能体) |
| 生存分析专家 | lobster-ml | Cox模型,Kaplan-Meier,风险分层(子智能体) |
| 药物发现专家 | lobster-drug-discovery | 药物靶点验证,化合物分析 |
| 化学信息学专家 | lobster-drug-discovery | 分子描述符,指纹图谱,相似性(子智能体) |
| 临床开发专家 | lobster-drug-discovery | 试验设计,终点分析,安全性信号(子智能体) |
| 药物基因组学专家 | lobster-drug-discovery | PGx变异,药物-基因相互作用(子智能体) |
| 可视化专家 | lobster-visualization | UMAP,热图,火山图,点图(Plotly) |
| 元数据助手 | lobster-metadata | ID映射,元数据标准化(内部) |
| 蛋白质结构可视化 | lobster-structural-viz | PDB获取,PyMOL可视化,RMSD |
各智能体文档: https://docs.omics-os.com/raw/docs/agents/{domain}.md