Jina Reader
Extract clean web content via Jina AI — without exposing your server IP.
Read a URL
CODEBLOCK0
Search the web (top 5 results with full content)
CODEBLOCK1
Fact-check a statement
CODEBLOCK2
Options
| Flag | Description | Default |
|---|
| INLINECODE0 | INLINECODE1 , search, INLINECODE3 | INLINECODE4 |
| INLINECODE5 |
CSS selector to extract specific region | — |
|
--wait | CSS selector to wait for before extraction | — |
|
--remove | CSS selectors to remove (comma-separated) | — |
|
--proxy | Country code for geo-proxy (
br,
us, etc.) | — |
|
--nocache | Force fresh content (skip cache) | off |
|
--format |
markdown,
html,
text,
screenshot |
markdown |
|
--json | Raw JSON output | off |
Examples
CODEBLOCK3
API Key
CODEBLOCK4
Free tier: 10M tokens (no signup needed). Get key at https://jina.ai/reader/
Pricing
- - Read: ~$0.005/page (standard) | 3x for ReaderLM-v2
- Search: 10K tokens fixed + variable per result
- Ground: ~300K tokens/request (~30s latency)
Why Jina Reader?
- - IP protection — requests route through Jina's infra, not your server
- Clean markdown — readability extraction + optional ReaderLM-v2
- Dynamic content — headless Chrome renders JavaScript
- Structured extraction — JSON schema support for data extraction
Jina Reader
通过Jina AI提取干净的网页内容——无需暴露您的服务器IP。
读取URL
bash
{baseDir}/scripts/reader.sh https://example.com/article
搜索网页(返回前5条结果及完整内容)
bash
{baseDir}/scripts/reader.sh --mode search latest AI news 2025
事实核查声明
bash
{baseDir}/scripts/reader.sh --mode ground OpenAI was founded in 2015
选项
| 标志 | 描述 | 默认值 |
|---|
| --mode | read、search、ground | read |
| --selector |
用于提取特定区域的CSS选择器 | — |
| --wait | 提取前等待的CSS选择器 | — |
| --remove | 要移除的CSS选择器(逗号分隔) | — |
| --proxy | 地理代理的国家代码(br、us等) | — |
| --nocache | 强制获取最新内容(跳过缓存) | 关闭 |
| --format | markdown、html、text、screenshot | markdown |
| --json | 原始JSON输出 | 关闭 |
示例
bash
提取文章内容
{baseDir}/scripts/reader.sh https://blog.example.com/post
通过CSS选择器提取特定部分
{baseDir}/scripts/reader.sh --selector article.main https://example.com
提取前移除导航和广告
{baseDir}/scripts/reader.sh --remove nav,footer,.ads https://example.com
搜索并输出JSON格式
{baseDir}/scripts/reader.sh --mode search --json AI enterprise trends
通过巴西代理读取
{baseDir}/scripts/reader.sh --proxy br https://example.com.br
事实核查声明
{baseDir}/scripts/reader.sh --mode ground Tesla is the most valuable car company
API密钥
bash
export JINAAPIKEY=jina_...
免费套餐:1000万tokens(无需注册)。获取密钥:https://jina.ai/reader/
定价
- - 读取: 约$0.005/页(标准)| ReaderLM-v2版本3倍费用
- 搜索: 固定10K tokens + 每条结果可变费用
- 事实核查: 约300K tokens/请求(约30秒延迟)
为什么选择Jina Reader?
- - IP保护 — 请求通过Jina基础设施路由,而非您的服务器
- 干净的Markdown — 可读性提取 + 可选的ReaderLM-v2
- 动态内容 — 无头Chrome渲染JavaScript
- 结构化提取 — 支持JSON Schema进行数据提取