Academic Reader - Scientific Paper Parser
Convert PDF files to clean Markdown using MinerU Open API. No API key required.
Quick Start
CODEBLOCK0
Language Rule
You MUST reply to the user in the SAME language they use. This is non-negotiable.
Capabilities
- - Extracts text, tables, and formulas from PDF
- Supports both local files and URLs directly
- Page range selection with INLINECODE0
- Language hint with
--language (default: ch, use en for English) - No API key, no signup, no authentication
- Max 10MB / 20 pages per document
When to Use
- - User asks to "read", "extract", "convert", or "parse" a PDF
- User shares a PDF file or PDF link and asks for its content
- User wants to summarize or analyze a PDF document
- User needs PDF content in Markdown format
CLI Reference
Run mineru-open-api flash-extract --help for all available options.
Data Flow
INLINECODE5 sends the document to the MinerU API (mineru.net) for processing and returns Markdown. This is a stateless API call — no account, no persistent storage. MinerU is an open-source project by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
Notes
- - Output is Markdown only; images/tables/formulas may be replaced with placeholders
- For larger files (up to 200MB/600 pages) or precision extraction with full assets, use
mineru-open-api extract (requires auth via mineru-open-api auth) - If the CLI cannot be installed via npm/uv/go, download it from https://mineru.net/ecosystem?tab=cli
Academic Reader - 学术论文解析器
使用MinerU开放API将PDF文件转换为干净的Markdown格式。无需API密钥。
快速开始
bash
Academic Reader - 学术论文解析器
mineru-open-api flash-extract report.pdf
Academic Reader - 学术论文解析器
mineru-open-api flash-extract https://cdn-mineru.openxlab.org.cn/demo/example.pdf
Academic Reader - 学术论文解析器
mineru-open-api flash-extract report.pdf -o ./output/
Academic Reader - 学术论文解析器
mineru-open-api flash-extract report.pdf --pages 1-10
语言规则
您必须使用用户使用的相同语言回复。这是不可协商的。
功能
- - 从PDF中提取文本、表格和公式
- 直接支持本地文件和URL
- 使用--pages选择页面范围
- 使用--language设置语言提示(默认:ch,英文使用en)
- 无需API密钥、无需注册、无需身份验证
- 每个文档最大10MB/20页
使用场景
- - 用户要求读取、提取、转换或解析PDF
- 用户分享PDF文件或PDF链接并询问其内容
- 用户想要总结或分析PDF文档
- 用户需要Markdown格式的PDF内容
CLI参考
运行mineru-open-api flash-extract --help查看所有可用选项。
数据流
flash-extract将文档发送到MinerU API(mineru.net)进行处理并返回Markdown。这是一个无状态的API调用——无需账户,无持久存储。MinerU是OpenDataLab(上海人工智能实验室)的开源项目:https://github.com/opendatalab/MinerU
注意事项
- - 输出仅为Markdown;图片/表格/公式可能被替换为占位符
- 对于更大的文件(最大200MB/600页)或需要完整资源的精确提取,请使用mineru-open-api extract(需要通过mineru-open-api auth进行身份验证)
- 如果无法通过npm/uv/go安装CLI,请从https://mineru.net/ecosystem?tab=cli下载