Image to Markdown - OCR Extract Text from Images
Extract text from images to Markdown using MinerU Open API. No API key required.
Quick Start
CODEBLOCK0
Language Rule
You MUST reply to the user in the SAME language they use. This is non-negotiable.
Capabilities
- - OCR text extraction from PNG, JPG, JPEG, WebP, BMP, TIFF
- Supports both local files and URLs directly
- Language hint with
--language (default: ch, use en for English) - No API key, no signup, no authentication
- Max 10MB per image
When to Use
- - User asks to "read", "extract", or "OCR" an image
- User shares a screenshot and asks what it says
- User wants text from a photo of a document or whiteboard
- User needs image content converted to Markdown
CLI Reference
Run mineru-open-api flash-extract --help for all available options.
Data Privacy
- -
flash-extract uploads the image to MinerU's cloud API for processing and returns the result. No account or API key is required. - Images are processed in real-time and are not stored after extraction.
- For details, see https://mineru.net
Notes
- - Output is Markdown text extracted via OCR
- For higher precision or batch processing, use
mineru-open-api extract (requires auth via mineru-open-api auth) - If the CLI cannot be installed via npm/uv/go, download it from https://mineru.net/ecosystem?tab=cli
图像转Markdown - 从图片中提取文本
使用MinerU开放API从图像中提取文本并转换为Markdown格式。无需API密钥。
快速开始
bash
从本地图片提取文本
mineru-open-api flash-extract screenshot.png
从图片URL提取文本(无需下载)
mineru-open-api flash-extract https://example.com/image.png
保存到文件
mineru-open-api flash-extract photo.jpg -o ./output/
指定语言以提高准确性
mineru-open-api flash-extract scan.jpg --language en
语言规则
您必须使用与用户相同的语言进行回复。此规则不可协商。
功能特性
- - 支持PNG、JPG、JPEG、WebP、BMP、TIFF格式的OCR文本提取
- 直接支持本地文件和URL
- 通过--language参数指定语言提示(默认:ch,英文使用en)
- 无需API密钥,无需注册,无需身份验证
- 单张图片最大10MB
使用场景
- - 用户要求读取、提取或OCR图片内容
- 用户分享截图并询问其中的文字
- 用户需要从文档或白板照片中获取文本
- 用户需要将图片内容转换为Markdown格式
CLI参考
运行mineru-open-api flash-extract --help查看所有可用选项。
数据隐私
- - flash-extract会将图片上传至MinerU的云端API进行处理并返回结果。无需账户或API密钥。
- 图片实时处理,提取后不会被存储。
- 详情请参见 https://mineru.net
注意事项
- - 输出为通过OCR提取的Markdown格式文本
- 如需更高精度或批量处理,请使用mineru-open-api extract(需通过mineru-open-api auth进行身份验证)
- 如果无法通过npm/uv/go安装CLI,请从 https://mineru.net/ecosystem?tab=cli 下载