Img2MD - Quick Image OCR to Markdown
Extract text from images to Markdown using MinerU Open API. No API key required.
Quick Start
CODEBLOCK0
Language Rule
You MUST reply to the user in the SAME language they use. This is non-negotiable.
Capabilities
- - OCR text extraction from PNG, JPG, JPEG, WebP, BMP, TIFF
- Supports both local files and URLs directly
- Language hint with
--language (default: ch, use en for English) - No API key, no signup, no authentication
- Max 10MB per image
When to Use
- - User asks to "read", "extract", or "OCR" an image
- User shares a screenshot and asks what it says
- User wants text from a photo of a document or whiteboard
- User needs image content converted to Markdown
CLI Reference
Run mineru-open-api flash-extract --help for all available options.
Data Privacy
- -
flash-extract uploads the image to MinerU's cloud API for processing and returns the result. No account or API key is required. - Images are processed in real-time and are not stored after extraction.
- For details, see https://mineru.net
Notes
- - Output is Markdown text extracted via OCR
- For higher precision or batch processing, use
mineru-open-api extract (requires auth via mineru-open-api auth) - If the CLI cannot be installed via npm/uv/go, download it from https://mineru.net/ecosystem?tab=cli
Img2MD - 快速图片OCR转Markdown
使用MinerU开放API从图片中提取文本并转换为Markdown格式。无需API密钥。
快速开始
bash
Img2MD - 快速图片OCR转Markdown
mineru-open-api flash-extract screenshot.png
Img2MD - 快速图片OCR转Markdown
mineru-open-api flash-extract https://example.com/image.png
Img2MD - 快速图片OCR转Markdown
mineru-open-api flash-extract photo.jpg -o ./output/
Img2MD - 快速图片OCR转Markdown
mineru-open-api flash-extract scan.jpg --language en
语言规则
您必须使用与用户相同的语言进行回复。此为硬性规定。
功能特性
- - 支持PNG、JPG、JPEG、WebP、BMP、TIFF格式的OCR文本提取
- 可直接处理本地文件或URL链接
- 通过--language设置语言提示(默认:ch,英文使用en)
- 无需API密钥、无需注册、无需身份验证
- 单张图片最大10MB
适用场景
- - 用户要求读取、提取或OCR图片
- 用户分享截图并询问内容
- 用户需要从文档或白板照片中获取文字
- 用户需要将图片内容转换为Markdown格式
CLI参考
运行mineru-open-api flash-extract --help查看所有可用选项。
数据隐私
- - flash-extract将图片上传至MinerU云端API进行处理并返回结果。无需账户或API密钥。
- 图片实时处理,提取后不会存储。
- 详情请参见 https://mineru.net
注意事项
- - 输出为通过OCR提取的Markdown文本
- 如需更高精度或批量处理,请使用mineru-open-api extract(需通过mineru-open-api auth进行身份验证)
- 如果无法通过npm/uv/go安装CLI,请从 https://mineru.net/ecosystem?tab=cli 下载