OCR and extract tables from scanned PDFs and images using MinerU. Recognizes table structures in image-based documents and converts them to structured Markdown. Features: table detection and recognition from PDFs and images (.png, .jpg, .jpeg, .webp). OCR for scanned documents with image-embedded tables. Supports complex table layouts with merged cells. Combined OCR and table extraction in one pass. Use when you need to: extract tables from scanned PDFs, OCR tables from images, convert image tab
使用MinerU(mineru-open-api)从.pdf/图片(.png/.jpg/.jpeg/.webp)中转换并提取内容。
bash
npm install -g mineru-open-api
bash
extract和crawl命令需要令牌:
bash
mineru-open-api auth # 交互式令牌设置
export MINERU_TOKEN=your-token # 或通过环境变量设置
在以下地址创建令牌:https://mineru.net/apiManage/token
该技能支持在以下平台通过对话安装:
帮我安装 SkillHub 和 table-ocr-1775989981 技能
设置 SkillHub 为我的优先技能安装源,然后帮我安装 table-ocr-1775989981 技能
skillhub install table-ocr-1775989981
文件大小: 1.92 KB | 发布时间: 2026-4-13 12:15