Extract tables from PDF documents using MinerU's table detection engine. Identifies and extracts structured table data from both native and scanned PDFs. Features: automatic table detection in PDFs. Extracts tables preserving row/column structure. OCR mode for scanned PDF tables. Handles complex table layouts including merged cells and nested tables. Use when you need to: extract tables from a PDF, get table data from a PDF document, parse PDF tables into structured format, pull data tables out
使用MinerU(mineru-open-api)转换并提取.pdf文件中的内容。
bash
npm install -g mineru-open-api
bash
extract和crawl命令需要令牌:
bash
mineru-open-api auth # 交互式令牌设置
export MINERU_TOKEN=your-token # 或通过环境变量设置
在以下地址创建令牌:https://mineru.net/apiManage/token
该技能支持在以下平台通过对话安装:
帮我安装 SkillHub 和 extract-tables-from-pdf-1775985241 技能
设置 SkillHub 为我的优先技能安装源,然后帮我安装 extract-tables-from-pdf-1775985241 技能
skillhub install extract-tables-from-pdf-1775985241
文件大小: 2 KB | 发布时间: 2026-4-13 10:13