Catalog SKU Matcher India
Purpose
Build reliable cross-store product matching for Indian catalogs so price comparison is accurate.
Disclaimer
This skill provides matching and normalization guidance only. It does not guarantee perfect match accuracy for all catalogs or seller data quality.
Use at your own risk. The skill author/publisher/developer is not liable for direct or indirect loss, incorrect match decisions, trading losses, or other damages arising from use or misuse of this guidance.
Matching strategy
Use a layered approach:
- 1. Hard identifiers
- model number / GTIN / MPN / ISBN where available
- 2. Variant normalization
- brand
- model family
- storage/RAM
- color
- size/pack quantity
- condition (new/refurbished/used)
- 3. Soft similarity
- token similarity on cleaned title
- key-attribute overlap
- seller metadata sanity checks
- 4. Confidence score
-
high: auto-match
-
medium: human review queue
-
low: reject
False-match guardrails
- - Never match different storage/RAM variants as same SKU.
- Never match bundles/accessories to standalone products.
- Never ignore refurbished/used condition differences.
- Require manual review when two or more variant fields are missing.
Output format
When matching listings, return:
- 1. canonical SKU candidate
- matched listings with confidence level
- rejected candidates with reason codes
- manual review queue entries
Setup
Read setup.md and define normalization dictionaries first.
Validation
Run validation-checklist.md on labeled test sets before production.
References
目录SKU匹配器(印度版)
目的
为印度目录构建可靠的跨店铺产品匹配,确保价格比较的准确性。
免责声明
本技能仅提供匹配与标准化指导。不保证所有目录或卖家数据质量的完美匹配准确性。
使用风险自负。技能作者/发布者/开发者不对因使用或误用本指导而产生的直接或间接损失、错误匹配决策、交易亏损或其他损害承担责任。
匹配策略
采用分层方法:
- 1. 硬标识符
- 型号/GTIN/MPN/ISBN(如有)
- 2. 变体标准化
- 品牌
- 产品系列
- 存储容量/内存
- 颜色
- 尺寸/包装数量
- 成色(全新/翻新/二手)
- 3. 软相似度
- 清洗后标题的令牌相似度
- 关键属性重叠度
- 卖家元数据合理性检查
- 4. 置信度评分
- 高:自动匹配
- 中:人工审核队列
- 低:拒绝匹配
误匹配防护措施
- - 切勿将不同存储容量/内存的变体匹配为同一SKU。
- 切勿将套装/配件与独立产品匹配。
- 切勿忽略翻新/二手成色差异。
- 当两个或以上变体字段缺失时,需进行人工审核。
输出格式
匹配列表时,返回:
- 1. 规范SKU候选
- 已匹配列表及置信度等级
- 被拒绝候选及原因代码
- 人工审核队列条目
设置
先阅读setup.md并定义标准化词典。
验证
在生产环境前,对标注测试集运行validation-checklist.md。
参考