电子学报 ›› 2019, Vol. 47 ›› Issue (5): 1023-1028.DOI: 10.3969/j.issn.0372-2112.2019.05.007

• 学术论文 • 上一篇    下一篇

基于集成模型的BOM近似度量方法

吴文李1, 范小朋2, 周庚申1, 黄羿1, 曹旸1, 林桂婵1   

  1. 1. 中国长城科技集团股份有限公司, 深圳广东 518000;
    2. 中国科学院深圳先进技术研究院, 深圳广东 518000
  • 收稿日期:2018-08-22 修回日期:2019-01-02 出版日期:2019-05-25 发布日期:2019-05-25
  • 通讯作者: 范小朋
  • 作者简介:吴文李 女,1986年生.中国科学院深圳先进技术研究所和中国长城科技集团股份有限公司博士后实践基地联合招收博士后.主要研究方向为机器学习、数据挖掘.E-mail:huihuigou@whu.edu.cn

A Novel BOM Similarity Metric Method Based on Ensemble Model

WU Wen-li1, FAN Xiao-peng2, ZHOU Geng-shen1, HUANG Yi1, CAO Yang1, LIN Gui-chan1   

  1. 1. China Greatwall Technology Group Co., Ltd, Shenzhen, Guangdong 518000, China;
    2. Shenzhen Institutes of Advanced Technology, China Academy of Sciences, Shenzhen, Guangdong 518000, China
  • Received:2018-08-22 Revised:2019-01-02 Online:2019-05-25 Published:2019-05-25

摘要: 为满足多品种小批次、大规模定制模式下有效划分产品族的需求,全面分析BOM(Bill of Materials,物料清单)所包含的特征,概括已有结构近似方法并提出内容近似度量模型,在此基础上提出组合两者的集成模型.结构近似模型方面,以包含BOM层次结构和物料数量的相邻矩阵表示BOM,利用正交普氏分析法计算BOM与BOM之间的近似程度.内容近似模型方面,从BOM文本中提取有效特征,引入逆向词频法将文本特征转换成机器可识别向量形式,采用余弦近似公式完成向量近似的计算.集成模型提出基于基尼系数的权重分配方法集成结构和内容两种模型.最后,提供测试框架并通过实验评价集成模型较已有方法在模型性能及训练耗时上的优劣.

关键词: 相似性度量, 物料清单, 产品族, 集成模型

Abstract: In order to meet the requirements of grouping product families for advanced manufacturing modes such as mass customization,the features in BOM (Bill of Materials) are comprehensively analyzed,and a concept of BOM structure-based similarity metric model,a content-based similarity metric model,and an ensemble model combined with both are proposed.In the structure-based model,BOMs are represented by adjacent matrixes,including the relationships between materials and the quantity of materials,and the Orthogonal Procrustes Analysis is implemented to measure the similarity among BOMs.While in content-based model,effective text features are extracted from BOMs,being transformed to vectors by TFIDF(Term Frequency-Inverse Document Frequency),and finally being inputted into cosine approximation formula for similarity value.To obtain more accuracy and performance,a weight distribution method based on the Gini coefficient is proposed for the ensemble model.Finally,a test framework is provided and all models are in evaluated experimentally in accuracy and performance.

Key words: similarity metric, BOM(Bill of Materials), product family, ensemble model

中图分类号: