电子学报 ›› 2019, Vol. 47 ›› Issue (11): 2284-2291.DOI: 10.3969/j.issn.0372-2112.2019.11.007

• 学术论文 • 上一篇    下一篇

一种基于伪标签的半监督少样本学习模型

余游, 冯林, 王格格, 徐其凤   

  1. 四川师范大学计算机科学学院, 四川成都 610101
  • 收稿日期:2019-05-06 修回日期:2019-07-01 出版日期:2019-11-25 发布日期:2019-11-25
  • 通讯作者: 冯林
  • 作者简介:余游 男,1993年出生于四川遂宁,现为四川师范大学计算机科学学院硕士研究生,主要研究方向为深度学习与数据挖掘.E-mail:454665275@qq.com;王格格 女,1995年出生于重庆市,现为四川师范大学计算机科学学院硕士研究生,主要研究方向为人工智能与数据挖掘.E-mail:347673996@qq.com;徐其凤 女,1993年出生于四川宜宾,现为四川师范大学计算机科学学院硕士研究生,主要研究方向为深度学习.E-mail:616876864@qq.com
  • 基金资助:
    国家科技支撑计划项目(No.2014BAH11F01,No.2014BAH11F02);四川师范大学2019年研究生优秀论文培育基金(No.川师研201903-36)

A Few-Shot Learning Model Based on Semi-Supervised with Pseudo Label

YU You, FENG Lin, WANG Ge-ge, XU Qi-feng   

  1. Department of Computer Science, Sichuan Normal University, Chengdu, Sichuan 610101, China
  • Received:2019-05-06 Revised:2019-07-01 Online:2019-11-25 Published:2019-11-25

摘要: 如何将带有大量标记数据的源域知识模型迁移至带有少量标记数据的目标域是少样本学习研究领域的热点问题.针对现有的少样本学习算法在源域数据与目标域数据的特征分布差异较大时存在的泛化能力较弱的问题,提出一种基于伪标签的半监督少样本学习模型FSLSS(Few-Shot Learning based on Semi-Supervised).首先,利用pytorch深度学习框架建立一个关系型深度学习网络,并使用源域数据对网络进行预训练;然后,使用此网络对目标域数据进行分类预测,将分类概率最大的类标签作为数据的伪标签;最后,利用目标域的伪标签数据和源域的真实标签数据对网络进行混合训练,并重复伪标签标记与混合训练过程.实验结果表明,相对于现有主流少样本学习算法,FSLSS模型有更好的泛化能力及知识迁移效果.

关键词: 少样本学习, 半监督学习, 伪标签, 迁移学习

Abstract: How to migrate a source domain knowledge model with a large amount of tagged data to a target domain with a small amount of tagged data is a hot issue in few-shot learning.For the problems that the existing few-shot learning algorithm have weak generalization ability when the difference between the feature distribution of the source domain data and the target domain data is large,a few-shot learning model based on semi-supervised FSLSS(Few-Shot Learning based on Semi-Supervised) is proposed.Firstly,a relational deep learning network is established by using the pytorch framework,and the network is pre-trained by the source domain data.Then,the network is used to predict the target domain data,and the label with the highest classification probability is used as the data's pseudo label.Finally,the network is hybrid trained using the pseudo label data of the target domain and the real label data of the source domain,then repeating the pseudo-labeled and hybrid trained process.The experimental results show that the FSLSS model has better generalization ability and knowledge transfer effect than the existing few-shot learning algorithms.

Key words: few-shot learning, semi-supervised learning, pseudo label, transfer learning

中图分类号: