1. 常州大学信息科学与工程学院,江苏,常州,213164
2. 浙江树人大学信息科技学院,浙江,杭州,310015
3. 常州大学信息科学与工程学院,江苏,常州,213164
4. 浙江树人大学信息科技学院,浙江,杭州,310015
网络出版:2020-08-25,
纸质出版:2020
移动端阅览
郑启航, 王章权, 刘半藤, 等. 基于加性间距胶囊网络的家庭活动识别方法研究[J]. 电子学报, 2020,48(8):1580-1586.
ZHENG Qi-hang, WANG Zhang-quan, LIU Ban-teng, et al. Research on Family Activity Recognition Method Based on Additive Margin Capsule Network[J]. Acta Electronica Sinica, 2020, 48(8): 1580-1586.
郑启航, 王章权, 刘半藤, 等. 基于加性间距胶囊网络的家庭活动识别方法研究[J]. 电子学报, 2020,48(8):1580-1586. DOI: 10.3969/j.issn.0372-2112.2020.08.017.
ZHENG Qi-hang, WANG Zhang-quan, LIU Ban-teng, et al. Research on Family Activity Recognition Method Based on Additive Margin Capsule Network[J]. Acta Electronica Sinica, 2020, 48(8): 1580-1586. DOI: 10.3969/j.issn.0372-2112.2020.08.017.
本文研究基于音频的家庭活动识别方法,提出了一种基于加性间距胶囊神经网络识别模型,针对传统胶囊神经网络目标函数仅以输出胶囊模长作为约束的弊端,本文以几何学的视角,在胶囊神经网络结构中加入Transition层,使用Transition层对胶囊单元空间关系进行变基至一维空间,再使用加性间距Softmax作为目标函数,以同类特征变化小,非同类特征差异大作为优化策略构建基于胶囊向量空间关系的目标函数以提高模型分类能力,最后对方法进行试验,采用音频事件对家庭活动进行分类识别.选择声学场景和事件检测与分类(Detection and Classification of Acoustic Scenes and Events,DCASE)2018挑战任务5作为数据集,进行分类器构建和测试,最终平均F1分数达到92.3%,优于其他主流方法.
We study the method of family activity recognition based on audio and propose a capsule neural network recognition model based on additive margin. In view of the drawbacks of the traditional capsule neural network objective function only with the output capsule mode length as the constraint
this paper adds a Transition layer to the capsule neural network structure from the perspective of geometry and uses the Transition layer to rebase the capsule unit spatial relationship to the one-dimensional. Then
using the additive margin Softmax as the objective function
the change of similar features is small
and the difference of non-similar features is used as the optimization strategy to construct the objective function based on the capsule vector space relationship to improve model classification ability. Finally
test this method by classified identified for audio events for family activities. Selecting Detection and Classification of Acoustic Scenes and Events (DCASE) 2018 Challenge Task 5 as a dataset for classifier construction and testing
with a final average F1 score of 92.3%
which is superior to other mainstream methods.
0
浏览量
38
下载量
2
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621