基于深度学习的表情动作单元识别综述

QIU

Y

, ZHAO

J Y

, WANG

Y F

.

Facial expression recognition using temporal relations among facial movements

[J]. Acta Electronica Sinica, 2016, 44(6): 1307-1313. (in Chinese)

[2]

孙晓, 潘汀.

基于兴趣区域深度神经网络的静态面部表情识别

[J]. 电子学报, 2017, 45(5): 1189-1197.

SUN

X

, PAN

T

.

Static facial expression recognition system using roi deep neural networks

[J]. Acta Electronica Sinica, 2017, 45(5): 1189-1197. (in Chinese)

[3]

张瑞, 蒋晨之, 苏剑波.

基于稀疏特征挑选和概率线性判别分析的表情识别研究

[J]. 电子学报, 2018, 46(7): 1710-1718.

ZHANG

R

, JIANG

C Z

, SU

J B

.

Expression recognition based on sparse selection and plda

[J]. Acta Electronica Sinica, 2018, 46(7): 1710-1718. (in Chinese)

[4]

孔德壮, 朱梦宇, 于江坤.

人脸表情识别在辅助医疗中的应用及方法研究

[J]. 生命科学仪器, 2019, 18(2): 43-48.

KONG

D Z

, ZHU

M Y

, YU

J K

.

Research on the application and method of facial expression recognition in assistive medical care

[J]. Life Science Instruments, 2019, 18(2): 43-48. (in Chinese)

[5]

FRANK

M G

, EKMAN

P

.

The ability to detect deceit generalizes across different types of high-stake lies

[J]. Journal of Personality and Social Psychology, 1997, 72(6): 1429-1439.

[6]

EKMAN

P

, FRIESEN

W V

. Facial Action Coding System: A Technique for the Measurement of Facial Movement[M]. Palo Alto: Consulting Psychologists Press, 1978.

[7]

EKMAN

P

, FRIESEN

W V

, HAGER

J C

. Facial Action Coding System[M]. Salt Lake City: Research Nexus, 2002.

[8]

SCHERER

K R

, EKMAN

P

. Handbook of Methods in Nonverbal Behavior Research[M]. Cambridge: Cambridge University Press, 1982.

[9]

FASEL

B

, LUETTIN

J

.

Automatic facial expression analysis: Survey

[J]. Pattern Recognition, 2003, 36(1): 259-275.

[10]

刘晓旻, 谭华春, 章毓晋.

人脸表情识别研究的新进展

[J]. 中国图象图形学报, 2006, 11(10): 1359-1368.

LIU

X M

, TAN

H C

, ZHANG

Y J

.

New research advances in facial expression recognition

[J]. Journal of Image and Graphics, 2006, 11(10): 1359-1368. (in Chinese)

[11]

KUMARI

J

, RAJESH

R

, POOJA

K M

.

Facial expression recognition: A survey

[J]. Procedia Computer Science, 2015, 58(1): 486-491.

[12]

CORNEANU

C A

, SIMÓN

M O

, COHN

J F

, et al.

Survey on rgb, 3d, thermal, and multimodal approaches for facial expression recognition: History, trends, and affect-related applications

[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(8): 1548-1568.

[13]

LI

S

, DENG

W

.

Deep facial expression recognition: A survey

[EB/OL]. (2020-03-17)[2022-05-09]. .

[14]

贲晛烨, 杨明强, 张鹏, 等.

微表情自动识别综述

[J]. 计算机辅助设计与图形学学报, 2014, 26(9): 1385-1395.

BEN

X Y

, YANG

M Q

, ZHANG

P

, et al.

Survey on automatic micro expression recognition methods

[J]. Journal of Computer-Aided Design and Computer Graphics, 2014, 26(9): 1385-1395. (in Chinese)

[15]

徐峰, 张军平.

人脸微表情识别综述

[J]. 自动化学报, 2017, 43(3): 333-348.

XU

F

, ZHANG

J P

.

Facial microexpression recognition: A survey

[J]. Acta Automatica Sinica, 2017, 43(3): 333-348. (in Chinese)

[16]

MARTINEZ

B

, VALSTAR

M F

, JIANG

B

, et al.

Automatic analysis of facial actions: A survey

[J]. IEEE Transactions on Affective Computing, 2019, 10(3): 325-347.

[17]

ZHI

R

, LIU

M

, ZHANG

D

.

A comprehensive survey on automatic facial action unit analysis

[J]. The Visual Computer, 2020, 36(5): 1067-1093.

[18]

PANTIC

M

, VALSTAR

M

, RADEMAKER

R

, et al.

Web-based database for facial expression analysis

[C]//Proceedings of the IEEE International Conference on Multimedia and Expo. Amsterdam: IEEE, 2005: 1-5.

[19]

MATSUMOTO

D

, YOO

S H

, S.Culture

NAKAGAWA

, regulation

emotion

, and adjustment [J]. Journal of Personality and Social Psychology, 2008, 94(6): 925-937.

[20]

YAN

W J

, WU

Q

, LIANG

J

, et al.

How fast are the leaked facial expressions: The duration of micro-expressions

[J]. Journal of Nonverbal Behavior, 2013, 37(4): 217-230.

[21]

SHEN

X

, WU

Q

, FU

X

.

Effects of the duration of expressions on the recognition of microexpressions

[J]. Journal of Zhejiang University-Science B, 2012, 13(3): 221-230.

[22]

DAVISON

A K

, MERGHANI

W

, YAP

M H

.

Objective classes for micro-facial expression recognition

[J]. Journal of Imaging, 2018, 4(10): 119.

[23]

GUDI

A

, TASLI

H E

, DENUYL

T M

, et al.

Deep learning based facs action unit occurrence and intensity estimation

[C]//Proceedings of the IEEE International Conference and Workshops on Automatic Face and Gesture Recognition. Ljubljana: IEEE, 2015: 1-5.

[24]

SHAO

Z

, LIU

Z

, CAI

J

, et al.

Facial action unit detection using attention and relation learning

[EB/OL]. (2019-10-23)[2022-05-09]. .

[本文引用: 7]

[25]

KANADE

T

, COHN

J F

, TIAN

Y

.

Comprehensive database for facial expression analysis

[C]//Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition. Grenoble: IEEE, 2000: 46-53.

[26]

ZHANG

X

, YIN

L

, COHN

J F

, et al.

Bp4d-spontaneous: A high-resolution spontaneous 3d dynamic facial expression database

[J]. Image and Vision Computing, 2014, 32(10): 692-706.

[27]

MAVADATI

S M

, MAHOOR

M H

, BARTLETT

K

, et al.

Disfa: A spontaneous facial action intensity database

[J]. IEEE Transactions on Affective Computing, 2013, 4(2): 151-160.

[28]

BENITEZ-QUIROZ

C F

, SRINIVASAN

R

, MARTINEZ

A M

.

Emotionet: An accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 5562-5570.

[29]

KOLLIAS

D

, ZAFEIRIOU

S.

, affect

Expression

,

action unit recognition: Aff-wild2, multi-task learning and arcface

[C]//Proceedings of the British Machine Vision Conference. Cardiff: BMVA Press, 2019: 297.

[30]

LUCEY

P

, COHN

J F

, KANADE

T

, et al.

The extended cohn-kanade dataset (ck+): A complete dataset for action unit and emotion-specified expression

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. San Francisco: IEEE, 2010: 94-101.

[31]

VALSTAR

M

, PANTIC

M

.

Induced disgust, happiness and surprise: An addition to the mmi facial expression database

[C]//Proceedings of the International Conference on Language Resources and Evaluation Workshops. Valletta: ELRA2010: 65-70.

[32]

SAVRAN

A

, ALYÜZ

N

, DIBEKLIOĞLU

H

, et al.

Bosphorus database for 3D face analysis

[C]//Proceedings of the European Workshop on Biometrics and Identity Management. Roskilde: Springer, 2008: 47-56.

[33]

STRATOU

G

, GHOSH

A

, DEBEVEC

P

, et al.

Effect of illumination on automatic expression recognition: A novel 3D relightable facial database

[C]//Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition. Santa Barbara: IEEE, 2011: 611-618.

[34]

COSKER

D

, KRUMHUBER

E

, HILTON

A

.

A facs valid 3D dynamic action unit database with applications to 3D dynamic morphable facial modeling

[C]//Proceedings of the IEEE International Conference on Computer Vision. Barcelona: IEEE, 2011: 2296-2303.

[35]

ZHANG

Z

, GIRARD

J M

, WU

Y

, et al.

Multimodal spontaneous emotion corpus for human behavior analysis

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 3438-3446.

[36]

GIRARD

J M

, CHU

W S

, JENI

L A

,et al.

Sayette group formation task (gft) spontaneous facial expression database

[C]//Proceedings of the IEEE International Conference on Automatic Face & Gesture Recognition. Washington: IEEE, 2017: 581-588.

[37]

YAN

W J

, LI

X

, WANG

S J

, ZHAO

G

, et al.

Casme ii: An improved spontaneous micro-expression database and the baseline evaluation

[J]. PloS One, 2014, 9(1): e86041.

[38]

DAVISON

A K

, LANSLEY

C

, COSTEN

N

, et al.

Samm: A spontaneous micro-facial movement dataset

[J]. IEEE Transactions on Affective Computing, 2018, 9(1): 116-129.

[39]

BEN X, REN

Y

, ZHANG

J

, et al.

Video-based facial micro-expression analysis: A survey of datasets, features and algorithms

[EB/OL]. (2021-03-19)[2022-05-09]. .

[40]

ZHOU

Y

, PI

J

, SHI

B E

.

Pose-independent facial action unit intensity regression based on multi-task deep transfer learning

[C]/Proceedings of the IEEE International Conference on Automatic Face & Gesture Recognition. Washington: IEEE, 2017: 872-877.

[41]

RUSSAKOVSKY

O

, DENG

J

, SU

H

, et al.

Imagenet large scale visual recognition challenge

[J]. International Journal of Computer Vision, 2015, 115(3): 211-252.

[42]

SIMONYAN

K

, ZISSERMAN

A

.

Very deep convolutional networks for large-scale image recognition

[C]//Proceedings of the International Conference on Learning Representations. San Diego: OpenReview, 2015: 1-14.

[43]

JI

S

, WANG

K

, PENG

X

, et al.

Multiple transfer learning and multi-label balanced training strategies for facial AU detection in the wild

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. Seattle: IEEE, 2020: 1657-1661.

[44]

HE

K

, ZHANG

X

, REN

S

, et al.

Deep residual learning for image recognition

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 770-778.

[45]

WERNER

P

, SAXEN

F

, AL-HAMADI

A

.

Facial action unit recognition in the wild with multi-task cnn self-training for the emotionet challenge

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. Seattle: IEEE, 2020: 410-411.

[46]

PENG

G

, WANG

S

.

Weakly supervised facial action unit recognition through adversarial training

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 2188-2196.

[47]

PENG

G

, WANG

S

.

Dual semi-supervised learning for facial action unit recognition

[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Honolulu: AAAI Press, 2019: 8827-8834.

[48]

GOODFELLOW

I

, POUGET-ABADIE

J

, MIRZA

M

, et al.

Generative adversarial nets

[C]//Proceedings of the Advances in Neural Information Processing Systems. Montreal: MIT Press, 2014: 2672-2680.

[49]

ZHANG

Y

, DONG

W

, HU

B G

, et al.

Classifier learning with prior probabilities for facial action unit recognition

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 5108-5116.

[50]

WU

S

, WANG

S

, PAN

B

, et al.

Deep facial action unit recognition from partially labeled data

[C]//Proceedings of the IEEE International Conference on Computer Vision. Venice: IEEE, 2017: 3951-3959.

[51]

ZHANG

Y

, DONG

W

, HU

B G

, et al.

Weakly-supervised deep convolutional neural network learning for facial action unit intensity estimation

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 2314-2323.

[52]

SHAO

Z

, LIU

Z

, CAI

J

, et al.

Jâa-net: Joint facial action unit detection and face alignment via adaptive attention

[J]. International Journal of Computer Vision, 2021, 129(2): 321-340.

[本文引用: 7]

[53]

JYOTI

S

, SHARMA

G

, DHALL

A

.

Expression empowered residen network for facial action unit detection

[C]//Proceedings of the IEEE International Conference on Automatic Face & Gesture Recognition. Lille: IEEE, 2019: 1-8.

[54]

TU

C H

, YANG

C Y

, HSU

J Y

.

Idennet: Identity-aware facial action unit detection

[C]//Proceedings of the IEEE International Conference on Automatic Face & Gesture Recognition. Lille: IEEE, 2019: 1-8.

[55]

LIU

Z

, SONG

G

, CAI

J

, et al.

Conditional adversarial synthesis of 3D facial action units

[J]. Neurocomputing, 2019, 355: 200-208.

[56]

MIRZA

M

, OSINDERO

S

.

Conditional generative adversarial nets

[EB/OL]. (2014-11-06)[2021-05-18]. .

[57]

BLANZ

V

, VETTER

T

.

A morphable model for the synthesis of 3D faces

[C]//Proceedings of the Annual Conference on Computer Graphics and Interactive Techniques of SIGGRAPH. Los Angeles: ACM, 1999: 187-194.

[58]

WANG

C

, WANG

S

.

Personalized multiple facial action unit recognition through generative adversarial recognition network

[C]//Proceedings of the ACM International Conference on Multimedia. Seoul: ACM, 2018: 302-310.

[59]

WILES

O

, KOEPKE

A S

, ZISSERMAN

A

.

Self-supervised learning of a facial attribute embedding from video

[C]//Proceedings of the British Machine Vision Conference. Newcastle: BMVA Press, 2018: 302.

[60]

LI

Y

, ZENG

J

, SHAN

S

.

Learning representations for facial actions from unlabeled videos

[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(1): 302-317.

[61]

LI

W

, ABTAHI

F

, ZHU

Z

, et al.

Eac-net: Deep nets with enhancing and cropping for facial action unit detection

[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(11): 2583-2596.

[62]

JAISWAL

S

, VALSTAR

M

.

Deep learning the dynamic appearance and shape of facial action units

[C]//Proceedings of the IEEE Winter Conference on Applications of Computer Vision. Lake Placid: IEEE, 2016: 1-8.

[63]

ALI

A M

, ALKABBANY

I

, FARAG

A

, et al.

Facial action units detection under pose variations using deep regions learning

[C]//Proceedings of the International Conference on Affective Computing and Intelligent Interaction. San Antonio: IEEE, 2017: 395-400.

[64]

MA

C

, CHEN

L

, YONG

J

.

AU R-CNN: Encoding expert prior knowledge into R-CNN for action unit detection

[J]. Neurocomputing, 2019, 355: 35-47.

[65]

LI

W

, ABTAHI

F

, ZHU

Z

.

Action unit detection with region adaptation, multi-labeling learning and optimal temporal fusing

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 6766-6775.

[66]

SANCHEZ

E

, TZIMIROPOULOS

G

, VALSTAR

M

.

Joint action unit localisation and intensity estimation through heatmap regression

[C]//Proceedings of the British Machine Vision Conference. Newcastle: BMVA Press, 2018: 233.

[67]

LIU

M

, LI

S

, SHAN

S

, et al.

AU-aware deep Networks for facial expression recognition

[C]//Proceedings of the IEEE International Conference and Workshops on Automatic Face and Gesture Recognition. Shanghai: IEEE, 2013: 1-6.

[68]

ZHAO

K

, CHU

W S

, ZHANG

H

.

Deep region and multi-label learning for facial action unit detection

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016: 3391-3399.

[69]

HAN

S

, MENG

Z

, LI

Z

, et al.

Optimizing filter size in convolutional neural networks for facial action unit recognition

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 5070-5078.

[70]

ERTUGRUL

I O

, JENI

L A

, COHN

J F

.

Pattnet: Patch-attentive deep network for action unit detection

[C]//Proceedings of the British Machine Vision Conference. Cardiff: BMVA Press, 2019: 114.1-114.13.

[71]

ERTUGRUL

I O

, YANG

L

, JENI

L A

, et al.

D-pattnet: Dynamic patch-attentive deep network for action unit detection

[J]. Frontiers in Computer Science, 2019, 1(11): 1-13.

[72]

NIU

X

, HAN

H

, YANG

S

, et al.

Local relationship learning with person-specific shape regularization for facial action unit detection

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2019: 11917-11926.

[73]

FAN

Y

, LIN

Z

.

G2rl: Geometry-guided representation learning for facial action unit intensity estimation

[C]//Proceedings of the International Joint Conference on Artificial Intelligence. Virtual Conference: IJCAI, 2020: 731-737.

[74]

TRAN

D L

, WALECKI

R

, RUDOVIC

O

, et al.

Deepcoder: Semi-parametric variational autoencoders for automatic facial action coding

[C]//Proceedings of the IEEE International Conference on Computer Vision. Venice: IEEE, 2017: 3190-3199.

[75]

BENITEZ-QUIROZ

C F

, WANG

Y

, MARTINEZ

A M

.

Recognition of action units in the wild with deep nets and a new global-local Loss

[C]//Proceedings of the IEEE International Conference on Computer Vision. Venice: IEEE, 2017: 3970-3979.

[76]

WALECKI

R

, RUDOVIC

O

, PAVLOVIC

V

, et al.

Deep structured learning for facial action unit intensity estimation

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 3405-3414.

[77]

CORNEANU

C A

, MADADI

M

, ESCALERA

S

.

Deep structure inference network for facial action unit recognition

[C]//Proceedings of the European Conference on Computer Vision. Munich: Springer, 2018: 309-324.

[78]

JACOB

G M

, STENGER

B

.

Facial action unit detection with transformers

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Virtual Conference: IEEE, 2021: 7680-7689.

[79]

LI

G

, ZHU

X

, ZENG

Y

, et al.

Semantic relationships guided representation learning for facial action unit recognition

[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Honolulu: AAAI, 2019: 8594-8601.

[80]

LI

Y

, TARLOW

D

, BROCKSCHMIDT

M

, et al.

Gated graph sequence neural networks

[C]//Proceedings of the International Conference on Learning Representations. San Juan: OpenReview, 2016: 1-16.

[81]

LIU

Z

, DONG

J

, ZHANG

C

, et al.

Relation modeling with graph convolutional networks for facial action unit detection

[C]//Proceedings of the International Conference on Multimedia Modeling. Daejeon: Springer, 2020: 489-501.

[82]

NIU

X

, HAN

H

, SHAN

S

, et al.

Multi-label co-regularization for semi-supervised facial action unit recognition

[C]//Proceedings of the Advances in Neural Information Processing Systems. Vancouver: Curran Associates2019: 909-919.

[83]

FAN

Y

, LAM

J C K

, LI

V O K

.

Facial action unit intensity estimation via semantic correspondence learning with dynamic graph convolution

[C]//Proceedings of the AAAI Conference on Artificial Intelligence. New York: AAAI, 2020: 12701-12708.

[84]

SONG

T

, CHEN

L

, ZHENG

W

, et al.

Uncertain graph neural networks for facial action unit detection

[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Virtual Conference: AAAI, 2021: 5993-6001.

[85]

SONG

T

, CUI

Z

, ZHENG

W

, et al.

Hybrid message passing with performance-driven structures for facial action unit detection

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Virtual Conference: IEEE, 2021: 6267-6276.

[86]

CHU

W S

, DE LA TORRE

F

, COHN

J F

.

Learning spatial and temporal cues for multi-label facial action unit detection

[C]//Proceedings of the IEEE International Conference on Automatic Face & Gesture Recognition. Washington: IEEE, 2017: 25-32.

[87]

BISHAY

M

, PATRAS

I

.

Fusing multilabel deep networks for facial action unit detection

[C]//Proceedings of the IEEE International Conference on Automatic Face & Gesture Recognition. Washington: IEEE, 2017: 681-688.

[88]

HE

J

, LI

D

, YANG

B

, et al.

Multi view facial action unit detection based on CNN and BLSTM-RNN

[C]//Proceedings of the IEEE International Conference on Automatic Face & Gesture Recognition. Washington: IEEE, 2017: 848-853.

[89]

SONG

T

, CUI

Z

, WANG

Y

, et al.

Dynamic probabilistic graph convolution for facial action unit intensity estimation

[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Virtual Conference: IEEE, 2021: 4845-4854.

[90]

YANG

L

, ERTUGRUL

I O

, COHN

J F

, et al.

FACS3D-NET: 3D convolution based spatiotemporal representation for action unit detection

[C]//Proceedings of the International Conference on Affective Computing and Intelligent Interaction. Cambridge: IEEE, 2019: 538-544.

[91]

YANG

H

, YIN

L

.

Learning temporal information from a single image for au detection

[C]//Proceedings of the IEEE International Conference on Automatic Face & Gesture Recognition. Lille: IEEE, 2019: 1-8.

[92]

ZHANG

Y

, JIANG

H

, WU

B

, et al.

Context-aware feature and label fusion for facial action unit intensity estimation with partially labeled data

[C]//Proceedings of the IEEE International Conference on Computer Vision. Seoul: IEEE, 2019: 733-742.

[93]

SHROUT

P E

, FLEISS

J L

.

Intraclass correlations: Uses in assessing rater reliability

[J]. Psychological Bulletin, 1979, 86(2): 420-428.

[94]

LIN

L

, WANG

K

, MENG

D

, et al.

Active self-paced learning for cost-effective and progressive face identification

[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 40(1): 7-19.

[95]

胡小娟, 刘磊, 邱宁佳.

基于主动学习和否定选择的垃圾邮件分类算法

[J]. 电子学报, 2018, 46(1): 203-209.

HU

X J

, LIU

L

, QIU

N J

.

A novel spam categorization algorithm based on active learning method and negative selection algorithm

[J]. Acta Electronica Sinica, 2018, 46(1): 203-209. (in Chinese)

[96]

姚拓中, 安鹏, 宋加涛.

基于历史分类加权和分级竞争采样的多视角主动学习

[J]. 电子学报, 2017, 45(1): 46-53.

YAO

T Z

, AN

P

, SONG

J T

.

Multi-view active learning based on weighted hypothesis boosting and hierarchical competition sampling

[J]. Acta Electronica Sinica, 2017, 45(1): 46-53. (in Chinese)