Classifying Facial Actions
Gianluca Donato, Marian Stewart Bartlett, Joseph C. Hager, Paul Ekman, and
Terrence J. Sejnowski
IEEE Transactions on Pattern Analysis and Machine Intelligence
21(10) p. 974-989, 1999.
Abstract
The Facial Action Coding System (FACS) (Ekman & Friesen, 1978) is an
objective method for quantifying facial movement in terms of component
actions. This system is widely used in behavioral investigations of
emotion, cognitive processes, and social interaction. The coding is
presently performed by highly trained human experts. This paper explores
and compares techniques for automatically recognizing facial actions in
sequences of images. These techniques include analysis of facial motion
through estimation of optical flow; holistic spatial analysis such as
principal component analysis, independent component analysis, local
feature analysis, and linear discriminant analysis; and methods based on
the outputs of local filters, such as Gabor wavelet representations, and
local principal components. Performance of these systems is compared to
naive and expert human subjects. Best performances were obtained using
the Gabor wavelet representation and the independent component
representation, both of which achieved 96% accuracy for classifying
twelve facial actions of the upper and lower face. The results provide
converging evidence for the importance of using local filters, high
spatial frequencies, and statistical independence for classifying facial
actions.