Home  |  Browse  |  Advanced Search  |  Help
Login | Create Account

Kroos, Christian (2004): A system for video-based analysis of face motion during speech. Dissertation, LMU München: Faculty for Languages and Literatures

[img]
Preview
PDF - Klicken Sie auf das PDF-Icon, um die Dissertation im Volltext herunterzuladen.
4Mb

Abstract

During face-to-face interaction, facial motion conveys information at various levels. These include a person's emotional condition, position in a discourse, and, while speaking, phonetic details about the speech sounds being produced. Trivially, the measurement of face motion is a prerequisite for any further analysis of its functional characteristics or information content. It is possible to make precise measures of locations on the face using systems that track the motion by means of active or passive markers placed directly on the face. Such systems, however, have the disadvantages of requiring specialised equipment, thus restricting the use outside the lab, and being invasive in the sense that the markers have to be attached to the subject's face. To overcome these limitations we developed a video-based system to measure face motion from standard video recordings by deforming the surface of an ellipsoidal mesh fit to the face. The mesh is initialised manually for a reference frame and then projected onto subsequent video frames. Location changes (between successive frames) for each mesh node are determined adaptively within a well-defined area around each mesh node, using a two-dimensional cross-correlation analysis on a two-dimensional wavelet transform of the frames. Position parameters are propagated in three steps from a coarser mesh and a correspondingly higher scale of the wavelet transform to the final fine mesh and lower scale of the wavelet transform. The sequential changes in position of the mesh nodes represent the facial motion. The method takes advantage of inherent constraints of the facial surfaces which distinguishes it from more general image motion estimation methods and it returns measurement points globally distributed over the facial surface contrary to feature-based methods.

Item Type:Thesis (Dissertation, LMU Munich)
Keywords:face, motion tracking, auditory-visual speech, wavelets, video
Dewey Decimal Classification:400 Language
400 Language > 410 Linguistics
Faculties:Faculty for Languages and Literatures
Language:English
Date Accepted:16. February 2004
Persistent Identifier (URN):urn:nbn:de:bvb:19-21453
MD5 Checksum of the PDF-file:fed51a13feba05b4c1d2c498555026c2
Signature of the printed copy:0001/UMC 13698
ID Code:2145
Deposited By:Christian Kroos
Deposited On:03. Jun 2004
Last Modified:22. Oct 2008 15:01

Repository Staff Only: item control page

Digitale Hochschulschriften is powered by EPrints 3 which is developed by the School of Electronics and Computer Science at the University of Southampton. More information and software creditsAbout