hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Text Driven Face-Video Synthesis Using GMM and Spatial Correlation
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS).
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS).
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS).ORCID iD: 0000-0002-4929-1262
2007 (English)In: Image analysis: 15th Scandinavian Conference, SCIA 2007, Aalborg, Denmark, June 10-14, 2007 ; proceedings / [ed] Ersboll, B K, Pedersen, K S, Berlin: Springer Berlin/Heidelberg, 2007, p. 572-580Conference paper, Published paper (Refereed)
Abstract [en]

Liveness detection is increasingly planned to be incorporated into biometric systems to reduce the risk of spoofing and impersonation. Some of the techniques used include detection of motion of the head while posing/speaking, iris size in varying illumination, fingerprint sweat, text-prompted speech, speech-to-lip motion synchronization etc. In this paper, we propose to build a biometric signal to test attack resilience of biometric systems by creating a text-driven video synthesis of faces. We synthesize new realistic looking video sequences from real image sequences representing utterance of digits. We determine the image sequences for each digit by using a GMM based speech recognizer. Then, depending on system prompt (sequence of digits) our method regenerates a video signal to test attack resilience of a biometric system that asks for random digit utterances to prevent play-back of pre-recorded data representing both audio and images. The discontinuities in the new image sequence, created at the connection of each digit, are removed by using a frame prediction algorithm that makes use of the well known block matching algorithm. Other uses of our results include web-based video communication for electronic commerce and frame interpolation for low frame rate video.

Place, publisher, year, edition, pages
Berlin: Springer Berlin/Heidelberg, 2007. p. 572-580
Series
Lecture Notes in Computer Science, ISSN 0302-9743 ; 4522
Keywords [en]
Image analysis
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:hh:diva-2130ISI: 000247364000058Scopus ID: 2-s2.0-38049080023Local ID: 2082/2525ISBN: 978-3-540-73039-2 (print)OAI: oai:DiVA.org:hh-2130DiVA, id: diva2:239348
Conference
15th Scandinavian Conference on Image Analysis, Aalborg, Denmark, June 10-14, 2007
Available from: 2008-11-12 Created: 2008-11-12 Last updated: 2018-03-23Bibliographically approved
In thesis
1. Audio-video synthesis methods for improving performance of biometric systems
Open this publication in new window or tab >>Audio-video synthesis methods for improving performance of biometric systems
2007 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

System security is important for any automation. It is even more so in the case of biometric systems due to the sensitive nature of the data it uses for enrollment and authentication - the subjects physical or biological trait. The performance quantification of biometric systems, such as face tracking and recognition, highly depend on the database used for testing the systems. Systems trained and tested on realistic and represenative databases evidently perform better. In fact, the main reason for evaluating any system on test data is that these data sets represent problems that system might face in the real world. However, building biometric databases that represent the real world is an expensive task due to its high demand on the side of the participants. This becomes even more difficult and unrealistic if the data is to be collected in a natural environment such as supermarkets, offices, streets, etc.

This thesis presents a procedure to build a synthetic biometric database by damascening images from a studio recorded database with a realistic scenery. To this end, we developed an image segmenation procedure to spearate the background of a video recorded in studio conditions with the prupose to replace it with an arbitrary complex background. Furthermore, we present how several degradations such as affine transformation, imaging noise, and motion blur can be incorporated into the production of the new database to simulate natural recording environments. The system is applied to the entire XM2VTS database, which already consists of several terabytes of data, to produce the DXM2VTS - Damascened XM2VTS database.

Moreover, the thesis presents a method to segment a video sequence in the time domain based on its audio concept. The video is then reshuffled and used for testing resilience of text-prompted biometric systems against playback attacks. The playback is supported by pyramid based frame interpolation method to reduce discontinuities created at the digit boundaries in time.

Place, publisher, year, edition, pages
Gothenburg: Department of Signals and Systems, Chalmers University of Technology, 2007. p. 31
Series
Technical report R, ISSN 1403-266X ; 2007:6
Keywords
Biometrics, Audio-Video synthesis, Image segmentation, XM2VTS, DXM2VTS
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-1976 (URN)2082/2371 (Local ID)2082/2371 (Archive number)2082/2371 (OAI)
Presentation
2007-04-24, R1107, Halmstad, 13:15
Supervisors
Available from: 2008-09-29 Created: 2008-09-29 Last updated: 2018-03-23Bibliographically approved

Open Access in DiVA

No full text in DiVA

Scopus

Authority records BETA

Teferi, DerejeFaraj, Maycel IsaacBigun, Josef

Search in DiVA

By author/editor
Teferi, DerejeFaraj, Maycel IsaacBigun, Josef
By organisation
Halmstad Embedded and Intelligent Systems Research (EIS)
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 139 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf