hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Lip-motion events analysis and lip segmentation using optical flow
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS).
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS).ORCID iD: 0000-0002-4929-1262
2012 (English)Conference paper, Published paper (Refereed)
Abstract [en]

We propose an algorithm for detecting the mouth events of opening and closing. Our method is translation and ro- tation invariant, works at very fast speeds, and does not re- quire segmented lips. The approach is based on a recently developed optical flow algorithm that handles the motion of linear structure in a stable and consistent way.Furthermore, we provide a semi-automatic tool for gen- erating groundtruth segmentation of video data, also based on the optical flow algorithm used for tracking keypoints at faster than 200 frames/second. We provide groundtruth for 50 sessions of speech of the XM2VTS database [16] avail- able for download, and the means to segment further ses- sions at a relatively small amount of user interaction.We use the generated groundtruth to test the proposed al- gorithm for detecting events, and show it to yield promising result. The semi-automatic tool will be a useful resource for researchers in need of groundtruth segmentation from video for the XM2VTS database and others.

Place, publisher, year, edition, pages
Piscataway, N.J.: IEEE Press, 2012. p. 138-145, article id 6239228
Series
IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshops. Proceedings, ISSN 2160-7508
Keywords [en]
Keypoints, Linear structures, Lip segmentation, Optical flow algorithm, Rotation invariant, Semi-automatic tools, User interaction, Video data
National Category
Signal Processing
Identifiers
URN: urn:nbn:se:hh:diva-19645DOI: 10.1109/CVPRW.2012.6239228Scopus ID: 2-s2.0-84864974582ISBN: 978-1-4673-1612-5 (print)OAI: oai:DiVA.org:hh-19645DiVA, id: diva2:552889
Conference
2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2012, 16-21 June, 2012, Rhode Island, USA
Funder
Swedish Research Council
Note

©2012 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Available from: 2012-10-04 Created: 2012-09-17 Last updated: 2018-03-22Bibliographically approved

Open Access in DiVA

fulltext(1775 kB)593 downloads
File information
File name FULLTEXT02.pdfFile size 1775 kBChecksum SHA-512
1d95144ceaa156e1cf9aaf567824634125461b864d777d9c019257ba878704200d919c0ac460bd8544cb114133be41b5171b8646e6c040cff8ddfc74a27a014b
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records BETA

Karlsson, Stefan M.Bigun, Josef

Search in DiVA

By author/editor
Karlsson, Stefan M.Bigun, Josef
By organisation
Halmstad Embedded and Intelligent Systems Research (EIS)
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 593 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 498 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf