The Potential of Visual Features: to Improve Voice Recognition Systems in Vehicles Noisy Environment
2014 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE credits
Student thesis
Abstract [en]
Multimodal biometric systems have been subject of study in recent decades, theirunique characteristic of Anti spoofing and liveness detection plus ability to deal withaudio noise made them technology candidates for improving current systems such asvoice recognition, verification and identification systems.In this work we studied feasibility of incorporating audio-visual voice recognitionsystem for dealing with audio noise in the truck cab environment. Speech recognitionsystems suffer from excessive noise from the engine and road traffic and cars stereosystem. To deal with this noise different techniques including active and passive noisecancelling have been studied.Our results showed that although audio-only systems are performing better in noisefree environment their performance drops significantly by increase in the level of noisein truck cabins, which by contrast does not affect the performance of visual features.Final fused system comprising both visual and audio cues, proved to be superior toboth audio-only and video-only systems.
Place, publisher, year, edition, pages
2014. , p. 49
Keywords [en]
voice recognition, lip motion, optical flow
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-27273Local ID: IDE1310OAI: oai:DiVA.org:hh-27273DiVA, id: diva2:771771
Subject / course
Computer science and engineering
Supervisors
Examiners
2014-12-152014-12-152025-10-01Bibliographically approved