hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Understanding and Improving CNNs with Complex Structure Tensor: A Biometrics Study
Halmstad University, School of Information Technology.ORCID iD: 0000-0002-9696-7843
Halmstad University, School of Information Technology.ORCID iD: 0000-0002-4929-1262
Halmstad University, School of Information Technology.ORCID iD: 0000-0002-1400-346X
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Our study provides evidence that CNNs struggle to effectively extract orientation features. We show that the use of Complex Structure Tensor, which contains compact orientation features with certainties, as input to CNNs consistently improves identification accuracy compared to using grayscale inputs alone. Experiments also demonstrated that our inputs, which were provided by mini complex conv-nets, combined with reduced CNN sizes, outperformed full-fledged, prevailing CNN architectures. This suggests that the upfront use of orientation features in CNNs, a strategy seen in mammalian vision, not only mitigates their limitations but also enhances their explainability and relevance to thin-clients. Experiments were done on publicly available data sets comprising periocular images for biometric identification and verification (Close and Open World) using 6 State of the Art CNN architectures. We reduced SOA Equal Error Rate (EER) on the PolyU dataset by 5-26 % depending on data and scenario.

National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:hh:diva-53249DOI: 10.48550/arXiv.2404.15608OAI: oai:DiVA.org:hh-53249DiVA, id: diva2:1853554
Funder
Vinnova, 2022-00919Swedish Research Council, 2016-03497Swedish Research Council, 2021-05110
Note

Som manuscript i avhandling/As manuscript in thesis

Available from: 2024-04-22 Created: 2024-04-22 Last updated: 2025-10-01Bibliographically approved
In thesis
1. Ocular Recognition in Unconstrained Sensing Environments
Open this publication in new window or tab >>Ocular Recognition in Unconstrained Sensing Environments
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

This thesis focuses on the problem of increasing flexibility in the acquisition and application of biometric recognition systems based on the ocular region. While the ocular area is one of the oldest and most widely studied biometric regions thanks to its rich and discriminative elements and characteristics, most modalities such as retina, iris, eye movements, or oculomotor plant have limitations regarding data acquisition. Some require a specific type of illumination like the iris, a limited distance range like eye movements, or specific sensors and user collaboration like the retina. In this context, this thesis focuses on the periocular region, which stands out as the ocular modality with the fewest acquisition constraints. 

The first part focuses on using middle-layers' deep representation of pre-trained CNNs as a one-shot learning method, along with simple distance-based metrics and similarity scores for periocular recognition. This approach tackles the issue of limited data availability and collection for biometric recognition systems by eliminating the need to train the models for the target data. Furthermore, it allows seamless transitions between identification and verification scenarios with a single model, and tackles the problem of the open-world setting and training bias of CNNs. We demonstrate that off-the-shelf features from middle-layers can outperform CNNs trained for the target domain that followed a more extensive training strategy when target data is limited.

The second part of the thesis analyzes traditional methods for biometric systems in the context of periocular recognition. Nowadays, these methods are often overlooked in favor of deep learning solutions. However, we show that they can still outperform heavily trained CNNs in closed-world and open-world settings and can be used in conjunction with CNNs to further improve recognition performance. Moreover, we investigate the use of the complex structure tensor as a handcrafted texture extractor at the input of CNNs. We show that CNNs can benefit from this explicit textural information in terms of performance and convergence, offering the potential for network compression and explainability of the features used. We demonstrate that CNNs may not easily access the orientation information present in the images that are exploited in some more traditional approaches.

The final part of the thesis addresses the analysis of periocular recognition under different light spectra and the cross-spectral scenario. More specifically, we analyze the performance of the proposed methods under different light spectra. We also investigate the cross-spectral scenario for one-shot learning with middle-layers' deep representations and explore the possibility of bridging the domain gap in the cross-spectral scenario by training generative networks. This allows using simpler models and algorithms trained on a single spectrum.

Place, publisher, year, edition, pages
Halmstad: Halmstad University Press, 2024. p. 49
Series
Halmstad University Dissertations ; 114
Keywords
Biometrics, Computer Vision, Pattern Recognition, Periocular Recognition
National Category
Signal Processing Computer graphics and computer vision
Identifiers
urn:nbn:se:hh:diva-53257 (URN)978-91-89587-43-4 (ISBN)978-91-89587-42-7 (ISBN)
Public defence
2024-05-28, S3030, Kristian IV:s väg 3, 08:00 (English)
Opponent
Supervisors
Available from: 2024-04-24 Created: 2024-04-24 Last updated: 2025-10-01Bibliographically approved

Open Access in DiVA

fulltext(10031 kB)114 downloads
File information
File name FULLTEXT01.pdfFile size 10031 kBChecksum SHA-512
47d9f558220186f66b3a935dacc0b7e4e0d93b267c85b79e23846eaa63b5c6c1d5336853d30dbedf83a5ae9da0fda6a7eedf9a5316462317a8b231be83f66fd8
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Hernandez-Diaz, KevinBigun, JosefAlonso-Fernandez, Fernando

Search in DiVA

By author/editor
Hernandez-Diaz, KevinBigun, JosefAlonso-Fernandez, Fernando
By organisation
School of Information Technology
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar
Total: 117 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 416 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf