hh.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Understanding and Improving CNNs with Complex Structure Tensor: A Biometrics Study
Högskolan i Halmstad, Akademin för informationsteknologi.ORCID-id: 0000-0002-9696-7843
Högskolan i Halmstad, Akademin för informationsteknologi.ORCID-id: 0000-0002-4929-1262
Högskolan i Halmstad, Akademin för informationsteknologi.ORCID-id: 0000-0002-1400-346X
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Abstract [en]

Our study provides evidence that CNNs struggle to effectively extract orientation features. We show that the use of Complex Structure Tensor, which contains compact orientation features with certainties, as input to CNNs consistently improves identification accuracy compared to using grayscale inputs alone. Experiments also demonstrated that our inputs, which were provided by mini complex conv-nets, combined with reduced CNN sizes, outperformed full-fledged, prevailing CNN architectures. This suggests that the upfront use of orientation features in CNNs, a strategy seen in mammalian vision, not only mitigates their limitations but also enhances their explainability and relevance to thin-clients. Experiments were done on publicly available data sets comprising periocular images for biometric identification and verification (Close and Open World) using 6 State of the Art CNN architectures. We reduced SOA Equal Error Rate (EER) on the PolyU dataset by 5-26 % depending on data and scenario.

Nationell ämneskategori
Datorgrafik och datorseende
Identifikatorer
URN: urn:nbn:se:hh:diva-53249DOI: 10.48550/arXiv.2404.15608OAI: oai:DiVA.org:hh-53249DiVA, id: diva2:1853554
Forskningsfinansiär
Vinnova, 2022-00919Vetenskapsrådet, 2016-03497Vetenskapsrådet, 2021-05110
Anmärkning

Som manuscript i avhandling/As manuscript in thesis

Tillgänglig från: 2024-04-22 Skapad: 2024-04-22 Senast uppdaterad: 2025-10-01Bibliografiskt granskad
Ingår i avhandling
1. Ocular Recognition in Unconstrained Sensing Environments
Öppna denna publikation i ny flik eller fönster >>Ocular Recognition in Unconstrained Sensing Environments
2024 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

This thesis focuses on the problem of increasing flexibility in the acquisition and application of biometric recognition systems based on the ocular region. While the ocular area is one of the oldest and most widely studied biometric regions thanks to its rich and discriminative elements and characteristics, most modalities such as retina, iris, eye movements, or oculomotor plant have limitations regarding data acquisition. Some require a specific type of illumination like the iris, a limited distance range like eye movements, or specific sensors and user collaboration like the retina. In this context, this thesis focuses on the periocular region, which stands out as the ocular modality with the fewest acquisition constraints. 

The first part focuses on using middle-layers' deep representation of pre-trained CNNs as a one-shot learning method, along with simple distance-based metrics and similarity scores for periocular recognition. This approach tackles the issue of limited data availability and collection for biometric recognition systems by eliminating the need to train the models for the target data. Furthermore, it allows seamless transitions between identification and verification scenarios with a single model, and tackles the problem of the open-world setting and training bias of CNNs. We demonstrate that off-the-shelf features from middle-layers can outperform CNNs trained for the target domain that followed a more extensive training strategy when target data is limited.

The second part of the thesis analyzes traditional methods for biometric systems in the context of periocular recognition. Nowadays, these methods are often overlooked in favor of deep learning solutions. However, we show that they can still outperform heavily trained CNNs in closed-world and open-world settings and can be used in conjunction with CNNs to further improve recognition performance. Moreover, we investigate the use of the complex structure tensor as a handcrafted texture extractor at the input of CNNs. We show that CNNs can benefit from this explicit textural information in terms of performance and convergence, offering the potential for network compression and explainability of the features used. We demonstrate that CNNs may not easily access the orientation information present in the images that are exploited in some more traditional approaches.

The final part of the thesis addresses the analysis of periocular recognition under different light spectra and the cross-spectral scenario. More specifically, we analyze the performance of the proposed methods under different light spectra. We also investigate the cross-spectral scenario for one-shot learning with middle-layers' deep representations and explore the possibility of bridging the domain gap in the cross-spectral scenario by training generative networks. This allows using simpler models and algorithms trained on a single spectrum.

Ort, förlag, år, upplaga, sidor
Halmstad: Halmstad University Press, 2024. s. 49
Serie
Halmstad University Dissertations ; 114
Nyckelord
Biometrics, Computer Vision, Pattern Recognition, Periocular Recognition
Nationell ämneskategori
Signalbehandling Datorgrafik och datorseende
Identifikatorer
urn:nbn:se:hh:diva-53257 (URN)978-91-89587-43-4 (ISBN)978-91-89587-42-7 (ISBN)
Disputation
2024-05-28, S3030, Kristian IV:s väg 3, 08:00 (Engelska)
Opponent
Handledare
Tillgänglig från: 2024-04-24 Skapad: 2024-04-24 Senast uppdaterad: 2025-10-01Bibliografiskt granskad

Open Access i DiVA

fulltext(10031 kB)121 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 10031 kBChecksumma SHA-512
47d9f558220186f66b3a935dacc0b7e4e0d93b267c85b79e23846eaa63b5c6c1d5336853d30dbedf83a5ae9da0fda6a7eedf9a5316462317a8b231be83f66fd8
Typ fulltextMimetyp application/pdf

Övriga länkar

Förlagets fulltext

Person

Hernandez-Diaz, KevinBigun, JosefAlonso-Fernandez, Fernando

Sök vidare i DiVA

Av författaren/redaktören
Hernandez-Diaz, KevinBigun, JosefAlonso-Fernandez, Fernando
Av organisationen
Akademin för informationsteknologi
Datorgrafik och datorseende

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 124 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 421 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf