hh.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Understanding and Improving CNNs with Complex Structure Tensor: A Biometrics Study
Högskolan i Halmstad, Akademin för informationsteknologi.ORCID-id: 0000-0002-9696-7843
Högskolan i Halmstad, Akademin för informationsteknologi.ORCID-id: 0000-0002-4929-1262
Högskolan i Halmstad, Akademin för informationsteknologi.ORCID-id: 0000-0002-1400-346X
(engelsk)Manuskript (preprint) (Annet vitenskapelig)
Abstract [en]

Our study provides evidence that CNNs struggle to effectively extract orientation features. We show that the use of Complex Structure Tensor, which contains compact orientation features with certainties, as input to CNNs consistently improves identification accuracy compared to using grayscale inputs alone. Experiments also demonstrated that our inputs, which were provided by mini complex conv-nets, combined with reduced CNN sizes, outperformed full-fledged, prevailing CNN architectures. This suggests that the upfront use of orientation features in CNNs, a strategy seen in mammalian vision, not only mitigates their limitations but also enhances their explainability and relevance to thin-clients. Experiments were done on publicly available data sets comprising periocular images for biometric identification and verification (Close and Open World) using 6 State of the Art CNN architectures. We reduced SOA Equal Error Rate (EER) on the PolyU dataset by 5-26 % depending on data and scenario.

HSV kategori
Identifikatorer
URN: urn:nbn:se:hh:diva-53249DOI: 10.48550/arXiv.2404.15608OAI: oai:DiVA.org:hh-53249DiVA, id: diva2:1853554
Forskningsfinansiär
Vinnova, 2022-00919Swedish Research Council, 2016-03497Swedish Research Council, 2021-05110
Merknad

Som manuscript i avhandling/As manuscript in thesis

Tilgjengelig fra: 2024-04-22 Laget: 2024-04-22 Sist oppdatert: 2025-10-01bibliografisk kontrollert
Inngår i avhandling
1. Ocular Recognition in Unconstrained Sensing Environments
Åpne denne publikasjonen i ny fane eller vindu >>Ocular Recognition in Unconstrained Sensing Environments
2024 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Abstract [en]

This thesis focuses on the problem of increasing flexibility in the acquisition and application of biometric recognition systems based on the ocular region. While the ocular area is one of the oldest and most widely studied biometric regions thanks to its rich and discriminative elements and characteristics, most modalities such as retina, iris, eye movements, or oculomotor plant have limitations regarding data acquisition. Some require a specific type of illumination like the iris, a limited distance range like eye movements, or specific sensors and user collaboration like the retina. In this context, this thesis focuses on the periocular region, which stands out as the ocular modality with the fewest acquisition constraints. 

The first part focuses on using middle-layers' deep representation of pre-trained CNNs as a one-shot learning method, along with simple distance-based metrics and similarity scores for periocular recognition. This approach tackles the issue of limited data availability and collection for biometric recognition systems by eliminating the need to train the models for the target data. Furthermore, it allows seamless transitions between identification and verification scenarios with a single model, and tackles the problem of the open-world setting and training bias of CNNs. We demonstrate that off-the-shelf features from middle-layers can outperform CNNs trained for the target domain that followed a more extensive training strategy when target data is limited.

The second part of the thesis analyzes traditional methods for biometric systems in the context of periocular recognition. Nowadays, these methods are often overlooked in favor of deep learning solutions. However, we show that they can still outperform heavily trained CNNs in closed-world and open-world settings and can be used in conjunction with CNNs to further improve recognition performance. Moreover, we investigate the use of the complex structure tensor as a handcrafted texture extractor at the input of CNNs. We show that CNNs can benefit from this explicit textural information in terms of performance and convergence, offering the potential for network compression and explainability of the features used. We demonstrate that CNNs may not easily access the orientation information present in the images that are exploited in some more traditional approaches.

The final part of the thesis addresses the analysis of periocular recognition under different light spectra and the cross-spectral scenario. More specifically, we analyze the performance of the proposed methods under different light spectra. We also investigate the cross-spectral scenario for one-shot learning with middle-layers' deep representations and explore the possibility of bridging the domain gap in the cross-spectral scenario by training generative networks. This allows using simpler models and algorithms trained on a single spectrum.

sted, utgiver, år, opplag, sider
Halmstad: Halmstad University Press, 2024. s. 49
Serie
Halmstad University Dissertations ; 114
Emneord
Biometrics, Computer Vision, Pattern Recognition, Periocular Recognition
HSV kategori
Identifikatorer
urn:nbn:se:hh:diva-53257 (URN)978-91-89587-43-4 (ISBN)978-91-89587-42-7 (ISBN)
Disputas
2024-05-28, S3030, Kristian IV:s väg 3, 08:00 (engelsk)
Opponent
Veileder
Tilgjengelig fra: 2024-04-24 Laget: 2024-04-24 Sist oppdatert: 2025-10-01bibliografisk kontrollert

Open Access i DiVA

fulltext(10031 kB)120 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 10031 kBChecksum SHA-512
47d9f558220186f66b3a935dacc0b7e4e0d93b267c85b79e23846eaa63b5c6c1d5336853d30dbedf83a5ae9da0fda6a7eedf9a5316462317a8b231be83f66fd8
Type fulltextMimetype application/pdf

Andre lenker

Forlagets fulltekst

Person

Hernandez-Diaz, KevinBigun, JosefAlonso-Fernandez, Fernando

Søk i DiVA

Av forfatter/redaktør
Hernandez-Diaz, KevinBigun, JosefAlonso-Fernandez, Fernando
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 123 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 421 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf