hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Combined CNN and ViT features off-the-shelf: Another astounding baseline for recognition
Halmstad University, School of Information Technology, Center for Applied Intelligent Systems Research (CAISR).ORCID iD: 0000-0002-1400-346X
Halmstad University, School of Information Technology, Center for Applied Intelligent Systems Research (CAISR).
Halmstad University, School of Information Technology, Center for Applied Intelligent Systems Research (CAISR).ORCID iD: 0000-0002-2851-4260
Halmstad University, School of Information Technology, Center for Applied Intelligent Systems Research (CAISR).ORCID iD: 0000-0002-4929-1262
2024 (English)In: Proceedings - 16th IEEE International Workshop on Information Forensics and Security, WIFS 2024, IEEE, 2024, p. 1-6Conference paper, Published paper (Refereed)
Abstract [en]

We apply pre-trained architectures, originally developed for the ImageNet Large Scale Visual Recognition Challenge, for periocular recognition. These architectures have demon-strated significant success in various computer vision tasks beyond the ones for which they were designed. This work builds on our previous study using off-the-shelf Convolutional Neural Network (CNN) and extends it to include the more recently proposed Vision Transformers (ViT). Despite being trained for generic object classification, middle-layer features from CNNs and ViTs are a suitable way to recognize individuals based on periocular images. We also demonstrate that CNNs and ViTs are highly complementary since their combination results in boosted accuracy. In addition, we show that a small portion of these pre-trained models can achieve good accuracy, resulting in thinner models with fewer parameters, suitable for resource-limited environments such as mobiles. This efficiency improves if traditional handcrafted features are added as well. ©2024 IEEE.

Place, publisher, year, edition, pages
IEEE, 2024. p. 1-6
Series
IEEE International Workshop on Information Forensics and Security, ISSN 2157-4766, E-ISSN 2157-4774
Keywords [en]
Periocular recognition, deep representation, biometrics, transfer learning, one-shot learning, Convolutional Neural Network, Vision Transformers
National Category
Signal Processing
Identifiers
URN: urn:nbn:se:hh:diva-54713DOI: 10.1109/WIFS61860.2024.10810712ISI: 001422478100039Scopus ID: 2-s2.0-85215518296ISBN: 979-8-3503-6442-2 (electronic)ISBN: 979-8-3503-6443-9 (print)OAI: oai:DiVA.org:hh-54713DiVA, id: diva2:1903681
Conference
16th IEEE International Workshop on Information Forensics and Security, WIFS 2024, Rome, Italy, December 2-5, 2024
Part of project
Facial Analysis in the Era of Mobile Devices and Face Masks, Swedish Research Council
Funder
Swedish Research CouncilVinnovaAvailable from: 2024-10-06 Created: 2024-10-06 Last updated: 2025-10-01Bibliographically approved

Open Access in DiVA

fulltext(5007 kB)90 downloads
File information
File name FULLTEXT01.pdfFile size 5007 kBChecksum SHA-512
637a4c6a9a3d5ba17fedbd9b716ba968b6800acc1f8af90b4682d55fd177f42743c18e3aa2b63948229eb775c5a80b70c53346e0166ae24dc13f474208f0797a
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Alonso-Fernandez, FernandoHernandez-Diaz, KevinTiwari, PrayagBigun, Josef

Search in DiVA

By author/editor
Alonso-Fernandez, FernandoHernandez-Diaz, KevinTiwari, PrayagBigun, Josef
By organisation
Center for Applied Intelligent Systems Research (CAISR)
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 90 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 935 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf