hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Real Time Gym Activity Detection using Monocular RGB Camera
Halmstad University, School of Information Technology.
2020 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Action detection is an attractive area for researchers in computer vision, healthcare, physiotherapy, psychology, and others. Intensive work has been done in this area due to its wide range of applications such as security surveillance, video tagging, Human-Computer Interaction (HCI), robotics, medical diagnosis, sports analysis, interactive gaming, and many others. After the deep learning booming results in computer vision tasks like image classification, many researchers have tried to extend the success of deep learning models to video classification and activity recognition. The research question of this thesis is to study the use of the 2D human poses extracted by a DNN-based model from RGB frames only, for the online activity detection task and comparing it with the state of the art solutions that utilize the human 3D skeletal data extracted by a depth sensor as an input. At the same time, this work showed the importance of input pre-processing and filtering on improving the performance of the online human activity detector. Detecting gym exercises and counting the repetitions in real-time using the human skeletal data versus the 2D poses have been studied in-depth in this work. The contributions of this work are as follows: 1) generating RGB-D dataset for a set of gym exercises, 2) proposing a novel real-time skeleton-based Double Representational RNN (DR-RNN) network architecture for the online action detection, 3) Demonstrating the ability of the proposed model to achieve satisfiable results using pose estimation models applied on RGB frames, 4) introducing a novel learnable exponential filter for the online low latency filtering applications.

Place, publisher, year, edition, pages
2020. , p. 38
Keywords [en]
ActivityDetection, RNN, GymActionDetection, PoseBasedActionDetection, DeepLearning, SkeletonBasedOnlineActionDetection
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:hh:diva-41440OAI: oai:DiVA.org:hh-41440DiVA, id: diva2:1390044
Subject / course
Computer science and engineering
Educational program
Master's Programme in Embedded and Intelligent Systems, 120 credits
Presentation
2019-05-24, Halmstad, 00:47 (English)
Supervisors
Examiners
Available from: 2020-01-31 Created: 2020-01-31 Last updated: 2020-01-31Bibliographically approved

Open Access in DiVA

fulltext(8129 kB)39 downloads
File information
File name FULLTEXT02.pdfFile size 8129 kBChecksum SHA-512
d56252fcc15e1e0c1ed871e507ad1fd7e4695851a0996d1a3a218f6d13c2923cb4cf597e9c8b23ed8bb7661ab22dc6808a2d86db4109c39fad77af6f99f1d35f
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Alshatta, Mohammad Samer
By organisation
School of Information Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 39 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 101 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf