hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Material handling machine activity recognition by context ensemble with gated recurrent units
Halmstad University, School of Information Technology.ORCID iD: 0000-0002-6420-8316
Halmstad University, School of Information Technology.ORCID iD: 0000-0001-5163-2997
Halmstad University, School of Information Technology.ORCID iD: 0000-0002-7796-5201
Halmstad University, School of Information Technology.ORCID iD: 0000-0003-3272-4145
Show others and affiliations
2023 (English)In: Engineering applications of artificial intelligence, ISSN 0952-1976, E-ISSN 1873-6769, Vol. 126, no Part C, article id 106992Article in journal (Refereed) Published
Abstract [en]

Research on machine activity recognition (MAR) is drawing more attention because MAR can provide productivity monitoring for efficiency optimization, better maintenance scheduling, product design improvement, and potential material savings. A particular challenge of MAR for human-operated machines is the overlap when transiting from one activity to another: during transitions, operators often perform two activities simultaneously, e.g., lifting the fork already while approaching a rack, so the exact time when one activity ends and another begins is uncertain. Machine learning models are often uncertain during such activity transitions, and we propose a novel ensemble-based method adapted to fuzzy transitions in a forklift MAR problem. Unlike traditional ensembles, where models in the ensemble are trained on different subsets of data, or with costs that force them to be diverse in their responses, our approach is to train a single model that predicts several activity labels, each under a different context. These individual predictions are not made by independent networks but are made using a structure that allows for sharing important features, i.e., a context ensemble. The results show that the gated recurrent unit network can provide medium or strong confident context ensembles for 95% of the cases in the test set, and the final forklift MAR result achieves accuracies of 97% for driving and 90% for load-handling activities. This study is the first to highlight the overlapping activity issue in MAR problems and to demonstrate that the recognition results can be significantly improved by designing a machine learning framework that addresses this issue. © 2023 The Author(s)

Place, publisher, year, edition, pages
Oxford: Elsevier, 2023. Vol. 126, no Part C, article id 106992
Keywords [en]
Context ensemble, Gated recurrent unit, Machine activity recognition, Material handling, Productivity monitoring
National Category
Computer Sciences Production Engineering, Human Work Science and Ergonomics
Research subject
Smart Cities and Communities
Identifiers
URN: urn:nbn:se:hh:diva-48552DOI: 10.1016/j.engappai.2023.106992ISI: 001070748600001Scopus ID: 2-s2.0-85169031390OAI: oai:DiVA.org:hh-48552DiVA, id: diva2:1707472
Funder
Knowledge Foundation, 20200001
Note

Funding agency: Toyota Material Handling Manufacturing Sweden AB

Available from: 2022-10-31 Created: 2022-10-31 Last updated: 2024-07-01Bibliographically approved
In thesis
1. Learning Representations for Machine Activity Recognition
Open this publication in new window or tab >>Learning Representations for Machine Activity Recognition
2022 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Machine activity recognition (MAR) is an essential and effective approach for equipment productivity monitoring. Developing MAR methods for forklift trucks, a vital piece of the industry, can benefit productivity efficiency, maintenance service, product design, and potential savings. With the growth of the Internet of Things, a large amount of sensory data has become accessible. Conventional MAR methods that have been developed primarily focus on data collected from external sensors, such as inertial measurement units (IMUs) and cameras. However, they are not effective for forklift applications: the IMU data does not reflect kinematic patterns due to a lack of large articulated parts, while the vision-based data collection requires many cameras to create sufficient coverage of an indoor environment, which, in result, risks the privacy and is less economical. Moreover, typical objectives in the existing MAR works are heavy equipment in construction sites where the working environment and tasks differ from the logistics sector. Therefore, it is necessary to develop intelligent and innovative approaches that are more suitable for forklift trucks.

This thesis demonstrates developing and utilizing representation learning methods to solve forklift MAR problems, based on the assumption that forklift activities are formed by a series of basic movements that can be detected from the onboard communication, i.e., signals in a Controller Area Network (CAN). Most of the methods proposed in this thesis incorporate semi-supervised techniques to deal with the limited amount of labeled data and to capitalize on a large amount of unlabeled data in our experiments. Deep neural networks are implemented to overcome different challenges of recognizing forklift activities and learn various representations of the data: i) learning invariant features to reconstruct input CAN signals by applying autoencoders, ii) learning discriminative features to recognize forklift activities by fine-tuning pre-training networks, and iii) learning temporal coherence to capture activity transitions by implementing gated recurrent units. Apart from achieving promising classification performance for forklift MAR problems, the representations obtained also support visualization and interpretability of the data as they are three-dimensional. Our ongoing works are new experiments about learning domain-invariant features, where domain adaptation methods are implemented to recognize activities performed by forklift trucks from different sites.

Place, publisher, year, edition, pages
Halmstad: Halmstad University Press, 2022. p. 67
Series
Halmstad University Dissertations ; 94
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-48549 (URN)978-91-89587-01-4 (ISBN)978-91-89587-00-7 (ISBN)
Presentation
2022-12-13, Wigforssalen in Hus J, Halmstad University, Kristian IV:s väg 3, Halmstad, 14:00 (English)
Opponent
Supervisors
Funder
Knowledge Foundation, 20200001
Available from: 2022-10-31 Created: 2022-10-31 Last updated: 2022-11-01Bibliographically approved
2. Learning Representations for Forklift Activity Recognition
Open this publication in new window or tab >>Learning Representations for Forklift Activity Recognition
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Machine Activity Recognition (MAR) is a research topic that focuses on the development of data-driven methods to improve productivity monitoring. The application and the perspective of MAR research jointly influence the diffi- culty of a MAR problem. Unlike previous MAR works, which have studied construction machinery from the viewpoint of the user, this project focuses on logistics equipment from the viewpoint of the original equipment manufac- turer. In terms of the application, forklift trucks have flexible functions and complex usage. The former is an intrinsic characteristic, as forklifts are me- chanically agile, and the latter is an extrinsic factor, as forklift usage can vary greatly with different drivers, loads, work shifts, and warehouse environments. As for the standpoint, manufacturers have customers who use their products all over the world. Studying a single machine or machines in a single site, i.e. the conventional MAR setting, cannot provide a general understanding of the equipment usage. Therefore, existing MAR methods with external sensory data and only supervised learning techniques are impractical in this case.

This thesis investigates learning representation-based methods for recog- nizing forklift routine activities using on-board sensory signals. Three methods are developed to capture important data features to overcome the challenges of forklift MAR. First, by pre-training autoencoders with unlabeled data and then fine-tuning them with pseudo-labeled data, discriminative features can be ex- tracted. Classifiers built on these features can outperform conventional MAR solutions that use only the labeled data. Second, training gated recurrent unit networks to recognize activities in different contexts can help to learn a repre- sentation that captures activities and their transitions, which further improves the MAR result. Third, implementing domain adversarial-training neural net- works with pseudo-labeled data can essentially compensate for the limited la- beled data from source domains, leading to representations that are informative for more than one domain. In addition, testing the full method on a real truck has demonstrated the applicability of the proposed method and the feasibility of an online MAR solution.

Place, publisher, year, edition, pages
Halmstad: Halmstad University Press, 2024. p. 33
Series
Halmstad University Dissertations ; 119
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-54111 (URN)978-91-89587-55-7 (ISBN)978-91-89587-54-0 (ISBN)
Public defence
2024-08-30, S3030, Kristian IV:s väg 3, Halmstad, 13:00 (English)
Opponent
Supervisors
Available from: 2024-07-12 Created: 2024-06-26 Last updated: 2024-07-31Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Chen, KunruRögnvaldsson, ThorsteinnNowaczyk, SławomirPashami, Sepideh

Search in DiVA

By author/editor
Chen, KunruRögnvaldsson, ThorsteinnNowaczyk, SławomirPashami, Sepideh
By organisation
School of Information Technology
In the same journal
Engineering applications of artificial intelligence
Computer SciencesProduction Engineering, Human Work Science and Ergonomics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 246 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf