hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Meta-Learning from Multimodal Task Distributions Using Multiple Sets of Meta-Parameters
Halmstad University, School of Information Technology.ORCID iD: 0000-0003-0185-5038
Halmstad University, School of Information Technology.ORCID iD: 0000-0002-2859-6155
Halmstad University, School of Information Technology.ORCID iD: 0000-0001-5163-2997
2023 (English)In: 2023 International Joint Conference on Neural Networks (IJCNN), Piscataway, NJ: IEEE, 2023, p. 1-8Conference paper, Published paper (Refereed)
Abstract [en]

Meta-learning or learning to learn involves training a model on various learning tasks in a way that allows it to quickly learn new tasks from the same distribution using only a small amount of training data (i.e., few-shot learning). Current meta-learning methods implicitly assume that the distribution over tasks is unimodal and consists of tasks belonging to a common domain, which significantly reduces the variety of task distributions they can handle. However, in real-world applications, tasks are often very diverse and come from multiple different domains, making it challenging to meta-learn common knowledge shared across the entire task distribution. In this paper, we propose a method for meta-learning from a multimodal task distribution. The proposed method learns multiple sets of meta-parameters (acting as different initializations of a neural network model) and uses a task encoder to select the best initialization to fine-tune for a new task. More specifically, with a few training examples from a task sampled from an unknown mode, the proposed method predicts which set of meta-parameters (i.e., model’s initialization) would lead to a fast adaptation and a good post-adaptation performance on that task. We evaluate the proposed method on a diverse set of few-shot regression and image classification tasks. The results demonstrate the superiority of the proposed method compared to other state of-the-art meta-learning methods and the benefit of learning multiple model initializations when tasks are sampled from a multimodal task distribution. © 2023 IEEE.

Place, publisher, year, edition, pages
Piscataway, NJ: IEEE, 2023. p. 1-8
Keywords [en]
Meta-Learning, Few-Shot Learning, Transfer Learning, Task Representation, Multimodal Distribution
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-51352DOI: 10.1109/IJCNN54540.2023.10191944ISI: 001046198707013Scopus ID: 2-s2.0-85169561819ISBN: 978-1-6654-8867-9 (electronic)OAI: oai:DiVA.org:hh-51352DiVA, id: diva2:1786779
Conference
International Joint Conference on Neural Networks (IJCNN 2023), Gold Coast, Australia, 18-23 June, 2023
Available from: 2023-08-10 Created: 2023-08-10 Last updated: 2025-10-01Bibliographically approved
In thesis
1. Advancing Meta-Learning for Enhanced Generalization Across Diverse Tasks
Open this publication in new window or tab >>Advancing Meta-Learning for Enhanced Generalization Across Diverse Tasks
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Meta-learning, or learning to learn, is a rapidly evolving area in machine learning that aims to enhance the adaptability and efficiency of learning algorithms. Inspired by the human ability to learn new concepts from limited examples and quickly adapt to unforeseen situations, meta-learning leverages prior experience to prepare models for fast adaptation to new tasks. Unlike traditional machine learning systems, where models are trained for specific tasks, meta-learning frameworks enable models to acquire generalized knowledge during training and efficiently learn new tasks during inference. This ability to generalize from past experiences to new tasks makes meta-learning a key focus in advancing artificial intelligence, offering the potential to create more flexible and efficient AI systems capable of performing well with minimal data.

In this thesis, we begin by formally defining the meta-learning framework, establishing clear terminology, and synthesizing existing work in a comprehensive survey paper. Building on this foundation, we demonstrate how meta-learning can be integrated into various fields to enhance model performance and extend capabilities to few-shot learning scenarios. We show how meta-learning can significantly improve the accuracy and efficiency of transferring knowledge across domains in domain adaptation. In scenarios involving a multimodal distribution of tasks, we develop methods that efficiently learn from and adapt to a wide variety of tasks drawn from different modes within the distribution, ensuring effective adaptation across diverse domains. Our work on personalized federated learning highlights meta-learning's potential to tailor federated learning processes to individual user needs while maintaining privacy and data security. Additionally, we address the challenges of continual learning by developing models that continuously integrate new information without forgetting previously acquired knowledge. For time series data analysis, we present meta-learning strategies that automatically learn optimal augmentation techniques, enhancing model predictions and offering robust solutions for real-world applications. Lastly, our pioneering research on unsupervised meta-learning via in-context learning explores innovative approaches for constructing tasks and learning effectively from unlabeled data.

Overall, the contributions of this thesis emphasize the potential of meta-learning techniques to improve performance across diverse research areas and demonstrate how advancements in one area can benefit the field as a whole.

Place, publisher, year, edition, pages
Halmstad: Halmstad University Press, 2025. p. 46
Series
Halmstad University Dissertations ; 127
Keywords
Meta-learning, Few-shot learning, Domain adaptation, Federated learning, Continual learning, Unsupervised learning, In-context learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-55147 (URN)978-91-89587-71-7 (ISBN)978-91-89587-70-0 (ISBN)
Public defence
2025-02-03, S1022, Kristian IV:s väg 3, 30118, Halmstad, Halmstad, 13:00 (English)
Opponent
Supervisors
Available from: 2025-01-08 Created: 2025-01-07 Last updated: 2025-10-01Bibliographically approved

Open Access in DiVA

Meta-Learning from Multimodal Task Distributions Using Multiple Sets of Meta-Parameters(458 kB)283 downloads
File information
File name FULLTEXT01.pdfFile size 458 kBChecksum SHA-512
5aa0837650160cd770ba825e8c202c4ea3d7259b57f767d8bc660896c976148853f370f301657785cc12e6b4a2c90b126b5307e98d35586875a5db71fba2ad90
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Vettoruzzo, AnnaBouguelia, Mohamed-RafikRögnvaldsson, Thorsteinn

Search in DiVA

By author/editor
Vettoruzzo, AnnaBouguelia, Mohamed-RafikRögnvaldsson, Thorsteinn
By organisation
School of Information Technology
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 283 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 214 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf