hh.sePublications
Change search
Link to record
Permanent link

Direct link
Publications (10 of 138) Show all publications
Wang, M., Zhang, Y., Ren, C., Li, Q., Tiwari, P., Wang, B. & Qin, J. (2026). Adaptive Boosting LLMs for Text Classification. IEEE Transactions on Neural Networks and Learning Systems, 1-12
Open this publication in new window or tab >>Adaptive Boosting LLMs for Text Classification
Show others...
2026 (English)In: IEEE Transactions on Neural Networks and Learning Systems, ISSN 2162-237X, E-ISSN 2162-2388, p. 1-12Article in journal (Refereed) Epub ahead of print
Abstract [en]

With large-scale language models demonstrating superior capabilities in a wide range of downstream natural language processing tasks, the future trajectory of research in the field of text categorization faces increasing uncertainty. In this evolving paradigm of open-ended language modeling, where task delimitations are increasingly blurred, a pressing question arises: to what extent has text classification advanced under the full potential of large language model (LLM)? To address this pivotal inquiry, we introduce recurrent generative pre-trained transformer (RGPT), an adaptive boosting framework meticulously designed to craft a dedicated LLM for text classification. RGPT constructs a sequence of base learners by dynamically modulating the training data distribution and iteratively fine-tuning LLMs. These base learners are then progressively integrated, leveraging historical prediction trajectories to form a highly specialized text classification model. Extensive empirical evaluations demonstrate that RGPT surpasses eight state-of-the-art pretrained language models and seven cutting-edge LLMs across four benchmark datasets, achieving an average performance gain of 2.90%. © 2012 IEEE.

Place, publisher, year, edition, pages
Piscataway, NJ: IEEE, 2026
Keywords
Boosting, large language model, natural language processing, text classification
National Category
Natural Language Processing
Identifiers
urn:nbn:se:hh:diva-58202 (URN)10.1109/TNNLS.2025.3639613 (DOI)001663478400001 ()41525530 (PubMedID)2-s2.0-105027441383 (Scopus ID)
Available from: 2026-01-30 Created: 2026-01-30 Last updated: 2026-01-30Bibliographically approved
Qu, Z., Li, Y., Liu, B., Gupta, D. & Tiwari, P. (2026). DTQFL: A Digital Twin-Assisted Quantum Federated Learning Algorithm for Intelligent Diagnosis in 5G Mobile Network. IEEE journal of biomedical and health informatics, 30(1), 17-26
Open this publication in new window or tab >>DTQFL: A Digital Twin-Assisted Quantum Federated Learning Algorithm for Intelligent Diagnosis in 5G Mobile Network
Show others...
2026 (English)In: IEEE journal of biomedical and health informatics, ISSN 2168-2194, E-ISSN 2168-2208, Vol. 30, no 1, p. 17-26Article in journal (Refereed) Published
Abstract [en]

Smart healthcare aims to revolutionize med-ical services by integrating artificial intelligence (AI). The limitations of classical machine learning include privacy concerns that prevent direct data sharing among medical institutions, untimely updates, and long training times. To address these issues, this study proposes a digital twin-assisted quantum federated learning algorithm (DTQFL). By leveraging the 5G mobile network, digital twins (DT) of patients can be created instantly using data from various Internet of Medical Things (IoMT) devices and simultane-ously reduce communication time in federated learning (FL) at the same time. DTQFL generates DT for patients with specific diseases, allowing for synchronous training and updating of the variational quantum neural network (VQNN) without disrupting the VQNN in the real world. This study utilized DTQFL to train its own personalized VQNN for each hospital, considering privacy security and training speed. Simultaneously, the personalized VQNN of each hospital was obtained through further local iterations of the final global parameters. The results indicate that DTQFL can train a good VQNN without collecting local data while achieving accuracy comparable to that of data-centralized algorithms. In addition, after personalized train-ing, the VQNN can achieve higher accuracy than that with-out personalized training. © 2026 IEEE.

Place, publisher, year, edition, pages
Piscataway, NJ: Institute of Electrical and Electronics Engineers (IEEE), 2026
Keywords
digital twin, federated learning, Federated learning, Hospitals, Medical services, mobile network, Privacy, Quantum cascade lasers, quantum neural network, Servers, Smart healthcare, Training
National Category
Other Computer and Information Science
Identifiers
urn:nbn:se:hh:diva-51549 (URN)10.1109/JBHI.2023.3303401 (DOI)001662927800002 ()37552590 (PubMedID)2-s2.0-85167839904 (Scopus ID)
Available from: 2023-08-31 Created: 2023-08-31 Last updated: 2026-02-06Bibliographically approved
Sun, L., Chen, Q., Zheng, M., Ning, X., Gupta, D. & Tiwari, P. (2026). Energy-Efficient Online Continual Learning for Time Series Classification in Nanorobot-Based Smart Health. IEEE journal of biomedical and health informatics, 30(1), 81-89
Open this publication in new window or tab >>Energy-Efficient Online Continual Learning for Time Series Classification in Nanorobot-Based Smart Health
Show others...
2026 (English)In: IEEE journal of biomedical and health informatics, ISSN 2168-2194, E-ISSN 2168-2208, Vol. 30, no 1, p. 81-89Article in journal (Refereed) Published
Abstract [en]

Nanorobots have been used in smart health to collect time series data such as electrocardiograms and electroencephalograms. Real-time classification of dynamic time series signals in nanorobots is a challenging task. Nanorobots in the nanoscale range require a classification algorithm with low computational complexity. First, the classification algorithm should be able to dynamically analyze time series signals and update itself to process the concept drifts (CD). Second, the classification algorithm should have the ability to handle catastrophic forgetting (CF) and classify historical data. Most importantly, the classification algorithm should be energy-efficient to use less computing power and memory to classify signals in real-time on a smart nanorobot. To solve these challenges, we design an algorithm that can Prevent Concept Drift in Online continual Learning for time series classification (PCDOL). The prototype suppression item in PCDOL can reduce the impact caused by CD. It also solves the CF problem through the replay feature. The computation per second and the memory consumed by PCDOL are only 3.572M and 1KB, respectively. The experimental results show that PCDOL is better than several state-of-the-art methods for dealing with CD and CF in energy-efficient nanorobots. © 2026 IEEE.

Place, publisher, year, edition, pages
Piscataway, NJ: Institute of Electrical and Electronics Engineers (IEEE), 2026
Keywords
Classification algorithms, concept drift, Feature extraction, Nanobioscience, nanorobot, online continual learning, Prototypes, sensor time series classification, smart health, Task analysis, Time series analysis, Training
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-51429 (URN)10.1109/JBHI.2023.3289992 (DOI)001662927800003 ()37368802 (PubMedID)2-s2.0-85163564312 (Scopus ID)
Note

Funding: National Natural Science Foundation of China (Grant Number: 61702274) and Major Key Project of PCL (Grant Number: PCL2022A03, PCL2021A02 and PCL2021A09)

Available from: 2023-08-17 Created: 2023-08-17 Last updated: 2026-02-06Bibliographically approved
Li, B., Zhang, X., Huang, Z., Tiwari, P., Zou, Q., Ding, Y. & Guo, X. (2026). Enhancing anticancer peptide discovery: A Fusion-Centric Framework With Conditional Diffusion For Prediction And Generation. PloS Computational Biology, 22(3), 1-29, Article ID e1014098.
Open this publication in new window or tab >>Enhancing anticancer peptide discovery: A Fusion-Centric Framework With Conditional Diffusion For Prediction And Generation
Show others...
2026 (English)In: PloS Computational Biology, ISSN 1553-734X, E-ISSN 1553-7358, Vol. 22, no 3, p. 1-29, article id e1014098Article in journal (Refereed) Published
Abstract [en]

Anticancer peptides (ACPs) are short bioactive sequences that selectively target tumor cells with minimal toxicity, positioning them as promising candidates for next-generation cancer therapies. However, existing computational models face limitations in sequence representation and class imbalance. To address these challenges, we propose UACD-ACPs, a unified fusion-driven framework that integrates a diffusion-inspired noise-conditioned classifier for ACP prediction and a diffusion-based peptide generation module with cancer-type-aware organization for targeted downstream screening. The classification module integrates ProtBERT-based semantic embeddings with physicochemical descriptors via the Multiscale Embedding Compression Strategy (MECS) and a diffusion-inspired noise-conditioned encoder, substantially enhancing predictive robustness and accuracy, particularly under challenging imbalanced multi-class settings. In the generative pipeline, we introduce a denoising diffusion-based generative framework augmented by two novel fusion modules: the Bitemporal Fusion Module (BFM) and the Temporal Feature Attention Module (TFAM). These modules perform multi-scale temporal and semantic fusion to promote the generation of structurally coherent and functionally relevant peptide candidates. Experimental results demonstrate that UACD-ACPs outperforms state-of-the-art methods in terms of accuracy, F1-score, and AUC-ROC. The generated peptides exhibit favorable physicochemical properties, diverse secondary structures, and strong structural stability, as validated by molecular dynamics simulations and membrane-binding analyses. Overall, this study highlights the potential of fusion-driven diffusion-based frameworks for alleviating class imbalance and data heterogeneity in anticancer peptide modeling, paving the way for scalable and biologically grounded ACP discovery. © 2026 Li et al.

Place, publisher, year, edition, pages
San Francisco: Public Library of Science (PLoS), 2026
Keywords
identification, inhibitor, language
National Category
Bioinformatics (Computational Biology) Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:hh:diva-58706 (URN)10.1371/journal.pcbi.1014098 (DOI)001724448300001 ()41886705 (PubMedID)2-s2.0-105034373461 (Scopus ID)
Available from: 2026-04-07 Created: 2026-04-07 Last updated: 2026-04-27Bibliographically approved
Jankowska, J., Kostek, B., Alonso-Fernandez, F. & Tiwari, P. (2026). Exploring the correlation between the type of music and the emotions evoked: A study using subjective questionnaires and EEG. In: Progress in Artificial Intelligence and Pattern Recognition: 9th International Congress, IWAIPR 2025, Varadero, Cuba, October 14–17, 2025, Proceedings. Paper presented at 9th International Congress, IWAIPR: International Congress on Artificial Intelligence and Pattern Recognition, October 14–17, 2025, Varadero, Cuba (pp. 395-406). Heidelberg: Springer
Open this publication in new window or tab >>Exploring the correlation between the type of music and the emotions evoked: A study using subjective questionnaires and EEG
2026 (English)In: Progress in Artificial Intelligence and Pattern Recognition: 9th International Congress, IWAIPR 2025, Varadero, Cuba, October 14–17, 2025, Proceedings, Heidelberg: Springer, 2026, p. 395-406Conference paper, Published paper (Refereed)
Abstract [en]

The subject of this work is to check how different types of music affect human emotions. While listening to music, a subjective survey and brain activity measurements were carried out using an EEG helmet. The aim is to demonstrate the impact of different music genres on emotions. The research involved a diverse group of participants of different gender and musical preferences. This had the effect of capturing a wide range of emotional responses to music. After the experiment, a relationship analysis of the respondents’ questionnaires with EEG signals was performed. The analysis revealed connections between emotions and observed brain activity. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.

Place, publisher, year, edition, pages
Heidelberg: Springer, 2026
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 16328
Keywords
Music and Emotion, EEG-Based Emotion Recognition, Brain-Computer Interface (BCI)
National Category
Signal Processing
Identifiers
urn:nbn:se:hh:diva-57694 (URN)10.1007/978-3-032-11358-0_33 (DOI)2-s2.0-105029910265 (Scopus ID)978-3-032-11357-3 (ISBN)978-3-032-11358-0 (ISBN)
Conference
9th International Congress, IWAIPR: International Congress on Artificial Intelligence and Pattern Recognition, October 14–17, 2025, Varadero, Cuba
Funder
Swedish Research Council
Available from: 2025-10-30 Created: 2025-10-30 Last updated: 2026-04-22Bibliographically approved
He, L., Zhao, C., Wang, Y. & Tiwari, P. (2026). FDA-CAPMA: Federated domain adaptation with co-activation pattern and multimodal mamba for fMRI depression detection. Information Fusion, 132, Article ID 104213.
Open this publication in new window or tab >>FDA-CAPMA: Federated domain adaptation with co-activation pattern and multimodal mamba for fMRI depression detection
2026 (English)In: Information Fusion, ISSN 1566-2535, E-ISSN 1872-6305, Vol. 132, article id 104213Article in journal (Refereed) Published
Abstract [en]

Major depressive disorder is projected to become the leading contributor to mental illness by 2030. While resting-state functional magnetic resonance imaging (rs-fMRI) has emerged as a non-invasive solution for depression detection, two significant challenges remain. First, due to medical data privacy regulations and the high costs associated with acquiring the necessary equipment, individual medical institutions struggle to obtain sufficient annotated data. Second, domain shifts, caused by discrepancies in scanner parameters and acquisition protocols across multi-center datasets, significantly hinder model generalization. To address these challenges, we propose a federated domain adaptation (FDA) method that integrates co-activation patterns and a multimodal Mamba network, termed FDA-CAPMA, for fMRI-based depression detection. Specifically, a federated learning architecture ensures both physical data isolation and patient privacy through parameter aggregation. A state-space model-based Mamba network captures cross-modal correlations between fMRI time-series features and non-imaging features. Additionally, a local maximum mean discrepancy (LMMD) module aligns source and target domain distributions in both feature and prediction spaces. Extensive experiments on the largest multi-center depression dataset (Rest-meta-MDD, 1813 participants) and ABIDE dataset, our method achieves an accuracy of 67.16%, and 65.72%, respectively. This work establishes a new paradigm for privacy-preserving depression recognition. Code will be available at: https://github.com/helang818/FDA-CAPMA/ © 2026 Elsevier B.V.

Place, publisher, year, edition, pages
Amsterdam: Elsevier, 2026
Keywords
Depression, Co-activation pattern, Federated domain adaptation, Mamba
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-58537 (URN)10.1016/j.inffus.2026.104213 (DOI)001697335500001 ()2-s2.0-105030338679 (Scopus ID)
Note

Funding information: This work is supported by National Natural Science Foundation of China (grant 62376215, 62236006, 62276210, 62306172, 82330043, 82471543, 62402386, 62206219, 82474666, 82105042), the Shanghai Key Laboratory of Tuina Techniques on Musculoskeletal Disorders (24dz2260200), the Three Year Action Plan for Shanghai to Further Accelerate the Inheritance, Innovation and Development of Traditional Chinese Medicine (ZY(2025–2027)-3-1-1), the Open Fund of National Engineering Laboratory for Big Data...

Available from: 2026-04-02 Created: 2026-04-02 Last updated: 2026-04-23Bibliographically approved
He, L., Chen, K., Zhao, J., Wang, Y., Pei, E., Chen, H., . . . Tiwari, P. (2026). LMVD: A large-scale multimodal vlog dataset for depression detection in the wild. Information Fusion, 126(Part: B), Article ID 103632.
Open this publication in new window or tab >>LMVD: A large-scale multimodal vlog dataset for depression detection in the wild
Show others...
2026 (English)In: Information Fusion, ISSN 1566-2535, E-ISSN 1872-6305, Vol. 126, no Part: B, article id 103632Article in journal (Refereed) Published
Abstract [en]

Depression profoundly impacts multiple dimensions of an individual's life, including personal and social functioning, academic achievement, occupational productivity, and overall quality of life. With recent advancements in affective computing, deep learning technologies have been increasingly adopted to identify patterns indicative of depression. However, due to concerns over participant privacy, data in this domain remain scarce, posing significant challenges for the development of robust discriminative models for depression detection. To address this limitation, we build a Large-scale Multimodal Vlog Dataset (LMVD) for depression recognition in real-world settings. The LMVD dataset comprises 1,823 video samples, totaling approximately 214 h of content, collected from 1,475 participants across four major multimedia platforms: Sina Weibo, Bilibili, TikTok, and YouTube. In addition, we introduce a novel architecture, MDDformer, specifically designed to capture non-verbal behavioral cues associated with depressive states. Extensive experimental evaluations conducted on LMVD demonstrate the superior performance of MDDformer in depression detection tasks. We anticipate that LMVD will become a valuable benchmark resource for the research community, facilitating progress in multimodal, real-world depression recognition. The dataset and source code will be made publicly available at: https://github.com/helang818/LMVD. © 2025 Elsevier B.V., All rights reserved.

Place, publisher, year, edition, pages
Amsterdam: Elsevier, 2026
Keywords
Deep Learning, Depression Detection, Multimodal, Transformer, Vlog, Behavioral Research, Data Privacy, Human Computer Interaction, Interactive Computer Systems, Large Datasets, Learning Systems, Multimedia Systems, Academic Achievements, Deep Learning, Depression Detection, Large-scales, Multi-modal, Multiple Dimensions, Overall Quality, Quality Of Life, Transformer, Vlog
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-57350 (URN)10.1016/j.inffus.2025.103632 (DOI)001619106900001 ()2-s2.0-105014021546 (Scopus ID)
Available from: 2025-09-18 Created: 2025-09-18 Last updated: 2026-02-06Bibliographically approved
Song, Y., Xu, W., Liu, L., Tiwari, P., Tian, G., Xie, Q. & Peng, M. (2026). MFC4POI: Multi-factor collaboration for next point-of-interest recommendation using large language models. Information Processing & Management, 63(7, Part B), Article ID 104824.
Open this publication in new window or tab >>MFC4POI: Multi-factor collaboration for next point-of-interest recommendation using large language models
Show others...
2026 (English)In: Information Processing & Management, ISSN 0306-4573, E-ISSN 1873-5371, Vol. 63, no 7, Part B, article id 104824Article in journal (Refereed) Published
Abstract [en]

Next point-of-interest (POI) recommendation aims to predict the next location a user will visit based on historical behavioral data. Given their strong natural language understanding and reasoning capabilities, large language models (LLMs) have been increasingly introduced into next POI recommendation, as conventional methods are limited by their numerical representations, which often obscure the inherent semantic information embedded in contextual features However, LLM-based approaches struggle to effectively comprehend multiple factors holistically and express them in natural language. To address this issue, we propose Multi-Factor Collaboration for Next POI Recommendation (MFC4POI), a pretraining and fine-tuning framework that reformulates the next POI recommendation task as a question-answering problem. It leverages the strong comprehension and reasoning capabilities of LLMs to analyze user’s current behavior based on historical trajectories, and real time on-site situation, including the dynamic preferences and nearby geographic information in natural language. Due to the inherent limitations of LLMs in accurately modeling temporal and geographical information, MFC4POI introduces an intent identification agent to capture users’ dynamic preferences based on Spatio-temporal Intent-based Knowledge Graph (STIKG). It alleviates the weak spatial reasoning capability of LLMs while providing clear reasoning paths that make the recommendations more intuitive and coherent. To further improve understanding of users’s real-time on-site situation, MFC4POI integrates geographic information with dynamic preferences by providing nearby POIs that align with these dynamic preferences, thereby further enhancing LLMs’ performance. Experiments on three real-world datasets, containing up to 4000 users and 405000 check-ins, show that MFC4POI consistently outperforms strong baselines. Specifically, it achieves better performance than most baselines on Acc@1 and surpasses all baselines on Acc@5 and Acc@10 across all datasets. © 2026 Published by Elsevier Ltd.

Place, publisher, year, edition, pages
London: Elsevier, 2026
Keywords
Knowledge graph, Large language models, Point-of-interest recommendation
National Category
Computer Sciences Artificial Intelligence
Identifiers
urn:nbn:se:hh:diva-58938 (URN)10.1016/j.ipm.2026.104824 (DOI)001758388700001 ()2-s2.0-105036859276 (Scopus ID)
Note

This work is supported by the Key Project of the National Natural Science Foundation of China (U23A20316), CCF-Tencent Rhino-Bird Open Research Fund (CCF-Tencent RAGR20250115), Wuhan Natural Science Foundation Exploratory Program (Morning Light Program) Project (2026040301020029) and the Intelligent Computing Center of the National Cybersecurity Talent and Innovation Base, Wuhan.

Available from: 2026-05-27 Created: 2026-05-27 Last updated: 2026-05-27Bibliographically approved
Zhang, Y., Yu, Y., Wang, X., Li, X., Liang, H. & Tiwari, P. (2026). Multi-Affection Prompt Learning for Sentiment, Emotion, and Sarcasm Joint Detection in Conversations. Tsinghua Science and Technology, 31(3), 1819-1837
Open this publication in new window or tab >>Multi-Affection Prompt Learning for Sentiment, Emotion, and Sarcasm Joint Detection in Conversations
Show others...
2026 (English)In: Tsinghua Science and Technology, ISSN 1007-0214, E-ISSN 1878-7606, Vol. 31, no 3, p. 1819-1837Article in journal (Refereed) Published
Abstract [en]

As a new trend in Natural Language Processing (NLP), prompt tuning has been explored to provide a reliable answer without requiring massive labeled samples and training learning in sentiment and emotion detection tasks. However, how to effectively encode the commonness and uniquesness across difference affections into prompts sets a limit to the potential of multi-affection joint detection. To fill this gap, we propose a multi-affection prompt (MAP) learning framework that takes both the commonness of multiple affections and the uniqueness of specific affection into consideration. More specifically, two different prompt encoders are first proposed to elaborate the multi-task shared prompt and the task-specific prompt, respectively. Second, a multi-task prompt interaction learning layer is proposed to capture the correlation between the multi-task and task-specific prompts. MAP adopts separate multi-task and task-specific prompts to learn different vectors for different affection tasks, thus mitigating the affection discrepancy of the [MASK] token in the masked language modeling task. Extensive experiments on two benchmark datasets show that our proposed method can significantly improve the multi-task generalization capability of PLMs, and yield better results than other state-of-the-art (SOTA) baselines, by the margin of 2.7% and 3.4%. © The author(s) 2026. The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Place, publisher, year, edition, pages
Beijing: Tsinghua University Press, 2026
Keywords
sentiment analysis, affection detection, prompt learning, multi-task learning
National Category
Computer and Information Sciences Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
urn:nbn:se:hh:diva-58484 (URN)10.26599/TST.2024.9010196 (DOI)001687114600001 ()
Note

Funding information: This paper was partly supported by Hong Kong RGC Theme-based Research Scheme (No. T45-401/22-N), the National Natural Science Foundation of China (No. 62006212), Fellowship from the China Postdoctoral Science Foundation (No. 2023M733907), Natural Science Foundation of Hunan Province of China (No. 242300421412), Foundation of Key Laboratory of Dependable Service Computing in Cyber-Physical-Society (Ministry of Education), and Chongqing University (No. CPSDSC202103).

Available from: 2026-04-02 Created: 2026-04-02 Last updated: 2026-04-22Bibliographically approved
Ning, X., Li, Q., Huang, X., Chen, Q., He, F., Li, W., . . . Liu, X. (2026). Neural Network Optimization Reimagined: Decoupled Techniques for Scratch and Fine-Tuning. IEEE Transactions on Pattern Analysis and Machine Intelligence
Open this publication in new window or tab >>Neural Network Optimization Reimagined: Decoupled Techniques for Scratch and Fine-Tuning
Show others...
2026 (English)In: IEEE Transactions on Pattern Analysis and Machine Intelligence, ISSN 0162-8828, E-ISSN 1939-3539Article in journal (Refereed) Epub ahead of print
Abstract [en]

With the accumulation of resources in the era of big data and the rise of pre-trained models in deep learning, optimizing neural networks for various tasks often involves different strategies for fine-tuning pre-trained models versus training from scratch. However, existing optimizers primarily focus on reducing the loss function by updating model parameters, without fully addressing the unique demands of these two major paradigms. In this paper, we propose DualOpt, a novel approach that decouples optimization techniques specifically tailored for these distinct training scenarios. For training from scratch, we introduce real-time layer-wise weight decay, designed to enhance both convergence and generalization by aligning with the characteristics of weight updates and network architecture. For more importantly fine-tuning, we integrate weight rollback with the optimizer, incorporating a rollback term into each weight update step. This ensures consistency in the weight distribution between upstream and downstream models, effectively mitigating knowledge forgetting and improving fine-tuning performance. Additionally, we extend the layer-wise weight decay to dynamically adjust the rollback levels across layers, adapting to the varying demands of different downstream tasks. Extensive experiments across diverse tasks, including image classification, object detection, semantic segmentation, and instance segmentation, demonstrate the broad applicability and state-of-the-art performance of DualOpt. © 2026 IEEE.

Place, publisher, year, edition, pages
Piscataway: IEEE, 2026
Keywords
fine-tuning, layer-wise penalty, Neural network optimization, training from scratch, weight rollback
National Category
Computer graphics and computer vision Artificial Intelligence Security, Privacy and Cryptography
Identifiers
urn:nbn:se:hh:diva-58936 (URN)10.1109/TPAMI.2026.3683792 (DOI)41979964 (PubMedID)2-s2.0-105036738768 (Scopus ID)
Note

Funding information: This work is funded by Beijing Natural Science Foundation (No. L233036) and National Natural Science Foundation of China (No. 62373343).

Available from: 2026-05-13 Created: 2026-05-13 Last updated: 2026-05-13Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-2851-4260

Search in DiVA

Show all publications