Gradient boosted bagging for evolving data stream regression
2025 (English)In: Data mining and knowledge discovery, ISSN 1384-5810, E-ISSN 1573-756X, Vol. 39, no 5, p. 1-37, article id 65Article in journal (Refereed) Published
Abstract [en]
Gradient boosting has been extensively studied in batch learning. Recently, its streaming adaptation, Streaming Gradient Boosted Trees (Sgbt), has surpassed existing state-of-the-art random subspace and random patches methods for streaming classification under various drift scenarios. However, its application in streaming regression remains unexplored. Vanilla Sgbt with squared loss exhibits high variance when applied to streaming regression problems. To address this, we utilize bagging streaming regressors in this work to create Streaming Gradient Boosted Regression (Sgbr). Bagging streaming regressors are employed in two ways: first, as base learners within the existing Sgbt framework, and second, as an ensemble method that aggregates multiple Sgbts. Our extensive experiments on 11 streaming regression datasets, encompassing multiple drift scenarios, demonstrate that the Sgb(Oza), a variant of the first Sgbr category, significantly outperforms current state-of-the-art streaming regression methods in terms of both predictive power and computational cost. © The Author(s) 2025.
Place, publisher, year, edition, pages
New York: Springer, 2025. Vol. 39, no 5, p. 1-37, article id 65
Keywords [en]
Stream learning, Gradient boosting, Gradient boosted bagging, Gradient boosted regression, Data stream boosting, Streaming regression, Streaming gradient boosting, Streaming gradient boosted bagging, Streaming gradient boosted regression
National Category
Computer Vision and Learning Systems
Identifiers
URN: urn:nbn:se:hh:diva-57235DOI: 10.1007/s10618-025-01147-xISI: 001545424200003Scopus ID: 2-s2.0-105012753249OAI: oai:DiVA.org:hh-57235DiVA, id: diva2:1991818
Funder
Halmstad UniversityKnowledge FoundationVinnova
Note
Nuwan Gunasekara would like to acknowledge the support from the Knowledge Foundation (KKs) and Vinnova (Sweden's innovation agency).Heitor Murilo Gomes acknowledges the financial support of the Marsden Fund under award number VUW2213. Bernhard Pfahringer and Albert Bifet would like to acknowledge the support from the Time-Evolving Data Science / Artificial Intelligence for Advanced Open Environmental Science (TAIAO) project.
Open access funding provided by Halmstad University.
2025-08-252025-08-252025-10-28Bibliographically approved