hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Agreeing to disagree: active learning with noisy labels without crowdsourcing
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS), CAISR - Center for Applied Intelligent Systems Research.ORCID iD: 0000-0002-2859-6155
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS), CAISR - Center for Applied Intelligent Systems Research.ORCID iD: 0000-0002-7796-5201
The University of South Dakota, Vermillion, South Dakota, USA.ORCID iD: 0000-0003-4176-0236
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS), CAISR - Center for Applied Intelligent Systems Research.ORCID iD: 0000-0003-2185-8973
2018 (English)In: International Journal of Machine Learning and Cybernetics, ISSN 1868-8071, E-ISSN 1868-808X, Vol. 9, no 8, p. 1307-1319Article in journal (Refereed) Published
Abstract [en]

We propose a new active learning method for classification, which handles label noise without relying on multiple oracles (i.e., crowdsourcing). We propose a strategy that selects (for labeling) instances with a high influence on the learned model. An instance x is said to have a high influence on the model h, if training h on x (with label y = h(x)) would result in a model that greatly disagrees with h on labeling other instances. Then, we propose another strategy that selects (for labeling) instances that are highly influenced by changes in the learned model. An instance x is said to be highly influenced, if training h with a set of instances would result in a committee of models that agree on a common label for x but disagree with h(x). We compare the two strategies and we show, on different publicly available datasets, that selecting instances according to the first strategy while eliminating noisy labels according to the second strategy, greatly improves the accuracy compared to several benchmarking methods, even when a significant amount of instances are mislabeled. © Springer-Verlag Berlin Heidelberg 2017

Place, publisher, year, edition, pages
Heidelberg: Springer, 2018. Vol. 9, no 8, p. 1307-1319
Keywords [en]
Active learning, Classification, Label noise, Mislabeling, Interactive learning, Machine learning, Data mining
National Category
Signal Processing Computer Systems Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-33365DOI: 10.1007/s13042-017-0645-0ISI: 000438855100006Scopus ID: 2-s2.0-85050140726OAI: oai:DiVA.org:hh-33365DiVA, id: diva2:1077485
Available from: 2017-02-27 Created: 2017-02-27 Last updated: 2020-02-03Bibliographically approved

Open Access in DiVA

BougueliaAL(4055 kB)1400 downloads
File information
File name FULLTEXT01.pdfFile size 4055 kBChecksum SHA-512
9aacfa1f3fce5e3aba1715874af2f2e26181ee5424f36380b92bba06b1f3dd54506be4ca4350256b59d04dc4bb38c2860e71c537f631c47f7ed9a1ce18d1a5d8
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Bouguelia, Mohamed-RafikNowaczyk, SławomirSantosh, K. C.Verikas, Antanas

Search in DiVA

By author/editor
Bouguelia, Mohamed-RafikNowaczyk, SławomirSantosh, K. C.Verikas, Antanas
By organisation
CAISR - Center for Applied Intelligent Systems Research
In the same journal
International Journal of Machine Learning and Cybernetics
Signal ProcessingComputer SystemsComputer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 1400 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 1394 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf