Explain your clusters with words: The role of metadata in interactive clustering
2022 (English)In: Proceedings of the Workshop on Semantic Techniques for Narrative-Based Understanding co-located with 31st International Joint Conference on Artificial Intelligence and the 25th European Conference on Artificial Intelligence (IJCAI-ECAI 2022) / [ed] Lise Stork; Katrien Beuls; Luc Steels, Aachen: Rheinisch-Westfaelische Technische Hochschule Aachen , 2022, Vol. 3322, p. 53-59Conference paper, Published paper (Refereed)
Abstract [en]
In this preliminary work, we present an approach for the augmentation of clustering with natural language explanations. In clustering there are 2 main challenges: a) choice of a proper, reasonable number of clusters, and b) cluster analysis and profiling. There is a plethora of technics for a) but not so much for b), which is in general a laborious task of explaining obtained clusters. We propose a method that aids experts in cluster analysis by providing an iterative, human-in-the-loop methodology of generating cluster explanations. In an illustrative example, we show how the process of clustering on a set of objective variables could be facilitated with textual metadata. In our case, images of products from online fashion store are used for clustering. Then, product descriptions are used for profiling clusters. © 2022 Copyright for this paper by its authors.
Place, publisher, year, edition, pages
Aachen: Rheinisch-Westfaelische Technische Hochschule Aachen , 2022. Vol. 3322, p. 53-59
Series
CEUR Workshop Proceedings, ISSN 1613-0073 ; 3322
Keywords [en]
Iterative methods, Metadata, Clusterings, Human-in-the-loop, Natural language explanations, Number of clusters, On-line fashion, Product descriptions, Textual metadata
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-49832Scopus ID: 2-s2.0-85146122393OAI: oai:DiVA.org:hh-49832DiVA, id: diva2:1727552
Conference
IJCAI-ECAI 2022, Workshop on semantic techniques for narrative-based understanding, Vienna, Austria, July 24, 2022
Funder
Swedish Research Council, CHIST-ERA-19-XAI-012
Note
This paper is funded from the XPM (ExplainablePredictive Maintenance) project funded by the National Science Center, Poland under CHIST-ERAprogramme Grant Agreement No. 857925 (NCNUMO-2020/02/Y/ST6/00070).
The work of Szymon Bobek has been additionallysupported by a HuLCKA grant from the PriorityResearch Area (Digiworld) under the Strategic Programme Excellence Initiative at the JagiellonianUniversity (U1U/P06/NO/02.16).
The work of Samaneh Jamshidi was supportedby CHIST-ERA grant CHIST-ERA-19-XAI-012funded by Swedish Research Council.
2023-01-162023-01-162023-02-15Bibliographically approved