hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Unsupervised anomaly detection for structured data - Finding similarities between retail products
Halmstad University, School of Information Technology.
Halmstad University, School of Information Technology.
2021 (English)Independent thesis Advanced level (professional degree), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Data is one of the most contributing factors for modern business operations. Having bad data could therefore lead to tremendous losses, both financially and for customer experience. This thesis seeks to find anomalies in real-world, complex, structured data, causing an international enterprise to miss out on income and the potential loss of customers. By using graph theory and similarity analysis, the findings suggest that certain countries contribute to the discrepancies more than other countries. This is believed to be an effect of countries customizing their products to match the market’s needs. This thesis is just scratching the surface of the analysis of the data, and the number of opportunities for future work are therefore many.

Place, publisher, year, edition, pages
2021. , p. 82
Keywords [en]
relational data, similarity analysis, data analysis, SQL, NetworkX, graph theory, anomaly detection, unsupervised, retail products, real-world data, AWS, amazon web services, similarity learning, data statistics, data preprocessing, similarity analysis algorithm, data validation
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-44756OAI: oai:DiVA.org:hh-44756DiVA, id: diva2:1567459
External cooperation
Jayway
Subject / course
Computer science and engineering
Educational program
Computer Science and Engineering, 300 credits
Supervisors
Examiners
Available from: 2021-06-02 Created: 2021-06-16 Last updated: 2021-06-17Bibliographically approved

Open Access in DiVA

fulltext(8411 kB)520 downloads
File information
File name FULLTEXT02.pdfFile size 8411 kBChecksum SHA-512
b2da053af9aadb187d2ccbb94ede028d459dd4a37322f9dd2adcf1b91fc027a981e32813f380e3d9b5d7847636ab9804d80a3f80e21eefe38a90cc7381287283
Type fulltextMimetype application/pdf

By organisation
School of Information Technology
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 520 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 761 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf