hh.sePublications
89101112131411 of 45
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Forensic Authorship Analysis of Microblogging Texts Using -Grams and Stylometric Features
Halmstad University, School of Information Technology.
Institute of Computer Science, University of Tartu, Tartu, Estonia.
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS), CAISR - Center for Applied Intelligent Systems Research.ORCID iD: 0000-0002-1400-346X
2020 (English)Conference paper, Published paper (Refereed)
Abstract [en]

In recent years, messages and text posted on the Internet are used in criminal investigations. Unfortunately, the authorship of many of them remains unknown. In some channels, the problem of establishing authorship may be even harder, since the length of digital texts is limited to a certain number of characters. In this work, we aim at identifying authors of tweet messages, which are limited to 280 characters. We evaluate popular features employed traditionally in authorship attribution which capture properties of the writing style at different levels. We use for our experiments a self-captured database of 40 users, with 120 to 200 tweets per user. Results using this small set are promising, with the different features providing a classification accuracy between 92% and 98.5%. These results are competitive in comparison to existing studies which employ short texts such as tweets or SMS. ©2020 IEEE 

Place, publisher, year, edition, pages
IEEE, 2020.
Keywords [en]
Authorship Identification, Authorship Attribution, Stylometry, N-Grams, Microblogging, Forensics
National Category
Signal Processing
Identifiers
URN: urn:nbn:se:hh:diva-41798OAI: oai:DiVA.org:hh-41798DiVA, id: diva2:1416649
Conference
8th International Workshop on Biometrics and Forensics (IWBF 2020), Porto, Portugal, April 29-30, 2020
Funder
Swedish Research CouncilKnowledge Foundation
Note

Other funding: European Social Fund via IT Academy programme.

Available from: 2020-03-24 Created: 2020-03-24 Last updated: 2020-03-26

Open Access in DiVA

fulltext(565 kB)3 downloads
File information
File name FULLTEXT01.pdfFile size 565 kBChecksum SHA-512
c2e477c8ce76163e1612be0a69cfa44037170e0db3afdee740109095078cb744d356ce2ede898cfa1626c7f56555be7aac65f7b2552412163ec6d44e46c35364
Type fulltextMimetype application/pdf

Authority records BETA

Alonso-Fernandez, Fernando

Search in DiVA

By author/editor
Alonso-Fernandez, Fernando
By organisation
School of Information TechnologyCAISR - Center for Applied Intelligent Systems Research
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 3 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 16 hits
89101112131411 of 45
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf