hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A base-line character recognition for syriac-aramaic
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS).
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS).ORCID iD: 0000-0002-4929-1262
2007 (English)In: IEEE International Conference on Systems Man and Cybernetics Conference Proceedings, Piscataway, N.J.: IEEE Press, 2007, Vol. 1-8, p. 1048-1055Conference paper, Published paper (Other academic)
Abstract [en]

Serto is the cursive alphabet of Syriac-Aramaic, which is used by the largest corpus of documents in libraries in Aramaic. A lingua franca, and often a source language, Aramaic has influenced major Judaic, Christian and Islamic thoughts as well as the development of science. The script is cursive, e.g. Arabic, and consequently it has a hand-writing appearance compared to Latin. Serto, and Aramaic in practice, has not an automatic character recognition system, OCR Most library documents are reproductions using printed characters. The readers would strongly benefit from having an OCR, as these reproductions are predominantly books, printed in the pre-computer era. We propose a segmentation-free OCR using linear symmetry features with an individual threshold for the tensors of the characters, and an ordered search sequence. It yields ~ 90 % correctly identified characters in the average. As a first recognition scheme for Serto, it represents a base-line OCR for Syriac-Aramaic.

Place, publisher, year, edition, pages
Piscataway, N.J.: IEEE Press, 2007. Vol. 1-8, p. 1048-1055
Series
IEEE International Conference on Systems Man and Cybernetics Conference Proceedings, ISSN 1062-922X
National Category
Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:hh:diva-14932DOI: 10.1109/ICSMC.2007.4414012ISI: 000255016302096Scopus ID: 2-s2.0-40949122349ISBN: 978-1-4244-0991-4 (print)OAI: oai:DiVA.org:hh-14932DiVA, id: diva2:408392
Conference
IEEE International Conference on Systems, Man and Cybernetics, 7-10 Oct. 2007, Montreal, Que.
Available from: 2011-04-04 Created: 2011-04-04 Last updated: 2020-05-12Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Tse, ElizabethBigun, Josef

Search in DiVA

By author/editor
Tse, ElizabethBigun, Josef
By organisation
Halmstad Embedded and Intelligent Systems Research (EIS)
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 183 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf