hh.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Structural and Syntactic Techniques for Recognition of Ethiopic Characters
Addis Ababa University, Department of Computer Science, Addis Ababa, Ethiopia .
Högskolan i Halmstad, Akademin för informationsteknologi, Halmstad Embedded and Intelligent Systems Research (EIS), Intelligenta system (IS-lab).ORCID-id: 0000-0002-4929-1262
2006 (Engelska)Ingår i: Structural, syntactic, and statistical pattern recognition joint IAPR international workshops SSPR 2006 and SPR 2006, Hong Kong, China, August 17-19, 2006 : proceedings: Lecture Notes in Computer Sciences (Volume 4109/2006), Berlin: Springer Berlin/Heidelberg, 2006, s. 118-126Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

OCR technology of Latin scripts is well advanced in comparison to other scripts. However, the available results from Latin are not always sufficient to directly adopt them for other scripts such as the Ethiopic script. In this paper, we propose a novel approach that uses structural and syntactic techniques for recognition of Ethiopic characters. We reveal that primitive structures and their spatial relationships form a unique set of patterns for each character. The relationships of primitives are represented by a special tree structure, which is also used to generate a pattern. A knowledge base of the alphabet that stores possibly occurring patterns for each character is built. Recognition is then achieved by matching the generated pattern against each pattern in the knowledge base. Structural features are extracted using direction field tensor. Experimental results are reported, and the recognition system is insensitive to variations on font types, sizes and styles.

Ort, förlag, år, upplaga, sidor
Berlin: Springer Berlin/Heidelberg, 2006. s. 118-126
Serie
Lecture Notes in Computer Science, ISSN 0302-9743 ; 4109
Nyckelord [en]
Pattern recognition, Image analysis, OCR
Nationell ämneskategori
Teknik och teknologier
Identifikatorer
URN: urn:nbn:se:hh:diva-2166DOI: 10.1007/11815921ISI: 000240075100012Scopus ID: 2-s2.0-33749587617Lokalt ID: 2082/2563ISBN: 978-3-540-37236-3 (tryckt)OAI: oai:DiVA.org:hh-2166DiVA, id: diva2:239384
Konferens
Joint IAPR International Workshops, SSPR 2006 and SPR 2006, Hong Kong, China, August 17-19, 2006
Tillgänglig från: 2008-11-27 Skapad: 2008-11-27 Senast uppdaterad: 2018-03-23Bibliografiskt granskad
Ingår i avhandling
1. Multifont recognition System for Ethiopic Script
Öppna denna publikation i ny flik eller fönster >>Multifont recognition System for Ethiopic Script
2006 (Engelska)Licentiatavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

In this thesis, we present a general framework for multi-font, multi-size and multi-style Ethiopic character recognition system. We propose structural and syntactic techniques for recognition of Ethiopic characters where the graphically comnplex characters are represented by less complex primitive structures and their spatial interrelationships. For each Ethiopic character, the primitive structures and their spatial interrelationships form a unique set of patterns.

The interrelationships of primitives are represented by a special tree structure which resembles a binary search tree in the sense that it groups child nodes as left and right, and keeps the spatial position of primitives in orderly manner. For a better computational efficiency, the primitive tree is converted into string pattern using in-order traversal, which generates a base of the alphabet that stores possibly occuring string patterns for each character. The recognition of characters is then achieved by matching the generated patterns with each pattern in a stored knowledge base of characters.

Structural features are extracted using direction field tensor, which is also used for character segmentation. In general, the recognition system does not need size normalization, thinning or other preprocessing procedures. The only parameter that needs to be adjusted during the recognition process is the size of Gaussian window which should be chosen optimally in relation to font sizes. We also constructed an Ethiopic Document Image Database (EDIDB) from real life documents and the recognition system is tested with respect to variations in font type, size, style, document skewness and document type. Experimental results are reported.

Ort, förlag, år, upplaga, sidor
Göteborg: Department of Signals and Systems, Chalmers University of Technology, 2006. s. 46
Serie
Technical report ; 2006:21
Nyckelord
Ethiopic character recognition, OCR, Multifont recognition, Amharic, Direction fields, Structural and syntactic pattern recognition
Nationell ämneskategori
Datorseende och robotik (autonoma system)
Identifikatorer
urn:nbn:se:hh:diva-1978 (URN)2082/2373 (Lokalt ID)2082/2373 (Arkivnummer)2082/2373 (OAI)
Presentation
(Engelska)
Handledare
Tillgänglig från: 2008-09-29 Skapad: 2008-09-29 Senast uppdaterad: 2018-03-23Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Personposter BETA

Assabie, YaregalBigun, Josef

Sök vidare i DiVA

Av författaren/redaktören
Assabie, YaregalBigun, Josef
Av organisationen
Intelligenta system (IS-lab)
Teknik och teknologier

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetricpoäng

doi
isbn
urn-nbn
Totalt: 420 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf