Independent Component Analysis Segmentation Algorithm

Title
Independent Component Analysis Segmentation Algorithm
Publication Date
2005
Author(s)
Chen, Yan
Leedham, Graham
Type of document
Conference Publication
Language
en
Entity Type
Publication
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Place of publication
Los Alamitos, United States of America
DOI
10.1109/ICDAR.2005.140
UNE publication id
une:6825
Abstract
In this paper we propose and investigate a new segmentation algorithm called the ICA (independent component analysis) segmentation algorithm and compare it against other existing overlapping strokes segmentation algorithms. The ICA segmentation algorithm converts the original touching or overlapping word components into a blind source matrix and then calculates the weighted value matrix before the values are re-evaluated using a fast ICA model. The readjusted weighted value matrix is applied to the blind source matrix to separate the word components. The algorithm has been evaluated on 30 overlapped document images from the CEDAR letter database and another 30 degraded historical document images, which containing many different kinds of overlapping and touching words in adjacent lines. Quantitative analysis of the results by measuring text recall, and qualitative assessment of processed document image quality is reported. The ICA segmentation algorithm is demonstrated to be effective at resolving the problem in varying forms of overlapping or touching text lines.
Link
Citation
Proceedings of the 2005 Eight International Conference on Document Analysis and Recognition (ICDAR'05), v.2, p. 680-684
ISSN
1520-5263
ISBN
0769524206
Start page
680
End page
684

Files:

NameSizeformatDescriptionLink