Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/19179
Title: Towards a segmentation and recognition-free approach for content-based document image retrieval of handwritten queries
Contributor(s): Chatbri, Houssem (author); Kameyama, Keisuke (author); Kwan, Paul H  (author)
Publication Date: 2015
DOI: 10.1109/ACPR.2015.7486483
Handle Link: https://hdl.handle.net/1959.11/19179
Abstract: We introduce a method for content-based document image retrieval (CBDIR) of handwritten queries that is both segmentation and recognition-free. We first demonstrate that our method is underpinned by a theoretical model that exploits the Bayes' rule. Next, we present an algorithmic implementation that takes into account real world retrieval challenges caused by handwriting fluctuations and style variations. Our algorithm operates as follows: First, a number of connected components of the query are matched against the connected components of the document image using shape features. A similarity threshold is used to select the connected components of the document image that are most similar to the query components. Then, the selected components are used to detect candidate occurrences of the query in the document image by using size-adaptive bounding boxes. Finally, a score is calculated for each candidate occurrence and used for ranking. We conduct a comparative evaluation of our method on a dataset of 200 printed document images, by executing 40 printed and 200 handwritten queries of mathematical expressions. Experimental results demonstrate competitive performances expressed by P-Recall = 100%, A-Recall = 99.95% for printed queries, and P-Recall = 73.5%, A-Recall = 57.92% for handwritten queries, outperforming a state-of-the-art CBDIR algorithm.
Publication Type: Conference Publication
Conference Details: ACPR 2015: 3rd Asian Conference on Pattern Recognition, Kuala Lumpur, Malaysia, 3rd - 6th November, 2015
Source of Publication: Proceedings of the Third IAPR Asian Conference on Pattern Recognition (ACPR 2015), p. 146-150
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Place of Publication: Los Alamitos, United States of America
Fields of Research (FoR) 2008: 080106 Image Processing
080109 Pattern Recognition and Data Mining
080104 Computer Vision
Fields of Research (FoR) 2020: 460306 Image processing
461199 Machine learning not elsewhere classified
460301 Active sensing
Socio-Economic Objective (SEO) 2008: 890404 Publishing and Print Services (incl. Internet Publishing)
970108 Expanding Knowledge in the Information and Computing Sciences
890201 Application Software Packages (excl. Computer Games)
Socio-Economic Objective (SEO) 2020: 220503 Publishing and print services
280115 Expanding knowledge in the information and computing sciences
220401 Application software packages
Peer Reviewed: Yes
HERDC Category Description: E1 Refereed Scholarly Conference Publication
Appears in Collections:Conference Publication

Files in This Item:
3 files
File Description SizeFormat 
Show full item record

SCOPUSTM   
Citations

7
checked on Dec 7, 2024

Page view(s)

1,348
checked on Apr 7, 2024
Google Media

Google ScholarTM

Check

Altmetric


Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.