Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/16424
Title: An Application-Independent and Segmentation-Free Approach for Spotting Queries in Document Images
Contributor(s): Chatbri, Houssem (author); Kwan, Paul H  (author); Kameyama, Keisuke (author)
Publication Date: 2014
DOI: 10.1109/ICPR.2014.498
Handle Link: https://hdl.handle.net/1959.11/16424
Abstract: We report our ongoing research on an application-independent and segmentation-free approach for spotting queries in document images. Built on our earlier work reported in [1][2], this paper introduces an image processing approach that finds occurrences of a query, which is a multi-part object, in a document image, through 5 steps: (1) Preprocessing for image normalization and connected components extraction. (2) Feature Extraction from connected components. (3) Matching of the query and document image connected components' feature vectors. (4) Voting for determining candidate occurrences in the document image that are similar to the query. (5) Candidate Filtering for detecting relevant occurrences and filtering out irrelevant patterns. Compared to existing methods, our contributions are twofold: Our approach is designed to deal with any type of queries, without restriction to a particular class such as words or mathematical expressions. Second, it does not apply a domain-specific segmentation to extract regions of interest from the document image, such as text paragraphs or mathematical calculations. Instead, it considers all the image information. Experimental evaluation using scanned journal images show promising performances and possibility of further improvement.
Publication Type: Conference Publication
Conference Details: ICPR 2014: 22nd International Conference on Pattern Recognition, Stockholm, Sweden, 24th - 28th August, 2014
Source of Publication: Proceedings of the 22nd International Conference on Pattern Recognition (ICPR), p. 2891-2896
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Place of Publication: Los Alamitos, United States of America
ISSN: 1051-4651
Fields of Research (FoR) 2008: 080109 Pattern Recognition and Data Mining
080106 Image Processing
080104 Computer Vision
Fields of Research (FoR) 2020: 461199 Machine learning not elsewhere classified
460306 Image processing
460301 Active sensing
Socio-Economic Objective (SEO) 2008: 970110 Expanding Knowledge in Technology
890201 Application Software Packages (excl. Computer Games)
970108 Expanding Knowledge in the Information and Computing Sciences
Socio-Economic Objective (SEO) 2020: 220401 Application software packages
280115 Expanding knowledge in the information and computing sciences
Peer Reviewed: Yes
HERDC Category Description: E1 Refereed Scholarly Conference Publication
Appears in Collections:Conference Publication

Files in This Item:
3 files
File Description SizeFormat 
Show full item record

SCOPUSTM   
Citations

6
checked on Jul 6, 2024

Page view(s)

1,336
checked on Jul 7, 2024
Google Media

Google ScholarTM

Check

Altmetric


Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.