Confluence: A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

Title
Confluence: A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection
Publication Date
2023-05-05
Author(s)
Shepley, Andrew J
( author )
OrcID: https://orcid.org/0000-0001-7511-4967
Email: asheple2@une.edu.au
UNE Id une-id:asheple2
Falzon, Gregory
( author )
OrcID: https://orcid.org/0000-0002-1989-9357
Email: gfalzon2@une.edu.au
UNE Id une-id:gfalzon2
Kwan, Paul
Brankovic, Ljiljana
( author )
OrcID: https://orcid.org/0000-0002-5056-4627
Email: lbrankov@une.edu.au
UNE Id une-id:lbrankov
Type of document
Journal Article
Language
en
Entity Type
Publication
Publisher
IEEE
Place of publication
United State of America
DOI
10.1109/TPAMI.2023.3273210
UNE publication id
une:1959.11/61991
Abstract

Confluence is a novel non-Intersection over Union (IoU) alternative to Non-Maxima Suppression (NMS) in bounding box post-processing in object detection. It overcomes the inherent limitations of IoU-based NMS variants to provide a more stable, consistent predictor of bounding box clustering by using a normalized Manhattan Distance inspired proximity metric to represent bounding box clustering. Unlike Greedy and Soft NMS, it does not rely solely on classification confidence scores to select optimal bounding boxes, instead selecting the box which is closest to every other box within a given cluster and removing highly confluent neighboring boxes. Confluence is experimentally validated on the MS COCO and CrowdHuman benchmarks, improving Average Precision by 0.2--2.7% and 1--3.8% respectively and Average Recall by 1.3--9.3 and 2.4--7.3% when compared against Greedy and Soft-NMS variants. Quantitative results are supported by extensive qualitative analysis and threshold sensitivity analysis experiments support the conclusion that Confluence is more robust than NMS variants. Confluence represents a paradigm shift in bounding box processing, with potential to replace IoU in bounding box regression processes.

Link
Citation
IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(10), p. 11561-11574
ISSN
0162-8828
Start page
11561
End page
11574

Files:

NameSizeformatDescriptionLink