Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Published 4 Jul 2018 in cs.CV | (1807.01696v2)

Abstract: Average precision (AP), the area under the recall-precision (RP) curve, is the standard performance measure for object detection. Despite its wide acceptance, it has a number of shortcomings, the most important of which are (i) the inability to distinguish very different RP curves, and (ii) the lack of directly measuring bounding box localization accuracy. In this paper, we propose 'Localization Recall Precision (LRP) Error', a new metric which we specifically designed for object detection. LRP Error is composed of three components related to localization, false negative (FN) rate and false positive (FP) rate. Based on LRP, we introduce the 'Optimal LRP', the minimum achievable LRP error representing the best achievable configuration of the detector in terms of recall-precision and the tightness of the boxes. In contrast to AP, which considers precisions over the entire recall domain, Optimal LRP determines the 'best' confidence score threshold for a class, which balances the trade-off between localization and recall-precision. In our experiments, we show that, for state-of-the-art object (SOTA) detectors, Optimal LRP provides richer and more discriminative information than AP. We also demonstrate that the best confidence score thresholds vary significantly among classes and detectors. Moreover, we present LRP results of a simple online video object detector which uses a SOTA still image object detector and show that the class-specific optimized thresholds increase the accuracy against the common approach of using a general threshold for all classes. At https://github.com/cancam/LRP we provide the source code that can compute LRP for the PASCAL VOC and MSCOCO datasets. Our source code can easily be adapted to other datasets as well.

Abstract PDF Upgrade to Chat

Authors (4)

Citations (103)

View on Semantic Scholar

Summary

The paper introduces LRP error by combining bounding box localization accuracy, false positive, and false negative rates to overcome AP's limitations.
The paper demonstrates that optimal LRP (oLRP) identifies the best confidence threshold, balancing precision and recall more effectively than traditional metrics.
The paper validates LRP through experiments on SOTA detectors like Faster R-CNN, RetinaNet, and SSD, showcasing enhanced class-specific threshold optimization.

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

The paper introduces a novel performance metric for object detection, termed Localization Recall Precision (LRP) Error. The proposed metric seeks to address the limitations of the widely-used Average Precision (AP) measure. Despite AP's acceptance as a standard in object detection, it exhibits notable shortcomings, including its inability to distinguish between radically different Recall-Precision (RP) curves and its lack of accounting for bounding box localization accuracy directly. This paper provides an in-depth analysis of these deficiencies and proposes LRP as an alternative that encompasses multiple facets of object detection performance.

Key Contributions

LRP Metric Definition: LRP Error is formulated with three primary components: bounding box localization, false positive (FP) rate, and false negative (FN) rate. These components are integrated to represent the detection system's performance. The LRP metric is further refined to produce the Optimal LRP (oLRP) error, defining the minimum achievable error, providing a more nuanced view than AP by identifying the best confidence score threshold for balancing localization accuracy and recall-precision.
Component Analysis: LRP's components individually capture critical aspects of detection performance, with localization accuracy quantified through the IoU-based component. The paper illustrates that LRP, unlike AP, emphasizes the precision-recall trade-off at the optimal point rather than over the entire curve.
Experimental Validation: The authors present extensive experiments on state-of-the-art (SOTA) detectors, including Faster R-CNN, RetinaNet, and SSD, using established benchmarks such as MSCOCO. The results reveal that LRP and oLRP provide more detailed and discriminative insights compared to AP, owing to their sensitivity to nuanced performance facets. Additionally, LRP identifies class-specific optimal thresholds, enhancing accuracy over general threshold applications.
Threshold Optimization: With oLRP, the paper pioneers a threshold optimization strategy tailored for each class, outperforming the traditional uniform threshold approach. This advancement demonstrates practical utility, particularly in applications needing tailored detection thresholds due to varied class characteristics.

Implications and Future Directions

LRP and oLRP metrics afford a significant advancement in evaluating object detection models by directly integrating localization accuracy within the performance measure. Their adoption could lead to revisions in how detectors are evaluated and ranked in future research and benchmarks, encouraging a shift towards metrics that promote optimization for real-world deployments. The paper also opens avenues for further study into refining LRP's parameters and exploring its applicability across different detection tasks and environments, including scenarios with varying object scales or occlusion levels.

Moreover, as the field progresses towards more dynamic and real-time detection tasks, such as video object detection, LRP’s granular threshold optimization will likely prove invaluable. Future research is encouraged to explore LRP's integration with emerging detection paradigms, potentially enhancing methods that demand higher localization accuracy and tailored precision-recall configurations. The provided source code for computing LRP on popular datasets lays the groundwork for community adaptation and further empirical evaluations.

In conclusion, the introduction of LRP sets a new benchmark by capturing both the precision-recall dynamics and the spatial accuracy of detections, potentially leading to more effective and accurate object detection systems.

Markdown Report Issue