TY - JOUR
T1 - Efficient annotation reduction with active learning for computer vision-based Retail Product Recognition
AU - Griffioen, Niels
AU - Ranković, Nevena
AU - Zamberlan, Federico
AU - Punith, Monisha
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/4
Y1 - 2024/4
N2 - The retail industry encounters huge obstacles with computer vision (CV) technology due to frequent model retraining with changing products and time-consuming, costly data annotation. Previous research in this field has been primarily focused on optimizing model performance rather than minimizing annotation effort. Therefore, the main idea of this paper is to evaluate active learning as a method to minimize annotation effort in the retail industry. The MVTEC Densely Segmented Supermarket dataset is used to evaluate various active learning methods such as the Least Confident, Entropy and Cost-Effective Active Learning (CEAL) along with Mask R-CNN model. The results demonstrate that annotating only 20.83-24.34% of the data achieves 95% of the full dataset’s performance. When training, out-of-sample data share similar characteristics, the Least Confident and CEAL methods reduce annotation requirements by 7.7-15.7% while maintaining 95% and 97% of the full dataset’s performance. However, the Entropy method under-performs compared to the random selection baseline. Ultimately, none of the methods show a clear advantage when the data characteristics differ between training and out-of-sample data. Finally, the proposed active learning methods on an industry-specific retail dataset remarkably propels the development of highly efficient and cost-effective CV solutions meticulously tailored for the retail industry.
AB - The retail industry encounters huge obstacles with computer vision (CV) technology due to frequent model retraining with changing products and time-consuming, costly data annotation. Previous research in this field has been primarily focused on optimizing model performance rather than minimizing annotation effort. Therefore, the main idea of this paper is to evaluate active learning as a method to minimize annotation effort in the retail industry. The MVTEC Densely Segmented Supermarket dataset is used to evaluate various active learning methods such as the Least Confident, Entropy and Cost-Effective Active Learning (CEAL) along with Mask R-CNN model. The results demonstrate that annotating only 20.83-24.34% of the data achieves 95% of the full dataset’s performance. When training, out-of-sample data share similar characteristics, the Least Confident and CEAL methods reduce annotation requirements by 7.7-15.7% while maintaining 95% and 97% of the full dataset’s performance. However, the Entropy method under-performs compared to the random selection baseline. Ultimately, none of the methods show a clear advantage when the data characteristics differ between training and out-of-sample data. Finally, the proposed active learning methods on an industry-specific retail dataset remarkably propels the development of highly efficient and cost-effective CV solutions meticulously tailored for the retail industry.
KW - Active learning
KW - Annotation effort
KW - Instance segmentation
KW - Retail image analysis
UR - http://www.scopus.com/inward/record.url?scp=85190087582&partnerID=8YFLogxK
U2 - 10.1007/s42001-024-00266-7
DO - 10.1007/s42001-024-00266-7
M3 - Article
SN - 2432-2717
JO - Journal of Computational Social Science
JF - Journal of Computational Social Science
ER -