Sage Journals: Discover world-class research

Abstract

Accurately locating product target information is crucial for improving competitiveness and brand image. However, traditional methods are often inefficient and lack robustness in complex visual environments. This study proposes an improved product target information localization model, which takes a dense connection module as the backbone to extract multi-scale feature information. Then, a dynamic convolution module is employed to adaptively fuse responses from different convolution kernels, while an attention mechanism is introduced to enhance key regions. Finally, a multi-stage feature refinement module is applied to progressively optimize edge and structural information, thereby generating high-quality saliency maps and improving localization accuracy and model robustness. Compared with the baseline model without refinement, introducing three feature refinement modules increases the F-measure by 0.026, while dynamic convolution achieves an optimal F-measure of 0.951. Moreover, the combination of two feature refinement modules and dynamic convolution reduces the MAE by 0.018. Compared with four state-of-the-art models (Capsal, PiCANet, PoolNet, and DGRL), I-PFPN consistently outperforms them in F-measure and PR curve evaluations. In practice, the model completes product target localization within approximately 0.5 s per image, making it a fast and effective tool for enterprise-level applications in dynamic market environments.

Keywords

visual communication product target information saliency localization I-PFPN dynamic convolution module

Get full access to this article

View all access options for this article.

References

Liu

Xiong

Yuan

, et al. Transcending pixels: boosting saliency detection via scene understanding from aerial imagery. IEEE Trans Geosci Rem Sens 2023; 61(1): 1–16.

Song

. Cross-border integration: research on visual communication design from the perspectives of brand and ecosystem. Adv Manag Sci 2023; 12(1).

Zeng

Yan

Wang

, et al. Abs-CAM: a gradient optimization interpretable approach for explanation of convolutional neural networks. Signal Image Video Process 2023; 17(4): 1069–1076.

Shakrani

Kanyangarara

Parowa

, et al. A deep learning model for face recognition in presence of mask. Acta inform Malays 2022; 6(2): 43–46.

Chen

, et al. Stage-wise salient object detection in 360 omnidirectional image via object-level semantical saliency ranking. IEEE Trans Vis Comput Graph 2020; 26(12): 3535–3545.

Zhao

Zhang

Gao

, et al. A novel action saliency and context-aware network for weakly-supervised temporal action localization. IEEE Trans Multimed 2023; 25(1): 8253–8266.

Cheng

Zeng

, et al. Hierarchical saliency mapping for weakly supervised object localization based on class activation mapping. Multimed Tool Appl 2020; 79(11): 31283–31298.

Min

Zhai

Zhou

, et al. A multimodal saliency model for videos with high audio-visual correspondence. IEEE Trans Image Process 2020; 29(1): 3805–3819.

Lai

Zhou

Khan

, et al. Weakly supervised visual saliency prediction. IEEE Trans Image Process 2022; 31(1): 3111–3124.

10.

Zhang

Xiao

, et al. Structure-consistent weakly supervised salient object detection with local saliency coherence. Proc AAAI Conf Artif Intell 2021; 35(4): 3234–3242.

11.

Zhu

Liu

, et al. Tnlrs: target-Aware non-local low-rank modeling with saliency filtering regularization for infrared small target detection. IEEE Trans Image Process 2020; 29: 9546–9558.

12.

Chen

Tao

Zhang

, et al. Saliency detection via the improved hierarchical principal component analysis method. Wireless Commun Mobile Comput 2020; 2020(1): 1–12.

13.

Che

Borji

Zhai

, et al. Adversarial attack against deep saliency models powered by non-redundant priors. IEEE Trans Image Process 2021; 30(1): 1973–1988.

14.

Sun

Yang

. Consumer visual attention and behaviour of online clothing. Int J Cloth Sci Technol 2021; 33(3): 305–320.

15.

Wang

Yin

Mbelwa

, et al. Context and saliency aware correlation filter for visual tracking. Multimed Tool Appl 2022; 81(19): 27879–27893.

16.

Abraham

Kovoor

. Unifying convolution and transformer: a dual stage network equipped with cross-interactive multi-modal feature fusion and edge guidance for RGB-D salient object detection. J Ambient Intell Hum Comput 2024; 15(4): 2341–2359.

17.

Liu

Luo

Zhang

, et al. Vst++: efficient and stronger visual saliency transforme. IEEE Trans Pattern Anal Mach Intell 2024; 46(11): 7300–7731.

18.

Zhao

Cheung

, et al. Scgan: saliency map-guided colorization with generative adversarial network. IEEE Trans Circ Syst Video Technol 2020; 31(8): 3062–3077.

19.

Song

Sui

Hua

. A hierarchical object detection method in large-scale optical remote sensing satellite imagery using saliency detection and CNN. Int J Rem Sens 2021; 42(8): 2827–2847.

20.

Banu

Vasuki

Roomi

SMM

. Target detection in SAR images using Bayesian saliency and morphological attribute profiles. Comput Commun 2020; 160(1): 738–748.

21.

Cao

Proctor

Ding

, et al. Influences of color salience and location of website links on user performance and affective experience with a Mobile web directory. Int J Hum Comput Interact 2021; 37(6): 547–559.

22.

Zhang

Cosman

. MMMNet: an end-to-end multi-task deep convolution neural network with multi-scale and multi-hierarchy fusion for blind image quality assessment. IEEE Trans Circ Syst Video Technol 2021; 31(12): 4798–4811.

23.

Wang

, et al. Learning saliency-aware correlation filters for visual tracking. Comput J 2022; 65(7): 1846–1859.

24.

Gökstorp

Breckon

. Temporal and non-temporal contextual saliency analysis for generalized wide-area search within unmanned aerial vehicle (UAV) video. Vis Comput 2022; 38(6): 2033–2040.

25.

Ischen

Meijers

Vandeberg

, et al. Seen as green? Assessing the salience and greenness of environmentally friendly packaging cues. J Food Prod Market 2022; 28(1): 31–48.

26.

Zhou

Chen

Yang

, et al. Activation to saliency: forming high-quality labels for unsupervised salient object detection. IEEE Trans Circ Syst Video Technol 2022; 33(2): 743–755.

27.

Preethi

Mamatha

. Region-based convolutional neural network for segmenting text in epigraphical images. Artifi Intell App 2023; 1(2): 119–127.

28.

Purohit

Dave

. Leveraging deep learning techniques to obtain efficacious segmentation results. Arch Adv Enginee Sci 2023; 1(1): 11–26.

29.

Garciaa

ABL

Garciab

Sentic

. Thermal images pre-processing for early detection of breast cancer: a progressive review. Acta Inform Malays 2024; 8(1): 26–31.

Optimizing product information localization with improved progressive feature polishing network: Enhancing accuracy and efficiency in visual communication

Abstract

Keywords

Get full access to this article

References