Sage Journals: Discover world-class research

Abstract

Noisy labels are often present in large, accessible datasets. Learning with noisy labels can degrade the generalization performance of DNNs. While Semi-Supervised Learning (SSL) approaches have shown promise by predicting pseudo-labels to correct noisy labels, we find and demonstrate a fundamental limitation of SSL-based label correction methods: hard samples near decision boundaries significantly weaken the memorization effect that these methods rely on. This leads to erroneous pseudo-labels and creates a negative feedback loop where models gradually memorize these errors, further degrading performance. To overcome the issue, inspired by AdaBoost and building on these insights, we propose HEALON (Hard samplE Adaptive Labeling with Optimal reweighting for Noisy labels), a novel framework for learning with noisy labels that effectively addresses the hard sample challenge through an optimal weighting strategy that balances their influence during training. The framework primarily consists of two steps: Weighted Implicit Ensemble (WIE) and Weight Optimization for HArd Sample (WOHAS). WIE combines the Adaboost strategy with multiple sets of weight distributions obtained from WOHAS to train a single model, allowing the model to converge to multiple local optima along its optimization path and predict pseudo-labels. A sample difficulty minimization method is then designed to aggregate the predicted labels, generating near-global optimal pseudo-labels for each noisy sample, followed by a multiple “snapshot” weights strategy to minimize computational cost. WOHAS quantifies sample difficulty using information entropy and derives optimal weights to WIE via a cumulative sample difficulty strategy, balancing the impact of hard samples while preserving the memorization effect. Extensive experiments on benchmark datasets demonstrate that our approach significantly outperforms state-of-the-art methods in both pseudo-label correction accuracy and overall classification performance.

Keywords

noisy labels hard sample adaptive reweighting ensemble learning sample selection semi-supervised learning

Get full access to this article

View all access options for this article.

References

Awais

Naseer

Khan

, et al. Foundation models defining a new era in vision: a survey and outlook. IEEE Trans Pattern Anal Mach Intell 2025; 47: 2245–2264.

Liu

Zhu

Liu

, et al. A survey of model compression techniques: past, present, and future. Front Robot AI 2025; 12: 1518965.

Zhang

, et al. Learning from noisy labels with decoupled meta label purifier. 2023, pp.19934–19943.

Yao

. Data optimization in deep learning: a survey. IEEE Trans Knowl Data Eng 2025; 37: 2356–2375.

Zhang

, et al. A survey on learning with noisy labels in natural language processing: how to train models with label noise. Eng Appl Artif Intell 2025; 146: 110157.

Jiang

Zhou

Leung

, et al. Mentornet: learning data-driven curriculum for very deep neural networks on corrupted labels. 2018, pp.2304–2313.

Byrd

Lipton

. What is the effect of importance weighting in deep learning? 2019, pp.872–881.

Cao

Wei

Gaidon

, et al. Learning imbalanced datasets with label-distribution-aware margin loss. Adv Neural Inf Process Syst 2019; 32: 1567–1578.

Feng

Ren

Xie

. OT-filter: an optimal transport filter for learning with noisy labels. 2023, pp.16164–16174.

10.

Wang

Han

Huang

, et al. Multi-similarity loss with general pair weighting for deep metric learning. 2019, pp.5022–5030.

11.

Huang

Jia

, et al. O2U-net: a simple noisy label detection approach for deep neural networks. 2019, pp.3326–3334.

12.

Zheng

Goswami

, et al. A topological filter for learning with label noise. Adv Neural Inf Process Syst 2020; 33: 21382–21393.

13.

Cheng

Zhu

, et al. Learning with instance-dependent label noise: A sample sieve approach. arXiv preprint arXiv:2010.02347, 2020.

14.

Mirzasoleiman

Cao

Leskovec

. Coresets for robust training of deep neural networks against noisy labels. Adv Neural Inf Process Syst 2020; 33: 11465–11477.

15.

Pleiss

Zhang

Elenberg

, et al. Identifying mislabeled data using the area under the margin ranking. Adv Neural Inf Process Syst 2020; 33: 17044–17056.

16.

Sun

Wei

Feng

, et al. Variational rectification inference for learning with noisy labels. Int J Comput Vis 2025; 133: 652–671.

17.

Han

Yao

, et al. Co-teaching: robust training of deep neural networks with extremely noisy labels. Adv Neural Inf Process Syst 2018; 31: 8527–8537.

18.

Socher

Hoi

. Dividemix: learning with noisy labels as semi-supervised learning. arXiv preprint arXiv:2002.07394, 2020.

19.

Han

Yao

, et al. How does disagreement help generalization against label corruption? 2019, pp.7164–7173.

20.

Song

Kim

Park

, et al. Learning from noisy labels with deep neural networks: a survey. IEEE Trans Neural Netw Learn Syst 2022; 34: 8135–8153.

21.

Bai

Yang

Han

, et al. Understanding and improving early stopping for learning with noisy labels. Adv Neural Inf Process Syst 2021; 34: 24392–24403.

22.

Liu

Niles-Weed

Razavian

, et al. Early-learning regularization prevents memorization of noisy labels. Adv Neural Inf Process Syst 2020; 33: 20331–20342.

23.

Yuan

Feng

Liu

. Early stopping against label noise without validation data, 2024.

24.

Wei

Feng

Chen

, et al. Combating noisy labels by agreement: a joint training method with co-regularization. 2020, pp.13726–13735.

25.

Freund

Schapire

et al. Experiments with a new boosting algorithm. In: ICML. Vol. 96. Citeseer, 1996, pp.148–156.

26.

Huang

Pleiss

, et al. Snapshot ensembles: train 1, get M for free. arXiv preprint arXiv:1704.00109, 2017.

27.

Kim

Choi

et al. Fine samples for learning with noisy labels. Adv Neural Inf Process Syst 2021; 34: 24137–24149.

28.

Yao

Wang

Tsang

, et al. Deep learning from noisy image labels with quality embedding. IEEE Trans Image Process 2018; 28: 1909–1922.

29.

Shao

Wang

, et al. Ensemble learning with manifold-based data splitting for noisy label correction. IEEE Trans Multimedia 2021; 24: 1127–1140.

30.

Shu

Xie

, et al. Meta-weight-net: learning an explicit mapping for sample weighting. Adv Neural Inf Process Syst 2019; 32: 1919–1930.

31.

Zhang

Cisse

Dauphin

, et al. mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412, 2017.

32.

Berthelot

Carlini

Goodfellow

, et al. Mixmatch: a holistic approach to semi-supervised learning. Adv Neural Inf Process Syst 2019; 32: 5049–5059.

33.

Tanaka

Ikami

Yamasaki

, et al. Joint optimization framework for learning with noisy labels. 2018, pp.5552–5560.

34.

Arazo

Ortego

Albert

, et al. Unsupervised label noise modeling and loss correction. 2019, pp.312–321.

35.

Wang

, et al. Webvision database: visual learning and understanding from web data. arXiv preprint arXiv:1708.02862, 2017.

36.

Szegedy

Ioffe

Vanhoucke

, et al. Inception-v4, inception-resnet and the impact of residual connections on learning, 2017.

37.

Deng

Dong

Socher

, et al. Imagenet: a large-scale hierarchical image database. 2009, pp.248–255.

38.

Vasconcelos

. Repair: removing representation bias by dataset resampling. 2019, pp.9572–9581.

39.

Zhang

Ren

, et al. Deep residual learning for image recognition. 2016, pp.770–778.

40.

Chen

Liao

Chen

, et al. Understanding and utilizing deep neural networks trained with noisy labels. 2019, pp.1062–1070.

41.

Huang

Zhang

Shan

. Twin contrastive learning with noisy labels. 2023, pp.11661–11670.

42.

Yuan

Feng

Liu

. Late stopping: avoiding confidently learning from mislabeled examples. 2023, pp.16079–16088.

HEALON: Progressive hard sample attenuation for learning with noisy labels

Abstract

Keywords

Get full access to this article

References