Sage Journals: Discover world-class research

Abstract

ESCO skill classifiers, which scan job ads to identify skills at the finest level of the taxonomy, are widely used in international statistical projects and by national employment agencies. However, to our knowledge, no systematic evaluation of these classifiers has been conducted across large sets (i.e., thousands) of ESCO skills. We introduce a method for evaluating ESCO skill classifiers, addressing two key challenges: the large number of skills (up to around 14 000) and severe class imbalance. Our approach relies on matrix sampling of skills and job ads, with clustering and stratification, and uses bootstrapping to estimate standard errors for classifier comparisons. We apply the method to three classifiers using a sample of Luxembourgish IT and finance job ads. Our results indicate that the classifiers achieve acceptable accuracy despite low recall. Notably, the text matching classifier performs competitively, even against methods based on large language models.

Keywords

skills ESCO NLP evaluation artificial intelligence

Get full access to this article

View all access options for this article.

References

Cedefop . Online job vacancies and skills analysis: a Cedefop pan-European approach. Saloniki, Greece: Publication Office of the European Union, 2019. https://doi.org/10.2801/567701

Khaouja

Kassou

Ghogho

. A survey on skill identification from online job ads. IEEE Access 2021; 9: 118134–118153.

European Commission . ESCO implementation manual. Brussels, Belgium: Publication Office of the European Union, 2018.

European Commission . ESCO handbook. Brussels, Belgium: European Commission, 2019. https://doi.org/10.2767/934956

ADEM. Job insights , https://www.jobinsights.lu (2024, Accessed 07 May 2025).

Naya

Bied

Caillou

, et al. Designing labor market recommender systems: the importance of job seeker preferences and competition. HAL Open Sci 2022. https://inria.hal.science/hal-03540319

Ascheri

Kiss-Nagy

Marconi

, et al. Competition in urban hiring markets: evidence from online job advertisements. Eurostat Statistical Working Papers, 2021. https://doi.org/10.2785/667004

Pruski

da Costa Pereira

Da Silveira

, et al. Bridging Skill Gaps: Combining generative and symbolic AI for personalized lifelong learning pathways. In: International Conference on Artificial Intelligence in Education, Palermo, Italy, 2025.

Bhola

Halder

Prasad

, et al. Retrieving skills from job descriptions: A language model based extreme multi-label classification. In: Proceedings of the 28th International conference on computational linguistics, 2020, https://doi.org/10.18653/v1/2020.coling-main.513

10.

Nguyen

Zhang

Montariol

, et al. Rethinking skill extraction in the job market domain using large language models. 2024, arXiv, https://doi.org/10.48550/arXiv.2402.03832

11.

Zhang

Jensen

Sonniks

, et al. Skillspan: hard and soft skill ex-traction from English job postings. In: Proceedings of the 2022 conference of the North American chapter of the association for computational linguistics: human language technologies, 2022, pp.4962–4984. Seattle, United States: Association for Computational Linguistics.

12.

Alexopoulos

. Semantic modeling for data. California, United States: O’Reilly, 2020.

13.

Leon

Gavrilescu

Floria

, et al. Hierarchical classification of transversal skills in job advertisements based on sentence embeddings. Information 2024; 15: 51.

14.

European Commission . Call for tenders EC-ESTAT/2025/OP/0012 - Development and Maintenance of Web Content Retrieval and Data Processing Pipelines for Statistical Purposes within the Web Intelligence Hub. European Commission. 2025, https://ec.europa.eu/info/funding-tenders/opportunities/portal/screen/opportunities/tender-details/docs/a3fe10c3-8c48-4a2b-8c2f-fc355c455bbd-CN/FWC-WIH%20-%20Tender%20specificat

15.

Ds4Skills . Blueprint for the data space for skills, https://www.skillsdataspace.eu/wiki/ (2025, Last accessed 01 August 2025).

16.

Wikimedia . Stop word list, https://meta.wikimedia.org/wiki/Stop_word_list/google_stop_word_list (2023, accessed 18 June 2023).

17.

R Core Team . R: A language and environment for statistical computing, R Foundation for Statistical Computing, https://www.R-project.org/ (2024).

18.

Mullen

Benoit

Keyes

, et al. Fast, Consistent Tokenization of Natural Language Text. J Open Source Software 2018; 3. 10.21105/joss.00655

19.

Mikolov

Chen

Corrado

. Efficient estimation of word representations in vector space. arXiv, 2013. https://doi.org/10.48550/arXiv.1301.3781

20.

Tamburri

Van Den Heuvel

Garriga

. Dataops for societal intelligence: a data pipeline for labor market skills extraction and matching. arXiv. 2021. https://doi.org/10.1109/IRI49571.2020.00063

21.

Pelucchi

Perego

. Unlocking the power of LLM models: Streamlining job postings classification using ISCO08 and ESCO Taxonomies. WIH-CONF CEDEFOP

∖

EUROSTAT 2023. 2023, cited in: Pelucchi M, Unlocking Insights: Leveraging Web Job Postings Data for Skills Intelligence in TVET, UNESCO Conference on Education Data and Statistics, 2024, https://ces.uis.unesco.org/wp-content/uploads/sites/23/2024/02/3.-TVET-and-skills-development-UIS-Conference-Slides.pdf)

22.

Devlin

Chang

Lee

, et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv 2019. https://doi.org/10.48550/arXiv.1810.04805

23.

Reimers

Gurevych

Sentence-BERT: Sentence embeddings using siamese BERT-networks. arXiv 2019. https://doi.org/10.48550/arXiv.1908.10084

24.

Clavié

Soulié

. Large language models as batteries-included zero-shot ESCO skills matchers. arXiv 2023, preprint arXiv:2307.03539. https://doi.org/10.48550/arXiv.2307.03539

25.

ADEM . Les métiers de l'Informatique. ADEM, https://adem.public.lu/fr/publications.html (2024).

26.

The Luxembourgish Open Data Platform . Skills required in ADEM job vacancies, https://data.public.lu/en/datasets/skills-required-in-adem-job-vacancies/ (2024, accessed 07 May 2025).

27.

Minatel

dos Santos

da Silva

, et al. Influence of data stratification criteria on fairer classifications. J Inf Data Manage 2025; 16. 10.5753/jidm.2025.4677

28.

Raschka

Model evaluation, model selection, and algorithm selection in machine learning. ArXiv 2020, https://arxiv.org/abs/1811.12808

29.

Monarch

. Human-in-the-loop machine learning. Manning, 2021.

30.

Dumelle

Higham

Ver Hoef

, et al. A comparison of design-based and model-based approaches for finite population spatial sampling and inference. Methods Ecol Evol 2022; 13: 2018–2029.

31.

Särndal

Swensson

Wretman

. Model assisted survey sampling. New York: Springer Science, 2003.

32.

Solon

Haider

Wooldridge

. What are we weighting for? J Hum Resour 2015; 50: 301–316.

33.

Haliburton

Leusmann

Welsch

, et al. Uncovering labeler bias in machine learning annotation tasks. AI and Ethics 2025; 5: 2515–2528.

34.

Cosgrove

Sostero

Bertoni

. Mapping DigComp digital competences to the ESCO skills framework for analysis of digital skills in EU online job advertisements. Publications Office of the European Union, 2024. https://doi.org/10.2760/197497

35.

Huang

. Using cluster bootstrapping to analyze nested data with a few clusters. Educ Psychol Meas 2018; 78: 297–318. https://journals.sagepub.com/doi/abs/10.1177/0013164416678980

36.

Colin Cameron

Miller

. A practitioner’s guide to cluster-robust inference. J Hum Resour 2015; 50: 317–372.

37.

Hassija

Chamola

Mahapatra

, et al. Interpreting black-box models: a review on explainable artificial intelligence. Cognit Comput. 2024, 16, 45–74.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

2.09 MB

0.00 MB

Evaluating ESCO skill classifiers

Abstract

Keywords

Get full access to this article

References

Supplementary Material