Applicability of Large Language Models (LLMs) to library classification: Analysis of ChatGPT-4o,DeepSeek and Gemini 2.0 for assigning Dewey Decimal Classification (DDC) numbers

Abstract

In this era of AI technologies and LLMs, libraries are also at the forefront of experimenting with how these technologies can improve and augment library operations, particularly classification and cataloging. This study investigates how well the LLMs, specifically ChatGPT-4o, DeepSeek, and Gemini 2.0, can perform library classification using the DDC scheme. Previous studies have evaluated various AI models for classification and cataloging purposes, mostly using content analysis or similarity measures, providing limited insights into where, how, and by what measure these models make errors. Our study develops a hierarchical evaluation scale that respects the structural characteristics of the DDC system. We tested the selected models on a dataset of 110 book titles, spanning across all main classes of the DDC, with expert-assigned numbers as a benchmark. Models were tested for accuracy, mismatch distribution, direction of misclassification, and cross-model compensation of error, addressing a crucial gap and adding novel findings to the existing body of knowledge. The results indicate that all three models handle broader levels of classification well, particularly up to the second and third digit. DeepSeek performed best overall, with an average match score of 56.43 out of 100, followed by ChatGPT-4o (51.82), while Gemini 2.0 produced the most variable outcomes of the three (45.73). Most errors occur at the section (third digit) and early decimal levels, indicating that such granular distinctions demand contextual understanding beyond the current model capabilities. Misclassifications at the main level were rare (ChatGPT: 9.09%; DeepSeek: 0.91%; Gemini: 8.18%). Interestingly, the cross-model compensation matrix revealed that different models perform differently across the hierarchical bins. DeepSeek was found to be excellent at broader-level classification, while ChatGPT-4o performed better at granular-level classification, indicating future potential for hierarchy-aware model combinations for the given task.

Keywords

library classification AI dewey decimal classification ChatGPT DeepSeek Gemini knowledge organization

Get full access to this article

View all access options for this article.

References

Abubakar

Aderinto

Abdulsalam

, et al. (2024) The prospects and challenges of artificial intelligence utilization for cataloguing of information resources in Nigerian Academic libraries. In: Proceedings of the 3rd international conference on ICT for national development and its sustainability, Ilorin, Nigeria, 2024, pp.251–267. Available at: https://www.researchgate.net/profile/Bolaji-Oladokun/publication/382947230_Bridging_the_Digital_Divide_Empowering_Nigerian_Universities_through_Technological_Advancements_in_Academic_Libraries/links/66b4749c8f7e1236bc43f5e7/Bridging-the-Digital-Divide-Empowering-Nigerian-Universities-through-Technological-Advancements-in-Academic-Libraries.pdf#page=258

Adetayo

(2023) Artificial intelligence chatbots in academic libraries: The rise of ChatGPT. Library Hi Tech News 40(3): 18–21.

Adetayo

Aborisade

Sanni

(2024) Microsoft copilot and anthropic Claude AI in education and library service. Library Hi Tech News. Epub ahead of print 18 January 2024. https://doi.org/10.1108/lhtn-01-2024-0002

Aldoseri

Al-Khalifa

Hamouda

(2024) AI-Powered innovation in digital transformation: Key pillars and industry impact. Sustainability 16(5): 1790.

Ali

Naeem

Bhatti

(2020) Artificial intelligence tools and perspectives of university librarians: An overview. Business Information Review 37(3): 116–124.

Ali

Naeem

Bhatti

(2021) Artificial Intelligence (AI) in Pakistani university library services. Library Hi Tech News 38(8): 12–15.

Apell

Eriksson

(2023) Artificial intelligence (AI) healthcare technology innovations: The current state and challenges from a life science industry perspective. Technology Analysis and Strategic Management 35(2): 179–193.

Asemi

Nowkarizi

(2021) Intelligent libraries: A review on expert systems, artificial intelligence, and robot. Library Hi Tech 39(2): 412–434.

Balnaves

(2024) Artificial intelligence and libraries: An introduction. In: Balnaves

Bultrini

Cox

, et al. (eds) New Horizons in Artificial Intelligence in Libraries. Walter de Gruyter GmbH & Co KG, pp.3–13.

10.

Bawden

Robinson

(2016) Library and Information Science. In: Jensen

Rothenbuhler

Pooley

, et al. (eds) The International Encyclopedia of Communication Theory and Philosophy. John Wiley & Sons, pp.1–5. https://doi.org/10.1002/9781118766804.wbiect113

11.

Bodenhamer

(2023) Reliability and Usability of ChatGPT for Library Metadata. OSU–Faculty and Staff Publications. Epub ahead of print 2023.

12.

Boppana

Bhadoria

Kodali

(2024) An open-source RAG architecture for LLMs. In: TENCON 2024 - 2024 IEEE region 10 conference (TENCON), December 2024, pp.43–46. Available at: https://ieeexplore.ieee.org/document/10903064 (accessed 27 October 2025).

13.

Borovič

Tomovski

Li Dobnik

, et al. (2025) Evaluating proprietary and open-weight large language models as universal decimal classification recommender systems. Applied Sciences 15(14): 7666.

14.

Brzustowicz

(2023) From ChatGPT to CatGPT: The implications of artificial intelligence on library cataloging. Information Technology and Libraries 42(3): 16295.

15.

Bultrini

(2024) Current directions for artificial intelligence in libraries: An introductory overview. In: Balnaves

Bultrini

Cox

, et al. (eds) New Horizons in Artificial Intelligence in Libraries. De Gruyter, pp.17–23. https://doi.org/10.1515/9783111336435-003

16.

Casheekar

Lahiri

Rath

, et al. (2024) A contemporary review on chatbots, AI-powered virtual conversational agents, ChatGPT: Applications, open challenges and future research directions. Computer Science Review 52: 100632.

17.

Chen

Zhang

Langrené

, et al. (2025) Unleashing the potential of prompt engineering for large language models. Patterns 6(6): 101260.

18.

Chen

(2020) IoT, cloud, big data and AI in interdisciplinary domains. Simulation Modelling Practice and Theory 102: 102070.

19.

Chow

EHC

Kao

(2024) An experiment with the use of ChatGPT for LCSH subject assignment on electronic theses and dissertations. Cataloging & Classification Quarterly 62(5): 574–588.

20.

Cirillo

Desiato

Polese

, et al. (2025) Exploring the ability of emerging large language models to detect cyberbullying in social posts through new prompt-based classification approaches. Information Processing & Management 62(3): 104043.

21.

Cox

(2023) How artificial intelligence might change academic library work: Applying the competencies literature and the theory of the professions. Journal of the Association for Information Science and Technology 74(3): 367–380.

22.

da Silva

de Sousa

(2024) Inteligência Artificial e o ChatGPT:: perspectivas e desafios para a Classificação Bibliográfica [Artificial Intelligence and ChatGPT: Perspectives and challenges for Bibliographic Classification]. Revista Ibero-Americana de Ciência da Informação 17(1): 44–65.

23.

Debnath

Siddiky

Rahman

, et al. (2025) A comprehensive survey of prompt engineering techniques in large language models. Institute of Electrical and Electronics Engineers (IEEE), 8 March. Available at: https://doi.org/10.36227/techrxiv.174140719.96375390/v2 (accessed 12 March 2025).

24.

Deegan

Tanner

(2013) Digital librarians: New roles for the information age. In: Digital Futures: Strategies for the Information Age. Digital Futures. Facet, pp.209–231. Available at: https://www.cambridge.org/core/books/digital-futures/digital-librarians-new-roles-for-the-information-age/7A63075640B8292FFBDCDB2AD0ADD8A3

25.

Desmarchelier

Djellal

Gallouj

(2025) Innovation in libraries: A service-oriented perspective. Research Policy 54(1): 105110.

26.

Dobreski

Hastings

(2025) AI Chatbots and subject cataloging: A performance test. Library Resources and Technical Services 69(02): 8440.

27.

Fahmi

Sofiyani

Margono

, et al. (2025) AI-based book classification using book titles: investigating two-stage NLP approach. In: 2025 IEEE 11th International conference on computing, engineering and design (ICCED), Cairo, Egypt, 13 November 2025, pp.1–6. IEEE. Available at: https://ieeexplore.ieee.org/document/11324715/ (accessed 4 February 2026).

28.

Feng

(2025) AI-powered knowledge organization: A next-generation approach to library classification using DeepSeek-R1. Scientific Reports 15(1): 38394.

29.

Gao

Jin

, et al. (2025) A comparison of DeepSeek and Other LLMs. arXiv:2502.03688. arXiv. Available at: http://arxiv.org/abs/2502.03688 (accessed 23 October 2025).

30.

Grzybowski

Pawlikowska–Łagód

Lambert

(2024) A history of artificial intelligence. Clinical Dermatology 42(3): 221–229.

31.

Guo

Tian

Tang

, et al. (2025) Multi-pattern retrieval-augmented framework for Text-to-SQL with Poincaré-Skeleton retrieval and meta-instruction reasoning. Information Processing & Management 62(3): 103978.

32.

Hamad

Al-Fadel

Shehata

AMK

(2024) The level of digital competencies for the provision of smart information service at academic libraries in Jordan. Global Knowledge, Memory and Communication 73(4/5): 614–633.

33.

Hamdan

Hassanien

Khamis

, et al. (eds) (2021) Applications of Artificial Intelligence in Business, Education and Healthcare. Studies in Computational Intelligence. Springer International Publishing. https://doi.org/10.1007/978-3-030-72080-3

34.

Harisanty

Anna

NEV

Putri

, et al. (2025) Is adopting artificial intelligence in libraries urgency or a buzzword? A systematic literature review. Journal of Information Science 51(2): 511–522.

35.

Hibner

Kelly

(eds) (2013) Making a Collection Count: A Holistic Approach to Library Collection Management, 2nd edn. Chandos Publishing.

36.

Hodonu-Wusu

(2025) The rise of artificial intelligence in libraries: The ethical and equitable methodologies, and prospects for empowering library users. AI and Ethics 5: 755–765. https://doi.org/10.1007/s43681-024-00432-7

37.

Hussain

(2023) Use of artificial intelligence in the library services: Prospects and challenges. Library Hi Tech News 40(2): 15–17.

38.

(2025) Application of large language models for digital libraries. In: Proceedings of the 24th ACM/IEEE joint conference on digital libraries.,119, pp.1–2. Association for Computing Machinery. https://doi.org/10.1145/3677389.3702617.

39.

Jayavadivel

Arunachalam

Nagarajan

, et al. (2024) Historical overview of AI adoption in libraries. In: Senthikumar

(ed.) AI-Assisted Library Reconstruction. IGI Global, pp.267–289. https://doi.org/10.4018/979-8-3693-2782-1.ch015

40.

Jha

(2023) Application of artificial intelligence in libraries and information centers services: Prospects and challenges. Library Hi Tech News 40(7): 1–5.

41.

Jia

Guo

, et al. (2026) Chinese ethnic minority book classification by large language models within CLC. Electronic Library 44: 251–270.

42.

Khalid

Witmer

A-P

(2025) Prompt engineering for large language model-assisted inductive thematic analysis. Social Science Computer Review. Epub ahead of print 2025. https://doi.org/10.1177/08944393251388098

43.

Krishnamurthy

Satija

Martínez-Ávila

(2023) Classification of classifications: Species of library classifications. Cataloging & Classification Quarterly 61(2): 228–248.

44.

Lazarinis

(2015) Dewey decimal classification. In: Cataloguing and Classification. Chandos Publishing, pp.153–176, Available at: https://www.sciencedirect.com/science/article/pii/B9780081001615000087 (accessed 23 October 2025).

45.

Liu

Cao

Liu

, et al. (2025) Datasets for large language models: a comprehensive survey. Artificial Intelligence Review 58(12): 403.

46.

Lund

(2025) Classification schemes: Universal, special, national. In: Baker

Ellis

(eds) Encyclopedia of Libraries, Librarianship, and Information Science. Elsevier, pp.501–513. Available at: https://linkinghub.elsevier.com/retrieve/pii/B9780323956895000109 (accessed 3 December 2025).

47.

Luo

Hong

Nie

(2025) Automatic classification of research data sets into the Chinese library classification with generative large language model. Electronic Library 43(4): 600–618.

48.

Mahmud

(2024) AI in automating library cataloging and classification. Library Hi Tech News. Epub ahead of print 2024. https://doi.org/10.1108/LHTN-07-2024-0114

49.

Mallikarjuna

(2024) An analysis of integrating artificial intelligence in academic libraries. DESIDOC Journal of Library & Information Technology 44(2): 124–129.

50.

Marr

(2019) Artificial Intelligence in Practice: How 50 Successful Companies Used AI and Machine Learning to Solve Problems. John Wiley & Sons.

51.

Martins

(2024) Artificial intelligence-assisted classification of library resources: The case of Claude AI. Library Philosophy and Practice: 8159.

52.

McCarthy

(2004) What is artificial intelligence? Epub ahead of print 24 November 2004.

53.

Mishra

Sarma

H and M S

(2025) PageLLM: Incremental approach for updating a security knowledge graph by using page ranking and large language model. Information Processing & Management 62(3): 104045.

54.

Mogali

(2014) Artificial intelligence and its applications in libraries. In: Conference: Bilingual international conference on information technology: yesterday, today and tomorrow., Defence Scientific Information and Documentation Centre, Ministry of Defence Delhi. Available at: https://www.researchgate.net/profile/Shivaranjini-Mogali/publication/287878456_Artificial_Intelligence_and_its_applications_in_Libraries/links/567a404708ae361c2f6826dc/Artificial-Intelligence-and-its-applications-in-Libraries.pdf (accessed 2 June 2025).

55.

Mojjada

(2025) ICT in Libraries: A Theoretical Approach. Chyren Publication.

56.

Nazim

Munshi

Ashar

(2023) Librarians self-efficacy in ICT-based library operations and services: A survey of librarians working in libraries of Aligarh Muslim University Library System. Journal of Librarianship and Information Science 55(4): 1028–1043.

57.

Nirudi

Parichi

(2024) Artificial intelligence in libraries: An overview. 5080670, SSRN Scholarly Paper. Social Science Research Network. Available at: https://papers.ssrn.com/abstract=5080670 (accessed 25 September 2025).

58.

Nursalimah

Ali

(2024) The role of librarians as digital curators in the digital information era in the Muhammadiyah University of North Sumatra Library. Journal La Edusci 5(4): 198–215.

59.

Ojuri

Han

Chiong

, et al. (2025) Optimizing text-to-SQL conversion techniques through the integration of intelligent agents and large language models. Information Processing & Management 62(5): 104136.

60.

Omame

Alex-Nmecha

(2020) Artificial intelligence in libraries. In: Osuigwe

(ed.) Managing and Adopting Library Information Services for Future Users. IGI Global Scientific Publishing, pp.120–144. https://doi.org/10.4018/978-1-7998-1116-9.ch008

61.

Oniani

Hilsman

Peng

, et al. (2023) Adopting and expanding ethical principles for generative artificial intelligence from military to healthcare. npj Digital Medicine 6: 225.

62.

Salaba

Chan

(2023) Cataloging and Classification: An Introduction. Bloomsbury Publishing.

63.

Satija

Gupta

(2024) S R Ranganathan: Making of the man and his method. Annals of Library and Information Studies 71(1): 11–24.

64.

Shahi

Hummel

(2025) On the effectiveness of large language models in automating categorization of scientific texts. In: Proceedings of the 27th International conference on enterprise information systems, Porto, Portugal, 2025, pp.544–554. SCITEPRESS - Science and Technology Publications. https://doi.org/10.5220/0013299100003929

65.

Stephens

(ed.) (1998) Public Library Collection Development in the Information Age, 1st Boca Raton edn. CRC Press.

66.

Suman

Patel

(2025) Awareness and usage of ICT-based library and information services among the library users of Vivekanand Central Library of Central University of Jharkhand, Ranchi: A case study. SSRN Electronic Journal. Epub ahead of print 2025. https://doi.org/10.2139/ssrn.5134486

67.

Tait

Martzoukou

Reid

(2016) Libraries for the future: The role of IT utilities in the transformation of academic libraries. Palgrave Communications 2: 16070.

68.

Tait

Pierson

(2022) Artificial intelligence and robots in libraries: Opportunities in LIS curriculum for preparing the librarians of tomorrow. Journal of the Australian Library and Information Association 71(3): 256–274.

69.

Wang

Zhao

, et al. (2024) Investigating the impact of prompt engineering on the performance of large language models for standardizing obstetric diagnosis text: Comparative study. JMIR Formative Research 8: e53216.

70.

Will

(1997) Dewey for Windows. Electronic Library 15(3): 192–195.

71.

Xia

Abuassba

AOM

(2020) Recent research on AI in games. In: 2020 International wireless communications and mobile computing (IWCMC), June 2020, pp.505–510. Available at: https://ieeexplore.ieee.org/document/9148327 (accessed 23 October 2025).