Integrating large language models in care,research,and education in multiple sclerosis management

Abstract

Use of techniques derived from generative artificial intelligence (AI), specifically large language models (LLMs), offer a transformative potential on the management of multiple sclerosis (MS). Recent LLMs have exhibited remarkable skills in producing and understanding human-like texts. The integration of AI in imaging applications and the deployment of foundation models for the classification and prognosis of disease course, including disability progression and even therapy response, have received considerable attention. However, the use of LLMs within the context of MS remains relatively underexplored. LLMs have the potential to support several activities related to MS management. Clinical decision support systems could help selecting proper disease-modifying therapies; AI-based tools could leverage unstructured real-world data for research or virtual tutors may provide adaptive education materials for neurologists and people with MS in the foreseeable future. In this focused review, we explore practical applications of LLMs across the continuum of MS management as an initial scope for future analyses, reflecting on regulatory hurdles and the indispensable role of human supervision.

Keywords

Multiple sclerosis large language models (LLMs)artificial intelligence applications disease management

Introduction

Digital tools and artificial intelligence (AI) in healthcare promise a new era in the management of multiple sclerosis (MS).¹ MS presents unique challenges as a chronic and unpredictable disease, such as multidimensionality and heterogeneity of pathological findings and clinical manifestations, or an extremely variable disease course. Thereby, there is an urgent need for cutting-edge technologies to capitalize recent advancements in diagnostics, monitoring, and treatment for people with MS (pwMS).

In this evolving landscape, large language models (LLMs) stand as a key component of the field of generative AI with the capacity to enhance MS management with efficiency and already impressive precision.^2,3 Since MS commonly affects younger individuals, notably women between 20 and 40 years, they may be especially receptive to an online second opinion provided through such platforms.^4,5

LLMs have emerged demonstrating an unparalleled ability to generate texts that emulate human quality, adapting to required contexts and extending their utility to specialized fields in biomedicine.² The applications of LLMs in managing MS are diverse and expanding, being possibly useful not only for MS specialists but for pwMS as well. We believe these applications could extend way beyond interactive chatbots, where pwMS can administrate their appointments or report symptoms. They could also include advanced educational tools and programs for text analysis or data extraction in setting of real-world analyses. These models are already explored for simulating and supporting clinical decision-making in other fields.⁶ In MS, integration of multimodal data obtained from various instruments and facilitated through AI algorithms with current medical knowledge may be used to support decision of immune or symptomatic treatments.²

An expanding literature highlights the utility of AI in MS, spanning from diagnosis to prognosis and monitoring, with a particular focus in imaging and multimodal data integration.⁷ However, there is still a substantial gap in the field of LLMs and their use in MS management remains relatively unexplored. Research is required for their use in direct patient care, MS-research, and education of stakeholders. In this focused review, we aim to approach these applications expanding the discourse on upcoming applications of LLMs within MS frameworks and with a clinical focus.^8,9 Complementary to previous reports of LLMs in medicine, we examine the role of LLMs and how these could innovatively address complexities in the MS healthcare continuum.⁸ These complexities are not fully addressed by existing AI tools, considering the distinctive characteristics and varying disability progression of MS. We examine some of the unique challenges posed by MS, such as the heterogeneity of symptoms or the need for individualized treatment and how LLMs can approach these by offering innovative tools. Furthermore, we explore into the regulatory and ethical implications that accompany the implementation of these technologies, highlighting the importance of prioritizing patient safety. We aim to stimulate discussion about the potential of LLMs in MS, exploring their applications in leveraging AI in this complex disease.

How do LLMs work?

LLMs employ complex algorithms that excel in producing remarkably human-like texts in the field of generative AI. Clusmann et al.⁸ provided a comprehensive analysis of the potential applications of LLMs within the medical field, accompanied by a useful glossary of computational terms. These models employ complex neural networks, often based on transformer architectures such as generative pre-trained transformer (GPT).¹⁰ This model, which relies on attention mechanisms, has revolutionized sequence transduction tasks by enhancing the ability to process contextual information without the need for recurrent or convolutional networks.

The capabilities of LLMs extend beyond simple question-answering as they can summarize, paraphrase, translate, or even transform verbal information.^11,12 With the advent of several open-access LLMs, such as ChatGPT (OpenAI, San Francisco, California, USA), or Gemini (Google LLC, Mountain View, California, USA), both the general public and academic communities have started to recognize their potential for language-based applications.^13,14 In the previous years, advances from GPT-3.5 to GPT-4o reflect significant improvements in performance, with enhanced understanding and improved responses from the latter. In addition, domain-specific models such as BioBERT, BioMegatron, and PubMedBERT have been developed, targeting the biomedical domain.^15–17

LLMs use linguistic patterns and semantics through a self-attention mechanism that can recognize contextual nuances and the relationships between words and sentences.⁸ This understanding surpasses that of traditional machine learning models, which may rely on manual feature encoding or rule-based systems. By leveraging the self-attention mechanism, it enables more efficient and accurate predictions, making it an ideal foundation for developing advanced LLM applications in MS healthcare. Context here is also key: while the world “apple” can refer to a “fruit” or “technology” if accompanied by “edible” or “watch,” the word “attack” can refer to a “relapse” or “pseudo-relapse” depending on further data. A “lesion” may refer to an electrophysiological finding, optic nerve affection, or to histological characteristics. A “lesion” + “active” can have a different meaning if it involves “early” or “late” demyelinating or “mixed + inactive” as well (see Supplementary Figure). Based on this context, the probabilities in the transformer architecture are adjusted, influencing the produced outputs to ensure they are contextually accurate and relevant to the specific MS use-case or scenario.

LLMs undergo pre-training on expansive datasets, or corpora, that include several interned-based sources to refine their language processing abilities. However, given that training corpora of currently available LLMs are often not disclosed and not fully known, it has been, to our knowledge, not yet been specifically tailored for MS-centered applications. Strategies for optimizing LLM output for specific tasks and domains are under discussion.¹⁸ For highly specific use-cases, LLMs can be fine-tuned on proprietary or local datasets. Recently, a group presented a fine-tuned model for detecting disease progression in MS with relatively little information from routine clinical practice.¹⁹ A particularly promising approach is the use of retrieval-augmented generation (RAG), which incorporates additional datasets during the response generation process.¹⁸ For instance, MS-specific literature—such as recent or regional guidelines, consensus papers, or scientific conference reports—can be used as source material in the form of targeted “chunks” of information to refine the foundational training of pre-existing models (see Figure 1). This enhancement can help to address issues such as the creation of erroneous content (“hallucinations”), reliance on outdated information, or responses lacking MS-specific knowledge by anchoring the LLMs’ responses in targeted, precise input as currently discussed exemplary in oncology.^12,20

Figure 1.

An integrated approach to large language models (LLMs) in multiple sclerosis (MS) management: a schematic representation.

LLMs in MS care

Processing and interpreting large datasets are fundamental to supporting clinical decisions in MS management. LLMs could integrate patient records, the latest research publications, and treatment protocols to provide evidence-informed guidance. Through neural networks and prediction models, they have shown proficiency in leveraging substantial medical knowledge and simulating complex reasoning skills typical of healthcare professionals.^21–23 Presenting information in a user-friendly manner, such as through a chatbot interface, is particularly promising for MS specialists in routine care, where current evidence is updating practically on daily basis. Already with few data or inputs deep learning approaches have been able to predict disease courses.¹⁹ LLMs hold the potential to refine not just diagnosis and selection of disease-modifying therapies (DMTs), but also symptomatic treatments or rehabilitation. Moreover, LLMs can optimize information flow through digital pathways or monitoring platforms with cutting-edge MS-knowledge, enhancing communication channels with caregivers.²⁴

For example, these models could serve as interactive interfaces to integrate data regarding risk for highly active disease course (e.g. description of relapses and their severity or MS lesion burden), hidden symptoms, demographics or inclusive financial aspects for treatment choice, summarizing not only data provided by MS specialists, but also directly from patients themselves. LLMs may also assist in creating personalized rehabilitation programs by analyzing patient preferences, conditions, or progress and suggesting modifications.

Effective communication is fundamental in MS care, where patient adherence can significantly influence treatment success and overall outcomes. A deep understanding of MS pathophysiology, symptoms, and treatment options is therefore essential. LLMs enable more digestible dialogues between MS specialists and pwMS. By translating difficult medical terminology into more accessible language, LLMs may assist pwMS in explaining their condition and therapies, as suggested by prior evaluations, for example, in a series of MS clinical scenarios.²⁵ In addition, LLMs can tailor educational content to educational backgrounds and respond to common inquiries. For instance, brain or spinal magnetic resonance imaging (MRI) reports could be translated into layperson-friendly summaries. LLMs can also offer instant responses to patient questions when medical professionals are not immediately reachable. In an Italian cohort, a group of pwMS perceived responses provided by a chatbot to be higher empathetic than answers provided from specialists.²⁶

The administrative load associated with documentation is well-known resource-consuming in medical domains, although few research is available in MS.²⁷ However, LLMs may ease automating writing of clinical notes, communication with pwMS, and other necessary paperwork in ambulatory or stationary settings (e.g. writing tailored medical reports for family physicians, insurance entities, or social service entities), reducing the burden of tasks such as applying for assistive devices. This could begin with integrating speech-to-text technologies during consultation documentation. Further calculations of disability beyond the Expanded Disability Status Scale (EDSS), as already explored with machine learning, may be possible.²⁸ Tests evaluating with MS-specific reports generated by LLMs are, however, not available. Moreover, LLMs can be used to streamline the process of obtaining insurance approvals for treatments by auto-generating necessary documentation and justifications based on data from medical records, for example, last EDSS, as data from symptoms documentation or symptomatic treatment may serve this purpose. Labels of approved DMTs could be checked with the patients records to assess potential drug interactions, contraindications, or appropriate dosing. This assistance may go beyond speeding up bureaucratic tasks, enabling MS specialists to dedicate more time for face-to-face patient interaction with potential to reduce medical expenses.

In efforts of implementing newer technologies into MS-healthcare, AI-based digital pathways are being integrated into a “digital MS twin” modeling the complexities of MS care within a virtual framework.²⁹ These digital pathways could become integrated components to reflect and expand real-world clinical scenarios,³⁰ improving involvement of pwMS, adherence, and outcomes.³¹

LLMs in MS research

An expanding generation of knowledge in MS regarding the pathophysiology, treatment and prognostic of MS is occurring. Uses of LLMs in MS research share several aspects with the application in other medical fields or neurological diseases. LLMs can be useful by summarizing current findings, aiding MS researchers in accessing synthesized versions of scientific literature.^23,32 Similarly, identification of emerging research opportunities, translational approaches, and innovative treatment strategies from other medical fields can be facilitated by LLMs. These can highlight the potential application of newer biomarkers used in other diseases, suggest novel therapeutic targets, and inspire cross-disciplinary approaches that could be beneficial in MS research. However, it is crucial to acknowledge that the usefulness of such content depends widely on the quality and sources of their training corpora, with the potential biases or conflicts of interest that may result from this. LLMs do not reflect their own output or weigh and balance arguments as humans, their output is based on calculations and predictions.

LLMs can particularly address complex MS-related datasets by streamlining the extraction and organization of unstructured textual data without the need for human reasoning, outperforming traditional machine learning methods.^19,33,34 Although data from randomized controlled trials provide the highest levels of evidence, more research is required to optimize use of real-world input data from clinical routine settings.³⁵ These, despite its raw and unformatted nature, provide insights beyond the setting of controlled trials.³⁶ Given the variability in the real-world clinical presentation of MS, LLMs can assist researchers recognizing patterns and errors for improving data quality and consistency (e.g. inconsistent EDSS, lack of compliance in tests), integrating multimodal data sources or providing real-time feedback during data collection. Yet, the integration of “big data” presents challenges (e.g. heterogeneity and biases) that can be also present using AI.

Data on electronic health records, not only coming from discharge summaries, but also, from round notes or any written communications may serve as source of real-world data.^19,37 A notable limitation in this area is the inherent subjectivity and potential bias from stakeholders in the primary documentation. Grading spasticity, describing eye movements, or characteristics of movement disorders is for neurologists or pwMS, still, relatively subjective. Nonetheless, beyond textual notes, data encompassed in different forms of language (including scales such as EDSS) and established clinical outcomes (including timed walk tests or assessments of visual acuity), or other types of data such as brain imaging, neurodestruction biomarkers, or digital sources (e.g. dynamometers, accelerometers, and smartwatches) could be incorporated.^38–40

While certain models are already operational across different medical fields, a large unmet potential remains within MS research.^41,42 For instance, research has shown the ability of a specifically tuned LLM to detect Alzheimer’s disease from medical records, surpassing the performance of human experts.⁴³ Efficient identification of prodromal MS from unstructured real-world data (including apparently unrelated consultations or data from healthcare funding bodies) may also be possible to estimate real disease trajectories and investigate novel MS biomarkers.

In addition, LLMs could facilitate interactions among scientific collaborators. The adoption of LLM in drafting manuscripts or preparing grant proposals is a topic of active debate within various research communities.^44,45 A position or statement from MS societies would likely be welcomed to address this aspect as a collective. We consider that questions regarding the appropriateness of LLM for draft generation, the boundaries of authorship and intellectual property, and whether LLM activities amount to those of a writing aid require careful consideration. AI systems hold the potential to streamline time-intensive research tasks, yet the generation of original content and conceptual thought may remain a human task.

LLMs in MS education

Contrary to concerns of AI in education because of reduced learning or memorization, LLMs may offer innovative pedagogical approaches.⁴⁶ Their integration into the medical curricula at an undergraduate level can provide instant access to high-quality medical knowledge, while including experiences of pwMS. Incorporation of latest MS corpora from medical guidelines, scientific literature, and medical conferences, for example, through RAG, would keep both students and healthcare professionals updated on best practices.⁴⁶

Flexible LLMs could help creating interactive educational modules. Case-based learning, an established and efficacious educational strategy, may be further strengthened by LLMs, permitting stakeholders to virtually engage with the challenging diagnostic and therapeutic scenarios associated with MS. While existing online or digital models offer predetermined responses to learning scenarios, LLMs may provide a dynamic open interaction. LLM-based chatbots may reflect certain challenging nuances of symptom description by pwMS, as pain can range between somatic, visceral, and neuropathic types. This could enhance critical thinking and decision-making skills beyond multiple-choice or pre-established answers.

These models can be fine-tuned to “guide” users (e.g. students) in the desired direction.¹⁸ Cases may include managing various stages of MS, from initial diagnosis to handling advanced symptoms and complications. Thus, the open-ended depiction of disease course or the interpretation of diagnostic tests could be trained in an unsupervised free-response format. However, we should carefully consider individualized treatment decision-making and availability of resources, which may widely vary according to local resources or regulations and a ground truth or gold standard is frequently not available.⁴⁷ Adapting the models to local characteristics and needs is important to avoid treatment gaps, as certain immune therapies may not be available or approved in specific regions. Moreover, treatment strategies may be considered in the customization, as the learning tool may adopt different approaches (e.g. hit hard and early vs escalation).

In addition, LLMs can change the development training exams and questions, as they could adjust to previous user results, although this application remains still untested in neurology and MS settings.⁴⁶ Furthermore, LLMs can serve as instrumental resources preparing content, such as educational lectures or presentations. A request to “prepare a presentation discussing the diagnosis of secondary progressive MS” can yield a constructive template, which a junior lecturer can further refine.²⁷

As mentioned above, patient education is fundamental in MS management. LLMs can play a significant role in disseminating understandable and accurate information to patients in a patient-friendly language. For instance, a newly diagnosed pwMS could interact with an LLM-based platform to understand their diagnosis or the suggested DMTs by the MS specialist. Management and interpretation of symptoms and decision support of possible relapse consultations may also serve in MS pathways or individual patient journeys.⁴⁸ LLMs can be prompted and adjusted to deliver personalized educational content adapted to individual profiles, including age, background, or specific symptoms, while also considering cognitive capacities of pwMS. This may include interpreting clinical trial findings, negative outcomes from phase III studies, or information about complementary or alternative therapies. Digital health literacy is necessary also in MS, as pwMS could take advantage of digital resources for their health.

Challenges and regulations

While LLMs hold immense potential in healthcare and MS management, their deployment in areas requiring medical interpretation must be approached with careful oversight and human involvement. If AI systems are used in complex clinical scenarios for applications in diagnosis, treatment, monitoring, or prognosis of MS, they are subject to stringent regulation to ensure they are utilized safely and effectively.^49,50 The integration of LLMs into MS poses tangible challenges common to other medical fields, including risks of disseminating misinformation, privacy breaches, inherent biases from training datasets, and the danger of abuse.

Ensuring that the employment of patient data through LLMs adheres to data protection regulations is imperative, as the confidentiality and integrity of sensitive health information are non-negotiable. Leakage of such data remains a concerning threat that must be addressed.⁵⁰ Federated learning, as an example, is an emerging collaborative approach that could enable the contribution between MS centers to a shared MS research model without exchanging sensitive patient data, maintaining privacy and data ownership.⁵¹

The European Union’s recent enactment of the first AI Act is a landmark development, categorizing AI systems according to their potential risk levels and delineating responsibilities for both developers and users.⁵² Unsurprisingly, AI systems with the potential to significantly impact healthcare—where the stakes can involve life-or-death outcomes and substantial financial implications—are identified as high risk and subjected to rigorous regulative measures. This legislation underscores the importance of transparency and risk management throughout the AI development lifecycle.

Addressing specific challenges in incorporating LLMs into MS management necessitates thoughtful strategies, as laid out in Table 1. Many of these considerations—reinforcing ethical development, bolstering cyber-secure infrastructures, and promoting data stewardship—are encapsulated within the framework of the European Union’s AI Act.

Table 1.

Navigating Challenges and Pioneering Solutions for Enhancing AI and LLM in MS Management.

Challenges	Approaches
Multicenter data availability and with heterogeneous data characteristics and collection method	Establishment of data-sharing agreements and collaborations (e.g. federate learning). Open-data initiatives
Data bias and representation, bias propagation	Use of diverse datasets involving pwMS from diverse characteristics and backgrounds can enhance training, refinement, and relevance of LLMs
Lack of defined “ground truth” in dynamic MS advances and regional differences	Periodical updates of LLMs, incorporation of diverse international perspectives in training and testing of models. Real-time employment of data-retrieval strategies (e.g. RAG)
Limited contextual understanding and nuanced judgment	Training enhancement and customization, (e.g. fine-tuning)
Over-reliance in AI technologies (automation bias)	Human oversight and validation to complement AI-driven decisions
Hallucination	Robust anomaly detection mechanisms and model recalibration strategies (e.g. feedback loops, early warning, adaptive learning, or regular updates through an MS societal task force)
Data privacy	Robust data security measures in training and use of LLMs (e.g. encryption, anonymization, and access controls to protect sensitive data)
Lack of validation studies	Rigorous real-world validation studies across diverse settings
Unknown structure and training corpus in closed-source models	Advocate for transparency and regulatory scrutiny of proprietary algorithms (e.g. EU AI act)
Technical infrastructure with restricted access across shareholders from different backgrounds	Scalable cloud computing resources for widespread accessibility
Fragmentation and inefficiency in development due to parallel working groups	Utilization of multicenter joint platforms, task forces or working groups; workshops and webinars

pwMS: people with MS; MS: multiple sclerosis; LLM: large language model; RAG: retrieval-augmented generation; AI: artificial intelligence; EU: European Union.

The widespread adoption of LLMs in patient treatment will require balancing their flexibility with ethical, legal, and procedural safeguards. As such, safe integration of this technology into MS management will rely on an ongoing dialogue between developers, healthcare professionals, pwMS, regulators, and the wider community, with safety as the foundational priority.

Vision: integrating large multi-modal models in MS care

Although research and use of LLMs in MS are still in early stages, other established AI technologies such as machine learning algorithms for imaging and pattern recognition are already more developed and tested, being possibly suitable for use in the coming future.⁵³ As previously noted, these technologies may complement the capabilities of LLMs. Multi-modal models could interpret data from different forms, encompassing several aspects of the disease and even making predictions in real-world scenarios.

In neuroimaging, non-LLM-based AI algorithms are already showing considerable promise in enhancing the understanding and management of MS.^19,54 These algorithms offer capabilities for identifying and monitoring MS lesions, as well as quantifying changes in brain tissue, such as regional or global atrophy, over time. Some of these AI-driven models are currently near to approval process, heralding their potential integration into clinical practice for MS management.^55,56 Furthermore, AI applications in neuroimaging are expanding, with research exploring the identification of novel biomarkers, supporting differential diagnoses of white matter anomalies, and predicting disease progression through imaging data. In addition, optical coherence tomography (OCT) leverages AI to augment the detection of MS, reflecting the versatility of AI in different imaging modalities.⁵⁷

AI’s proficiency identifying longitudinal dynamics in biomarkers presents an exciting opportunity for early detection of disease progression or the efficacy of DMTs. Beyond imaging, data from wearable technologies offer real-time insights into physical parameters, including movement patterns, heart rate variability, sleep quality, and activity levels, which are especially interesting given the symptoms such as fatigue and mobility in pwMS.⁵⁸

Recognizing subtle changes, particularly in early stages of MS, is challenging. AI provides a refined tool to detect and interpret these nuances. In synthesizing these insights, the integration of multi-modal data through AI together with LLMs can significantly enhance patient management across all spectrums of MS.

Further research and efforts are needed to assess the potential applications of LLMs, as commented above. Combining diverse data streams with clinical information systems supported by LLMs stands to transform the landscape of MS care, research, and education. By integrating the strengths of various AI tools, a holistic approach is possible, one that delivers a comprehensive, patient-centric care of pwMS. This may offer a platform that not only involves answering questions on a chatbot, but also aims for a personalized disease management. As we move forward, increasing applications of generative AI seem to be possible in the near future for a novel digital MS management.

Supplemental Material

sj-jpg-1-msj-10.1177_13524585241277376 – Supplemental material for Integrating large language models in care, research, and education in multiple sclerosis management

Supplemental material, sj-jpg-1-msj-10.1177_13524585241277376 for Integrating large language models in care, research, and education in multiple sclerosis management by Hernan Inojosa, Isabel Voigt, Judith Wenk, Dyke Ferber, Isabella Wiest, Dario Antweiler, Eva Weicken, Stephen Gilbert, Jakob Nikolas Kather, Katja Akgün and Tjalf Ziemssen in Multiple Sclerosis Journal

Footnotes

Data Availability Statement

Data sharing is not applicable to this article as no datasets were generated or analyzed during this study.

Declaration of Conflicting Interests

The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: H.I. received speaker honoraria from Roche and financial support for research activities from Teva, Biogen, and Alexion. S.G. declares a nonfinancial interest as an Advisory Group member of the EY-coordinated “Study on Regulatory Governance and Innovation in the Field of Medical Devices” conducted on behalf of the DG SANTE of the European Commission. S.G. declares the following competing financial interests: he has or has had consulting relationships with Una Health GmbH, Lindus Health Ltd., Flo Ltd, Thymia Ltd., FORUM Institut für Management GmbH, High-Tech Gründerfonds Management GmbH, and Ada Health GmbH and holds share options in Ada Health GmbH. S.G. and J.N.K. are supported by the German Federal Ministry of Health (DEEP LIVER, ZMVI1-2520DAT111; SWAG, 01KD2215B), the Max-Eder-Programme of the German Cancer Aid (grant no. 70113864), the German Federal Ministry of Education and Research (PEARL, 01KD2104C; CAMINO, 01EO2101; SWAG, 01KD2215A; TRANSFORM LIVER, 031L0312A; TANGERINE, 01KT2302 through ERA-NET Transcan), the German Academic Exchange Service (SECAI, 57616814), the German Federal Joint Committee (Transplant.KI, 01VSF21048) the European Union’s Horizon Europe and innovation programme (ODELIA, 101057091; GENIAL, 101096312) and the National Institute for Health and Care Research (NIHR, NIHR213331) Leeds Biomedical Research Centre. K.A. received personal compensation from Novartis, Biogen Idec, Teva, Sanofi, and Roche for consulting services. T.Z. reports scientific advisory board and/or consulting for Biogen, Roche, Novartis, Celgene, and Merck; compensation for serving on speakers bureaus for Roche, Novartis, Merck, Sanofi, Celgene, and Biogen; and research support from Biogen, Novartis, Merck, and Sanofi. E.W., I.V., D.A., J.W., I.W., and D.F. have nothing to declare.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iDs

Hernan Inojosa

Isabella Wiest

Dario Antweiler

Supplemental Material

Supplemental material for this article is available online.

References

Goldenberg

. Multiple sclerosis review. P T 2012; 37: 175–184.

Singhal

Azizi

, et al. Large language models encode clinical knowledge. Nature 2023; 620: 172–180.

Haupt

Marks

. AI-generated medical advice-GPT and beyond. JAMA 2023; 329: 1349–1350.

Wendebourg

Heesen

Finlayson

, et al. Patient education for people with multiple sclerosis-associated fatigue: A systematic review. PLoS ONE 2017; 12: e0173025.

Sahebalzamani

Zamiri

Rashvand

. The effects of self-care training on quality of life in patients with multiple sclerosis. Iran J Nurs Midwifery Res 2012; 17: 7–11.

Benary

Wang

Schmidt

, et al. Leveraging large language models for decision support in personalized oncology. JAMA Netw Open 2023; 6: e2343689.

Bonacchi

Filippi

Rocca

. Role of artificial intelligence in MS clinical practice. NeuroImage Clin 2022; 35: 103065.

Clusmann

Kolbinger

Muti

, et al. The future landscape of large language models in medicine. Commun Med 2023; 3: 141.

Romano

Shih

Paschalidis

, et al. Large language models in neurology research and future practice. Neurology 2023; 101: 1058–1067.

10.

Vaswani

Shazeer

Parmar

, et al. Attention is all you need. Adv Neural Inf Process Syst 2017; 30: 1–11.

11.

Harrer

. Attention is not all you need: The complicated case of ethically using large language models in healthcare and medicine. EBioMedicine 2023; 90: 104512.

12.

Lappin

. Assessing the strengths and weaknesses of large language models. J Log Lang Inf 2023; 33: 9–20.

13.

Achiam

Adler

Agarwal

, et al. Gpt-4 technical report. arXiv preprint arXiv:230308774 2023.

14.

Pichai

. An important next step on our AI journey. Google Blog, 6 February 2023. https://blog.google/technology/ai/bard-google-ai-search-updates/

15.

Lee

Yoon

Kim

, et al. BioBERT: A pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 2019; 36: 1234–1240.

16.

Shin

H-C

Zhang

Bakhturina

, et al. BioMegatron: Larger biomedical domain language model. arXiv preprint arXiv:201006060 2020.

17.

Tinn

Cheng

, et al. Domain-specific language model pretraining for biomedical natural language processing. ACM Trans Comput Healthc 2021; 3: 2.

18.

Dodgson

Nanzheng

Peh

, et al. Establishing performance baselines in fine-tuning, retrieval-augmented generation and soft-prompting for non-specialist LLM users. arXiv preprint arXiv:231105903 2023.

19.

Zhan

. Precision monitoring for disease progression in patients with multiple sclerosis: A deep learning approach, 2023, https://ses.library.usyd.edu.au/handle/2123/31910

20.

Truhn

Eckardt

J-N

Ferber

, et al. Large language models and multimodal foundation models for precision oncology. npj Precis Oncol 2024; 8: 72.

21.

Savage

Nayak

Gallo

, et al. Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine. npj Digit Med 2024; 7: 20.

22.

Schubert

Wick

Venkataramani

. Performance of large language models on a neurology board–style examination. JAMA Netw Open 2023; 6: e2346721.

23.

Van Veen

Van Uden

Blankemeier

, et al. Adapted large language models can outperform medical experts in clinical text summarization. Nat Med 2024; 30: 1134–1142.

24.

Voigt

Inojosa

Wenk

, et al. Building a monitoring matrix for the management of multiple sclerosis. Autoimmun Rev 2023; 22: 103358.

25.

Inojosa

Gilbert

Kather

, et al. Can ChatGPT explain it? Use of artificial intelligence in multiple sclerosis communication. Neurol Res Pract 2023; 5: 48.

26.

Maida

Moccia

Palladino

, et al. ChatGPT vs. neurologists: A cross-sectional study investigating preference, satisfaction ratings and perceived empathy in responses among people living with multiple sclerosis. J Neurol 2024; 271: 4057–4066.

27.

Sinsky

Colligan

, et al. Allocation of physician time in ambulatory practice: A time and motion study in 4 specialties. Ann Intern Med 2016; 165: 753–760.

28.

Alves

Green

Leavy

, et al. Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis. Mult Scler J Exp Transl Clin 2022; 8: 20552173221108635.

29.

Voigt

Inojosa

Dillenseger

, et al. Digital twins for multiple sclerosis. Front Immunol 2021; 12: 669811.

30.

Wenk

Voigt

Inojosa

, et al. Building digital patient pathways for the management and treatment of multiple sclerosis. Front Immunol 2024; 15: 1356436.

31.

Tan

Cai

Agarwal

, et al. Impact of adherence to disease-modifying therapies on clinical and economic outcomes among patients with multiple sclerosis. Adv Ther 2011; 28: 51–61.

32.

Tang

Sun

Idnay

, et al. Evaluating large language models on medical evidence summarization. npj Digit Med 2023; 6: 158.

33.

Feder

Vainstein

Rosenfeld

, et al. Active deep learning to detect demographic traits in free-form clinical notes. J Biomed Inform 2020; 107: 103436.

34.

Bisercic

Nikolic

van der Schaar

, et al. Interpretable medical diagnostics with structured data extraction by large language models. arXiv preprint arXiv:230605052 2023.

35.

Kwakkenbos

Imran

McCall

, et al. CONSORT extension for the reporting of randomised controlled trials conducted using cohorts and routinely collected data (CONSORT-ROUTINE): Checklist with explanation and elaboration. BMJ 2021; 373: n857.

36.

Cohen

Trojano

Mowry

, et al. Leveraging real-world data to investigate multiple sclerosis disease behavior, prognosis, and treatment. Mult Scler 2020; 26: 23–37.

37.

Subbiah

. The next generation of evidence-based medicine. Nat Med 2023; 29: 49–58.

38.

Dillenseger

Weidemann

Trentzsch

, et al. Digital biomarkers in multiple sclerosis. Brain Sci 2021; 11: 1519.

39.

Ziemssen

Haase

. Digital innovation in multiple sclerosis management. Brain Sci 2021; 12: 40.

40.

Inojosa

Schriefer

Ziemssen

. Clinical outcome measures in multiple sclerosis: A review. Autoimmun Rev 2020; 19: 102512.

41.

Reichenpfader

Müller

Denecke

. Protocol: Large language model-based information extraction from free-text radiology reports: A scoping review protocol. BMJ Open 2023; 13: e076865.

42.

Delk

, et al. A comparison of a large language model vs manual chart review for the extraction of data elements from the electronic health record. Gastroenterology 2024; 166: 707–709.e3.

43.

Wang

. Two directions for clinical data generation with large language models: Data-to-label and label-to-data. Proc Conf Empir Methods Nat Lang Process 2023; 2023: 7129–7143.

44.

Thorp

. ChatGPT is fun, but not an author. Science 2023; 379: 313.

45.

Moon

Purkayastha

, et al. Ethics of large language models in medicine and medical research. Lancet Digit Health 2023; 5: e333–e335.

46.

Figari Jordan

Sandrone

Southerland

. Opportunities and challenges for incorporating artificial intelligence and natural language processing in neurology education. Neurol Educ 2024; 3: e200116.

47.

Zeineddine

Al-Hajje

Salameh

, et al. Barriers to accessing multiple sclerosis disease-modifying therapies in the Middle East and North Africa region: A regional survey-based study. Mult Scler Relat Disord 2023; 79: 104959.

48.

Abd-Alrazaq

AlSaad

Alhuwail

, et al. Large language models in medical education: Opportunities, challenges, and future directions. JMIR Med Educ 2023; 9: e48291.

49.

Gilbert

Harvey

Melvin

, et al. Large language model AI chatbots require approval as medical devices. Nat Med 2023; 29: 2396–2398.

50.

Meskó

Topol

. The imperative for regulatory oversight of large language models (or generative AI) in healthcare. npj Digit Med 2023; 6: 120.

51.

Sheller

Edwards

Reina

, et al. Federated learning in medicine: Facilitating multi-institutional collaborations without sharing patient data. Sci Rep 2020; 10: 12598.

52.

Commission E. Interinstitutional File: 2021/0106(COD). Proposal for a Regulation of the European Parliament and of the Council laying down harmonised rules on artificial intelligence (Artificial Intelligence Act) and amending certain Union legislative acts. 2024.

53.

Denissen

Nagels

. Artificial intelligence will change MS care within the next 10 years: Yes. Mult Scler J 2022; 28: 2171–2173.

54.

Afzal

Luo

Ramadan

, et al. The emerging role of artificial intelligence in multiple sclerosis imaging. Mult Scler J 2022; 28: 849–858.

55.

Shoeibi

Khodatars

Jafari

, et al. Applications of deep learning techniques for automated multiple sclerosis detection using magnetic resonance imaging: A review. Comput Biol Med 2021; 136: 104697.

56.

La Rosa

Wynen

Al-Louzi

, et al. Cortical lesions, central vein sign, and paramagnetic rim lesions in multiple sclerosis: Emerging machine learning techniques and future avenues. NeuroImage Clin 2022; 36: 103205.

57.

Kenney

Liu

Hasanaj

, et al. The role of optical coherence tomography criteria and machine learning in multiple sclerosis and optic neuritis diagnosis. Neurology 2022; 99: e1100–e1112.

58.

Graves

Montalban

. Biosensors to monitor MS activity. Mult Scler J 2020; 26: 605–608.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.14 MB