Abstract
Background:
Psoriatic arthritis (PsA) is a complex inflammatory disease with significant musculoskeletal, dermatological, and comorbidity burden. Despite therapeutic advances, many patients remain difficult to treat (D2T). Conventional definitions often overlook distinctions between inflammatory and non-inflammatory drivers, risking inappropriate treatment escalation. Two consensus frameworks—the European Alliance of Associations for Rheumatology (EULAR) and the Group for Research and Assessment of Psoriasis and Psoriatic Arthritis (GRAPPA)—have recently been proposed to better capture this complexity based on disease drivers.
Objectives:
To evaluate the applicability, overlap, and clinical utility of EULAR and GRAPPA definitions for difficult/complex-to-manage (D2M/C2M) and treatment-refractory PsA (TR-PsA) in a real-world cohort.
Methods:
We conducted a retrospective analysis within the prospective Ottawa Rheumatology CompreHEnSive TReatment and Assessment (ORCHESTRA) cohort, including consecutive PsA patients initiating or switching advanced therapy between April 2022 and September 2025. Patients were classified according to EULAR and GRAPPA frameworks into the broader D2M-PsA (EULAR) and C2M-PsA (GRAPPA) groups, and narrower inflammation-restricted TR-PsA subsets. Prevalence, overlap, and agreement were assessed using descriptive statistics and kappa analysis. Baseline clinical, comorbidity, and ultrasound characteristics were compared across groups.
Results:
Seventy-six PsA patients were included. GRAPPA identified more patients as C2M-PsA (35/76; 46%) than EULAR’s D2M-PsA (16/76; 21%). All EULAR D2M-PsA patients were included in C2M-PsA, whereas 19 additional patients were uniquely identified by GRAPPA due to less stringent treatment failure thresholds. Agreement between EULAR D2M-PsA and GRAPPA C2M-PsA was moderate (κ = 0.48). TR-PsA classification showed perfect agreement (κ = 1.000), with identical patients identified by both frameworks. D2M/C2M patients had longer disease duration and greater prior therapy exposure than non-D2T patients, while objective inflammatory measures were largely comparable.
Conclusion:
EULAR and GRAPPA D2T PsA frameworks share conceptual intent but differ in real-world clinical capture. EULAR preferentially enriches for multidomain involvement and inflammatory refractoriness after multiple therapy failures, whereas GRAPPA identifies a broader population reflecting earlier disease complexity. While both frameworks identified the same patients using inflammation-restricted criteria, differences in operationalization limit assumptions of interchangeability.
Plain language summary
Psoriatic arthritis (PsA) is a chronic inflammatory condition that can affect joints, skin, and many other aspects of health. Although several effective treatments are available, some patients remain difficult to manage. In these patients, ongoing symptoms may be caused by active inflammation, but also by other factors such as pain sensitization, comorbid conditions, or treatment intolerance. Distinguishing between these causes is important to avoid unnecessary escalation of immune-suppressing therapy. Recently, two expert groups—GRAPPA and EULAR—proposed new frameworks to better define “difficult-to-treat” PsA by considering disease drivers rather than treatment failure alone. However, how these frameworks perform in real-world clinical practice has not been well studied. In this study, we applied both frameworks to a real-world cohort of PsA patients from the ORCHESTRA registry in Canada who were starting or changing advanced therapies. We compared how often each framework identified patients as difficult-to-manage, how much they overlapped, and whether they identified similar clinical profiles. We found that the GRAPPA framework identified more patients as having complex or difficult-to-manage PsA, capturing patients earlier in their disease course with fewer treatment failures. In contrast, the EULAR framework identified a smaller group with more advanced disease and greater treatment exposure. When focusing only on patients with true inflammatory treatment refractoriness, both frameworks identified exactly the same patients. Overall, our findings suggest that while GRAPPA and EULAR share similar goals, they identify different patient populations in practice. GRAPPA may be more useful for recognizing early disease complexity, while EULAR may better identify patients with established inflammatory refractoriness. These differences should be considered when using these frameworks in clinical care and research.
Keywords
Introduction
Psoriatic arthritis (PsA) is a chronic, multifaceted inflammatory disorder characterized by involvement of the joints, entheses, digits, axial skeleton, skin, and nails. 1 Beyond its musculoskeletal and dermatological manifestations, PsA imposes a substantial burden through a range of comorbidities, including fibromyalgia, depression, obesity, and cardiovascular disease, all of which contribute to significant health and psychosocial consequences.2–5 Despite notable advances in therapeutic options, a substantial proportion of patients remains difficult to treat (D2T). 6 The complexity of these D2T cases extends well beyond true inflammatory refractoriness, as non-inflammatory comorbidities frequently impact disease activity measures. In addition, treatment obstacles may arise from medication side effects, contraindications, or limited access. The existing literature on D2T PsA presents considerable heterogeneity in its definition, having variable nomenclature and threshold criteria, often centered on the number of failed therapies 7 and largely adapted from the European Alliance of Associations for Rheumatology (EULAR) D2T rheumatoid arthritis (RA) framework. 8 Consequently, most definitions of D2T PsA do not stratify patients according to their specific disease state, those with persistent inflammatory activity and those whose disease burden is predominantly driven by non-inflammatory factors; rather, they encompass all cases within a single D2T category.6,9–11 This conflation risks obscuring the underlying mechanisms of non-response, hindering the identification of distinct patient subgroups, and may lead to the inappropriate escalation of immunosuppressive therapy when inflammation is not the primary driver of disease activity
To address these challenges, recent initiatives from the EULAR and the Group for Research and Assessment of Psoriasis and Psoriatic Arthritis (GRAPPA) have focused on developing clearer and standardized definitions, which were recently published.12,13 Both frameworks introduced definitions that capture a broad category of patients, irrespective of the underlying driver of failure, encompassing not only biological non-response but also comorbidities and treatment-related adverse events, as well as a narrower category reflecting true inflammatory refractoriness. Although the specific criteria differ slightly, both frameworks share a similar conceptual structure. One key distinction, however, lies in their intended application: the EULAR criteria were primarily developed to facilitate the identification of more homogeneous patient populations for research, whereas the GRAPPA framework was designed to support clinical shared decision-making and application in observational studies.
Within the broad disease-state category, EULAR proposed difficult-to-manage PsA (D2M-PsA), defined by failure of ⩾2 advanced therapies with different mechanisms of action, together with patient- or physician-perceived inadequate control and unmet treatment targets. In parallel, GRAPPA introduced the broad category of complex-to-manage PsA (C2M-PsA), defined as persistent disease impact despite at least one adequate trial of advanced therapy
Within each framework, a narrower subset of true treatment-refractory cases requires objective evidence of active inflammation (clinical, laboratory, or imaging). EULAR and GRAPPA define this group as treatment-refractory PsA (TR-PsA). Although the criteria within these definitions differ slightly, the concepts are closely aligned. These consensus-based definitions represent an important advance, but their applicability and limitations in routine clinical practice remain untested. The primary objective of this study was to determine the applicability, prevalence, and agreement of TR-PsA and D2M/C2M-PsA EULAR and GRAPPA consensus definitions within a real-world cohort who are initiating a new advanced therapy. Secondary objectives included characterizing and comparing the demographic, clinical, and comorbidity profiles of patients who fulfilled versus did not fulfill these definitions.
Methods
Study design and population
This single-center observational study is a retrospective analysis of data from the prospective longitudinal cohort, the Ottawa Rheumatology CompreHEnSiveness TReatment and Assessment Clinic (ORCHESTRA), established in April 2022 at the Ottawa University Hospital. The reporting of this study adheres to the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines. 14
ORCHESTRA operates within a shared-care model with referring primary rheumatologists, and it includes patients with inflammatory arthritis: RA, PsA, and ankylosing spondylitis patients who are initiating or switching to a new biologic disease-modifying antirheumatic drug (bDMARD) or targeted synthetic disease-modifying antirheumatic drug (tsDMARD) therapies at the Ottawa Hospital Rheumatology Division. Patients are assessed at baseline (prior to therapy initiation), at 3- and 6-months following treatment initiation, and subsequently every 3 months until individualized treatment targets, defined as low disease activity or remission, are achieved. At each visit, a standardized comprehensive assessment is performed, including demographic characteristics, clinical and ultrasound disease activity measures, laboratory markers of inflammation (erythrocyte sedimentation rate (ESR) and C-reactive protein (CRP)), and comorbidities.
For the present analysis, all consecutive PsA patients enrolled in the ORCHESTRA cohort between April 2022 and September 2025 who met the eligibility criteria were included. As this was a retrospective, exploratory analysis of an ongoing registry, no formal sample size calculation was performed.
Inclusion criteria
Age ⩾18 years at the time of ORCHESTRA enrollment.
Confirmed diagnosis of PsA, as determined by the treating rheumatologist.
Enrollment in the ORCHESTRA cohort, which captures patients referred for initiation or switching of advanced therapy.
Availability of baseline clinical data required to apply EULAR and GRAPPA definitions.
Exclusion criteria
Diagnosis of inflammatory arthritis other than PsA
Age <18 years
Absence of sufficient baseline data to operationalize EULAR or GRAPPA definitions
The study was approved by the Ottawa Health Science Network Research Ethics Board (OHSN-REB 20220187-01H) and conducted in accordance with the Declaration of Helsinki and Good Clinical Practice guidelines. Written informed consent was obtained from all participants.
Operational definitions
Baseline was defined as the initial ORCHESTRA visit at which patients were assessed to determine whether they fulfilled the proposed definition criteria for both EULAR and GRAPPA based on historical treatment exposure, current clinical assessment, disease activity measures, and patient- and physician-reported perceptions of disease impact (Table 1).12,13 We adhered closely to the original definitions and their suggested approaches to operationalization, applying minor pragmatic adaptations only when explicit thresholds or measures were not specified in either framework.
Summary of proposed definitions for D2M/C2M-PsA and TR-PsA according to EULAR and GRAPPA frameworks.
This is multifactorial drivers beyond objective inflammation and include multiple drivers that are leading to the C2M state (e.g., fibromyalgia, mood/sleep disorders, obesity, OA/CPPD, access/adherence treatment side effects).
Providing that no other causes likely explain these laboratory findings.
Uveitis (ophthalmologic confirmation) or inflammatory bowel disease (clinical/ endoscopic/ histologic or fecal calprotectin evidence).
Exclusion criteria: Comorbidities or other conditions may contribute to symptoms or biomarker elevations, such as infection, obesity-related CRP, metabolic syndrome, dyslipidemia, fibromyalgia/central sensitization, generalized hypermobility, or mechanical/degenerative pain, gout or CPPD, anxiety, and depression. These factors should be recognized as potential contributors, but do not preclude TR-PsA classification if PsA activity is the primary driver of disease manifestations. Exclusion should occur when these conditions better explain the clinical picture.
BASDAI, bath ankylosing spondylitis disease activity index; bDMARD, biologic disease-modifying antirheumatic drug; BSA, body surface area; C2M-PsA, complex-to-manage psoriatic arthritis; CRP, C-reactive protein; D2M-PsA, difficult-to-manage psoriatic arthritis; DAPSA, disease activity in psoriatic arthritis score; ESR, erythrocyte sedimentation rate; EULAR, European Alliance of Associations for Rheumatology; GRAPPA, Group for Research and Assessment of Psoriasis and Psoriatic Arthritis; IBD, inflammatory bowel disease; MDA, minimal disease activity; MRI, magnetic resonance imaging; PhGA, physician global assessment; PsA, psoriatic arthritis; PtGA, patient global assessment; SJC, swollen joint count; SPARCC, spondyloarthritis research consortium of Canada enthesitis index; TJC, tender joint count; TR-PsA, treatment-refractory psoriatic arthritis; tsDMARD, targeted synthetic DMARD; WBC, white blood cell count.
Both the EULAR and GRAPPA frameworks are conceptually designed to identify patients with PsA who are challenging to manage, distinguishing between predominantly non-inflammatory complexity (D2M-PsA, EULAR; or C2M-PsA, GRAPPA) and true inflammatory treatment refractoriness (TR-PsA). Accordingly, operationalization incorporated three core domains applicable to all patients, with two additional criteria applied to identify the subset with true inflammatory TR-PsA.
Prior advanced therapy failure
The first requirement was a history of failure of prior advanced therapy after an adequate therapeutic trial, guided by EULAR and GRAPPA PsA treatment recommendations.15,16 Advanced therapies included tumor necrosis factor inhibitors (TNFi), interleukin-17 inhibitors (IL-17i), interleukin-12/23 inhibitors (IL-12/23i), and Janus kinase inhibitors (JAKi). A minimum treatment duration of approximately 12 weeks was required, in line with treat-to-target consensus principles, which recommend early assessment for improvement at around 3 months and target attainment by 6 months. This minimum duration was used to preserve inclusion, recognizing that in routine clinical practice, treatment switching may occur earlier in cases of clear non-response. Treatment duration varied across the cohort.
Persistent disease activity
The second requirement was evidence of persistent disease activity despite therapy, assessed using clinical, laboratory, and imaging measures, with thresholds applied as specified in the original EULAR and GRAPPA operationalizations. For axial disease, EULAR refers to inflammatory back pain without specifying a quantitative activity measure, whereas GRAPPA suggests a Bath Ankylosing Spondylitis Disease Activity Index (BASDAI) ⩾ 4. Given the retrospective nature of data capture and the absence of a validated axial PsA activity instrument, the BASDAI ⩾ 4 threshold was pragmatically applied to both definitions to ensure consistency.
Perception of problematic disease
The third domain captured the perception of disease as problematic from both the patient and physician perspectives. In accordance with GRAPPA guidance, patient and physician global assessments were used, applying a cutoff of ⩾4, acknowledging that this threshold is pragmatic and not formally validated.
Additional criteria for identification of treatment-refractory PsA (TR-PsA)
Two additional criteria were required to classify patients as having true inflammatory TR-PsA. These included objective evidence of ongoing inflammation and exclusion of non-inflammatory drivers of symptoms. Patients whose treatment changes were primarily due to contraindications or intolerance, or whose disease activity was predominantly attributable to non-inflammatory conditions such as osteoarthritis or fibromyalgia, were not classified as having inflammatory treatment-refractory disease.
Musculoskeletal ultrasound assessment
Musculoskeletal ultrasound (US) examinations were performed at the end of each clinical visit as part of routine clinical care by a rheumatologist experienced in musculoskeletal sonography, who was not involved in D2T subgroup classification, as D2T phenotypes were applied retrospectively. All scans were conducted using a Logiq E10 ultrasound system (GE Healthcare, Milwaukee, WI, USA) equipped with a broadband 6–20 MHz linear probe.
A standardized scanning protocol was followed, obtaining dorsal and palmar views in a neutral joint position. A total of 46 joints were examined bilaterally: metacarpophalangeal (MCP) joints 1–5, proximal interphalangeal (PIP) joints 1–5, distal interphalangeal (DIP) joints 1–5, wrists, elbows, knees, ankles, and metatarsophalangeal (MTP) joints 2–5. Power Doppler (PD) settings were standardized with a pulse repetition frequency of 500 Hz, low wall filter, and gain adjusted until background signal disappearance. Synovitis was defined according to the OMERACT US Task Force definitions. A semi-quantitative scoring system (0–3) was used for grading both gray-scale (GS) and PD components 17 :
GS synovitis: 0 = none; 1 = mild; 2 = moderate; 3 = marked synovial hypertrophy.
PD signal: 0 = no flow; 1 = ⩽ 2 vessel (or signals) in small joints / ⩽ 3 in large joints; 2 = PD signal > grade 1 but < 50% of synovial area; 3 = PD signal occupying > 50% of the GS synovial area.
A combined OMERACT–EULAR synovitis score (0–3) was derived for each joint based on GS and PD findings, and the global OMERACT–EULAR Synovitis Score (GLOESS) was calculated by summing combined synovitis scores across the 46 assessed joints (range 0–138). 18
Enthesitis was assessed by the GS features (hypoechogenicity, thickening, enthesophytes, erosions, and calcifications) and PD signal at enthesis sites. All findings were scored on a scale between 0 and 3. Relevant features were summed to achieve an inflammatory and chronicity score, as described in our group’s previous publications. 19
In this analysis, the criteria component of objective sonographic inflammation was defined as follows:
Ultrasound synovitis: gray-scale (GS) score ⩾1 AND PD score ⩾1
Ultrasound tenosynovitis: GS score ⩾1 and PD score ⩾1
Ultrasound enthesitis: presence of hypoechogenicity or entheseal thickening, and PD signal
Statistical analysis
Descriptive statistics were employed to summarize the demographic and clinical characteristics of the patients. The normality of data distribution was assessed using the Kolmogorov–Smirnov test. Continuous variables were presented as mean ± standard deviation or median with interquartile range (IQR), depending on the distribution. Categorical variables were expressed as frequencies and percentages. Between-group comparisons were performed using the Chi-square test or Fisher’s exact test for categorical variables, and the independent Student’s t-test or Mann–Whitney U test for continuous variables, as appropriate. The agreement between the EULAR and GRAPPA definitions was evaluated using kappa analysis. The strength of agreement was interpreted according to the guidelines proposed by Landis and Koch: 0.01–0.20 slight, 0.21–0.40 fair, 0.41–0.60 moderate, 0.61–0.80 substantial, and 0.81–1.00 almost perfect. A p-value <0.05 was considered statistically significant. 20 Given the limited sample size and low number of outcome events within several subgroups, analyses were restricted to descriptive summaries and unadjusted comparisons only. No multivariable regression analyses were performed. These analyses were considered exploratory and hypothesis-generating. All statistical analyses were conducted using SPSS (version 22.0; IBM Corp., Armonk, NY, USA).
Results
Baseline characteristics and ultrasound findings
Seventy-six patients with PsA were enrolled in the study, of whom 42 (55.3%) were female. The mean age was 48.3 ± 12.7 years. The median disease duration was 4.0 years (IQR, 1.0–14.8 years), and the median symptom duration was 8.0 years (IQR, 3.0–18.0 years). The most common clinical phenotype was symmetric polyarthritis, observed in 64 patients (84.2%), while enthesitis and dactylitis were observed in 39 (51.3%) and 42 (55.3%) patients, respectively. Ultrasound assessment revealed a median GLOESS score of 19.0 (IQR, 11.0–28.0), indicating moderate synovial inflammation. The median Enthesitis Inflammation Score was 6.0 (IQR, 2.0–11.0), with low median enthesis Doppler activity of 0.0 (IQR, 0.0–1.0). Additional sociodemographic, clinical, and ultrasound findings across definitions are summarized in Table 2.
Baseline sociodemographic, clinical characteristics, and ultrasound scores of PsA patients in the ORCHESTRA cohort and across different definitions.
Median (IQR).
Mean ± SD.
n (%).
BASDAI, Bath Ankylosing Spondylitis Disease Activity Index; BSA, body surface area; csDMARDs, conventional synthetic disease-modifying antirheumatic drugs; GLOESS, global OMERACT–EULAR synovitis score; HAQ, health assessment questionnaire; SPARCC, Spondyloarthritis Research Consortium of Canada enthesitis index; VAS, visual analog scale.
EULAR and GRAPPA definitions’ prevalence, and agreements
According to the EULAR definition, 16 patients (21.1%) met D2M-PsA criteria, including one patient (1.3%) with D2M alone and 14 (18.4%) who met the criteria for TR-PsA. Overall, 60 (78.9%) were non-D2M-PsA. Based on the GRAPPA definition, 35 patients (46.1%) were categorized as having C2M-PsA, including 21 (27.6%) who had C2M without objective inflammation (non-TR-PsA), and 14 (18.4%) who met the criteria for TR-PsA, whereas the remaining 41 (53.9%) were classified as non-C2M PsA.
There was moderate agreement between the EULAR D2M-PsA and GRAPPA C2M-PsA classifications (κ = 0.476). All patients classified as D2M-PsA by EULAR were also categorized as C2M-PsA by GRAPPA; however, 19 of the 35 patients (54.2%) identified as C2M-PsA by GRAPPA were not identified as D2M-PsA under EULAR criteria because they did not meet the required entry criteria of failure of at least two advanced therapies with different MOA. The absolute agreement between the two definitions was 75% (Table 3).
Agreement between GRAPPA C2M-PsA and EULAR D2M-PsA definitions, and between TR-PsA GRAPPA and EULAR definitions.
C2M-PsA, complex-to-manage psoriatic arthritis; D2M-PsA, difficult-to-manage psoriatic arthritis; EULAR, European alliance of associations for rheumatism; GRAPPA, group for research and assessment of psoriasis and psoriatic arthritis; TR-PsA, treatment-refractory psoriatic arthritis.
In contrast, nearly perfect agreement was observed for the classification of TR-PsA between EULAR and GRAPPA frameworks (κ = 1.00). The same 14 patients were identified as TR-PsA by both EULAR and GRAPPA, yielding 100% absolute agreement (Table 3). The degree of overlap between EULAR and GRAPPA classification frameworks is depicted in Figure 1.

Nested Overlap of EULAR and GRAPPA Definitions of D2M/C2M PsA and TR-PsA Within the ORCHESTRA Cohort. Numbers indicate the number of patients at each step of the nested overlap (none, GRAPPA-only, D2M-only, and TR-PsA overlap).
Comparative characteristics by D2M-PsA, C2M-PsA, and TR-PsA status
Baseline demographic, clinical, and ultrasound characteristics according to EULAR D2M-PsA and GRAPPA C2M-PsA classifications are summarized in Table 2, with unadjusted comparisons available in Table 4.
Comparison of demographic, clinical, and ultrasound characteristics between D2M/C2M PsA patients according to EULAR and GRAPPA definitions.
Median (IQR).
Mean ± SD.
n (%).
Statistically significant values (p < 0.05) are shown in bold.
BASDAI, bath ankylosing spondylitis disease activity index; B-mode, gray-scale ultrasound score; BSA, body surface area; C2M-PsA, complex-to-manage psoriatic arthritis; CRP, C-reactive protein; csDMARDs, conventional synthetic disease-modifying antirheumatic drugs; D2M-PsA, difficult-to-manage psoriatic arthritis; DIP, distal interphalangeal; ESR, erythrocyte sedimentation rate; EULAR, European alliance of associations for rheumatology; GLOESS, global OMERACT–EULAR synovitis score; GRAPPA, group for research and assessment of psoriasis and psoriatic arthritis; HAQ, health assessment questionnaire; MDA, minimal disease activity; NiPsA, non-inflammatory psoriatic arthritis; PiPsA, persistent inflammatory psoriatic arthritis; PsA, psoriatic arthritis; SPARCC, spondyloarthritis research consortium of Canada score; TR-PsA, treatment-refractory psoriatic arthritis; VAS, visual analog scale.
Across both frameworks, D2M-PsA and C2M-PsA patients had significantly longer disease and symptom durations and greater prior exposure to advanced therapies than those not meeting the definitions (all p ⩽ 0.002; Table 4). No significant differences were observed in age, sex, body mass index, fibromyalgia, depression, inflammatory markers, clinical joint counts, patient- and physician-reported disease activity, functional status, or ultrasound measures of synovitis and enthesitis across either definition.
Cutaneous and nail psoriasis were numerically more frequent among D2M or C2M patients, whereas dactylitis was more commonly observed in non-D2M patients, particularly in the GRAPPA comparison.
Because EULAR and GRAPPA identified an identical group of patients with TR-PsA, baseline characteristics for TR-PsA were examined relative to non-TR-PsA patients irrespective of framework (Table 4). As expected, TR-PsA patients also demonstrated longer disease and symptom duration and greater prior exposure to advanced therapies. In contrast, measures of inflammation, including clinical assessment, US, and laboratory testing, were largely comparable between TR-PsA and non-TR-PsA patients.
Then we further analyzed patients captured by GRAPPA C2M-PsA but not by EULAR (the discordant group). This discordant group exhibited worse patient- and physician-reported global assessments and greater functional impairment compared with patients not meeting either definition (non-D2T disease). On the other hand, no differences were observed in clinical and ultrasound measures of activity between the discordant and EULAR D2M-PsA groups. Together, these patterns suggest that the discordant GRAPPA-C2M group is more similar to the D2M phenotype rather than to non-D2T disease. Details are presented in the Table S1.
Discussion
Despite the increasing number of therapy options becoming available over the last two decades, D2T-PsA remains a major challenge in clinical practice with a substantial burden on healthcare systems.21,22 Few prior studies have examined D2T-PsA6,7,9–11; however, most have relied on adaptations of the D2T-RA framework.8,23 Such adaptations do not fully address the multifaceted nature of PsA, including its multidomain involvement and high burden of comorbidities, nor do they adequately distinguish inflammatory from non-inflammatory disease drivers. These limitations may hinder the implementation of targeted, holistic management approaches and contribute to suboptimal achievement of treatment targets. To address this unmet need, EULAR and GRAPPA have recently developed PsA-specific D2T frameworks to stratify complex cases by disease drivers, sharing a common conceptual goal but differing in their operationalization.12,13 To the best of our knowledge, this study is the first to apply both the EULAR and GRAPPA consensus definitions of D2T-PsA in a real-world cohort, providing empirical insight into their clinical applicability, overlap, and performance.
Our findings revealed key differences in patient capture, particularly within the broader, non–inflammation-restricted classifications. The GRAPPA C2M-PsA framework, by requiring failure of only one advanced therapy, identified a larger proportion of patients. In contrast, the EULAR D2M-PsA criteria applied stricter entry thresholds, resulting in a smaller cohort and failing to capture more than half of the patients identified by GRAPPA, yielding only moderate agreement between the two definitions. Importantly, all patients classified by EULAR were also captured by GRAPPA, indicating complete inclusion of the EULAR-defined group within the GRAPPA framework.
When comparing patients identified by the two frameworks (EULAR D2M-PsA and GRAPPA C2M-PsA), notable differences emerged. Within the more stringent EULAR D2M-PsA cohort, the majority of patients were also classified as TR-PsA (87.5%), whereas only 42% of GRAPPA C2M-PsA patients met TR-PsA criteria. This finding indicates that the EULAR definition preferentially enriches for patients with true inflammatory refractoriness, while the broader GRAPPA framework captures a wider spectrum of disease complexity beyond biologic non-response. Moreover, the trend toward the EULAR D2M-PsA group showing a greater concentration of multidomain PsA manifestations, including enthesitis, dactylitis, uveitis, psoriasis, inflammatory bowel disease, and gout, than the GRAPPA C2M-PsA group, may suggest that patients meeting the more stringent EULAR definition could represent a more biologically complex and systemically involved PsA phenotype. However, given the smaller size of the EULAR-defined group, this apparent enrichment may partly reflect a denominator effect and should be interpreted cautiously as hypothesis-generating.
Taken together, these findings raise the question of whether the GRAPPA framework may be overly inclusive. To explore this, we examined the discordant group captured by GRAPPA but not by EULAR and assessed whether this subset more closely resembled patients with D2M disease or those without D2T features. We found that the discordant group exhibited characteristics similar to those of the D2M-PsA patients with respect to clinical disease activity and ultrasound inflammation. In contrast, compared with non-D2T patients, the discordant group had worse patient- and physician-reported outcomes and functional impairment, despite similar objective inflammatory measures. This pattern supports the interpretation that GRAPPA may identify complex patients earlier along a disease trajectory, rather than inappropriately inflating the D2T population. Indeed, this broader inclusiveness is consistent with the GRAPPA task force’s stated intent, which acknowledges C2M-PsA as a dynamic, heterogeneous, and multifactorial state, and by requiring failure of only one approved therapy, the GRAPPA definition intentionally captures a spectrum ranging from patients with longstanding complexity to those who may demonstrate unexpectedly favorable responses to intervention early on.
Therefore, in clinical practice, EULAR and GRAPPA definitions appear to serve complementary rather than interchangeable roles. The GRAPPA C2M-PsA framework is more inclusive and potentially clinically meaningful for real-world practice, on the basis of discordant patients being more similar to D2M patients for the patient-reported outcomes and functional status. This framework may be capturing patients with early or emerging disease complexity driven by multidomain involvement, comorbidities, and functional impairment, even after limited therapeutic exposure. In routine care, this broader framework may help flag patients who require multidisciplinary management, reassessment of treatment goals, and optimization of non-inflammatory contributors before further immunologic escalation. In contrast, the EULAR D2M-PsA framework preferentially enriches for patients with multidomain disease in whom inflammatory refractoriness remains clinically relevant. This greater stringency may be advantageous by concentrating patients with D2T characteristics, thereby yielding a more homogeneous cohort and facilitating patient selection in research, consistent with the EULAR main objective in developing these criteria. On the other hand, in clinical practice, it may simply indicate that patients warrant more urgent reassessment of treatment response and consideration of alternative therapeutic strategies when similar patient profiles are encountered. Notably, this increased precision comes at the cost of reduced inclusiveness, underscoring the importance of selecting definitions based on the intended clinical or research context.
With respect to definitions intended to identify true inflammatory biologic non-response, both EULAR and GRAPPA demonstrated perfect agreement in classifying patients as TR-PsA. Despite this concordance, the two frameworks are not necessarily interchangeable, as they differ in how objective inflammation is conceptualized and operationalized. EULAR permits broader clinical indicators of inflammation, including tender or swollen joints, tender entheses, and inflammatory back pain, whereas GRAPPA emphasizes more specific clinical findings such as swollen joints, dactylitis, or defined psoriasis body surface area thresholds, and relies more heavily on imaging for domains such as axial disease and enthesitis, where clinical examination alone may lack specificity. In the present cohort, the absence of patients with isolated axial or entheseal-predominant disease, together with the modest sample size, may have limited our ability to detect meaningful differences arising from these operational distinctions. Larger and more heterogeneous cohorts enriched for axial and entheseal phenotypes are needed to determine whether these definitional differences translate into clinically meaningful variation in patient classification.
Notably, compared with non-TR-PsA patients, those classified as TR-PsA did not exhibit higher clinical or ultrasound inflammatory activity. This underscores the need to interpret inflammatory activity within the broader context of disease trajectory, cumulative therapeutic exposure, and patterns of treatment failure—including primary versus secondary non-response—and to more precisely delineate what constitutes true inflammatory activity in PsA across its domains. In this context, recent work by Zabotti et al. introduced an imaging-based framework distinguishing Persistent Inflammatory PsA (PiPsA) and Non-Inflammatory PsA (NiPsA) as mutually exclusive strata based on objective evidence of inflammation on imaging. 24 While conceptually aligned with GRAPPA and EULAR, this approach may serve as a subsequent refinement, enabling clearer stratification into exclusively inflammatory versus non-inflammatory disease and facilitating more targeted, mechanism-driven treatment strategies. Accordingly, future work will focus on applying the PiPsA/NiPsA framework within larger, real-world cohorts to evaluate its added value beyond existing D2T classifications.
Limitations
While this study provides valuable real-world insights, several limitations should be acknowledged. First, the modest sample size limits the statistical power and the strength of inferences drawn from subgroup comparisons. In addition, prevalence estimates reflect patients referred for initiation or switching of advanced therapy in a tertiary care setting and may not capture individuals who were unable to start or escalate therapy due to comorbidities, contraindications, or access-related barriers. Finally, although this analysis was conducted within a prospective cohort, application of the D2T definitions was retrospective. As such, pragmatic proxies—such as BASDAI to infer axial involvement—were used in the absence of real-time clinical adjudication, which may be less accurate or may not reliably reflect active inflammatory disease when confounding factors are present. In patients with concurrent activity across multiple disease domains or with psychosocial comorbidities, this approach may further limit the ability to definitively identify the dominant driver of treatment failure or therapy
Conclusion
This study provides the first real-world comparison of the EULAR and GRAPPA frameworks for D2T PsA, demonstrating shared conceptual goals but distinct clinical performance. Differences were most apparent within the non–inflammation-restricted classifications, where GRAPPA identified a broader and clinically meaningful population, reflecting greater inclusiveness and earlier capture of disease complexity, whereas EULAR identified a more concentrated subset enriched for multidomain involvement and inflammatory refractoriness after multiple therapeutic exposures. Although both frameworks identified the same patients when applying inflammation-restricted definitions for TR-PsA in this cohort, differences in their operationalization preclude assuming interchangeability. Larger prospective studies are needed to formally assess performance and to refine stratified, mechanism-driven management strategies in PsA.
Supplemental Material
sj-docx-1-tab-10.1177_1759720X261443557 – Supplemental material for Real-world performance of EULAR and GRAPPA definitions of difficult-to-treat psoriatic arthritis within the ORCHESTRA cohort: an observational cohort study
Supplemental material, sj-docx-1-tab-10.1177_1759720X261443557 for Real-world performance of EULAR and GRAPPA definitions of difficult-to-treat psoriatic arthritis within the ORCHESTRA cohort: an observational cohort study by Rahaf Zyad Attar, Seyyid Bilal Acikgoz, Elliot Hepworth, Ozun Bayindir Tsechelidis, Ricardo Sabido-Sauri, Tara Swami and Sibel Zehra Aydin in Therapeutic Advances in Musculoskeletal Disease
Supplemental Material
sj-docx-2-tab-10.1177_1759720X261443557 – Supplemental material for Real-world performance of EULAR and GRAPPA definitions of difficult-to-treat psoriatic arthritis within the ORCHESTRA cohort: an observational cohort study
Supplemental material, sj-docx-2-tab-10.1177_1759720X261443557 for Real-world performance of EULAR and GRAPPA definitions of difficult-to-treat psoriatic arthritis within the ORCHESTRA cohort: an observational cohort study by Rahaf Zyad Attar, Seyyid Bilal Acikgoz, Elliot Hepworth, Ozun Bayindir Tsechelidis, Ricardo Sabido-Sauri, Tara Swami and Sibel Zehra Aydin in Therapeutic Advances in Musculoskeletal Disease
Footnotes
Acknowledgements
The authors would like to express their sincere gratitude to all patients who contributed their time and data to the ORCHESTRA registry, making this research possible. We also thank the dedicated clinical and research teams involved in data collection, management, and coordination of the registry for their ongoing commitment to advancing inflammatory arthritis research.
Declarations
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
