Development,initial validation and reliability testing of a web-based,generic feline health-related quality-of-life instrument

Abstract

Objectives

The objective of this study was to develop a valid, reliable, web-based generic feline health-related quality-of-life (HRQoL) questionnaire instrument to measure the affective impact of chronic disease.

Methods

A large initial item pool, obtained through interviews with cat owners, was reduced using predetermined criteria, survey scores for relevance and clarity, and the ability of individual items to discriminate between healthy and sick cats when owners completed a prototype questionnaire. Using these data, factor analysis was used to derive a scoring algorithm and provide evidence for factorial validity. Validity was demonstrated further in a field trial using a ‘known groups’ approach (sick vs healthy cats will have a different HRQoL profile, and the HRQoL profile of cats will deteriorate as comorbidities increase). Test–retest reliability was assessed using intra-class correlation coefficients (ICCs).

Results

In total, 165 items were reduced to 20 and, on the basis of a factor analysis that explained 72.3% of the variation in scores input by 71 owners of 30 healthy and 41 sick cats using the prototype, these were allocated to three domains (vitality, comfort and emotional wellbeing [EWB]) with a scoring algorithm derived using item loadings. Subsequently, the owners of 36 healthy and 58 sick cats completed one or two (n = 48) assessments. Median scores (healthy vs sick) for all domains were significantly different (P <0.001), 78% of cats were correctly classified as healthy or sick and for comorbidities the correlation coefficients were moderate (vitality 0.64; comfort 0.63; EWB 0.50). Test–retest reliability was good (ICC vitality 0.635; comfort 0.716; EWB 0.853).

Conclusions and relevance

This study provides initial evidence for the validity and reliability of a novel HRQoL instrument to aid the assessment and management of chronic diseases of cats.

Introduction

It is the unpleasant feelings (affective component) associated with chronic disease that cause an individual to suffer. The medical profession recognises the importance of the valid and reliable measurement of how people feel and has addressed this through the development of instruments to measure health-related quality of life (HRQoL) for disease detection (discriminative purposes) or to measure change in health status over time (evaluative purposes).¹ Structured questionnaire instruments are developed and tested using well-established psychometric methodology.^2–4

Instruments to measure HRQoL in companion animals consist of questions for the owner, who is well placed to report on the subtle changes in behaviour, attitude and demeanour that occur with chronic disease. While several feline disease-specific instruments exist,^5–9 to date no validated generic HRQoL instrument exists for the purpose of comparing treatments or disease states.¹⁰ A generic instrument (CHEW [Cat HEalth and Wellbeing]), based on owner-perceived health status has been reported,¹¹ but not validated, in sick cats. Similarly CatQol (Bijsmans) focuses on general health, eating, behaviour and management, but has been validated only in cats with chronic kidney disease (CKD).¹² More recently, Tatlock et al have described an owner-reported feline quality-of-life (QoL) scale for healthy cats.¹³

Briefly, the psychometric approach to instrument design consists of the generation of a pool of items (questions), most often through interviews with key informants, reduction of these by various techniques, including expert judgement of the relevance and adequacy of items,¹⁴ the identification of items that do not discriminate well between known groups of subjects,¹⁵ and the use of a statistical technique called factor analysis (FA),¹⁶ before pretesting and then testing for validity and reliability.

Evidence for any new instrument’s validity and reliability is essential before use in a clinical context. Various kinds of validity may be sought. For example, content validity is a measure of the extent to which the items included in a questionnaire are relevant and adequate for its purpose: it is established during its construction and assessed by expert judgement. Criterion validity is the agreement of a new instrument with some existing ‘gold standard’, but where that does not exist, evidence can be gathered to support concurrent criterion validity (comparison with a validated measure of a related construct) or predictive criterion validity where performance of the new measure successfully predicts that of a later measure. Construct validity – evidence that the instrument is measuring the construct that it is intended to measure – is considered to be the most robust and fundamental form of validity.¹

A construct is something that is not directly observable or measurable, such as ‘happiness’. The construct being measured here is HRQoL, which is the subjective evaluation by an individual of its circumstances, which include an altered health state and the impact of related interventions.¹⁷ Construct validity is established by a process of hypothesis testing, where hypotheses are based upon how an instrument should perform if it is measuring the construct of interest. For example, factorial validity applies if FA of data generated using the instrument reveals an interpretable factor structure that fits the construct the instrument was designed to measure.¹⁸ In a ‘known-groups’ approach to construct validation, predictions are made about how scores obtained with the instrument will differ between groups, such as healthy and sick animals, or will reflect disease burden, and these predictions are tested.

A reliable instrument will produce the same score when an unchanging subject is measured at two time points by the same observer (repeatability/intra-rater reliability), or when two people measure the same subject at one time (reproducibility/inter-rater reliability).²

Previously a novel generic instrument to measure HRQoL in dogs was developed in which most of the items reported aspects of behaviour that owners believed were expressions of a dog’s subjective experience (feelings),^17,19 and evidence for the validity and reliability of a web-based version was reported.¹⁵ Subsequent shortening resulted in a 22-item instrument that retains the capacity of the prototype to measure the animal’s feelings.²⁰ The aim of this study was to develop an equivalent generic instrument for cats and to provide first evidence for its content validity, construct validity and intra-rater reliability.

Materials and methods

Ethical approval was granted by the University of Glasgow, and all participants gave informed consent.

Item generation and initial selection

Semi-structured interviews were conducted with the owners of healthy cats and cats with conditions likely to affect QoL, recruited through the University of Glasgow Small Animal Hospital (UGSAH) and several veterinary practices. The interviews were recorded and transcribed verbatim to generate items consisting of terms and phrases used by interviewees to describe their cats when healthy and sick. Interviews were continued until no new information emerged and redundancy was reached.^2,10 Questions (open and closed) were worded carefully to limit response bias.^21,22 Qualitative analysis of the transcripts was conducted using established methods in grounded theory, a methodology commonly used in the social sciences that involves the gathering and analysis of data to construct a theoretical framework for whatever is being studied. This is in contrast to conventional methods, which adopt an existing theoretical framework, and then collect data to show whether or not it applies to the phenomenon being studied.²³

Each item was considered by the authors and excluded if it was deemed to be related to individual personality traits; disease-specific; lacking in clarity/readability; more relevant to clinical examination rather than owner report; not relevant to HRQoL; or where a more appropriate description of that observation had been offered.

Content validation

An online survey (SurveyMonkey, San Mateo, CA, USA) of the remaining items was conducted in which groups of cat owners and clinicians were asked to judge their clarity and their relevance to the measurement of feline HRQoL. Relevance was scored using a 4-point Likert scale (0 = ‘not relevant’, 3 = ‘very relevant’).²⁴ The clarity of each item was determined using binary response options, ‘clear or not clear’.²⁵ Participants were asked for feedback on why they found an item not relevant or unclear, and were invited to suggest additional items.

Degree of relevance scores were then dichotomised by assigning scores of 0 and 1 as ‘not relevant’ and scores of 2 and 3 as ‘relevant’. Content validity index scores for relevance (I-CVI_R) and clarity (I-CVI_C) of each item were derived by averaging the scores given to each item and dividing by the number of respondents.²⁴ Items were excluded if I-CVI_R ⩽0.60 or considered for revision/exclusion if I-CVI_C <0.70.

Prototype construction and pre-testing

In the prototype instrument, each item was accompanied by a 7-point Likert scale (0 = ‘not at all’, 6 = ‘couldn’t be more’) to allow the respondent to rate the extent to which the item described his or her cat. Some of these anchors were re-worded to suit the form (word or phrase) of the item.

Software developers, Kyria Ltd (www.kyria.co.uk) produced a web-based prototype instrument for pre-testing with a number of cat owners. The prototype was revised as required to ensure optimal utility, functionality and lack of ambiguity.

To compare owners’ impressions of health status with that of clinicians, which has been found to differ in the dog according to previous work (unpublished), an additional owner question, ‘Is your cat perfectly healthy?’ – Yes/No, was included in the prototype instrument but did not form part of the assessment.

Field test 1 for item reduction, factorial validation and determination of scoring algorithm

Cat owners were recruited from first-opinion practices, feline specialist practices and the UGSAH. Owners completed one assessment for their cat and the attending clinician completed a general health assessment (supplementary material Appendix S1) to verify the cat’s health status.

The research team reviewed dot plots of the response scores (0–6) generated for each of the items and eliminated any item that was judged by all not to discriminate well between healthy and sick cats.

To establish evidence for factorial validity, and to determine a scoring algorithm for the instrument, a FA (principal components method with a varimax rotation) was performed with remaining items. The scores attributed to each item by the owners were used for the FA. The analysis allocates each item to a factor with a loading (0–1), which determines the closeness of its relationship to the factor. Resulting loadings were sorted, and any item with a loading <0.4 was excluded. A scree test and the Kaiser criterion were used to identify the optimum number of factors and the interpretability of a range of factor models was examined. Factors were interpreted based on how those items loading onto a particular factor were related, and a factor model was chosen that accounted for an acceptable amount of the variability in the data, was readily interpretable and did not include any factors containing only one or two items. An algorithm, based on the item–factor associations of the selected factor model, was derived in order to generate a domain score for each of the resultant factors/domains.

Field test 2 for construct validity and reliability testing

A new group of cat owners was recruited from first opinion practices, feline specialist practices and the UGSAH. The attending clinician completed a general health assessment (supplementary material Appendix S1) to verify the cat’s health status.

Owners of healthy and sick cats, grouped according to the clinical judgement of the consulting clinician, completed at least one assessment. A number of owners of healthy cats completed two assessments, 2 weeks apart, and test–retest reliability was assessed using the intra-class correlation coefficient (ICC). A one-way random model was assumed where the subjects (cats) are assumed random.²⁶

Using the first assessment for each cat, box plots and descriptive statistics were used to identify differences between healthy and sick cats, followed by formal statistical analysis using non-parametric Mann–Whitney tests owing to the non-normality of the data. Linear discriminant analysis was used to determine the ability of the instrument to differentiate healthy from sick cats. The correlation between the number of comorbidities affecting each cat and their HRQoL scores was investigated using linear regression and Pearson correlation coefficient for all three domains was calculated for healthy cats, cats with 1–2 comorbidities and cats with ⩾3 comorbidities.

The following hypotheses were tested: (1) that the HRQoL profile of scores will differ between healthy cats and sick cats; and (2) that the HRQoL profile will be worse for cats with poorer health status, as defined by the number of comorbidities present in individuals.

Results

An overview of the development process is shown in Figure 1.

Figure 1

Summary of the study design for developing a generic health-related quality-of-life (HRQoL) instrument for cats

Item generation and initial selection

Semi-structured interviews conducted with the owners (n = 18) of healthy (n = 19) and sick (n = 10) cats generated an initial pool of 165 items for consideration (Table 1). Table 2 illustrates the format of interview questions. One hundred and six items met our criteria for exclusion or revision (for examples see Table 3). Fifty-one items were retained for content validation.

Table 1

Details of owners and cats involved in different stages of instrument development and validation including semi-structured interviews, field test 1 (item reduction and scoring algorithm generation) and field test 2 (validity and reliability testing)

Semi-structured interview	Field test 1 – online testing of prototype instrument	Field test 2 – testing for validity and reliability
• Cat owners (18: 5 males; 13 females) • Single and multi-cat households (2–4 cats) • Cat age range 1.75–21 years • 19 healthy and 10 sick (four OA and hyperthyroidism; two OA and CKD; four OA only)	• 71 single owner assessments (30 healthy cats; 41 sick cats) (43 males; 28 females) • Mean age of healthy cats 6.5 years (range 0.3–16.5 years) • Mean age of sick cats 11.5 years (range 1.2–19.5 years) • 66% DSH; 34% purebred (eg, Maine Coon, Persian, Siamese, etc) • 95% comorbidities (see Table 5 for conditions) • 29% of owners of unhealthy cats misclassified their cats as being healthy – in contrast to clinical assessment	• 94 single owner assessments (26 healthy cats; 58 sick cats) (48 male cats; 46 female cats) • 48 repeat assessments • Mean age of healthy cats 4.6 years (range 1–10 years) • Mean age of sick cats 11.7 years (range 1.1–19.9 years) • 86% DSH; 14% purebred • 72% comorbidities (see Table 5 for conditions) • 26% of owners of unhealthy cats misclassified their cats as being healthy – in contrast to clinical assessment

Demographics of cats include age, health status and presenting conditions, sex and breed. Misclassification between owner impression and clinician report of health status is reported for each study

OA = osteoarthritis; CKD = chronic kidney disease; DSH = domestic shorthair

Table 2

Examples of questions asked of owners during semi-structured interviews

Examples of questions asked of cat owners
Please describe your cat’s daily routine
Can you tell how your cat is feeling, and, if so, describe how?
How do you know when your cat is unwell? Healthy? Feeling happy?
How did you first know your cat was unwell? Were there any behavioural changes you noticed specifically?
How do you monitor that the disease is getting worse?
How do you know that treatment is working? Or not working?
What areas of your cat’s life are most impacted by the condition?

Table 3

Number of potential items reported by cat owners that were eliminated or revised throughout development following review by the research team (CN, LW, MS, AN, JR), including the rationale and select examples

Number of potential items	Rationale	Examples
21 eliminated	Explicitly described underlying personality traits	‘Gentle’, ‘mischievous’, ‘bold’
22 eliminated	Specific to one disease	‘Yowling’, ‘needing manual evacuation’, ‘night howling’
	Clinical potential items	‘Doesn’t like joint manipulation at the vet’, ‘muscle wastage’, weight loss/gain’
	Specific to individual cat or not easily recognisable	‘Whiskers fanned out’, ‘easy to give medication to when she’s feeling well’, ‘head butts’
	Not relevant to measuring HRQoL	‘Runs away after difficulty giving him his medication’, ‘cloudy eyes’, ‘bright eyes’
63 eliminated	True synonyms/more appropriate descriptor commonly used	‘Friendly’ was most commonly reported and synonym of ‘sociable’, ‘follows me around’ and ‘come to greet you’
	More appropriate descriptor available	‘Content’ was chosen instead of ‘chilled’ as it was believed more appropriate for a wider audience
	Another descriptor would adequately capture a behaviour over one that is ‘too specific’	‘Interested in his/her food’ was retained, covering ‘loss of appetite’, ‘enjoying food’, ‘less hungry’
11 revised	Revised to improve clarity/readability	‘Doesn’t go out in winter anymore’ was revised to ‘going out in cold weather’; ‘getting up and down the stairs’ to ‘managing getting up and down the stairs’

HRQoL = health-related quality of life

Content validation

Fifty-eight participants (48 owners of 14 sick cats and 34 healthy cats) and 10 clinicians completed surveys assessing the clarity and relevance of the remaining 51 items. As a result of not having met I-CVI criteria, 13 of these items were eliminated and 11 items were revised, with one of those being split into two separate items (Figure 1).

Prototype construction and pre-testing

The prototype for field test 1 consisted of 39 items, 27 single words with the standard response option (0 = ‘not at all’; 6 = ‘couldn’t be more’) and 12 items where response options were reworded to suit the form of the item (eg, ‘hiding away’ – not hiding away at all/ couldn’t be hiding away more).

Pre-testing of the online instrument was conducted with 15 owners of five healthy cats and 10 sick cats. Following this, response options for two items – ‘Jumping or climbing up/down’ and ‘Usual sleeping patterns and/or places’ – were revised to improve readability and comprehension.

Field test 1 for item reduction, factorial validation and determination of scoring algorithm

Using the online prototype instrument for field test 1, 71 owner and clinician assessments from UGSAH, five general practices and one feline specialist practice were completed over a period of 5 months for 30 healthy cats and 41 cats diagnosed with a chronic condition expected to affect QoL (Table 1). Ninety-five percent of cats presented with 1–6 comorbidities (Table 5). Review of dot plots of these item responses suggested that 19 items were unlikely, in the opinion of the research team, to discriminate between sick and healthy cats (Figure 2); these were removed leaving 20 items to be included in the instrument for field test 2 (Table 4).

Figure 2

Examples of dot plots for items that were (a) excluded and (b) retained on the basis of their discriminatory potential as assessed by the research team. The x-axis represents the response values selected by owners of healthy (red square) and unhealthy (blue circle) cats

Table 4

Twenty items that make up the feline health-related quality-of-life scale and their response options

1 Please tell us how well this word describes (cat name) as he is today: Active Couldn’t be more active – Not at all active: 6–0 2 Please tell us how well this word describes (cat name) as he is today: Unsteady Couldn’t be more unsteady – Not at all unsteady: 6–0 3 Please tell us how well this word describes (cat name) as he is today: Energetic Couldn’t be more energetic – Not at all energetic: 6–0 4 Please tell us how well this word describes (cat name) as he is today: Comfortable Couldn’t be more comfortable – Not at all comfortable: 6–0 5 Please tell us how well this word describes (cat name) as he is today: Lethargic Couldn’t be more lethargic – Not at all lethargic: 6–0 6 Please tell us how well this word describes (cat name) as he is today: Showing hunting behaviour Couldn’t be showing hunting behaviour more – Not at all showing hunting behaviour: 6–0 7 Please tell us how well this word describes (cat name) as he is today: Lively Couldn’t be more lively – Not at all lively: 6–0 8 Please tell us how well this word describes (cat name) as he is today: Alert Couldn’t be more alert – Not at all alert: 6–0 9 Please tell us how well this word describes (cat name) as he is today: Sore Couldn’t be more sore – Not at all sore: 6–010 Please tell us how well this word describes (cat name) as he is today: Content Couldn’t be more content – Not at all content: 6–011 Please tell us how well this word describes (cat name) as he is today: Playful Couldn’t be more playful – Not at all playful: 6–012 Please tell us how well this word describes (cat name) as he is today: Uncomfortable Couldn’t be more uncomfortable – Not at all uncomfortable: 6–013 Please tell us how well this word describes (cat name) as he is today: Enjoying the things he usually does Couldn’t be enjoying the things he usually does more – Not at all enjoying the things he usually does: 6–014 Please tell us how well this word describes (cat name) as he is today: Jumping or climbing up/down as usual Jumping or climbing up/down as usual – Not jumping or climbing up/down as usual: 6–015 Please tell us how well this word describes (cat name) as he is today: Exploring Couldn’t be exploring more – Not at all exploring: 6–016 Please tell us how well this word describes (cat name) as he is today: Feeling himself Couldn’t be feeling himself more – Not at all feeling himself: 6–017 Please tell us how well this word describes (cat name) as he is today: Stiff Couldn’t be more stiff – Not at all stiff: 6–018 Please tell us how well this word describes (cat name) as he is today: Happy Couldn’t be more happy – Not at all happy: 6–019 Please tell us how well this word describes (cat name) as he is today: Inquisitive Couldn’t be more inquisitive – Not at all inquisitive: 6–020 Please tell us how well this word describes (cat name) as he is today: Slow Couldn’t be more slow – Not at all slow: 6–0

A FA was conducted using the responses to these 20 items, all of which had loadings >0.4. A model containing three factors was considered to be optimal, accounting for 72.3% of the variance in the owner response data and consisting of factors that could be interpreted as HRQoL domains which were named by the research team as ‘vitality’ (11 items), ‘comfort’ (eight items) and ‘emotional wellbeing’ (EWB) (seven items). Some items loaded onto more than one factor. An algorithm, based on the item–factor associations for the three-factor model, was derived in order to generate three domain scores. However, for commercial reasons a description of the factor composition and the algorithm are not presented.

Field test 2 for construct validity and reliability testing

Using the resulting online instrument for field test 2, the owners of 36 healthy cats and 58 sick cats as determined by clinician assessment, representing a comprehensive range of breeds (Table 1), completed one assessment and, of these 94 owners, 48 owners completed two assessments. According to their responses to a direct question, a total of 26% of owners of sick cats believed their cat to be perfectly healthy, despite a clinician diagnosis of ill health. Sick cats presented with a range of conditions and 72% had 1–6 additional comorbidities (Table 5).

Table 5

Conditions reported by clinicians for each assessment completed for 41 cats from field test 1 and the 58 cats from field test 2 that in the clinician’s opinion were not perfectly healthy

Presenting conditions	Field test 1	Field test 2
Degenerative joint disease	23	33
Obesity	8	11
Painful cancer	0	1
Non-painful cancer	3	4
Chronic skin disease	7	1
Chronic medical condition	28	21
Cardiac disease	11	4
Neurological disease	2	1
Chronic ear disease	3	2
Chronic dental disease	21	9
Chronic kidney disease	20	16
Hyperthyroidism	6	9
Chronic lower urinary tract disease	2	6
Cat flu	2	3
Chronic gastrointestinal disease	10	5
Previous physical trauma	4	3
Underweight	12	21
Other	7*	2*

Comorbid conditions were reported in 95% and 72% of cases in field tests 1 and 2, respectively

Other conditions included: field test 1 – cognitive decline (n = 3), cancer in remission (n = 2), diabetes (n = 2), proliferative gum disease (asymptomatic), liver disease and otitis externa right ear and scratches to pinna; field test 2 – hypertension (n = 2)

Construct validity

Differences between healthy and sick cats existed for all three domains (Table 6), supporting hypothesis 1 (null hypothesis: no difference in median score between healthy and sick cats, rejected at P <0.01), with greater variability in the sick group than in the healthy group (Figure 3). Linear discriminant analysis (using cross-validation) showed that the instrument correctly classified as either healthy or sick 78% (healthy 71%; sick 89% classified correctly) of the cats assessed. An increase in the number of comorbidities was associated with a deterioration in HRQoL profile (Figure 4). The Pearson correlation coefficients for vitality, comfort and EWB, and comorbidities (healthy, 1–2 and >3) were −0.64, −0.63 and −0.50, respectively.

Table 6

Descriptive statistics and Mann–Whitney test results comparing the scores of healthy and sick cats for each of the three domains (vitality, comfort and emotional wellbeing [EWB])

Domain	Number of cats	Mean ± SD	IQR	Median	Mann–Whitneydifference in median(healthy – unwell)	P value	95% CI
Vitality
Healthy	36	3.30 ± 0.53	0.84	3.37	1.04	<0.001	0.71–1.32
Sick	58	2.32 ± 0.75	1.08	2.23
Comfort
Healthy	36	5.90 ± 0.20	0.23	6.00	0.63	<0.001	0.40–0.98
Sick	58	5.06 ± 0.73	1.06	5.25
EWB
Healthy	36	3.73 ± 0.32	0.45	3.87	0.56	<0.001	0.33–0.77
Sick	58	3.15 ± 0.63	0.83

IQR = interquartile range; CI = confidence interval

Figure 3

Plots of scores for three domains of health-related quality of life (HRQoL; vitality, comfort and emotional wellbeing [EWB]) generated by the owners of 36 healthy control cats and 58 sick cats using the 20-item web-based generic HRQoL instrument. Each blue box represents the scores obtained for between 25% (bottom line) and 75% (top line) of the group, with the line in the middle representing the median score

Figure 4

Fitted line plots of linear regressions performed for three domains of the health-related quality-of-life (HRQoL) instrument. Statistics presented include the residual standard deviation (S) and r² values for the three domains. The number of presenting conditions is on the x-axis, whereas the HRQoL scores are on the y-axis

Reliability

Forty eight owners completed a second assessment for their cats with a minimum of 14 days between assessments and the ICCs (95% confidence intervals [CIs]) for vitality, comfort and EWB were 0.635 (0.044–0.862), 0.716 (0.256–0.893) and 0.853 (0.615–0.945), respectively.

Discussion

One hundred and sixty-five potential items were collected from the owners of sick and healthy cats using best practice for qualitative research.² As information obtained from key informants underpins content validity, comprehensive representation of all relevant populations is necessary. Freeman et al described a generic HRQoL scale for cats,¹¹ where key informants were restricted to owners/caregivers of healthy cats. Similarly Tatlock et al used pet owners of healthy cats as key informants.¹³ However, 29% and 27% of owners (field tests 1 and 2, respectively) in this study thought their cats were healthy when clinicians deemed them to be sick, reinforcing that such judgement may be unreliable.²⁷ Bijsmans et al used owners as informants for CatQol, but no details are available regarding the health status of these cats.¹² In contrast, the health status of the 19 healthy cats and 10 sick cats belonging to owners recruited as key informants in this study was verified by a veterinary surgeon. Although this number of cats could be considered to be low, interviews were conducted until no new information emerged. In addition, 48 different owners involved in the content validation process were invited to suggest additional items if they felt the collection of items was inadequate. Initial reduction of the 165 items was based on criteria devised by the investigators, an approach considered to be appropriate in human medicine.^28–30

In veterinary science, many rely on cognitive debriefing interviews to establish the content validity of an instrument scale,^11,31 or simply ask owners to judge whether the instrument appears to be capable of measuring what it is intended to measure (face validity).³² However, in this study a group of vets, as well as a large group of owners, were involved in the validation process, adding to the robustness of this stage in the process.

In human medicine and the social sciences, the quantification of content validity has been introduced. One approach, used here, asks relevant ‘experts’ to rate the relevance and clarity of items using a rating scale, and those ratings are used to calculate a CVI for each individual item on the scale (I-CVI), providing objective information to guide researchers in revising, deleting, or substituting items.³³ The instrument described here is the first instrument in veterinary science to quantify and establish content validity of each item using this technique.

Following field test 1, 19 items were excluded based on research team judgement that they did not discriminate healthy from sick cats. Although it was considered unlikely that an item that was unable to discriminate healthy from sick cats would prove useful in an evaluative context, the possibility cannot be discounted and items removed at this stage may be reassessed for inclusion if the instrument proves not to be responsive to clinical change in further longitudinal studies.

The remaining 20 items all loaded >0.4 in the FA. Factor loadings of 0.3, 0.5 and 0.7 are generally considered to be low, medium and high, respectively,³⁴ with loadings of 0.3 deemed to be the minimum consideration level for FA. Increasing the loading threshold to 0.4 may have accounted for the fact that no further items were removed at this stage. Furthermore, the fact that the majority of loadings were >0.6 indicates stability of the factor model.³⁵ Although FA provides any number of factor models for a given data set, there are established methods of identifying how many factors could sensibly be extracted, including the scree test and the Kaiser criterion,³⁶ both considered in this study. The three-factor model adopted here, accounting for 72% of the variability in the owner response data, compares favourably with the canine HRQoL (64%)¹⁵ and 72% for the shortened instrument,²⁰ an 11-factor questionnaire designed to measure the behaviour and temperament of pet dogs (57%)³⁷ and a four-factor QoL questionnaire regarding infants (45%),³⁸ and confirms factorial validity.

Scores in all three domains of HRQoL were significantly different between healthy and sick cats. The fact that domain scores in the sick cats showed more variation than the healthy cats is not surprising given the heterogeneity of disease in the sick cat population. Interestingly, the variability in healthy cat vitality was similar to that of the sick cats, but this probably reflects the tendency for the individual variation in vitality that tends to exist in this species. Furthermore, cats with greater comorbidity had lower HRQoL scores, indicating a poorer QoL, with moderate Pearson correlation coefficients for all three domains,³⁹ thus upholding known-groups hypothesis, providing additional evidence for construct validity.

Evidence for known-group validity relating to health status of other generic HRQoL instruments in cats is sparse. Freeman et al investigated the validity of their scale in a large group of 1303 cats,¹¹ but only eight of these were categorised as being ‘not very healthy’ or ‘not at all healthy’ by their owners – in any case a judgement that we have shown in this study to be unreliable. Bijsmans et al demonstrated that their instrument detected difference between healthy cats and those with CKD,¹² but this evidence is of limited value in relation to the proposed generic nature of their instrument.

Discriminant analysis indicated an overall misclassification rate of 22% vs that reported for dogs with chronic pain (12% misclassification)¹⁹ and for a proxy instrument for pain measurement in communicatively impaired children (13%).⁴⁰ Misclassifications in the study reported here may have been a result of measurement error, or may have occurred because the QoL of some healthy cats was compromised at the time for reasons other than poor health, or because some sick cats may have been experiencing a good QoL at the time.

Criterion validity was not carried out because no gold standard instrument for the measurement of HRQoL exists, but the authors do not discount the possibility of being able to demonstrate concurrent or predictive criterion validity in future studies when suitable measures become available.

Test–retest reliability was carried out on data for healthy cats only, for which health status would be less likely to change over a 2 week period than would that of sick cats. A 2 week period between the completion of questionnaires is commonly chosen for this purpose, being a short enough period for change in health status to be unlikely but being a long enough period for respondents to be unlikely to remember their previous responses. The ICC values for the comfort and EWB domains were >0.7 and >0.8, respectively, indicating that test–retest reliability for those domains was good, and it was moderate for vitality (ICC >0.6).⁴¹

Conclusions

The measurement of feline HRQoL is becoming more necessary as chronic diseases such as CKD, hyperthyroidism, cognitive decline and osteoarthritis affect the QoL of an increasing number of ageing cats, and evidence-based medicine requires that robust measures of clinical impact be developed. This study has provided initial evidence for the reliability and validity of a novel generic instrument that measures the affective component of the chronic disease experience. However, it is important to emphasise that validity is not determined by a single statistic, but by a body of research that supports the claim that the instrument is valid for particular purposes, with defined populations and in specified contexts.² Accordingly, future research will seek to provide such evidence, as well as evidence for the instrument’s responsiveness to clinical change including that following treatment. The instrument is available for clinical use and for clinical trials from NewMetrica (www.newmetrica.com). For further information please contact the corresponding author.

Supplemental Material

Appendix S1

Veterinary assessment

Footnotes

Acknowledgements

The authors wish to thank all the cat owners, as well as the veterinary surgeons and nurses in practice and at the University of Glasgow Small Animal Hospital, who willingly participated in our studies

Accepted: 16 January 2018

Supplementary material

Appendix S1: Veterinary assessment.

Conflict of interest

Professor Reid is a shareholder of NewMetrica, which is the developer and supplier of the instrument.

Funding

We acknowledge Scottish Enterprise for the award of a SMART grant, which enabled us to undertake the study, and Boehringer Ingelheim for additional financial support.

References

Fayers

Machin

. Quality of life: the assessment, analysis and interpretation of patient-reported outcomes. John Wiley & Sons, 2007.

Streiner

Norman

Cairney

. Health measurement scales: a practical guide to their development and use. New York: Oxford University Press, 2015.

Abell

Springer

Kamata

Developing and validating rapid assessment instruments. Oxford: Oxford University Press, 2009.

Brod

Tesler

Christensen

TL.

Qualitative research and content validity: developing best practices based on science and experience. Qual Life Res 2009; 18: 1263–1278.

Freeman

Rush

Oyama

et al . Development and evaluation of a questionnaire for assessment of health-related quality of life in cats with cardiac disease. J Am Vet Med Assoc 2012; 240: 1188–1193.

Zamprogno

Hansen

Bondell

et al . Item generation and design testing of a questionnaire to assess degenerative joint disease-associated pain in cats. Am J Vet Res 2010; 71: 1417–1424.

Niessen

SJM

Powney

Guitian

et al . Evaluation of a quality-of-life tool for cats with diabetes mellitus. J Vet Intern Med 2010; 24: 1098–1105.

Noli

Minafò

Galzerano

Quality of life of dogs with skin diseases and their owners. Part 1: development and validation of a questionnaire. Vet Dermatol 2011; 22: 335–343.

Noli

Borio

Varina

et al . Development and validation of a questionnaire to evaluate the quality of life of cats with skin disease and their owners, and its use in 185 cats with skin disease. Vet Dermatol 2016; 27: 247–e58.

10.

Giuffrida

Kerrigan

SM.

Quality of life measurement in prospective studies of cancer treatments in dogs and cats. J Vet Intern Med 2014; 28: 1824–1829.

11.

Freeman

Rodenberg

Narayanan

et al . Development and initial validation of the Cat HEalth and Wellbeing (CHEW) Questionnaire: a generic health-related quality of life instrument for cats. J Feline Med Surg 2016; 18: 689–701.

12.

Bijsmans

Jepson

Syme

et al . Psychometric validation of a general health quality of life tool for cats used to compare healthy cats and cats with chronic kidney disease. J Vet Intern Med 2016; 30: 183–191.

13.

Tatlock

Gober

Williamson

et al . Development and preliminary psychometric evaluation of an owner-completed measure of feline quality of life. Vet J 2017; 228: 22–32.

14.

Osse

BHP

Vernooij-Dassen

MJFJ

Schadé

et al . A practical instrument to explore patients’ needs in palliative care: the Problems and Needs in Palliative Care questionnaire – short version. Palliat Med 2007; 21: 391–399.

15.

Reid

Wiseman-Orr

Scott

et al . Development, validation and reliability of a web-based questionnaire to measure health-related quality of life in dogs. J Small Anim Pract 2013; 54: 227–233.

16.

Las Hayas

Quintana

Padierna

et al . Use of rasch methodology to develop a short version of the Health Related Quality of life for Eating Disorders questionnaire: a prospective study. Health Qual Life Outcomes 2010; 8: 29.

17.

Wiseman-Orr

Scott

Reid

et al . Validation of a structured questionnaire as an instrument to measure chronic pain in dogs on the basis of effects on health-related quality of life. Am J Vet Res 2006; 67: 1826–1836.

18.

Johnston

File

SE.

Sex differences in animal tests of anxiety. Physiol Behav 1991; 49: 245–250.

19.

Wiseman-Orr

Nolan

Reid

et al . Development of a questionnaire to measure the effects of chronic pain on health-related quality of life in dogs. Am J Vet Res 2004; 65: 1077–1084.

20.

Reid

Wiseman-Orr

Scott

EM.

Shortening of an existing online health-related quality of life (HRQL) instrument for dogs. J Small Anim Pract 2018; 59: 334–342.

21.

Oppenheim

AN.

Questionnaire design, interviewing and attitude measurement. 2nd ed. London: Bloomsbury Publishing, 2000.

22.

Foddy

WH.

Constructing questions for interviews and questionnaires: theory and practice in social research. Cambridge: Cambridge University Press, 1993.

23.

Glaser

Grounded theory methodology. Introd Qual Res Psychol 2013; 3: 69–82.

24.

Lynn

. Determination and quantification of content validity. Nurs Res; 35: 382–385.

25.

van Schelven

Dikken

Sillekens

LGM

et al . Content validation of the Dutch version of the ‘Older Patients in Acute Care Survey’, an instrument to measure the attitude of hospital nurses towards older patients. Int J Clin Med 2015; 6: 7–18.

26.

Shrout

Fleiss

JL.

Intraclass correlations: uses in assessing rater reliability. Psychol Bull 1979; 86: 420–428.

27.

Spofford

Lefebvre

McCune

et al . Should the veterinary profession invest in developing methods to assess quality of life in healthy dogs and cats? J Am Vet Med Assoc 2013; 243: 952–956.

28.

Melzack

The McGill Pain Questionnaire: major properties and scoring methods. Pain 1975; 1: 277–299.

29.

Juniper

Guyatt

Jaeschke

How to develop and validate a new health-related quality of life instrument. In: Spilker

(ed). Quality of life and pharmacoeconomics in clinical trials. 2nd ed. Philadelphia, PA: Lippincott-Raven, 1996, pp 49–56.

30.

Armstrong

Toledano

Miloslavich

et al . The Miami pediatric quality of life questionnaire: parent scale. Int J Cancer Suppl 1999; 12: 11–17.

31.

Favrot

Linek

Mueller

et al . Development of a questionnaire to assess the impact of atopic dermatitis on health-related quality of life of affected dogs and their owners. Vet Dermatol 2010; 21: 64–70.

32.

Walton

Cowderoy

Lascelles

et al . Evaluation of construct and criterion validity for the ‘Liverpool Osteoarthritis in Dogs’ (LOAD) clinical metrology instrument and comparison to two other instruments. PLoS One 2013; 8: e58125.

33.

Polit

Beck

CT.

The content validity index: are you sure you know what’s being reported? Critique and recommendations. Res Nurs Health 2006; 29: 489–497.

34.

Shevlin

Miles

JNV

. Effects of sample size, model specification and factor loadings on the GFI in confirmatory factor analysis. Pers Individ Dif 1998; 25: 85–90.

35.

MacCallum

Widaman

Zhang

et al . Sample size in factor analysis. Psychol Methods 1999; 4: 84–99.

36.

Coste

Bouée

Ecosse

et al . Methodological issues in determining the dimensionality of composite health measures using principal component analysis: case illustration and suggestions for practice. Qual Life Res 2005; 14: 641–654.

37.

Hsu

Serpell

JA.

Development and validation of a questionnaire for measuring behavior and temperament traits in pet dogs. J Am Vet Med Assoc 2003; 223: 1293–300.

38.

Manificat

A new instrument to evaluate infant quality of life. Qual Life Newsl MAPI Res Inst 1999; 23: 7–8.

39.

Stallard

Williams

Velleman

et al . The development and evaluation of the pain indicator for communicatively impaired children (PICIC). Pain 2002; 98: 145–149.

40.

Mukaka

MM.

A guide to appropriate use of correlation coefficient in medical research. Malawi Med J 2012; 24: 69–71.

41.

Rosner

. Fundamentals of biostatistics. 8th ed. Boston: Brooks Cole, 2016.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.17 MB