Psychometric Properties of the Patient Activation Measure in Community-Dwelling Adults in Singapore

Abstract

Introduction

Measuring health activation in general population using valid instruments is needed to facilitate the evaluation of health education and behavioral programs in community. The 13-item Patient Activation Measure was well validated in patients with different chronic diseases but rarely validated in general population. The objective of this study was to assess the psychometric properties of the Patient Activation Measure among community-dwelling adults in Singapore.

Methods

Data of participants having valid responses to the English-version measure (N = 824) were analyzed. The psychometric properties were assessed by demonstrating evidence for uni-dimensionality using Rasch Principal Component Analysis of Residuals, known-group validity, convergent and divergent validity, and internal consistency reliability using Cronbach’s alpha.

Results

The uni-dimensionality of the Patient Activation Measure was supported by the Rasch Principal Component Analysis of Residuals results. Participants having multimorbidity or polypharmacy and being inactive in physical activity had significantly lower activation scores. The activation score was positively and moderately correlated with health confidence measured by the Health Confidence Measure (r = .38, P < .001), and negatively and weakly correlated with depressive symptoms measured by the Patient Health Questionnaire (r = − .13, P < .001). The internal reliability was good with a Cronbach’s alpha of .82.

Conclusion

The 13-item Patient Activation Measure has acceptable construct validity and good internal consistency among community-dwelling adults. It is a potential instrument to measure health activation in this population. Further research is required to investigate the expansion of response options, validate the cut-off scores for the activation levels and examine the test-retest reliability and responsiveness.

Keywords

patient activation measure activation validation psychometric properties community-dwelling adults

Highlights

What do we already know about this topic?

• In recent years, healthcare consumers are increasingly recognized as critical partners in health services planning, delivery, and evaluation; and are increasingly required to play a more active role in their own care.

• The 13-item Patient Activation Measure is the most widely used measure for activation which has been validated in many countries among various disease population.

How does your research contribute to the field?

• The findings of our research provide additional evidence for the validity and reliability of Patient Activation Measure from Asian perspective.

• Different response patterns were observed in community-dwelling adult population compared to patients with specific or mixed chronic conditions as reported in other Patient Activation Measure validation studies.

• Patient Activation Measure had good structural and construct validity and acceptable internal consistency reliability in community-dwelling adult population.

What are your research’s implications towards theory, practice, or policy?

• Patient Activation Measure can be used among community-dwelling adult population. However, further research is required to refine the instruction for administration, investigate the expansion of response options, validate the cut-off scores for the activation levels, and examine the test-retest reliability and responsiveness in community-dwelling adult population.

Introduction

With the ageing population and changes in societal behaviors, chronic diseases are on a steady rise and are the leading contributors to death and disability globally.¹ The prevalence of chronic diseases greatly exceeds the capacity of health system to care for patients. To combat this growing health crisis in a sustainable way, healthcare systems around the world have recognized the importance of activating and empowering individuals to take active roles in their own health and health care, and multi-level strategies have been taken to prevent and improve outcomes of chronic diseases.^2,3

Consumer or patient activation refers to how well an individual understands one’s own role in their health management and how competent they are to fulfill that role.^4,5 Numerous studies examining the relationship between an individual’s level of activation and engagement in health management have consistently reported that highly activated individuals were more likely to engage in a variety of healthy behaviors such as regular physical exercise, healthy diet, maintaining recommended weight, and managing stress^6-8; monitor health conditions; and adhere to treatment.

Several instruments have been developed to measure patient activation or engagement. The 13-item Patient Activation Measure (PAM-13), which was developed to assess an individual’s self-reported knowledge, skills and confidence to manage one’s own health and healthcare needs,^9,10 has become the most widely used generic measure for patient activation irrespective of the underlying health condition. The PAM-13 had shown good psychometric properties in the English-speaking general¹⁰ or specific disease-specific population.^11,12 It has been translated into many languages and validated in disease-specific outpatient and inpatient care settings^11,13-18 with mixed but generally good psychometric properties. However, its applicability to community-dwelling adults with diverse health status is not well established.

In Singapore, multiple health education and behavioral programs are taking place in community to improve community-dwelling adults’ health behaviors and capability of health management. Measuring activation of this population not only allows for designing tailored programs for individuals at different activation levels, but also for monitoring their changes in activation as one component of short-term outcomes of the programs. As such, validating the PAM-13 in this population becomes necessary. As a multiracial and multicultural country with prevalent multi-language literacy (74.3% in 2020), English is the most predominant language in Singapore.¹⁹ The objective of this study was to assess the psychometric properties of the English version PAM-13 among community-dwelling adults in Singapore.

Methods

Study Design and Participants

Data of this cross-sectional study was collected face-to-face by trained interviewers during a follow-up survey of the Population Health Index (PHI) study. Details of the PHI study has been previously published.^20,21 Briefly, PHI participants were aged 21 years old and above and residing in the randomly selected households at Central region of Singapore for at least 6 months in the past year when recruited. A total of 824 participants who 1) declared without any pre-existing intellectual disability, cognitive impairment, or psychiatric diseases; 2) completed the English version of the PAM-13 independently during the second follow-up survey; and 3) had valid responses to the PAM-13 (“Not Applicable” response to less than three items, and at least one non “Agree” response to any of the 13 items) were included in the analysis.

To ensure the sufficiency of sample size for the study, we calculated the required sample size based on the main analyses being conducted. Firstly, when using Rasch analysis to examine item characteristics of rating scale tools, a minimum of 250 subjects were recommended for polytomous items to ensure the stability and robustness of Rasch analysis.²² Secondly, a power analysis for simple correlation was conducted. To detect a correlation of −.16 between PAM-13 and the Patient Health Questionnaire (PHQ-9)¹¹ using a two-sided test at 5% significance level (α = .05) with power of 80% (β = .2), the required sample size would be 304. As such, the sample size of 824 participants was sufficient.

The PHI study was approved by the ethics review committee of the National Healthcare Group Domain Specific Review Board (Reference Number: 2015/00269). Written informed consent was obtained prior to the study initiation from all individual participants after they were fully informed of the study objectives and procedures.

Measures

Participant characteristics

Self-reported demographics used for this analysis included age, gender (male or female), ethnicity (Chinese, Malay, Indian, or others), highest education level, employment status, and self-perceived money sufficiency for essential daily living. A participant was considered to have multimorbidity if he/she had two or more self-reported chronic diseases. Participants were also asked to report the number of prescribed medications taken regularly for chronic diseases. The use of three or more medications daily was considered as polypharmacy.

Patient Activation Measure −13

The PAM-13 is a unidimensional, Guttmann-like measure that has 13 items measuring self-assessed knowledge, skills and confidence for self-management of health.¹⁰ The items are ordered by level of difficulty of activation. Each item has four response options: 1=strongly disagree, 2=disagree, 3=agree, 4=strongly agree, and additional “Not Applicable (NA)” option. Raw total PAM scores (derived by [raw score]/[number of items answered excepting non-applicable items] * 13) can be transformed to a scale (PAM score) with a theoretical range of 0-100 based on calibration tables, with higher scores indicating higher activation. The scaled score can be converted into four activation levels using the pre-defined cut-offs.¹⁶ Individuals at Level 1 tended to be more passive recipients of their own healthcare; Level 2 had the confidence and knowledge to take actions; Level 3 were taking actions to maintain or improve health; and Level 4 were maintaining guideline health behaviors under stress. In this study, each item score was entered into the PAM-13 scoring spreadsheet 2017 provided by the developer to derive the PAM score and activation level for each participant.

Patient Health Questionnaire

The PHQ-9 is a short, well-validated screening tool for symptoms of depression using 9 items. It captures self-reported response to each item based on past two weeks condition using a 4-point scale (0 = not at all, 1 = several days, 2 = more than half the days, 3 = nearly every day). The total score is calculated by summing up each item score and ranges from 0 to 27 with higher scores indicating more severe depressive symptoms. PHQ-9 had been validated among primary care patients in Singapore with good internal consistency reliability (Cronbach’s alpha .87).²³

Physical activity participation and self-perceived health status

A single item phased “How often do you take part in regular fitness program?” was used to measure the physical activity participation with five response options: “very often”, “often”, “once in a while”, “almost never”, and “never”. Participants were categorized into two groups based on their response to the question: physically active group if they reported “very often” or “often”, and physically inactive group if they reported “once in a while”, “almost never” or “never”. Self-perceived health status was determined based on the response to a single question phased “In comparison with other people of the same age, how do you consider your health status?” with four response options: “not as good”, “do not know”, “as good”, and “better”. The first two options were grouped together as “not as good/do not know” due to the small number of participants.

Health Confidence Measure

The Health Confidence Measure is considered as a simple and effective proxy for health engagement.²⁴ It measures a person’s health engagement using a single question: “How confident are you that you can control and manage most of your health problems?”. Individuals were asked to rate their health confidence on a scale from 0 (not very confident) to 10 (very confident). A cut-off score of 7 was used to categorized individuals into two groups: high health confidence (scored 7-10), and low health confidence (scored 0-6).

Statistical Analysis

Descriptive statistics were employed to summarize participants’ characteristics with continuous variables being described by the mean and standard deviation (SD), and categorical variables being described using the frequency and percentage.

Item-level descriptive statistics including responses distribution (including “NA” responses) and floor and ceiling effects of individual PAM-13 items were reported. Floor and ceiling effects were considered present if more than 15% of the participants achieved the lowest (disagree strongly) or highest (agree strongly) possible score.²⁵

The construct validity of the PAM-13 was examined from three aspects: dimensionality and Rasch item fit, known-group validity, and convergent and divergent validity. The internal consistency reliability was examined in this study.

Uni-Dimensionality and Rasch Item Fit

Rating Scale Model (RSM)²⁶ was used to estimate the primary measurement dimension for polytomous data from the Likert response format since the PAM-13 is anticipated to be unidimensional. Rating scale model requires each item to have the same number of response options and have one set of threshold values for all the items in the scale. As “disagree strongly” option was not selected for 6 out of 13 items and rarely selected for the rest of the items, it was combined with “disagree” option for the estimation of the RSM. As such, the initial five responses for each PAM-13 item were recoded as follows: 0=disagree strongly or disagree, 1=agree, 2=agree strongly, and NA=not applicable.

Analysis of uni-dimensionality of the PAM-13 was a two-step process. Firstly, the measurement dimension of the scale was estimated using the RSM. Secondly, a principal component analysis of residuals (PCAR) was used to determine whether substantial subdimensions existed within the items after fitting the RSM.²⁷ This analysis involves computing the standardized residuals, that is, (observed responses- expected values)/(model standard error). If the items measured a single latent dimension as estimated by the Rasch model, the remaining residual variance should reflect random variation. In operation, uni-dimensionality was evaluated based on four criteria: 1) at least 50% of the total variance should be explained by the measurement dimension²⁸; 2) the variance explained by the first principal component of the residuals be no more than 15%²⁹; 3) A minimum ratio of 3:1 for the variance in the measurement dimension compared to the variance of the first principal component of residuals³⁰; and 4) the Eigenvalue of the first contrast³¹ should be less than 2.0.

Rasch item fit statistics were used to examine how accurately the data fit the Rasch measurement model, i.e. how well item difficulty or an individual’s ability contributes to the underlying construct of the test.³² Infit statistics including Infit mean-square (MNSQ) and Infit value of t-test (ZSTD) and Outfit statistics including Outfit MNSQ and Outfit ZSTD were reported. The reasonable Infit and Outfit MNSQ range was .6 - 1.4 for a rating scale³³ and the reasonable Infit and Outfit ZSTD range³⁴ was between −2 and 2. Items not fitting the Rasch model implied a lack of uni-dimensionality which could be either due to misunderstanding of the item or that it was measuring another construct.

Known-Group Validity

Known-group validity was conducted to evaluate the ability of the PAM-13 to discriminate among known distinct groups. The PAM scores and the activation levels were compared between subgroups of multimorbidity, polypharmacy, and physical activity using independent samples t-tests and Chi-squared tests, respectively. The difference in PAM scores and activation levels across self-perceived health status groups was evaluated using the one-way ANOVA test and Chi-squared test, respectively. We hypothesized that individuals without multimorbidity or polypharmacy, being more active in physical activity, and having better self-perceived health would have higher PAM scores and higher activation levels compared to their respective counterparts.

Convergent and Divergent Validity

The convergent validity of the PAM-13 was measured by assessing the association between PAM score and health confidence score measured by Health Confidence Measure using Pearson’s correlation coefficient (r) and association between activation level and health confidence level using Chi-squared test. We conjectured that there was at least moderate (r > .30) and positive relationship between PAM score and health confidence score and participants with higher activation levels were more likely to have higher health confidence. The divergent validity of the PAM-13 was measured by assessing the association between PAM score and PHQ-9 depressive symptom score using Pearson’s correlation coefficient. We hypothesized that there was little or weak correlation (|r| < .30) between these two. We expected that the correlation coefficient of PAM score and health confidence score was higher than that of PAM score and depressive symptom score.

Reliability

Internal consistency reliability was measured using Cronbach’s alpha, average inter-item correlation and item-to-rest correlation. An alpha of .70 or higher was considered as the acceptable value.³⁵ Average inter-item correlation was derived by taking the average of all correlation coefficients for each pair of items on a test that measured the same construct. The ideal range of average inter-item correlation³⁶ was .15 to .50. Item-rest correlation was to evaluate the correlation between an item and the scale formed by all other items. A correlation of .3 or higher indicated a moderate correlation while a correlation of .5 or higher indicated a strong correlation.³⁷

The RSM and PCAR were conducted using the statistical program R with the extended Rasch model package (eRM) and Rasch Model Parameters by Pairwise Algorithm package (pairwise). All the other analyses were performed using Stata 16.0 for Windows. The result was considered significant if a P value was <.05 (one-tailed tests for independent t test and one-way ANOVA).

Results

Participant Characteristics

The characteristics of all participants are described in Table 1. The mean age of participants was 47.7 years (SD: 14.1 years), ranging from 21 to 75 years, 432 were females (52.4%) and 70.0% were Chinese. About 9.0% had primary school education or lower and 74.6% were employed. About 29% had multimorbidity and 12.3% had polypharmacy issue.

Table 1.

Participant characteristics and PAM scores (N = 824).

Variable	Total	PAM Score	P-value
Variable	n (%)	Mean ± SD	P-value
Age (Mean ± SD)	47.7±14.1		.004
21- 64 years	702 (85.2)	66.7 ± 12.2
65 years and above	122 (14.8)	63.3 ± 10.9
Gender			.081
Male	392 (47.6)	67.0 ± 12.6
Female	432 (52.4)	65.5 ± 11.5
Ethnicity			.136
Chinese	577 (70.0)	65.9 ± 11.8
Malay	75 (9.1)	64.7 ± 10.3
Indian	143 (17.4)	68.2 ± 13.7
Others	29 (3.5)	66.3 ± 13.2
Highest education level			<.001
No formal education	18 (2.2)	59.1 ± 6.6
Primary	57 (6.9)	62.5 ± 9.5
Secondary	254 (30.8)	64.5 ± 11.0
Post-secondary and above	495 (60.1)	67.8 ± 12.7
Employment status			.015
Employed	615 (74.6)	66.8 ± 12.2
Unemployed	123 (14.9)	65.9 ± 11.9
Inactive	82 (10.0)	62.7 ± 10.7
Unfit for work	4 (.5)	57.6 ± 5.4
Self-reported money sufficiency			<.001
Sufficient	720 (87.4)	66.7 ± 12.2
Insufficient	104 (12.6)	62.6 ± 10.8
Multimorbidity			.024
No	584 (70.9)	66.7 ± 12.3
Yes	240 (29.1)	64.9 ± 11.4
Polypharmacy			.006
No	723 (87.7)	66.6 ± 12.0
Yes	101 (12.3)	63.4 ± 12.4

Item-Level Descriptive Statistics

The responses distribution of the PAM-13 (without any missing values) for all 824 participants is presented in Table 2. The proportion of “NA” responses ranged from 0 (PAM_1) to 22.6% (PAM_4). Responses across the other four response options were unevenly distributed for all 13 items: The “agree” and “agree strongly” options were the most frequently selected responses while the “disagree strongly” was the least selected responses for all items (range: 0 – .4%), indicating no floor effects (Table 3). Ceiling effects assessed based on the proportion of “agree strongly” responses in all 13 items ranged from 16.1% (PAM_12) to 64.8% (PAM_1).

Table 2.

Distribution of responses for the items in the PAM-13, n (%).

Item	Disagree Strongly	Disagree	Agree	Agree Strongly	Not Applicable
PAM_1	0 (0)	1 (.1)	289 (35.1)	534 (64.8)	0 (0)
PAM_2	0 (0)	9 (1.1)	363 (44.1)	451 (54.7)	1 (.1)
PAM_3	1 (.1)	25 (3)	531 (64.4)	266 (32.3)	1 (.1)
PAM_4	0 (0)	15 (1.8)	308 (37.4)	315 (38.2)	186 (22.6)
PAM_5	2 (.2)	10 (1.2)	530 (64.3)	278 (33.7)	4 (.5)
PAM_6	0 (0)	24 (2.9)	548 (66.5)	250 (30.3)	2 (.2)
PAM_7	0 (0)	16 (1.9)	488 (59.2)	304 (36.9)	16 (1.9)
PAM_8	1 (.1)	40 (4.9)	516 (62.6)	186 (22.6)	81 (9.8)
PAM_9	1 (.1)	65 (7.9)	487 (59.1)	142 (17.2)	129 (15.7)
PAM_10	1 (.1)	40 (4.9)	482 (58.5)	297 (36)	4 (.5)
PAM_11	0 (0)	46 (5.6)	606 (73.5)	171 (20.8)	1 (.1)
PAM_12	3 (.4)	118 (14.3)	567 (68.8)	133 (16.1)	3 (.4)
PAM_13	1 (.1)	44 (5.3)	499 (60.6)	278 (33.7)	2 (.2)

Table 3.

Floor and ceiling effects, and item level correlations.

Item	Floor Effect (%)	Ceiling Effect (%)	Item-Rest Correlation	Average Inter-item Correlation	Alpha
PAM_1	.0	64.8	.38	.26	.81
PAM_2	.0	54.7	.43	.26	.81
PAM_3	.1	32.3	.46	.26	.80
PAM_4	.0	38.2	.31	.27	.82
PAM_5	.2	33.7	.46	.26	.81
PAM_6	.0	30.3	.48	.25	.80
PAM_7	.0	36.9	.42	.26	.81
PAM_8	.1	22.6	.44	.26	.81
PAM_9	.1	17.2	.40	.26	.81
PAM_10	.1	36.0	.49	.25	.80
PAM_11	.0	20.8	.61	.24	.79
PAM_12	.4	16.1	.56	.25	.80
PAM_13	.1	33.7	.47	.25	.80
Test scale				.26	.82

Uni-Dimensionality and Rasch Item Fit

Assessing uni-dimensionality of the PAM-13 using the PCAR analysis showed that the variance explained by the measure was 63.3% (above 50% of the total variance), and the variance explained by the first principal component of residuals was 14.2% (slightly lower than 15%), forming a ratio of 4.5:1, which met the 3:1 criterion for uni-dimensionality. The eigenvalue of the first contrast was 1.85 (lower than 2.0). Overall, the uni-dimensionality of the PAM-13 was supported by the data.

The results of Rasch item fit statistics in Table 4 showed that both Infit MNSQ (range: .63 to 1.14) and outfit MNSQ (range: .57 to 1.25) were within the acceptable range of .6 to 1.4 for all items, suggesting that this is a productive measurement with a good fit of the Rasch model. However, there were seven items having Infit ZSTD and eight items having Outfit ZSTD beyond the acceptable range, suggesting that these items were useful to the measurement but required further refinement.

Table 4.

Item difficulty, standard error and infit and outfit statistics.

Item Sequence Based on Item Difficulty	Item	Item Difficulty	Standard Error	Location	Threshold 1	Threshold 2	Infit		Outfit
Item Sequence Based on Item Difficulty	Item	Item Difficulty	Standard Error	Location	Threshold 1	Threshold 2	MNSQ	ZSTD	MNSQ	ZSTD
1	PAM_1	—	—	.65	−1.82	3.13	.93	−1.69	1.21	2.12
2	PAM_2	−1.23	.08	1.24	−1.23	3.71	1.02	.61	1.02	.30
3	PAM_4	−.87	.09	1.60	−.87	4.07	1.14	2.89	1.25	3.20
4	PAM_7	−.30	.08	2.17	−.30	4.64	.93	−1.43	.91	−1.49
5	PAM_5	−.14	.08	2.34	−.14	4.81	.84	−3.48	.83	−2.96
6	PAM_10	−.07	.08	2.40	−.07	4.87	1.06	1.25	1.06	.92
7	PAM_3	.04	.08	2.51	.04	4.98	.95	−1.01	.95	−.90
8	PAM_13	.09	.08	2.56	.09	5.03	1.10	1.89	1.09	1.42
9	PAM_6	.14	.08	2.61	.14	5.08	.86	−2.86	.83	−2.83
10	PAM_8	.60	.08	3.07	.60	5.54	.79	−3.79	.75	−4.00
11	PAM_11	.83	.08	3.30	.83	5.77	.63	−7.50	.57	−7.75
12	PAM_9	1.12	.09	3.59	1.12	6.06	.86	−2.22	.82	−2.59
13	PAM_12	1.62	.09	4.09	1.62	6.56	.87	−2.44	.85	−2.49

Note. MNSQ: mean square (values between .60 and 1.40 are within acceptable limits for the Rasch model). ZSTD: value of t-test (values between 2 and 2 are within acceptable limits for the Rasch model).

Based on the difficulty parameters (Table 4), the ranking of items derived from this cohort was different as compared to the ranking of items in the original PAM-13. Participants in this study found it easier to agree to PAM_7, PAM_10 and PAM_13. On the other hand, items including PAM_3, PAM_6, PAM_8 and PAM_9 were harder to be endorsed by the participants. The item location parameter ranged from .65 for PAM_1, which was the easiest item, to 4.09 for PAM_12, the most difficult one.

Known-Group Validity

As shown in Table 5, participants who had multimorbidity or polypharmacy had significantly lower PAM scores (mean score: 64.9 and 64.3, respectively) compared to their respective counterparts (mean score: 66.7 and 66.6, respectively) and were more likely to have lower activation levels. Participants who were more active in physical activity had higher PAM scores or activation levels than their counterparts. Those who perceived their health as good as or even better than their peers had higher PAM scores and higher proportions with higher activation levels than those perceived their health not as good as their peers. These results provided the evidence of known-group validity.

Table 5.

Associations between PAM-13 and other variables or measures.

Variable	n (%)	PAM Level				P-value	PAM Score	P-value
		Level 1	Level 2	Level 3	Level 4
		(n = 20)	(n = 79)	(n = 563)	(n = 162)		Mean±SD/r (95%CI)
Multimorbidity						.020^a		.021^b
No	584 (70.9)	11 (55.0)	52 (65.8)	392 (69.6)	129 (79.6)		66.7 ± 12.3
Yes	240 (29.1)	9 (45.0)	27 (34.2)	171 (30.4)	33 (20.4)		64.9 ± 11.4
Polypharmacy						.001^a		.008^b
No	723 (87.7)	15 (75.0)	60 (76.0)	500 (88.8)	148 (91.4)		66.6 ± 12.0
Yes	101 (12.3)	5 (25.0)	19 (24.0)	63 (11.2)	14 (8.6)		63.4 ± 12.4
Physical activity participation
Inactive	364 (44.2)	11 (55.0)	48 (60.8)	254 (45.1)	51 (31.5)	<.001^a	63.9 ± 10.9	<.001^b
Active	460 (55.8)	9 (45.0)	31 (39.2)	309 (54.9)	111 (68.5)	<.001^a	68.0 ± 12.7	<.001^b
Self-perceived health status
Not as good/do not know	108 (13.1)	8 (40.0)	27 (34.2)	64 (11.4)	9 (5.6)	<.001^a	60.2 ± 10.6	<.001^c
As good	353 (42.8)	6 (30.0)	31 (39.2)	248 (44.0)	68 (42.0)		65.8 ± 11.7
Better	363 (44.1)	6 (30.0)	21 (26.6)	251 (44.6)	85 (52.5)		68.4 ± 12.3
PHQ-9 depressive symptoms	824 (100)	2.3 ± 3.5	1.3 ± 3.2	.4 ± 1.8	.3 ± 1.0	<.001^c	−.13 (−.19, −.06)	<.001^d
Health confidence	824 (100)	6.1 ± 1.9	6.9 ± 1.3	7.8 ± 1.0	8.5 ± 1.1	<.001^c	.38 (.32, .44)	<.001^d
Low	89 (10.8)	13 (65.0)	24 (30.4)	47 (8.3)	5 (3.1)	<.001^a	58.7 ± 11.0	<.001^b
High	735 (89.2)	7 (35.0)	55 (69.6)	516 (91.7)	157 (96.9)	<.001^a	67.1 ± 11.9	<.001^b

^aChi-squared test.

^bIndependent sample t-test.

^cOne-way ANOVA.

^dPearson’s correlation test.

Convergent and Divergent Validity

PAM score was positively associated with health confidence (r = .38) as shown in Table 5. Individuals with high health confidence level had significantly higher PAM scores compared to those with low health confidence level (mean score: 67.1 and 58.7, respectively). However, the association between PAM score and PHQ-9 depressive symptom score was low (r = − .13) although higher activation levels corresponded to lower depressive symptom scores. The correlation between PAM score and health confidence score (r = .38) was higher than that between PAM score and depressive symptom score (r = −.13), providing the evidence of good convergent and divergent validity.

Internal Consistency Reliability

Cronbach’s alpha for the PAM-13 was .82 and item-rest correlations ranged from moderate (items PAM_1 - PAM_10, and PAM_13) to strong (items PAM_11 and PAM_12). The average inter-item correlation was .26 with individual inter-item correlations ranging from .24 to .27 (Table 2), well within the ideal range of .15 to .50, indicating strong internal consistency reliability.

Discussion

This study examined the psychometric properties of the English-version PAM-13 in terms of construct validity (including uni-dimensionality, known-group validity, and convergent and divergent validity) and internal consistency reliability among community-dwelling adult population in Singapore.

Findings from the item-level descriptive analysis showed that “disagree strongly” and “disagree” were the options least selected by this population. The ceiling effect existed in all 13 items and was the highest for the first two items and the lowest for item PAM_12. This pattern was consistent with the study conducted among cardiac patients in outpatient clinics¹¹ although the ceiling effect for each item was more prevalent in our cohort. While high ceiling effect was expected for PAM_1 and PAM_2 as they are the easiest items in the measure, the high ceiling effect for the rest of the items suggests that the scaling of the PAM-13 may not be adequate to capture the appropriate responses for this population. This might also suggest that the PAM-13 with existing rating scale may have low discriminating power or be less responsive to changes when being used among a relatively healthy population. Increasing the response categories towards the high end may allow participants to provide a more accurate response to what they truly feel and reduce ceiling effects.³⁸

In this study, the “NA” option was mostly selected for items PAM_4 (use of prescribed medications), PAM_9 (available treatment options), and PAM_8 (knowledge on cause of health problems), which was different from the response pattern reported in other studies in patients with either specific (eg, cardiac conditions or mental problems) or mixed chronic diseases.^11,16,39 This could be partially explained by the difference in health profile of the participants as these three items might be less applicable to very healthy individuals, especially those without any health problems. The relatively higher responses of “NA” option among community-dwelling adults brings up the necessity of rephasing the instructions of the measure while using it in a population with diverse health states to make it clearer that the health problems stated in the items describe any health problems that are not specific to any chronic conditions.

The uni-dimensionality of the PAM-13 was generally supported by the data from our study, however, the high Infit and Outfit ZSTD for seven out of 12 items suggested for further refinement of these items. Furthermore, as what have been found in a cognitive interviewing study conducted among cardiac patients in Singapore,³⁸ some of the terms/phases such as “when all is said and done” in item PAM_1 or the sentence structures such as “tell doctor concerns I have” in item PAM_6 are not commonly used by local population, and the interpretations of some wording such as “active role” in item PAM_2 and “medical treatments” in item PAM_7 varied due to ambiguity. This is probably due to the differences in health beliefs and needs or practices of self-management of health linked to the local cultural background.³⁸ Another issue highlighted by the trained interviewers was that participants with multiple conditions in our study were more inclined to ask questions like “Which condition should the answer be based on?”, suggesting that the settings where the participants are recruited or interviewed determine the “default” condition the responses are based on.

Similar to the validation studies of either the English^10,12 or other language versions of the PAM-13 conducted in other countries,^13,17,40 ranking of items according to levels of difficulty in this study was different from the original ranking of items derived from an American population with chronic diseases. Interestingly, within the same country, the ranking of items and their response patterns derived from outpatient cardiac patients and the community-dwelling adult population also differed.¹¹ This highlights the necessity in validating the measure when using in different populations or settings.

The differences in PAM scores between subgroups based on multimorbidity, polypharmacy, and activity participation suggest that these three conditions have good distinguishing capability for activation. Similar to other validation studies,^15,18,40,41 the significant difference in PAM scores across three self-perceived health status groups in our study population provided further evidence for known-group validity.

Confidence is a postulated aspect of activation. The convergent validity of the PAM-13 was supported by its positive and moderate correlation with the confidence in terms of controlling and managing most of health problems. The PHQ-9, a screening tool for depressive symptoms, was not constructed to measure any aspects of activation. As such, its negative and weak association with the PAM-13 provided the evidence of divergent validity.

Similar to the validation studies conducted in different disease-specific populations,^11-13,15,40 this study adds additional evidence for good internal consistency reliability.

Examining the distribution of activation levels between/across known-groups and comparison of PHQ-9 and health confidence scores across activation levels produced consistent findings as compared to examining using PAM scores, suggesting that the current cut-off scores for activation levels have the discriminating power based on the variables examined in this study. However, as the existing four levels of activation were derived based on the original item difficulty, further research is required to determine the validity of the cut-off scores of the four levels of activation in different populations, especially if the ranking of the item difficulty differs from its original one.

There are several limitations for the study. Firstly, the original PAM-13 was directly used for data collection without any adaption to the health beliefs of local population, this might result in unintended variation in responses. Secondly, the interviewers were instructed not to interpret the statements to individuals as suggested by the administration guideline of PAM-13, variation in interpretation or comprehension from participants could not be avoided, which might increase the variation in responses. Thirdly, due to the cross-sectional study design, we were unable to examine the test-retest reliability and the responsiveness of the PAM-13, which is of interest for future research.

Conclusions

This study demonstrates that the PAM-13 has acceptable construct validity and good internal consistency among community-dwelling adults in Singapore. Further research is required to refine the instruction of PAM administration, investigate the expansion of response options, validate the cut-off scores for the activation levels and examine the test-retest reliability and responsiveness.

Footnotes

Acknowledgments

The authors thank Reuben Ong for administrative assistance with project management, all the trained surveyors for assistance with data collection and the subjects whose participation made this study possible.

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Availability of data and materials

According to the Data Protection Act Commission Singapore-Advisory Guidelines for the Healthcare Sector, all the individual data collected for the Population Health Index study are protected under the Act. As such, the datasets analyzed during the current study are not publicly available. However, minimal dataset underlying the findings in the manuscript is available from the corresponding author on reasonable request.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Healthcare Group Pte Ltd in the form of salaries for all authors.

Ethics approval and consent to participate

The study was approved by the ethics review committee of the National Healthcare Group Domain Specific Review Board (Reference Number: 2015/00269). The study was conducted in accordance with the Declaration of Helsinki. Written informed consent was obtained from all individual participants after they were being informed about the study objectives and the safeguards put in place so that confidentiality of the collected data is maintained.

ORCID iD

Lixia Ge

References

World Health Organization . Noncommunicable Diseases. WHO. https://www.who.int/news-room/fact-sheets/detail/noncommunicable-diseases. Published June 1 2018, Accessed March 7, 2021.

Dietz

Brownson

Douglas

, et al. Chronic disease prevention: tobacco, physical activity, and nutrition for a healthy start: A vital direction for health and health care. NAM Perspect. 2016;6(9). doi:10.31478/201609j.

Institute of Medicine . Living Well with Chronic Illness: A Call for Public Health. Washington, DC: The National Academies Press; 2012. doi:10.17226/13272.

Hibbard

. Patient activation and the use of information to support informed health decisions. Patient Educ Counsel. 2017;100(1):5-7. doi:10.1016/j.pec.2016.07.006.

Hibbard

Mahoney

. Toward a theory of patient and consumer activation. Patient Educ Counsel. 2010;78(3):377-381. doi:10.1016/j.pec.2009.12.015.

Hibbard

Mahoney

Stock

Tusler

. Do increases in patient activation result in improved self-management behaviors? Health Serv Res. 2007;42(4):1443-1463. doi:10.1111/j.1475-6773.2006.00669.x.

Regeer

van Empelen

Bilo

HJG

de Koning EJP Huisman

. Change is possible: How increased patient activation is associated with favorable changes in well-being, self-management and health outcomes among people with type 2 diabetes mellitus: A prospective longitudinal study. Patient Educ Counsel. 2022;105(4):821-827. doi:10.1016/j.pec.2021.07.014.

Harvey

Fowles

Terry

. When activation changes, what else changes? The relationship between change in patient activation measure (PAM) and employees’ health status and health behaviors. Patient Educ Couns. 2012;88(2):338-343. doi:10.1016/j.pec.2012.02.005.

Hibbard

Stockard

Mahoney

Tusler

. Development of the Patient Activation Measure (PAM): Conceptualizing and measuring activation in patients and consumers. Health Serv Res. 2004;39(4 Pt 1):1005-1026. doi:10.1111/j.1475-6773.2004.00269.x.

10.

Hibbard

Mahoney

Stockard

Tusler

. Development and testing of a short form of the patient activation measure. Health Serv Res. 2005;40(6 Pt 1):1918-1930. doi:10.1111/j.1475-6773.2005.00438.x.

11.

Ngooi

Packer

Kephart

, et al. Validation of the Patient Activation Measure (PAM-13) among adults with cardiac conditions in Singapore. Qual Life Res. 2017;26(4):1071-1080. doi:10.1007/s11136-016-1412-5.

12.

Lightfoot

Wilkinson

Memory

Palmer

Smith

. Reliability and validity of the patient activation measure in kidney disease: Results of Rasch analysis. CJASN. 2021;16(6):880-888. doi:10.2215/CJN.19611220.

13.

Ahn

Ham

Kim

. Psychometric properties of the Korean version of the “Patient Activation Measure 13” (PAM13-K) in patients with osteoarthritis. Eval Health Prof. 2015;38(2):255-264. doi:10.1177/0163278714540915.

14.

Graffigna

Barello

Bonanomi

Lozza

Hibbard

. Measuring patient activation in Italy: Translation, adaptation and validation of the Italian version of the patient activation measure 13 (PAM13-I). BMC Med Inform Decis Mak. 2015;15:109. doi:10.1186/s12911-015-0232-9.

15.

Maindal

Sokolowski

Vedsted

. Translation, adaptation and validation of the American short form Patient Activation Measure (PAM13) in a Danish version. BMC Publ Health. 2009;9:209. doi:10.1186/1471-2458-9-209.

16.

Moljord

IEO

Lara-Cabrera

Perestelo-Pérez

Rivero-Santana

Eriksen

Linaker

. Psychometric properties of the patient activation measure-13 among out-patients waiting for mental health treatment: A validation study in Norway. Patient Educ Couns. 2015;98(11):1410-1417. doi:10.1016/j.pec.2015.06.009.

17.

Packer

Kephart

Ghahari

Audulv

Versnel

Warner

. The patient activation measure: A validation study in a neurological population. Qual Life Res. 2015;24(7):1587-1596. doi:10.1007/s11136-014-0908-0.

18.

Zill

Dwinger

Kriston

Rohenkohl

Härter

Dirmaier

. Psychometric evaluation of the German version of the Patient Activation Measure (PAM13). BMC Publ Health. 2013;13:1027. doi:10.1186/1471-2458-13-1027.

19.

Ong

. English Most Spoken at Home for Nearly Half of S’pore Residents: Population Census. The Straits Times. https://www.straitstimes.com/singapore/english-most-spoken-at-home-for-nearly-half-of-spore-residents-population-census. Published June 16 2021, Accessed April 20, 2022.

20.

Yap

Ong

Heng

. Social isolation, loneliness and their relationships with depressive symptoms: A population-based study. PLoS One. 2017;12(8):e0182145. doi:10.1371/journal.pone.0182145.

21.

Yap

Ong

Heng

. Development of a scalable and extendable multi-dimensional health index to measure the health of individuals. PLoS One. 2020;15(10):e0240302. doi:10.1371/journal.pone.0240302.

22.

Chen

Lenderking

Jin

Wyrwich

Gelhorn

Revicki

. Is Rasch model analysis applicable in small sample size pilot studies for assessing item characteristics? An example using PROMIS pain behavior item bank data. Qual Life Res. 2014;23(2):485-493. doi:10.1007/s11136-013-0487-5.

23.

Sung

Low

CCH

Fung

DSS

Chan

. Screening for major and minor depression in a multiethnic sample of Asian primary care patients: A comparison of the nine-item Patient Health Questionnaire (PHQ-9) and the 16-item Quick Inventory of Depressive Symptomatology - Self-Report (QIDS-SR16 ). Asia Pac Psychiatr. 2013;5(4):249-258. doi:10.1111/appy.12101.

24.

Wasson

Coleman

. Health confidence: A simple, essential measure for patient engagement and better practice. FPM. 2014;21(5):8-12.

25.

McHorney

Tarlov

. Individual-patient monitoring in clinical practice: Are available health status surveys adequate? Qual Life Res. 1995;4(4):293-307.

26.

Andrich

. A rating formulation for ordered response categories. Psychometrika. 1978;43(4):561-573. doi:10.1007/BF02293814.

27.

Smith

. Detecting and evaluating the impact of multidimensionality using item fit statistics and principal component analysis of residuals. J Appl Meas. 2002;3(2):205-231.

28.

Linacre

. Data variance explained by Rasch measures. R M T. 2006;20:1045.

29.

McCreary

Conrad

Scott

Funk

Dennis

. Using the Rasch measurement model in psychometric analysis of the family effectiveness measure. Nurs Res. 2013;62(3):149-159. doi:10.1097/NNR.0b013e31828eafe6.

30.

Embretson

Reise

. Item Response Theory for Psychologists. Mahwah, New Jersey: Lawrence Erlbaum Associates, Inc.; 2000.

31.

Boone

Staver

. Principal component analysis of residuals (PCAR). In: Boone

Staver

, eds Advances in Rasch Analyses in the Human Sciences. Cham: Springer International Publishing; 2020:13-24. doi:10.1007/978-3-030-43420-5_2.

32.

Linacre

. What do infit and outfit, mean-square and standardized mean? R M T. 2002;16(2):878.

33.

Wright

Linacre

. Reasonable mean-square fit values. R M T. 1994;8(3):370.

34.

Bond

Fox

. Applying the Rasch Model: Fundamental Measurement in the Human Sciences. 2nd ed. Mahwah, NJ, US: Lawrence Erlbaum Associates Publishers; 2007:340.

35.

Bland

Altman

. Statistics notes: Cronbach’s alpha. BMJ. 1997;314(7080):572. doi:10.1136/bmj.314.7080.572.

36.

Stephanie

. Average inter-Item correlation: Definition, example. Statistics HowTo.com: Elementary statistics for the rest of us! https://www.statisticshowto.com/average-inter-item-correlation/. Published June 17 2018, Accessed August 11, 2020.

37.

Clark

Watson

. Constructing validity: Basic issues in objective scale development. Psychol Assess. 1995;7(3):309-319. doi:10.1037/1040-3590.7.3.309.

38.

Ngooi

Packer

Warner

, et al. How adults with cardiac conditions in Singapore understand the Patient Activation Measure (PAM-13) items: a cognitive interviewing study. Disabil Rehabil. 2018;40(5):587-596. doi:10.1080/09638288.2016.1261413.

39.

Hellström

Kassaye Tessma

Flink

Dahlgren

Schildmeijer

Ekstedt

. Validation of the patient activation measure in patients at discharge from hospitals and at distance from hospital care in Sweden. BMC Publ Health. 2019;19(1):1701. doi:10.1186/s12889-019-8025-1.

40.

Brenk-Franz

Hibbard

Herrmann

, et al. Validation of the German version of the Patient Activation Measure 13 (PAM13-D) in an international multicentre study of primary care patients. PLoS One. 2013;8(9):e74786. doi:10.1371/journal.pone.0074786.

41.

Rademakers

Nijman

van der Hoek

Heijmans

Rijken

. Measuring patient activation in the Netherlands: translation and validation of the American short form Patient Activation Measure (PAM13). BMC Publ Health. 2012;12:577. doi:10.1186/1471-2458-12-577.