Abstract
Objective
This exploratory experimental study compared young people’s credibility appraisals and behavioural intentions following exposure to depression treatment information on a Health 2.0 website versus a traditional website. The traditional website listed evidence-based treatment recommendations for depression as judged by field experts. The Health 2.0 website contained information about how helpful each treatment was, as aggregated from feedback from young people with lived experience of depression.
Method
Participants (
Results
Participants in the traditional website condition rated their website as significantly more influential than did participants presented with the Health 2.0 website. This difference in treatment influence was fully accounted for the participants’ perception of credibility of the information provided by the websites.
Conclusion
The traditional website was rated as significantly more credible and influential than the Health 2.0 website. Treatment decisions appeared to be based on the extent to which online information appears credible. In conclusion, health-related content was perceived by users as more credible when endorsed by experts than by other users, and perceived message credibility appears to be a powerful determinant of behavioural intentions within the e-health setting.
Although young people value and use the Internet to seek information on mental health,1–3 surprisingly little is known about the impact of such information on young people’s intentions to seek help,4–6 and factors that influence such impact. The advent of Health 2.0 has provided users with an alternative source of information about health compared with traditional expert-sponsored websites. Given the variety of online information sources now available, it is important to understand the impact of different types of information on young people’s decisions to seek treatment, and the factors that account for such impact. As the e-health landscape continues to evolve, several questions emerge regarding the impact of website information that is crowdsourced (that is, based on user-generated reviews and opinions) compared with the more traditional websites that are based on expert consensus and research.
The terms Medicine 2.0 and Health 2.0 are sometimes used synonymously and at other times they are defined differently. Health 2.0 is the more prevalent and widely accepted term.
7
In the absence of a consensus on an appropriate definition,
7
we use the following definition of Medicine 2.0 provided by Hughes and colleagues: ‘
Credibility appraisals are appraisals about the extent to which information or an information source is believable. 12 The emergence of Health 2.0 has highlighted the importance of evaluating the credibility of e-health content,13–15 as there is often no quality control procedure in place to assess the content prior to it being uploaded. Furthermore, individuals publishing content are not required to have any particular expertise in healthcare. As the e-health landscape continues to evolve, researchers have shifted from examining the role of content quality to the role of the user’s perceived credibility of the website and content in treatment decisions.
Several research questions need to be examined in order to better understand how e-health users appraise the credibility of different types of e-health content.
First, there is a need to explore the extent to which e-health consumers believe that e-health content derived from lay expertise (i.e. ‘user-generated’ data) is as credible as that derived from professional expertise (i.e. traditional e-health data).13,14 Emerging research indicates that content provided by patients on online health forums is valued as a credible source of health advice by e-health users.16,17 Investigating how e-health consumers appraise the credibility of user-generated health data, including crowdsourced treatment reviews, has been identified as a research priority.14,15,17 Crowdsourced medical data is frequently accessed.18,19 Moreover, evidence indicates that medical treatment advice derived from crowdsourced data is likely to differ from evidence-based guidelines.20–23 For example, one study found that medications that were subject to United States Food and Drug Administration black box warnings were rated more poorly in crowdsourced reviews than in the empirical literature, whereas alternative medications or those with addictive properties were rated more positively. 21 Studies comparing crowdsourced treatment reviews of dermatological treatments to randomised controlled trial results found that, for some treatments, crowdsourced reviews were more positive than would be expected on the basis of their empirical support, while for others, treatments were rated more negatively.20,22,24 This raises the question as to how e-health users rate the credibility of crowdsourced treatment data compared with evidence-based guidelines.
A second question that needs to be examined is the extent to which such crowdsourced e-health content influences young people’s help-seeking intentions, and the extent to which credibility appraisal of such content plays a role in such influence. Internationally, there has been an increased emphasis on attempting to better harness the potential of online technologies to meet the needs of young people experiencing mental health difficulties. 25 While some websites provide e-therapy, many others provide mental health information with the intent of improving young people’s awareness of their mental health needs, and their ability to source appropriate interventions offline. Most young people search for mental health information by entering a general search term on a search engine (e.g. ‘depression’ on Google) rather than going straight to a trusted website.2,26,27 The retrieved results are likely to vary in information quality. 6 Therefore, it is important to understand which sources of online advice young people are likely to adopt.
Third, there is a need to explore how e-health users appraise the credibility of the content that they find online. The credibility literature has largely focused on the perceived credibility of information sources (i.e. perceived source credibility), with far fewer studies examining how individuals appraise the credibility of messages themselves (i.e. perceived message credibility). 10 However, an individual’s credibility appraisal methods are likely to vary dependent on the information domain (e.g. entertainment versus health) and whether they are acquiring information online or offline.14,28–30 Researchers have cautioned against the risk of overlooking the importance of message features in contributing to credibility perceptions of e-health content.31,32 Indeed, there is evidence to suggest that e-health users focus on message rather than source credibility cues when appraising information derived from both experiential expertise and scientific expertise in online health forms. 17
Within the communications literature, research has consistently demonstrated that sources that are perceived to be credible have a greater impact on information receivers’ attitude and behaviour than those that are not.10,11 However, research suggests that in conditions of high user involvement (i.e. when information receivers are heavily invested in the topic), it is not the most important predictor of persuasion. 11 Rather, the perceived credibility of message content (i.e. message credibility) may be more influential. Wilson and Sherrell 11 conducted a meta-analysis examining the impact of source credibility on persuasion when user involvement was considered. They found that in 67% of the studies, source credibility only had an impact on persuasion in low user involvement conditions (i.e. when information receivers had low levels of investment in the topic being discussed). This finding is consistent with the Elaboration Likelihood Model of information processing (ELM).33,34 ELM suggests that in low involvement conditions, individuals may use source cues (e.g. the website domain name) as a simple heuristic (or ‘rule of thumb’) to make a quick decision on whether or not their message is credible (e.g. the source looks professional). In contrast, when individuals are highly involved in a topic, if they have the ability to do so, they will invest cognitive effort in critically appraising message content (i.e. using the central route of cognitive processing) rather than relying on simple heuristics. A similar dual processing model has been applied to the context of assessing credibility in the online environment, emphasising the key role that user motivation and ability play. 35
The online context and the nature of the task of searching for medical treatment information promote an emphasis on critically evaluating the characteristics of e-health content itself (i.e. perceived message credibility) in informing behavioural intentions.17,33,36 There are two reasons for this. First, adopting inaccurate or misguided treatment advice could be very costly to the wellbeing of e-health users. Therefore, they are by definition a highly involved user group who are motivated to make an accurate credibility appraisal.14,16,17,29 It can be argued that young people seeking mental health advice online are a particularly involved user group compared with other age groups, as they are particularly reluctant to seek mental health advice offline. 37 This suggests that they are less likely than older adults to verify the accuracy of the information they find online with offline sources (e.g. medical professionals, family members or peers). This creates extra vulnerability, as verification has been found to be the most important strategy that e-health users use to appraise the credibility of both scientific and experiential e-health content. 17 Second, e-health users are gathering treatment information in an environment in which source cues are often ambiguous or absent and, even when provided, they may not be trustworthy.14,29,30,38,39 This promotes an emphasis on the influence of message over source cues in influencing credibility appraisals. 30
Even when relying on heuristics to make credibility appraisals, in the context of searching for medical treatment information, e-health users are more likely to rely on message rather than source cues. This is because they are ‘accuracy-motivated’ users. 40 One heuristic that is likely to be particularly salient when searching for medical treatment information is that of ‘self-confirmation’, which reflects ‘a tendency to notice and place greater value on information that supports one’s beliefs, while discounting information that refutes those beliefs.’ (Metzger and Flanagin, 29 p.453). The ‘consistency heuristic’ is also likely to be important; this reflects a tendency to believe information is credible if it is consistent with information provided by other sources. 40 In contrast, heuristic cues related to the surface characteristics of websites (e.g. the design and layout, interactivity and domain name) that have been found to influence Internet users’ credibility appraisals29,41 are likely to be less influential when an individual is searching for information in a situation where a poor decision could come at a high personal cost.29,40
Health 2.0 websites that utilise crowdsourcing allow e-health users to share and access experiential expertise in unprecedented and potentially more powerful ways than ever before. The ‘shared experiential expertise’ 42 that is available through these websites may be particularly valued by e-health users as a credible source of health advice. First, it may be perceived to be more credible than other forms of user-generated content, as it is more difficult to manipulate. As the volume of consumer feedback increases, the overall output (i.e. the ‘rating’) cannot be easily manipulated by any one reviewer’s input. 42 Second, as the volume of feedback increases, it may be perceived to be less subjective and therefore less biased than any one individual’s feedback. 17 Third, it may be more current than content on traditional websites, as it can be updated very quickly to consider new evidence or controversy. 43
One study has compared the relative perceived credibility of crowdsourced data versus ‘expert-generated’ data. The perceived credibility of content based on ‘expert-generated’ versus ‘user-generated’ movie ratings was investigated. 42 User-generated ratings were only perceived to be more credible than expert-generated ratings if the volume of ratings was high. This was in contrast to a previous study which found that users neglected to attend to the number of ratings when considering reviews of e-commerce products. 44 Participants were more likely to perceive the crowdsourced reviews to be credible if they had contributed crowdsourced data themselves (i.e. posted online reviews and recommendations) and reported familiarity with crowdsourced content. 42 As credibility appraisal methods are likely to vary between information domains, it may not be appropriate to generalise research from an e-commerce to an e-health setting.
The current exploratory study aimed to explore the extent to which traditional and Health 2.0 websites were perceived by young consumers as influential in treatment decisions and if so why. First, it explored whether treatment intentions differed after viewing the two types of website (traditional versus Health 2.0). Second, it examined whether the association between such intentions and website type was mediated by perceived message credibility. It was hypothesised that if behavioural intentions varied as a function of website allocation, this effect would be fully mediated by perceived message credibility.
The study attempted to promote external validity as the majority of research on the credibility of e-health content has lacked ecological validity.10,45,46 The study used live and pre-existing websites in order to maximize the external validity of the information sources. A priori power analyses indicated that in order to determine whether there was a difference between experimental groups in behavioural intentions, a sample size of 210 would be required (anticipating a medium effect size and desired statistical power level of 0.95). While it would have been ideal to recruit a sample of actual users of the websites, or of individuals with current diagnoses of depression, it was not possible to do so within the resource constraints of this study. Therefore, a convenience sample was recruited, primarily comprising female university students. Given the exploratory nature of this study, this approach was deemed to be appropriate.
Method
Participants
Participants were required to be between 18 and 25 years of age and able to read English. They were recruited from social, family and professional networks of the researchers, and from a pool of undergraduate psychology students in the Research Experience Program (REP) at Swinburne University. Student participants were provided with course credit, while others were entered into a draw to win AU$50 gift vouchers. Participation was voluntary and participants could withdraw from the study at any time.
Only participants who were recruited through networks of the authors were offered the option of entering the prize draw and completing the experiment in their own time, as it was assumed that they would comply with the experiment instructions. In order to ensure that all other participants complied with instructions, they completed the experiment in a laboratory session with one of the authors present in the background.
The only check for data validity was for participants to enter the name of the website that they had just viewed. Any participant who failed to do so was excluded from all further analyses.
Design and procedure
The study used a between-subjects experimental design. Using an online algorithm (www.randomizer.org), participants were randomly assigned to one of two e-health websites:
In contrast, the primary purpose of the
Summary of some key similarities and differences between the two websites.
The
Participants were asked to: ‘Imagine that you have recently been diagnosed with depression. You are unsure of what treatments you should try so you decide to go online to find some information about “what works” for treating depression.
A manipulation check was used to determine whether participants had correctly perceived whether the information on the website was provided by an expert (‘researchers with expertise in depression’ for
Measures
Participants completed a demographics form (sex, age, education level and employment status) and the following questionnaires prior to viewing their allocated website.
Percentage of participants who had previously engaged in Health 2.0 activities.
Topic salience
53
was measured by four items that were used to determine how relevant the topic of depression was to participants’ lives. Participants were asked, ‘have you ever been diagnosed with depression?’ and ‘has someone close to you (e.g. a family member or friend) ever been diagnosed with depression?’ Response options were ‘yes’, ‘no’ and ‘I don’t know’. Participants were asked to rate their perceived knowledge about depression and its treatment on a scale of 1 (
Disposition to trust 54 was measured by the four-item Disposition to Trust Scale which assessed participants’ general tendency toward trusting others. This scale was initially developed as a five-item scale on the basis of the literature pertaining to trait-based trust. Following initial psychometric analyses of the subscale within a broader e-commerce questionnaire using confirmatory factor analysis, one item was deleted on the basis of shared variance. 54 Unidimensionality of the four-item scale was confirmed using goodness of fit indices for the full structural model (GFI = 0.92, AGFI = 0.88, NFI = 0.93, and RMR = 0.04; 54 with factor loadings ranging from 0.61 to 0.89). The four-item scale demonstrated acceptable internal reliability in the current study (Cronbach’s α = 0.75).
A three-item measure of website design assessed the degree to which the website was perceived to be: ‘attractive’, ‘colourful’ and ‘likeable’. The scale demonstrated acceptable internal reliability (Cronbach’s
The 12-item website credibility subscale was refined using confirmatory factor analyses. Several items had to be deleted due to metric invariance between the two experimental groups. This indicated that the measure was being interpreted differently across the two experimental groups. As a result, it was not valid to compare the two groups on the subscale. In order to achieve metric invariance, it was necessary to remove the problematic item ‘trustworthiness’ from the scale, as it loaded significantly more heavily on the construct of website credibility for the
Analytical approach
Preliminary analyses comprised three steps. First, psychometric analyses were conducted to determine the reliability and validity of the included scales. Second, in order to establish whether randomisation was successful, a series of analyses were conducted to examine whether there were any differences between the experimental groups on user characteristics. Third, frequencies were used to establish the manipulation check results (i.e. did the group allocated to
In order to examine the first research question, an independent samples
Finally, a series of independent samples t-tests were used to ascertain whether there were any between-group differences on other website characteristics or in confirmation of prior belief, that may have contributed to differences in behavioural intentions between the experimental groups.
Results
Participants
A desired sample size of 210 was based on a priori power analysis for a two-tailed student
Participants’ disposition to trust, internet and e-health literacy skills (n = 279).
Note: *eHEALS scores were presented as means scores ranging from 1 to 5 rather than total scores ranging from 8 to 40.
Depression was a salient topic; most participants (189/279, 67.7%) reported that a close friend or family member had been diagnosed with depression, and one in five (57/279, 20.4%) reported a lifetime history of depression. Almost three quarters of the sample (207/279, 74.5%) reported knowing ‘a reasonable amount’ or a ‘great deal’ about depression and its treatment. Most (244/279, 87.5%) reported being ‘moderately’ or ‘very’ interested in learning more about depression and its treatment.
Randomisation
A series of non-parametric Mann–Whitney
Performance on the manipulation check
Failure rates for identifying the message source for
Research questions
The mean behavioural intention rating was significantly higher for the group allocated to the
Results of mediation analysis for the total sample (
PMC, perceived message credibility; BI, behavioural intentions.

Results of a simple mediation analysis (n = 279).
Exploratory independent-samples t-tests (two-tailed) of between-group differences on website characteristics.
Discussion
This exploratory experimental study compared young people’s behavioural intention toward acting on depression treatment information provided on a traditional e-health website, compared with a Health 2.0 website. It was hypothesised that if behavioural intentions varied as a function of website allocation, this effect would be fully mediated by perceived message credibility. In response to calls for studies examining online credibility to be externally valid, 10 live and pre-existing websites were used in this study. As recommended by previous researchers, 65 efforts were made to choose websites that were similar on several important characteristics (e.g. similar website sponsors, lack of advertising, similar treatment rating scales used). The sample used in this study was comprised primarily of female university students. This sample limits the generalizability of the findings.
Behavioural Intention ratings were significantly higher among participants in the
While the sample in this study was not clinical, participants were well suited to addressing the research questions, as they had sufficient ability to complete the task as demonstrated by high levels of reported Internet use, experience, perceived expertise and adequate e-health literacy. In addition, the majority (87.5%) reported being ‘moderately’ or ‘very’ interested in learning more about depression and its treatment. Most (67.7%) had a friend or family member who had been diagnosed with depression. The prevalence of a lifetime history of depression (20.5%) was slightly higher than that found in the general population. 66 Previous research has found elevated levels of psychological distress among student populations. 67 Therefore, the sample succeeded in mirroring the high user-involvement condition that would be typical of most e-health users seeking treatment advice.
The finding that message credibility fully mediated the relationship between website allocation and behavioural intention is consistent with ELM33,34 and Metzger’s dual processing model of credibility assessment, 35 which posit that under conditions of high topic involvement, users will scrutinise message characteristics. Moreover, the results of this study were similar to those of McKnight and Kacmar 9 who found that message credibility fully mediated the relationship between initial impressions of a website (e.g. perceived website quality, credibility, and reputation) and the adoption of online legal advice. The results of this study also support other research that demonstrated that message characteristics were important predictors of the adoption of e-health advice. 31 They are also consistent with the finding that message credibility can be an important determinant of e-health users’ appraisals of the credibility of e-health data regardless of whether it is derived from experiential and scientific expertise.16,17
Further analyses supported the inference that the difference in behavioural intentions ratings between groups was not due to differences in other website features. There was no difference in perceived sponsor credibility between the groups, supporting the inference that differences in the perceived credibility of the website sponsors did not account for the main effect of website on behavioural intention. While there were significant differences between groups in ratings of website design and dynamism features, both of these differences were in favour of the Health 2.0 website, and hence cannot explain the higher ratings of behavioural intention for the traditional website.
A significant minority of participants failed the manipulation check. These findings are similar to those of previous research that found that significant percentages of participants fail correctly identify the source of information. 60 As this study was prioritising external validity, participants’ data was included in the main analyses regardless of their performance on the manipulation check. Full mediation was also evident in the subset of participants who passed the manipulation check.
Researchers are beginning to investigate the impact of age on how e-health users appraise the credibility of e-health data. There is some evidence to suggest that young people may be more critical consumers of traditional e-health content than older adults. However, when Health 2.0 content is considered, the impact of age on credibility appraisals seems to be more complex. 68
Limitations and future research
This study had several limitations that should be considered in interpreting the results.
First, in promoting external validity of the information sources used, the internal validity of the study was attenuated. The websites differed in several ways in terms of the content offered, their domain names (.org versus .edu.au), website design and interactivity features. Some of these variables were measured and controlled for within the study design. However, it was not possible to control for all potential confounding variables. As such, the results of this study cannot be solely attributed to differences in credibility appraisals of treatment data derived from experiential versus professional expertise. Instead, they need to be conceptualised as overall differences in credibility appraisals and behavioural intentions related to the websites as a whole.
Second, this exploratory study only included one prototype of a Health 2.0 and a traditional e-health website. Therefore, it is not possible to generalise the results to other websites. As an example of a Health 2.0 website,
Third, participants were predominantly female and highly educated, therefore is not possible to generalise the findings to other populations. The results need to be interpreted bearing this in mind. There is evidence to suggest that references are a particularly important indicator of credibility to university students.70,71 As the sample in this study was predominantly made up of students, they may have been particularly influenced by evidence-based information provided on
Fourth, while the study attempted to enhance external validity, it was still conducted in an artificial situation. Participants recruited outside of the researchers’ personal and professional networks were required to complete the experiment in a laboratory with one of the authors supervising in the background. This is likely to have promoted the validity of the data, however it also compromised the external validity of the study, as the experimental setting was more artificial than it would have been if all participants could complete the experiment privately. Moreover, in an artificial situation, certain variables cannot be controlled for. For example, young people usually use search engines to find e-health content2,26,27,72 and the process by which young people find a website has been found to influence their credibility appraisals of online content. 73 Furthermore, while the use of a vignette can encourage goal-directed searching and encourage individuals to act as they would in a real-life situation, hypothetical situations cannot replicate real-world circumstances. Depressive symptoms (e.g. amotivation, hopelessness and helplessness) may reduce the likelihood that individuals who are currently depressed would act on any e-health content accessed. Furthermore, cognitive symptoms of depression, such as poor concentration, and fatigue may also interfere with credibility appraisal processes that are observed in non-depressed populations. Future research should recruit more diverse samples of young people and include those with current diagnoses of mental illness. Researchers should consider recruiting directly from the websites in question to investigate the opinions of active users. Similarly, several important predictors of persuasion were not included in the analyses. Two particularly important variables that were not accounted for were perceived source expertise (lay expertise and professional expertise) and homophily.60,74 Future research should also consider incorporating more complex data validity checks to improve the generalizability of results, particularly if self-report data is being used.
Finally, future research should go beyond measuring behavioural intention toward acting on treatment advice to measure actual behaviour change that results from accessing e-health data. While behavioural intentions are assumed to lead to behaviour, this is not always the case. 75
Contributions of this research to the field
This is the first study investigating the relative impact of mental health treatment advice provided on a Health 2.0 versus a traditional e-health website. Furthermore, it is the first study to compare the perceived credibility of these two types of mental health treatment advice, from the perspective of end-users. Behavioural intentions ratings for the traditional e-health website exceeded those of the group allocated to the traditional e-health. However, the effect size was small and mean behavioural intentions ratings for both websites were above the midpoint on the rating scale. The Health 2.0 website used in this study had several limitations that may have detracted from the perceived credibility of the treatment ratings provided, therefore, the results suggest that e-health consumers are likely to perceive more established Health 2.0 websites as quite credible sources of medical treatment advice.
The results of this study also highlight the persuasive power of message credibility within e-health. They add weight to previous findings that e-health users rely heavily on message credibility cues when appraising the credibility of both user-generated and scientific health information online.16,17 Consistent with previous research, 31 message characteristics were very important in influencing participants’ behavioural intention toward adopting online advice. The ultimate aim of e-health websites is to positively influence the wellbeing of their users. Therefore, it is imperative to understand how to promote message credibility to fully harness their potential.
Footnotes
Acknowledgements
The authors would like to thank the participants who took part in this study for their time and interest. In addition, the authors would like to thank the developers of the
Contributorship
FS researched literature and conceived the study with input from SB, AJ and NR on the conceptualization and design. FS and SB were involved in protocol development and gaining ethical approval with input from AJ and NR on reviewing draft documents. FS was responsible for participant recruitment, data collection and analysis. DM provided statistical input on guiding and reviewing data analysis. FS wrote the first draft of the manuscript. All authors reviewed and edited the manuscript and approved the final version of the manuscript
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: The Health 2.0 website used in this study (WhatWorks4u.org) is run by researchers in the Population Mental Health Group at The Melbourne School of Population and Global Health (MSPGH), Australia. This group is led by AJ. In addition to AJ, NR and FS were also involved in developing the WhatWorks4u.org website. AJ is also a former contributor to the BluePages website.
Ethical approval
The Swinburne University Human Research Ethics Committee (Melbourne, Australia) approved this study (SUHREC: 2012/112)
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Guarantor
FS.
Peer review
This manuscript was reviewed by Philip Batterham, Australian National University, and Nikki Newhouse, University of Oxford.
