Abstract
Online labour markets (OLMs) are a vital source of income for globally diverse and dispersed freelancers. Despite their promise of neutrality, OLMs are known to perpetuate hiring discrimination, vested in how OLMs are designed and what kinds of interactions they enable between freelancers and hirers. In this study, we go beyond understanding mechanisms of hiring discrimination in OLMs, to identifying platform design features that can minimise hiring discrimination. To do so, we draw on a methodology guided by the design justice ethos. Drawing on a survey on UK-based freelancers and interviews with a purposefully drawn sub-sample, we collaboratively identify five platform design interventions to minimise hiring discrimination in OLMs: community composition, identity-signalling flairs, text only reviews, union membership, and an antidiscrimination prompt. The core of our study is an innovative experiment conducted on a purpose-built, mock OLM,
Introduction
Online labour markets (OLMs) have witnessed rapid growth in recent years, expanding opportunities for freelancing (Stephany et al., 2021). Wood et al. (2019) estimate that 163 million freelancers are registered on OLMs worldwide, with 4.2 million in the UK reporting freelancing as their main job. 1 Data suggest that a third of freelancers come from India, especially for software development projects (Stephany et al., 2021). Freelancers on OLMs are occupationally heterogeneous and globally diverse (Leung, 2018). Women comprise about 40% of online freelancers (Stephany et al., 2021). Freelancing jobs range from manual labour such as carpentry and plumbing, domestic labour to software development, writing, and creative and multimedia work (Kässi and Lehdonvirta, 2018). Part of OLMs’ appeal to employers is the ability to hire on a short-term basis for specific needs and to do so by accessing a skilled global labour force. Following platform economy scholars, such as Kässi and Lehdonvirta, (2018) and Stephany et al., (2021), we conceptualise these platforms as marketplaces to highlight our focus on the labour market (i.e. hiring) aspects of platforms.
Given the ethnic and gender diversity that encompasses online freelancing, attention needs to be paid to how job opportunities are distributed across OLM freelancers. Who are the people who get selected for jobs? Discrimination in labour markets works through mechanisms which often sort individuals into occupations deemed most suitable given an individual's status characteristics, such as ethnicity and gender (Hong, 2016; Ridgeway and Cornell, 2006). Labour markets with high-status and high-paid jobs typically tend to favour White men, who benefit from stereotypes of expertise and assumed leadership ability. Women are often perceived by hirers as ‘naturally’ less skilled in technical professions and less committed to competitive workplaces (Brooke, 2021; Rivera and Tilsik, 2016). In lower wage labour markets, there is also a racialised and gendered preference (Hong, 2016), such as for women from the Global South for sewing and embroidery based on these women's ostensibly ‘nimble fingers’ and the preference for Asian women for beauty work in the US (Kang, 2010). Different labour markets thus have different mechanisms of discrimination.
In offline labour markets, research has also shown how hiring discrimination can be managed through slight tweaks to the hiring process, for example, by having a blinded orchestra audition (Goldin and Rouse, 2000). On online platforms, scholars have used the conceptual tool of ‘affordances’ (Davis, 2020) to understand discrimination. Affordances emphasise how an object (i.e. platform) interfaces with its social environment to, at times, produce biased and discriminatory outcomes (Noble, 2018; Schwartz and Neff, 2019). OLMs are conducive for examining how labour market discrimination can be minimised. Platform owners can make platform design decisions that amplify or minimise different biases via the hiring platform. This relative ease of manipulating platform design is partly why OLMs have been framed as spaces where hiring discrimination may be minimised (Stringhi, 2022). An example of a platform design decision is curtailing users’ active decision-making in hiring someone in favour of algorithmic ‘matching’ (Wood and Lehdonvirta, 2021). For example, Uber does not let riders pick from a range of drivers (Levy and Barocas, 2017). Leaving matching to algorithms brings its own host of problems, wherein discrimination may be baked into the very algorithm (Jarrahi et al., 2021; Noble, 2018).
While prior research has successfully identified mechanisms of labour market discrimination, how hiring platforms can be designed to minimise discrimination has received less attention. We follow the lead of scholars seeking to identify and minimise labour market discrimination, by focusing on online freelancing (Correll, 2017; Levy and Barocas, 2017). Researching the functioning of discrimination in freelancing is crucial to ensuring that hiring discrimination is challenged as work moves increasingly online. Building on large-scale statistical analysis of OLMs and qualitative research on the experiences of freelancers (Hannák et al., 2017), we develop an innovative mixed-methods study to test which design interventions can help minimise biases and discrimination in freelancing work adjudicated via OLMs. We ask the following research questions:
Our methodology is motivated by design justice (Baumer, 2017; Costanza-Chock, 2020), drawing on a survey (
Our key contribution is to show design interventions that can, with relative ease, minimise discrimination in OLMs. The study furthers social science research to identify how and when bias and discrimination are most acute and when they can be adjusted to ensure that all freelancers gain access to freelance work through OLMs. This is important because OLMs are a source of income for millions of workers. OLMs also become critical as an income source during economic crises (International Labour Organisation, 2021). Our findings demonstrate evidence of a general preference for South Asian women freelancers, which increases under specific interventions. We find community composition and flairs to be especially useful for minimising discrimination in this study. Anonymity, previously seen as potentially curtailing discrimination in online and offline labour markets (Stefanelli and Lukac, 2020), does not appear to benefit freelancers in this study. Our findings contribute to research that, in addition to identifying mechanisms of labour market discrimination, also provides evidence on designing interventions to reduce discrimination.
Literature review
Mechanisms and conditions of affordances in OLMs
Algorithms and platforms are not neutral, objective entities that inevitably produce equal outcomes (Davis, 2020; Eubanks, 2018; Noble, 2018). Instead, technologies are social. Scholars studying how technologies interface with social environments emphasise how biases can be built into technologies producing discriminatory outcomes (Noble, 2018). Especially salient in this line of conceptualisation is the notion of ‘affordances’ which takes into account how, beyond the technical capacity of a design object, the use of an object depends substantially on its social environment and conditions (Davis, 2020). Moving beyond a dichotomy of what can or cannot be done with a technology, Davis (2020) develops the ‘mechanisms and conditions framework’. The
The mechanisms and conditions of affordances framework are useful for understanding how platforms shape labour practices in online work (Bellesia et al., 2023; Schwartz and Neff, 2019; Wood et al., 2019). In her study of the ride-hailing app Uber, Rosenblatt (2019) shows that while Uber promotes driving as characterised by freedom and independence, Uber's platform design disadvantages its workers through how it structures labour and pay. Through the affordances of
Like Rosenblatt (2019), Wood et al.'s (2019) study of online freelancing found that flexibility promised by platforms was the primary driver for freelancers seeking work through OLMs. However, freelancers had to employ algorithmic management techniques through curating platform-based ratings and reputation systems to maintain autonomy in their work by appealing to hirers. How platforms compute the scores, or ratings and rankings that a user has on a particular platform, are often opaque to users. Users thus deploy algorithmic management in varied ways. Some freelancers
Affordances have individual and collective implications for structuring interactions between different users. A mechanism that may appear neutral to one group may be discouraging or even hostile to another (Davis, 2020). Schwartz and Neff (2019) further the concept of affordances by arguing how affordances are gendered because they ‘enable different users to take different actions based on the gendered social and cultural repertories available to users and technology designers’ (Schwartz and Neff, 2019: p.2407). Stringhi (2022) illustrates how the affordances associated with controls over information flows and interactions between different categories of users extend gender affordances to freelancing platforms. To establish reputation and build trust, OLMs encourage the creation of personal profiles that feature first names, encourage an identifying profile picture, and even linking to external social media accounts. Stringhi (2022) shows that this mechanism of affordance triggers unintended consequences, such as facilitating racial discrimination and harassment of freelancers. This personalisation in OLMs emphasises the significance of the identity of the transacting parties, enhancing the probability for gender discrimination (Stringhi, 2022). Yet, technologies, and specifically platforms, can also be thoughtfully designed keeping in mind what kind of affordances each design choice renders possible.
Hiring discrimination in OLMs
Platform design features are key in shaping both the mechanisms and conditions of affordances in discrimination in OLMs. It is important to thus pinpoint precisely which features, when, and for whom, produce discrimination and how such discrimination may be mitigated. Research on discrimination in OLMs is essential, albeit currently nascent, given how OLMs are a vital source of labour for many (Stephany et al., 2021). The comprehensive research on discrimination in traditional labour markets informs us that ethnicity and gender are two status characteristics that tend to confer advantage on White men (Gaddis, 2015; Quadlin, 2018; Rivera and Tilcsik, 2016; Tak et al., 2019). White men benefit from being considered competent and being viewed as having leadership skills and ‘natural’ ability for some of the highest-paying positions. This translates into hiring, pay, and promotion advantages. In contrast, stereotypes about women, like their suitability for tasks viewed to be masculine, such as coding (Brooke, 2021) and Math, often disadvantage them, especially in elite labour markets (Quadlin, 2018; Rivera and Tilcsik, 2016). Similar mechanisms may be at play in OLMs. OLM employers overwhelming tend to be based in majority White and developed countries of the Global North such as the United States and the United Kingdom (UK), while the highly educated workforce of freelancers they hire from is substantially located in countries of the Global South, such as India (Stephany et al., 2021). This creates a space ripe for racialised and gendered stereotypes to be activated as White hirers make decisions about hiring from a racially diverse workforce.
The emerging literature on discrimination in OLMs (Galperin, 2021; Hangartner, Kopp and Siegenthaler, 2021; Pallais, 2014) finds that hiring biases advantage women from less developed countries (Chan and Wang, 2018). Biases advantaging women in OLMs tend to be concentrated in jobs that are feminine-typed, such as administration (Cerqua and Urwin, 2018). Women's disadvantage remains for high-value projects that pay more (Cerqua and Urwin, 2018). Women from the Global South have often been viewed as suited for detail-oriented tasks (Kang, 2010). Mechanisms leading to the preference for South Asian women in OLMs may be vested in similar stereotypes about women. Gendered occupational deviation appears to disadvantage men and women in OLMs. In a large-scale study of an online Swiss recruitment platform, Hangartner et al. (2021) track how much time recruiters spend on each profile. Although recruiters spend similar amounts of time on all profiles, they are less likely to contact women in male-dominated fields and men in female-dominated fields, as well as individuals from immigrant and minority ethnic groups. The mechanism for penalty for women in masculine-typed jobs appears to be that employers fall back on gender stereotypes in the absence of information, and these stereotypes reduce women's chances of getting hired (Hong, 2016).
Hirers’ decisions could be tethered to offline gendered biases, but another possibility is that hirers’ biases are reinforced by each platform. A common feature on OLMs is showing the ‘community’ of workers who have completed a particular task. A task could be reinforced as being masculine-typed if hirers are shown images of men who have previously completed this or a similar task. But this same assumption of a task as masculine-typed could be undermined if hirers are shown a more gender-balanced community of workers as previously completing this task. Indeed, employers appear to update their assumptions about workers as they gain familiarity with different features of a platform (Galperin, 2021). This indicates that OLM employers can be trained to hire in ways that minimise discrimination.
Intervening in discrimination in OLMs
Scholarship from the tradition of human–computer interaction has argued that platforms should challenge implicit processes of discrimination through
The information required for an online freelancing profile typically consists at a basic level of an image (or profile photo), username, average rating, number of reviews, location, and skills. Prior research indicates that reputation systems on platforms, based on ratings and reviews, are a crucial mechanism in making some freelancers more coveted than others on OLMs. Reputation systems replicate broader cultural prejudices despite ostensibly being founded in quantifiable approaches to fairness and justice (Hannák et al., 2017). Even the architecture of ratings is conducive to amplifying gendered discrimination (Rivera and Tilcsik, 2019). One study shows that when candidates were ranked on a scale of 1– 6 versus 1–10, the gender gap in evaluation widens substantially. In the 1–10 scale, the gender gap in the evaluation was significantly larger, with the mechanism being that participants were wary of ranking women a 10, which they viewed as indicating an overly generous score for women. Evaluators were far more comfortable scoring women at the highest level of 6 on a 1–6 scale because they perceived that this scale did not indicate an unduly inflated evaluation (Rivera and Tilcsik, 2019). Scores and rankings, so central in OLMs, are often a biased metric.
A mechanism of affordance is how platforms can allow businesses to highlight specific aspects of their identity. In 2021, both Instagram and Google added a feature that allowed businesses to add a ‘business identity attribute’ or flair, such as ‘black-owned’ or ‘woman-owned’. Research has found that labelling restaurants as Black-owned on Yelp.com increased customer engagement and the number of reviews left by White customers (Aneja et al., 2023). Similarly, Babar et al. (2023) found that the woman-owned identity signalling also increases business for profiles on Google Maps. Aneja et al. (2023) suggest that these positive effects are because consumers desire to compensate for the disadvantages and discrimination that women and ethnic minorities face. Research on identity-signalling flairs is restricted to reviews of offline businesses through review platforms and has not included freelancing work to date. While anonymity can protect freelancers from discrimination, there is also evidence that identity signalling too is beneficial.
Because prior research has pointed to how men in feminine-typed tasks and women in masculine-typed tasks are more likely to experience discrimination in OLMs in terms of being hired, it makes sense to pay attention to this aspect of hiring in OLMs through focusing on the community of workers. Moreover, given that the racial and gender background of an individual matters for hiring in both offline and OLMs, information that indicates this background (i.e. through photograph, name, or an explicit declaration, such as being a minority-owned or woman-owned business) requires attention as well. We draw on previous scholarship on the affordances approach, hiring discrimination, and design justice to first identify specific platform design features most conducive to amplifying hiring discrimination and then to manipulate these features to provide experimental evidence to minimise discrimination in OLMs.
Our key contribution is methodological: we create a mock platform that allows us to experimentally test different platform design choices. Scholars have typically not been privy to the design choices of real OLMs like Freelancer or Upwork. The algorithmic decisions made on those platforms remain a black box. Further, the valuable conceptual work on affordances and design justice is often not empirically tested. This arguably provides platforms with a justification for not pursuing such interventions. Because we created our own mock platform, we can cleanly manipulate the relevant design feature and separate out how and when hiring discrimination is amplified and how it can be minimised through selective and considered manipulation of each design feature. This study extends prior research on discrimination in OLMs which has tended to rely on survey and tracking data (Hangartner et al., 2021). Our study goes beyond describing that discrimination occurs to testing whether, and which, design features can be most effectively manipulated to minimise discrimination. This methodological contribution enables us to contribute to the growing scholarship on minimising labour market discrimination.
Methodology
Our research design consisted of a survey, interviews, and a discrete-choice experiment. Given RQ1, we designed a survey of UK freelancers to elicit their perceptions of hiring discriminations on OLMs. Next, we conducted interviews with a purposefully selected sub-sample of freelancers who participated in our survey. Through these interviews, we probed for platform design changes that freelancers think can minimise hiring discrimination. This aspect of our methodology, treating freelancers as ‘peer researchers’ who are involved in the platform design, is shaped by design justice (Davis, 2020) and bridges RQ1 and RQ2 by encouraging freelancers to draw on their experiences of discrimination to inform interventions. Our final step, to answer RQ2, was to conduct a discrete-choice experiment to understand which platform design choices can help minimise hiring discrimination.
Survey of freelancers
We developed a survey to collect freelancers’ experiences on how they experience discrimination. The survey was completed in July 2022 and yielded 884 valid responses. Respondents were reimbursed £7.50 for their time and to motivate detailed, free-text answers. We contacted freelancers by posting on social media forums for freelancers in the UK, specifically Facebook groups (i.e. Freelancers UK) and Reddit (r/freelanceuk). While non-probability sampling is often the gold standard in survey research, for comparison with existing research, we limited the sample to those who either lived in the UK or had British citizenship (Lehdonvirta et al., 2021). We focused on freelancers living in the UK to ensure internal validity as we are focusing on culturally shared notions of gender and ethnicity. However, the sampling frame does limit the generalisability of the findings outside of the UK. We determined that the sample characteristics were representative of the gender and ethnicity of the wider UK online freelancing population in terms of equivalent stratification identity categories. I 55% of the sample was women, 45% men, and <1% identified as non-binary or gender-queer. The mean age was 31 years. 93% of respondents disclosed their ethnicity; 53% were White, 30% identified as from Multiple Ethnic Groups, 6% were South Asian, and 4% were Black. There is likely an economic self-selection effect (Lehdonvirta et al., 2021), where the opportunity cost of participating in the survey is lower for lower-paid freelancers. A summary of the main themes of the survey is in Table 1.
Summary of the survey instrument themes and content.
Interviews with freelancers
At the close of the survey, freelancers indicated if they could be contacted for an interview. 39% of freelancers who completed the survey agreed to be contacted for interview. Given our research focus on discrimination in online freelancing, we prioritised interviewing freelancers across a wide range of those who felt they were not discriminated against at all in OLMs to those who felt they were. We created a metric of discrimination frequency by combining and re-coding the frequency of discriminatory experiences from the survey to a numeric scale. In our survey, this corresponded to the following item:

Density of discrimination from clients, colleagues, and freelancing platforms as reported in the survey.
We aimed for our interview sample to capture a variation in pay rates, as we discuss later in the results. We also paid attention to interviewing freelancers across salient dimensions of stratification, including gender, ethnicity, and variation in six occupation classes as identified by Kässi and Lehdonvirta (2018). There were three freelancers from each of the occupation classes: (a) Clerical and data entry, (b) Creative and multimedia, (c) Professional services, (d) Sales and marketing support, (e) Software development and technology, and (f) Writing and translation. These occupation classes categorise the breadth of online labour undertaken on freelancing platforms (Stephany et al., 2021). The sample consisted of 10 women and eight men. In terms of ethnicity: five identified as White, four as Black, three as East Asian, and six as South Asian.
We designed our interview protocol to gain deeper insight into what aspects of OLMs freelancers saw as most salient in their experiences of discrimination and what platform design features they saw as beneficial to minimising discrimination (see Table 2). The first author and a graduate research assistant conducted the interviews via online video. Interviews lasted approximately 30–40 min. Interviewees were reimbursed £30.00 each, paid via Amazon voucher. This sum was to appropriately acknowledge interviewees’ input and time towards the research goals. Our interview and survey data combined led us to prioritise the interventions in to test in our experiment.
Summary interview schedule with example questions.
Twelve interviewees highlighted the importance of being a native English speaker for being hired. Freelancers shared that so-called ‘proficiency of English’, regardless of spoken ability, was used to justify refusing work or firing freelancers from a project. Freelancers referred to English proficiency as a ‘proxy’ for racism. Given the prominence of this theme in the interviews, we included English language proficiency as a variable of interest in the study. In addition, interviewees also proposed that membership in a freelancing union could challenge discrimination through a potential dispute resolution process where (a) the prices of labour were standardised to avoid undercutting, and (b) freelancers and clients could be reported to a union for discrimination and face sanctions on the platform. We thus included ‘union membership’ as one of our five design interventions. Table 3 outlines the design-based interventions that were selected for testing. We include a brief description of each intervention and the literature that the strategy was developed from. When a specific strategy was suggested by freelancers independently in interviews, this is noted in the source column with ‘interview’.
Description and source of discrimination intervention strategies.
Experimental study design and mock platform
The experimental study was conducted from September to October 2022 through the standalone web application hosted on

Mock freelancing platform landing page.
Following the procedure for a-priori analysis laid out by Stefanelli & Lukac (2020), we determined that a sample of 1500 hirers was adequate to uncover an effect of at least a 3-percent difference in choice propensity (
Variables included in the research design.
The jobs that formed each trial and intervention in

Example structure of the mock freelancing platform choice page.

Structure and flow of the experimental research design.
The freelancer profiles consisted of an image, user name, average rating, number of reviews, English language proficiency, location (all UK-based), and list of skills displayed as text on each freelancer's profile that mirrored the skills in the advert. Hirers were expected to infer the gender and ethnicity of freelancers from the images and usernames in freelancer profiles, a common practice in computational research on big data (Hofstra and De Schipper, 2018). The usernames were selected from the 100 most popular names in the UK census, followed by an underscore and up to four randomly generated numbers to simulate unique usernames. We used AI-generated images for freelancers’ photos, selecting photos that appeared natural and were smiling to mimic typical freelancer profiles on OLMs. 3 We encoded the images to ensure no hirer saw the same freelancer more than once. For ratings, freelancers had low (1), medium (2.5), or high (5) scores that were depicted as stars out of a maximum of five stars. Accurately representing frequency as a categorical variable for the number of reviews was trickier. Instead, we populated the ratings with values that were within one standard deviation of the values 10 (Low), 50 (Medium), and 100 (High). Operationalising rating and reviews as categorical variables allowed us to compare freelancers, while retaining ecological validity by displaying numeric values to participants. The frequency of each category of ratings and reviews was matched proportionally across baseline and trials. The order in which freelancers were presented to hirers was not significant in post-hoc testing.
In addition to the attributes outlined in Table 4, we included an ‘anonymous’ freelancer profile. Our survey showed that freelancers may operate anonymously on freelancing platforms to avoid experiencing discrimination. This was coded as a dummy variable (0; Known attributes, 1; Anonymous), where a random known freelancer was compared with an anonymous profile. Both freelancers had identical skill sets that matched the job specifications displayed. The attributes in Table 4 reflect the components that make up each freelancer profile. Hierarchical Bayes (HB) was used to estimate the model parameters due to its ability to estimate individual-level parameters. Here, the model parameters are the part-worth utilities (shortened to ‘utilities’), interpreted as preferences towards specific attribute levels,
Our implementation of DCA contained four sections: (a) Reference trials 1—7, a hiring choice without intervention for a baseline, (b) Interventions 1—5 shown in Table 3, (c) Feedback on the website design in a free-text box, and (d) Socio-demographic questions. The implementation of the interventions is shown in Figure 4. The experiment presented participants with seven discrete-choice tasks without interventions in the reference trials. The reference trials were structured to replicate the choices generated for the interventions to allow for comparison. The flow of the experimental research design is shown in Figure 5.

Screenshots of example intervention strategies.
In the research design, we prioritised experimental orthogonality, ensuring that all specified parameters may be estimated independently of any other. The final model's variance inflation factor (VIF) scores indicated that the attributes were close to orthogonal, except for the Anonymous Dummy. A factional (or partial) research design was employed, using a representative subset of the complete set of possible combinations of attribute levels resulting in 144 freelancer profiles (of a potential 270), combined into choice sets of two as specified above. We used the polycor package to estimate the correlations between the factors and the VIF in Table 5.
Multicollinearity diagnostics (variation inflation factor).
VIF: variance inflation factor.
Results
Freelancers experiences of discrimination and payment
Addressing RQ1, the survey examined if payment and experiences with freelancing platforms vary by gender and ethnicity. The mean day rate for freelancers in the survey was £154.20. There was no significant gender or ethnicity difference in the day rate for the freelancers who responded to the survey (

Freelancer day rate by gender and ethnicity from survey.
Our survey also looked at discrimination based on payment that might not be directly represented in the day rate of freelancers. We asked freelancers if they ever (a) had a discount requested, (b) had a discount of more than 50% requested, or (c) been asked to work on a project for free. This was broken down by ethnicity and gender through analysis of covariance (ANCOVA). The results of the ANCOVA for gender and ethnicity for requesting discounted labour are displayed in Table 6.
Frequency of requested discount by gender and ethnicity.
Table 6 shows that while women are equally likely as men to have a discount requested, they are significantly more likely than men to be asked to do the work for free for ‘experience’. Freelancers who are White or Black were significantly less likely to be asked for a discount. Post-hoc testing with Tukey HSD revealed significant differences in having any discount requested between White and Multiple Ethnicity freelancers (
Figure 7 shows the frequency of specific experiences of discrimination. Only 20% of freelancers reported never having any of the negative experiences detailed in the survey; 80% of freelancers surveyed had reported at least one instance of discrimination in the last year. Of those who had negative experiences, 76% of survey respondents attributed these experiences to their status characteristics. Our survey indicated that most freelancers turned to the relevant freelancing platform for assistance when they experienced discrimination.

Distribution of specific experiences of platform discrimination from survey.
Baseline aggregated preferences towards hiring freelancers
We sought to categorise intersectional hiring discrimination in OLMs, adding depth to RQ1. Using the HB method, we assessed the utility parameters for each hirer in the sample, as well as the entire sample. Table 7 shows the average part-worth utilities for each level of each attribute and the importance scores for each of the attributes. Part-worths explain the contribution of the levels of each attribute to the overall utility, allowing one to understand how changing specific characteristics of the freelancer affects hirers’ preferences for a particular hiring choice. Higher part-worth utility values indicate a greater preference for hiring a freelancer with that particular attribute. The part-worth utilities of each attribute are scaled to sum to 1, so they should be considered as relative values within the experimental setting. Figure 8 includes a non-scaled plot of the part-worths of each attribute.

Baseline attribute part-worths in hiring mock freelancer.
Importance of each freelancer attribute in the decision to hire (baseline attribute part-worth).
Table 7 shows that the average rating had the most significant influence on hirers’ preferences, with a relative importance of 41%. The number of reviews was the second most important attribute, responsible for 32% of the hiring decision. As expected, hirers preferred freelancers with the highest average rating and number of reviews. Notably, there is a greater difference between a medium and high number of reviews than between a medium and low number of reviews. There is thus a relative advantage for a high number of reviews compared to a medium than a medium number of reviews compared to low. Freelancer ethnicity (as a combined attribute) was reasonably salient, contributing 14% of the choice to hire. The preferred freelancer is South Asian. Hirers showed the least preference for freelancers who are East Asian. We find a preference for native English speakers over fluent speakers, which seems reasonable given the experiences freelancers cited in our interviews. Hirers in this study generally preferred women, and freelancer gender explained 5% of the overall hiring decision, as shown in Table 7. This supports previous research (i.e. Chan and Wang, 2018), which also suggest that in contrast to hiring in offline labour markets, hiring on OLMs favours women.
We excluded comparisons with anonymous freelancers who had no discernible gender or ethnicity visible on their profile. The survey found that 76% of freelancers conduct business under their given name, with 51% using an image of themselves and 48% also using a logo. The figure below indicates how anonymity compared to the other possible combinations of gender and ethnicity. Anonymity generally disadvantages freelancers. Figure 9 shows that compared to being anonymous, a freelancer who shows their gender is preferred by hirers. Anonymity only benefits freelancers who are East Asian men. This aligns with several interviewees reflecting that they may be hired from an anonymous profile but be removed from a project when they appeared in a video call.

Baseline gender and ethnicity part-worths, including anonymity.
Finally, we examined the selected platform-based interventions (Table 3) that affect hiring decisions and discrimination in OLMs (RQ2). Table 8 demonstrates a comparison of the utility of each attribute under each intervention with the baseline model. The baseline and intervention models all passed significant testing measures. Across all interventions, hirers preferred a high number of reviews and average rating. A-priori power analysis indicated sufficient hirers under each condition.
Comparison of attribute part-worth under each intervention strategy.
Under the community condition, English language proficiency (20%) and ethnicity (17%) increase in importance for hirers’ decision-making. While the average rating and number of reviews are relatively more important factors, their influence decreased by close to 8%. Under this intervention, there is a preference for Black freelancers, with East Asian, South Asian, and White freelancers negatively affected. In this intervention, hirers were shown a community of ‘
Under the flairs condition (i.e. ‘Woman Owned Business’), ethnicity (35%) and gender (34%) are the most important attributes, followed by the average rating (19%), number of reviews (9%), and English language proficiency (1%). For ethnicity, hirers strongly preferred South Asian (18%) followed by East Asian (11%) freelancers. For gender, the preference for women rose to 18%, a significant increase of 17% from the baseline model. Inclusivity Flairs increased the preference for hiring women and countered the bias against East Asian freelancers. Flairs present a potential intervention for design justice in OLMs.
In our text reviews condition, we examine how text-based reviews affect attribute preferences in hiring decisions compared to the baseline model. This intervention only allows a limited comparison with the intervention model, as the two attributes – number of reviews and average rating – are replaced with equivalent text reviews. However, the model has interesting findings. Under the text reviews condition, ethnicity is the strongest indicator of preference, followed by gender and English language proficiency. Under the text reviews condition, hirers indicated a preference for South Asian freelancers, followed by Black freelancers. White freelancers were the least preferred. Women remained preferred. Without a quantitative basis for comparison, ethnicity appears to become the strongest indicator of freelancer hiring.
Under the ‘union membership’ intervention, where freelancers’ profiles indicated that they were members of a Freelancers’ Union, average rating, and number of reviews (31%) were equally important for hirer's preferences. Ethnicity was also important, responsible for 17% of the hirer's decision. There was a preference for East Asian freelancers. Notably, there was also an increased preference for White freelancers by 5%. Despite the over-representation of White hirers in our sample, an increase in preference is noteworthy because it demonstrates a shift from the reference model; 16% of the preference can be attributed to English Language Proficiency. The shift in preferences for freelancers could be due to an assumed decrease in ‘undercutting’ prices that freelancers in the interviews said they would benefit from, with the Union assisting with standardising rates. The Union intervention upends the bias experienced by East Asian freelancers, compared with the baseline hiring model.
Under the discrimination prompt, which asked users if they would like to review their hiring decision, the number of reviews remained most important (29%), followed by ethnicity (27%). There was an increased preference for South Asian freelancers, with a bias against East Asian and White freelancers. There remained a preference for women, with gender responsible for 22% of the choice. Actively challenging potentially discriminatory behaviour may increase the importance of ethnicity in hiring.
Design intervention techniques are most effective at increasing the importance of ethnicity and gender when used as tools for positive discrimination or
Discussion and conclusions
Affordances are critical in shaping how technologies can be best marshalled to minimise discrimination that often characterises social and economic life. Prior studies have identified how design features of platforms shape the affordances of those platforms (Schwartz and Neff, 2019; Stringhi, 2022), including in how platforms impact worker autonomy (Wood et al., 2019; Bellesia et al., 2013). We have extended this line of research to manipulate platform design features precisely to better align design justice principles to practise. The mechanisms of affordance that we focused on were designed to affect what Davis (2020: p.65) defines as the ‘condition of cultural and institutional legitimacy’ or how larger social-structure norms inform human-technology relations. By default, technologies tend to reproduce existing power structures. To subvert these structures, we collaborated with freelancers to develop design interventions (affordance
As our initial survey of freelancers illustrated, the norm for freelancers is to operate on OLMs under their personal identity. The rating and review systems of OLMs are typically assumed to be a neutral mechanism conveying a freelancer's ability and quality of previous work to a potential hirer. Indeed, our experiment found that a freelancer rating and number of reviews were responsible for the majority (72%) of the decision to hire in the baseline. Prior studies have noted the importance of reputation in OLMs, as ascertained by numerical ratings and especially by text reviews (Pallais, 2014). In the absence of other information, reputation appears to stand in as a testament to the skills and reliability of a freelancer. With the inclusion of text reviews as a mechanism for intervention, White freelancers were least preferred. Hannak et al. (2017) found that freelancers who were not White generally received less text reviews on OLMs. We find that that an equal volume of text reviews between White and non-White freelancers benefits the latter. In the context of our experiment, it is possible that ethnicity may simply be far more visible in the choices that hirers make. Hirers may thus be correcting for racial biases they see themselves as having, with text reviews providing the appropriate mechanism of affordance where hirers feel they can confidently ascertain a freelancer's skill and suitability, enabling hirers to make bias-correcting decisions in a less risky context. Given rating inflation in OLMs, it is likely that the number of reviews and ethnicity of freelancer will influence hiring preferences. Hannák et al. (2017) find that perceived ethnicity is significantly correlated with ratings and text evaluations. In one U.S.-based study of TaskRabbit and Fiverr, researchers found that Black and (East) Asian workers received lower ratings and more negative adjectives. However, the ethnic group that was ranked lowest actually shifted based on geographic location within the US (Hannák et al., 2017). Hirers do not treat ‘non-White’ groups as an undistinguishable monolith, but have their own hierarchies. In our UK-based study, our findings suggest that South Asian freelancers benefit from rating inflation, and possibly at a cost to East Asian freelancers, since even text reviews do not seem to benefit East Asian freelancers.
Scholars have suggested that being anonymous on OLMs could mitigate race and gender biases, for instance, in terms of preferring women for lower-value projects or minimising the disadvantage for East Asian freelancers. Unlike offline labour markets, it is ostensibly easier for freelancers to be anonymous on platforms if they choose, for instance, to forgo a profile picture or share a handle in place of their name. We found that anonymity was generally undesirable for freelancers, only increasing the likelihood of being hired for East Asian freelancers, who otherwise tend to be least preferred for hiring. This somewhat counter-intuitive finding may be because of a norm in OLMs to share considerable information to humanise oneself on the platforms. Anonymous profiles appear to go against the cultural and institutional condition of affordances (Davis, 2020), with OLMs functioning as spaces that expect some level of information about freelancers. This may come down to trustworthiness, which hirers may be searching for as they quickly examine freelancer profiles. It could also be an extension of the expectation in offline labour markets to establish a sense of ‘chemistry’ with employers by emphasising one's personal attributes, and the cultural expectations of bringing passion to one's work (Rao and Tobias Neely, 2019; Sharone, 2013). The contemporary cultural context of work appears to discourage anonymity, even on OLMs. That said, some platforms operate on algorithmic matching systems, such as Uber, where drivers appear indistinct as do riders. Why anonymity works for some in OLMs, but not others is an avenue of further inquiry that may be fruitfully pursued through future scholarship.
Flairs, such as a decorative logo denoting a business as ‘minority-owned’, are also crucial in minimising discrimination in OLMs. Flairs lead hirers in our study to choose women and non-White freelancers. Given the disadvantage that East Asian freelancers (especially East Asian men) encounter, it is possible that an appropriately designed flair could mitigate these biases within OLMs. Our data are not suitable for explaining the mechanisms through which flairs counter prejudices against East Asians. One mechanism could be that in the context of similar choices, flairs serve as a low-cost but feel-good decision wherein hirers may feel that they are behaving beneficently by selecting the freelancer in question.
Our study has explored how interventions for design justice can impact hiring decisions. In offline studies, discrimination against women is especially rife in elite labour markets where decision-makers’ biases about women's suitability for particular occupations are foregrounded (Rivera and Tilcsik, 2016). Freelancers are often better educated and come from a more privileged background than the average population. Yet the jobs they are typically vying for through OLMs are not elite jobs with expectations of stability, benefits, and even some semblance of security. The projects offered on OLMs, even for highly educated workers, are usually time-limited. The characteristics of the job in question may thus explain the differences in mechanisms of discrimination. The duration of many of these jobs and other such aspects may lead hirers to privilege women, especially Asian women, who are often stereotyped as detail-oriented, diligent, and good at time and organisational management (Leung, 2018).
Based on our findings, we offer the following recommendations for platform designers and researchers:
Guided by the conceptual framework of affordances and scholarship on design justice, our methodological contribution is creating a mock OLM that allows us to experimentally test platform design interventions. Such experimentation has not been possible in prior studies because typically platform owners, for example Upwork, Freelancer, TaskRabbit and so on, tend to protect details about the inner functioning of their platforms. Scholars have repeatedly noted the lack of transparency of how platform algorithms are produced (Baumer, 2017; Jarrahi et al., 2021; Noble, 2018). Our research design allows us to innovatively overcome this challenge. However, there are limitations to this study. Most importantly, in an experimental procedure the results occur in a tightly controlled research setting. We are thus cautious about any claims to generalisability. For example, our study modelled the choice between hiring one of two freelancers and limiting the information available to hirers. While choosing between two freelancers does not represent OLMs such as Upwork.com, it allows us to model preferential decisions in a hiring context. Leung (2021) highlights how competition in freelancing can cause a ‘race to the bottom’ where a platform like Upwork can downgrade the rate of pay. There appears to be an assumption that South and East Asian workers will inevitably accept fees lower than their White counterparts for the same work, coupled with a disadvantage based on assumptions of being non-native English speakers (Leung, 2021). Our research provides statistical evidence to support these assertions. Previous scholarship also suggests that stereotypes influence hiring decisions in freelancing, with hirers preferring workers whom they can pay a lower rate to complete the task (Galperin, 2021). As this study did not include price, our findings cannot account for these theories explicitly.
Future research should model if interventions can affect the price a freelancer is awarded for a project. While our data do not allow us to infer the mechanisms for this preference, other studies have suggested that gendered assumptions may lead to these observed outcomes (Cerqua and Urwin, 2018; Galperin, 2021; Hong, 2016). Our study could be expanded upon in the future by testing if South and East Asian freelancers may be preferentially hired due to assumed lower rates. Given the significant effects of community, flairs, and text interventions, our study indicates that more research on user-interface is warranted. For instance, researchers should examine how hirers interact with filtering mechanisms on OLMs to find freelancers and if identity-signalling flairs could assist with designing justice on OLMs. Future research should complement investigations of algorithmic structures with a critical analysis of how platform design communicates affordance to its users. Our study can additionally be expanded upon with a natural experiment, such as that conducted by Hangartner et al. (2021), to ascertain if the findings are repeated in a natural setting. How gender and ethnicity biases are reflected in these metrics and in freelancers’ payment for their work will be a fruitful avenue for future research. This study provides experimental evidence of paths forward that can enable OLMs to function in a more equitable manner.
Footnotes
Acknowledgements
The authors thank the interview participants for their willingness to speak with us and for providing critical insights. The authors would additionally like to recognise the editor and anonymous reviewers for their contributions to this work. We would also like to acknowledge Fabian Stephany, Julian Albert, Vili Lehdonvirta, Martin Lukac for their valuable feedback to this study. We thank Nayana Prakash for her work as Research Assistant on this project.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the British Academy/Leverhulme Small Research Grant (SRG21\210549) and The Suntory and Toyota International Centres for Economics and Related Disciplines.
