Sage Journals: Discover world-class research

Abstract

This article analyzes the relative age effect (RAE) in German elite youth soccer. We hypothesize that, to get selected by elite youth academies, players with relative age disadvantages must be relatively more talented; especially at the margin of getting selected. Using data on 2,383 former elite youth players and their later market values, we show that the RAE was substantial in German elite youth soccer between 2000 and 2020. Moreover, in the sample of former elite youth academy players, those with relative age disadvantages reach significantly higher market values. Our results indicate that relative age disadvantages of elite academy players are positively correlated with their unobserved ability; suggesting that, on average, the RAE results in a loss of talent—and market value. We find that clubs could generate 30.6 to 72.8% higher market values without the RAE.

Keywords

relative age effect in elite youth soccer selection,market values unobserved ability misallocation of talent

Introduction

In talent selection, it is crucial to distinguish between current and potential performance levels. The person who is the best right now need not be the same as the person who will be the best in five years. Evaluating a heterogeneous talent pool based on current performance only can therefore have negative effects in the long run. In soccer, for instance, many young athletes who are considered elite today will no longer be elite tomorrow. These young athletes will be called fading shooting stars in this article. Fading shooting stars, in this sense, shine bright today but will never appear on the sky of professional soccer.

It would, of course, require clairvoyant abilities to predict which soccer talent will eventually make it to the professional stage. Performance development in soccer as well as in most other settings is too complex and ambiguous to make exact predictions. Yet, there is also a systematic reason for the abundance of fading shooting stars, which can easily be identified and targeted: the relative age effect (RAE). In youth soccer, athletes are grouped by years of birth in most countries. This creates an arbitrary age cut-off. Consequently, adolescents born in January are almost one year older than their December born peers. When playing in the same team, players born in January have a relative age advantage. They are relatively faster, stronger, more mature, and therefore momentarily better athletes on average. Under the ‘‘non-astrology” (Allen & Barnsley, 1993) assumption that ability is uncorrelated with birth dates, observing elite youth academies to select more relatively older players would mean that clubs focus too much on current performance rather than the potential performance level of players. The resulting over-representation of relatively older players likely leads to a waste of talent and resources.

A large literature documents the existence of the RAE, drawing on data from several countries, soccer teams, and time periods (e.g., Ashworth & Heyndels, 2007; Barnsley et al., 1985; Cobley et al., 2008; Jackson & Comber, 2020; Mujika et al., 2009; Musch & Grondin, 2001; Musch & Hay, 1999; Pérez-González et al., 2021; Tribolet et al., 2019). Many publications also provide recommendations on how to mitigate the RAE in talent selection processes (Cumming et al., 2018; Lagestad et al., 2018; Mann & van Ginneken, 2017; Martindale et al., 2005; Roberts et al., 2021). Yet, little has changed: Even though the RAE in professional soccer is known since 35 years, it is still very prevalent and, overall, even intensified over time (Sierra-Díaz et al., 2017). Roberts et al. (2021) thus argue that researchers need to consider new approaches to target the RAE in professional soccer to better understand the phenomenon and quantify its consequences.

Our article investigates the RAE in German elite youth soccer. We use data on 2,383 former youth players of the 17 most successful German Bundesliga Youth Academies (BYA) and their market values in the period between 2002 and 2020. Our article contributes to the literature in four dimensions. First, despite the attempts to mitigate the problem, we show that the RAE is still prevalent in German BYAs. Second, we introduce a new theoretical model of a player’s performance development over time, which facilitates the understanding and analysis of the RAE, and allows us to derive testable hypotheses. Third, using econometric methods and novel data on former German BYA players, we test the implications of the RAE for talent selection and the distribution of unobserved ability in elite youth academies. Fourth, using data on players’ market values, we aim to quantify the costs caused by the RAE in professional German soccer, which, in today’s highly capitalized soccer, could be a strong argument for changing talent selection practices.

Specifically, we derive from our theoretical model the hypothesis that, among all players selected into BYAs, players with relative age disadvantages have on average higher unobserved ability than their relatively older peers. This builds on the simple observation that relatively younger players must compensate the disadvantages caused by their relative age with more ability to still get selected. We further hypothesize that this effect is particularly pronounced among those players who, at the point of selection, just met the threshold requirements.

Our first finding is that the RAE in BYAs is both substantial and persistent. 71.5% of former Under-19 (U19) BYA players were born in the first half and 44.6% in the first quarter of the year. Moreover, the RAE has even increased slightly in the last two decades. Second, we find that, elite youth players that were born towards the end of the year, in fact, reach significantly higher market values on average. Using an instrumental variable approach, we further show that, at the margin of getting selected, relative age disadvantages are positively correlated with (unobserved) ability. This empirical result supports our theoretical hypotheses and rationalizes also why we find a positive correlation between relative age disadvantages and market values on average in our sample of former elite youth players. Our third finding is that the RAE is very expensive for BYAs: we estimate that Bundesliga clubs could generate 30.6 to 72.8% higher market values through their BYAs when eliminating the RAE in talent selection. This result can be considered as rather conservative as we only model the costs of bad selection related to the RAE. Relative maturity differences during adolescence presumably cause additional costs.

The findings and mechanisms we describe are also relevant for talent identification, development, and recruitment outside of sports. Various studies from different fields show that initial differences in (relative) performance have significant consequences on selection outcomes and achievement, and that eliminating structural biases in recruitment comes with sizeable (economic) gains (e.g., Balboni et al., 2022; Cullen et al., 2006; Dustmann et al., 2016; Friebel et al., 2019; Hanushek & Rivkin, 2009; Hsieh et al., 2019; Murphy & Weinhardt, 2020). The world of soccer lends itself, in particular, to the analysis of how initial performance differences affect selection and individual careers in the short and long run because of the excellent data available.¹

The article is organized as follows. Section “Related Literature” discusses the related literature on the existence and consequences of the RAE. Section “Theoretical Framework” proposes a model of player’s performance development, which allows to illustrate the mechanisms involved in the RAE as well as derive hypotheses for the empirical analysis. Section “Empirical Setting” describes the institutional setting in Germany and our data. Section “Empirical Results” presents our empirical analysis and results. Finally, Section “Conclusion” discusses the implications of our analysis and concludes.

Related Literature

The Relative Age Effect in Soccer

The existence of the RAE in sports was for the first time shown by Barnsley et al. (1985), who report skewed birth date distributions in Canadian youth ice hockey. In the 1990s, first soccer-related RAE studies were published. Musch and Hay (1999), for example, find evidence for strong RAEs in professional soccer across several countries including Germany. Decades of research have produced a large body of evidence on the RAE. Yet, the RAE has continued to exist in both youth and professional soccer. Therefore, Roberts et al. (2021) see the need to identify new data capture techniques and more sensitive measures of the RAE to foster a deeper understanding of the effect and its consequences.

While Allen and Barnsley (1993) outline a basic model, the only formalized model of the RAE in sports so far is developed by Pierson et al. (2014), who model the RAE as a reinforcing feedback loop and apply it to Canadian youth hockey. Moreover, Dawid and Muehlheusser (2015) present a dynamic model of repeated talent selection with heterogeneity in ability and relative age, which can also be applied to sports. Besides that, most publications have only relied on descriptive statistics so far.

Cobley et al. (2008) track the RAE in professional German soccer from 1963 to 2007. Using $χ^{2}$ tests, they show that the RAE grew consistently and progressively within the period examined. The proportion of players born in the first half of the year is a very popular estimate for the RAE. Referring to the review by Musch and Grondin (2001), early studies on elite youth soccer players in the UK and Sweden found that the proportion of players born in the first half of the year amounts to between 62 and 87%. Based on the same descriptive measure, more recent studies on elite youth academies report the following figures: 85.9% for U9 British Premier League players (Jackson & Comber, 2020), 65.4% for Australian U19 elite soccer academies (Tribolet et al., 2019), 75.2% for the AC Bilbao elite youth (Mujika et al., 2009), and 65.6% for international youth championships between 2017 and 2019 (Pérez-González et al., 2021). Overall, age groups examined and measures used differ largely across studies, while the results are unequivocal: the RAE still exists in elite (youth) soccer teams.

Approaches to relate the RAE to players’ monetary valuations, have not yet yielded conclusive results. Pérez-González et al. (2020), for instance, analyze players of ten highly successful European soccer clubs and show that players’ market values are not significantly correlated with their relative age. Moreover, Fumarco and Rossi (2018) show that professional soccer players born in the last quarter of the year earn significantly lower wages than players born in the first quarter. However, with similar statistical precision, they find that players born in the third quarter of the year earn substantially more than those born in the first quarter. Ashworth and Heyndels (2007) use data from professional soccer players in the German Bundesliga for the seasons 1997/1998 and 1998/1999. Based on estimated gross wages, they find that the late-born players in a cohort earn higher wages. Hence, within the sample of professional soccer players—those shooting stars that made it to the professional stage—the relation between players’ monetary valuation and relative age is not unambiguously clear.

Production Function of Elite Youth Academies and the Optimal Selection Policy

The existence and implications of the RAE in German elite youth soccer highly depend on the production function of elite youth academies; in other words, on how elite youth academies employ different kinds of training and selection strategies to optimally exhaust the talent pool. Dawid and Muehlheusser (2015) show that, when initial relative age advantages are strong, clubs can maximize the quality of the talent pool in the long term if they initially resist the temptation to select players based on momentary performance signals.³ In other words, scarce training resources are misallocated if clubs always select the momentarily best despite strong relative age advantages.

While Dawid and Muehlheusser (2015) assume that “planners” want to maximize the average talent level in a given population at the end of the training process, which we will call the average shooting star strategy, it could be possible that soccer clubs have different objectives and thus a different production function. For example, clubs could consider it most effective to focus on the performance development of a small subgroup of 3 to 5 very promising players, which we will denote as the top shooting star strategy. To support the few top shooting stars optimally, clubs might surround them at every given stage with the currently best players available which tend to be relatively older and more mature on average. This strategy of largely utilizing the RAE might even be necessary to retain and attract the best. To give a better-informed assessment of the production function of the elite youth academies, we briefly summarize the relevant literature.

In terms of short-term success, it is optimal to fully follow the average shooting star strategy. Grossmann and Lames (2013) show that youth clubs can increase their momentary competitiveness by exploiting the RAE. As the RAE tends to be more pronounced in elite youth leagues (Del Campo et al., 2010; Jackson & Comber, 2020) and in clubs which are regarded as successful and have an excellent reputation (Jiménez & Pain, 2008), elite youth clubs indeed show a preference for short-term success and momentary competitiveness. Moreover, Jiménez and Pain (2008) argue that the first aim of clubs is to be successful at all stages instead of promoting the greatest talents and taking a long-term perspective. This short-term orientation is further intensified by coaches’ incentives who perceive pressure to select players based on short-term goals (Hill & Sotiriadou, 2016; Roach, 2022). While these findings do not necessarily contradict the top shooting star strategy which also largely relies on the utilization of the RAE, it is apparent that talent development does not just follow a long-term plan but is subject to many short-term constraints.

Furthermore, the top shooting star strategy requires that elite youth academies are able to identify top talents already at early stages of selection and that the selection of these top shooting stars is independent of the RAE. Both are rather strong assumptions. The RAE, in fact, is still significant in adult elite leagues (see Sierra-Díaz et al., 2017 and Figure C3 in Appendix C) which indicates the inability of elite youth academies to identify their top players independently of the RAE. Although we cannot fully dismiss the top shooting star strategy, in this article, we will assume that elite youth academies cannot identify the most promising talents at early stages of selection but, being subject to short-term constraints, primarily aim at maximizing the average talent level.

Stylized Facts on the RAE and Performance Development

Before developing our model, we present stylized facts from the literature on the RAE and youth players’ performance development. A model that is faithful to the evidence must recognize these empirical findings. First, relative age and maturity advantages are generally beneficial in soccer (Lovell et al., 2015; Malina et al., 2007, 2000; Rommers et al., 2019; Rösch et al., 2000; Votteler & Höner, 2014). Second, relative maturity differences can be substantial during adolescence, are greatest around the age of 13 and decline afterwards (Malina et al., 2004; Walker, 2016). Third, the RAE in elite youth soccer follows this maturity pattern, increasing initially and peaking around the age of 13 to 15. Yet, the RAE does not disappear eventually but remains significant even at the professional level (Cobley et al., 2008; Patel et al., 2019; Pierson et al., 2014; Sierra-Díaz et al., 2017). Fourth, initial age and maturity advantages likely lead to a path dependency due to access to better training and other factors such as players’ increased self-confidence, parents’ behavior, and coaches’ perceptions (Musch & Grondin, 2001; Pierson et al., 2014). Fifth, as discussed above, the RAE is more pronounced in elite leagues and youth clubs can increase their momentary competitiveness by exploiting the RAE (Del Campo et al., 2010; Grossmann & Lames, 2013; Jackson & Comber, 2020; Jiménez & Pain, 2008).

From these stylized facts, it is also apparent that the RAE is complemented by a relative maturity effect (RME), that is, differences in maturation status which are independent from relative age (see Malina et al., 2000). Hence, analyzing the impact of only the RAE (and not the RME) on talent allocation will most likely lead to conservative results when it comes to skewed talent selection and misallocation of talent.

Theoretical Framework

Basic Setup

The simple theoretical model introduced in this section aims to illustrate the problems caused by the RAE. The model goes beyond the previous analysis of ability, relative age, selection, training, and monetary valuation in soccer (e.g., Ashworth & Heyndels, 2007) and will serve as the basis for deriving our hypotheses. Let $P_{i}$ denote player $i$ ’s realized performance level as a function of time $t$ , which is measured in years and refers to his age. $P_{i}$ aims to represent a player’s ability, exercise, and routine as well as soccer specific attitudes (e.g., tactical sense) and physical characteristics (e.g., fitness, height, and speed)—in short, everything that determines how good a player is (see Reilly et al., 2000). As we focus on the earlier stages of a player’s career from childhood to the professional age, we rely on a logistic growth function. Here, players’ realized performance levels increase with age. This approach allows to incorporate heterogeneity in ability, training, and relative age, but, owing to simplicity, misses to represent the decline in performance at later stages of the career.⁴ Finally, $P_{i}^{⋆}$ captures player $i$ ’s maximum performance level, which is the performance level that the logistic growth function will eventually converge to. Player $i$ ’s performance level in period $t$ is $P_{i} (t) = \frac{P_{i}^{⋆}}{1 + (P_{i}^{⋆} - 1) \times e x p (- t)}$ , which implies $lim_{t \to \infty} P_{i} (t) = P_{i}^{⋆}$ .

Let $m_{i}$ denote player $i$ ’s month of birth which shifts the performance development function to the right according to the player’s relative age. The development of a player born in December starts 11/12 of a year later than the development of his peers born in January of the same year. The starting point of the performance development function is, thus, defined by player $i$ ’s birth month. This yields

P_{i} (t) = \frac{P_{i}^{⋆}}{1 + (P_{i}^{⋆} - 1) \times e x p (- t - \frac{m_{i}}{12})} .

(1)

Figure 1 plots the performance development functions of 4 distinct players born in the same year. Two of these players were born in January (red solid lines) and two in December (blue dashed lines). Moreover, per birth month, one player is relatively talented (higher

P_{i}^{⋆}

) and the other player is relatively untalented (lower

P_{i}^{⋆}

). The performance development function of players of the same talent level is specified equivalently apart from the fact that they are shifted according to the respective birth month. Figure 1 furthermore indicates two points of selection,

A

and

B

, where elite youth academies choose a certain number of adolescents. The plotted performance development functions show, first, that players’ realized performance levels increase with age. The slope of the function initially increases and eventually decreases, which perfectly represents the fact that male adolescent height velocity peaks around the age of 13 (Walker, 2016), so that marginal maturity and performance levels around that age are greatest (Malina et al., 2004). Second, Figure 1 illustrates that, among players with the same

P_{i}^{⋆}

, the performance level of the December-born player is always temporally behind the performance level of his January-born peer. This reflects the stylized fact that relative age advantages are generally beneficial in soccer. Finally, when players approach professional age, relative age differences lose their significance.

Figure 1.

Simple performance development model with two different birth months and two different talent levels.

For illustration, assume that only two players can get selected by an elite youth academy at selection point $A$ . If the selection is based on current performance levels, both January-born players are chosen. It is obvious that this is not the best choice from a long-run perspective. One might, however, argue that the academy could still pick both players with relative higher ability at selection point B and end up with the two most talented players. The next subsection, however, shows why the RAE might still continue to affect selection decisions.

The Effect of ‘Superior’ Elite Academy Training

So far, the model did not incorporate the training effect of soccer elite academies relative to other youth clubs. We assume that elite youth academies indeed offer superior training—the treatment—and let the maximum performance level of player $i$ , $P_{i d_{i}}^{⋆}$ , depend on treatment $d_{i} = {0, 1}$ .⁵ We make the established assumption that training and ability are complements in the sense that the former is more effective for individuals with higher potential (see Cunha & Heckman, 2007; Dawid & Muehlheusser, 2015). After the point of selection $s$ , player $i$ ’s maximum performance level depends on whether he receives elite youth academy training ( $d_{i} = 1$ ) or not ( $d_{i} = 0$ ): $P_{i 1}^{⋆} > P_{i 0}^{⋆}$ . Player $i$ ’s realized performance level as a function of time $t$ , thus, reads:

for t \leq s : P_{i} (t) = \frac{P_{i}^{⋆}}{1 + (P_{i}^{⋆} - 1) \times e x p (- t - \frac{m_{i}}{12})},

(2)

for t > s : P_{i d_{i}} (t) = \frac{P_{i d_{i}}^{⋆}}{1 + (P_{i d_{i}}^{⋆} - 1) \times e x p (- t - \frac{m_{i}}{12})} .

(3)

The expansion of the maximum performance level through elite training is illustrated in Figure 2, which shows a relatively untalented January-born (red) and a relatively talented December-born (blue). Out of mathematical ease, the BYA training effect is modelled as an immediate jump to a higher performance development curve after selection.⁶

Figure 2.

Performance development model: Illustrating fading and late blooming shooting stars.

The lighter red and blue lines represent possible examples of the counterfactuals. Selection of the relatively untalented January-born player lifts his performance level after selection point A, so that even at selection point B it remains higher. This visualizes the main problem caused by the RAE: Although, at selection point A, the relatively talented December-born was currently worse than the relatively untalented January-born, the long-term return of selecting the December-born is much higher. At selection point B, the counterfactual performance of the relatively talented and treated December-born is higher than the performance of the relatively untalented and treated January-born. Eventually, the maximum performance level of the relatively talented and treated player exceeds the maximum performance level of the relatively untalented and treated considerably. Meanwhile the performance curve of the relatively untalented January-born presents the case of fading shooting stars vividly: Shining lightly in early selection rounds due to their relative age advantage, they eventually fade before entering the professional soccer stage. In line with existing evidence on the RAE, the model illustrates how the RAE remains even when maturity differences vanish, in particular, in a highly competitive environment. Based on this, we derive our first hypothesis:

Hypothesis 1 (RAE Existence and Path Dependency):

Given (i) selection cut-offs during performance development, (ii) relative age-based performance differences, and (iii) positive effects of elite youth academy training, the RAE occurs in a competitive environment and is sustained even when relative age differences fade.

As described in Section “The Relative Age Effect in Soccer”, the existence of the RAE in professional youth sports is a well-established result, as is its persistence when relative age differences fade. While Hypothesis 1 can therefore almost be considered a stylized fact, it also provides a mechanism behind the RAE which can be related to other fields.

Marginally Selected Players and Ability

Our theoretical considerations suggest that the ability of those players who were just good enough to get selected into elite academies is not evenly distributed over birth months, although, in the general population, ability is uncorrelated with birth dates. Players who just got selected will be denoted as the marginally selected. To define the concept of the marginally selected, we assume that, at selection point A ( $t = s$ ), all players above a certain current performance level $P_{δ} (s)$ get selected into youth elite academies ( $d_{i} = 1$ ), while all players below are rejected:

d_{i} = {\begin{matrix} 1 & if\ P_{i} (t = s) \geq P_{δ} (s) \\ 0 & if\ P_{i} (t = s) < P_{δ} (s) \end{matrix}

(4)

The marginally selected is the player for which

P_{i} (t = s) = P_{δ} (s)

. Conditioning the performance level on the birth month

m

, the performance level of the marginally selected can be denoted as

P_{δ} (t | m)

. It becomes evident that the maximum performance level of the January-born marginally selected is lower than the maximum performance level of the December-born marginally selected:

lim_{t \to \infty} P_{δ} (t | m = 1) = P_{δ}^{⋆} (m = 1) < P_{δ}^{⋆} (m = 12) = lim_{t \to \infty} P_{δ} (t | m = 12) .

(5)

This can be further generalized. The marginally selected player of a certain month has a higher maximum performance level than the marginally selected from the previous month apart from the December–January cut-off:

P_{δ}^{⋆} (m + 1) > P_{δ}^{⋆} (m) .

(6)

However, no further statements can be made about the exact relation of the talent of the marginally selected players from different months. A function of the marginally selected depending on month of birth could be convex, concave, or approximately linear, depending on the performance level cut-off at selection

P_{δ} (s)

, the point of selection

s

, and a general scaling parameter determining the course of the function, which we omitted for the sake of simplicity.

Based on Equation (6), we derive the following hypothesis:

Hypothesis 2 (Ability Premium at the Margin):

Among the marginally selected players, relative age disadvantages (later birth months) are positively correlated with players’ maximum performance levels: $C o r r (P_{δ, i}^{⋆}, m_{i}) > 0$ .

As maximum performance levels reflect players soccer-specific ability, we can rephrase this hypothesis as: Among the marginally selected, those with relative age disadvantages have relatively higher unobserved ability. We will test this hypothesis in Section “Relative Age Advantages, Training, Market Values, and Ability”.

Figure 3 illustrates that marginally selected players from different birth months eventually end up having very different maximum performance levels. The player born in December who was just good enough to get selected has a much higher maximum performance level than the January-born marginally selected.

Figure 3.

Marginally selected players and maximum performance levels.

In terms of ability, the upper bound of the players—the very best—selected into BYAs per birth month is assumed to be identical for all birth months, as ability is evenly distributed in the general population. However, the lower bound—the marginally selected—is skewed with the January-born marginally selected being less talented than the December-born marginally selected. Consequently, the theoretical model suggests that also the average ability of selected January-born players is lower than the average ability of their December-born peers. In general, the average ability—and thus their average maximum performance level ${\bar{P}}^{⋆} (m)$ —of players born in a certain month of the year exceeds the average talent of players born in the previous month apart from the December–January cut-off. Hence, because the marginally selected ability is skewed, also the average ability is skewed over birth months.

{\bar{P}}^{⋆} (m + 1) > {\bar{P}}^{⋆} (m) .

(7)

Being born just before the cut-off, in the end of the year, is thus related to an average ability premium. Based on Equation (7), we derive our third hypothesis:

Hypothesis 3 (Ability Premium on Average):

Average performance levels are positively correlated with players’ relative age disadvantages in elite youth academies.

Maximum Performance Levels and Foregone Market Values

Our main empirical approach is to rely on the highest market values HMV as a proxy for the maximum performance level $P_{i}^{⋆}$ of players with different treatment status $d_{i}$ . Concentrating merely on the maximum performance level has the advantage that the individual performance development function can remain unknown. Assuming that development functions are subject to the same development processes and determinants on average, a lot can still be inferred about performance development. Yet, the HMV is not just a function of a player’s maximum performance level, but also depends on other characteristics (see Kempa, 2022). Let player $i$ ’s highest market value $H M V_{i}$ be determined by $P_{i}^{⋆}$ plus the influence of other factors $X_{i}$ such as position, youth team or year born and an unobserved error term $u_{i}$ . Logarithmising the HMV takes the positive skew of market values into account and establishes linearity between HMV and the covariates. When controlling for the covariates $X_{i}$ , the logHMV is therefore plausibly an applicable proxy for the maximum performance level of players, based on which the RAE in BYAs can be analyzed in more detail. Based on that and Hypothesis 3, we derive our fourth hypothesis:

Hypothesis 4 (Foregone Market Value):

Among former youth elite academies players, market values are positively correlated with relative age disadvantages; implying that recruiting players more evenly across birth months increases average market values generated by elite youth academies.

In the following sections, we will empirically test our hypotheses after presenting our data and the setting we analyze.

Empirical Setting

Bundesliga Youth Academies

As a response to the bad performance of the German national team in the World Cup of 1998 and the European Championship of 2000,⁷ German youth soccer was radically reorganized and modernized. A new licensing regulation, passed in 2001, required every club in the first two divisions (Bundesliga and 2. Liga) to build up Bundesliga youth academies (BYA, German: ‘Nachwuchsleistungszentren’). The two primary goals of BYAs are ‘internationally outstanding Bundesliga and German national teams’ and ‘optimal exhaustion of the talent pool’ (Deutsche Fußball Liga, 2020a).⁸ Linking BYAs’ talent selection to the RAE, unobserved ability, and market values, this article will particularly address the second goal by asking whether BYAs exhaust their talent pool optimally.

BYAs are highly standardized, which will prove to be of great advantage for our analysis.⁹ The focus of soccer training is accurately regulated for certain age cohorts. Only from the U15 onwards, BYAs are allowed to conduct ‘performance-oriented training’, where specializations are stabilized and further developed as direct preparation for a professional soccer career (Deutsche Fußball Liga, 2020a).¹⁰ Between the U15 and U19, investments are highest, competition is biggest, and training is most intensive. As players develop most during this performance-oriented training, U15 to U19 squad selection is pivotal.

Today’s Bundesliga teams invest millions in their BYAs, while most money is spent on the U15 to U19 teams (Sponsors, 2019). Hoffenheim, for instance, has a staff of more than 50 full-time employees responsible for about 150 youth players which play in Hoffenheim’s seven BYA teams (Sponsors, 2019). In total, 5.400 adolescents played for 279 teams in 54 BYAs¹¹ in Germany in 2017 (Franzke, 2017). To put this figure into context, about 484.000 adolescents between the age of 15 to 19 play soccer in Germany (Deutscher Fußball Bund, 2020). Hence, only about 1% of active adolescent players make it to a BYA. From this top one percent, again less than 5% (60–70 players per year) will eventually succeed in getting a professional contract in Europe’s top leagues (Franzke, 2017; Sponsors, 2019). The total investment of the 36 Bundesliga and $2.$ Liga clubs in BYAs amounted to 177 and 186 million Euro in the seasons 2017/18 and 2018/19, respectively (Deutsche Fußball Liga, 2019, 2020b). Overall, more than 1.6 billion Euro have been invested in BYAs since 2001 (Deutsche Fußball Liga, 2018). Due to these high investments, it is safe to assume that BYAs indeed offer superior training compared to non-BYA youth clubs, which generally have much fewer resources available.

Data

In this subsection, we summarize the most important aspects of the data. A more detailed description is provided in Appendix B. We use data on former BYA youth players retrieved from the sports website transfermarkt.de.¹² Next to other information about professional soccer players (name, birth date, strong foot, height, transfer history, etc.), the focus of the website relies on market values. Market values are estimated and discussed by non-expert users for more than 800,000 soccer players worldwide and are regularly updated (Keppel & Claessens, 2020). Data from transfermarkt.de was used before in different scientific publications (e.g., Augste & Lames, 2011; Grossmann & Lames, 2013; Herm et al., 2014; Bryson et al., 2013; Pérez-González et al., 2020). While the data quality was viewed with criticism first (e.g., Sundermeyer, 2009), market values on transfermarkt.de were found to be highly correlated with expert estimates from well-respected sources (Franck & Nüesch, 2012). Peeters (2018) finds that transfermarkt.de data on market values performs better than other indicators in predicting a team’s strength. Moreover, he does not find evidence for ‘wishful thinking bias’, which would result in overestimating market values of popular players and teams. Müller et al. (2017) show that the crowd-based estimates from transfermarkt.de are equally accurate as estimates from a multiple regression algorithm and even outperform the algorithm for high-priced players.

When constructing the data set, there was a trade-off between quality and quantity. In other words, the aim was to include as many BYAs as possible without jeopardizing completeness and quality of the data. As a baseline, we examined the aggregated standings of the U19 Bundesliga since 2001. We further supplemented this information with rankings of the most successful BYAs from two different websites (fussballfieber.de, 2017; ran.de, 2015) and compiled a short list of the 36 most successful BYAs. Yet, going from the top to the bottom of the list, the data became increasingly incomplete. Finally, our data set consists of the U17 and U19 Bundesliga cadres of the 17 most successful youth teams between 2001 and 2020.¹³ Every additional club would have implied incomplete data.

We restrict our data to players with German nationality, as other players might have undergone elite youth academies of different qualities in their home countries before being selected. Additionally, players who were mentioned in BYA cadres but without concordant reference to this in their transfer history were dropped. This was necessary because we need to calculate the number of days that youth players spent in BYAs based on their transfer histories. The final data set contains 3,835 observations. Among them, 2,383 played for a U19 BYA and were born between 1988 and 2001, that is, could potentially have gotten five full years (U15-U19) of BYA performance-oriented training.

The variable BYAyears captures the time a player spent in one of the 17 BYAs chosen, ranging continuously from zero up to a maximum of five years. We only consider the period of performance-oriented training between the U15 and U19 as competition, investment, and training quality are highest in these years. BYAyears excludes spells during which players were first trained at one of the remaining 37 BYAs and joined one of the 17 selected clubs later.¹⁴ Two main arguments justify this specification: First, close examination of the data reveals that transfers from other BYAs (out of the sample) to the 17 first-tier BYAs (in the sample) are rather rare. Second, not all BYAs provide the same quality of training. More than 70% of total BYA investment is made by the 18 Bundesliga clubs (Sponsors, 2019). Investment in BYAs is, thus, likely to be skewed towards the most successful ones. Hence, BYAyears is an appropriate measure for the years that adolescents received distinguished soccer training, guaranteeing the highest possible level of homogeneity by not treating first- and second-tier BYAs as the same.

In the last two decades, a sharp increase in market values could be observed, so that highest market values are hardly comparable across years. To overcome this issue, we calculate Bundesliga market value inflation rates based on the total market values of all Bundesliga teams’ 11 most expensive players in all years between 2005 and 2020. We chose the 11 most expensive players from all 18 Bundesliga clubs in every given year because this yields a ‘‘player basket’’ of 198 players in each year which remains comparable over time. When merely looking at absolute market values or average market values, the inflation rate might be skewed by the number of players which clubs register in different years. While the number of players per club is also motivated by the ’starting eleven’, the 198 players in the player basket is a large enough number that the absolute market values are not influenced too much by individual players.

Absolute market values of all Bundesliga teams’ top 11 players and the respective inflation rates are shown in Figure C1 in Appendix C. On first sight, inflation rates of over 30% might appear unrealistic, but Poli et al. (2019) also find inflation rates above 30% for European soccer leagues between 2011 and 2019. Using the calculated inflation rates and the date when a player’s highest market value was reached, we convert highest market values to 2020 inflation-adjusted highest market values (HMV).¹⁵ In our analyses, we rely on logarithmized values (logHMV) to counteract the progressive nature of market values.

All variables are available for all observations except for the players’ specific positions and body height which are missing for about 5 and 20% of the observations, respectively. Table 1 reports descriptive statistics of our data set.

Table 1.

Descriptive Statistics of Key Variables.

	Obs.	Mean	Min	Max	Std. Dev.
HMV in 1,000EUR	2383	1283.702	0	128856.5	7054.63
logHMV	2383	3.83	0	11.8	2.82
BYAyears	2383	2.964	0.1	5.0	1.344
yearBorn	2383	1995.23	1988	2001	3.76
monthBorn	2383	4.69	1	12	3.16
weekBorn	2383	18.17	0.14	52.28	13.8
Born Jan–Jun, dummy	2383	0.715	0	1	0.45
Born Jan–Mar, dummy	2383	0.446	0	1	0.49
Specific Positions, categorical	2281	5.152	1	12	3.38
BuLi Pro, dummy	2383	0.242	0	1	0.43
Right-Footed, dummy	2383	0.469	0	1	0.50
Left-Footed, dummy	2383	0.185	0	1	0.39
Two-Footed, dummy	2383	0.176	0	1	0.38
U19 BYA Team, categorical	2383	9.0	1	17	4.89
U17 BYA Team, categorical	1688	9.3	1	17	4.85
National team, dummy	2383	0.022	0.0	1.0	0.146
Height in cm	1977	182.12	163	202	6.28

Data on the 17 most successful BYA U19 clubs from transfermarkt.de. Players born between 1988 and 2001. Variables on individual player level: 2020 highest market values adjusted for inflation in 1,000EUR (HMV), logarithmized values of HMV (logHMV), years spent in BYA (BYAyears), birth year (yearBorn), birth month (monthBorn), week born in players’ respective birth year (weekBorn), dummy variables for being born in the first half (Born Jan–Jun) and first quarter of the year (Born Jan–Mar), specific positions (goalkeeper, center back, right back, left back, central defensive, central midfield, central offensive, right midfield, left midfield, center forward, left wing, or right wing), dummy variable if played in the Bundesliga at least once (BuLi Pro), dummy variables for strong foot (Right-Footed, Left-Footed, Two-Footed), categorical variables for the 17 selected U19 BYA clubs (U19 BYA Team) and the U17 BYA clubs (U17 BYA Team), dummy variable for having played at least once for the German national team (National team, dummy), and height in cm (Height).

Empirical Results

Relative Age Effect in Bundesliga Youth Academies

To test Hypothesis 1 and quantify the RAE in BYAs, we calculate the share of players born in the first half and the first quarter of the year; two well established RAE indicators (see, e.g., Jackson & Comber, 2020; Mujika et al., 2009; Musch & Grondin, 2001; Tribolet et al., 2019). Table 2 shows that 71.5% of U19 youth players were born in the first half and 44.6% in the first quarter of the year. Both numbers are well above the equal birthday distributions, 50% and 25%.¹⁶ Table 2 reveals that the RAE is very pronounced across all the 17 BYAs. While certain differences exist, they are not extremely large. The proportion of players born in the first half of the year varies between 77.1% (VfL Wolfsburg) and 65.8% (Schalke 04), while the share of players born in the first quarter of the year ranges between 56.7% (Borussia Dortmund) and 38.0% (Hoffenheim).

Table 2.

The Relative Age Effect: Summary Statistics by U19 BYA.

	% born Jan–Jun	% born Jan–Mar	Mean HMV in 1,000€	Obs.
Full sample	71.5	44.6	1283.702	2383
VfL Wolfsburg U19	77.1	45.0	1054.406	131
Borussia Dortmund U19	75.6	56.7	1960.316	127
FC Bayern München U19	75.4	46.5	3599.560	114
VfB Stuttgart U19	74.4	45.1	2349.844	134
Bayer 04 Leverkusen U19	73.1	50.0	914.710	130
TSV 1860 Mönchen U19	71.8	46.6	1067.845	163
Eintracht Frankfurt U19	72.6	42.5	262.893	146
Werder Bremen U19	71.5	47.2	886.345	144
1.FSV Mainz 05 U19	71.5	42.3	1270.778	130
SC Freiburg U19	71.2	42.9	707.105	156
Hamburger SV U19	70.8	44.6	966.160	129
TSG 1899 Hoffenheim U19	70.1	38.0	1008.937	137
1.FC Köln U19	69.5	42.4	1013.030	151
Borussia Münchengladbach U19	69.2	45.3	1101.972	159
Hannover 96 U19	69.7	41.0	528.013	121
Hertha BSC U19	68.7	43.6	956.346	164
FC Schalke 04 U19	65.8	40.4	2867.211	147

Data on the 17 most successful BYA U19 clubs from transfermarkt.de. Players born between 1988 and 2001. Differences in the number of observations per club can be attributed to missing data and different proportions of foreign youth players, who are not considered here.

Table C1 in Appendix C replicates these findings for the U17 BYA teams, showing an even larger RAE than in U19 BYA teams. The pattern of stronger RAE in U17 teams and a slightly smaller RAE in U19 teams, presumably owing to declining maturity differences, was also found in other studies which we discussed in the literature review (e.g., Jackson & Comber, 2020; Malina et al., 2004; Patel et al., 2019).

Relative age differences should have faded to a large extent already in U19 BYA teams, and fully in professional adult leagues. The finding that a significant RAE still exists in U19 BYA teams (see Table 2) as well as in the two top German professional adults leagues (see Figure C3 in Appendix C), highlights the persistence of the RAE and is in line with Hypothesis 1. In the context of selection cut-offs during performance development, initial relative age-based performance differences, and positive effects of elite youth academy training, the RAE does not only arise, but it also persists.

Table 2, furthermore, presents average highest market values (HMV). As there is only little variation in the size of the RAE and market values are influenced by various other factors, it is not surprising that the size of the RAE and HMV do not seem to be correlated across clubs. Average HMV, however, need to be treated with caution as values are likely to be affected by a few very expensive players. Yet, it is clear that the existence of the RAE is economically interesting given BYA players’ (future) market values.

Figure 4 illustrates the development of the RAE over time, by showing the proportion of players born in the first half of the year between 1985 and 2005. The two main insights from this figure are that, first, the RAE did not decline since the introduction of BYAs and, second, the proportion of players born in the first half of a year is significantly different from 50% (i.e., the equal distribution) at the 95% confidence interval for every birth cohort. At the beginning of the period examined in this article (birth cohorts 1988 and 1989), the proportion of players born in the first half of the year was around 65%. The RAE indicator increased to around 75% in the following 10 years and remained approximately unchanged since then.

Figure 4.

The RAE over time: Proportion of BYA players born in first half of the year by age cohorts. Figure displays values of all former U19 BYA players born between 1988 and 2001. The respective age cohorts are indicated by vertical lines. Confidence intervals at 95% and equal distribution as reference.

Overall, the descriptive statistics show that the RAE has not declined, but was rather amplified since the introduction of BYAs. The primary goal of BYAs, the ‘‘optimal exhaustion of the talent pool’’ (Deutsche Fußball Liga, 2020a), is thus probably missed.¹⁷ As unobserved ability is plausibly independent of birth dates, the preferred selection of relatively older adolescents suggests that talent is lost: Some late blooming shooting stars are deprived of the chance to shine.

Relative Age Advantages, Training, Market Values, and Ability

To test our Hypotheses 2 and 3—that performance levels are positively correlated with players’ relative age disadvantages, at the margin of getting selected as well as on average—we first run regressions of logHMV on players’ quarters of birth. In doing so, the logHMV serve as a proxy for players’ maximum performance levels. In all specifications, we control for year of birth and U19 club fixed effects. Column 1 of Table 3 shows a clear picture: BYA players born later in the year reach significantly higher market values during their careers. Specifically, former U19 youth players that were born in the third (fourth) quarter of the year reach 41.9% (58.7%) higher market values compared to their peers born in the first quarter of the year. Our results are supported by quantile regressions (see Table C3 in Appendix C).

Table 3.

OLS Regressions of Market Values, BYA Training, and Birth Quarters.

	(1)	(2)	(3)	(4)	(5)
	logHMV	logHMV	BYAyears	logHMV	logHMV
Q2: Apr–Jun	0.0599	0.303	$-$ 0.0656		0.0122
	(0.126)	(0.186)	(0.0634)		(0.129)
Q3: Jul–Sep	0.364***	0.315*	$-$ 0.191**		0.322**
	(0.140)	(0.178)	(0.0759)		(0.135)
Q4: Oct–Dec	0.462***	0.212	$-$ 0.224**		0.453***
	(0.177)	(0.213)	(0.0965)		(0.168)
BYAyears				0.614***	0.532***
				(0.0384)	(0.0396)
Height (cm)					0.0627***
					(0.00937)
Position control	No	No	No	No	Yes
Strong foot control	No	No	No	No	Yes
Birth year FE	Yes	Yes	Yes	Yes	Yes
U19 Club FE	Yes	Yes	Yes	Yes	Yes
Observations	2383	577	2383	2383	1873
R squared	0.231	0.276	0.0826	0.306	0.332
Sample	Full	BuLiPro	Full	Full	Full
Cov. with weekBorn	2.626	2.062	$-$ 1.383	2.626	2.092
Cov. with BYAyears	0.736	0.106	1.807	0.736	0.650

The full sample includes all former U19 BYA players who were born between 1988 and 2001. The BuLiPro sample in column 2 includes only players that have at least once played in the Bundesliga or 2. Bundesliga during their career. In columns 1, 2, 3, and 5 players born in the first quarter of the year (Q1: Jan-Mar) are omitted and constitute the baseline. Because the logarithm of the market values is the dependent variable (in column 1, 2, 4 and 5), the coefficient needs to be converted as following: $100 \times (e x p (\hat{β}) - 1)$ . Based on that, a selection of coefficients and their respective percentage changes are shown in the format $β = x$ %: $- 0.2 = - 18.1$ %, $0.212 = 23.6$ %, $0.322 = 38.0$ %, $0.364 = 43.9$ %, $0.453 = 57.3$ %, $0.462 = 58.7$ %, $0.532 = 70.2$ %, and $0.614 = 84.8$ %. The ’position control’ consists of 12 different position specializations. The ‘strong foot control’ refers to player’s strong foot: left, right, or both. The smaller sample in column 5 can be explained by the fact that the ‘‘height’’-variable is missing for some players. The last two rows report the unconditional covariance of the dependent variable with the variables ‘weekborn’ and ‘BYAyears’, respectively. Heteroskedasticity-robust Huber-White standard errors in parentheses. *Significant at the 10% level, **significant at the 5% level, ***significant at the 1% level.

At first sight, this result appears to contradict the findings of Fumarco and Rossi (2018); Pérez-González et al. (2020) who show that professional soccer players’ monetary valuations are not significantly correlated with their relative age. However, this can partly be explained by different samples. We study former U19 BYA players, while Fumarco and Rossi (2018); Pérez-González et al. (2020) concentrate only on professional adult players. Our results, however, are more in line with Ashworth and Heyndels (2007) who analyze the (estimated) wages of professional players from the German Bundesliga for two seasons between 1997 and 1999 and find that the late-born players receive a wage premium. In Column 2 of Table 3, we similarly restrict our sample to those former U19 BYA players who have at least once played in one of the top two German professional adult leagues. When doing so, the association between players’ quarter of birth and their market values declines and loses significance; while still being positive. Our findings are thus generally in line with the literature. Within the sample of those players who made it to the professional stage, the association between players’ monetary valuations and relative age does neither seem to be clearly positive nor clearly negative. In fact, we would not expect significant differences, if players with relative age advantages are a bit less talented, on average, but receive a bit more time of distinguished elite academy training. Yet, in our sample of all former U19 BYA players—including the fading shooting stars that never make it to the professional stage—the unambiguously positive association between relative age disadvantages and market values is striking.

As players’ quarter of birth is, by nature, uncorrelated with their talent, skill and thus maximum performance level, the positive correlation between relative age disadvantages and logHMV can be explained by unobserved ability and sample selection: Players born towards the end of the year need to compensate with higher ability for their relative age disadvantage in order to get selected by BYAs.

Still, the simple estimates presented in Column 1 of Table 3 might be biased: Column 3 of Table 3 shows that players with relative age disadvantages are selected relatively later by BYAs and therefore receive, on average, less distinguished elite training. At the same time, years of BYA training are strongly positively associated with market values (see Column 4 of Table 3). Moreover, other factors such as players’ positions, their height or even their strong foot, might be correlated with their relative age, unobserved ability and thus market values. In Table C2 in Appendix C, we report additional results on the association between players’ market values and player specific characteristics. We obtain three main results: (i) the positions of former BYA players that are associated with the highest market values are right wing, left wing, and central midfield, while center backs, right midfielders, and left midfielder reach the lowest market values, (ii) taller players reach on average higher market values, and (iii) two-footed players reach on average significantly lower market values than right-footed players, which indicates that BYAs overrate the importance of two-footedness when it comes to talent selection.

In Column 5 of Table 3, we therefore add controls for these player characteristics and also control for the number of BYA training years the players received. This does, however, not affect the results: The association between former BYA players’ relative age and their market values is still significantly positive. Players born in the last quarter of the year still have market values which are 57.3% larger than the market values of their peers born in the first quarter of the year. This indicates that in the sample of BYA players, relative age and unobserved ability are not evenly distributed. In other words, relatively disadvantaged players that still get selected are relatively more talented. While this finding strongly supports Hypothesis 3—that average performance levels are positively correlated with relative age disadvantages in elite youth academies—it does not directly verify Hypothesis 2. Based on the regressions in Table 3 alone we cannot conclude that it is indeed the marginally selected players who drive this positive correlation between relative age disadvantage and ability as stated in Hypothesis 2.

To test Hypothesis 2, we therefore need to go one step further. We do this by slightly abusing an instrumental variable (IV) approach: Instead of using an unbiased IV estimate to measure a causal effect, we use a biased IV estimate to learn about the covariance of some part of the error term (unobserved ability) and the instrument (relative age disadvantage) at the margin of getting selected. In the following paragraphs, we describe how we do this in detail.

The following two-stage least squares (2SLS) estimation equations provide the basis for testing Hypothesis 2:

{BYAyears}_{i} = π_{0} + π_{1} w e e k B o r n_{i} + δ_{y} + γ_{c} + ν_{i}

(8)

l o g H M V_{i} = β_{0} + β^{I V} {\hat{B Y A y e a r s}}_{i} + δ_{y} + γ_{c} + ϵ_{i},

(9)

where we first use players’ relative age as an instrument for the years of BYA training they received (first stage, Equation (8)), and then regress players’ market values on their instrumented years of BYA training (second stage, Equation (9)). Here,

δ_{y}

denote year of birth fixed effects and

γ_{c}

U19 BYA club fixed effects.

The instrument is motivated by the fact that relatively older adolescents have a higher propensity of getting selected early by BYAs. The literature suggests that performance differences between boys of contrasting maturity status are most pronounced between the age of 13 and 16 (see Section “Stylized Facts on the RAE and Performance Development”) so that boys with a relative age advantage are more likely to be selected at the U15 stage. As the maturity advantage of relatively older players decreases subsequently, some relatively younger players make it into the team at later stages. In the U19 team, then, relatively older players should have gotten more years of BYA training on average. Moreover, Column 3 of Table 3 and also Columns 1 and 3 of Table 4 show that relative age is significantly correlated with BYAyears.

Table 4.

Two-Stage Least Squares: Identifying the Marginally Selected Talent Bias.

	First-Stage	Second-Stage	First-Stage	Second-Stage
	(1)	(2)	(3)	(4)
	BYAyears	logHMV	BYAyears	logHMV
$\hat{B Y A y e a r s}$		$-$ 1.617*		$-$ 1.723*
		(0.863)		(0.881)
weekBorn	$-$ 0.00646***		$-$ 0.00647***
	(0.00198)		(0.00197)
Birth Year FE	Yes	Yes	Yes	Yes
U19 Club FE	No	No	Yes	Yes
Observations	2383	2383	2383	2383
F-Test	10.61		10.76

The sample includes all former U19 BYA players who were born between 1988 and 2001. Columns 1 and 3 show the first-stage results. Columns 2 and 4 show the second-stage results. Because the logarithm of the market values is the dependent variable (in columns 2 and 4), the coefficient needs to be converted as following: $100 \times (e x p (\hat{β}) - 1)$ . Based on that, coefficients can be interpreted as changes of the following size: $-$ 77.4% (column 2) and $-$ 82.1% (column 4). The Wu-Hausman F-statistics of the 2SLS regressions are 16.57 and 18.73, respectively, when running regressions analog to those in columns 2 and 4 without robust standard errors, as required by the Wu-Hausman test. We can therefore reject that BYAyears is exogenous in OLS regressions. Heteroskedasticity-robust Huber-White standard errors in parentheses. * Significant at the 10% level, ** significant at the 5% level, *** significant at the 1% level.

Following the idea of Angrist and Krueger (1992), birthdays are generally a valid instrument as they are plausibly random and uncorrelated with possible confounders. As Musch and Grondin (2001) argue, there are no seasonal circumstances which could explain why youth players are more likely to be selected by elite youth academies apart from relative age. Players born in December of one year and those born in January of the next year are exposed to the same conditions while growing up.¹⁸ It is therefore safe to assume that unobserved ability is distributed equally across birth months in the whole population.

However, unobserved ability is likely not independently distributed in our sample of those players that were selected into BYAs while it is in the general population. As unobserved ability likely influences how early a player gets selected and also how much BYA training he gets, it is also linked to market values. In our sample, the exclusion restriction—the necessary assumption to identify causal effects using IV—therefore likely fails. The IV estimation is biased. This implies that we cannot identify the causal effect of BYAyears on market values. Still, we can use the biased IV estimation to learn something about the covariance of the instrument (the relative age) and the confounder (unobserved ability).

Note that the biased IV estimator, not conditioning on fixed effects for simplicity, can be expressed as follows:¹⁹

β^{I V} = \underset{> 0}{\underset{⏟}{β^{u n b i a s e d}}} + C o v (ϵ_{i}, w e e k B o r n_{i}) / \underset{< 0}{\underset{⏟}{C o v ({BYAyears}_{i}, w e e k B o r n_{i})}},

(10)

where

β^{u n b i a s e d}

is the unbiased (causal) effect of one additional year of BYA training on players’ market values, which we cannot estimate but are safe to assume to be non-negative given the large amount of resources used to train and promote youth players in BYAs compared to soccer training outside of the BYA elite academies. Columns 4 and 5 of Table 3 also suggest that the effect of one additional year of BYA training is significantly positive. Moreover, we know from Column 3 of Table 3 that the covariance of weekBorn and BYAyears is negative. This is further supported by the first stage of the 2SLS estimation reported in Columns 1 and 3 of Table 4.

The only part of Equation (10) which we do not know the sign of is the covariance of the 2SLS regression error $ϵ$ and weekBorn. Note from the discussion of our instrument above that the only part of the 2SLS regression error that can be correlated with relative age (weekBorn) is players’ unobserved ability which determines if they get selected despite having relative age disadvantages. If the sign of the 2SLS estimator $β^{I V}$ is negative, we can therefore conclude that covariance of the 2SLS regression error $ϵ$ and the relative age disadvantage weekBorn²⁰ is positive: $β^{I V} < 0 \Rightarrow C o v (ϵ_{i}, w e e k B o r n_{i}) > 0$ .

In fact, Columns 2 and 4 of Table 4 show that the 2SLS estimator is significantly negative at the 10% level. Based on Equation (10), we thus conclude that the covariance of unobserved ability and relative age disadvantage is positive.

Using another feature of IV estimation, we can go even one last step further. The IV estimator $β^{I V}$ does not estimate an average effect, but a local average treatment effect (LATE). In general, this is the average ’treatment’ effect for those that got induced into ’treatment’ by the instrument and were not ’treated’ otherwise. In our context, this refers to those that only got more (or less) years of BYA training exclusively because of their relative age. It is apparent that this group is almost analogous to our definition of the marginally selected, for which small differences determine if they get selected into BYAs or not. As $β^{I V}$ can be considered a (biased) local estimator of the training effect for the marginally selected, we can also consider $C o v (ϵ_{i}, w e e k B o r n_{i})$ as the local covariance between the marginally selected players’ unobserved ability and their relative age disadvantage. The negative sign of the (biased) LATE in Columns 2 and 4 of Table 4 therefore implies that the covariance of unobserved ability and relative age disadvantages is negative for the marginally selected. Hence, we can conclude that there is an ability premium at the margin of getting selected as we have argued theoretically in Section “Theoretical Framework” and have stated in Hypothesis 2.

In conclusion, we have utilized a biased IV estimator to establish that the covariance of unobserved ability and relative age disadvantage is positive at the margin of getting selected. This does not only support Hypothesis 2, but also explains why we find an ability premium for those with relative age disadvantages on average (Hypothesis 3).

Finally, we want to note that the instrumental variable approach is very promising for further analyses of the mechanisms involved in the RAE. New data on both elite and non-elite youth players would promise two advantages. First, the prevalence of the RAE in BYAs would likely further increase the instrument’s strength. Second, the exclusion restriction would hold, as talent in the whole population is independent from birth dates so that a causal effect of elite youth academy training on market values could be estimated. IV regressions can therefore be a promising starting point for further research.

Foregone Market Values

This section aims to test Hypothesis 4. The key questions are: Do BYAs forego higher market values by not selecting youth players evenly across birth months? And, can we quantify how much additional market value, if any, BYAs could generate when eliminating the RAE in talent selection? To answer these questions, we start with the findings derived in the previous section. We found that, in the sample of former BYA youth players, those with relative age disadvantages have on average higher ability levels; in particular, at the margin of getting selected. This implies, for example, that the December-born player who was just not selected by a U15 BYA is likely to have higher (unobserved) ability than the January-born player who just got selected by the U15 BYA. From Figure 3 in Section “Theoretical Framework”, it is apparent that this exemplary selection decision can be improved: By selecting the marginal December-born instead of the marginal January-born, ability in the BYA could be increased; as well as market values in the long term.

Of course, one would need clairvoyant abilities to predict the actual performance potential of youth players. The relationships established in Hypotheses 2 and 3 are obviously not valid for each individual player. Still, the positive association of relative age disadvantages and unobserved ability, which exists on average, indicates that selecting more evenly across months of birth will also have effects on average. To test Hypothesis 4, we therefore compare groups of players, not individual players. The idea is rather simple: We compare a group of players that is representative for the current average level of ability in BYAs to a group of players that is representative for the average level of ability that would emerge if selection was independent of the RAE.

Specifically, using month of birth as a grouping variable, we compare a credible status quo reference group (ref) to a plausible state-of-no-RAE group $D$ . By taking differences in (conditional) mean logHMV, we obtain an estimate for the costs of the RAE in BYAs.

\frac{H M V^{D} - H M V^{r e f}}{H M V^{r e f}} \approx l o g H M V^{D} - l o g H M V^{r e f} = β_{D}^{r e f}

(11)

Figure 5 shows the distribution of average logHMV by birth month. In line with the regression results in Table 3, logHMV increase over birth months. The mean logHMV (the red horizontal line) is surpassed in the middle of the year. We, therefore, consider players born in June and July as the natural choice for the reference group for the status quo talent level in BYAs. Alternative choices for the status quo group are players born between May and June or players born between July and August. As logHMV of these players are just below (above) the mean logHMV, we expect to obtain upper (lower) bound cost estimates when using these alternative status quo groups as a baseline.²¹

Figure 5.

Mean logarithmized highest market values by month of birth. Note: The sample includes all former U19 BYA players who were born between 1988 and 2001. The red line refers to the average logHMV of all players born between 1988 and 2001.

We now turn to the question of how high the average talent level in BYAs could be when eliminating the RAE. The natural choice for the state-of-no-RAE group are players born between September and December. We split this group into a September–October and a November–December group. In forming these groups, we can account a bit for the fluctuations in logHMV across birth months (see Figure 5). We consider the November–December group as a more optimistic state-of-no-RAE group, as among these players the average talent level should be the highest, while the September–October state-of-no-RAE group is a more conservative choice.

Building on Equation (11), we estimate the cost of the RAE in BYAs using OLS regressions. For each status quo reference group (ref), we estimate differences in means with respect to a set of state-of-no-RAE groups $Γ = {d_{1}, d_{2}, \dots D}$ . Groups are defined by month of birth ( $m o n t h B o r n$ ) of player $i$ . Building on the notation introduced above,²¹ we estimate the following regression model:

l o g H M V_{i} = β_{0} + \sum_{d = 1}^{D} β_{d} \times 1 [m o n t h B o r n_{i} \in d] + X_{i} Λ + γ_{c} + δ_{y} + u_{i}

(12)

\begin{aligned} if m o n t h B o r n_{i} \in {r e f \cup Γ} . \end{aligned}

Our coefficient of main interest,

β_{d}

, captures the difference in mean logHMV between players born in the state-of-no-RAE month of birth group

d

and players born in the status quo reference group (

r e f

). As shown in Equation (11), we can interpret

β_{d}

as the proportional average market value premium of the state-of-no-RAE group relative to the status quo reference group. In the regressions, only players who were born in either the status quo reference group or one of the state-of-no-RAE groups are considered. All players born in other months are not included in the respective regression samples.

Table 5 reports the coefficient estimates for different status quo reference groups and state-of-no-RAE groups. Columns 3 to 5 show estimates with players born between May and August as the reference group which, as discussed above, constitute the most credible baseline for how high the average talent level currently is in BYAs. Taking players born between September and October and those born between November and December as the state-of-no-RAE groups and following our reasoning above, we find that BYAs could generate 30.6 to 72.8% higher HMV when eliminating the RAE.

Table 5.

The Foregone Market Values and the Relative Age Effect in Bundesliga Youth Academies.

	(1)	(2)	(3)	(4)	(5)
	logHMV	logHMV	logHMV	logHMV	logHMV
Reference Months	Jan–Feb	Mar–Apr	May–Jun	Jun–Jul	Jul–Aug
May–Jun	0.0329	0.00994
	(0.156)	(0.170)
Jul–Aug	0.229	0.231	0.193
	(0.158)	(0.175)	(0.187)
Sep–Oct	0.470**	0.457**	0.450**	0.337	0.262
	(0.196)	(0.209)	(0.218)	(0.214)	(0.220)
Nov–Dec	0.590***	0.556**	0.540**	0.478*	0.365
	(0.226)	(0.234)	(0.246)	(0.248)	(0.247)
Birth Year FE	Yes	Yes	Yes	Yes	Yes
U19 Club FE	Yes	Yes	Yes	Yes	Yes
Observations	1846	1606	1069	711	680
$R^{2}$	0.244	0.215	0.227	0.255	0.269

The sample includes all former U19 BYA players who were born between 1988 and 2001. Column 1 compares logarithmized market values of youth players born in January and February with those born in the other birth months groups shown. In the other columns, the reference birth month combinations are March and April (column 2), May and June (column 3), June and July (column 4), and July and August (column 5). Because the logarithm of the market values is the dependent variable, the coefficient needs to be converted as following: $100 \times (e x p (\hat{β}) - 1)$ . Based on that, a selection of coefficients and their respective percentage changes are shown in the format $β$ = x%: 0.193 = 21.3%, 0.23 = 25.9%, 0.337 = 40.1%, 0.45 = 56.8%, 0.478 = 61.3%, 0.54 = 71.6%, and 0.59 = 80.4%. Heteroskedasticity-robust Huber-White standard errors in parentheses. * Significant at the 10% level, ** significant at the 5% level, *** significant at the 1% level.

Taking the natural choice for the status quo group, players born in June and July, as reference (Column 4 of Table 5), eliminating the RAE in talent selection is associated with 38.8 to 64.5% larger HMV. The estimator of the September–October state-of-no-RAE group, however, is not statistically significant. The estimate of the November–December state-of-no-RAE group is meanwhile statistically significant at the 5%-level. Overall, the estimates show that eliminating the RAE could lead to sizeable effects on average market values of former elite youth players.

Considering the alternative status quo groups, May–June (Column 3) and July–August (Column 5), we see that the estimates follow the expected patterns. Mean differences are larger in size and statistically significant for the May–June reference group, while they are smaller and insignificant for the July–August reference group. Moreover, the November–December state-of-no-RAE group leads to generally larger mean differences than the September–October group. Hence, the difference in logHMV between the July–August and the September–October groups (Column 5, 30.6%) plausibly constitutes a lower bound for the opportunity costs of the RAE in BYAs. Likewise, the difference in means between the May–June and November–October groups (Column 3, 72.8%) can be considered an upper bound. The estimates are robust to controlling for individual player’s height and strong foot (see Table C4 in Appendix C).

Overall, these results support Hypothesis 4: BYAs forego higher average market values by maintaining selection characterized by the RAE instead of selecting youth players evenly across birth months. Market values differ considerably over birth months with players born later in the year having generally higher market values. Eliminating the RAE in talent selection, average unobserved ability levels in BYAs could be increased. We find that this higher average ability would translate into 30.6 to 72.8% higher market values of former U19 BYA players. Hence, professional German clubs could generate substantially more value through their BYAs than they are currently doing. To express this in numbers, if the average former BYA player sells for 1.284 million EUR today (see Table 1), he could sell for 1.677 to 2.219 million EUR in absence of the RAE.

One could criticize our calculations arguing that relatively younger players might compensate for their relative disadvantage with greater effort and might even adapt their performance level to the relative physiological advantage of their peers (see Mann & van Ginneken, 2017; Votteler & Höner, 2014). In other words, relative age disadvantages could come with positive spillovers from relatively advantaged team mates. This is also discussed as peer effects in the literature (see, e.g., Ashworth & Heyndels, 2007). A disproportionately higher share of older players within a youth team would then not be an indication of the RAE, but could be a strategic tool to promote a few exceptionally promising players (see Section “Production Function of Elite Youth Academies and the Optimal Selection Policy”). Two important observations speak against this argumentation. First, evidence suggests that relatively younger and relatively less physically developed youth players tend to receive less match playing time than their relatively older and stronger peers (Deprez et al., 2015; Sæther, 2016; Vaeyens et al., 2005). This is also in line with the observation that clubs aim to be successful at all stages and utilize the RAE in pursuit of short-term success (Jiménez & Pain, 2008). For relatively younger players, positive spillovers during training might thus be balanced out by the negative effect of less and shorter match experience. Second, the RAE can still be observed at the professional level (see Sierra-Díaz et al., 2017 and Figure C3 in Appendix C) which indicates the inability of elite youth academies to identify their top players independently of the RAE.

Finally, we need to emphasize again that we only estimate the possible gains of eliminating the RAE. We have not considered the opportunity costs of the relative maturity effect which co-exists next to the RAE and is also influential in BYA talent selection (see Malina et al., 2000). The overall costs of selecting along the lines of momentary instead of potential performance levels are therefore expected to be even higher.

Conclusion

This article investigates the RAE in German elite youth soccer. We develop a simple theoretical model which illustrates the underlying mechanism in talent selection and the negative consequences of the RAE. Based on our model, we derive four hypotheses: First, given selection cut-offs during performance development, relative age-based performance differences, and positive effects of elite youth academy training, we hypothesize that the RAE occurs in an competitive environment, and is sustained even when relative age differences fade. Second, we hypothesize that among marginally selected players, relative age disadvantages (later birth months) are positively correlated with players’ unobserved ability. Third, we derive from the second hypothesis that, among all youth players selected by elite academies, average ability levels are positively correlated with players’ relative age disadvantages. Fourth, we postulate that elite youth academies forego the creation of higher market values by not selecting youth players more evenly across months of birth. Our data includes information on the players of the most successful German Bundesliga Youth Academies (BYAs) for the period 2002–2020.

Even though the RAE has been well-documented since decades, our results show that it still exists in German elite Bundesliga youth academies. While there is no reason to assume that ability is not distributed evenly across birth dates, we find that 71.5% (44.6%) of the players in BYAs in the under 19 teams were born in the first 6 (3) months of a year. In this competitive environment with key date assessments, relatively older players within a cohort accordingly have a higher probability of getting selected. Moreover, the RAE also persists in professional German adult soccer. This supports our first hypothesis.

We show that elite youth players who were born towards the end of the year, on average, reach significantly higher market values during their career. This suggests that, within the sample of all players that were selected by BYAs, elite youth players must compensate for their relative age disadvantage by having higher ability. This is also expressed in market values and supports our third hypothesis.

With respect to our second hypothesis, we rely on an instrumental variable approach to learn about the covariance of some part of the error term (unobserved ability) and the instrument (relative age disadvantage). We find that relative age disadvantages are positively correlated with unobserved ability at the margin of getting selected. This supports our second hypothesis and explains the significantly positive relation between market values and relative age disadvantages.

Finally, the results of our analyses reveal that the RAE causes substantial financial losses for the clubs as it reduces players’ market values. According to our estimations, future market values of BYA players could be between 30.6 and 72.8% higher if the clubs were able to eliminate the RAE in talent selection. These figures support our fourth hypothesis and show that the RAE does not only cause substantial costs in terms of team performance, but also in the financial dimension.

The mechanisms we have described are also relevant for talent identification, development, and recruitment outside of sports. There are many (structural) reasons that give individuals short-term advantages or disadvantages (e.g., parental background, gender, ethnicity, networks, ordinal ranks, language, mobility, environmental shocks, etc.) which might mask their real potential. This can be the case in several contexts: firms’ hiring or promotion decisions, the admission to certain schools or study programs, tracking decisions in school, allocation to math or reading groups or other enrichment programs in (primary) school, the award of scholarships, program participation among unemployed or in development aid, et cetera. Failing to account for these short-term factors reproduces and deepens inequalities (e.g., Hanushek & Rivkin, 2009; Murphy & Weinhardt, 2020) and poverty (e.g., Balboni et al., 2022), leads to a waste of talent, and makes later compensatory investments more expensive, especially if relative disadvantages occur in early childhood (see Cunha & Heckman, 2007).

We show that distinguishing between adolescents’ current and potential performance levels is crucial for the efficient allocation of talent and resources. Beyond that, our article contributes in two ways to the debate on how to improve the allocation of talent in society. First, we offer a conceptional framework and an exemplary application, highlighting the key mechanisms and implications of talent selection in the nexus between current and potential performance levels. Second, we show that the economic gains can be large if initial differences are eliminated rather than perpetuated.

Footnotes

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors received no financial support for the research, authorship and/or publication of this article.

ORCID iDs

Lukas Tohoff

Mario Mechtel

Notes

Author Biographies

Lukas Tohoff completed his Master of Science in Economic and Social Sciences at Bocconi University in 2023 and attended the General Course at the London School of Economics during his undergraduate studies. He holds a Bachelor of Science in Economics and a Bachelor of Science in Business Administration, both from Leuphana University Lüneburg. He is now studying for a PhD in Economics with the Berlin School of Economics at Humboldt University of Berlin. His main research interests are in applied econometrics, labor economics, health economics, and economics of sports.

Mario Mechtel studied Economics at Johannes Gutenberg University Mainz and earned his PhD in Economics from Eberhard Karls University Tübingen. Since 2018, he has been a professor of Empirical Microeconomics at Leuphana University Lüneburg. His research focuses on behavioral economics, environmental and sustainability economics, digitalization, and labor and personnel economics, the latter with applications in sports economics.

Appendix

References

Allen

Barnsley

(1993). Streams and tiers: The interaction of ability, maturity, and training in systems with age-dependent recursive selection. The Journal of Human Resources, 28(3), 649–659.

Angrist

J. D.

Krueger

A. B.

(1992). The effect of age at school entry on educational attainment: An application of instrumental variables with moments from two samples. Journal of the American Statistical Association, 87(418), 328–336.

Angrist

J. D.

Pischke

J. S.

(2009). Mostly harmless econometrics: An empiricist’s companion. Princeton University Press.

Ashworth

Heyndels

(2007). Selection bias and peer effects in team sports. Journal of Sports Economics, 8(4), 355–377.

Augste

Lames

(2011). The relative age effect and success in German elite U-17 soccer teams. Journal of Sports Sciences, 29(9), 983–987.

Balboni

Bandiera

Burgess

Ghatak

Heil

(2022). Why do people stay poor? Quarterly Journal of Economics, 137(2), 785–844.

Barnsley

R. H.

Thompson

A. H.

Barnsley

P. E.

(1985). Hockey success and birthdate: The relative age effect. Canadian Association for Health, Physical Education, and Recreation, 51(1), 23–28.

Berger

Nieken

(2016). Heterogeneous contestants and the intensity of tournaments. Journal of Sports Economics, 17(7), 631–660.

Bryson

Frick

Simmons

(2013). The returns to scarce talent: Footedness and player remuneration in European soccer. Journal of Sports Economics, 14(6), 606–628.

10.

Cobley

S. P.

Schorer

Baker

(2008). Relative age effects in professional German soccer: A historical analysis. Journal of Sports Sciences, 26(14), 1531–1538.

11.

Cohen-Zada

Krumer

Shapir

O. M.

(2018). Testing the effect of serve order in tennis tiebreak. Journal of Economic Behavior & Organization, 146, 106–115.

12.

Cullen

J. B.

Jacob

B. A.

Levitt

(2006). The effect of school choice on participants: Evidence from randomized lotteries. Econometrica, 74(5), 1191–1230.

13.

Cumming

S. P.

Brown

D. J.

Mitchell

Bunce

Hunt

Hedges

Malina

R. M.

(2018). Premier league academy soccer players’ experiences of competing in a tournament bio-banded for biological maturation. Journal of Sports Sciences, 36(7), 757–765.

14.

Cunha

Heckman

(2007). The technology of skill formation. American Economic Review, 97(2), 31–47.

15.

Dawid

Muehlheusser

(2015). Repeated selection with heterogeneous individuals and relative age effects. Journal of Economic Behavior & Organization, 116, 387–406.

16.

Del Campo

D. G. D.

Vicedo

J. C. P.

Villora

S. G.

Jordan

O. R. C.

(2010). The relative age effect in youth soccer players from Spain. Journal of Sports Science & Medicine, 9(2), 190.

17.

Deprez

D. N.

Fransen

Lenoir

Philippaerts

R. M.

Vaeyens

(2015). A retrospective study on anthropometrical, physical fitness, and motor coordination characteristics that influence dropout, contract status, and first-team playing time in high-level soccer players aged eight to eighteen years. The Journal of Strength & Conditioning Research, 29(6), 1692–1704.

18.

Deutsche Fußball Liga. (2018). Die Entwicklung der Leistungszentren. https://www.dfl.de/de/hintergrund/nachwuchsleistungszentren/wende-am-tiefpunkt-die-nachwuchsleistungszentren/

19.

Deutsche Fußball Liga. (2019). Wirtschaftsreport 2019. https://media.dfl.de/sites/2/2019/02/DFL_Wirtschaftsreport_2019_DE_M-1.pdf

20.

Deutsche Fußball Liga. (2020a). Anhang V: Richtlinien für die Unterrichtung und Unterhaltung von Leistungszentren der Teilnehmer der Lizenzligen. https://media.dfl.de/sites/2/2019/12/Anhang-V-zur-LO-2019-12-04-Stand.pdf

21.

Deutsche Fußball Liga. (2020b). Wirtschaftsreport 2020. https://media.dfl.de/sites/2/2020/02/DE_DFL_Wirtschaftsreport_2020_M.pdf

22.

Deutscher Fußball Bund. (2020). Mitglieder-Statistik 2020. https://www.dfb.de/fileadmin/_dfbdam/223584-Mitgliederstatistik.pdf

23.

Dustmann

Puhani

P. A.

Schönberg

(2016). The long-term effects of early track choice. Economic Journal, 127(603), 1348–1380.

24.

Feess

Gerfin

Muehlheusser

(2015). Contracts as rent-seeking devices: Evidence from German soccer. Economic Inquiry, 53(1), 714–730.

25.

Flepp

Franck

(2021). The performance effects of wise and unwise managerial dismissals. Economic Inquiry, 59(1), 186–198.

26.

Franck

Nüesch

(2012). Talent and/or popularity: What does it take to be a superstar? Economic Inquiry, 50(1), 202–216.

27.

Franzke

(2017). Nachwuchsförderung: 1,23 Milliarden Euro für die NLZs. Kicker. https://www.kicker.de/nachwuchsfoerderung_123-milliarden-euro-fuer-die-nlzs-711418/artikel

28.

Friebel

Heinz

Hoffman

Zubanov

(2019). What do employee referral programs do? Measuring the direct and overall effects of a management practice. NBER Working Paper (w25920).

29.

Fumarco

Rossi

(2018). The relative age effect on labour market outcomes–Evidence from Italian football. European Sport Management Quarterly, 18(4), 501–516.

30.

fupa.net. (2020). https://www.fupa.net/

31.

fussballfieber.de. (2017). Top 7 - Die Besten Jugendakademien Deutschlands: Sie legen besonders viel Wert auf Jugendarbeit. Fussballfieber. https://www.fussballfieber.de/p/2ha9td/top-7-die-besten-jugendakademien-deutschlands

32.

González-Díaz

Gossner

Rogers

B. W.

(2012). Performing best when it matters most: Evidence from professional tennis. Journal of Economic Behavior & Organization, 84(3), 767–781.

33.

Grossmann

Lames

(2013). Relative Age Effect (RAE) in football talents–The role of youth academies in transition to professional status in Germany. International Journal of Performance Analysis in Sport, 13(1), 120–134.

34.

Hanushek

E. A.

Rivkin

S. G.

(2009). Harming the best: How schools affect the black-white achievement gap. Journal of Policy Analysis and Management, 28(3), 366–393.

35.

Harb-Wu

Krumer

(2019). Choking under pressure in front of a supportive audience: Evidence from professional biathlon. Journal of Economic Behavior & Organization, 166, 246–262.

36.

Herm

Callsen-Bracker

H. M.

Kreis

(2014). When the crowd evaluates soccer players’ market values: Accuracy and evaluation attributes of an online community. Sport Management Review, 17(4), 484–492.

37.

Hill

Remer

(2020). Race and employment outcomes: Evidence from NBA coaches. Economic Inquiry, 58(3), 1469–1486.

38.

Hill

Sotiriadou

(2016). Coach decision-making and the relative age effect on talent selection in football. European Sport Management Quarterly, 16(3), 292–315.

39.

Hsieh

C.-T.

Hurst

Jones

C. I.

Klenow

P. J.

(2019). The allocation of talent and U.S. economic growth. Econometrica, 87(5), 1439–1474.

40.

Jackson

R. C.

Comber

(2020). Hill on a mountaintop: A longitudinal and cross-sectional analysis of the relative age effect in competitive youth football. Journal of Sports Sciences, 38(11-12), 1352–1358.

41.

Jiménez

I. P.

Pain

M. T.

(2008). Relative age effect in Spanish association football: Its extent and implications for wasted potential. Journal of Sports Sciences, 26(10), 995–1003.

42.

Kempa

(2022). Task-specific human capital and returns to specialization: Evidence from association football. Oxford Economic Papers, 74(1), 136–154.

43.

Keppel

Claessens

(2020). How the volunteers of data website Transfermarkt became influential players at European top football clubs. Follow The Money. https://www.ftm.nl/artikelen/transfermarkt-volunteers-european-football

44.

Kitchens

C. T.

(2015). Are winners promoted too often? Evidence from the NFL draft 1999–2012. Economic Inquiry, 53(2), 1317–1330.

45.

Lagestad

Steen

Dalen

(2018). Inevitable relative age effects in different stages of the selection process among male and female youth soccer players. Sports, 6(2), 29.

46.

Lovell

Towlson

Parkin

Portas

Vaeyens

Cobley

(2015). Soccer player characteristics in English lower-league development programmes: The relationships between relative age, maturation, anthropometry and physical fitness. PLoS one, 10(9), 1–14.

47.

Malina

R. M.

Chamorro

Serratosa

Morate

(2007). TW3 and Fels skeletal ages in elite youth soccer players. Annals of Human Biology, 34(2), 265–272.

48.

Malina

R. M.

Eisenmann

J. C.

Cumming

S. P.

Ribeiro

Aroso

(2004). Maturity-associated variation in the growth and functional capacities of youth football (soccer) players 13–15 years. European Journal of Applied Physiology, 91(5-6), 555–562.

49.

Malina

R. M.

Reyes

M. P.

Eisenmann

J. C.

Horta

Rodrigues

Miller

(2000). Height, mass and skeletal maturity of elite Portuguese soccer players aged 11–16 years. Journal of Sports Sciences, 18(9), 685–693.

50.

Mann

D. L.

van Ginneken

P. J.

(2017). Age-ordered shirt numbering reduces the selection bias associated with the relative age effect. Journal of Sports Sciences, 35(8), 784–790.

51.

Martindale

R. J.

Collins

Daubney

(2005). Talent development: A guide for practice and research within sport. Quest (Grand Rapids, Mich.), 57(4), 353–375.

52.

Mechtel

Bäker

Brändle

Vetter

(2011). Red cards: Not such bad news for penalized guest teams. Journal of Sports Economics, 12(6), 621–646.

53.

Muehlheusser

Schneemann

Sliwka

Wallmeier

(2018). The contribution of managers to organizational success: Evidence from German soccer. Journal of Sports Economics, 9(6), 786–819.

54.

Mujika

Vaeyens

Matthys

S. P.

Santisteban

Goiriena

Philippaerts

(2009). The relative age effect in a professional football club setting. Journal of Sports Sciences, 27(11), 1153–1158.

55.

Müller

Simons

Weinmann

(2017). Beyond crowd judgments: Data-driven estimation of market value in association football. European Journal of Operational Research, 263(2), 611–624.

56.

Murphy

Weinhardt

(2020). Top of the class: The importance of ordinal rank. The Review of Economic Studies, 87(6), 2777–2826.

57.

Musch

Grondin

(2001). Unequal competition as an impediment to personal development: A review of the relative age effect in sport. Developmental Review, 21(2), 147–167.

58.

Musch

Hay

(1999). The relative age effect in soccer: Cross-cultural evidence for a systematic discrimination against children born late in the competition year. Sociology of Sport Journal, 16(1), 54–64.

59.

Patel

Nevill

Cloak

Smith

Wyon

(2019). Relative age, maturation, anthropometry and physical performance characteristics of players within an Elite Youth Football Academy. International Journal of Sports Science & Coaching, 14(6), 714–725.

60.

Peeters

(2018). Testing the Wisdom of Crowds in the field: Transfermarkt valuations and international soccer results. International Journal of Forecasting, 34(1), 17–29.

61.

Pérez-González

Fernandez-Luna

Castillo

Burillo

(2020). Are European soccer players worth more if they are born early in the year? Relative age effect on player market value. International Journal of Environmental Research and Public Health, 17(9), 3301.

62.

Pérez-González

León-Quismondo

Bonal

Burillo

Fernández-Luna

Á.

(2021). The new generation of professional soccer talent is born under the bias of the RAE: Relative age effect in international male youth soccer championships. Children, 8(12), 1117.

63.

Pierson

Addona

Yates

(2014). A behavioural dynamic model of the relative age effect. Journal of Sports Sciences, 32(8), 776–784.

64.

Poli

Revenel

Besson

(2019). Financial analysis of the transfer market in the big-5 European leagues (2010–2019). CIES Football Observatory Monthly Report, 47, 1–8.

65.

ran.de. (2015). Top 10: Die besten Jugendakademien Deutschlands. ran. https://www.ran.de/fussball/bundesliga/news/top-10-die-besten-jugendakademien-deutschlands-153443

66.

Reilly

Williams

A. M.

Nevill

Franks

(2000). A multidisciplinary approach to talent identification in soccer. Journal of Sports Sciences, 18(9), 695–702.

67.

Roach

M. A.

(2022). Career concerns and personnel investment in the Major League Baseball player draft. Economic Inquiry, 60(1), 413–426.

68.

Roberts

S. J.

McRobert

A. P.

Rudd

Enright

Reeves

M. J.

(2021). Research in another un-examined (RAE) context. A chronology of 35 years of relative age effect research in soccer: Is it time to move on? Science and Medicine in Football, 5(4), 301–309.

69.

Rommers

Mostaert

Goossens

Vaeyens

Witvrouw

Lenoir

D’Hondt

(2019). Age and maturity related differences in motor coordination among male elite youth soccer players. Journal of Sports Sciences, 37(2), 196–203.

70.

Rösch

Hodgson

Peterson

Graf-Baumann

Junge

Chomiak

Dvorak

(2000). Assessment and evaluation of football performance. The American Journal of Sports Medicine, 28(5), 29–39.

71.

Sierra-Díaz

M. J.

González-Víllora

Pastor-Vicedo

J. C.

Serra-Olivares

(2017). Soccer and relative age effect: A walk among elite players and young players. Sports, 5(1), 5.

72.

Sponsors. (2019). Nachwuchsarbeit in der Bundesliga: Ein Geschäftsmodell? Sponsors online. https://www.sponsors.de/news/themen/nachwuchsarbeit-der-bundesliga-ein-geschaeftsmodell

73.

Statista. (2020). Anzahl der Unique User von transfermarkt.de von Mai 2018 bis November 2020. Statista. https://de.statista.com/statistik/daten/studie/945403/umfrage/besucherzahlen-von-transfermarktde-direkt-als-zeitreihe/

74.

Sæther

S. A.

(2016). Presence of the relative age effect and its effect on playing time among under-20 players in the Norwegian Premier League Tippeligaen—A four-year follow up. Montenegrin Journal of Sports Science and Medicine, 5(1), 11.

75.

Sundermeyer

(2009). Manipulierter Marktwert. ZEIT online. https://www.zeit.de/sport/fussball/2009-12/transfermarkt-spieler-marktwert

76.

transfermarkt.de. (2020). https://www.transfermarkt.de/

77.

Tribolet

Watsford

M. L.

Coutts

A. J.

Smith

Fransen

(2019). From entry to elite: The relative age effect in the Australian football talent pathway. Journal of Science and Medicine in Sport, 22(6), 741–745.

78.

Vaeyens

Philippaerts

R. M.

Malina

R. M.

(2005). The relative age effect in soccer: A match-related perspective. Journal of sports sciences, 23(7), 747–756.

79.

Votteler

Höner

(2014). The relative age effect in the German Football TID Programme: Biases in motor performance diagnostics and effects on single motor abilities and skills in groups of selected players. European Journal of Sport Science, 14(5), 433–442.

80.

Walker

(2016). Peak height velocity: The period of time a child experiences the largest increase in stature. https://www.scienceforsport.com/peak-height-velocity/#top

Fading Shooting Stars—The Relative Age Effect,Ability,and Foregone Market Values in German Elite Youth Soccer

Abstract

Keywords

Introduction

Related Literature

The Relative Age Effect in Soccer

Production Function of Elite Youth Academies and the Optimal Selection Policy

Stylized Facts on the RAE and Performance Development

Theoretical Framework

Basic Setup

The Effect of ‘Superior’ Elite Academy Training

Marginally Selected Players and Ability

Maximum Performance Levels and Foregone Market Values

Hypothesis 4 (Foregone Market Value):

Empirical Setting

Bundesliga Youth Academies

Data

Empirical Results

Relative Age Effect in Bundesliga Youth Academies

Relative Age Advantages, Training, Market Values, and Ability

Foregone Market Values

Conclusion

Footnotes

Declaration of conflicting interests

Funding

ORCID iDs

Notes

Author Biographies

Appendix

References