Exploring the relationships between potential,performance,and athleticism in elite youth soccer players

Abstract

Talent identification of young soccer players is a significant consideration for elite youth soccer organisations, with decisions on players often being made based on the subjective expert opinion of scouts and coaches. Previous research has suggested that scouts and coaches purportedly consider a multitude of factors before arriving at their decisions. However, to date, limited research exists regarding how coaches evaluate player potential within such organisations and how the consideration of further data may influence such evaluations of a player's future potential. Therefore, the aim of this study was to present ratings of perceived potential in an elite youth soccer academy and explore the relationships between a player's perceived potential, athleticism, and perceived level of current match performance. Using 181 player observations, findings demonstrated that ratings of players’ perceived potential decreased within older age groups and were the lowest in the senior academy age group (U18). Furthermore, a mixed effects ordinal regression model suggested that coaches’ perceptions of a player's future potential demonstrated strong relationships with current match performance and athleticism. Increases in a player's mean match rating significantly increased the odds of them being perceived to have higher potential and increases in athleticism also demonstrated a positive relationship with increased perceptions of potential. The findings have implications for youth soccer organisations, providing insight into the distribution of ratings of potential within an elite youth soccer academy, and demonstrating that further data (match performance and athleticism) is related to, and may influence, coaches’ perceptions of player potential.

Keywords

Association football coach bias subjective player assessment talent identification and development

Introduction

Identifying and developing talented young soccer players is a key aim for elite youth soccer academies. Talent identification (TID) involves the discovery of young players perceived to have the potential for future success^1,2 and can occur as early as 8 years of age in soccer, resulting in entry to a soccer academy. Talent development (TD) aims to provide a suitable learning environment to accelerate player potential towards the elite level through exposure to highly professionalized coaching, training, and support systems.^1,2 During the TID and TD process, players are often (de)selected at various stages of an academy pathway based upon their future potential, their development, and their performance within an academy programme. These TID, TD and (de)selection processes,^1,2 are traditionally coach-driven, contingent upon the comprehensive and intuitive knowledge of a coach being channelled into a decision on what a talented player is and/or what a talented player needs. Multiple approaches exist within the current literature regarding TID and (de)selection processes, with data coming from a variety of multidisciplinary sources and assessment methods (e.g., signs, samples, subjective expert opinion; see.³ However, minimal research exists exploring the information that coaches’ may (sub)consciously utilise and combine in their decision-making processes.⁴

Previous research has called into question the validity of TID and (de)selection judgements based on the subjective expert opinion (SEO) of high-level coaches, highlighting an absence of structure in (de)selection processes and a lack of specific criteria upon which decisions are based.⁵ Research has highlighted that coaches often utilise their gut instinct, making choices based on their own experience, personal taste, and their perceived ability to improve a player, rather than exclusively considering player potential and performance.^6–8 Due to the difficulty in objectively quantifying an explicit measure of performance in a dynamic, open-skilled sport such as soccer, the perceived performance levels of players are often used to indicate differences in skill levels.⁹ Whilst such an approach seems theoretically and practically suitable, such an approach often leads to the incorrect assumption that current performance is indicative of future potential.^10,11 This highlights the need for further research exploring coaches’ ratings of potential and any possible relationships between perceptions of player potential and other performance metrics (e.g., match performance, athleticism).

Further, although coaches consider multiple attributes when forming their opinions,^12–14 coaches SEO's are likely to be heavily influenced based on their observations of a player's performance in match situations.^9,15,16 However, studies in youth soccer TID have demonstrated biases (e.g., relative age, maturity biases), leading to further confusion regarding performance and potential ratings.^17,18 Extrapolating current performance as a measure of future skill level disregards the non-linear and individual nature of TD,^19–21 leading to a higher risk of mis-identifying high potential, talented young athletes. Whilst non-linear individual development and the complexity of human judgement cannot be solved using one single approach,²² there is a need for further research to understand how a coach's perception of current match performance, may influence their expectation and evaluation of future player potential.

Considering the limitations of purely subjective judgements, research has commonly used objective sources of data in examining TID, (de)selection, and TD processes.^23–26 Due to the complex and multifaceted nature of soccer, objective data has generally fallen into specific domains (e.g., physical, technical, tactical, psychological, anthropometric, sociological²); thought to relate to some element of soccer performance. Whilst a multidisciplinary approach considering all domains may be considered best practice, research within the physical domain is more commonplace.^25,26 However, research has demonstrated that coaches are unable to accurately assess players’ physical abilities subjectively in both elite youth soccer²⁷ and rugby league,²⁸ perhaps suggesting a difficulty for coaches in subjectively evaluating the athelticisim of homogeneous groups.²⁷ Despite this, several research articles have highlighted an abundance of physical qualities in coaches' self-reported attributes deemed important for TID, TD, and (de)selection processes.^12–16,29 Therefore, whilst it is unlikely that each distinct attribute (e.g., strength, speed, aerobic capacity) is accounted for in isolation by technical coaches, further research is required to determine if an athlete's physical qualities (i.e., athleticism) play a prominent role in coaches’ SEO and evaluations of future potential.

As previously highlighted, research has suggested that the use of clinical judgement (i.e., making decisions based solely on SEO in one's head^30,31); may be at risk of several errors and (sub)conscious biases.^9,31,32 Although the SEO of coaches is currently the primary method for TID and (de)selection within a soccer academy, limited research has explored how player's athleticism and match performance may affect perceptions of potential in elite youth soccer players. Such research would provide insight into the factors that contribute to the instinctual reasoning and tacit knowledge of coaches, that allows them to make their judgements.^6,7,32 Therefore, this study firstly presented coaches’ ratings of perceived potential in elite youth male soccer players across the U12 to U18 age groups within a professional UK academy. Secondly, the study explored the relationship between academy soccer player's athleticism (i.e., combination of physical qualities) and coaches’ subjective ratings of players’ current match performance with coaches’ perceptions of future player potential.

Methods

Participants

Six academy coaches working full-time within an elite youth soccer academy participated in this study across 2020/21, 2021/22 and 2022/23 playing seasons. Each participant was a lead coach for an individual squad across the U12-U18 age groups. On average, the participants had been coaching for 16.5 ± 6.7 years and had worked within the current academy for a minimum of one year at the commencement of the 2020/21 season. Additionally, all coaches were qualified to a minimum of a UEFA B Licence and as such were considered a qualified, representative, and experienced sample of elite youth soccer coaches.

Ethics

As part of registration to the academy, players and their parents/guardians consented to data being collected and gatekeeper consent was provided by the academy for the study. Institutional ethical approval was granted by Leeds Beckett University (Ethics Application No: 119713) approving the study.

Design

During the three seasons data was collected on one hundred and one elite youth male soccer players registered to the academy. This resulted in player observations (n = 181) for outfield playing positions in the U12 (n = 13), U13 (n = 34), U14 (n = 51), U15 (n = 31), U16 (n = 26) and U18 (n = 26) age groups. Data was collected from competitive matches (match performance), fitness testing (athleticism), and multidisciplinary team (MDT) meetings (player potential), to allow relationships between match performance, athleticism, and potential for future success to be explored.

Measures

Fitness tests

Objective data was collected within the academy based upon four measures of athleticism, which were also recommended as part of a national strategy under the Elite Player Performance Plan (EPPP).³³ All tests were familiar to players and were conducted by practitioners from the academy's physical development department who provided verbal instructions and demonstrations to players. Tests were conducted in September, January and April of each season on an indoor third generation synthetic surface (3G) a minimum of 48 h following competitive fixtures. The battery of fitness tests chosen, are commonly used in elite youth soccer populations^34–37 and their reliability has previously been demonstrated.^38–41 Tests were conducted in the subsequent order, following a standardized warm-up protocol of light jogging, dynamic stretching and progressive familiarization attempts: countermovement vertical jump (CMJ),⁴² twenty-metre linear sprint test (20 m),³⁵ modified version of the 505 change of direction test (505).⁴³ Additionally, a one-kilometre running time-trial (1KM)^35,42 was undertaken (U15 age group and older only) which was not a part of the EPPP benchmark fitness testing battery but formed a regular part of players’ annual physical development programme. The 1KM permitted a low-cost, time efficient method to measure aerobic fitness, over a distance that has been utilised in previous soccer literature.^37,44

For the CMJ athletes stood on dual force plates (ForceDecks Lite, Vald Performance, Brisbane, Australia) sampling at 1000 Hz, with their hands on their waist. Following a stance phase to determine bodyweight,⁴⁵ players were instructed to jump as high and as fast as possible following a self-selected countermovement depth. In the 20 m sprint test, players began in a standing start placed 1 metre behind the first timing gates (TCi System, Brower Timing System, Utah, USA). Players were instructed to maximally sprint through the timing gates without slowing before the final timing gate. Similarly, for the 505 test, players began from a standing start 1 metre behind the timing gates (TCi System, Brower Timing System, Utah, USA) and were instructed to maximally sprint to a line 5 metres away, change direction ensuring at least one foot made contact with the turning line, before sprinting back through the timing gates. For the 1KM, players were individually timed using a handheld stopwatch (Casio HS-80TW, Casio, Tokyo, Japan) completing a single lap of a custom 1 kilometre running track based at the academy's training ground. For the CMJ and 20 m tests, players completed 3 maximal trials, for the 505 players completed 4 maximal trials (turning twice on each leg), and for the 1KM players completed 1 maximal trial. Recovery periods were a minimum of 2 min following each trial for the CMJ, 20 m and 505 tests. The best value from each test for CMJ height (cm), 20 m time (s), 505 time (s) and 1KM time (s) were selected for subsequent analysis.

Total score of athleticism

The Total Score of Athleticism (TSA) was calculated through averaging a set of standardized scores (z-scores) from a series of tests (i.e., CMJ, 20 m, 505, 1KM) to provide a single score of general athleticism that is comparable within a squad or team. Z-scores were calculated by subtracting the sample mean from the player's best score and dividing by a sample standard deviation (SD) for each individual test. Sample means (benchmarks) and SDs were provided by the Premier League as part of the national fitness testing strategy for each individual age group,³³ with the exception of the 1KM where the mean and SD were the result of historical club data records. In situations where negative z-score values were indicative of superior performance (e.g., a 20 m time below the sample mean), z-scores were multiplied by −1 to transform negative values into positive values allowing for aligned polarity across individual tests contributing to the TSA.⁴⁶ The highest TSA of the corresponding season (across the four testing timepoints) was included for subsequent analysis as it theoretically represented a players best physical performance for that season.

Match ratings

Match ratings were collected for each player for each match they participated in. Coach ratings were assigned to each player based on their perceived level of match performance relative to other players of a similar age and skill level within the academy. Ratings were provided independently following each match by a single lead coach, within each respective age group, based on that coach's SEO, with no input from other staff members (i.e., the lead coach from the U12-U18 age groups provided the match ratings for players within their team). Ratings were conducted on a 7-point scale, with the following descriptors: 1 – below academy standard, 2 – approaching academy standard, 3 – meeting academy standard, 4 – exceeding academy standard, and 0.5 unit increments between each descriptor for players falling between standards. A seasonal mean match rating was utilised for analysis, as this represented general performance over multiple matches, discounting possible one-off or atypical performances from individual players. Additionally, a mean match rating was a variable utilised within the academy when comparing players, allowing some form of distinction between players with similar ratings, rather than players being dichotomised into groups based on a singular score from the match rating descriptors. Players had competed in a minimum of five matches per season, and for at least 40 min in their respective matches for the match rating to be included in analysis. The average number of match ratings per player per season was 20 ± 10, whilst the range for the number of match ratings was 5–46. Such coach-based ratings have been utilised in previous research,^18,47,48 with coach subjective ratings demonstrating acceptable intra-rater reliability correlations (.50 ≤ r_tt≤ .85²⁹; but poor inter-rater reliability (ICC = 0.14, 95% CI = −0.04–0.39;.⁴⁹ Internal estimates demonstrated slight to moderate inter-rater reliability assessed via the Krippendorff's alpha coefficient (α = 0.34, CI = 0.12–0.53;.⁵⁰

Perceived potential

As part of MDT meetings conducted within the academy, coaches were asked to consider their perceptions of a player's future potential (i.e., the perception the player would have a successful career at the elite senior level). Similarly to match ratings, lead coaches for each age group were individually responsible for rating perceived future potential, following conversations with other coaches and support staff. Ratings demonstrated good absolute agreement, using a two-way mixed effect intraclass correlation coefficient (ICC = 0.84, p < 0.001) with the lead coach as a single rater,⁵¹ indicating coaches were fairly consistent in their own repeated ratings of individual players’ potential. Ratings were based on a RAG + B (Red, Amber, Green, Blue) colour grading system, similar to ratings commonly used in team sport monitoring and decision-support systems,⁵² and previous research relating to player ratings.⁴⁸ Blue ratings represented players with the highest potential, with Red ratings representing players perceived to have the least chance of achieving future elite status (i.e., lowest potential). Six MDT meetings were conducted per season with perceptions of a player's future potential provided in each meeting. The most frequent rating (mode) from the RAG + B system was used to represent coach's perceived potential for each individual player in each season.

Statistical analysis

Data was analysed via R Studio.⁵³ Descriptive statistics were calculated representing the match rating and TSA associated with each level of perceived potential. A series of cumulative link mixed models using maximum likelihood estimates were fitted using the R package ordinal.⁵⁴ Cumulative link mixed models (CLMM) are a type of ordinal regression which permitted analysing the ordinal response variable (perceived potential), whilst accounting for the hierarchical structure of the data, with observations nested within coaches.

Ordinal regression permitted the dependent variable (perceived potential) to be treated as an ordered categorical variable (with a non-equidistant relationship), without being falsely modelled as a continuous or unordered response. This method modelled the relationships between the variables and predicted the probability of an observation (player) being classified at a specific level of perceived potential, based upon their mean match rating and TSA values. Perceived potential labels (RAG + B) were transformed to adapt to ordinal regression and reassigned values of: Red = 1, Amber = 2, Green = 3, Blue = 4. A forward stepwise approach to the models was undertaken, starting with a basic null (intercept only) model, before adding parameters in a stepwise fashion, comparing model fit following each iteration.⁵⁵ Likelihood ratio tests and the Akaike Information Criterion (AIC) were used as measures to assess goodness-of-fit, with the AIC accounting for the number of parameters estimated and correcting for model complexity.⁵⁵ If additional parameters did not improve model fit, they were removed from the subsequent iteration. The first model included only the response variable (perceived potential), without the addition of any predictors variables (Null Model). This model represented a reference point to evaluate the performance and significance of more complex models. A second model was then fitted with the first predictor variable (Match Rating) and random intercepts, allowing for individual variation based on the repeated measurements of individual players by individual coaches (Model 1). In the third model, the second predictor variable (TSA) was included as a fixed effect (Model 2). The fourth model added season as a fixed effect and included any interaction effects with both match rating and TSA, in order to determine if any associations with the predictors and the outcome perceived potential differed across seasons (Model 3). Finally, Models 5 and 6 were fitted with match rating and TSA with random effects for coaches (Model 5) and players (Model 6), allowing their slopes to vary to determine if the effects of match rating and TSA differed between individual coaches and players when assessing perceived potential.

Results

Table 1 presents the mean and SD for match ratings and TSA, for each level of perceived potential rating (Red, Amber, Green, Blue), within each age group. Whilst sample sizes were small for some groups this reflects the applied nature of the study and was considered within the statistical analysis applied.

Table 1.

Descriptive statistics showing means (M) and standard deviations (SD) of match ratings and TSA across levels of perceived potential within age groups.

	Red					Amber					Green					Blue					All
		Match Rating		TSA			Match Rating		TSA			Match Rating		TSA			Match Rating		TSA			Match Rating		TSA
	n	M	SD	M	SD	n	M	SD	M	SD	n	M	SD	M	SD	n	M	SD	M	SD	n	M	SD	M	SD
U12	2	2.32	0.31	−0.06	0.25						2	2.93	0.35	0.31	0.38	9	3.42	0.31	0.58	0.62	13	3.18	0.51	0.44	0.58
U13	2	1.86	0.19	−0.33	0.07	7	2.23	0.09	0.33	1.14	22	2.68	0.35	0.41	0.78	3	3.64	0.08	1.14	0.40	34	2.62	0.50	0.41	0.85
U14	5	2.64	0.14	0.17	1.30	8	2.52	0.17	−0.14	0.37	30	2.91	0.19	0.49	0.60	8	3.12	0.30	0.93	0.40	51	2.86	0.27	0.43	0.70
U15						5	2.62	0.43	0.83	0.48	18	2.78	0.28	0.58	0.60	8	3.16	0.38	0.62	0.34	31	2.85	0.38	0.63	0.52
U16						7	2.62	0.26	−0.07	0.39	13	2.91	0.26	0.54	0.57	6	3.15	0.15	0.51	0.31	26	2.89	0.30	0.37	0.53
U18	12	2.45	0.24	0.23	0.70	6	2.49	0.26	0.55	0.40	4	2.73	0.15	0.18	0.31	4	2.74	0.41	0.10	0.55	26	2.55	0.28	0.27	0.56
Total	21	2.43	0.29	0.14	0.80	33	2.49	0.28	0.25	0.70	89	2.82	0.28	0.47	0.62	38	3.21	0.37	0.64	0.50	181	2.79	0.40	0.43	0.65

When observing match ratings, the U18 age group had the lowest overall mean match rating based on all levels of potential (2.55 ± 0.28) with the highest mean match rating occurring in the U12 age group (3.18 ± 0.51). The U18 age group also had the lowest overall mean TSA (0.27 ± 0.56), with the U15 age group demonstrating the highest overall mean TSA (0.63 ± 0.52).

In relation to perceived potential ratings, a green rating had the highest total frequency (n = 89; 49.2%), followed by blue (n = 38; 21.0%), amber (n = 33; 18.2%) and red (n = 21; 11.6%). Figure 1 shows the distribution of perceived potential ratings across the age groups. The U12 age group had the highest tally of blue perceived potential responses (n = 9), with the U18 age group recording the highest tallies for the lowest perceived potential (Red, n = 12). Of note, no players playing in the U15 or U16 age groups across the three seasons were ever regarded as having the lowest perceived potential (Red).

Figure 1.

Frequency counts of the dependent variable Perceived Potential across age groups. Red = Lowest Perceived Potential, Blue = Highest Perceived Potential.

Ordinal regression was used to explore the relationships between player potential, match performance and TSA. Model selection based on the AIC resulted in Model 2 being treated as the final model. Table 2 highlights comparisons between the CLMM's following each iteration against the best fitting model. Model 2 had a high degree of model fit (AIC = 349.88) and provided a significantly better fit than the null model (AIC = 453.78; χ2 = 111.90, df = 4, p < 0.001), and Model 1 (AIC = 351.89; χ2 = 4.01, df = 1, p < 0.05). Model 3 did not improve significantly upon Model 2 (χ2 = 5.36, df = 2, p > 0.05) but resulted in a lower AIC (AIC = 348.52). However, Model 2 was chosen as the most parsimonious model, providing a good balance between model complexity and fit. The addition of interaction effects for season (Model 4), random effects for match rating and TSA within coaches (Model 5), and random effects for match rating and TSA within players, failed to improve upon Model 3. This suggests that the effects of season and allowing the slopes for match rating and TSA to vary per individual coach and player, had no significant effect on improving the modelled relationship between the variables and perceived potential. Additionally, a correlation coefficient of 0.22 between the predictor variables (Match Rating and TSA) suggested that use of the CLMM accurately captured the relationships between variables with no effects of multicollinearity amongst the predictor variables.

Table 2.

Comparison of model fit indices.

Cumulative Link Mixed Models Iterations	AIC	χ2	df	p
Null vs. Model 1 (Match Rating + Random Intercepts Model)	351.89	107.89	3	<0.001**
Model 1 vs. Model 2 (Match Rating + TSA + Random Intercepts Model)	349.88	4.01	1	<0.05*
Model 2 vs. Model 3 (Match Rating + TSA + Season + Random Intercepts Model)	348.52	5.36	2	0.07
Model 2 vs. Model 4 (Match Rating* Season + TSA* Season + Random Intercepts Model)	351.85	10.03	6	0.12
Model 2 vs. Model 5 (Match Rating + TSA + Random Intercepts and Random Slopes [Coaches] Model)	348.84	11.04	5	0.05
Model 2 vs. Model 6 (Match Rating + TSA + Random Intercepts and Random Slopes [Players] Model)	359.83	0.05	5	1.00

Notes: ** = p < 0.001, * = p < 0.05.

The final model chosen (Model 2) was fitted as a CLMM with the Laplace approximation, modelling the relationship between the predictors match rating and TSA and the ordinal response variable perceived potential. The model incorporated random intercepts for the grouping variables coach and player, which represented repeated measures within individual coaches players. Model fit statistics are presented in Table 3 including the coefficients for fixed effects (β), their standard error estimates (SE), their significance (p) assessed via the Wald statistic (Wald χ2), and finally the calculated odds ratios (OR) and their 95% confidence intervals (CI). Random intercept variance (τ00), intraclass correlation coefficient (ICC) and number of individual players (n) are also presented for the random effect of player.

Table 3.

Cumulative link mixed model for associations between match rating and TSA on rating of perceived potential.

Fixed Effects	β	SE	Wald χ2	p	OR	95% CI
Match Rating	4.78	0.71	6.77	<0.001*	119.35	20.90–476.40
TSA	0.59	0.33	1.75	0.08	1.79	0.93–3.45

Random Effects	τ00	ICC	n
Coach	0.36	0.26	6
Player	0.79	0.26	101

Notes: Bold = p < 0.001.

Considering the fixed effects, Model 2 demonstrated a significant positive association between match rating and being perceived in a higher category of potential (β= 4.78, SE = 0.71, p < 0.001, 95% CI = 3.40–6.17). The odds ratio for match rating was 119.35 (95% CI = 20.90–476.40). This suggests that for every one-unit increase in mean match rating, the odds of individual players being perceived in a higher potential category are over 100 times greater, indicating a strong positive association between higher match ratings and the likelihood of being perceived as a higher potential player.

Predicted probabilities demonstrating the relationship between match rating and perceived potential are presented in Figure 2 which highlights a non-linear relationship between match rating and level of perceived potential. The highest probabilities of being perceived in each category occur simultaneously with increasing mean match rating scores.

Figure 2.

Predicted probabilities of being perceived at each level of potential in relation to mean match rating.

TSA also indicated a positive but non-significant association with perceived potential (β= 0.64, SE = 0.34, p > 0.05, 95%CI = -0.07–1.24). Odds ratios suggest that for a one unit increase in TSA, the odds of a player being perceived in a higher category of potential are almost twice as likely (OR = 1.79, 95% CI = 0.93–3.45). Figure 3 displays the predicted probabilities highlighting the relationship between TSA and the probability of being perceived at a specific level of potential.

Figure 3.

Predicted probabilities of being perceived at each level of potential in relation to TSA.

The random effects of the model (i.e., players), account for the variability in the perceived potential response that can’t be explained by the predictor variables (mean match rating and TSA), allowing each player to have a different response category for perceived potential whilst considering the repeated measurements within that player. The estimated random intercept variance (between-subject variance) is τ00 = 0.36 for the random effect of coaches (n = 6) and τ00 = 0.79 for the random effect of players (n = 101), indicating the variability in response attributable to individual differences. The ICC of 0.26 suggests that 26% of total variation in perceived potential response is due to differences between individual coaches and players that is unexplained by mean match rating or TSA alone.

A graphical representation of the findings, displaying players TSA, mean matching rating and level of future perceived potential is plotted in Figures 4 (distinct levels of potential) and 6 (combined plot), highlighting a general observation of better current performance and physical ability leading to a higher level of perceived future potential. The blue ellipse in Figure 5 represents those perceived to have the highest future potential, with the majority of players perceived to have high potential showing above average scores for both mean matching rating and TSA.

Figure 4.

Relationships between mean match rating, TSA, and perceived potential separated by category of perceived potential.

Figure 5.

Relationships between mean match rating, TSA, and perceived potential across all categories of perceived potential.

Discussion

Talent selection risks have previously been presented as a combination of potential and performance, often informed by coaches’ SEOs.¹⁰ However, limited research exists exploring evaluations of player potential^5,7,33 through coaches’ SEOs and the relationships between data that may influence such evaluations (e.g., match performance, athleticism). Therefore, this study presented ratings of perceived potential from players in an elite youth soccer academy and explored the relationships between perceived potential, athleticism, and match performance. The study findings showed that ratings of player potential decreased as age groups increased and were lowest within older age groups (e.g., U18). Secondly, a key finding was that coaches’ perceptions of a player's future potential are strongly related to a player's current match performance, based on (mean) match ratings from the current season. Additionally, findings also highlighted an association between a player's current athleticism (TSA) from the current season and their perceived future potential, however, this finding was statistically non-significant. These findings have implications for TID, the rating of future potential, and the factors that may impact upon possible ratings for monitoring player development within soccer academies.

Player potential

In relation to players perceived potential ratings, within the U12-U18 age groups, over 70% of players were perceived to have high levels of potential (Green or Blue ratings). The U12 and U15 age groups demonstrated the greatest percentage of players perceived to have high potential with 85% and 84% of players within those groups respectively being rated as green or blue potential players. Such findings may be reflective of a degree of uncertainty amongst coaches, who do not wish to “miss out” on potential future elite players and may also represent a form self-serving bias from the coaches within these age groups, who possibly overestimate player's potential to justify the impact their coaching has had on such players.^6,32

The highest count of players perceived to have the lowest level of potential (Red) occurred within the U18 age group. Only 31% of players within this age group were perceived to have high potential (Green or Blue). Such findings are in agreement with the work of Kite et al., (2023), who demonstrated coaches’ perceptions of player potential were capable of determining (de)selection status, but only in close proximity to the (de)selection decision itself. Intuitively, such findings are repeated in the current study, with a lower frequency of high potential ratings at the U18 age group, reflective of the limited quantity and opportunity for players to obtain professional contracts at the elite level.⁵⁶ Finally, as the U18 age groups signifies the culmination of the developmental academy pathway (i.e., final age group prior to possibly receiving a professional contract), the lower frequency of high potential ratings may coincide with a coach perceiving such players have limited remaining time for development.

Objective and subjective assessments

Findings demonstrated that both subjective ratings of soccer performance (mean match rating) and objective measures related to soccer performance (TSA) may be linked to coaches’ perceptions of future potential. These findings are in agreement with previous research highlighting the use of both objective and subjective data in providing valid prognostic relevance for future career level.^23,26 For example, Sieghartsleitner et al., (2019), found that the use of coaches’ SEO (rating of in-game performance, i.e., match rating) combined with objective assessments (physical attributes, biological maturation, psychological skills, familial support, technical skills and training history) provided significantly better predictions of future career (de)selection than the use of objective assessment alone in elite youth soccer players. Similarly to the current study, in-game performance was shown to have a significant contribution as a single variable. Whilst undoubtedly a valuable assessment tool, such findings demonstrate prior evidence of the possible misuse of current performance as an indicator of potential given they are regarded as distinct constructs. Additionally, Höner et al., (2021), assessed a variety of soccer related performance factors using both objective (sprint, agility, dribbling, ball control, juggling) and subjective assessments (kicking skills, endurance, tactical skills, psychosocial skills), demonstrating that combining both objective and subjective assessments provided better predictions of future career success. Such findings support the notion that player related attributes (e.g., athleticism) influence coaches’ perceptions of future success (i.e., potential).

Associations between match rating and perceived potential

The strong relationship between match rating and perceived potential in the current study is consistent with the increased use of coaches’ SEO within both research and practice.^26,47,48 In the results of the current study, the U18 age group demonstrated consistently lower match ratings and a low percentage of high potential ratings (31%), with the U12 age group demonstrating the highest match ratings and the highest frequency of high potential ratings (n = 9) in comparison to other age groups. Additionally, match ratings increased concurrently with ratings of perceived potential across all age groups. Such findings further highlight the possibility that coaches were (un)intentionally extrapolating current performance to infer future potential.¹⁰ Equally, it could be argued that the U18 age group represents a significantly greater standard of play and expectations in comparison to age groups within the YDP phase, partly explaining the lower mean match ratings. Further, it could be suggested that as the U18 age group represents a biannual age category (i.e., U17 and U18 players), the combination of more (U18) and less (U17) experienced players who coaches will have spent more and less time with those groups respectively, may have impacted coaches’ judgements, as match ratings are likely to be an integration of technical/tactical aspects and knowledge of the player in other domains (e.g., psychological skills, training history, familial support^26,32;

Associations between TSA and perceived potential

Whilst statistically non-significant in the current study, the positive relationship between TSA (representative of a player's physical ability) and future potential is in keeping with previous research highlighting favourable physical differences between future selected and de-selected players.^36,57 Results from the current study suggest that, generally, more athletic players were more likely to be considered higher potential by coaches with coaches perceiving them to have greater chance of future elite success. Although physical abilities are directly important to soccer match play, our findings perhaps highlight that high levels of ability across multiple disciplines are important when considering the future potential of young soccer players.^23,24,26 However, caution is warranted when considering a player's level of athleticism in relation to future potential, given the well documented differences in the development of athleticism when considering individual player maturation.^35,58,59

Limitations

Firstly, limitations arise when considering the findings of the current study are limited to a single professional soccer academy in the UK, where findings may not be representative of all elite youth male soccer academies who may adopt different philosophies and processes in relation to TID, TD and (de)selection. Additionally, evaluations of both subjective opinion of soccer performance (match rating) and objective scores of player athleticism (TSA) may have represented static observations of athlete qualities.³² Whilst our current data aggregated repeated measurements of a player across a season, the data still represents an evaluation that fails to consider changes over time (i.e., development). Further, when assessing a player's TSA, biological maturation was not accounted for, which is known to affect measures of athleticism and coaches’ evaluations of match performance in academy soccer players.^35,47,58 Additionally, sample sizes were small within levels of perceived potential - within age groups. However, limited sample sizes can be expected given the applied nature of the research within a single soccer academy and despite the limitations, our findings may in fact represent a reality and true reflection of observations across numerous soccer academies. Finally, only match ratings and TSA were considered within the current study representing only certain elements that may contribute to player potential (i.e., technical, tactical, physical). A multidisciplinary approach is recommended for future research that considers physical,⁶⁰ technical / tactical,^61,62 psychological,⁶³ and sociological⁶⁴ factors that may contribute to TID and perceptions of player potential.^2,65

Practical applications

The positive associations found between the predictor variables (match rating, TSA) and future perceived potential perhaps suggest a limitation of coaches in their ability to evaluate the non-linear, dynamic development of talent. Practically, understanding how potential is evaluated within elite youth soccer players throughout various age groups and understanding relationships that may contribute to such evaluations may be a key application and reflection for youth soccer academies. As a hypothetical example, overlaying the data from the current study with the proposed risk matrix from Baker et al. (2018) as shown in Figure 6, would result in no player in the current study being categorised as high potential (Blue) but under performing (Box 1). Similarly, very few players categorised as having lower potential (Red/Amber), would currently be performing well (Box 9). Disregarding the development of talent and assuming it is a fixed capacity, that is reflected through current performance, remains a central problem for TID and TD as it increases the risks associated with (de)selection decisions.¹⁰ As such, clear definitions of performance (e.g.,⁶⁶ and potential should be made explicit within organisations to facilitate objective and valid assessments and improve methods relating to coaches’ assessments.⁶³

Figure 6.

Overlay of mean match rating and perceived potential results on a redrawn version of Baker et al.'s (2018) risk matrix. Note: points have been jittered along perceived potential axis for illustrative purposes.

Findings from the current study demonstrate lower ratings of potential within the U18 age group in comparison to U12-U16 age groups and highlight that current match performance and a player's level of athleticism has a strong relationship with coaches’ perceptions of player potential. Undertaking such analysis within organisations can aid in discerning possible (sub)conscious biases relating to physical abilities^17,35 and perceived match performance^18,47 from coaches’ perspectives, and can highlight a possible misunderstanding between performance and potential.

Conclusion

This study presented player potential ratings from a soccer academy and explored the relationships between potential, athleticism and match performance ratings. Ratings of potential were highest in the U12 age group and lowest in the U18 age group, with 70% of all players being perceived to have high potential (Green or Blue). Overall, players rated as having the highest potential (Blue), demonstrated higher mean match ratings and TSA, with the exception of TSA in the U16 and U18 age groups. An ordinal regression model established a significant and positive association between a player's match rating and the odds of them being rated in a higher category of perceived potential. Additionally, a player's TSA was also found to have a positive association with coaches’ ratings of perceived potential, although there was some uncertainty in the coefficient estimate. These key findings highlight that a coach's perceived levels of future performance for individual players may be strongly linked to those players’ levels of current performance, as evidenced through the coaches’ match ratings. With this in mind, it is difficult to conclude how a single discrete rating of performance can be fully explanatory of a coach's evaluation of talent. More work is required to develop a deeper understanding of what coaches may consider when evaluating player performances in match situations, as this study highlights such considerations may heavily influence perceptions of future potential. Additionally, this research also found that a player's current level of athleticism (TSA) may impact the coach's perceived level of player potential, with more athletic players regarded as having a higher potential for future success in soccer. Coaches should be aware of potential biases that affect their subjective player assessments and be cautious of the differences between performance and potential when evaluating their players, given the longitudinal, non-linear, dynamic nature of talent development.

Footnotes

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Sam Barraclough

References

Williams

Reilly

. Talent identification and development in soccer. J Sports Sci 2000; 18: 657–667.

Williams

Ford

Drust

. Talent identification and development in soccer since the millennium. J Sports Sci 2020; 38: 1199–1210.

Barraclough

Till

Kerr

, et al. Methodological Approaches to Talent Identification in Team Sports: A Narrative Review. Sports 2022; 10: 81.

Bergkamp

TLG

Frencken

WGP

Niessen

ASM

, et al. How soccer scouts identify talented players. Eur J Sport Sci 2021; 22: 994–1004.

Johansson

Fahlén

. Simply the best, better than all the rest? Validity issues in selections in elite sport. Int J Sports Sci Coach 2017; 12: 470–480.

Roberts

Greenwood

Stanley

, et al. Understanding the “gut instinct” of expert coaches during talent identification. J Sports Sci 2021; 39: 359–367.

Christensen

. An eye for talent’: talent identification and the ‘practical sense’ of top-level soccer coaches. Sociol Sport J 2009; 26: 365–382.

Lund

Söderström

. To see or not to see: talent identification in the Swedish football association. Sociol Sport J 2017; 34: 248–258.

Bergkamp

TLG

Niessen

ASM

den Hartigh

RJR

, et al. Methodological issues in soccer talent identification research. Sports Med 2019; 49: 1317–1335.

10.

Baker

Schorer

Wattie

. Compromising talent: issues in identifying and selecting talent in sport. Quest 2018; 70: 48–63.

11.

Vaeyens

Lenoir

Williams

, et al. Talent identification and development programmes in sport current models and future directions. Sports Med 2008; 38: 703–714.

12.

Roberts

McRobert

Lewis

, et al. Establishing consensus of position-specific predictors for elite youth soccer in England. Science and Medicine in Football 2019; 3: 205–213.

13.

Towlson

Cope

Perry

, et al. Practitioners’ multi-disciplinary perspectives of soccer talent according to phase of development and playing position. Int J Sports Sci Coach 2019; 14: 528–540.

14.

Kite

Noon

Morris

, et al. British Soccer academy personnel perceive psychological and technical/tactical attributes as the most important contributors to development. Journal of Science in Sport and Exercise 2021; 4: 37–48.

15.

Fuhre

Øygard

Saether

. Coaches’ criteria for talent identification of youth male soccer players. Sports 2022; 10: 14.

16.

Larkin

O’Connor

. Talent identification and recruitment in youth soccer: Recruiter’s perceptions of the key attributes for player recruitment. PLoS One 2017; 12: e0175716.

17.

Towlson

Cobley

Midgley

, et al. Relative age, maturation and physical biases on position allocation in elite-youth soccer. Int J Sports Med 2017; 38: 201–209.

18.

Hill

Scott

McGee

, et al.

Are relative age and biological ages associated with coaches’ evaluations of match performance in male academy soccer players?

Int J Sports Sci Coach 2021; 16: 227–235.

19.

Cobley

Till

O’Hara

, et al. Variable and changing trajectories in youth athlete development: further verification in advocating a long-term inclusive tracking approach. J Strength Cond Res 2014; 28: 1959–1970.

20.

Abbott

Button

Pepping

G-J

, et al. Unnatural selection: Talent identification and development in sport. Nonlinear Dynamics Psychol Life Sci 2005; 9: 61–88.

21.

Baker

Wattie

Schorer

. A proposed conceptualization of talent in sport: the first step in a long and winding road. Psychol Sport Exerc 2019; 43: 27–33.

22.

Karelaia

Hogarth

. Determinants of linear judgment: a meta-analysis of lens model studies’: correction. Psychol Bull 2008; 134: 741–741.

23.

Höner

Murr

Larkin

, et al. Nationwide subjective and objective assessments of potential talent predictors in elite youth soccer: an investigation of prognostic validity in a prospective study. Front Sports Act Living 2021; 3: 638227.

24.

Vaeyens

Malina

Janssens

, et al. A multidisciplinary selection model for youth soccer: the Ghent Youth Soccer Project. Br J Sports Med 2006; 40: 928–934.

25.

Forsman

Blomqvist

Davids

, et al. Identifying technical, physiological, tactical and psychological characteristics that contribute to career progression in soccer. Int J Sports Sci Coach 2016; 11: 505–513.

26.

Sieghartsleitner

Zuber

Zibung

, et al. Or Coaches’ eye?-both! beneficial collaboration of multidimensional measurements and coach assessments for efficient talent selection in elite youth football. J Sports Sci Med 2019; 18: 32–43.

27.

Dugdale

Sanders

Myers

, et al. A case study comparison of objective and subjective evaluation methods of physical qualities in youth soccer players. J Sports Sci 2020; 38: 1304–1312.

28.

McCormack

Jones

Elliott

, et al. Coaches’ assessment of players physical performance: subjective and objective measures are needed when profiling players. Eur J Sport Sci 2021; 22: 1177–1187.

29.

Jokuschies

Gut

Conzelmann

. Systematizing coaches’ ‘eye for talent’: player assessments based on expert coaches’ subjective talent criteria in top-level youth soccer. Int J Sports Sci Coach 2017; 12: 565–576.

30.

Dawes

Faust

Meehl

. Clinical versus actuarial judgment. Science (1979) 1989; 243: 1668–1674.

31.

Den Hartigh

RJR

Niessen

ASM

Frencken

WGP

, et al. Selection procedures in sports: improving predictions of athletes’ future performance. Eur J Sport Sci 2018; 18: 1191–1198.

32.

Roberts

Greenwood

Stanley

, et al. Coach knowledge in talent identification: a systematic review and meta-synthesis. J Sci Med Sport 2019; 22: 1163–1172.

33.

The Premier League. Elite Player Performance Plan. 2011.

34.

Moran

Paxton

Jones

, et al. Variable long-term developmental trajectories of short sprint speed and jumping height in English Premier League academy soccer players: an applied case study. J Sports Sci 2020; 38: 2525–2531.

35.

Murtagh

Brownlee

O’Boyle

, et al. Importance of speed and power in elite youth soccer depends on maturation Status. J Strength Cond Res 2018; 32: 297–303.

36.

Emmonds

Till

Jones

, et al.

Anthropometric, speed and endurance characteristics of English academy soccer players: do they influence obtaining a professional contract at 18 years of age?

Int J Sports Sci Coach 2016; 11: 2212–2218.

37.

Sperlich

De Marées

Koehler

, et al. Effects of 5 weeks' high-intensity interval training vs. volume training in 14-year-old soccer players. The Journal of Strength and Conditioning Research 2011; 25: 1271–1278.

38.

Dugdale

Arthur

Sanders

, et al. Reliability and validity of field-based fitness tests in youth soccer players. Eur J Sport Sci 2019; 19: 745–756.

39.

Paul

Nassis

. Physical fitness testing in youth soccer: issues and considerations regarding reliability, validity and sensitivity. Pediatr Exerc Sci 2015; 27: 301–313.

40.

Altmann

Ringhof

Neumann

, et al. Validity and reliability of speed tests used in soccer: A systematic review. PLoS One 2019; 14: e0220982.

41.

Rago

Brito

Figueiredo

, et al. Countermovement jump analysis using different portable devices: Implications for field testing. Sports 2018; 6: 91.

42.

Glatthorn

Gouge

Nussbaumer

, et al. Validity and reliability of optojump photoelectric cells for estimating vertical jump height. J Strength Cond Res 2011; 25: 556–560.

43.

Taylor

Cunningham

Hood

, et al. The reliability of a modified 505 test and change-of-direction deficit time in elite youth football players. Science and Medicine in Football 2019; 3: 157–162.

44.

Clancy

Green

Curnyn

, et al. The concurrent validity and between-session reliability of a 1000 m time trial for the assessment of aerobic fitness in elite development soccer players. Sport Performance and Science Reports 2020; 92: 1–3.

45.

McMahon

Suchomel

Lake

, et al. Understanding the key phases of the countermovement jump force-time curve. Strength Cond J 2018; 40: 96–106.

46.

Turner

Jones

Stewart

, et al. Total score of athleticism: holistic athlete profiling to enhance decision-making. Strength Cond J 2019; 41: 91–101.

47.

Hill

Scott

McGee

, et al. Coaches’ evaluations of match performance in academy soccer players in relation to the adolescent growth spurt. Journal of Science in Sport and Exercise 2020; 2: 359–366.

48.

Kite

Noon

Morris

, et al. Observations of player (de)Selection within a professional UK soccer academy. Journal of Science in Sport and Exercise 2023; 6: 71–80.

49.

Bergkamp

TLG

Meijer

den Hartigh

RJR

, et al. Examining the reliability and predictive validity of performance assessments by soccer coaches and scouts: The influence of structured collection and mechanical combination of information. Psychol Sport Exerc 2022; 63: 102257.

50.

Krippendorff

Computing Krippendorff’s Alpha-Reliability, http://repository.upenn.edu/asc_papershttp://repository.upenn.edu/asc_papers/43 (accessed 23 January 2024), 2011.

51.

Koo

. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med 2016; 15: 155–163.

52.

Robertson

Bartlett

Gastin

. Red, amber, or green? Athlete monitoring in team sport: the need for decision-support systems. Int J Sports Physiol Perform 2017; 12: 73–79.

53.

R Core Team. R: A Language and Environment for Statistical Computing, https://www.R-project.org . 2023.

54.

Christensen

RHB.

ordinal—Regression Models for Ordinal Data, https://CRAN.R-project.org/package=ordinal . 2022.

55.

Field

. Discovering Statistics Using IBM SPSS Statistics. Fifth Edition. London: Sage Publications, 2017.

56.

Allen

. 0.5% OF U9S Will Make It to a Pro Club First Team. https://www.thefa.com/news/2016/Sep/14/richard-allen-interview-boot-room-talent-id-courses (accessed 31 January 2024). 2016.

57.

Deprez

Fransen

Lenoir

, et al. A retrospective study on anthropometrical, physical fitness, and motor coordination characteristics that influence dropout, contract status, and first-team playing time in high-level soccer players aged eight to eighteen years. J Strength Cond Res 2015; 29: 1692–1704.

58.

Parr

Winwood

Hodson-Tole

, et al. The main and interactive effects of biological maturity and relative age on physical performance in elite youth soccer players. Journal of Sports Medicine 2020; 2020: 1–11.

59.

Meylan

Cronin

Oliver

, et al. Talent identification in soccer: the role of maturity status on physical, physiological and technical characteristics. Int J Sports Sci Coach 2010; 5: 571–592.

60.

Saward

Hulse

Morris

, et al.

Longitudinal physical development of future professional male soccer players: Implications for talent identification and development?

Front Sports Act Living 2020; 2: 578203.

61.

Sieghartsleitner

Zuber

Zibung

, et al. Talent selection in youth football: Technical skills rather than general motor performance predict future player status of football talents. Current Issues in Sport Science (CISS) 2019; 4: 011.

62.

Kelly

Williams

Wilson

. Technical testing and match analysis statistics as predictors for age-specific talent development in an English football academy. Int J Perform Anal Sport 2020; 20: 1035–1051.

63.

Musculus

Lobinger

. Psychological characteristics in talented soccer players - Recommendations on how to improve coaches’ assessment. Front Psychol 2018; 9: 41.

64.

Reeves

McRobert

Littlewood

, et al. A scoping review of the potential sociological predictors of talent in junior-elite football: 2000–2016. Soccer Soc 2018; 19: 1085–1105.

65.

Kelly

Williams

Wilson

. Developing a football-specific talent identification and development profiling concept - The Locking Wheel Nut Model. Applied Coaching Research Journal 2018; 2: 32–41.

66.

Barraclough

Piggott

Till

, et al. Creating a shared mental model of performance: Coaches’ perspectives of key position-specific soccer actions. Int J Sports Sci Coach 2024; 19: 586–603.