Sage Journals: Discover world-class research

Abstract

Each incident of failure of a novel drug to translate from preclinical experiments into the clinic increases the pressure to justify the use of animals in these experiments and for the interpretation of findings to be valid and convincing. This is especially the case with research of psychedelics, because they have already been tested extensively in humans. One reason for failure to translate is that preclinical research findings and conclusions are rarely, if ever, confirmed independently. Another is an incautious interpretation of the validity of the animal models that are used, in respect of their relevance to the human disorder of interest and its treatment. This article discusses both these points in the context of preclinical investigations of psychedelics as fast-acting antidepressants, but the generic points are relevant to other human disorders as well.

Keywords

Antidepressant experimental design forced swim test head-twitch (response)open field test psychedelic reproducibility sucrose preference test

Introduction

After many years in the doldrums, there has been a resurgence of interest in the hunt for new classes of antidepressants. Alongside ketamine, a major focus for preclinical studies of psychedelics has been their potential action as fast-acting antidepressants. Although the efficacy of existing compounds is already being tested in humans, preclinical experiments are still needed to investigate their mechanism(s) of action and the search for new congeners with promising prospects for clinical development (reviewed by Liebnau et al., 2025).

This is happening at a time when the justification for preclinical research that uses animals is being challenged more than ever before. The ethical burden of preclinical experiments is exacerbated by the poor reproducibility of many research findings (de Oliveira Andrade, 2025) and their high rate of failure to translate into the human clinic (Yamaguchi et al., 2021). There are many reasons why a compound might not translate successfully into humans, including the emergence of unwanted side effects. However, if a compound was predicted to be efficacious on the basis of evidence from preclinical models, but turned out not to have any beneficial effects, then all the animals have been wasted. In some cases, that might be because the preclinical evidence has been misconstrued, rather than arising from a fundamental flaw in the animal model per se (see Stanford, 2025). Nevertheless, such failures have inflamed criticism of, and loss of confidence in, the reliability and relevance of findings from animal studies in the research of human disorders and their treatment. For all those reasons, it is essential that preclinical procedures are not only reproducible but also fit for purpose.

So far, most efforts to improve reproducibility have placed a strong emphasis on randomization and blinding as essential for reducing the risk of bias. Preregistration of research plans is also encouraged to help ensure compliance with the intended (and approved) research plan. However, successful translation depends not only on appropriate experimental design but also on the validity of the experimental procedures; both these elements must be satisfied if the findings are to be unambiguous and interpreted correctly.

This article first outlines steps being taken to improve the design and reporting of preclinical investigations. However, initiatives that are intended to improve reproducibility offer no benefit at all if the procedures are not carried out properly, or if they lack the validity needed to fulfil the experimental objectives (Pratt et al., 2022). That issue is discussed next. Despite focusing on behavioural procedures that are widely used in the research of potential psychedelic antidepressants, many of the points are generic and apply to other fields in the behavioural neurosciences.

Improving reproducibility and validity of research findings: General points

For preclinical findings to be translatable, there needs to be assurance that the investigation was planned properly and that the design of individual experiments is appropriate. To that end, several strategies and online tools have been developed to help improve the reproducibility of preclinical research (Table 1).

Table 1.

The pathway to improving reproducibility of research findings.

Process	Stage	Purpose	Guidelines/references
Make decisions on the study concept, objectives and experimental procedures	1	To consider practical matters and overall strategy before planning experiments	PREPARE; Smith et al. (2018)
	2	To ensure that the validity of the experimental ‘model’ is appropriate for meeting the objectives of the study and interpretation of the findings	iTRIPP; Bailey et al. (2023)
Plan the entire investigation	3	(a) Pilot study	Bate et al. (2025)
		(b) Hypothesis-generating experiment
		(c) Hypothesis-confirming experiment
Design each study/experiment	4a	Pilot study: To identify factors to be included in, or excluded from, subsequent steps of the investigation	Experimental design assistant; Percie du Sert et al. (2017) https://nc3rs.org.uk/our-portfolio/experimental-design-assistant-eda
	4b	Hypothesis-generating experiment: To gather preliminary (unpredicted) findings. Also, to provide information to inform power analysis, which is needed to estimate the appropriate sample size for the hypothesis-confirming experiment
	4c	Hypothesis-confirming experiment: To confirm whether or not findings predicted from the hypothesis-generating experiment are reproducible: that is, accept or reject the statistical null hypothesis. The design is selected to control for false positives and false negatives (Type I and II error rates)
Report the experimental procedures and findings	5	To ensure that all key experimental factors and procedures are included when reporting the experimental procedures and the findings	ARRIVE 2.0; Percie du Sert et al. (2020) IUPHAR Transparency and Reproducibility Committee (CLEAR guidelines under development)
Report the experimental procedures and findings	5	Be CLEAR: Ensure transparent reporting of the data and how each experiment was conducted (as well as individual techniques), to help identify sources of bias/variability and their possible influence on interpretation of the findings

As part of a step-wise approach to the design of a preclinical investigation, the first consideration is the PREPARE guidelines (Planning Research and Experimental Procedures on Animals: Recommendations for Excellence; Smith et al., 2018), which offer a broad checklist of factors that need to be addressed before embarking on any experiment that uses animals. These include the following: proper formulation of the study; effective communication between researchers and the animal facility; and quality control of reagents and the animals’ health status.

An important component of that process will be deciding what experimental procedures are to be used and how the information gathered will be appropriate for meeting the objectives of the study. Those decisions require a dispassionate appraisal of the validity of the procedures/models, bearing in mind that each of them could be valid for one purpose (e.g. as a predictive screen), but not another (e.g. as a model of a complex, multifactorial disorder in humans; see below). The Improving Translational Relevance in Preclinical Psychopharmacology (iTRIPP) guidelines offer advice on those points when designing experiments to study a ‘model’ of a psychiatric disorder and/or its treatment (Bailey et al., 2023).

The next step is to plan the entire investigation: that is, the series of experiments that aim to reach a safe conclusion (Table 1). The prospective development of this overall plan is important because that approach will reduce the risk of false positives and enable the most efficient use of resources, including animals (Bate et al., 2025). The overall plan should start with pilot studies, to help decide which experimental factors merit inclusion in subsequent steps of the investigation and also to identify the fixed levels of other factors (e.g. timeline or range of drug doses), to maximize the window of opportunity or optimize the process. This should be followed by exploratory hypothesis-generating experiments and, finally, the hypothesis-confirming experiment. Whereas that sequence is typically standard practice in industrial settings, blue-skies research tends to evolve more gradually. However, it is important to note that an investigation that approaches the conclusion by devising one experiment at a time, adding new factors as the investigation progresses, risks wasting resources and will take much longer. It can also fail to identify important interactions between the experimental factors.

Notably, many preclinical exploratory findings are reported as conclusive before they have been confirmed independently. This is a premature and risky strategy; it should be made clear that any such findings are merely exploratory and tentative (Bate et al. 2025; Table 1). Conclusive findings require a test of an unequivocal prediction for the effect of a given experimental intervention (the hypothesis): for example, that a novel psychedelic will produce the response of interest, of meaningful magnitude, over a predicted range of doses. This precaution reduces the risk of being misled by false-positive and/or false-negative findings.

In addition to prompting the hypothesis, the results from exploratory experiments are useful because they also provide information on the likely magnitude of the response to the experimental intervention and the variability of the measure of interest. Both are needed for power analysis, which is used to estimate the appropriate sample size in the hypothesis-confirming experiment. That estimation helps to ensure that the sample sizes are neither too small (i.e. the study is underpowered and lacks adequate sensitivity) nor too large (i.e. the study is overpowered and risks detecting changes that are too small to be of any importance).

Only after carrying out a properly powered hypothesis-confirming experiment can it be inferred that the conclusion is likely to be correct and not a false positive (internal validity), but even that precaution does not guarantee reproducibility. Ideally, a further step would be to confirm that the same findings emerge when the experiment is carried out in a different laboratory setting and, even better, when testing the hypothesis in a different strain/species of animal (external validity). Unfortunately, the motivation to carry out those important checks is diminished by the need for confidentiality, which is driven by the competitive research environment of both the commercial and academic sectors. Nevertheless, findings that are confirmed as reproducible in that final category of tests are less likely to be false positives and more likely to offer promising translational potential.

Having planned the overall investigation, the next step is to design the individual experiments (Table 1). At this point, it is important to recognize that the design of each experiment depends on its role within the investigation: that is, whether it is an exploratory pilot study, an exploratory hypothesis-generating experiment or a hypothesis-confirming experiment (Bate et al., 2025). To help with that process, the National Centre for the 3Rs, UK, has developed the Experimental Design Assistant (EDA; https://nc3rs.org.uk/our-portfolio/experimental-design-assistant-eda). This enables researchers to build a schematic flowchart of a proposed experiment and to populate the plan with information about the experimental factors and important aspects of the procedure (e.g. species, strategies for blinding and randomization and the configuration of the samples within the experiment; Percie du Sert et al., 2017). The plan includes the intended strategy for statistical analysis (Bate et al., 2017). The EDA will then critique the overall design and analysis and will either confirm that they are appropriate and compatible, or flag points that raise concerns or that need more information; these flags are linked to online pages of advice on how to reconcile such issue(s). Finally, the EDA can produce a text report, which lists all the details that have been specified in the design. Importantly, this report makes it easy to check whether there are any issues that are important for promoting reproducibility, but have not been considered in the design.

The final step, after carrying out the experiment, is to report the experimental procedure and the findings to a standard that enables the experiment to be repeated independently (Table 1). Shortcomings in this reporting process have been identified as a major factor that contributes to poor reproducibility (Kilkenny et al., 2009). The ARRIVE Guidelines were developed to address that problem but despite most leading journals formally endorsing those guidelines, there is evidently scope for improving the compliance of the articles they publish (Hair et al., 2019; Lilley et al., 2020). As a further remedy, the ARRIVE 2.0 Guidelines were devised. This version differs from the original by categorizing factors that should be included in original research reports as either ‘Essential’ or ‘Recommended’ (Percie du Sert et al., 2020), but it is too early to assess whether that adjustment has improved matters. In the meantime, a preprint of the RIVER Guidelines has been released (RIVER Working Group, 2023), which will offer equivalent advice on the reporting of experiments carried out in vitro.

To further consolidate the reporting process, the Transparency and Reproducibility Committee of the International Union of Basic and Clinical Pharmacology (IUPHAR) is developing guidance on Clarity, Evaluation, Assessment, Rigour (CLEAR). This guidance emphasizes the need for data transparency together with its experimental context, which needs a full description of how the experimental design was implemented. This information is essential because it will help to identify sources of bias and variability that could affect the interpretation of the findings.

Despite all these initiatives, even reproducible findings will be merely expensive distractions if it turns out that the experimental model lacked the required validity. However, the criteria for validity depend on the objectives of the study (Almanasreh et al., 2019; Campbell and Fiske, 1959; Slack and Drougalis, 2001; Willner, 1986):

Construct validity: The neurobiology and pharmacology of the model are consistent with our understanding of the disorder and its treatment, which need continual revalidation as new evidence emerges. This can be further assessed in respect of:

Content validity: The extent to which the animal model incorporates all (or only some) features of the human disorder, or its treatment.

Concurrent validity: The extent to which the response to a novel experimental challenge replicates that expressed in an established (validated) model (the ‘gold standard’).

Divergent validity: The extent to which it is possible to distinguish between the construct validity of an established model and that of a novel challenge with a different (or overlapping) construct.

Convergent validity: The extent to which the effect(s) of a novel experimental challenge matches/correlates with those from a different model that assesses the same response (construct).

Predictive validity: The response to a given category of experimental challenge(s) in animals is consistent and can be used to predict the response in humans. This is the essence of preclinical drug screening.

Face validity: The observed response to the experimental challenge in animals resembles the effects in humans. This category of validity is vulnerable to misleading anthropomorphism.

Internal validity: The response to an experimental challenge is reproducible when the experiment is repeated under identical circumstances.

External validity: The response is reproducible when the experiment is carried out in different circumstances (a different laboratory or strain of animal, for instance). This category is essential for translational validity.

Translational validity: The response to the experimental challenge in animals is borne out in humans.

In short, for preclinical research to offer any scientific benefits and to have a realistic possibility of translating into humans, the research findings must be valid for the purposes of the study objectives, as well as reproducible. The following sections address that former point, focusing on key aspects of preclinical research of psychedelics.

The hallucinogenic profile of psychedelics

There are two main classes of psychedelics: phenethylamines (e.g. mescaline) and indolamines. The latter comprises two subgroups: tryptamines (e.g. dimethyltryptamine and psilocybin) and ergolines (also known as lysergamides, e.g. LSD). Conclusions from early studies of the pharmacology of psychedelics were confounded by the lack of any knowledge of the multiple 5-HT receptor subtypes that have now been identified. As a consequence, LSD was described as a ‘5-HTD’ receptor antagonist (Burris and Sanders-Bush, 1992; Gaddum, 1957), whereas Aghajanian believed LSD activated inhibitory 5-HT autoreceptors in the midbrain (Aghajanian et al., 1968). There is now plenty of evidence that all these compounds are either full agonists or partial agonists of 5-HT2a receptors and that this action is responsible for their hallucinogenic effect.

Until recently, there was little human data to underpin this view (for obvious reasons), apart from studies such as Sadzot et al. (1989) who reported that the threshold for doses of psychedelics that induce hallucinations correlated with their affinity for human (and rat) 5-HT2a receptors. However, many later studies have reported that the psychotomimetic effects of psychedelics are blocked by the 5-HT2a receptor antagonist, ketanserin (e.g. Kraehenmann et al., 2017; Vollenweider et al., 1998; see Jalal, 2018). A recent imaging study, using positron emission tomography, has further confirmed that both 5-HT2a occupancy by psilocin and its plasma concentration correlate with the subjective (mystical) experience induced by its prodrug parent, psilocybin (Madsen et al., 2019; Stenbæk et al., 2021).

However, several 5-HT2a receptor agonists are widely purported to lack hallucinogenic actions. Amongst these are 2-bromo-lysergic acid diethylamide (2-bromo-LSD; Cerletti and Rothlin, 1955; Ginzel and Meyer-Gross, 1956; Lewis et al., 2023), lisuride and Ariadne (Cunningham et al., 2023). The piperazine, quipazine, although from a different chemical class, is also often cited as another example. But, as discussed below, these assessments are highly questionable (see also Fiorella et al., 1995).

The prevailing view that 2-bromo-LSD does not induce hallucinations is open to question, or qualification. For instance, at doses used to test for relief of cluster headache, most subjects felt ‘slightly tipsy’ (see: Karst et al., 2010) and another study reported ‘a delirious reaction similar in almost all respects to that of LSD’ (Richards et al., 1958). Given that this compound is a partial (and biased) agonist at 5-HT2a receptors, but a full agonist, antagonist, or even inverse agonist at many other G protein-coupled receptors (Lewis et al. 2023), it has not been possible to identify which action(s) could contribute to such a response.

It is also hard to understand why lisuride, which is a congener of LSD and a 5-HT2a receptor agonist, is widely regarded as non-hallucinogenic. In a clinical trial, again to treat headache, ‘visual hallucinations’ were specified as a side-effect that warranted cessation of the study (Somerville and Herrmann, 1978). Hallucinations are also listed as a potential side-effect in data sheets for formulations of this compound, which is used to treat Parkinson’s disease and (off-label) migraine.

The evidence that quipazine does not induce hallucinations is similarly tenuous and seems to rely on a small-scale human study that focused on its effects on hormone secretion (Parati et al., 1980). Peripheral side-effects are listed, and they all point to nausea and gastrointestinal problems, which are likely to be attributed to quipazine’s action as a 5-HT3 receptor agonist. This action will constrain the dose range that can be tested in humans, which could account for the dearth of reports of any hallucinogenic action. This limitation would not be evident in rodent studies, given that neither rats nor mice express the emetic reflex (Horn et al., 2013). As a consequence, compounds that are predicted to be efficacious, on the basis of preclinical evidence, might not translate into humans due to limitations imposed by the maximum tolerated dose. Nevertheless, as summarized by de la Fuente Revenga et al. (2021), the human data for the lack of hallucinogenic effects of quipazine are ‘both scant and fragmentary, when not contradictory’.

Of particular relevance to this article is the evidence that the behavioural effects of these compounds in animal studies (specifically, the head-twitch response (HTR; see below)) resemble those of psychedelics that are acknowledged to be hallucinogenic. Yet, because they are widely regarded as non-hallucinogenic (and, in the case of quipazine, non-psychedelic), this action has led to them being branded as ‘false positives’ (e.g. White et al., 1981). This could be unfortunate because that judgement, if incorrect, unjustifiably undermines confidence in the validity of preclinical investigations of the effects of psychedelics on rodent behaviour (see also: Kehler and Lindskov, 2025).

Another complication is the evidence that 5-HT2a agonism is necessary, but not sufficient, for the hallucinogenic actions of psychedelics. A full discussion of that topic is beyond the scope of this article but, in brief, evidence points to activation of 5-HT_1a receptors as an important contribution to the hallucinogenic effect of tryptamines (Pokorny et al., 2016), while activation of 5-HT_2c receptors contributes to the actions of phenethylamines (Custodio et al., 2023; Fantegrossi et al., 2008) and ergolines (Fiorella et al., 1995), including lisuride. In addition, each psychedelic compound has its profile of binding to the full range of receptors for 5-HT and other neurotransmitters, each with different receptor reserves, which will further distinguish them from one another (Holze et al., 2024; McKenna et al., 1990; Millan et al., 2002). The recruitment of several different second messenger cascades that follow activation of (G-protein coupled) 5-HT2a receptors, together with biased agonism, and their heterodimerization with receptors for other neurotransmitters, will further refine and define the response to individual psychedelics.

Little is known about the functional consequences of all these variables, but emerging evidence suggests that they have important implications for the psychotropic effects of these compounds and their therapeutic potential. This is particularly the case in respect of compounds with preferential binding to 5-HT2a receptors, such as LSD and psilocybin, which are regarded as ‘typical psychedelics’ (Kamal et al., 2023). By contrast, others, such as 5-methoxy-N,N-dimethyltrypamine, which has higher affinity for 5-HT_1a than 5-HT2a receptors, together with ketamine (a NMDA-receptor antagonist) and methylenedioxymethamphetamine (a mixed-action, serotonin-releasing agent) are described as ‘atypical psychedelics’ (‘entheogens’). These disparate pharmacological profiles and topographical distributions of their receptor targets (Delli Pizzi et al., 2023) are thought to explain differences in their psychotropic effects, especially core features of the hallucinations, and their potential as fast-acting antidepressants (Bosch et al., 2022; Dourron et al., 2023).

All these differences, together with their respective pharmacokinetic profiles, could affect findings from preclinical behavioural studies in crucial ways. This variability is reminiscent of the detailed interrogations of the actions of sedative-hypnotics in studies of stimulus discrimination by Griffiths (e.g. Ator and Griffiths, 1986, 1989), who discovered that benzodiazepines and other compounds that bind to their allosteric-binding site on the GABAA-receptor do not all generalize to the same interoceptive cue. That topic has been discussed in detail elsewhere (Heal et al., 2025).

Against that background, the following sections discuss preclinical investigations of the behavioural responses to psychedelics, as 5-HT2a receptor agonists with antidepressant potential.

The validity of predictive screens

Head-twitches, hallucinations and 5-HT2a receptors

Psychedelics and other hallucinogens induce a behavioural syndrome in rodents, which includes head-twitches and wet-dog shakes (e.g. Titeler et al., 1988, but see Silva and Calil, 1975). There is long-standing evidence that head-twitches in mice (Goodwin and Green, 1985) and rats (Schreiber et al., 1995) are mediated by activation of 5-HT2a receptors. Moreover, the expression of head-twitches is prevented by the 5-HT2a receptor antagonist, ketanserin (Darmani et al., 1990) and is not seen in 5-HT2a receptor gene knockout mice (htr2^−/−; González-Maeso et al., 2007). Interestingly, a recent study reported that head-twitches induced by psilocybin are also not evident in genetically altered mice lacking the 5-HT transporter (Gattuso et al., 2025).

It is not known whether either activation of 5-HT2a receptors or a hallucinogenic (mystical) experience, or both, is essential for an antidepressant response to psychedelics (but see: Yaden and Griffiths, 2020). Nevertheless, many preclinical studies of putative psychedelic antidepressants have scored their induction of head-twitches in rodents, as an index of 5-HT2a receptor activation and, more speculatively, as a predictive screen for a hallucinogenic response (Halberstadt et al., 2020). Assuming the latter assumption to be correct, that still raises questions about whether particular features of the response to the test compound, including hallucinations (the discriminative stimulus), have any bearing on the HTR as a predictor of an antidepressant response (see above and Gouzoulis-Mayfrank et al., 2005).

The action of the LSD analogue, lisuride, has been somewhat problematic in this context because this 5-HT2a agonist does not cause head-twitches in mice, even at high doses. Given the evidence that lisuride does cause hallucinations (see above – but contrary to many descriptions of its actions), this finding should be regarded as a false-negative response were it not for its efficacy in the least shrew (Cryptotis parva; see: Darmani et al., 1994; Table 2). This disparity highlights the importance of species differences in this response.

Table 2.

Typical and atypical behavioural and pharmacological profiles of psychedelics and related compounds.

Compound	Hallucinations	Head-twitches	5-HT_2a receptor agonist	Comments
Typical psychedelics
Tryptamines (e.g. dimethyltryptamine, psilocybin)	✓	✓	✓
Ergolines (e.g. LSD)	✓	✓	✓
Phenethylamines (e.g. mescaline)	✓	✓	✓
2-bromo-LSD	No	No	Partial agonist
Ergometrine	No	✓	Partial agonist
Lisuride	✓	Not mice, but species differences	✓	Widely claimed to lack hallucinogenic activity, but hallucinations have been reported in clinical trials
Quipazine	?	✓	✓	No reports of hallucinations, but doses are probably restricted by activation of 5-HT₃ receptors, which causes nausea and vomiting

The ergoline, ergometrine, is another apparent anomaly. There are reports that this compound has psychotropic (‘entheogenic’/‘psychotomimetic’) effects that are equivalent to a low dose of LSD (Bigwood et al., 1979; Selva et al., 1989), but no frank hallucinations have been reported for this 5-HT2a partial agonist. This may be because the unpleasant somatic effects of this compound are dose-limiting (Ott and Neely, 1980). Yet, it does induce head-twitches in mice, albeit at a concentration that is 5- to 200-fold higher than is needed for psilocybin, dimethyltryptamine or LSD (Balsara et al., 1986; Corne and Pickering, 1967). Similarly, there are no reports of hallucinations for the 5-HT2a agonist, quipazine (but see above), and yet this piperazine produces a HTR (de la Fuente Revenga et al., 2021). As a consequence, both ergometrine and quipazine are widely regarded as producing false-positive responses in the head-twitch test.

If hallucinations are essential for the efficacy of psychedelics as antidepressants, then purported false-negative (lisuride) and false-positive (quipazine and ergometrine) findings in the HTR test would undermine its use as a preclinical predictive screen for (hallucinogenic) psychedelic antidepressants. Clearly, it is important to resolve all these points because using animals to score head-twitches cannot be justified if either hallucinations are not necessary and/or the test is unreliable (Figure 1).

Figure 1.

A flowchart to indicate the criteria that justify the use of the head-twitch response (HTR) to predict the therapeutic effects of putative fast-acting antidepressants and their mechanism of action.

Notwithstanding those uncertainties, another (alternative) reason for scoring head-twitches assumes that 5-HT2a receptor agonism, alone, predicts a fast-acting antidepressant response. In that context, the validity of this test rests merely on the expression of head-twitches as an objective criterion. But, if assessment of 5-HT_2a receptor activation is the intended objective, then there is no need to regard either ergometrine or quipazine as false positives. In fact, their provocation of head-twitches consolidates the validity of this behavioural assay as a predictive screen for 5-HT2a agonists, albeit with possible differences in species’ sensitivity.

That possibility, along with the belief that some tryptamines bind to 5-HT2a receptors but do not induce hallucinations (see above), has kindled the development of congeners as putative antidepressants that will lack an unwanted hallucinogenic side-effect (reviewed by Duan et al., 2024). However, if that is the case, it is not necessary to use animals at all because 5-HT_2a receptor activation can be measured in vitro. That approach has the added advantage of enabling investigations of caveats such as biased agonism and coupling to different second messengers (Figure 1).

Yet, another scenario is that activation of 5-HT2a receptors is essential, but not adequate, for an antidepressant response. In fact, the need for co-activation of 5-HT2a and other neurotransmitter receptor(s), which would need to be identified, is the only situation for which the use of animals to score head-twitches can be fully justified (Figure 1).

Another factor to bear in mind, as pointed out by Halberstadt and Geyer (2013), is that what is scored as a ‘head-twitch’ can show poor inter-observer consistency (Silva and Calil, 1975) and is vulnerable to subjective bias, which confounds comparisons across different studies. That problem can be avoided by ensuring that head-witches are scored only by proficiently trained experimenters, after confirmation of their intra- and inter-scorer consistency (>95% inter-rater reliability: Canal and Morgan, 2012; Garcia et al., 2007). This precaution is essential because the full-blown response to psychedelics in rodents is not confined to head-twitches, but includes reciprocal forepaw treading, flat body posture, head-weaving, hind limb abduction and Straub tail. The ‘serotonin syndrome’ in rodents is believed to be analogous to ‘serotonin toxicity’ in humans, which is a life-threatening delirium caused by excessive serotonergic transmission (Haberzettl et al., 2013; Stanford et al., 2010).

This might be an important, but neglected, confounder because reciprocal forepaw treading and flat body posture, at least, have been attributed to activation of 5-HT_1a receptors in rats (Arvidsson et al., 1981; Smith and Peroutka, 1986) and mice (Yamada et al., 1988). Moreover, expression of both head-twitches and other aspects of the serotonin syndrome is evidently influenced by interactions with other serotonin receptors and other neurotransmitter systems (Goodwin et al., 1987; Heal et al., 1986). This is likely to be particularly important for compounds, such as lisuride, which bind to many different neurotransmitter receptors (Millan et al., 2002) and are purported to have anomalous profiles in this test.

Related to this broader profile is the evidence that expression of head-twitches after treatment with psychedelics has a bell-shaped dose–response profile. This could be explained by high drug doses activating stereotypic behaviours, associated with the full-blown serotonin syndrome, that prevent or mask expression of head-twitches (Corne and Pickering, 1967). Clearly, the choice of drug dose (and possibly the route of administration (Yamada et al., 1988)) needs to be considered carefully when assessing the effects of novel psychedelics in this test, not least because these variables, and their behavioural consequences, could contribute to the inconsistent reports of the effects of different compounds. In fact, this hormetic confounder could explain why lisuride does not provoke head-twitches in mice (but see above) and has been proposed by Glatfelter et al. (2024) to account for the apparent lack of hallucinogenic effects of this compound.

In summary, it is not at all certain that hallucinations per se are a necessary component of an antidepressant response to psychedelics, either as a marker for a therapeutic dose and/or by inducing a change of psychological status, which drives benefits that outlast the acute effects of the drug. It is also not certain whether activation of 5-HT2a receptors, alone, is sufficient to either induce hallucinations or to relieve depression. Whatever the case, the validity of the HTR as a predictive screen for hallucinations and/or antidepressants rests on psychedelics producing a consistent change in animals’ behaviour; that validity is challenged by claims that this test is vulnerable to false negatives and false positives.

There is clearly a need to reconcile and validate all these different objectives and confounders. That process is important for ethical and scientific reasons, especially in respect of the need to confirm that in vitro (non-animal) alternatives cannot provide the required information.

The Forced Swim Test: Validity depends on the experimental objective

The Forced Swim Test (FST) has been used for over 50 years on the basis that the immobility of rodents, which develops after a short period of inescapable confinement in a cylinder of water (usually 6 min maximum), is an expression of depression, or ‘depression-like’ behaviour. A change of mindset on this point followed the long-overdue acknowledgement that depression in humans is typically a chronic relapsing disorder, whereas the immobility in the FST is state-dependent and dissipates when the animal is removed from the water. Also, the immobility is diminished by subchronic administration of an established antidepressant shortly before the test (c 24 h), but a therapeutic response to established (monoamine targeting) antidepressants in humans needs several weeks of treatment, at least. Interestingly, the latter limitation does not apply to psychedelics, which are being investigated as fast-acting antidepressants. Nevertheless, the use of the FST to study depression (or depression-like behaviour) is now widely deprecated and no longer permitted in the UK and some other jurisdictions, on the grounds of its lack of validity as a model of depression.

The only procedure to produce a change in animals’ phenotype that arguably produces many key features of depression is olfactory bulbectomy. Interestingly, the HTR to the psychedelic, 2,5-dimethoxy-4-iodoamphetamine (DOI), is particularly sensitive in olfactory bulbectomized animals and is abolished by an antagonist of either 5-HT2a or 5-HT2c receptors or chronic administration of the selective serotonin reuptake inhibitor (SSRI), fluvoxamine (Oba et al., 2013). Repeated LSD treatment also relieves the deficit in active avoidance in these animals, as did chronic administration of the established antidepressant, imipramine (Buchborn et al., 2014). There is clearly scope for more research on psychedelics in this model of depression.

The misunderstanding that led to the FST being used to study depression derives from publications that first reported the procedure. The authors commented that the immobility appears to ‘reflect a state of despair in the rat’ (Porsolt et al., 1978) and that ‘having learned that escape was impossible and their having given up hope. Immobility was therefore given the name “behavioral despair”’ (Porsolt et al., 2001). The immobility was arbitrarily given the name ‘behavioural despair’ to distinguish it from ‘learned helplessness’, which was used to describe the deficit in active avoidance that develops when animals experience a series of inescapable, uncontrollable foot-shocks.

Learned helplessness was abandoned as a model of depression more than 40 years ago (Maier, 1984) but, because ‘despair’ and ‘hopelessness’ are prominent features of depression, the assumption that the forced swimming induces depression in rodents still prevails despite the lack of any scientific justification. The only evidence for that interpretation resorts to ‘face validity’, which is vulnerable to anthropomorphic mistakes. The immobility that characterizes both behavioural despair and learned helplessness is now thought to reflect a coping strategy/stress resilience (Maier and Watkins, 2010; Molendijk and de Kloet, 2015; see: Stanford, 2020).

By contrast, all established antidepressants produce a positive response in the FST, and so it is still used as a predictive screen for new candidate treatments. The procedure has been refined since it was first developed, mainly because it was thought that SSRIs did not reduce immobility. That setback was resolved by scoring several components of animals’ behaviour (swimming and climbing), as well as overt immobility (Cryan et al., 2005).

A common criticism of the use of the FST as a screen for antidepressants is that the reduction in immobility (or increase in swimming/climbing) does not emulate any aspect of antidepression in humans. Even though an increase in motor behaviour (or motor motivation) could be beneficial in depression, this criticism is a red herring. The key point is that a predictive screen does not need the behavioural response of the animal to emulate any aspect of the treatment in humans: the only requirement is for all drugs of a given therapeutic class to induce the same (any) change in animals’ behaviour. So far, the FST is an effective screen for all antidepressants that are licensed for that indication, including SSRIs, and others that have been developed since then. Its predictive validity has been endorsed by a recent systematic review, which concluded that this test is ‘necessary and evidence based’ (Brandwein et al., 2023).

However, most studies using the FST have tested long-established antidepressants that augment monoamine transmission. The predictive validity of the FST for putative fast-acting antidepressants, which fall into a different pharmacological and therapeutic category, is less certain. Strong support for that possibility has emerged from the flurry of studies of the effects of Esketamine (the S-enantiomer of ketamine) in the FST, which was prompted by its recent licensed approval as a fast-acting antidepressant.

So far, nearly all studies have found that S-ketamine reduces immobility of both male and female rats (e.g. Arjmand et al., 2023; Koncz et al., 2023; Pereira et al., 2019) and mice, even when administered as a nebulized formulation (Brandão et al., 2023). Many more positive findings have emerged for tests of the racemic mixture of ketamine in rats (e.g. Clark et al., 2024) and mice (e.g. Bulthuis et al., 2024). There is also evidence that its efficacy in this test is long-lasting (Viana et al., 2020), as is the therapeutic response to S-ketamine. One exception is where rats were repeatedly treated with a high dose of ketamine before carrying out the FST in a way that did not conform to the usual protocol (Zhou et al., 2025). Another is where even the active controls, imipramine and fluoxetine, were ineffective in mice (Medeiros et al. 2025).

Whether or not the predictive validity of the FST extends to psychedelics has yet to be confirmed, and that will be an important challenge. So far, comparatively few studies have looked at the effects of psychedelics in this test, but the majority have reported a reduction in the immobility of rats. They do seem to be less effective in mice, but that limitation might be resolved by refining the dose schedule or other test parameters (see: Brandwein et al., 2023; Table 3).

Table 3.

Examples of findings for the effects of psychedelics in the Forced Swim Test (FST).

Species	Compound	Dose (I.P.) (mg/kg)*	Dose time before test	Immobility	References
Rat	N,N‑dimethyltryptamine	10	1, 2 and 3.5 h	↓	Cameron et al. (2018)
	5-Me-N,N-ODMT	0.5 and 5	1–24 h	NC	Kitada et al. (1983)
	Psilocybin	1	1–5 weeks	↓	Hibicke et al. (2020)
	Psilocybin	1	24 h	↓	Wang et al. (2025)
	Psilocybin	2, 3 and 10	3 h	NC (FSL)	Jefsen et al. (2019)
	Psilocin	0.5 and 2	3 h	NC (FSL)	Jefsen et al. (2019)
	Psilocybin	2 and 10	24 h	↑	Wojtas et al. (2022)
	Psilocybin	0.3	9 days	↓ (WKY)	Kolasa et al. (2024)
	Psilocybin	0.3	23 days	↓ (WH)	Kolasa et al. (2024)
	Psilocybin	1	24 h	↓	Rakoczy et al. (2024)
	LSD	0.15	5 weeks	↓	Hibicke et al. (2020)
Mouse	5-methoxy-N,N-dimethyltryptamine	10	24 h	↓	Cameron et al. (2023)
	2,5-dimethoxy-4-iodoamphetamine (DOI)	0.25–	45 min	NC	Redrobe and Bourin (1997)
	2,5-dimethoxy-4-iodoamphetamine (DOI)	0.2 and 2	24 h	NC	Pędzich et al. (2022)
	2,5-dimethoxy-4-iodoamphetamine (DOI)	0.05–2	24 h–3 weeks	↓ 0.01 and 0.25	Takaba et al. (2024)
	Psilocyn	1.5–4	24 h–3 weeks	↓ (1.5 mg/kg)	Takaba et al. (2024)
	LSD	30 µg/kg	20 min	NC	de Gregorio et al. (2022)
	Psilocybin	1	1, 3 and 7 days	NC	Hesselgrave et al. (2021)
	Psilocybin	1	1, 3 and 8 days	NC	Gattuso et al. (2025)

FSL: flinders sensitive line; NC: no change; WH: Wistar Han; WKY: Wistar Kyoto.

Dose (mg/kg) unless otherwise stated.

Of particular note was the finding that psilocybin did not affect the immobility of the Flinders Sensitive Line strain of rats, which is used in preclinical research of depression (Jefsen et al., 2019). However, this strain has a low density of 5-HT2a receptor mRNA expression in key limbic areas (Osterlund et al., 1999), and so their apparently anomalous response to psilocybin in the FST might actually serve to endorse the reliability of the FST instead.

The focus on 5-HT2a agonism as a key component of an antidepressant response to psychedelics prompts the question of whether their effect on immobility in the FST can be attributed to activation of these receptors? Not many studies have addressed that question, but the reduction in immobility following treatment with psilocyn or DOI (Takaba et al., 2024) or 5-MeO-DMT (Cameron et al., 2023) was prevented by the 5-HT2a antagonist, ketanserin. Functional ablation of the 5-HT2a receptor gene does not affect baseline immobility in this test (Jaggar et al., 2017), and there is inconsistent evidence for its effect on the immobility response to psychedelics. Whereas the response to DOI and lisuride was abolished, the response to psilocybin remained intact (Sekssaoui et al., 2024). There is also evidence that 5-HT1a receptor co-activation makes an important contribution to a reduction in the immobility caused by psychedelics (Głuch-Lutwin et al., 2023), which again suggests that activation of 5-HT2a receptors might be necessary, but not sufficient, for a psychedelic response.

The wide range of experimental parameters that have been incorporated into different studies of the actions of psychedelics in the FST (particularly drug doses and treatment schedules) could explain why no consistent response profile has emerged so far. As a consequence, we can neither be confident about nor rule out the use of the FST as a reliable predictive screen for psychedelic antidepressants, particularly in mice. Given that there are no alternative procedures in prospect, with predictive validity that might match the track record of the FST, there is a pressing need to resolve this question.

The Open Field Test

The Open Field Test (OFT) is a deceptively straightforward, high-throughput procedure that apparently involves merely placing animals in a novel arena and scoring their movements for a few minutes.

When first developed, the Open Field was a large circular arena (at least 1 m in diameter for testing rats), which enabled low-level and even illumination throughout. The measure of primary interest was the rats’ defaecation rate when placed in the apparatus. Although rats with a comparatively high defaecation rate tended to spend more time in the periphery of the arena (‘thigmotaxis’), it was their defaecation rate that was thought to be an indication of their ‘emotional reactivity’. That proposal prompted a program of selective inbreeding of rats (high vs low rate of defaecation in the Open Field) to understand the neurobiological basis of ‘emotionality’ (see: Broadhurst, 1975). The culmination of that effort was the development of the Maudsley Reactive (MR) and Maudsley NonReactive (MNR) strains of rats. However, defaecation, as the primary measure of interest, attracted some criticism (Cunha and Masur, 1978) and was later eclipsed by assessment of animals’ preference for, versus avoidance of, the central zone of the arena. That shift was based on the unconfirmed assumption that animals that spent more time in the central zone were less ‘anxious’ than those that preferred the periphery.

The proposal that emotionality differed in MR and MNR strains of rat was always controversial (Broadhurst, 1976; Commissaris et al., 1986). The possibility that emotionality in the Open Field, and other preclinical tests, is analogous to anxiety in humans was even more uncertain (see: Lister, 1990). Such scepticism was endorsed by studies comparing the effects of anxiolytic drugs on several different aspects of the behaviour of MR and MNR rats, including the Open Field, which failed to validate the proposal (e.g. Rowan and Flaherty, 1991). An objective assessment of the effect of antianxiety drugs on animals’ behaviour in the OFT was that they reduce ambulation (Crawley, 1985), but do not reduce thigmotaxis. In short, it is hard to understand why animals’ centre-field behaviour in this test is still being used as an index of anxiety.

The OFT is also used to evaluate animals’ locomotor activity. This is particularly the case when investigating the actions of putative antidepressants. The intention in this context is to gather evidence that a positive response to a novel antidepressant in tests such as the FST predicts a change in mood/motivation in humans, rather than causing a non-specific increase in motor activity. Yet, there are two reasons why fulfilling that objective is probably not necessary: (1) as explained above, an increase in motor activity could help treat psychomotor retardation in depression and (2) a predictive screen for antidepressants does not need to measure any aspect of mood.

Whatever the case, interpreting results from the OFT needs to take into account the animals’ response to the stress of exposure to a novel arena. This could either increase their movement (rapid exploration to achieve escape) or reduce it (freezing, in extreme). Because animals’ locomotor activity in the OFT is affected by their stress resilience, which is controversially interpreted as ‘anxiety’, these are not independent variables and so to evaluate one, but not the other, will be misleading. This will be a particular problem when using centre-field activity as an index of the animal’s stress response (‘anxiety’). To take account of this interaction, the score for movement within, or directed towards, the central zone for each animal needs to be expressed as a proportion of its total activity in the arena (Salmon and Stanford, 1989). Yet, when using the OFT to assess animals’ ‘anxiety’, most studies merely report the % time animals spend in the central zone; that adjustment does not resolve the issue.

Other potential confounds are that, despite Broadhurst’s detailed attention to the dimensions and illumination of the apparatus, contemporary Open Fields are usually square and made of Perspex or similar material. Those changes in the construction of the apparatus not only introduce corners, which will be a preferred region of the arena, but also make it difficult to prevent shadows and reflective glare. All those technical features, which will affect animals’ behaviour in the OFT (Stanford, 2007; Voikar and Stanford, 2023; Walsh and Cummings, 1976), could be particularly important for psychedelics in view of evidence that LSD affects animals’ sensitivity to light (Cunha and Masur, 1978).

Another complication is that locomotor activity in the OFT will depend on what else the animals are doing. If the experimental intervention affects grooming or rearing, for instance, then ambulation will be affected indirectly because they cannot carry out those behaviours and move around the arena at the same time. This is potentially particularly important for studies of psychedelics, which induce head-twitches, wet-dog shakes and other motor stereotypies that might disrupt ambulation long after their overt expression, or subliminally with low doses of psychedelics (Bysiek et al., 2025). It follows that any inference about the effect of psychedelics on animals’ locomotor activity in the OFT needs to be considered in the context of a full behavioural profile and depends on confirmation that any change in locomotor activity is not explained by their effects on other aspects of motor behaviour (e.g. Herpfer et al., 2005).

Although there have been many studies of the effects of psychedelics on locomotor activity and other exploratory/vegetative behaviours (Geyer et al., 1986; Halberstadt and Geyer, 2018; Halberstadt et al., 2019), they have all been carried out in what is essentially a novel environment, as in the OFT, or other form of activity meter, and do not consider interactions between different aspects of the animals’ behaviour. This is important, especially in light of a comparison of animals’ behaviour in a novel environment and their home cage, which revealed that the effects of test compounds (including hallucinogens) on locomotor activity differ markedly in a novel test arena and the home cage (Robinson and Riedel, 2014).

As a consequence, any or all of the factors above could help explain why the effects of psychedelics on these behaviours, especially their locomotor activity, are not yet consistent enough to support any conclusions about their effects on either motor activity or emotionality (Table 4). Gaining a better understanding of the effects of psychedelics on spontaneous behaviours, without the complication of the stress of exposure to a novel environment, needs a clear assessment of the animals’ baseline activity throughout the circadian cycle while in their home cage (e.g. Porter et al., 2015) and a comparative assessment of the effects of test compounds on their behaviours.

Table 4.

Examples of the effects of psychedelics on defaecation and locomotor activity in the Open Field Test.

Species	Compound	Defaecation	Locomotor activity	Dose mg/kg*	Dose – test interval	References
Rat	DMT	↓		2–5	1.5–3 h	Brimblecombe et al. (1964)
	DOI		↓	0.125–1.0		Kaur and Ahlenius (1997)
	DOM	↓	↑	0.5–1.0	0.5–2.0 h	Yamamoto and Ueki (1975)
	DOM	↓	↓ < ↑	>5.0	0.5–2.0 h	Yamamoto and Ueki (1975)
	Lisuride		↑	0.1–0.5	30 min	Fink and Morgenstern (1985)
	Lisuride		↓	>0.5	30 min	Fink and Morgenstern (1985)
	LSD		↑	2–500 µg/kg	15 min	Dandiya et al. (1969)
	LSD	↓	NC	0.1	10 min	Silva and Calil (1975)
	Mescaline	↓ or NC strain differences	↑ or NC strain differences	35	1–2 h	Lush (1975)
	Mescaline		↑	>10		Yamamoto and Ueki (1975)
	Mescaline	↓	NC	5–20	10 min	Silva and Calil (1975)
	Psilocybin		NC	2–10	20 min	Wojtas et al. (2022)
	Psilocybin		↓ NC	2–10	1 and 24 h	Wojtas et al. (2023)
	Psilocyn		↓	0.25–4.0		Tylš et al. (2016)
Mouse	Lisuride		↓	0.01–4.0	31–120 min	Pogorelov et al. (2023)
	LSD		↑	0.3	0–30 min	Kaplan et al. (2022)
	Psilocybin		↑	0.01–0.3	0–2 h	Higgins et al. (2021)
	Psilocybin		↑	1	60 min	Gattuso et al. (2025)

Dose (mg/kg) unless otherwise stated.

The sucrose preference test (‘anhedonia’)

The reduction in rodents’ sucrose preference after experiencing a regime of chronic, unpredictable mild stressors (CUMS) is interpreted as analogous to anhedonia in humans and so serves as an animal model of this aspect of depression (but see Stanford, 2020). The validation of this model rests mainly on evidence that established antidepressants prevent this stress-induced reduction in motivation to seek reward (reviewed by Willner, 2017).

It is too early to tell whether this antidepressant action extends to psychedelics because only a limited range of compounds has been tested so far. The majority of studies have used either LSD or psilocybin in mice and a dose of 1 mg/kg, regardless of the test compound. An increase in sucrose preference is a common, but not invariable, finding (Table 5).

Table 5.

Examples of the effects of psychedelics on animals’ preference for sucrose in the sucrose preference test.

Species	Compound	Dose mg/kg	Treatment before a drug test	Change in sucrose preference	References
Rat	LSD	0.16	26 weeks of LSD treatment	↓	Marona-Lewicka et al. (2011)
Rat	Psilocybin	1.0	Social instability	↑	Wang et al. (2025)
Mouse	DOI	1.0	Repeated swim stress	↑	Sekssaoui et al. (2024)
	Lisuride	1.0	Repeated swim stress	↑	Sekssaoui et al. (2024)
	LSD	0.05 or 1.0	Repeated saline injection	NC	Elsilä et al. (2022)
	Psilocybin	1.0	Wild type or 5-HT transporter knockout	NC	Gattuso et al. (2025)
	Psilocybin	1.0	Chronic multi-modal stress	↑	Hesselgrave et al. (2021)
	Psilocybin	1.0	Repeated swim stress	↑	Sekssaoui et al. (2024)

DOI: 2,5-dimethoxy-4-iodoamphetamine: NC: no change.

When considering the validity of this test to study depression, or as a predictive screen for putative antidepressants, it should be borne in mind that, although anhedonia is a feature of major depressive disorder in humans, it is also a symptom of autism spectrum disorder, substance use disorder and schizophrenia. Its status as a symptom of schizophrenia is especially relevant given that early preclinical research of the neurobiology of that disorder used psychedelics as the experimental challenge, to produce a rodent model of schizophrenia on the basis that hallucinations are common to both.

In the context of research of depression and its treatment, it should also be noted that procedures for imposing CUMS that are used currently often differ markedly from the protocol developed by Willner (1986). The original procedure used the rodent equivalent of a series of daily ‘hassles’ (mild psychological stressors). However, many CUMS studies now incorporate a series of physiological stressors, such as 24 h food and/or water deprivation, swimming in ice-cold water and prolonged heat stress (45°C), each of which is moderate or severe, particularly when their cumulative severity is taken into account. The justification for such CUMS procedures is not assured, especially when they are compared with the types of non-noxious (psychological) stressors that can trigger, or exacerbate, depression in humans. It follows that the validity of their use, either to study the neurobiology of depression or as predictive screens for putative antidepressants, is highly questionable, both scientifically and ethically.

Finally, the wide range of stressors that have been used to induce anhedonia could well contribute to the disparate effects of psychedelics in these studies (Table 5). As with the FST, when used as a predictive screen, it may well be that the predictive validity of the sucrose preference test would improve if there was better understanding of: the environmental factors that contribute to variability in the response to the drug; the effective dose regimen; and compliance with the need to apply chronic mild stress, both in terms of its duration and intensity.

Final comments

With the possible exception of the HTR, no clear pattern has emerged to confirm the effects of psychedelics on rodent behaviour. Doubtless, this is partly because the pharmacokinetics and pharmacodynamics of all these compounds differ substantially in ways that will affect both the magnitude and the time course of the behavioural response. To allay the burgeoning scepticism about the merits of preclinical experiments, it is essential to pin down and understand the key variables. The success of those investigations will depend on the appropriate design of the entire investigation, as well as individual experiments, and confirmation that the findings are reproducible.

However, more caution should be applied to the anthropomorphic interpretation of individual behaviours (face validity), which is controversial at best and could even be totally spurious. Some outcome measures are clearly more objective and do have obvious analogous relevance to human behaviour. These include, for instance, measures of drug self-administration (to assess the risk of misuse of a new drug), monitoring sleep/eating architecture in studies of insomnia and eating disorders, or assessment of cognitive performance in operant training tasks. By contrast, assessment of animals’ subjective state (mood) is particularly challenging, especially when drawing inferences that are based merely on changes in their motor activity or gustatory preference.

The justification for using animals to develop new psychedelic compounds and to study their underlying biological mechanisms is predicated on the need for the findings to be reliable predictors of therapeutic efficacy (and harms). At the moment, every failure to translate into humans is cited as evidence that animal experiments are irrelevant and misleading. Rebutting this criticism requires the experimental procedures to be valid and the inferences to be cautious and realistic. Only then can we be confident that animal models will make an unassailable contribution to the research of novel psychedelics, as fast-acting antidepressants.

Footnotes

Acknowledgements

I wish to thank the referees for their helpful comments on the first version of this article.

Declaration of conflicting interests

The author declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.

Funding

The author received no financial support for the research, authorship and/or publication of this article.

ORCID iD

S Clare Stanford

References

Aghajanian

Foote

Sheard

(1968) Lysergic acid diethylamide: Sensitive neuronal units in the midbrain raphe. Science 161: 706–708.

Almanasreh

Moles

Chen

(2019) Evaluation of methods used for estimating content validity. Res Social Adm Pharm 15: 214–221.

Arjmand

Vadstrup Pedersen

Silva

, et al. (2023) Sex and estrous cycle are not mediators of S-ketamine’s rapid-antidepressant behavioral effects in a genetic rat model of depression. Int J Neuropsychopharmacol 26: 350–358.

Arvidsson

Hacksell

Nilsson

, et al. (1981) 8-Hydroxy-2-(di-n-propylamino)tetralin, a new centrally acting 5-hydroxytryptamine receptor agonist. J Med Chem 24: 921–923.

Ator

Griffiths

(1986) Discriminative stimulus effects of atypical anxiolytics in baboons and rats. J Pharmacol Exp Ther 237: 393–403.

Ator

Griffiths

(1989) Differential generalization to pentobarbital in rats trained to discriminate lorazepam, chlordiazepoxide, diazepam, or triazolam. Psychopharmacology (Berl) 98: 20–30.

Bailey

Bast

Chaby

, et al. (2023) Improving translational relevance in preclinical psychopharmacology (iTRIPP). J Psychopharmacol 37: 1051–1057.

Balsara

Bapat

Nandal

, et al. (1986) Head-twitch response induced by ergometrine in mice: Behavioural evidence for direct stimulation of central 5-hydroxytryptamine receptors by ergometrine. Psychopharmacology (Berl) 88: 275–278.

Bate

Clark

Stanford

(2017) Using InVivoStat to perform the statistical analysis of experiments. J Psychopharmacol 31: 644–652.

10.

Bate

Stanford

Page

(2025) A three-stage strategy for conducting an experimental investigation: A recommendation to improve the reproducibility of reported conclusions. J Psychopharmacol 39: 301–312.

11.

Bigwood

Ott

Thompson

, et al. (1979) Entheogenic effects of ergonovine. J Psychedelic Drugs 11: 147–149.

12.

Bosch

Halm

Seifritz

(2022) Psychedelics in the treatment of unipolar and bipolar depression. Int J Bipolar Disord 10: 18.

13.

Brandão

AAC

Deus

DLS

Duarte-Filho

LAMS

, et al. (2023) Nebulized and intraperitoneal ketamine have equivalent antidepressant-like effect in the forced swim and tail suspension tests in mice. Pharmacol Biochem Behav 233: 173674.

14.

Brandwein

Leenaars

CHC

Becker

, et al. (2023) A systematic mapping review of the evolution of the rat Forced Swim Test: Protocols and outcome parameters. Pharmacol Res 196: 106917.

15.

Brimblecombe

Downing

Green

, et al. (1964) Some pharmacological effects of a series of tryptamine derivatives. Br J Pharmacol Chemother 23: 43–54.

16.

Broadhurst

(1975) The Maudsley reactive and nonreactive strains of rats: A survey. Behav Genet 5: 299–319.

17.

Broadhurst

(1976) The Maudsley reactive and nonreactive strains of rats: A clarification. Behav Genet 6: 363–365

18.

Buchborn

Schröder

Höllt

, et al. (2014) Repeated lysergic acid diethylamide in an animal model of depression: Normalisation of learning behaviour and hippocampal serotonin 5-HT2 signalling. J Psychopharmacol 28: 545–552.

19.

Bulthuis

McGowan

Ladner

, et al. (2024) GluN2B on adult-born granule cells modulates (R,S)-ketamine’s rapid-acting effects in mice. Int J Neuropsychopharmacol 27: pyae036.

20.

Burris

Sanders-Bush

(1992) Unsurmountable antagonism of brain 5-hydroxytryptamine2 receptors by (+)-lysergic acid diethylamide and bromo-lysergic acid diethylamide. Mol Pharmacol 42: 826–830.

21.

Bysiek

Wojtas

Szpręgiel

, et al. (2025) The effect of low-dose psilocybin on brain neurotransmission and rat behavior. Prog Neuropsychopharmacol Biol Psychiatry 27: 111347.

22.

Cameron

Benson

Dunlap

, et al. (2018) Effects of N, N-dimethyltryptamine on rat behaviors relevant to anxiety and depression. ACS Chem Neurosci 9: 1582–1590.

23.

Cameron

Patel

Vargas

, et al. (2023) 5-HT2ARs mediate therapeutic behavioral effects of psychedelic tryptamines. ACS Chem Neurosci 14: 351–358.

24.

Campbell

Fiske

(1959) Convergent and discriminant validation by the multitrait-multimethod matrix. Psychol Bull 56: 81–105.

25.

Canal

Morgan

(2012) Head-twitch response in rodents induced by the hallucinogen 2,5-dimethoxy-4-iodoamphetamine: A comprehensive history, a re-evaluation of mechanisms, and its utility as a model. Drug Test Anal 4: 556–576.

26.

Cerletti

Rothlin

(1955) Role of 5-hydroxytryptamine in mental diseases and its antagonism to lysergic acid derivatives. Nature 176: 785–786.

27.

Clark

Wang

Hanania

, et al. (2024) 5-HT_1B receptor activation produces rapid antidepressant-like effects in rodents. Pharmacol Biochem Behav 247: 173917.

28.

Commissaris

Harrington

Ortiz

, et al. (1986) Maudsley reactive and non-reactive rat strains: Differential performance in a conflict task. Physiol Behav 38: 291–294.

29.

Corne

Pickering

(1967) A possible correlation between drug-induced hallucinations in man and a behavioural response in mice. Psychopharmacologia 11: 65–78.

30.

Crawley

(1985) Exploratory behavior models of anxiety in mice. Neurosci Biobehav Rev 9: 37–44.

31.

Cryan

Valentino

Lucki

(2005) Assessing substrates underlying the behavioral effects of antidepressants using the modified rat forced swimming test. Neurosci Biobehav Rev 29: 547–569.

32.

Cunha

Masur

(1978) Evaluation of psychotropic drugs with a modified open field test. Pharmacology 16: 259–267.

33.

Cunningham

Bock

Serrano

, et al. (2023) Pharmacological mechanism of the non-hallucinogenic 5-HT_2A agonist Ariadne and analogs. ACS Chem Neurosci 14: 119–135.

34.

Custodio

RJP

Ortiz

Lee

, et al. (2023) Serotonin 2C receptors are also important in head-twitch responses in male mice. Psychopharmacology (Berl) 242: 1585–1605.

35.

Dandiya

Gupta

, et al. (1969) Effects of LSD on open field performance in rats. Psychopharmacologia 15: 333–340.

36.

Darmani

Martin

Pandey

, et al. (1990) Do functional relationships exist between 5-HT1A and 5-HT2 receptors? Pharmacol Biochem Behav 36: 901–906.

37.

Darmani

Mock

Towns

, et al. (1994) The head-twitch response in the least shrew (Cryptotis parva) is a 5-HT2- and not a 5-HT1C-mediated phenomenon. Pharmacol Biochem Behav 48: 383–396.

38.

De Gregorio

Inserra

Enns

, et al. (2022) Repeated lysergic acid diethylamide (LSD) reverses stress-induced anxiety-like behavior, cortical synaptogenesis deficits and serotonergic neurotransmission decline. Neuropsychopharmacology 47: 1188–1198.

39.

de la Fuente Revenga

Shah

Nassehi

, et al. (2021) Psychedelic-like properties of quipazine and its structural analogues in mice. ACS Chem Neurosci 12: 831–844.

40.

Delli Pizzi

Chiacchiaretta

Sestieri

, et al. (2023) Spatial correspondence of LSD-induced variations on brain functioning at rest with serotonin receptor expression. Biol Psychiatry Cogn Neurosci Neuroimaging 8: 768–776.

41.

de Oliveira Andrade

(2025) Huge reproducibility project fails to validate dozens of biomedical studies. Nature 641: 293–294.

42.

Dourron

Nichols

Simonsson

, et al. (2023) 5-MeO-DMT: An atypical psychedelic with unique pharmacology, phenomenology and risk? Psychopharmacology (Berl) 242: 1457–1479.

43.

Duan

Cao

Wang

, et al. (2024) Serotonin 2A receptor (5-HT_2AR) agonists: Psychedelics and non-hallucinogenic analogues as emerging antidepressants. J Chem Rev 124: 124–163.

44.

Elsilä

Harkki

Enberg

, et al. (2022) Effects of acute lysergic acid diethylamide on intermittent ethanol and sucrose drinking and intracranial self-stimulation in C57BL/6 mice. J Psychopharmacol 36: 860–874.

45.

Fantegrossi

Reissig

Katz

, et al. (2008) Hallucinogen-like effects of N,N-dipropyltryptamine (DPT): Possible mediation by serotonin 5-HT1A and 5-HT2A receptors in rodents. Pharmacol Biochem Behav 88: 358–365.

46.

Fink

Morgenstern

(1985) Locomotor effects of lisuride: A consequence of dopaminergic and serotonergic actions. Psychopharmacology (Berl) 85: 464–468.

47.

Fiorella

Rabin

Winter

(1995) Role of 5-HT2A and 5-HT2C receptors in the stimulus effects of hallucinogenic drugs. II: Reassessment of LSD false positives. Psychopharmacology (Berl) 121: 357–363.

48.

Gaddum

(1957) Serotonin-LSD interactions. Ann N Y Acad Sci 66: 643–647.

49.

Garcia

Smith

Sanders-Bush

(2007) Role of G(q) protein in behavioral effects of the hallucinogenic drug 1-(2,5-dimethoxy-4-iodophenyl)-2-aminopropane. Neuropharmacology 52: 1671–1677.

50.

Gattuso

Wilson

, et al. (2025) Mice lacking the serotonin transporter do not respond to the behavioural effects of psilocybin. Eur J Pharmacol 991: 177304.

51.

Geyer

Russo

Masten

(1986) Multivariate assessment of locomotor behavior: Pharmacological and behavioral analyses. Pharmacol Biochem Behav 25: 277–288.

52.

Ginzel

Meyer-Gross

(1956) Prevention of psychological effects of d-lysergivc acid diethylamide (LSD 25) but its 2-brom derivative (BOL 148). Nature 178: 210.

53.

Glatfelter

Pottie

Partilla

, et al. (2024) Comparative pharmacological effects of lisuride and lysergic acid diethylamide revisited. ACS Pharmacol Transl Sci 7: 641–653.

54.

Głuch-Lutwin

Sałaciak

Pytka

, et al. (2023) The 5-HT_1A receptor biased agonist, NLX-204, shows rapid-acting antidepressant-like properties and neurochemical changes in two mouse models of depression Behav Brain Res 438: 114207.

55.

González-Maeso

Weisstaub

Zhou

, et al. (2007) Hallucinogens recruit specific cortical 5-HT(2A) receptor-mediated signaling pathways to affect behavior. Neuron 53: 439–452.

56.

Goodwin

Green

(1985) A behavioural and biochemical study in mice and rats of putative selective agonists and antagonists for 5-HT1 and 5-HT2 receptors. Br J Pharmacol 84: 743–753.

57.

Goodwin

De Souza

Green

, et al. (1987) The pharmacology of the behavioural and hypothermic responses of rats to 8-hydroxy-2-(di-n-propylamino)tetralin (8-OH-DPAT). Psychopharmacology (Berl) 91: 506–511.

58.

Gouzoulis-Mayfrank

Heekeren

Neukirch

, et al. (2005) Psychological effects of (S)-ketamine and N,N-dimethyltryptamine (DMT): A double-blind, cross-over study in healthy volunteers. Pharmacopsychiatry 38: 301–311.

59.

Haberzettl

Bert

Fink

, et al. (2013) Animal models of the serotonin syndrome: A systematic review. Behav Brain Res 256: 328–345.

60.

Hair

Macleod

Sena

, et al. (2019) A randomised controlled trial of an intervention to improve compliance with the ARRIVE guidelines (IICARus). Res Integr Peer Rev 4: 12.

61.

Halberstadt

Geyer

(2013) Characterization of the head-twitch response induced by hallucinogens in mice: Detection of the behavior based on the dynamics of head movement. Psychopharmacology (Berl) 227: 727–739.

62.

Halberstadt

van der Zee

JVF

Chatha

, et al. (2019) Chronic treatment with a metabotropic mGlu2/3 receptor agonist diminishes behavioral response to a phenethylamine hallucinogen. Psychopharmacology (Berl) 236: 821–830.

63.

Halberstadt

Chatha

Klein

, et al. (2020) Correlation between the potency of hallucinogens in the mouse head-twitch response assay and their behavioral and subjective effects in other species. Neuropharmacology 167: 107933.

64.

Halberstadt

Geyer

(2018) Effect of hallucinogens on unconditioned behavior. Curr Top Behav Neurosci 36: 159–199.

65.

Heal

Philpot

O’Shaughnessy

, et al. (1986) The influence of central noradrenergic function on 5-HT2-mediated head-twitch responses in mice: Possible implications for the actions of antidepressant drugs. Psychopharmacology (Berl) 89: 414–420.

66.

Heal

Smith

Gosden

, et al. (2025) Discriminating evidence - use and misuse of the drug-discrimination test in abuse potential assessment of novel CNS drugs. J Psychopharmacol 39: 629–651.

67.

Herpfer

Hunt

Stanford

(2005) A comparison of neurokinin 1 receptor knock-out (NK1-/-) and wildtype mice: Exploratory behaviour and extracellular noradrenaline concentration in the cerebral cortex of anaesthetised subjects. Neuropharmacology 48: 706–719.

68.

Hesselgrave

Troppoli

Wulff

, et al. (2021) Harnessing psilocybin: Antidepressant-like behavioral and synaptic actions of psilocybin are independent of 5-HT2R activation in mice. Proc Natl Acad Sci U S A 118: e2022489118.

69.

Hibicke

Landry

Kramer

, et al. (2020) Psychedelics, but not ketamine, produce persistent antidepressant-like effects in a rodent experimental system for the study of depression. ACS Chem Neurosci 11: 864–871.

70.

Higgins

Carroll

Brown

, et al. (2021) Low doses of psilocybin and ketamine enhance motivation and attention in poor performing rats: Evidence for an antidepressant property. Front Pharmacol 12: 640241.

71.

Holze

Singh

Liechti

, et al. (2024) Serotonergic psychedelics: A comparative review of efficacy, safety, pharmacokinetics, and binding profile. Biol Psychiatry Cogn Neurosci Neuroimaging 9: 472–489.

72.

Horn

Kimball

Wang

, et al. (2013) Why can’t rodents vomit? A comparative behavioral, anatomical, and physiological study. PLoS One 8: e60537.

73.

Jaggar

Weisstaub

Gingrich

, et al. (2017) 5-HT_2A receptor deficiency alters the metabolic and transcriptional, but not the behavioral, consequences of chronic unpredictable stress. Neurobiol Stress 7: 89–102.

74.

Jalal

(2018) The neuropharmacology of sleep paralysis hallucinations: Serotonin 2A activation and a novel therapeutic drug. Psychopharmacology (Berl) 235: 3083–3091.

75.

Jefsen

Højgaard

Christiansen

, et al. (2019) Psilocybin lacks antidepressant-like effect in the Flinders Sensitive Line rat. Acta Neuropsychiatr 31: 213–219.

76.

Kamal

Jha

Radhakrishnan

(2023) Role of psychedelics in treatment-resistant depression. Psychiatr Clin North Am 46: 291–305.

77.

Kaplan

Confair

Kim

, et al. (2022) Bespoke library docking for 5-HT_2A receptor agonists with antidepressant activity. Nature 610: 582–591.

78.

Karst

Halpern

Bernateck

, et al. (2010) The non-hallucinogen 2-bromo-lysergic acid diethylamide as preventative treatment for cluster headache: An open, non-randomized case series. Cephalalgia 30: 1140–1144.

79.

Kaur

Ahlenius

(1997) Potentiation of DOI-induced forward locomotion in rats by (-)-pindolol pretreatment. Neural Transm (Vienna) 104: 605–614.

80.

Kehler

Lindskov

MST

(2025) Are the LSD-analogs lisuride and ergotamine examples of non-hallucinogenic serotonin 5-HT2A receptor agonists? J Psychopharmacol 2698811251330741. DOI: 10.1177/02698811251330741.

81.

Kilkenny

Parsons

Kadyszewski

, et al. (2009) Survey of the quality of experimental design, statistical analysis and reporting of research using animals. PLoS One 4: e7824.

82.

Kitada

Miyauchi

Kanazawa

, et al. (1983) Involvement of alpha- and beta 1-adrenergic mechanisms in the immobility-reducing action of desipramine in the forced swimming test. Neuropharmacology 22: 1055–1060.

83.

Kolasa

Nikiforuk

Korlatowicz

, et al. (2024) Correction to: Unraveling psilocybin’s therapeutic potential: Behavioral and neuroplasticity insights in Wistar-Kyoto and Wistar male rat models of treatment resistant depression. Psychopharmacology (Berl) 242: 1455.

84.

Koncz

Papp

Pothorszki

, et al. (2023) (S)-Ketamine but not (R)-ketamine shows acute effects on depression-like behavior and sleep-wake architecture in rats. Int J Neuropsychopharmacol 26: 618–626.

85.

Kraehenmann

Pokorny

Vollenweider

, et al. (2017) Dreamlike effects of LSD on waking imagery in humans depend on serotonin 2A receptor activation. Psychopharmacology (Berl) 234: 2031–2046.

86.

Lewis

Bonniwell

Lanham

, et al. (2023) A non-hallucinogenic LSD analog with therapeutic potential for mood disorders. Cell Rep 42: 112203.

87.

Liebnau

Betzler

Kerber

(2025) Catalyst for change: Psyilocybin’s antidepressant mechanisms – A systematic review. J Psychopharmacol 39: 397–415.

88.

Lilley

Stanford

Kendall

, et al. (2020) ARRIVE 2.0 and the British Journal of Pharmacology: Updated guidance for 2020. Br J Pharmacol 177: 3611–3616.

89.

Lush

(1975) A comparison of the effect of mescaline on activity and emotional defaecation in seven strains of mice. Br J Pharmacol 55: 133–139.

90.

Lister

(1990) Ethologically-based animal models of anxiety disorders. Pharmacol Ther 46: 321–340.

91.

Madsen

Fisher

Burmester

, et al. (2019) Psychedelic effects of psilocybin correlate with serotonin 2A receptor occupancy and plasma psilocin levels. Neuropsychopharmacology 44: 1328–1334.

92.

Maier

(1984) Learned helplessness and animal models of depression. Prog Neuropsychopharmacol Biol Psychiatry 8: 435–446.

93.

Maier

Watkins

(2010) Role of the medial prefrontal cortex in coping and resilience. Brain Res 1355: 52–60.

94.

McKenna

Repke

, et al. (1990) Differential interactions of indolealkylamines with 5-hydroxytryptamine receptor subtypes. J Neuropharmacol 29: 193–198.

95.

Marona-Lewicka

Nichols

(2011) An animal model of schizophrenia based on chronic LSD administration: Old idea, new results. Neuropharmacology 61: 503–512.

96.

Medeiros

Rosa

Vissintainer

, et al. (2025) Lack of antidepressant-like effect of imipramine, fluoxetine and ketamine in the forced swim test in mice: How much standardization does a positive control require? Eur Neuropsychopharmacol 93: 24–26.

97.

Millan

Maiofiss

Cussac

, et al. (2002) Differential actions of antiparkinson agents at multiple classes of monoaminergic receptor. I. A multivariate analysis of the binding profiles of 14 drugs at 21 native and cloned human receptor subtypes. Pharmacol Exp Ther 303: 791–804.

98.

Molendijk

de Kloet

(2015) Immobility in the forced swim test is adaptive and does not reflect depression. Psychoneuroendocrinology 62: 389–391.

99.

Oba

Nakagawasai

Onogi

, et al. (2013) Chronic fluvoxamine treatment changes 5-HT(2A/2C) receptor-mediated behavior in olfactory bulbectomized mice. Life Sci 92: 119–124.

100.

Osterlund

Overstreet

Hurd

(1999) The flinders sensitive line rats, a genetic model of depression, show abnormal serotonin receptor mRNA expression in the brain that is reversed by 17beta-estradiol. Brain Res Mol Brain Res 74: 158–166.

101.

Ott

Neely

(1980) Entheogenic (hallucinogenic) effects of methylergonovine. J Psychedelic Drugs 12: 165–166.

102.

Parati

Zanardi

Cocchi

, et al. (1980) Neuroendocrine effects of quipazine in man in health state or with neurological disorders. J Neural Transm 47: 273–297.

103.

Pędzich

Medrano

Buckinx

, et al. (2022) Psychedelic-induced serotonin 2A receptor downregulation does not predict swim stress coping in mice. Int J Mol Sci 23: 15284.

104.

Percie

Sert

Bamsey

Bate

, et al. (2017) The experimental design assistant. Nat Methods 14: 1024–1025.

105.

Percie du Sert

Ahluwalia

Alam

, et al. (2020) Reporting animal research: Explanation and elaboration for the ARRIVE guidelines 2.0. PLoS Biol 18: e3000411.

106.

Pereira

Joca

SRL

Harvey

, et al. (2019) Esketamine and rapastinel, but not imipramine, have antidepressant-like effect in a treatment-resistant animal model of depression. Acta Neuropsychiatr 31: 258–265.

107.

Pogorelov

Rodriguiz

Roth

, et al. (2023) The G protein biased serotonin 5-HT2A receptor agonist lisuride exerts anti-depressant drug-like activities in mice. Front Mol Biosci 10: 1233743.

108.

Pokorny

Preller

Kraehenmann

, et al. (2016) Modulatory effect of the 5-HT1A agonist buspirone and the mixed non-hallucinogenic 5-HT1A/2A agonist ergotamine on psilocybin-induced psychedelic experience. Eur Neuropsychopharmacol 26: 756–766.

109.

Porsolt

Anton

Blavet

, et al. (1978) Behavioural despair in rats: A new model sensitive to antidepressant treatments. Eur J Pharmacol 47: 379–391.

110.

Porsolt

Brossard

Hautbois

, et al. (2001) Rodent models of depression: Forced swimming and tail suspension behavioral despair tests in rats and mice. Curr Protoc Neurosci Chapter 8: Unit 8.10A.

111.

Porter

Pillidge

Tsai

, et al. (2015) A lack of functional NK1 receptors explains most, but not all, abnormal behaviours of NK1R-/- mice. Genes Brain Behav 14: 189–199.

112.

Pratt

Robinson

Fernandes

, et al. (2022) BAP editorial: Improving the validity and translation of preclinical research. J Psychopharmacol 36: 779–780.

113.

Rakoczy

Runge

Sen

, et al. (2024) Pharmacological and behavioural effects of tryptamines present in psilocybin-containing mushrooms. Br J Pharmacol 181: 3627–3641.

114.

Redrobe

Bourin

(1997) Partial role of 5-HT2 and 5-HT3 receptors in the activity of antidepressants in the mouse forced swimming test. Eur J Pharmacol 325: 129–135.

115.

Richards

Chapman

Goodell

, et al. (1958) LSD-like delirium following ingestion of a small amount of its brom analog (BOL-148). Ann Intern Med 48: 1078–1082.

116.

RIVER Working Group (2023) Reporting in vitro experiments responsibly – the RIVER Recommendations. Available at: https://osf.io/preprints/metaarxiv/x6aut_v1 (accessed 11 August 2025).

117.

Robinson

Riedel

(2014) Comparison of automated home cage monitoring systems: Emphasis on feeding behaviour, activity and spatial learning following pharmacological interventions. J Neurosci Methods 234: 13–25.

118.

Rowan

Flaherty

(1991) Behavior of Maudsley reactive and nonreactive rats (Rattus norvegicus) in three consummatory contrast paradigms. J Comp Psychol 105: 115–124.

119.

Sadzot

Baraban

Glennon

, et al. (1989) Hallucinogenic drug interactions at human brain 5-HT2 receptors: Implications for treating LSD-induced hallucinogenesis. Psychopharmacology (Berl) 98: 495–499.

120.

Salmon

Stanford

(1989) Beta-adrenoceptor binding correlates with behaviour of rats in the open field. Psychopharmacology (Berl) 98: 412–416.

121.

Schreiber

Brocco

Audinot

, et al. (1995) (1-(2,5-dimethoxy-4 iodophenyl)-2-aminopropane)-induced head-twitches in the rat are mediated by 5-hydroxytryptamine (5-HT) 2A receptors: Modulation by novel 5-HT2A/2C antagonists, D1 antagonists and 5-HT1A agonists. J Pharmacol Exp Ther 273: 101–112.

122.

Sekssaoui

Bockaert

Marin

, et al. (2024) Antidepressant-like effects of psychedelics in a chronic despair mouse model: Is the 5-HT_2A receptor the unique player? Neuropsychopharmacology 49: 747–756.

123.

Selva

Rodríquez

Tornos

, et al. (1989) The ergonovine test: An episode of hallucinations. Rev Esp Cardiol 42: 214–215.

124.

Silva

Calil

(1975) Screening hallucinogenic drugs: Systematic study of three behavioral tests. Psychopharmacologia 42: 163–171.

125.

Slack

Draugalis

(2001) Establishing the internal and external validity of experimental studies. Am J Health Syst Pharm 58: 2173–2181.

126.

Smith

Peroutka

(1986) Differential effects of 5-hydroxytryptamine1a selective drugs on the 5-HT behavioral syndrome. Pharmacol Biochem Behav 24: 1513–1519.

127.

Smith

Clutton

Lilley

, et al. (2018) PREPARE: Guidelines for planning animal research and testing. Lab Anim 52: 135–141.

128.

Somerville

Herrmann

(1978) Migraine prophylaxis with lisuride hydrogen maleate—a double blind study of lisuride versus placebo. Headache 18: 75–79.

129.

Stanford

(2007) The open field test: Reinventing the wheel. J Psychopharmacol 21: 134–135.

130.

Stanford

(2020) Some reasons why preclinical studies of psychiatric disorders fail to translate: What can be rescued from the misunderstanding and misuse of animal ‘Models’? Altern Lab Anim 48: 106–115.

131.

Stanford

(2025) From a putative antidepressant to a validated mouse model of ADHD via NK1 receptors. In: Vink

(ed.), Substance P From Pain to Cancer. Molecular Mediators in Health and Disease: How Cells Communicate. New York, NY: Academic Press, pp. 525–553.

132.

Stanford

Gillman

(2010) Risk of severe serotonin toxicity following co-administration of methylene blue and serotonin reuptake inhibitors: An update on a case report of post-operative delirium. J Psychopharmacol 24: 1433–1438.

133.

Stenbæk

Madsen

Ozenne

(2021) Brain serotonin 2A receptor binding predicts subjective temporal and mystical effects of psilocybin in healthy humans. J Psychopharmacol 35: 459–468.

134.

Takaba

Ibi

Yoshida

, et al. (2024) Ethopharmacological evaluation of antidepressant-like effect of serotonergic psychedelics in C57BL/6J male mice. Naunyn Schmiedebergs Arch Pharmacol 397: 3019–3035.

135.

Titeler

Lyon

Glennon

(1988) Radioligand binding evidence implicates the brain 5-HT2 receptor as a site of action for LSD and phenylisopropylamine hallucinogens. Psychopharmacology (Berl) 94: 213–216.

136.

Tylš

Páleníček

Kadeřábek

, et al. (2016) Sex differences and serotonergic mechanisms in the behavioural effects of psilocin. J Behav Pharmacol 27: 309–320.

137.

Viana

GSB

Vale

EMD

Araujo

ARA

, et al. (2020) Rapid and long-lasting antidepressant-like effects of ketamine and their relationship with the expression of brain enzymes, BDNF, and astrocytes. Braz J Med Biol Res 54: e10107.

138.

Voikar

Stanford

(2023) The open field test. In: Harro

(ed.) Psychiatric Vulnerability, Mood, and Anxiety Disorders. Tests and Models in Mice and Rats. Totowa, NJ: Humana Press, pp. 9–29.

139.

Vollenweider

Vollenweider-Scherpenhuyzen

Bäbler

, et al. (1998) Psilocybin induces schizophrenia-like psychosis in humans via a serotonin-2 agonist action. Neuroreport 9: 3897–3902.

140.

Walsh

Cummins

(1976) The open field test – a critical review. Psychol Bull 83: 482–504.

141.

Wang

Robbins

Zhuang

, et al. (2025) Early psilocybin intervention alleviates behavioral despair and cognitive impairment in stressed Wistar rats. Prog Neuropsychopharmacol Biol Psychiatry 136: 111243.

142.

White

Holohean

Appel

(1981) Lack of specificity of an animal behavior model for hallucinogenic drug action. Pharmacol Biochem Behav 14: 339–343.

143.

Willner

(2017) The chronic mild stress (CMS) model of depression: History, evaluation and usage. Neurobiol Stress 6: 78–93.

144.

Willner

(1986) Validation criteria for animal models of human mental disorders: Learned helplessness as a paradigm case. Prog Neuropsychopharmacol Biol Psychiatry 10: 677–690.

145.

Wojtas

Bysiek

Wawrzczak-Bargiela

, et al. (2022) Effect of psilocybin and ketamine on brain neurotransmitters, glutamate receptors, DNA and rat behavior. Int J Mol Sci 3: 6713.

146.

Wojtas

Bysiek

Wawrzczak-Bargiela

, et al. (2023) Limbic system response to psilocybin and ketamine administration in rats: A neurochemical and behavioral study. Int J Mol Sci 25: 100.

147.

Yaden

Griffiths

(2020) The subjective effects of psychedelics are necessary for their enduring therapeutic effects. ACS Pharmacol Transl Sci 4: 568–572.

148.

Yamada

Sugimoto

Horisaka

(1988) The behavioural effects of 8-hydroxy-2-(di-n-propylamino)tetralin (8-OH-DPAT) in mice. Eur J Pharmacol 154: 299–304.

149.

Yamaguchi

Kaneko

Narukawa

(2021) Approval success rates of drug candidates based on target, action, modality, application, and their combinations. Clin Transl Sci 14: 1113–1122.

150.

Yamamoto

Ueki

(1975) Behavioral effects of 2,5-dimethoxy-4-methylamphetamine (DOM) in rats and mice. Eur J Pharmacol 32: 156–162.

151.

Zhou

Wang

Cao

, et al. (2025) Repeated high-dose esketamine in early postnatal rats leads to behavioural deficits with long-term modifications in white matter microstructural integrity. Brain Res 1847: 149311.

Appraisal of the validity of preclinical investigations to predict the efficacy of psychedelic drugs as fast-acting antidepressants

Abstract

Keywords

Introduction

Improving reproducibility and validity of research findings: General points

The hallucinogenic profile of psychedelics

The validity of predictive screens

Head-twitches, hallucinations and 5-HT2a receptors

The Forced Swim Test: Validity depends on the experimental objective

The Open Field Test

The sucrose preference test (‘anhedonia’)

Final comments

Footnotes

Acknowledgements

Declaration of conflicting interests

Funding

ORCID iD

References