Meaningful measurement requires substantive formal theory

Abstract

In this article, we take the opportunity to elaborate on some aspects of our article “Squaring the Circle: From Latent Variables to Theory-Based Measurement” (Borgstede & Eggert, 2023) that gave rise to the concerns uttered by Hasselman (2023) and Slaney (2023), and to clarify why we think that theory-based measurement is indeed necessary and sufficient for the establishment of meaningful psychological measurement procedures. Moreover, we will illustrate how theory-based measurement might be accomplished in psychology by means of an example from behavioral selection theory.

Keywords

covariance based law of effect formal theory measurement in psychology multilevel model of behavioral selection theory-based measurement

It’s hip to be square—Towards a metrology of psychology

In his comment, Hasselman (2023) generally seems to agree with our position in Borgstede and Eggert (2023) that substantive formal theory is necessary to justify meaningful measurement. However, he does not agree with our claim that substantive formal theory is also sufficient for meaningful measurement. Hasselman builds his position on a formal account of the measurement process, where measurement is construed as the identification of the state of a system by means of a measurement device. In classical physical measurement (ideal measurement), the value provided by the measurement device is identical to the state of the system. However, there are instances of measurement that involve a nonnegligible interaction between the system and the measurement device. A prominent example of such nonnegligible interaction is given by quantum measurement, but analogous examples can even be found in classical physics (e.g., using a thermometer that has been cooled down in a freezer prior to the measurement of temperature; Hasselman, 2023, pp. 147–148). Hasselman (2023) seems to think that we construe psychological measurement as ideal measurement whereas he himself argues that psychological measurement almost inevitably involves nonnegligible interactions between the system and the measurement device (p. 148). Hasselman (2023) claims that prior to substantive theory building in psychology, a general theory about these nonnegligible interactions is needed. Therefore, he concludes, substantive formal theory cannot be sufficient for measurement in psychology (p. 146).

In fact, we do not think that psychological measurement is restricted to ideal measurement. However, contrary to Hasselman (2023), we conceive the measurement device to be an integral part of the system one wishes to describe. For example, a beam scale only provides a measure of mass because the act of putting objects in the pans until an equilibrium is reached can be described using the theory of classical mechanics. In other words, measurement is not something external to the system. Therefore, our account of theory-based measurement naturally entails what Hasselman calls a “theory of measurement.” The same holds for Hasselman’s temperature example where the measurement device (the cooled down thermometer) and the system (water in a cup) interact with regard to the temperature during the act of measuring (i.e., the thermometer changes the temperature of the water and vice versa). In our framework, the thermometer and the water are both physical objects and their interaction is guided by the same theory (thermodynamics) that provides the semantics of the attribute of interest (temperature). Therefore, what Hasselman calls “theory of measurement” is only a special application of the kind of substantive formal theories that are required for theory-based measurement (cf. Holzhauser & Eggert, 2019).

Hasselman (2023) correctly points towards the problem that most psychological assessment procedures, such as rating scales or standardized tests, lack a substantive formal theory of the processes that generate the behavior of an individual in such contexts. In our view, most psychological assessment procedures are in fact interpersonal (mostly verbal) interactions. Consequently, if we attempt to measure anything with an ordinal rating scale, we would indeed need a theory of verbal interaction that covers the usage of rating scales. Although this may seem to imply a special measurement theory for psychology, we think that the usage of rating scales itself lies in the domain of psychology. Therefore, substantive formal theory is in fact not only necessary, but also sufficient for measurement in psychology. Explain behavior and eventually you will end up with a theory that (among many other phenomena) explains the interactions of humans with psychological assessment devices. Such a theory will certainly be very different from the formalisms employed in psychometrics, partly because these formalisms build on questionable assumptions—such as the ergodicity property, as pointed out by Hasselman (2023, p. 149). More importantly, however, we cannot expect latent variable modelling (LVM) to solve the problem of psychological measurement because latent variables neither account for the social interactions involved in psychological assessments, nor for the phenomena they seek to explain.

Going around in circles—Theory, folk psychology, and the meaning of “meaning”

In her commentary, Slaney (2023) proposes that scientific terms cannot be independent from common language terms because “A theory from which clear hypotheses can be derived and tested presupposes that the meanings of the concepts out of which the theory is built are already meaningful” (p. 140). Taking the position that the meaning of scientific terms depends on the meaning of the common language terms from which they are derived, Slaney argues that the framework of theory-based measurement cannot provide meaning to psychological concepts unless the corresponding common language terms are already meaningful. Building on this premise, Slaney (2023) points to several apparent problems in our article concerning the meaning of the word “meaning” (pp. 139–140). She references Wittgenstein’s account of meaning as the use of a concept in context (Wittgenstein, 1953/1989) as opposed to what she considers to be our position.

In fact, our account is totally in line with Wittgenstein’s account of meaning. In our view, scientific theories are just a special kind of language game, and as such, not unlike common language grammars. However, in contrast to the meaning of a term in common language, which is usually characterized by a collection of different contexts, each providing its own rules for the use of the word that may in part overlap with one another, a formal scientific theory has very clear boundaries. Within the theory, the grammar of the theoretical terms is in fact completely determined by the theoretical principles (i.e., by scientific laws). In other words, a formal theory is a language game with exceptionally strict rules. For example, in classical mechanics, force is exactly the product of mass and acceleration (and nothing else!). This is what we mean when we say that formal theory “provides meaning” to theoretical terms. It does so in the same way that common language grammars provide meaning—but in a more restrictive way. In other words, whereas common language concepts often have several meanings that only bear some degree of similarity (family resemblance, in Wittgenstein’s terminology), formal scientific concepts are unambiguous. We further argue that only concepts with an unambiguous meaning in this sense are measurable. Hence, we do not “conflate the meaningfulness of a (theoretical) concept with the measurability of the attribute” (Slaney, 2023, p. 140), we only point to the fact that measurement requires unambiguous grammars for the underlying theoretical terms.

What then is the relation between common language terms and scientific terms? According to Slaney (2023), scientific terms are defined on the grounds of common language terms (p. 140). This position clearly adheres to the syntactic view of scientific theories, which we explicitly reject. In the syntactic view, scientific terms are defined by means of correspondence rules that relate the abstract terms used in the theory to common language terms (Carnap, 1974). In this view, scientific terms are essentially reducible to common language terms. Consequently, it seems that scientific theory can never generate new concepts or provide meaning to theoretical terms that would be independent of common language concepts (Slaney, 2023, p. 143). We formalized such reduction relations between abstract concepts and common language terms in Buntins et al. (2016), with the result that, although they might justify psychological testing procedures on pragmatic grounds, they cannot provide a sound foundation for scientific theory construction. We further argued in Buntins et al. (2017) that the attempt to ground substantive psychological theory on formalized common language concepts introduces the (pseudo-)problem of validity, which stems from the unavoidable deviations between such formalized concepts and their common language counterparts.

In the semantic view, these problems dissolve because the semantics of the theoretical terms are given by the theory net of all scientific laws that are relevant to the use of the terms (Balzer et al., 1987) which themselves constitute a language game. Following the semantic view, the relation between theoretical terms and common language terms is mediated by the fact that different language games may be connected to the same empirical phenomena. For example, we may say that a car “quickly drives around a corner” (common language term), while a physicist may say that the same car’s movement is “subject to angular acceleration“ (scientific term). Both terms can be used to describe the same phenomenon, but they follow different grammars (i.e., they are part of different language games). The relation between the two, therefore, is not conceptual (or definitional), it is indeed empirical in that the usage of the terms is constrained by the phenomena they seek to describe. Moreover, the grammar of the term “angular acceleration” is so unambiguous that we can actually calculate the angular acceleration from the trajectory of the car—that is, because of the strict grammar imposed by the theory of classical mechanics, angular acceleration is a measurable attribute whereas “quickly around a corner” is not.

Having clarified these misunderstandings, the question remains: what kind of theory could possibly be the starting point for theory-based measurement in psychology? We think that our own theoretical developments in the context of behavioral selection theory (Borgstede, 2020; Borgstede & Eggert, 2021) may well illustrate how we envision the development of theory-based measurement in psychology. Behavioral selection theory applies to a domain of phenomena that involve context-dependent changes in individual behavior as a result of environmental consequences. In common language, we describe such phenomena using terms like learning, experience, or trial-and-error, each being part of a language game that imposes a certain grammar (i.e., a common language meaning) to the corresponding terms. Although the meaning of a term like learning is clear enough in many contexts, it is not well-defined in the sense that it allows one to identify, say, the amount or effectiveness of a learning in a specific context. In other words, the common language term does not allow for measurement.

In our own work, we provide a formal theory of behavioral selection that builds on the abstract Price equation (Price, 1970). The Price equation gives a formal definition of selection that is independent of the specific domain of application. Applying this definition to the domain of individual behavior, Borgstede and Eggert (2021) derive a fundamental principle of behavior, the covariance based law of effect (CLOE), within a substantive formal theory, the multilevel model of behavioral selection (MLBS). Figure 1 illustrates a simplified theory net that builds on the CLOE as a fundamental principle (cf. Borgstede & Luque, 2021). As the core of the theory, the CLOE specifies the grammar of the basic terminology that we apply to describe the phenomena involved in behavioral selection. Note that the CLOE is not a restatement of or an abstraction from common language terms like learning or trial-and-error. It provides a formal definition of selection ∆_s in terms of a weighted covariance between behavior $b$ and fitness predictors $p$ : $Δ_{S} b = β_{w p} Cov (b, p)$ . Within the theory of behavioral selection, the meaning of the term ∆_sb is given exclusively by the CLOE and only connected to common language terms such as learning due to the fact that we use them to describe one and the same phenomenon: behavior change due to environmental consequences. We thus have an unambiguously defined theoretical term, behavioral selection, that is defined by a fundamental scientific principle in the context of a larger theory net. As such, the concept of selection is neither reducible to common language terminology, nor does it presuppose any of the psychological concepts it seeks to replace as argued by Slaney (2023, p. 140). At the same time, the MLBS provides a grammar for the underlying concepts that is so unambiguous, that selection can actually be calculated from the observed behavior of an individual when the environmental feedback is known (Borgstede & Anselme, 2022). In other words, the amount of behavioral selection in a certain context can be measured using the conceptual framework of the MLBS whereas the common language concept of learning is not measurable. Of course, if one is interested in the referents of such common language concepts (e.g., learning, experience, or intention), one might be inclined to conclude that psychological measurement is not possible at all (Slaney, 2023, pp. 142–143). However, we think that the primary objective of psychology is not how people talk about psychological phenomena, but what the actual principles underlying these phenomena are.

Figure 1.

Simplified theory net for behavioral selection theory. At the top of the hierarchy stands the fundamental principle, the covariance based law of effect (CLOE). The subordinate laws are specializations of the CLOE that restrict the class of structures described by the abstract principle. For example, residual (i.e., non-selection) change is further speciated by the law of adjunctive behavior and the law of induction. Similarly, behavioral selection is further speciated by applying it to different schedules of reinforcement, such as the feedback obtained from ratio schedules or interval schedules. The combined application of these laws yields the formal framework for modeling a specific behavioral experiment (see Borgstede & Luque, 2021 for a detailed exposition).

Footnotes

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

ORCID iDs

Matthias Borgstede

Frank Eggert

Author biographies

Matthias Borgstede is a senior researcher at the University of Bamberg. His research focuses on the methodological foundations of measurement and the development of formal theory in psychology from an evolutionary perspective.

Frank Eggert is a professor of Research Methods and Biopsychology at the University of Braunschweig. His primary research interests are methodological problems of psychology, and the theoretical integration of behavioural psychology and behavioural ecology.

References

Balzer

Moulines

C. U.

Sneed

J. D.

(1987). An architectonic for science: The structuralist program. Reidel.

Borgstede

(2020). An evolutionary model of reinforcer value. Behavioural Processes, 175, Article 104109. https://doi.org/10.1016/j.beproc.2020.104109

Borgstede

Anselme

(2022). A new method to quantify the causal effects of reinforcement in terms of behavioral selection. bioRxiv Preprints. https://doi.org/10.1101/2022.07.22.501082

Borgstede

Eggert

(2021). The formal foundation of an evolutionary theory of reinforcement. Behavioural Processes, 186, Article 104370. https://doi.org/10.1016/j.beproc.2021.104370

Borgstede

Eggert

(2023). Squaring the circle: From latent variables to theory-based measurement. Theory & Psychology, 33(1), 118–137. https://doi.org/10.1177/09593543221127985

Borgstede

Luque

V. J.

(2021). The covariance based law of effect: A fundamental principle of behavior. Behavior and Philosophy, 49, 63–81. https://behavior.org/wp-content/uploads/2022/01/BP-v49-Borgstede2_corrected.pdf

Buntins

Eggert

(2016). Psychological tests from a (fuzzy-)logical point of view. Quality & Quantity, 50(6), 2395–2416. https://doi.org/10.1007/s11135-015-0268-z

Buntins

Eggert

(2017). Clarifying the concept of validity: From measurement to everyday language. Theory & Psychology, 27(5), 703–710. https://doi.org/10.1177/0959354317702256

Carnap

(1974). An introduction to the philosophy of science. Dover Publications.

10.

Hasselman

(2023). Going round in squares: Theory-based measurement requires a theory of measurement. Theory & Psychology, 33(1), 145–152. https://doi.org/10.1177/09593543221131511

11.

Holzhauser

Eggert

(2019). The role of measurement in theorising about the world. Social Science Information, 58(2), 301–326. https://doi.org/10.1177/0539018419860082

12.

Price

G. R.

(1970). Selection and covariance. Nature, 227(5257), 520–521. https://doi.org/10.1038/227520a0

13.

Slaney

K. L.

(2023). Why force a square peg into a round hole? The ongoing (pseudo-)problem of psychological measurement. Theory & Psychology, 33(1), 138–144. https://doi.org/10.1177/09593543221128522

14.

Wittgenstein

(1989). Philosophical investigations (3rd ed., Reprint of English text with index). Blackwell. (Original work published 1953)