Abstract
Idealization is ubiquitous in human cognition, and so is the inclination to be puzzled by it: what to make of ideal gas, infinitely large populations,
Keywords
1. Introduction
Mass point, frictionless plane, vacuum, ideal gas, infinitely large populations, immortal humans, perfect competition, perfectly informed agents, zero transaction costs, and so on. Across the history of natural and social science, idealizations of various kinds have prospered, and it appears science has made a great deal of progress thanks to—or despite—them. Yet, there is persistent puzzlement about what to make of idealizations simply because they appear to be in stark conflict with the facts as we know them and because their useful functions are seldom fully transparent. Confusion and puzzlement have occasionally given rise to heated debates.
In what follows, I will examine these puzzlements in the social sciences, through authors such as J. H. von Thünen, J. S. Mill, Max Weber, Milton Friedman, and Thomas Schelling; in contemporary political philosophy and the recent methodological debates therein around “ideal and non-ideal theory” since John Rawls and others; and in contemporary philosophy of science and the massive investigation into scientific modeling. There are interesting affinities and variations between these conversations as well as obvious opportunities for them to learn from one another.
My overall goal is to improve the transparency of idealizations in use. I will not argue for or against any particular idealization, but will outline a framework—a few organizing concepts and general principles—within which debates over any idealization can be fruitfully carried out. This requires clarifying the very idea of idealization, the alternative intended
2. What to Make of Extreme and Obvious Falsehood
I will provide some opening observations about the key components in the conundrum. I am suggesting that idealization is ubiquitous, and so is puzzlement about it. We need to be clear about what we are talking about.
2.1. Puzzlement
On the one hand, there is puzzled
In judging that an idealization does not get the facts right, one may draw from a variety of background information. In much of social science, commonsense observation plays major roles. Almost anyone, regardless of educational background, can tell that market agents are not perfectly informed lightning calculators, just based on one’s experience with human action and interaction, or on general cultural understanding of human life (cf. the idea of “phenomenological pressure” in Mäki 2013, 89-90). In contrast, it takes a little education in physics to know that nothing travels faster than light and that therefore the idealized concept of rigid body—that assumes that force is transmitted from one part of the body to another in no time—gets the facts wrong.
The puzzled judgment comes with various intensities. They may range from caution to suspicion to outright repudiation. More weakly, one may merely have stronger doubts about a theory or model that employs unadorned idealizations than about one that does not. More strongly, one may disallow idealizations as unacceptable. In this vein, Peter Blau (1956, 35) stated, in the context of his account of bureaucracy, that “[s]ince generalizations about idealized states defy testing in systematic research, they have no place in science.” In the context of political philosophy, Onora O’Neill (1987, 56; italics in the original) finds it objectionable that in idealization “much (too much) that is false of human agents is
On the other hand, there are puzzled
The recommendable attitudes should derive from selective judgment that would be passed largely on a case-by-case basis. This will require understanding what each particular idealization says and does as well as what standards would apply in assessing its capacities for promoting the goals of inquiry. It is the purpose of this article to facilitate such judgments.
2.2. Idealization
Next consider what these hesitations, resistances, and defenses are about. What is an
As I see it, idealization can be taken to have distinct features that dissociate it from the above two characterizations. First, if taken as factual propositions, idealizations are not just false, but deliberately so, associated with an
Second, idealizations do not appear to be just false, but their ideality is also connected to some sort of
In some cases, ideality is conceived normatively. It designates normative ideals that are to be recommended and pursued. Their desirability may have moral, political, aesthetic, or other non-epistemic origins. Idealizations in political philosophy are often intended to articulate normative ideals.
2.3. Why Idealize?
Many authors consider idealization a matter of self-evidence in first-rate science. Doing science in accordance to what Leszek Nowak (1980) calls the “idealizational conception of science” is taken by him to have been the best practice from Galileo and Marx and beyond. 2 Yet in scientific practice itself, puzzlement, confusion, and diversity of opinions prevail. The issues are far from settled, so we need to go on asking questions and trying to answer them. What to make of idealization? Why idealize? Let me provide a first quick and rough idea of the reasons and possible justifications for using idealizations in our epistemic pursuits. There are at least three sets of such reasons (cf. Potochnik 2017):
The world is immensely rich in attributes and complex in relationships.
Our cognitive capacities are limited.
Our cognitive interests are multifarious.
Reasons 1 and 2 alone imply the necessity to simplify and idealize in our cognitive ventures. My main focus in this essay will be on Reason 3, itself sufficient for the importance of idealization or its equivalents for our epistemic aspirations.
2.4. Functions of Idealization
I will next provide some preliminary elaboration of Reason 3. The key is to understand the
On Step 2, we ask what [1] Idealizations may serve the function of [2] Idealizations may help envisage [3] Idealizations may be used for exploring [4] Idealizations may serve the function of exploring
2.5. Contents of Idealizing Assumptions
It is a mistake to treat idealizations as (false)
None of these make the (false) claim that an economy is closed. These are alternative altered versions of a straight idealization that assumes some extreme perfection. These altered versions can be used for explicating the functions of idealizations and thereby for alleviating puzzlements around them. Note that a straightforward idealization, if interpreted literally and considered a truth-apt claim, is known to be false, while these altered versions have a chance of being true (Mäki 2012; Musgrave 1981).
2.6. Puzzlements Again
We can now be a little more specific about the nature of puzzlements about idealization. There are at least three sources of puzzlement. First, one may not have understood the
The sections to follow aim to offer tools for improving the transparency of idealizations in terms of their contents and functions. Note that idealizations, supposing they are explicitly stated or at least available for being explicitly stated, are relatively transparent themselves—especially when contrasted with the massive amount of silent and implicit omission that is unavoidable in modeling. However, idealizations as explicitly stated are not transparent enough, since their formulations typically do not reveal their intended (or required) contents and functions. Their transparency needs upgrading.
3. Minimal Model Idealization
A major function of idealization is to help to isolate a minimal causal fragment embedded in a larger causal structure. 3 Some of the more puzzling and controversial cases belong to this category.
3.1. J. S. Mill 1836
J. S. Mill’s ([1836] 1844) famous thoughts about how political economy treats human behavior exemplify this function. Only a small selection of behavioral dispositions is isolated by economic theory: It does
Economic theory thus isolates the rational maximization of wealth (together with two counteracting dispositions, namely aversion to labor and desire of present enjoyment) from all other motives that are implicitly assumed to have zero strength. One can then draw inferences about what happens in such an imagined economic world. In response to suspicions about employing such a narrow view of human motivation, the obvious justification is to argue that the isolated dispositions are the most powerful and stable ones in the economic context. This was Mill’s own (realist) line. We can put it in terms of
3.2. Milton Friedman 1953
It is not always appreciated that Milton Friedman’s (1953) confused defense of the maximization assumption, more than a century later, is similar to that of Mill. Friedman defended the assumption against criticisms based on empirical research that had concluded that business firms do not actually seek to maximize their expected returns. He famously generalized this into an encompassing argument that the unrealisticness of assumptions in economic theory does not matter. This has been interpreted by many commentators to imply an instrumentalist, anti-realist conception of theory and model. I have argued this is a mistake: whatever Friedman’s authentic philosophical outlook may have been, the essay (F53 for short)
A meaningful scientific hypothesis or theory typically asserts that certain forces are, and other forces are not, important in understanding a particular class of phenomena. (F53, 40)
This is in line with suggesting that unimportant forces are to be idealized away. The falsity of such idealizations could be accompanied by a true negligibility claim: those forces are there, but they make a negligible impact on the phenomena under study, thus idealization is justified. Here is another rendering: It is frequently convenient to present such a hypothesis by stating that the phenomena it is desired to predict behave in the world of observation
It is another mistake to believe that once an
However, F53 is quite confused in its defense of unrealistic assumptions. One example is the conflation between nothing but the truth and the whole truth in envisioning “a completely ‘realistic’ theory of the wheat market” (F53, 32) against critics of the maximization assumption (see Mäki 2009, 97-98). Another comes from the use of a presumed analogy with assuming a vacuum in Galileo’s law of freely falling bodies. Analogies often illuminate, but this one obscures (see Mäki 2009, 99-101). F53 argues that if a vacuum is fine, then so is profit maximization. However, a vacuum (air pressure = 0) is an idealization with functions similar to those of assuming magnetic forces = 0, all other forces = 0, and the radius of the Earth = ∞ (flat Earth) in the case of Galileo’s law, and assuming that the strength of all other motives except the maximization motive = 0 in the case of theory of competitive firms. The adequate analogy that is left unexamined by F53 is between profit maximization and gravity, both isolated by idealizing other factors away.
Galileo’s law is useful for illustrating the roles of negligibility and applicability considerations. F53 does not employ these labels, but it provides arguments that manifest the concepts. F53 says Galileo’s law is mostly applicable to falling cannon balls in atmospheric conditions, but not to falling feathers. The reason is that the deviation of the actual behavior of the cannon ball from what Galileo’s law predicts is negligible for many purposes, whereas the deviation in the case of a feather is not. This is to say the falsity of the vacuum idealization about actual atmospheric conditions is negligible in some cases (to which it is then applicable) but not in others.
3.3. J. H. von Thünen 1826/1842
J. H. von Thünen’s famous and still influential model of the isolated state not only isolates the state from the rest of the world, but it also isolates a small mechanism that is believed to make a major impact on the distribution of land use (see, for example, Mäki 2011). The model is dependent on many highly idealizing assumptions: the area is flat; the soil is uniform in fertility; the area is uniform in climate; there is just one point-like Town in the middle of the area; the agents are perfectly informed rational maximizers; and so on. 5
Given these assumptions it is possible to derive a neat land-use pattern of concentric rings, the famous Thünen rings. The function of the various idealizing assumptions is that of controlling for several causally relevant factors (such as mountains, rivers, fertility and climate differences, other towns, biases in decision-making), excluding or neutralizing them, cleaning them away, as it were. Thünen thereby prepares a purified stage where just one purified mechanism shapes the outcome. Idealizing assumptions help isolate a major mechanism and its characteristic way of operation. This is the
It is remarkable that Thünen himself was not in the least confused about the method he applied. He was aware that his idealizations are false about the world, but he argued that they “are a necessary part of my argument, allowing me to establish the operation of a certain factor, a factor whose operation we see but dimly in reality, where it is in incessant conflict with others of its kind” (von Thünen 1966, 3-4). By using his method (of isolation) “we may divest an acting force of all incidental conditions and everything accidental, and only in this way can we recognize its role in producing the phenomena before us” (von Thünen 1910, 274; my translation).
4. Benchmark Model Idealization
Minimal models of the sort discussed above highlight the causal contribution of a causal unit to the emergence or existence of a phenomenon, using idealizations for neutralizing other causal factors. By contrast, using models as benchmarks highlights the way in which such idealized factors, once released from control, play their causal roles. This can mean several different things.
4.1. Thünen Again
The Thünen model can again be used to illustrate. In the benchmark, the Thünen Mechanism, when acting alone, shapes the outcome pattern in the model, that of concentric rings (Figure 1, left). This is not sufficient for explaining the far less neat and regular patterns in the actual world (illustrated by an image of the larger Beijing area in Figure 1, right). The causal contribution of the Thünen Mechanism to shaping the actual pattern is modified by several other causes and conditions that can be mobilized in explaining the actual pattern by relaxing the idealizing assumptions that were used for neutralizing their impact in the simple model that isolated the Thünen Mechanism from everything else. The model is now de-isolated by de-idealization to let multiple other factors play their causal roles in shaping the outcome—such as rivers and mountains, several urban concentrations and their internal structure, various other heterogeneities as well as limitations in decision-makers’ rationality.

Thünen model applied, without and with relaxed idealizations.
The whole empirical pattern is the
In a different strategy, one uses the Thünen model as a benchmark for identifying deviations from it in the actual world, and then sets out to explain those deviations by relaxing the idealizations of the model. The Thünen Mechanism is no more invoked in these explanations (Figure 2).

Thünen model used as benchmark, deviations explained.
Here
We may put the difference between minimal model idealization and benchmark model idealization in terms of explanatory burden as follows. In the former,
4.2. Max Weber and Ideal Types
Puzzlement reigns around the notion of ideal type. Max Weber himself was famously far from clear about what ideal types are. He did not provide one elaborate and unambiguous definition of the concept, and his examples of ideal types (bureaucracy, capitalism,
It seems ideal types in Weber’s sense are
As to “ideal” in “ideal type,” there is no one clear-cut definition that we can find in Weber, but he mainly seems to think of it in terms of abstraction and accentuation (
This calls for some scrutiny. Abstracting common features of a number of instances can be taken to yield generic types that are exemplified by those instances; this would be in line with ordinary usages of “type.” This feature of types (i.e., that they are exemplified by token instances) then vanishes at the next step. This step is
I have two quick comments on this. First, we might conclude that Weber’s thinking has
Second, now focusing on “ideality” alone, it does not seem to be entirely clear that Weberian ideal types are idealizations, at least in our narrow sense. It seems Weber thinks ideality derives from
Having made some remarks about the puzzling notion of ideal type itself, I next suggest what seems an obvious way in which ideal types are supposed to be
Weber implies that ideal types
The key point is that ideal types are supposed to be compared to actual situations, and it is not until these comparisons have been made and deviations have been identified that explanatory activity may start. For example, the idealizing assumptions of rational action in theoretical economics are indispensable means for explanatory purposes: “Only in this way is it possible to assess the causal significance of irrational factors as accounting for the deviations from this [ideal] type” (Weber 1947, 92). All errors and emotional influences can be seen as
These observations should go at least some way toward removing puzzled confusions about how to use and assess Weberian ideal typical accounts. For example, Weber’s ideal model of bureaucracy has been criticized unduly for misrepresenting actual bureaucracies by distorting or missing various important details in their structure, functioning, and context. This is a mistake about applicability conditions. As Lamond (1990) reminds us, Weber’s model is not intended for being used in a microanalysis of organization, but rather in a comparative—cross-cultural and historical—analysis of the organization of society as a whole.
5. How-Possibly Model Idealization
Minimal models typically deal with how-actually cases. Even though such models do not cover the whole range of causes and conditions that jointly bring about the phenomenon to be explained, they are often used to suggest that the isolated cause actually contributes to its emergence or existence. The epistemic ambitions may also be more modest, such that only how-possibly explanations are pursued.
Economic model building often explores possible causal scenarios. One first recognizes a phenomenon, such as a pattern of behavior. One then proceeds abductively, asking and attempting to answer the question, “What mechanism(s) could have generated this pattern?” In answering the question, one proposes a model that accounts for the pattern by isolating a possible mechanism that could be causally responsible for the pattern. It is not established or even proposed as the mechanism that
While there might be just one correct how-actually explanation (given the precise explanatory question that has been asked), there may be several correct how-possibly explanations (this is one justification for entertaining multiple models at the same time). For the latter to be judged correct, they must meet some constraints. Not just any imaginative logical possibility will do. The space of possibilities may be constrained by what is known about the nature of the real world, including natural, cognitive, and social facts. This delimits the set of hypothetical scenarios that are considered worthy of further examination.
Historical examples in social science include Carl Menger’s “conjectural history” of the emergence of money—as generally accepted medium of exchange—from barter (Menger 1892; see Aydinonat 2008). We know that such a medium exists, and Menger provides a simple how-possibly explanation for its emergence. It is based on an idealized model that excludes everything but separate self-interested economizing actors in the market, engaged in direct exchange of goods with varying salability. In this idealized world, devoid of things such as some other institutions, government activity, and cultural meanings of money, money emerges as an unintended consequence of separate individuals’ interactions. It is not necessarily argued that this is how money has historically emerged as a matter of actual fact, just that this is how money could possibly emerge. This is supposed to contribute to our understanding of the nature of money.
More recent examples include economic models of science (Mäki 2005a) and Thomas Schelling’s models of segregation (Aydinonat 2007, 2008; Grüne-Yanoff 2009; Mäki 2009; Schelling 1978; Verreault-Julien 2019). Schelling’s models have been much discussed recently, so I can briefly focus on just their function as how-possibly models.
Schelling’s model is an agent-based model that traces processes that can generate segregated social structures. In one racial housing market version, the modeler starts with a random distribution of housing between blacks and whites. One then lets the residents follow rules governed by their (non-racist) preferences for their neighbors’ skin color in their residential location decisions. Finally, racial segregation will emerge in the model city. This is presented as an unintended (and unwanted) consequence of interactive individual decisions.
Again, various idealizations contribute to cleaning the stage for the simple invisible-hand mechanism to produce the segregated outcome in isolation. It may be assumed that the city is spatially homogeneous and closed; that is, there are no neighborhoods in the city and no moves to and from the city. Agents have uniform preferences and behave rationally. Agents make decisions about whether to move or stay based on just their preferences about their immediate neighbors, thus either have no other preferences or their other preferences have no influence on moving decisions. Agents move from one slot to another at no cost. Information about a slot left behind by an agent is immediately available to all agents. There are no differences in housing prices and budget constraints. Socioeconomic factors play no role. No external interference, such as by zoning authorities, shapes the process.
We know the world is not like that. Yet, the model might be informative about a possible way in which segregation emerges—or perhaps rather, about a possible contribution that something like this invisible-hand mechanism makes to the outcome. Whether it is informative about such a possibility is not dependent on whether it provides a complete story about how the segregated outcome actually comes about generally or in any particular case. Removing any possible confusions about these functions has implications for how Schelling’s models are to be appropriately critically assessed.
It is remarkable that how-possibly models may have policy relevance. The Schelling case illuminates. First there is the observed fact that racial segregation obtains. Second, we have a typical commonsense explanation: segregated structures obtain because people want to live next to their “own race.” Third, an alternative how-possibly explanation is given by the model: the segregated structure is a possible unintended consequence of many alternative preferences, racist and non-racist alike. Fourth, a political conclusion transpires: rather than simply “let people have what they want” there may be possible grounds for a political intervention, by way of housing and zoning policies for example. Indeed, exploring possible causal scenarios with practical relevance is yet another possible function of idealizations.
6. Normative Ideal Idealization
We now come to our final example, one that involves normativity. In post-Rawlsian political philosophy, there is a methodological debate under way, criticizing or justifying what they call ideal theory in contrast to non-ideal theory. Some of the key concepts—including those of ideal and non-ideal theory—tend to be ambiguous (see, for example, Hamlin and Stemplowska 2012; Valentini 2012). Heightened puzzlement reigns. We need to ask questions about the concept of ideal theory itself, as well as about the functions of idealization in ideal theory. The issues are vast and tangled, so only a few brief and comparative opening observations can be submitted.
In putting forth his “ideal theory,” Rawls sets the stage for dealing with the problem of social justice by assuming a society that is well-ordered and self-contained, 7 that its population consists of physically and mentally fully capable adults, and that everything takes place under “reasonably favorable” natural and historical conditions, such as in the absence of wars and natural disasters. Importantly, the agents are assumed to strictly comply with his principles of justice. Ideal theory “assumes strict compliance and works out the principles that characterize a well-ordered society under favorable circumstances” (Rawls 1971, 245).
I have two observations to make about full or strict compliance, the key idealization to assume away any motivational, cognitive, and behavioral deficiencies, required for isolating the fundamental principles of justice. First, regarding the contents of the idealization, saying that such weaknesses must be excluded when they are “beyond reasonable extent” (Valentini 2009, 339) is to pass a negligibility judgment, an altered version of the idealization of full compliance. This is the claim that any such actual weakness
Second, one may be puzzled about the [I]f we compare the operation of societies ordered by competing principles of justice while assuming strict compliance with those principles, the different effects we observe can reasonably be taken to be wholly the responsibility of the different ordering principles themselves. So our comparison turns out to be quite strictly a comparison only of the principles of justice.
This can be put as follows. By neutralizing the effects of any varying and hard-to-measure degrees of non-full compliance, we can focus on examining the effects of the ordering principles without the interference of failures to fully comply. This manifests an “experimental” use of minimal model idealization. On the other hand, if this procedure is not applied, and the degree of compliance below perfection is not neutralized for its effects, we lose the prospect of isolating the causal effects of other factors as they are intermingled with those of imperfect compliance. On top of possibly unfavorable conditions, these other factors include other valuable goals such as peace, security, order, legitimacy, and democracy. An idealized theory of justice identifies and examines principles of justice in isolation from all other normative principles. 8
The above does not exhaust the functions of ideal theory. Normativity is also involved. Ideal theory provides a conception of a just society that we are to achieve if we can. Existing institutions are to be judged in the light of this conception. (Rawls 1971, 246)
This evokes two normative functions of the ideal. One is a
The other normative function is
Non-ideal theory exhibits characteristics of later-step models in scientific modeling, those constructed by relaxing one or more of the idealizations of ideal theory to explain further features in the
These comparative observations have illuminated some of the similarities and differences between idealization in scientific modeling and in political philosophy. Admittedly, we are far from having removed all the puzzlements and confusions—but luckily that particular ideal end-state was not our goal, so it is enough if we have managed to step on the non-ideal path of transition toward it! 10
7. Conclusion
Idealization is a ubiquitous, powerful, and risky cognitive instrument. That there are suspicions and confusions about idealizations in general or about particular idealizations is unsurprising given that they appear to conflict with known facts and that it is often unclear what their point is.
I have proposed alleviating these puzzlements by drawing attention to the variety of claims that idealizations can be used to make (about negligibility, applicability, early-step status, etc.) as well as to the variety of functions that idealizations can serve (such as minimal, benchmark, and how-possibly modeling). I am not pretending to have identified a full list of possibilities, but the main point is that any particular idealization is to be assessed against some such decent function, not solely in terms of its truthlikeness or the like. This should reduce puzzlement and improve the quality of critical conversation on idealizations. Particular idealizations may go astray, including those that I have used as illustrations in this article. But an idealization with this or that content fails or succeeds in relation to this or that function; therefore, we need to get those contents and functions clear to be able to determine whether an idealization does its job or fails to do so.
Footnotes
Acknowledgements
The article derives from a keynote lecture delivered at the Asian Conference on the Philosophy of the Social Sciences, held at Nankai University, Tianjin, China, on June 1-2, 2019. The paper has also been presented at GATE, University of Jean Monnet, Saint-Etienne, France; and at TRIANGLE, University of Lyon 2, France. Thanks for helpful comments go to the participants of these events and to two anonymous referees.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Funding from Nankai University is gratefully acknowledged.
