Abstract

A key but neglected issue in the search for collective intelligence principles is the role of noise. Does noise inhibit collective intelligence or can it amplify the discovery of intelligent solutions? In this exchange of letters, the authors explore the pros and cons of noise. —Jessica Flack, Co-Editor-in-Chief
Benefits of Noise in Collective Systems: the Four-fold way
David C. Krakauer and David Wolpert
Dear Drs. Kahneman, Sibony, and Sunstein,
We read your book and letter with great interest. You make a very compelling case for increasing awareness of the negative consequences of noise, especially in socio-political-economic decision making. This helps to balance all of the attention that has traditionally been paid to issues of bias. Your most convincing arguments relate to the why and how reducing noise leads to improved outcomes across numerous domains of individual human judgment, from medicine to the law.
As we read you, “bias” describes how far a prediction is from the average value of a quantity. The perennial optimist has positive bias, and the incorrigible pessimist has negative bias, but both are biased. “Noise” (or variance, to give it its formal name) captures the scatter around the average prediction. A meteorologist who predicts every day that the temperature will have exactly its average value is unbiased—but their predictions can have a lot of noise.
It might interest you to know that the philosopher Arthur Schopenhauer wrote in 1905 that “the amount of noise that anyone can bear undisturbed stands in inverse proportion to his mental capacity and therefore be regarded as a pretty fair measure of it” (Arthur et al., n.d). In a certain respect, you are channeling Schopenhauer and imploring us to increase our mental capacities.
Your primary point is that if the decisions and or predictions are made by a single person, at a single moment, then noise can be (highly) deleterious. We call such scenarios the domain of individual exploitation. You recognize that there will be situations where noise in one’s decisions can be exploited to reduce cost or increase individual freedom. But far more numerous are the situations where noise cannot be exploited and is deleterious. This point is well taken and the community at large should be heedful of developing blind-spots in judgment by exclusive attention to matters of bias.
However, there is another domain in which either a single person is engaged in a time-extended sequence of multiple predictions and decisions, or a set of multiple people are making predictions and decisions together. Let’s call this the domain of collective exploration. In these scenarios, information is derived across an ensemble of individuals and/or moments, and it is rarely a point estimate that is being made but a process that needs to be advanced. In most of these cases, as you most likely would agree, noise is an essential functional requirement. And more often than not, this noisy phase of activity precedes the individual exploitation phase where noise becomes increasingly detrimental.
William James celebrates the benefit of the freedom to explore in his influential paper “The Dilemma of Determinism” where he writes, “Let us not fear to shout it from the house-tops if need be; for we now know that the idea of chance is, at bottom, exactly the same thing as the idea of gift” (James, n.d).
Perhaps the great research challenge moving forward is finding balanced approaches to managing both desirable and undesirables sources of noise. And in order to do so, we need a firmer understanding of collective exploration. As a tentative foray in this direction, we have included a simple figure in order to categorize the space of functional noise in collective systems. We label the vertices of a quadrilateral with problems of Optimization; Computation and Inference; Signal Detection; and Mechanics. In what follows, we provide a very brief introduction to each of these vertices (Figure 1). The four-fold value of functional noise in collective systems. Functional noise is organized into four categories, and several examples are provided for each. We have in mind a taxonomy of ways that noise can be harnessed to benefit a system without which function at the collective level would be rendered impossible.
In complex optimization problems, utility or fitness functions are typically non-isotropic (multi-peaked), degenerate (many equivalent optima), non-convex, and/or dynamically changing. In these problems, a collection of measurements is made, extending over time and/or space, so as to decide on how to move in a suitable configuration space. In the engineering sciences, these problems of how to behave across a collection of times arise in active learning (Settles, 2012), experimental design (Ryan and Morgan, 2007), optimization theory (Rao, 2019), control theory (Doyle et al., 2013), or reinforcement learning more generally (Sutton and Barto, 2018). Similarly, collective processes in the natural world, like mutation-selection (Charlesworth, 1990), and their engineered analogs including genetic algorithms (Mitchell, 1998), and simulated annealing (Van Laarhoven and Aarts, 1987), employ random perturbations in order to discover paths toward viable solutions within a population of agents collectively exploring a space. Once at global equilibrium, as KSS argue, noise becomes a liability. In population genetics, the associated elimination of noise at a fitness maximum is called the general reduction principle (Altenberg and Feldman, 1987).
Modern collective and distributed forms of computation make extensive use of noise. This includes the very fundamental nature of random processes associated with quantum entanglement and quantum computation (Steane, 2004), the need to operate in the critical regime of collective spins in Ising systems so as to ensure high memory capacity in the Hopfield model (Hopfield, n.d), and the requirement that Boltzmann machines operate in regimes of non-zero temperature/noise (Ackley et al., 1985). Noise is also an essential feature of most modern forms of model-based inference, most notably the principle of maximum entropy, where noise is required for the estimation of constrained Bayesian priors (Jaynes, 1982).
And at the level of mechanical and collective molecular processes, randomness is essential for the construction of stochastic machines including Brownian ratchets (Fornés, 2021) that support contractile forces, mechanisms of stochastic self-assembly in distributed agents (Klavins et al., 2006), cell receptor densification (Shimizu and Bray, 2021), and as a system parameter for modulating ultra-sensitivity required for hysteresis supporting molecular memory and switches in populations of enzymes (Smith et al., 2011). And in large populations of cells classified into tissues, such as heart muscle, noise is essential in maintaining adaptability. The loss of noise in cardiac rhythms leads to a fatal determinism associated with congestive heart failure (Goldberger et al., 2002).
More generally, if we consider a multiplicity of individuals, each optimizing a different objective function which involves the choices of the other players as well as herself, then equilibrium might require noise. The simple game of “penny-matching” illustrates the point. Suppose there are two individuals (“players”) each of whom can choose heads or tails. Suppose the first player’s objective is maximized when they make the same choice as their opponent, while the second player’s objective is maximized when they make the opposite choice of their opponent. As Nash emphasized, in his Nobel-prize–winning contribution to game theory, the only equilibrium of this penny-matching game, the only way for the players not to get caught in an endless cycle, is for both players to choose heads randomly, with 50% probability (Binmore and Samuelson, 1997; Camerer, 2003; Myerson, 1991).
Moving clockwise to the final vertex in our figure, in signaling domains, signals are often not detectable until noise is injected into a system. KSS make the point that noise can displace a true signal, but when a signal is hitherto invisible, noise can come to our rescue. This use of noise is called stochastic resonance (Harmer et al., 2002), using distributions of noise to move regular signals above the threshold where it might then be processed. A broad range of related principles to include stochastic focusing (Paulsson et al., 2000) and stochastic amplification (Krakauer and Sasaki, 2002) achieves a similar goal of revealing concealed signals in collectives by adding essential variability. When signals are sparse, it becomes possible to almost perfectly re-construct a putative under-determined system using compressed sensing (Donoho, 2006). A key element of compressed sensing is the use of noisy or random sampling and random wave-forms to more effectively make measurements (Unser and Tafti, n.d).
We are not sure if you remember the story of Ludwig Boltzmann’s 1905 trip to the United States. From this trip, he wrote his celebrated essay, Journey of a German Professor to Eldorado. In the essay, he introduces an informal idea of noise that strongly resonates with yours, “My memory for numbers, usually quite reliable, always has difficulty keeping a record of glasses of beer” (Boltzmann, 1991). There are few ideas more challenging than formalizing the idea of noise and randomness. With or without a beer. But perhaps the most famous definition can be found inscribed on Boltzmann’s melancholic tomb (Villani, n.d) whose epitaph is the definition of entropy
The variable W is the number of micro-states of a system and the key ingredient is the collection of particles that can adopt different configurations in a prescribed state space.
In the 1960s, Solomonoff, Kolmogorov, and Chaitin extended this idea using new ideas of algorithmic complexity (Partovi, 2001), in which randomness is defined in terms of degrees of compressibility: a perfectly random string is one for which a computer program needs to be at least as long as the string. And the length of the program is lower bounded by the entropy of the string. Through the work of Boltzmann, Kolmogorov, and others, we now know that randomness is not only essential to function in many body systems, but that it provides the foundation for the rigorous analysis of complex systems more generally.
As we see it, these relatively recent breakthroughs demonstrate that you are part of a profound tradition of bringing the subject of noise to the foreground of our thinking. We are all are asking how best to model and manage variability in complex systems. And we probably all agree that when it comes to problems of collective dynamics, attending to the theory of noise is a necessary bias!
Yours,
David and David.
