Controlled Hallucination: Three Theories Explaining How the Brain and Life Work
In the past decade, several theories have emerged that distill generations of interdisciplinary scientific experience into accessible frameworks. Perception, cognitive biases, adaptive strategiesâall share a common principle.
Consider the famous painting by Belgian artist RenĂ© Magritte. It depicts a smoking pipe, with the French caption âThis is not a pipe.â When you compare the image and the text, you might experience a subtle mental conflict between expectation and perception. This internal âclash with realityâ was described by American social psychologist Leon Festinger in 1957 as the âTheory of Cognitive Dissonance.â The musical origin of the term (from Latin dissonantiaâdisagreement, discord, inconsistency) intuitively hints at the core idea: a sharp, jarring note disrupts the smooth, harmonious process of perceiving reality.
Cognitive dissonance isnât always a psychological conflict that leads to frustration; itâs more of a spectrum of sensations, from confusion and uncertainty about what to do next, to mild puzzlementâlike the riddle that the brave soldier Schweik used to baffle forensic doctors: âThereâs a four-story building, each floor has eight windows, the roof has two dormer windows and two chimneys, each floor has two tenants. Now tell me, gentlemen, in what year did the doormanâs grandmother die?â
In popular culture, cognitive dissonance is often seen only as psychological conflict, overlooking the second partâthe mechanism for resolving this conflict, or reconciling expectations with reality. Festingerâs theory includes not just the stress and discomfort from new, contradictory information, but also the ways we reduce this dissonance.
Where Do Expectations Come From?
Recall the dualistic model of visual perception: sensory stimulation activates processes in the brain. Imagine the complex chain of events, from a photon hitting the light-sensitive cells (rods and cones) in the retina, to the assembly of a complex visual image in the higher brain regions, placed in a specific context. Now scale this up to all available visual stimuli. And thatâs just visionâone of many âchannelsâ of incoming information about the world. The mind canât possibly process the overwhelming flood of sensory signals we receive every second. If perception worked according to outdated views, life simply wouldnât existâitâs impossible to keep up with the ever-changing train of reality. If you canât keep up, you have to anticipate.
Imagine the brain, locked inside the skullâit sees and hears nothing directly. It just receives a stream of signals and must guess whatâs happening outside. In fact, it doesnât just guess, it must predictâso the body can prepare and react in time.
Predictive Processing Theory
The brain functions as a multi-level prediction machine, where a top-down stream of predictions (what we expect from the world) is constantly compared and adjusted against a bottom-up stream of sensory data (what our senses perceive). The top-down stream is everything we know about the worldâour best heuristics (quick, simplified reasoning for efficiency), our prior beliefs and expectations, all our previous experienceâfrom E = mcÂČ to âLondon is the capital of Great Britain.â The bottom-up stream consists of three parts: exteroception (whatâs happening outside the body), interoception (whatâs happening inside the body), and proprioception (the position and movement of the body), all combined into a multimodal model. All our knowledge forms the foundation for constructing predictions about what we should feel.
How It Works
The brain generates mental models (called generative models) that predict what the sensory apparatus should receive as input. These predictions are called prior beliefs. Predictive models are layered in a hierarchy, reflecting the brainâs organization from lower to higher, from simple to complexâhigher levels send predictions downward, and lower levels send incoming sensory data upward.
If top-down predictions donât match bottom-up sensory data, a prediction error occurs, and the model either updates its priors or ignores the incoming data as noise, keeping its previous assumptions.
Example
Think about vision. We never see the world as it appears on the retina. First, the image on the retina is inverted (the eye is a camera obscura, and the brain flips the image). Second, itâs blurry at the periphery due to uneven distribution of visual cells. Third, thereâs a layer of blood vessels over the retina (inverted retina). Fourth, thereâs a blind spot where the optic nerve exits. Plus, our eyes make countless tiny, rapid movements (saccades), âscanningâ the space. Yet we enjoy a full-color, three-dimensional, stabilized image, already interpreted. Our brain even predicts light and shadow, as in visual illusions.
How the Brain Uses Bayesian Statistics
A critical parameter for both streams is the level of precision. We care not just about the data, but also its accuracy or probabilistic âweight.â A bottom-up signal like âthereâs an elephant in front of youâ has high weight; a vague silhouette in the fog has low weight. A top-down prediction that water is probably wet has very high weight; âthe Dow Jones Index should drop a couple of points because of rising diaper pricesâ has very low weight.
Both streamsâbottom-up and top-downâconstantly interact at every level, and this ongoing probability adjustment can be described using Bayesian statistics. Bayesâ theorem is about determining the probability of an event based on prior events. In simple termsâif the shot glass you drank from last night with some shady people smells like acetone, youâll probably feel bad in the morning.
In a graph of Bayesian inference with a Gaussian distribution, Expectation is our prior, Reality is the actual data, and Estimate is our perceptionâa compromise between the two. The X-axis is any parameter weâre trying to predict; the Y-axis is the probability of each value. Uncertainty is the variability of expectations; Noise is the variability of precision. The process:
- Thereâs an expectation (prior), whose precision depends on uncertainty.
- Thereâs sensory input (likelihood), or reality, whose precision depends on noise.
- Between expectation and reality is what we perceiveâthe posterior. We adjust our prior based on the new signal and get a posterior probability.
A Simple Example
You decide to skip work, assuming your boss is on a business trip and wonât notice. Thatâs your prior. The accuracy of your prediction depends on uncertaintyâare you sure he left? Did anything change? Whereâs the info from? The less you know, the higher the uncertainty, the less accurate the prediction.
You start gathering informationâask colleagues, managers, even check his flight online. The accuracy of this data depends on noiseâdid you hear it in the break room (low precision), from your project manager (medium precision), or from his assistant who bought his tickets and saw him off (high precision)?
Your final decision is the posteriorâa balance between your prediction and what you learned. If your predictions and data were accurate, your unsanctioned day off goes unnoticed. Prediction error is small. But if you relied on vague assumptions and random data, your prediction fails, the boss just stepped out, your âtrusted sourcesâ rat you out, and you get in trouble. Next time, youâll analyze more carefully and update your beliefs accordingly.
Surfing Uncertainty
Now that weâve covered the âBayesian brain hypothesis,â letâs expand our understanding of how predictions interact with incoming data. There are three scenarios:
- If predictions roughly match sensory data, everything is calmâpredictions come true, and all is well.
- If low-precision sensory data contradicts high-level predictions, Bayesian math may decide the predictions are correct and the data is faulty. Lower levels âfit the dataâ to the prediction, and higher levels stick to their expectations.
- If high-precision sensory data conflicts with predictions, Bayesian math concludes the predictions are wrong. The involved neurons signal âAlert! Somethingâs off!â The greater the mismatch and the higher the dataâs precision, the bigger the surpriseâthe louder the internal alarm.
Each levelâs main task is to minimize surprise. Ideally, the brain predicts the world so well that surprises are rare, because each surprise triggers a flurry of activity to update the generative model until calm is restored. All this happens in fractions of a second. Lower levels bombard higher levels with data, which adjust their hypotheses and send predictions back down. After countless cycles, everything is more or less predicted and expectedâuntil the next crisis.
Andy Clark, in his book Surfing Uncertainty, compared this predictive process to surfing: âTo act quickly and flexibly in an unstable and noisy world, the brain must become a master of predictionâriding the waves of noisy and ambiguous sensory stimulation, trying to stay ahead. An experienced surfer stays in the âpocketâ: close, but just ahead of where the wave breaks. The wave carries you, but doesnât catch you. The brainâs task is the same. By constantly trying to predict incoming sensory signals, we can learn about the world, think, and act in it.â
The result is perception, which predictive processing theory calls a âcontrolled hallucination.â We donât perceive the world as it is, but our predictions about it, corrected by incoming data. As Anil Seth said in his TED talk, itâs âour brainâs best guess.â
Active Inference
Weâve explored the leading theory of brain functionâpredictive processingâto understand where our expectations come from. Now we can grasp whatâs meant by the âBayesian brain.â After the work-skipping and factory fire examples, the following diagram should be clear. It âpacksâ the predictive processing process to show which processes happen in the brain and which outside. The brain builds an internal model of the world, makes predictions, compares them to incoming information, updates its worldview, and the cycle repeats. Note the background color: everything on beige is the external environment, everything on white is internal. Sensory data and actions are at the boundary.
Letâs look at another diagram. Itâs almost the same: world model, expectations/forecast, prediction, prediction error, model update. Forecasting is just another word for âprediction.â Here, we add a boundary between the system (internal) and the external world, shown as a dashed line. All the processes weâve discussed happen inside the system, while actions and sensory data are at the boundary with the outside world.
To simplify further: Sensory states are sensations, our sensory input. Active states are actions or behavior. Internal states are our internal states, the result of all these processes. External states are the states of the surrounding world, our environment.
The states of the world (S) determine our sensory states (o), which, after internal processing, become our internal states (s), which determine our active states (a), which change the world, closing the causal loop. This is called active inference and is essentially how autonomous agents function in a dynamic environment.
We Are All Markov Blankets
The term âMarkov blanketâ was coined by Israeli-American scientist and philosopher Judea Pearl, who works on probabilistic approaches to AI and Bayesian networks. Andrey Markov (senior), whose name it bears, was a pioneer in the study of stochastic (random) processes and probability theory. His son, also Andrey Markov (junior), was an equally outstanding mathematician, giving us Markov chains and Markov processes.
The âblanketâ or âboundaryâ of Markov is a concept that goes far beyond consciousness and neuroscienceâitâs even more fundamental. Absolutely anything exists as a Markov blanket. Without it, you couldnât draw a boundary between something and everything else. If something doesnât have a Markov blanket, it simply doesnât exist. Everything in our world is a Markov blanket, ânestedâ within other Markov blankets, as far as scaling allows.
âIf the Markov blanket is minimal, meaning it canât drop any variable without losing information, itâs called a Markov boundary.â This is the very boundary where we end and the world begins, and vice versa.
Without any of these componentsâsensory, internal, or active statesâwe wouldnât exist as autonomous subjects. Our Markov boundary protects us from the causal complexity of the world.
The Free Energy Principle
What are all living organisms doing in this chaotic, unpredictable, and, most importantly, non-equilibrium world? First and foremostâsimply existing, maintaining their boundaries that separate them from the environment and preserve some internal structure and processes. To do this, they must perceive the world (Bayesian math), represent it internally (a generative model), predict (hierarchical predictive processing), and act (active inference) to update their internal model.
We âprobeâ the world through active inference, create its internal model via predictive processing, and update this model (learn) using Bayesâ theorem. The final, perhaps key, element: all these processes can be reduced to optimizing a single parameterâthe difference between expectation and reality. All our complex adaptive strategies boil down to reducing uncertainty. This parameter is called variational free energy.
In essence, weâve just encountered the Free Energy Principle, which is now considered as explanatory as the theory of evolution by natural selection.
Karl Friston, the author of the Free Energy Principle and predictive processing theory, has a citation index higher than Einstein, with 1,200+ scientific publications. Anyone even slightly familiar with his work is left with the impression that heâs incredibly brilliant.
Itâs hard not to appreciate the elegance of the idea that all living things are generators of predictions about the states of the world, engaged in self-maintenance and self-organization by separating themselves from the environment and minimizing their prediction errors.
Drawing Parallels and Conclusions
Festingerâs theory of cognitive dissonance, describing the conflict between expectations and reality and the mechanisms for resolving it, was a precursor to newer, more complex, and far-reaching theories. Starting with explanations of mental processes, they evolved to address the very essence of adaptive strategies in all living things. A good theory is like a prismâit lets us see whatâs hidden from the naked eye. The world we perceive is a generative model, built on our brainâs guesses about whatâs happening outsideâa controlled hallucination. We canât escape this fact, but we can listen more closely to what our senses tell us and not be afraid to update and complicate our worldview. Only those who never learn or try new things avoid making mistakes.