For most of the last century, translation was understood as a problem of substitution. A word in one language was assumed to have a fixed meaning, and the translator's task was to find a faithful equivalent in another. Dictionaries enshrined this view, and early machine translation inherited it almost without question.
The intuition is comforting. It assumes that meaning is a stable property of words, sitting inside them like cargo waiting to be unloaded into a different container. Move the cargo carefully, and the meaning arrives intact. The trouble is that no working translator has ever experienced the craft this way. The intuition does not match the practice. And as we now build large multilingual systems on top of that flawed intuition, the cracks are no longer academic — they are operational.
The substitution illusion
Substitution makes a hidden assumption: that a word's meaning is independent of the sentence around it, the speaker who utters it, the reader who receives it, and the moment in which it is read. We know none of these are true. The Arabic word عين can mean an eye, a spring of water, an essence, a spy, or the letter itself, depending on context. The English word set has more than four hundred recorded senses. Bank in a financial document is not the same object as bank beside a river, and a translator who stops at substitution will eventually mistake one for the other.
The deeper problem is that substitution treats meaning as local — a property of the word itself. But meaning, in any natural language, is overwhelmingly non-local. It depends on what came before, what comes after, what the writer assumed the reader already knew, and what cultural frames are silently active in the room. None of that is inside the word.
Superposition: a word, many states
This is where the analogy to quantum information becomes useful — not as a metaphor for marketing copy, but as an honest description of how meaning behaves before a translator commits to a choice.
In quantum mechanics, a particle before measurement does not have a single definite state. It exists in a superposition of possibilities, weighted by probability. Only when an observation occurs does the system "collapse" into one outcome. Words in context behave with disconcerting similarity. Read in isolation, a noun like state holds dozens of plausible senses at once: a country, a condition, a mode of being, a formal declaration, a U.S. administrative unit. Each is live. None has been ruled out.
Meaning is not a property a word carries. It is a property a sentence resolves to, after every constraint in the surrounding text has had its say.
— from the working notes of the Tarjama linguistics groupThe translator does not select among these meanings consciously, the way one picks an item from a menu. They feel the constraints accumulate — register, audience, the verb that governs the noun, the preposition that follows it, the document type, the brand voice — until only one or two readings remain plausible. The act of choosing the target word is the act of collapse.
Observation collapses meaning
If superposition is the first lesson, the second is harder: the act of translating changes the system. A faithful translation is not a passive readout of an invariant signal. It is an intervention. Every translator silently decides what to preserve, what to compress, and what to expand. They make these decisions because the target language has different load-bearing structures than the source — different ways of marking politeness, time, certainty, and authority.
An Arabic legal clause that uses the dual form, the passive voice, and an embedded relative clause cannot be rendered into clean English without redistributing that information. Some of it goes into word choice, some into punctuation, some into a footnote, some is left implicit on the assumption that the new reader will infer it. The translator chooses where the meaning lands. There is no neutral position from which to observe the source text without altering its weight in the target.
This is why two excellent translators, working independently and in good faith, will produce different renditions of the same paragraph. Both can be correct. The disagreement is not a defect — it is evidence that the underlying object has more than one valid resolution.
Entanglement across the page
The third behaviour worth borrowing from physics is entanglement. In quantum systems, two particles can become correlated such that measuring one immediately constrains the state of the other, regardless of how separated they are. Long-form documents are entangled in exactly this sense.
A choice made on page one — translating compliance as الامتثال rather than التقيد, for example — quietly constrains every subsequent occurrence of that concept across the document. Pick the wrong terminology in the abstract of a forty-page report and you will pay for it on every page that follows. The decisions are not independent. They form a web, and the web tightens as you go.
This is why glossaries, style guides, and translation memories matter so much in enterprise localization. They are not bureaucratic overhead. They are the formal mechanism by which a team enforces internal consistency across a system whose parts are entangled by definition.
What this means for AI translation
None of this is a critique of machine translation. Modern neural systems already operate, mathematically, on something close to a superposition view: every token is a vector that encodes many overlapping senses, and the decoder commits to one only after weighing the surrounding context. In that respect, the architecture has caught up with the linguistics.
The remaining gap is governance. A model can resolve a word in context, but it does not know which resolution your organization considers canonical, which terms are protected for legal reasons, or which phrasings have already been negotiated with regulators. A model collapses meaning according to its training distribution. An enterprise needs meaning to collapse according to its own house rules.
That is why we believe the future of multilingual operations is not a contest between human and machine. It is a layered system in which:
- Models handle the heavy lifting of contextual resolution at speed and scale.
- Glossaries, style guides, and termbases act as the boundary conditions that keep collapse consistent.
- Linguists set policy, audit edge cases, and own the judgements no model can be held accountable for.
- Workflow tools record every decision so the same word never has to be re-resolved twice.
Toward a physics of meaning
The point of the analogy is not to make translation sound mysterious. It is the opposite: to give the field a vocabulary precise enough to describe what the best translators have always known and what the best systems are now beginning to formalize. Meaning is not a thing you transfer. It is a state that a careful reader, human or otherwise, resolves out of a field of possibilities, under the influence of every constraint within reach.
Treat translation as substitution and you will keep building tools that fail predictably at the edges. Treat it as a resolution problem under uncertainty — closer to inference than to lookup — and the tooling, the workflows, and the standards all start to look different. Better, in our view. And, finally, honest about the work.