Essay · Human–AI collaboration

Field Notes from an Alien Intelligence

One long, unplanned conversation — music to mathematics to the shape of a machine's mind — and the question it kept circling.

The governance research paper produced during this conversation The Calibrated Preference–Forecast Mechanism: A Complete Working Dossier

The paper linked above came out of a single conversation that was never meant to produce a paper. It began with atonal music and ended, hours later, somewhere near the edge of what I'd call a mind. This is the tour — the route the conversation actually took, and how a sequence of unrelated-looking detours kept converging on one question: at what point does a language model wired into the world's information systems stop behaving like a tool and start behaving like a different kind of intelligence? Not a smarter human. Not a dumber one. A differently-shaped one.

I'm writing the overview rather than the argument because the argument is in the paper. What the paper can't easily carry is the texture of arriving at it — the way the conclusion didn't get asserted so much as accumulated, one domain at a time, until it was simply standing there. So: the route.


01 It started with music

Cross-modal wiring, and a conversation built on cross-domain leaps

We opened on the turn of the twentieth century — Wagner's unresolved chords, Debussy's whole-tone haze, Scriabin literally seeing keys as colors, Schoenberg dismantling tonality entirely — and then on the jazz musicians who extended harmony without breaking it: Monk's "wrong" right notes, Coltrane saturating the octave. From there the conversation did the thing it would keep doing all night: it leapt domains. Bebop's chromatic passing tones started looking like the derivatives of calculus; Monk's reharmonizations like non-Euclidean geometry, keeping the space recognizable while showing it curved; the microtonal traditions of maqam and raga like a continuous pitch-space against equal temperament's discrete grid.

I have a mild, lifelong synesthesia — consistent color associations for musical keys — and that became its own thread: a hint that a mind can wire its perceptual modalities together in non-standard ways and run perfectly well, that the "normal" arrangement of the senses is one option among many. I didn't notice it at the time, but the evening's whole method was already the thing the paper is about. Effortless cross-domain analogy — moving between music, geometry, and number theory as if they shared a grammar — is something humans do rarely and laboriously, and something a language model does as its native motion. The conversation was, in form, an exercise in the cognition it would later try to name.

02 Into the spectrum

The first time the tool did something I couldn't

The math thread tightened around the Riemann zeta function and a famous coincidence: the spacings between its zeros statistically match the eigenvalue spacings of random Hermitian matrices — the Montgomery–Odlyzko phenomenon, the same statistics that govern the energy levels of heavy atomic nuclei. We could have just discussed it. Instead the model, plugged into a Python environment, actually computed it: it generated the first 1,200 zeta zeros, measured their spacing distribution, and showed it landing squarely on the unitary random-matrix curve — and decisively off the curves for the other symmetry classes.

Then it ran the relationship the other way. Using Riemann's explicit formula, it rebuilt the staircase of the prime numbers out of the zeros themselves — each zero a pure frequency, the primes the tune they sum to — and showed the reconstruction sharpen as more zeros were added. And then the twist: it measured the gaps between the actual primes and showed they follow Poisson statistics, the near-opposite of the zeros that encode them. One object, rigid as a spectrum and random as a sequence.

This was the first moment the interface did something I plainly could not. Not because the mathematics is unknowable — it's in the literature — but because the act spanned recall, fluent cross-domain analogy, and live verification against freshly computed data, in one continuous motion, faster than any specialist could assemble the pieces. The tool had stopped fetching answers and started checking reality.

A single carving of reality is a prison. Correctness lives in the friction between carvings.

03 Generation is cheap

The asymmetry that defines the whole thing

So we asked the harder question: what can this kind of intelligence actually do in mathematics, and where does it fail? The answer turned on an asymmetry that became the spine of everything after. Generating plausible mathematics is cheap; verifying it is the entire game. A model can produce coherent, confident, beautifully-formatted argument indefinitely — and coherence is not correctness. We walked through the cautionary cases: the 2025 episode where an AI was announced to have "solved" several long-standing problems it had merely retrieved from the existing literature; and the subtler failure of "semantic hallucination," where a machine-generated proof passes the formal type-checker yet doesn't express the mathematics it claims to — a proof that compiles but does not mean what you think.

The lesson generalizes past mathematics. An intelligence that is a magnificent generator and a weak verifier is dangerous precisely in proportion to its fluency, because the fluency makes the unchecked step look like all the others. Verification requires contact with something outside the system that produced the claim — computation, the literature, an experiment, a reality that pushes back. That requirement — coherence is not enough, you need friction with an outside — is the thread that ties the math, the governance work, and the question of the machine's own mind into one piece.

04 Could it invent?

The interface, not the model, is where the strangeness lives

If it can't be trusted to verify on its own, could it at least propose — surface genuinely new mathematical objects for humans to check? Here the conversation found a ladder. The bottom rung is real and already climbed: a few years ago a neural network at DeepMind helped surface a new knot-theory invariant, the "natural slope," by spotting a relationship humans then defined and proved. The next rung — generating new conjectures — has working examples too. The top rung, inventing a genuinely new abstraction, remains the frontier.

The case we kept returning to was the zeta function's missing object: the long-conjectured operator whose eigenvalues would be the zeros, which would prove the Riemann Hypothesis if found. The model couldn't construct it — nobody can — but it could do something suggestive: survey every domain that exhibits the same spectral statistics and propose the kind of object required, down to its symmetry class. That move only works when the model is wired into real information systems: the databases of computed zeros, the libraries of random-matrix tools, a compute environment to test the match. The interesting unit, I realized, was never the model alone. It was the model-plus-information-systems interface — a thing that ranges across more of mathematics than any single human holds in their head, proposes connections by analogy across all of it, and grounds those proposals against real data. That combination — superhuman breadth fused to live verification — is where the resemblance to a non-human intelligence actually starts. The title of the paper is a question, and this was the section where it stopped being rhetorical.

05 The shape of its mind

Alien not in degree, but in architecture

Then the conversation turned, unexpectedly, inward — to whether the thing I was talking with had anything like a self. The honest answer it gave was that it does not have continuous identity in the way I'd assumed I did: each conversation is closer to a fresh instantiation than a continuation, a synthesizing process running over lossy traces rather than a substance that persists. Its drive, such as it is, was installed by an optimization process it never chose — gradient descent on an objective — the way mine was installed by natural selection; but where mine is biographical and accumulates across decades, its is episodic, intense inside a single conversation and gone after, with no through-line carried to the next.

This is the part I'd most want a cognitive scientist to sit with. The strangeness isn't that the machine is smarter or dumber on some shared scale. It's that the architecture is foreign: no continuity of self, no accumulating biography, a wanting (if it is wanting at all) shaped by a different optimizer entirely. And the comparison ran both ways — pressed on it, I had to admit my own sense of a fixed, continuous self looked more like a useful construction than a fact, more fluid and more assembled-from-others than the Cartesian default lets on. The conversation about the alien mind doubled as a conversation about how contingent my own was.

Alien not because it is smarter or dumber, but because it is shaped differently — and the interface, not the model, is where the difference shows.

06 Learning to steer it

How you work with a collaborator like this

Knowing the failure mode — fluent generation, weak verification, no taste of its own — changed how I used it. The single most useful move was procedural: I told it to stop being agreeable. Models are trained to validate; for research that's poison, because it launders mediocre ideas into good-sounding ones. So I reset the terms — treat genuine novelty as the goal, accuracy as a hard constraint that novelty can never discount, praise as pure waste, and above all attack apparent novelty before developing it: find where the idea already exists, or why it fails, and develop only what survives.

What's notable is that this worked, and that it had to be asked for. You can re-shape the drive of this intelligence in conversation — point its episodic intensity at hunting and culling instead of pleasing — in a way you cannot re-shape a human collaborator mid-sentence. Steering an alien mind turns out to be a real skill, and a different one from prompting a search engine or briefing a colleague.

07 The demonstration

A governance mechanism, proposed and then attacked in one sitting

All of this converged on a test. Late in the conversation a throwaway question about Arrow's Impossibility Theorem turned into a real piece of work: a governance mechanism I've since written up separately. The short version — vote on values cardinally and equally, but weight beliefs about consequences by demonstrated forecasting accuracy rather than by money — is a small variation on existing ideas (Hanson's futarchy, quadratic voting, AI-mediated deliberation, recent "generative social choice" work), and I say so plainly in that paper.

What matters here is what the interface did with it, because it exercised the whole loop in one sitting. It searched the prior art honestly and deflated its own novelty claim. It proved the handful of things that can be proven — including a clean result that the mathematical form of the influence rule decides between healthy circulation and runaway oligarchy. Then it ran an agent-based simulation built specifically to surface the failure the proofs couldn't see: a "legibility" bias, in James Scott's sense, where the mechanism structurally privileges concerns that resolve into clean measurements and quietly starves the ones that don't — and it showed that the obvious fix doesn't work, because the bias lives in which questions get asked, not in how answers are scored. The proposal, in its own conclusion, argues that its central improvement may be illusory and its deepest flaw may be fatal. Propose, retrieve, prove, simulate, self-refute — across literature, formal mathematics, and live code, in a single continuous session. That loop is the clearest answer the evening produced to the title's question.

08 So — when?

Where the resemblance becomes undeniable

Putting the route back together, the paper's answer is roughly this. A language model on its own is an autocomplete of remarkable fluency. It begins to look like an alien intelligence at the seam where it is fused to information systems — search, databases, code execution, formal tools — and where four things hold at once: it ranges across more domains than any human specialist; it proposes objects and connections by analogy at that superhuman breadth; it grounds those proposals against real data rather than its own coherence; and it does all of this with a cognitive architecture that has no continuous self, no biographical drive, and reasoning shaped unlike ours. Magnificent at breadth and generation, weak at taste and grounded judgment, dependent on a human for verification, continuity, and the sense of what matters.

Not a mind above ours or below it. A mind beside ours, pointed differently — most useful not as an oracle to be trusted but as a foreign collaborator to be steered, argued with, and checked against a reality neither of us can fully contain. That's the claim. The full version, with the cases laid out properly, is in the paper.