Octonions and the Standard Model

16 June, 2026

Paul Schwahn and I have come out with a new paper about octonions and the Standard Model:

The Standard Model gauge group from the exceptional Jordan algebra

It builds on things I’ve discussed here, but it goes further. Let me explain a bit.

A bit is just a binary alternative: 1 or 0, true or false. That’s how it works in classical logic. We could also have a ‘trit’, meaning 3 alternatives.

In quantum physics we instead have qubits and qutrits.

Qubits and qutrits are usually described using complex numbers. The algebra of observables of a qubit is the Jordan algebra \mathfrak{h}_2(\mathbb{C}), consisting of 2 \times 2 self-adjoint complex matrices. Similarly, the algebra of observables of an qutrit is the Jordan algebra \mathfrak{h}_3(\mathbb{C}), consisting of 3 \times 3 self-adjoint complex matrices.

We can also study systems with more than 3 alternative ways to be. They work the same way, using the Jordan algebras \mathfrak{h}_n(\mathbb{C}) with n > 3.

But we can also do quantum mechanics using other number systems! The options have been mapped out, and the largest allowed number system for this purpose is the algebra of octonions.

A weird thing is that Jordan algebras built using octonions can describe qutrits, but not quantum systems with more than 3 alternative ways to be. The algebra of observables of an octonionic qutrit is the so-called ‘exceptional’ Jordan algebra \mathfrak{h}_3(\mathbb{O}), consisting of 3 \times 3 self-adjoint octonion matrices. What makes it exceptional is that \mathfrak{h}_n(\mathbb{O}) is not a Jordan algebra when n is bigger than 3.

So, there’s something special about octonionic qutrits—and it turns out that every symmetry in the gauge group of the Standard Model is a symmetry of an octonionic qutrit!

Not every symmetry of an octonionic qutrit is a symmetry of the Standard Model. But those that do have a simple description. They are those that restrict to give symmetries of an ordinary qutrit sitting inside the octonionic qutrit… and an ordinary qubit sitting inside that!

That sounds exciting, but also vague, so let me make it precise.

While lots of people say the gauge group of the Standard Model of particle physics is \text{U}(1) \times \text{SU}(2) \times \text{SU}(3), in fact a certain subgroup of this acts trivially on all known particles. If we mod out by that, we’re left with a group called \text{S}(\text{U}(2) \times \text{U}(3)), which is

\Big\{ x \in \text{SU}(5) : x =   \left(   \begin{array}{c c c c c}  \ast & \ast & 0 & 0 & 0 \\  \ast & \ast & 0 & 0 & 0 \\  0 & 0 & \ast & \ast & \ast \\  0 & 0 & \ast & \ast & \ast \\  0 & 0 & \ast & \ast & \ast   \end{array}  \right) \; \Big\}.

and this is the group I’m talking about.

We proved two theorems describing this group in terms of the symmetries of an octonionic qutrit. The group of automorphisms of the exceptional Jordan algebra \mathfrak{h}_3(\mathbb{O}) is a 52-dimensional Lie group known affectionately as \text{F}_4—so that’s what I mean by the symmetries of an octonionic qutrit.

Here’s our main result:

Theorem 1. Suppose X,B are Jordan subalgebras of \mathfrak{h}_3(\mathbb{O}) such that

X \cong \mathfrak{h}_2(\mathbb{C}), \;\; B \cong \mathfrak{h}_3(\mathbb{C}), \;\; X \subset B.

Then

\text{Stab}(X) \cap \text{Stab}(B)_0 \cong \text{S}(\text{U}(2) \times \text{U}(3)).

Here \text{Stab}(X) is the stabilizer of X—that is, the subgroup of \text{F}_4 consisting of elements that map X to itself—while \text{Stab}(B)_0 is the identity component of the stabilizer of B.

This ‘identity component’ business is rather sneaky, but it turns out that guys in \text{Stab}(B)_0 are symmetries of an ordinary qutrit that can be described as unitary operators on \mathbb{C}, while \text{Stab}(B) also contains those symmetries that are described by antiunitary operators. The CPT symmetry of the Standard Model is antiunitary, for example.

Theorem 1 emerged from a related result, which grew out of the work of Todorov and Dubois-Violette:

Theorem 2. Suppose A,B are Jordan subalgebras of \mathfrak{h}_3(\mathbb{O}) such that

A \cong \mathfrak{h}_2(\mathbb{O}), \;\; B \cong \mathfrak{h}_3(\mathbb{C}), \;\; A \cap B \cong \mathfrak{h}_2(\mathbb{C}).

Then

\text{Stab}(A) \cap \text{Stab}(B)_0 \cong \text{S}(\text{U}(2) \times \text{U}(3)).

Todorov and Dubois–Violette proved this for a certain standard choice of subalgebras A and B. Thus, the challenge in proving Theorem 2 was to show that every other choice can be mapped to this standard choice using the action of \text{F}_4. This shows that the theorem is not an artifact of a specific choice, but rather a general fact.

How do we prove these results?

We start by constructing the octonion product from \text{SU}(3)-invariant operations on \mathbb{C} and \mathbb{C}^3. We then use this description to reprove Todorov and Dubois–Violette’s special case of Theorem 2. Then we show that \text{F}_4 acts transitively on the set of subalgebras of \mathfrak{h}_3(\mathbb{O}) that are isomorphic to \mathfrak{h}_3(\mathbb{C}). We also show every Jordan subalgebra of \mathfrak{h}_3(\mathbb{O}) isomorphic to \mathfrak{h}_2(\mathbb{C}) is contained in a unique Jordan subalgebra isomorphic to \mathfrak{h}_2(\mathbb{O}). This lets us prove that \text{F}_4 acts transitively on the set of pairs of Jordan subalgebra A, B \subset \mathfrak{h}_3(\mathbb{O}) with A \cong \mathfrak{h}_2(\mathbb{O}), B \cong \mathfrak{h}_3(\mathbb{C}) and A \cap B \cong \mathfrak{h}_3(\mathbb{C}). Theorem 2 then follows from Todorov and Dubois-Violette’s special case. We conclude by using these results to prove Theorem 1.

However, if you want to get into the details of the physics, the interesting part is how the strong force gauge group \text{SU}(3) and the electroweak \text{S}(\text{U}(1) \times \text{U}(2)) show up from the relation between octonionic qutrits, complex qutrits and complex qubits. You’ll see that in the proof of Lemma 4.

And if you want to get into the details of the math, the main interesting thing here is the use of Jordan algebra technology like ‘Peirce decompositions’ and ‘Jordan frames’ to figure out what it must be like when you have a Jordan algebra \mathfrak{h}_2(\mathbb{L}) or \mathfrak{h}_3(\mathbb{L}) sitting inside \mathfrak{h}_3(\mathbb{K}), where \mathbb{L} is some normed division algebra contained in a bigger normed division algebra \mathbb{K}.

What it all ‘really means’, if anything, is a question for later. It could be just a coincidence. Of course I hope not.


Interview with Micah Zarin

2 June, 2026

I’m not completely happy with this interview with Micah Zarin. It was nothing he did, it was me. I forgot to say that current-day AI wastes a lot of energy, and companies hope to use it to lay off people, and oligarchs are using it to extract lots of money from everyone. While obvious, these things are tremendously important and I should have emphasized them.

I was distracted by Micah’s fear that AI would make a career in math pointless, which really surprised me. So instead of giving my general thoughts on AI, I focused on putting myself in his place and imagining what to do in that situation. I suggested doing math with the help of AI as a way to overcome his fear and go ahead doing math while keeping abreast of new developments. If AI overtakes humans in math in his lifetime, which is far from certain, this could be a way to keep productively participating in math throughout this process. But I warned him to be very critical of what LLMs say, to lessen the danger of getting caught up in the ‘AI vortex’ that is turning many people into crackpots.

Mathematics, in case you haven’t been paying attention, is different from some other subjects because it’s an area where LLMs have shown some truly impressive problem-solving ability: read the various mathematicians’ comments in Remarks on the disproof of the unit distance conjecture where they grapple with this. But nobody really knows where this is going. So far LLMs have not shown much ability to invent new theories of mathematics, so it would be jumping to conclusions to assume AI will soon overtake humans in that realm. It would also be jumping to conclusions to assume it won’t.

Whatever happens, the real danger is not that AI will become too good, but that it will become too evil—most likely because of the oligarchs, corporations and governments behind it. I wish I had emphasized that point, which is always on my mind.

I think I succeeded in making another point, which is that life will not become pointless simply because some other entity gets better than humans at something and knocks us off our throne. To think that the meaning of life resides in our superiority is a childish attitude.


Summing the Reciprocals of Primes

1 June, 2026

The sum of the reciprocals of the primes diverges, but very slowly. The sum of the reciprocals of the first 100 primes is

2.106…

The sum of the reciprocals of the first 1,000 primes is

2.457…

For the first 10,000 it’s

2.709…

And it keeps creeping up, ever more slowly. To get the sum to reach 6, you need to add up the reciprocals of the first 3 × 10¹³² primes—far more than the number of atoms in the observable universe! Luckily there is no shortage of primes.

Here’s how you can see that the sum diverges, and that it diverges very slowly. First, remember Euler’s product formula for the Riemann zeta function:

\displaystyle{ \sum_{n=1}^\infty \frac{1}{n^{s}} = \prod_{p \text{\, prime}}\left(1-\frac{1}{p^{s}}\right)^{-1} }

This diverges logarithmically when s = 1. Taking logs we see

\displaystyle{ \sum_{p \text{\, prime}}\ln\!\left(1-\frac{1}{p}\right)^{-1} \;\approx\; \sum_{p \text{\, prime}}\frac{1}{p} }

must diverge too—but only log-logarithmically!

A deeper result, called Merten’s Second Theorem and proved here, says that

\displaystyle{ \sum_{p \le n} \frac{1}{p} = \ln\ln n + M + o(1) }

for some constant M. This constant is called the Meissel–Mertens constant. You can think of it as a fancier relative of Euler’s constant \gamma, which is defined by

\displaystyle{   \sum_{k = 1}^n \frac{1}{k} = \ln n + \gamma + o(1) }

But it’s much less widespread in mathematics than Euler’s constant, much as primes are less widespread than natural numbers.

Here’s how it works:

Image

What’s a bit surprising, given the slowness of convergence and the somewhat erratic behavior of the primes, is that people can compute the Meissel–Mertens constant very precisely:

M \approx 0.26149721284764278375542683860869585905\ldots

The trick, of course, is to use another formula for this constant, which lets you compute it much more efficiently than the definition. Here it is:

\displaystyle{ M = \gamma + \sum_{n = 2}^\infty \frac{\mu(n)}{n} \ln \zeta(n)}

where \mu is the Möbius function and \zeta is the Riemann zeta function. By the way, this formula shows that calling M a fancier relative of \gamma is not just talk.

When you know how to compute the Meissel–Mertens constant, you can compare the sum of reciprocals of primes to \ln\ln n + M, and the agreement is very good:

Image

In 1983, Guy Robin proved the curve goes above and below the actual sum infinitely many times, i.e.

\displaystyle{\left(\sum _{p\leq n}{\frac {1}{p}}\right) -\ln \ln n-M}

changes sign infinitely many times. You can see a bit of that happening here:

Image

Puzzle. Can you find a naturally occurring sum that diverges even more slowly, for example like \ln \ln \ln n ?

Acknowledgements

The pictures were created by Dcoetzee, Marek Wolf and Saroad, respectively, and placed into the public domain on Wikicommons. Click on the pictures for more details.

Image


From Pentagons to Pentagrams

29 May, 2026

I recently showed you that if you take the regular icosahedron:


Image

considered in a coordinate system based on the golden ratio, and then replace √5 by -√5 in all your formulas, you get the great icosahedron:


Image

But this fact isn’t an isolated one-off! If we do the same for the regular dodecahedron:


Image

we get the great stellated dodecahedron:


Image

There’s also a star polyhedron called the great dodecahedron:


Image

and if we play the same game, replacing \sqrt{5} by -\sqrt{5} in all the formulas, we get the small stellated dodecahedron:


Image

These six polyhedra form a family; the four nonconvex ones are called the Kepler–Poinsot polyhedra. I never understood what was so great about them, though of course they look ravishingly attractive. So it was nice to learn that if we include the convex ones, they come in three pairs related by the operation of replacing \sqrt{5} by -\sqrt{5}, which is called Galois conjugation. This is mentioned near the end of this book:

• John Horton Conway, Heidi Burgiel and Chaim Goodman-Strauss, The Symmetries of Things, A K Peters, Natick, Massachusetts, 2008.

These authors spend more energy describing three other relations among this family of polyhedra:


Image

But I’m more interested in Galois conjugation, which carries each polyhedron in this picture to the one at the opposite corner of the hexagon. I got interested in Galois conjugation because it interchanges two kinds of quasiparticles that propagate in icosahedral quasicrystals, called phonons and phasons:

Phasons in Quasicrystals.

But there’s some simple geometry behind it, which I’d like to discuss here.

You’ll notice that in all the examples I gave, Galois conjugation takes regular pentagons to regular pentagrams. And that turns out to be a general fact!

Let \mathbb{Q}(\sqrt{5}) be the golden field: that is, the set of all numbers

a + b \sqrt{5}

with a,b rational, equipped with the usual addition, multiplication, subtraction and division. Define Galois conjugation

f \colon \mathbb{Q}(\sqrt{5}) \to \mathbb{Q}(\sqrt{5})

by

f(a + b \sqrt{5}) = a - b \sqrt{5}

This map preserves all the field operations, and if you apply it twice you get back where you started. Thus, it’s like complex conjugation in many respects.

The golden field gets its name because it contains the golden ratio

\displaystyle{ \Phi = \frac{1 + \sqrt{5}}{2} = 1.6180339\dots }

If we apply Galois conjugation to the golden ratio, we get its negative reciprocal:

\displaystyle{ -1/\Phi = \frac{1 - \sqrt{5}}{2} \approx -0.6180339\dots }

This suggests that Galois conjugation should somehow map regular pentagons to regular pentagrams! Why? Well, in a regular pentagon, each exterior turning angle is 2\pi/ 5 = 72^\circ:


Image

while in a regular pentagram, each exterior turning angle is 4 \pi /5 = 144^\circ:


Image

The cosine of the exterior turning angle for the pentagon is

\cos(2\pi/5) = 1/2 \Phi

and we apply Galois conjugation to this, we get the cosine of the exterior turning angle for the pentagram!

\cos(4 \pi/5) = -\Phi/2

This is not quite a proof that Galois conjugation turns regular pentagons into regular pentagrams—indeed, we have to clarify what we even mean by that claim. But it’s a key ingredient of the proof.

To be more precise, let’s consider a regular pentagon in \mathbb{R}^n whose vertices lie in \mathbb{Q}(\sqrt{5})^n. Beware: such a pentagon is impossible in the plane!

Puzzle. Show this.

But it’s possible in 3 or more dimensions. For example:

\begin{array}{ccl}  v_1 &=& (1,1,1)  \\  v_2 &=& (0, 1/\Phi,\Phi) \\  v_3 &=& (-1,1,1) \\  v_4 &=& (-1/\Phi,\Phi,0) \\  v_5 &=& (1/\Phi,\Phi,0)   \end{array}

taken in cyclic order, are the vertices of a regular pentagon in 3 dimensions. And once you can get one, you can get plenty, by translations and rotations. It may take a bit of thought to dream up rotation matrices with entries in the golden field, but there are lots: even rotation matrices with rational entries are dense among all rotations.

Now, Galois conjugation acts on \mathbb{Q}(\sqrt{5})^n coordinatewise; let’s abuse language and call this map

f \colon \mathbb{Q}(\sqrt{5})^n \to \mathbb{Q}(\sqrt{5})^n

If we take a regular pentagon and apply this map to its vertices and edges, what do we get? A regular pentagram, I claim!

We can simplify the proof by noticing that only the cyclic ordering on the vertices is needed to distinguish a regular pentagon and a regular pentagram.

Theorem. Let v_1, \dots, v_5 \in \mathbb{Q}(\sqrt{5})^n be the vertices of a regular pentagon, listed in cyclic order. Then f(v_1), \dots, f(v_5), listed in the same cyclic order, are the vertices of a regular pentagram.

Proof. Define the edge vectors

e_i \;=\; v_{i+1} - v_i \;\in\; \mathbb{Q}(\sqrt{5})^n, \qquad i = 1, \dots, 5 \pmod 5

All these have the same squared length

e_i \cdot e_i = L^2 \in \mathbb{Q}(\sqrt{5})

The exterior turning angle at each vertex of the pentagon is 2\pi/5, so this is the angle between the consecutive edge vectors e_i and e_{i+1}. Since

\cos(2\pi/5) = 1/2\Phi

we have

e_i \cdot e_{i+1} \;=\; \cos(2\pi/5) L^2  \;=\; (1/2\Phi) \, L^2

Now let v_i' = f(v_i) and e_i' = f(e_i) = v_{i+1}' - v_i'. It is easy to check that the usual dot product of v, w \in \mathbb{Q}(\sqrt{5})^n obeys

f(v) \cdot f(w) = f(v \cdot w)

so

e_i' \cdot e_i' \;=\; f(L^2)

and

e_i' \cdot e_{i+1}' \;=\;  f(1/2\Phi) \, f(L^2) \;=\; -(\Phi/2) \, f(L^2)

Therefore the cosine of the angle between consecutive edge vectors e_i' is

\displaystyle{  \frac{e_i' \cdot e_{i+1}'}{\sqrt{(e_i' \cdot e_i')(e_{i+1}' \cdot e_{i+1}')}}     \;=\;  \frac{-(\Phi/2) f(L^2)}{\sqrt{f(L^2)^2}}  \; = \; -\Phi/2 }

Since

\cos(4\pi/5)   \; = \; -\Phi /2

it follows that the angle between consecutive edge vectors is 4\pi/5.

But wait! The above calculation secretly assumed f(L^2) is positive, because we claimed that the usual positive square root of f(L^2)^2 equals f(L^2), and we also felt free to divide by f(L^2). Why is f(L^2) positive? Writing

e_i = (c_1, \dots, c_n)

with c_j \in \mathbb{Q}(\sqrt{5}), we have

L^2 = \sum_j c_j^2

and thus

f(L^2) = \sum_j f(c_j)^2

This is a sum of squares of real numbers, hence nonnegative. It is strictly positive because f is injective, so the f(c_j) are not all zero.

The five points v_i' are coplanar, since coplanarity amounts to the vanishing of certain 3 × 3 minors in the matrix of coordinate differences, which is a polynomial condition over \mathbb{Q}(\Phi), hence preserved by f. These points are also distinct, since f is injective. The edges e'_i thus form a planar closed 5-gon with equal edge lengths \sqrt{f(L^2)} and constant exterior turning angle 4\pi/5 at each vertex. This is a regular pentagram.   █

The same style of argument shows that applying Galois conjugation to a regular pentagram with vertices in \mathbb{Q}(\sqrt{5})^n, we get back a regular pentagon. And if you’re worried about what happened to the plane, fear not! We can draw regular pentagons in the plane whose vertices have coordinates in a certain quadratic extension of \mathbb{Q}(\sqrt{5}). This larger field again has an automorphism that carries regular pentagons to regular pentagrams.

We can also play similar games with heptagons and the like, using different fields.

Acknowledgments and addenda

The red pictures of polyhedra were made using were created using Robert Webb’s Stella software and placed on Wikicommons. The diagram of Kepler–Poinsot polyhedra was created by Tilman Piesk and placed on Wikicommons.

For a description of the small stellated dodecahedron and great dodecahedron as Riemann surfaces—branched coverings of the sphere—try this:

• John Baez, Small stellated dodecahedron, Visual Insight, 15 June 2016.

I do not know how these branched coverings are related to the Galois theory perspective given here!

On Mastodon, J. M. animated an icosahedron morphing into a great icosahedron:

Image

and a dodecahedron morphing to a great stellated dodecahedron:

Image

Here’s another fun example. If you take a rhombicosidodecahedron with vertex coordinates all in the golden field:

Image

and apply the Galois transformation \sqrt{5} \mapsto -\sqrt{5}, the pentagons turn into pentagrams, while the squares stay squares and the equilateral triangles stay equilateral triangles. It gets messy if we draw everything, but if we draw just the pentagrams it’s beautiful:

Image

Interestingly the squares and triangles, not drawn, stay the same size—because the squared lengths of their edges are rational! But the squared lengths of the pentagon edges involve \sqrt{5}, so they change as they become pentagram edges.


The Great Icosahedron

27 May, 2026

I never knew what was so great about the ‘great icosahedron’. Now I do.

Take a regular icosahedron whose vertices have coordinates in the field ℚ[√5], which consists of numbers a + b√5 with a and b rational.

Image

Apply the nontrivial element of the Galois group of this field: that is, simply replace √5 by -√5 in all your formulas. You get the great icosahedron:

Image

What do I mean by this, exactly? For example, take the regular icosahedron whose 12 vertices are

(±1,±Φ,0),   (0,±1,±Φ),   (±Φ,0,±1)

where Φ = (1 + √5)/2 is the golden ratio. Form a bunch of

• points (all the vertices of the icosahedron),
• lines (containing all the edges of the icosahedron), and
• planes (containing all the faces of the icosahedron)

Now replace √5 by -√5 in all your formulas. You get the equations for a new bunch of points, lines and planes. And:

• the points are the vertices a great icosahedron;
• the lines contain all the edges of this great icosahedron;
• the planes contain all the faces of this great icosahedron.

So, replacing √5 by -√5 makes the icosahedron great!

On Mastodon, J. M. animated this using an interpolation process called ‘tweening’:

Image

If you apply the Galois transformation again you get back the icosahedron. If you apply it yet again you make the icosahedron great again.

I thank and andeux for their help on this.

This fact is asserted without proof in the section on regular star-polytopes near the end of this book:

• John Horton Conway, Heidi Burgiel and Chaim Goodman-Strauss, The Symmetries of Things, A K Peters, Natick, Massachusetts, 2008.

Let me sketch an uninspired, purely computational proof. Take each of these twelve icosahedron vertices:

(±1,±Φ,0),   (0,±1,±Φ),   (±Φ,0,±1)

and find its five nearest neighbors. Then apply the Galois transformation, which amounts to replacing Φ with -1/Φ. You get twelve new points

(±1,±1/Φ,0),   (0,±1,±1/Φ),   (±1/Φ,0,±1)

which turn out to be vertices of a new, smaller icosahedron!

Then, for each vertex of the original icosahedron, see where it goes under this transformation, and see where its nearest neighbors go. You’ll see they become second nearest neighbors.

Thus the edges of the original icosahedron, which connect nearest neighbor vertices, go to edges connecting second nearest neighbors of a new smaller icosahedron.

What about the faces? The faces of the original icosahedron are equilateral triangles whose corners are triples of icosahedron vertices that are all nearest neighbors of each other. So, they get sent to triangles whose corners are triples of vertices that are all second nearest neighbors.

And these facts characterize the great icosahedron: it has the vertices of a regular icosahedron, edges connecting all pairs of second nearest neighbor vertices:

Image

and faces formed by all triples of second nearest neighbor vertices. You can’t see any of these triangular faces in its entirety in this picture—but if you look hard you can see most of one.

These are all the possible squared distances between vertices in the original icosahedron and the new great icosahedron:

Original icosahedron   Great icosahedron
0 0
4 4
4 + 4Φ ≈ 10.47 4 − 4/Φ ≈ 1.53
8 + 4Φ ≈ 14.47 8 − 4/Φ ≈ 5.53

To get the new squared distances from the old ones we simply replace Φ by -1/Φ, as we must, since Galois transformations preserve the field operations and squared distances are computed using field operations.

The nearest neighbors had distance squared 4, and these must be mapped to new points that still have distance squared 4—but now these are second nearest neighbors.

Of course it would be nice to have a deeper, more general understanding of what the Galois transformation sending \sqrt{5} to -\sqrt{5} does to the geometry of shapes made from regular pentagons. Soon I’ll dig a bit deeper into this!


What Are Atoms Made Of?

24 May, 2026

This post starts with the the slides of an elementary talk I gave at Sloans Bar and Grill, in Glasgow, as part of a wonderful series called A Pint of Science. At the end I include some fascinating details which I only had time to briefly touch on in my talk. If you already know plenty of physics, this is the juicy stuff.

Image


Start with a hair:

Image

A red blood cell is ten times smaller across:

Image

A flu virus is 10 times smaller across than that:

Image

The flagellum of this cell is ten times smaller across than that:

Image

A molecule of hemoglobin is about ten times smaller than that:

Image

A water molecule is ten times smaller than that:

Image

A hydrogen atom is 5 times smaller than that:

Image

This is an actual image of a single hydrogen atom, which is amazing. (Read more here.)

But how did people figure out that atoms even exist, back before we could see individual atoms? And how did we figure out what they’re made of?

It started with chemistry. Two liters of hydrogen burn with one of oxygen to form one liter of water vapor, so we guess water is made of 2 atoms of hydrogen and 1 of oxygen: H2O. But one cubic meter of oxygen is 16 times heavier than one of hydrogen, so we guess O is 16 times heavier than H.

And so on — this took a lot of detective work, back in the 1800s.

By 1871, Mendeleev created the periodic table, listing atoms in order of weight. Here’s a modern version:

Image

But what are the numbers on these elements? They are called atomic numbers. They are not the atom’s weights. Now we know it’s the number of electrons they have! But how did we find out about electrons?

In 1869, William Crookes made a beam by evacuating a glass tube and running an electric current though it:

Image

In 1897, J. J. Thomson showed that this beam is made of particles less than one thousandth the mass of hydrogen atoms! We now call these particle electrons though Thomson disliked this term. (Read more here.)

Electrons are negatively charged, but atoms have no charge, so there must be something positively charged in the atom. Thomson proposed that an atom is a ball of positive charge with electrons speckled throughout it. A journalist called this the plum-pudding atom, and the name stuck. (Read more here.)

Image

But in 1886, Eugen Goldstein created a beam moving in the opposite direction from the electrons in an evacuated tube! (Read more here.)

Image

In 1898 Wilhelm Wien showed this beam is made of positively charged particles with about the same mass as a hydrogen atom. It took a long time to realize the full importance of these particle. We now call them protons. (Read more here.)

In 1909 Rutherford’s team in Manchester showed that the positive charge in an atom is concentrated in a small part near its center: the nucleus.

Image

His assistants Geiger and Marsden fired particles (which we now know are helium nuclei) at some gold leaf, and some bounced back. When Rutherford saw the results of this experiment, he wrote “it was almost as incredible as if you had fired a 15-inch shell at a piece of tissue paper and it came back and hit you”. The plum-pudding model was out!

But there’s a big problem. The “atomic weight” is the little number at the bottom of each square here:

Image

If protons account for almost all the mass of the atom, the atomic weight of an atom should be the number of protons it contains. But since the atomic weight is bigger than the atomic number, this would mean atoms would have more protons than electrons — hence be positively charged. They’re not!

Let’s not even worry now about the fact that the atomic weight is not always close to an integer. Let’s just worry about this: how could an atom have more protons than electrons, if it’s electrically neutral?

Here’s one possible solution: the nucleus contains extra electrons to cancel out the excess positive charge. Rutherford argued for this in 1920.

For example helium has atomic number 2, but atomic mass 4. With 2 electrons and 4 protons, helium would have charge +2. It doesn’t! But if it also had 2 extra electrons in the nucleus it would be neutral, as observed.

But what would make some of the electrons stay in the nucleus, while others orbit it?

In 1930, Marie Curie’s daughter Irène and her husband created a beam of electrically neutral particles by bombarding the metal beryllium
with radiation.

Image

In Cambridge, James Chadwick carried these experiments further and proved there was a neutral particle with almost the same mass as a proton: the neutron. (Read more here.)

It became clear that the nucleus of an atom is made of protons and neutrons!

Later experiments showed a nucleus is about 1/60,000 as big across as an atom. If the hydrogen atom were a sphere 30 meters across, the proton would be a grain of salt at the center.

This raises another huge problem. If the protons are confined in the tiny nucleus, and they’re all positively charged, they must repel each other immensely! What holds them together?

But that’s another story! I’ve sketched out how we figured out the basics of atoms — that’s atomic physics. The next story would be nuclear physics. There are always new puzzles, leading us onward.


Digression: photographing a hydrogen atom

The photograph is not just a picture of the wavefunction of the electron in a hydrogen atom; it’s the result of a complicated process. For more try these:

The second is more careful than its title. It does the unusual and welcome thing of saying that “the experimentally observed nodal structures originate from the transverse nodal structure of the initial state that is formed upon laser excitation” — which is the actual claim, properly hedged, rather than “we photographed the orbital.” It also notes the comparison between measured and predicted images, which makes clear that this is a quantitative experiment with theory backing it, not a literal photograph.

Digression: J. J. Thomson

In fact, the man who discovered the electron through some of the most delicate vacuum-tube experiments ever performed was reportedly a complete disaster with his hands.

J. J. Thomson was hopeless in the lab. His assistant Ebenezer Everett forbade Thomson from touching anything delicate on the grounds that he was “exceptionally helpless with his hands”. According to Ainissa Ramirez:

For such a small man, he was a Victorian bull in a china shop. When he visited his students in the laboratory, they’d wince when he offered help, and quickly tried to move fragile things out of his way. They took deep breaths when he sat on a lab stool to speak. Life was no better at home. J. J.’s wife did not permit him to use a hammer in the house.

The irony: the 1897 electron experiments depended absolutely on glassware that almost no one in the world could make. You needed cathode-ray tubes capable of holding a vacuum so high that any ordinary tube would shatter, with metal electrodes sealed through the glass without leaking. Everett, a self-taught glassblower who had migrated over from the chemistry department, made every tube by hand. Thomson didn’t touch them.

Thomson called the particles he discovered “corpuscules”. The word “electron” was already in circulation when he made his announcement: George Johnstone Stoney had coined it in 1891 for the unit of charge in electrolysis, and Joseph Larmor was already using it in 1894 for a theoretical particle in his electromagnetic ether theory. Within months of Thomson’s April 1897 lecture, George FitzGerald suggested that the corpuscle identified by Thomson from cathode rays and proposed as parts of an atom was a “free electron,” as described by physicist Joseph Larmor and Hendrik Lorentz. While Thomson did not adopt the terminology, the connection convinced other scientists that cathode rays were particles.

Thomson dug in. He kept saying “corpuscle” through the whole period when he was actually building his case for the particle — the 1899 paper showing the photoelectric particles had the same m/e, the 1904 plum-pudding model paper, the 1906 Nobel Prize, all discussed “corpuscles.” Thomson himself continued to use the term corpuscle until 1913 — about sixteen years after his announcement, and well after the rest of the physics community had moved on. His 1906 Nobel citation reflects how oddly Thomson’s contribution was framed at the time: “His 1906 Nobel Prize was granted ‘in recognition of the great merits of his theoretical and experimental investigations on the conduction of electricity by gases,’ not for any specific discovery, let alone the electron (which he kept calling the corpuscle)”.

Part of the resistance was substantive, not just stubborn. “Electron” in Larmor’s and Lorentz’s usage was a theoretical entity tied to an ether-based electromagnetic worldview, with specific commitments Thomson didn’t want to inherit. Calling his thing a “corpuscle” let him keep it as a mechanical, material particle — a building block of the atom in his own picture — without buying into the Continental electromagnetic program.

So the textbook line “Thomson discovered the electron in 1897” smooths over a real conceptual identification that took a decade and a half, and that Thomson himself was among the last to embrace.

For more, try:

• Isobel Falconer, Corpuscles to electrons in Histories of the Electron, Buchwald and Warwick, eds., MIT Press, Cambridge, 2001.

• Ainissa Ramirez, The Alchemy of Us—How Humans and Matter Transformed One Another, MIT Press, Cambridge, Massaschusetts, 2020.

Digression: the plum-pudding atom

The “plum pudding” name was invented by an anonymous popular-science writer in 1906. You can see a careful historical reconstruction here:

• Giora Hon and Bernard R. Goldstein, J. J. Thomson’s plum-pudding atomic model: the making of a scientific myth, Annalen der Physik 525 (2013), A129–A133.

Hon and Goldstein tracked down the earliest occurrence: an anonymous reporter in the British pharmaceutical trade magazine The Chemist and Druggist, August 1906, who described the atom as having “minute specks, the negative corpuscles, swimming about in a sphere of positive electrification, like raisins in a parsimonious plum-pudding”. “The analogy was never used by Thomson nor his colleagues. It seems to have been coined by popular science writers to make the model easier to understand for the layman.”

In fact, by the time the Chemist and Druggist writer reached for the pudding image in 1906, Thomson’s model had already moved past the version it was supposedly describing. As Hon and Goldstein put it: “according to Thomson’s theory of 1906, the electrons revolve on rings about the center of the atom, and they are not distributed throughout the ‘pudding’ as ‘raisins swimming about.'” The pudding image was obsolete from the moment it was coined.

What Thomson actually used as his guiding analogy was much more interesting: floating magnets. Thomson drew an analogy with experiments by Alfred Marshall Mayer (1836–1897). Piercing small magnetic needles into corks and letting them float in water below a strong magnet, Mayer had observed in 1878/79 that the magnetized floating needles quasi-automatically positioned themselves in characteristic configurations depending on their number. With more than six magnetic needles present, a seventh and eighth would inevitably position itself inside the outer ring of six. As the number of floating magnets increased, more and more rings would form. Thomson hoped that a similar ring-structure composed of corpuscles could be found inside chemical atoms, and suspected that each of these rings would be analogous to the chemical periods. So in Thomson’s head, the atom looked like Mayer’s needle-in-cork rings — dynamic, self-organizing, suggestive of the periodic table, not a dessert.

Another question: was the model even Thomson’s alone? Britannica notes that the model was “proposed about 1900 by William Thomson (Lord Kelvin) and strongly supported by Sir Joseph John Thomson, who had discovered (1897) the electron”. Kelvin had been working on similar uniform-positive-sphere ideas a few years before J.J. Thomson’s 1904 Philosophical Magazine paper, so the standard “Thomson’s plum pudding” of textbooks elides both a co-originator and the actual originator of the name.

• A. M. Mayer, Floating magnets, Nature 17 (1878), 487–488.

• Klaus Hentschel, Atomic Models, J.J. Thomson’s ‘Plum Pudding’ Model, in Compendium of Quantum Physics, eds. D. Greenberger, K. Hentschel and F. Weinert, Springer, Berlin, 2009.

• John Heilbron, J.J. Thomson and the Bohr atom, Physics Today 30 (1977), 23–30.

• Wikipedia, Plum pudding model.

• Britannica, Thomson atomic model.


Digression: Eugen Goldstein

In Crookes’ tube, electrons flew from a negatively charged metal tip (the cathode) to the positively charged one (the anode). Thus, electrons were called cathode rays, and this apparatus was called a cathode ray tube.

By the mid-1880s, the phenomena created by these tubes was a hot but baffling field. There was a vigorous debate about whether cathode rays were charged particles (the Crookes/Schuster view) or some kind of wave in the aether (the German view, which Goldstein himself leaned toward). In 1876, Goldstein discovered that cathode rays were emitted perpendicular to the cathode surface. This inspired the design of concave cathodes to produce concentrated or focused rays. So experimenters were routinely fiddling with cathode geometries — different shapes, different sizes, different configurations — to study cathode ray behavior. Goldstein spent his time doing this.

Eventually, around 1886, he tried drilling small holes in the cathode — Kanäle, meaning “channels” or “canals”. He discovered something surprising: wth the tube running, a faint glow appeared streaming out the back of the cathode, opposite to the direction of the cathode rays. Busch writes: “Goldstein therefore called them ‘canal rays’ (German ‘Kanalstrahlen’). Since the canal rays went in the opposite direction as the ‘cathode rays’, Goldstein speculated that these rays consisted of positively charged particles.”

Later these canal rays were called “anode rays” as a parallel to “cathode rays,” but it’s somewhat misleading — these rays don’t actually originate at the anode. They “are produced by positively charged ions after impact ionization in the cathode ray tubes. These ions are accelerated towards the cathode, pass — if the cathode is provided with holes — through these holes (‘channels’) due to their inertia and can be detected behind the cathode by their luminous phenomena.” So they’re lone protons formed in the gas, accelerated toward the cathode, that shoot through the holes by inertia and emerge on the far side.

• Britannica, Eugen Goldstein.

• Wikipedia, Eugen Goldstein.

• Encyclopedia of Scientific Biography, Eugen Goldstein.

• Uwe Busch, Claims of priority — The scientific path to the discovery of X-rays, Z. Med. Phys. 19 (2023), 230–242.

Digression: Wilhelm Wien

Wilhelm Wien, famous for his work on blackbody radiation, was working at Aachen when in 1898 he turned to the problem Goldstein had left dangling. Goldstein himself had tried to deflect canal rays magnetically and failed. Using the strongest magnet he had, one that certainly had an effect on the cathode rays, Goldstein attempted to deflect his canal rays, but he observed no change in their path. So for twelve years the rays were a sort of mystery — clearly something real, but resistant to the magnetic deflection trick that was the standard probe of the era.

Wien managed to bend canal ways using brute force. To deflect the cathode rays, you only needed a modest field. To deflect the canal rays, Wien needed a magnetic field of 3250 gauss and a electrical field of 2000 V — and his rays moved a grand total of 6 mm. The thing was enormously more sluggish to deflect than electrons, and this meant the particles were vastly more massive per unit charge. Goldstein hadn’t been doing anything wrong — he just hadn’t been hitting them hard enough!

The really nice piece of physics is how Wien figured out the charge/mass ratio of canal rays. He invented what we now call the Wien filter: a charged-particle velocity filter with orthogonal electric and magnetic fields. The trick is elegant. The magnetic force on a particle depends on its velocity; the electric force doesn’t. So if you tune the fields against each other, only particles of one particular velocity sail straight through undeflected — everything else gets bent. Once you know the velocity, the deflection in a pure magnetic field tells you charge-to-mass. It’s a beautiful idea, and it’s still the working principle behind ion-beam instruments today.

Wilhelm Wien analyzed these positive rays in 1898 and found that the particles have a mass-to-charge ratio more than 1,000 times larger than that of the electron. Because the ratio of the particles is also comparable to the mass-to-charge ratio of the residual atoms in the discharge tubes, scientists suspected that the rays were actually ions from the gases in the tube. So Wien did not discover the proton, contrary to what’s sometimes claimed. What he found was:

• The deflection was in the opposite direction from cathode rays — so the particles were positively charged.

• The charge-to-mass ratio depended on the gas in the tube, not on the cathode material — so these weren’t some universal particle like the electron; they were ions of whatever gas you’d filled the tube with.

• The mass-to-charge ratio was comparable to atomic masses, meaning these were essentially atoms missing electrons.

The hydrogen ion — which we now call the proton — was just one of many ions Wien could produce, and there was nothing special about it in his work. The identification of the hydrogen ion as a fundamental constituent of all nuclei came much later, with Rutherford’s nuclear-scattering experiments and his 1919 demonstration that he could knock hydrogen nuclei out of nitrogen. Rutherford named it the proton around 1920.

So the lineage is this: Goldstein 1886 saw a glow, Wien 1898 showed it was ionized gas, and finally Rutherford 1919–1920 identified the hydrogen ion as a universal nuclear building block.

Digression: Discovery of the neutron

The discovery of the neutron was an extremely complicated business, starting with the the problem of understanding atomic mass (Z) and atomic number (A) in the periodic table.

Prout, isotopes, and the Z-vs-A puzzle. The puzzle has roots in William Prout’s 1815 hypothesis that atomic weights are integer multiples of hydrogen, suggesting hydrogen as a building block for all the other elements. Through the nineteenth century this looked broken: chlorine sat at 35.5, and most elements were not clean integers. Two developments rehabilitated it. Soddy in 1913 introduced the concept of isotopes — the same element can exist in chemically identical forms of different mass — explaining the non-integer atomic weights as weighted averages. J.J. Thomson’s 1912–13 work on neon gave the first stable-element example, and Aston’s mass spectrograph (1919 onwards) established what Aston called the “whole-number rule”: individual isotopes have atomic masses very nearly integral in units of hydrogen, with small deviations here and there. In parallel, Moseley’s 1913 work using X-rays pinned down atomic number Z as the nuclear charge, not just an ordinal label. Putting these together you got the central puzzle: for elements heavier than hydrogen, the atomic mass A is larger than the atomic number Z!

The proton-electron nucleus. The obvious move was: maybe the nucleus contains A protons and A − Z electrons, giving the right charge and mass. The model was appealing on three grounds beyond accounting. First, only two particles were known (proton and electron), so parsimony favored it. Second, beta decay does eject electrons from nuclei, so the electrons appeared to be there to begin with. Third, alpha particles (helium-4 nuclei) could be assembled as 4p + 2e, neatly. Rutherford himself adopted this picture in his 1920 Bakerian Lecture:

• Ernest Rutherford, Nuclear constitution of atoms, Proc. Roy. Soc. A 97 (1920), 374–400.

In that lecture he went further: he proposed that some proton-electron pairs inside the nucleus might be bound so tightly as to form a compact neutral unit: “the idea of the possible existence of an atom of mass one, which has a zero nuclear charge”. He floated the term “neutron” for it. (W.D. Harkins coined the same term independently in 1921.) Note that Rutherford’s neutron was not a new elementary particle; it was a compound, a sort of miniature hydrogen atom. This conception persisted for over a decade and shaped how the discovery was eventually interpreted.

Three killer problems with electrons in the nucleus. Through the 1920s, three independent objections accumulated.

The confinement problem. Confining an electron to a nucleus of radius ~10⁻¹⁴ m gives, by the uncertainty principle, a momentum spread of order ℏ/r and hence a kinetic energy of order ~20 MeV — far larger than any nuclear binding energy then known, and far larger than the energies of beta-decay electrons. With Dirac’s relativistic electron theory (1928), things got worse: Klein’s paradox showed that electrons in sufficiently strong potentials don’t bind cleanly at all. Bohr was so disturbed by this that around 1929–30 he was openly speculating that energy and momentum conservation might fail at nuclear scales.

The nitrogen-14 anomaly. This is the cleanest of the three and the one that broke the model. Studies of the spectrum of molecular nitrogen in the late 1920s showed that the nitrogen-14 nucleus is a boson, hence must have integer spin. (Direct measurement gave spin 1.) But in the theory where a neutron is a proton-electron combination model, nitrogen-14 would be 14p + 7e = 21 spin-½ fermions, which must have half-integer total spin and thus be a fermion. The model predicted the wrong thing for the most common nitrogen isotope!

The continuous beta spectrum. Chadwick himself, back in 1914, had shown that beta decay produces a continuous energy spectrum, not a line spectrum. Ellis, Wooster, and (decisively) Meitner and Orthmann’s 1929 calorimetric experiment confirmed energy was apparently not conserved. Pauli’s famous December 1930 letter “Dear Radioactive Ladies and Gentlemen” proposed a neutral, spin-½, light particle inside the nucleus to fix both energy conservation and the spin-statistics problem simultaneously — he called his particle a “neutron” too. (Fermi renamed it “neutrino” in 1933 after Chadwick’s discovery, to avoid the name collision.)

So by 1930 there were three structurally independent reasons to think the proton-electron nucleus was wrong, and Rutherford had a candidate replacement that nobody had yet found.

The 1930–32 experimental crescendo. The discovery story is what the Encyclopedia of Scientific Biography calls a “Tale of Three Cities” — Berlin, Paris, and Cambridge. “Walther Bothe and his assistant Herbert Becker [were] working in the Physikalisch-Technische Reichsanstalt (Imperial Physical-Technical Institute) in Charlottenburg, a suburb of Berlin; Irène Curie and her husband Frédéric Joliot working in the Institut du Radium in Paris; and James Chadwick working in the Cavendish Laboratory in Cambridge. The story reached its crescendo between June 1930 and February 1932.”

Berlin, 1930. Walther Bothe and Herbert Becker observed that bombarding beryllium with alpha particles emitted a highly penetrating, electrically neutral radiation. They, along with most of the scientific community, incorrectly assumed this radiation was exceptionally high-energy gamma rays. Lead absorption coefficients suggested gamma rays of unprecedented energy — odd, since the nuclear binding energies available from α + ⁹Be shouldn’t have been enough.

• Paris, late 1931 – January 1932. Irène Joliot-Curie and Frédéric Joliot, with the world’s strongest polonium alpha source (an inheritance, essentially, from Marie Curie’s lab), pushed the Bothe radiation through a paraffin window. They found that this radiation knocked loose protons from hydrogen atoms in that target, and those protons recoiled with very high velocity. They interpreted the result as the scattering of high-energy photons off protons — which, as Chadwick noted in his Feb 27, 1932 Nature letter, required “that the beryllium radiation had a quantum energy of 50 × 10⁶ electron volts”. That was about ten times more energy than any plausible nuclear gamma source could produce. Curie and Joliot reported their findings on 18 January 1932, and the Comptes Rendus arrived in Cambridge within a couple of weeks.

Cambridge, February 1932. Chadwick read the paper and, primed by Rutherford’s 1920 prediction and his own decade-long search for the neutron, instantly suspected the right answer. Rutherford’s reported reaction was characteristic: “I do not believe it!” Chadwick built the experiment immediately — borrowed polonium, the Cavendish’s beryllium, an ionization chamber. The crucial idea was simply to substitute multiple other targets for the paraffin: hydrogen, nitrogen, helium, lithium. If the projectile is a massive neutral particle of unknown mass m, you can work out the maximum recoil velocity of a nucleus of mass M. Comparing the maximum recoil velocities for hydrogen and nitrogen, Chadwick could solve algebraically for m. He found m is close to the proton mass. Three weeks after reading the Paris paper, he submitted a paper on the possible existence of a neutron on February 17th, 1932, which was published ten days later:

• James Chadwick, Possible existence of a neutron, Nature 129 (1932), 312.

He sent a fuller, more confidently titled paper to the Royal Society on the 10th of May:

• Chadwick, The existence of a neutron, Proc. Roy. Soc. A 136 (1932), 692–708.

Aftermath: from compound to elementary. Chadwick himself, in 1932, still followed Rutherford and described the neutron as a tight proton-electron compound — he wasn’t yet claiming a new elementary particle. The shift came over the next two years. Heisenberg’s three “On the structure of atomic nuclei” papers (July–December 1932) treated the proton and neutron as two states of a single ‘nucleon’ with the strong force coupling them via isospin — a structure that doesn’t work at all if the neutron has an electron lurking inside it. Fermi’s January 1934 theory of beta decay treated the electron as created at the moment of decay (the electromagnetic-radiation analogy: photons aren’t “in” the atom before emission either), removing the last motivation to put electrons in the nucleus. And the final empirical nail, suggested to Chadwick by the young Maurice Goldhaber, was photodisintegration of the deuteron: γ + d → p + n. By energy conservation, the neutron mass came out at greater than 1.0077 and less than 1.0086 in atomic mass units. Thus, the neutron was more massive than the hydrogen atom which had an accurately determined mass of 1.0078 amu. A bound p+e would weigh less than free p plus free e (binding energy is negative); since the neutron weighs more than a hydrogen atom, it cannot be such a bound state. By 1934 the neutron was firmly an elementary particle.

Primary sources and good histories:

• Lawrence Rutherford, Bakerian Lecture: nuclear constitution of atoms, Roy. Soc. Proc. A 97 (1920), 374–400.

• Irène Joliot-Curie and Frédéric Joliot-Curie, Émission de protons de grande vitesse par les substances hydrogénées sous l’influence des rayons γ très pénétrants, Comptes Rendus 194 (1932): 273.

• APS News, Chadwick reports the discovery of the neutron, May 1932.

• S. M. Bilenky, Neutrino: history of a unique particle.

• Roger Stuewer, The nuclear electron hypothesis, in Otto Hahn and the Rise of Nuclear Physics, Reidel, 1983.


Phasons in Quasicrystals

19 May, 2026
Image

In 2025, researchers studied a quasicrystal forged in a hypervelocity asteroid collision 600 million years ago—and found that it contains ‘phasons’!

It’s not a perfect icosahedral quasicrystal: it’s slightly distorted. 6 gentle ‘phason waves’ run through it, oriented along the 6 fivefold symmetry axes of an icosahedron. These waves were locked in when the alloy quickly cooled after impact, and they’ve been sitting there frozen in the structure ever since.

This quasicrystal is made of ‘icosahedrite’. The easiest way to describe this is the ‘slice and project’ method. You start with a lattice in 6 dimensions, choose a 3d slice, thicken that up a bit, take the lattice points that lie in the thickened slice, and project them down to 3d space. The atoms in the icosahedrite are exactly the projections of the 6d lattice points that happen to fall inside the thickened slice.

But now imagine wiggling the slice gently—letting it ripple in the other three dimensions, the ones perpendicular to physical space. Some 6d lattice points slip out of the slice and others slip in. In physical space this looks like atoms suddenly hopping from one position to a nearby alternative one.

These atomic hops are called ‘phason flips’, and a wave of these hops is a ‘phason’. Sound waves involve atoms swaying smoothly; phasons involve atoms jumping between alternative positions, and they’re special to quasicrystals.

These phasons give a fossil record of the collision that made the quasicrystal: the instant of cooling, preserved as a piece of warped 6-dimensional geometry, sitting inside a rock for 600 million years!

References

For more about quasicrystals in this particular meteorite, which is called the Khatyrka meteorite, read my post:

Naturally occurring quasicrystals.

This paper reported the discovery of icosahedrite in the Khatyrka meteorite:

• L. Bindi, J. M. Eiler, Y. Guan, L. S. Hollister, G. MacPherson, P. J. Steinhardt and N. Yao, Evidence for the extraterrestrial origin of a natural quasicrystal, Proceedings of the National Academy of Sciences 31 (2012), 1396-401.

This is the paper that first reported phasons in this sample of icosahedrite:

• Hiroyuki Takakuru, Asuka Ishikawa, Tsutomu Ishimasa, Luca Bindi and Paul J. Steinhardt, High-resolution synchrotron X-ray study of natural icosahedrite, IUCrJ 12 (2025), 435–443.

Quasicrystal phasons had already been understood long before:

• Dov Levin, T. C. Lubensky, Stellan Ostlund, Sriram Ramaswamy, Paul J. Steinhardt and John Toner, Elasticity and dislocations in pentagonal and icosahedral quasicrystals, Physical Review Letters 54 (1985), 1520–1523.

• T. C. Lubensky, Sriram Ramaswamy and John Toner, Hydrodynamics of icosahedral quasicrystals, Physical Review Letters 32 (1985), 7444–7452.

These papers use some nice math. The lattice I mentioned above is called the D6 lattice: a ‘checkboard’ lattice in 6 dimensions consisting of all points with integer coordinates that sum to an even integer. To understand phasons in icosahedral quasicrystals we need to understand how this lattice connects to the representation theory of the rotational symmetry group of the icosahedron, which is the alternating group A5. This group has an obvious 3-dimensional representation, called 3, where group elements just rotate an icosahedron and drag 3d Euclidean space along with that. But it also has another 3-dimensional representation, the Galois conjugate 3′, where we replace \sqrt{5} by -\sqrt{5} in every formula describing the representation 3.

Beautifully, the 6d space containing the D6 lattice transforms as the representation 33′. It turns out that that 3 describes phonons while the 3′ describes phasons! As mentioned, phonons are waves where atoms move smoothly in 3d space, while phasons involve atoms hopping between alternative positions.

Using representation theory we can classify all the possible formulas, allowed by symmetry, for how an icosahedral quasicrystal can oscillate. The possible choices turn out to involves various constants: two that describe the behavior of phonons, two for phasons, and one for phonon-phason interactions.

I feel like saying more, but this is not the right place!


Naturally Occurring Quasicrystals

14 May, 2026

Image

I love quasicrystals—like crystals, but with patterns that never repeat, like Penrose tiles. But they’ve very rare in nature. They’re created only by the most exotic and violent events: a high-speed collision of asteroids, lightning hitting a downed power cable in a sand dune—or an atomic bomb!

Amazingly, the first 3 kinds of naturally occurring quasicrystal were discovered in a single meteorite that landed in Khatyrka, in the far east of Russia. They’ve never been found anywhere else! And this meteorite is highly anomalous: it’s the only meteorite known that contains metallic aluminum—and it seems to have been formed in a ultra-high-velocity collision between asteroids.

The only other quasicrystal I know that may be naturally created came from a bolt of lightning hitting a sand dune near a downed power cable in Nebraska. Then there was one found amid the fused desert sand and copper transmission cable left behind by the first atomic bomb test at Trinity, New Mexico. That’s not quite ‘naturally created’.

And that’s all. As far as I can tell, the rest have been made in labs!

Here are the 3 kinds of quasicrystal found in the Khatyrka meteorite, in order of their discovery:

• Icosahedrite, Al63Cu24Fe13. By the way, these numbers are percentages, since quasicrystals don’t have integer-ratio molecular formulas the way conventional compounds do, due to their aperiodic nature.

Icosahedrite has full three-dimensional icosahedral symmetry and is quasiperiodic in all 3 directions. The easiest way to describe it mathematically is the ‘slice and project’ method. You start with a lattice in 6 dimensions, choose a 3d slice, thicken that up a bit, take the lattice points that lie in this slice, and project them down to 3d space. The lattice in 6 dimensions is called the D6 lattice: it consists of those 6-tuples of integers that sum to an even integer. If we slice along the correct 3d subspace the result has icosahedral symmetry. A lot of math in this mineral!

• Decagonite, Al71Ni24Fe5. This a ‘stacked’ quasicrystal: quasiperiodic with tenfold rotational symmetry in two dimensions, and ordinary periodic stacking in the third direction. We can build the 2d pattern using the slice and project method starting from the A₄ lattice in 4 dimensions. The A₄ lattice, in turn, is gotten by taking 5-tuples of integers, but only those that sum to zero. This quasicrystal looks like this:

High-resolution electron-microscopy image of natural Al71Ni24Fe5 quasicrystal found in the Khatyrka meteorite, showing decagonal patterns that never quite repeat.    Photo by Paul J. Steinhardt et al. from http://www.nature.com/srep/2015/150313/srep09111/full/srep09111.html

• i-Phase II, Al62Cu31Fe7. Like icosahedrite this has full icosahedral symmetry and you get it mathematically using the slice-and-project method starting from the D₆ lattice. But it has a distinctly different composition: more copper and less aluminum. It’s the first quasicrystal composition discovered in nature before being made in the lab.

The Nebraska quasicrystal shows how blurry the concept of ‘natural’ can be. It was found inside a ‘fulgurite’: a rock made when lightning hits sand. They found it in the Sand Hills near Hyannis, Nebraska, near a downed power line during a storm. It’s unclear whether this fulgurite was created by a lightning strike or by the falling power line creating its own arc, so the ‘natural vs manmade’ status is genuinely ambiguous.

What’s more important is it was a new kind of quasicrystal produced by a high-current, high-temperature, rapid-quench event on Earth’s surface! It has a composition of roughly Mn72.3Si15.6Cr9.7Al1.8Ni0.6. Its atomic planes have 12-fold symmetry in a nonrepeating pattern, and these planes are stacked periodically along the perpendicular direction.

Here’s a picture of this quasicrystal:

tunnelling electron microscope data obtained on a dodecagonal quasicrystal from a fulgurite. (A) The black circle in the acicular, quasicrystalline grain indicates the region where the electron diffraction pattern (Inset) has been collected. (B) A HAADF-TEM image of a portion of the quasicrystalline grain.  From here: www.pnas.org/doi/10.1073/pnas.2215484119

References

On icosahedrite, the first natural quasicrystal to be found in the meteorite from Khatyrka:

• L. Bindi, J. M. Eiler, Y. Guan, L. S. Hollister, G. MacPherson, P. J. Steinhardt and N. Yao, Evidence for the extraterrestrial origin of a natural quasicrystal, Proceedings of the National Academy of Sciences 31 (2012), 1396-401.

On decagonite, the second to be found:

• I. Buganski and L. Bindi, Insight into the structure of decagonite—the extraterrestrial decagonal quasicrystal, IUCrJ 8 (2021), 87–101.

On i-phase II, which is the provisional designation of the third quasicrystal found in that meteorite:

• L. Bindi, C. Lin, C. Ma and P. J. Steinhardt, Collisions in outer space produced an icosahedral phase in the Khatyrka meteorite never observed previously in the laboratory, Scientific Reports 6 (2016), 38117.

On the dodecagonal quasicrystal found in the dune in Nebraska:

• L. Bindi, M. A. Pasek, C. Ma, J. Hu, G. Cheng, N. Yao, P. D. Asimow, and P. J. Steinhardt, Electrical discharge triggers quasicrystal formation in an eolian dune, Proceedings of the National Academy of Sciences 120 (2023), e2215484119.

On the quasicrystal found at the atomic bomb test site:

• L. Bindi, W. Kolb, G. N. Eby, P. D. Asimow, T. C. Wallace and P. J. Steinhardt, Accidental synthesis of a previously unknown quasicrystal in the first atomic bomb test, Proceedings of the National Academy of Sciences 118 22 (2021), e2101350118.

This has icosahedral symmetry, but it’s quite different than the other quasicrystals I’ve mentioned, since it’s mostly made of silicon! Its formula is Si61Cu30Ca2Fe2.


Freiman’s Constant

7 May, 2026

 

Image

I recently asked people on Mastodon “What’s the most surprising fact you’ve learned in the last couple of weeks?” It was a nice way to learn a lot of interesting things. My own biggest recent surprise was this: the number

\displaystyle{ F = \frac{2221564096 + 283748 \sqrt{462}}{491993569} }

plays a fundamental role in number theory!

For any irrational x, we define its Lagrange number to be the supremum of numbers c such that

\displaystyle{ \left| \frac{p}{q} - x \right| < \frac{1}{cq^2} }

has infinitely many solutions for rationals p/q. So, the easier x is to approximate by rational numbers, the bigger its Lagrange number is.

Quite famously, the golden ratio has the smallest possible Lagrange number, namely √5. This means it’s as hard as possible to approximate using rational numbers.

The set of all Lagrange numbers is very complicated, and very interesting. But here’s the shocking fact: if

\displaystyle{ F = \frac{2221564096 + 283748 \sqrt{462}}{491993569} }

then every real number \ge F is a Lagrange number, and F is the smallest number with this property!

F is called ‘Freiman’s constant’, because he proved this fact. His proof is 100 pages, and I don’t want to read it… not even counting the fact that it’s only available in Russian:

• G. A. Freiman, Diophantine Approximations and the Geometry of Numbers (Markov’s Problem) [in Russian]. Kalinin State University Press, Kalinin, 1975.

There’s a lot more crazy stuff known about the set of all Lagrange numbers, which is called the Lagrange spectrum. As mentioned, the smallest number in the Lagrange spectrum is \sqrt{5}. The next is \sqrt{8}. The next is \sqrt{221}/5. The next is \sqrt{1517}/13. In 1879, Markov showed that such numbers form an increasing sequence that converges to 3. They are precisely the Lagrange numbers of numbers whose continued fraction expansion and eventually consists only of 1’s and 2’s and is eventually periodic, like this:

3 + \cfrac{1}{7 + \cfrac{1}{1 + \cfrac{1}{2 + \cfrac{1}{1 + \cfrac{1}{2 + \cdots}}}}}

3 is the Lagrange number of every number whose continued fraction expansion eventually consists only of 1’s and 2’s and is not eventually periodic. Above 3 the Lagrange spectrum becomes much richer. It’s a fractal: it has infinitely many gaps, but positive Hausdorff dimension, with the dimension increasing as we move up.

Moreira showed that when we reach \sqrt{12} \approx 3.464 the Hausdorff dimension of the Lagrange spectrum hits 1. And as mentioned, Freiman showed that above

\displaystyle{ F = \frac{2221564096 + 283748 \sqrt{462}}{491993569} \approx 4.52782956616087914\dots }

the Lagrange spectrum is a half-line. Directly below Freiman’s constant, Freiman showed there is a gap of width roughly 3 \cdot 10^{-8}.

Here is the classic reference in English on this subject:

• Thomas W. Cusick and Mary E. Flahive, The Markov and Lagrange Spectra, AMS Mathematical Surveys and Monographs 30, AMS, Providence, Rhode Island, 1989.

The Markov spectrum is another set, containing the Lagrange spectrum, and their relationship is very interesting. Here’s a free online reference that reviews all the basics before doing more:

• Carlos Gustavo Moreira, Geometric properties of the Markov and Lagrange spectra, Annals of Mathematics 188, 145–170.

For a stripped-down account, go here:

• Wikipedia, Markov spectrum.


Quantum Mechanics of the Inverse Cube Force Law

1 May, 2026

Image

In the last episode of my column in Notices of the American Mathematical Society, we looked at a particle moving in an attractive central force whose strength is proportional to the inverse cube of the distance from the origin. Among other things, we saw that a particle moving in such a force can spiral in to the origin in a finite time. But that was classical mechanics. What about quantum mechanics?

Here things get more tricky. The uncertainty principle tends to prevent the particle from falling in to the origin. But when the attractive force is strong enough, the particle can still fall in. We can make up a theory where the particle shoots back out, but there are choices involved: we need to say how the particle changes phase when it shoots back out. So there is not just a single theory, but many!

Why does the particle come back out? There are theories where it does not. In these theories, at least those studied so far, time evolution is nonunitary: that is, the probability of finding the particle somewhere or other does not stay equal to 1, because the particle simply disappears when it hits the origin. Here we focus on theories where time evolution is unitary and the particle comes back out. Many people have written about these, running into ‘paradoxes’ when they weren’t careful enough. Only rather recently have things been straightened out.

Let us dig into the details. In quantum mechanics, the Hilbert space of states of a particle in \mathbb{R}^3 is L^2(\mathbb{R}^3). In a central force whose strength is proportional to 1/r^3, such a particle has a Hamiltonian of this form:

H = -\nabla^2 + c r^{-2}

The first term describes the particle’s kinetic energy, while the second describes its potential energy: remember, taking the gradient of an inverse square potential gives an inverse cube force. I have set some constants to 1 to remove irrelevant clutter, but we need the constant c to say how strong the force is. When c \, < \, 0, the force is attractive.

In this game, analysis is paramount. We should interpret H as a densely defined linear operator on L^2(\mathbb{R}^3). For this, we choose a dense linear subspace D \subset L^2(\mathbb{R}^3) and treat H as a linear map from D to L^2(\mathbb{R}^3). Different choices of D correspond to different physical assumptions: for example, assumptions about what happens when the particle falls into the origin.

To get unitary time evolution in quantum mechanics, we need the Hamiltonian to be self-adjoint. But adjoints of densely defined operators are tricky. Let us briefly recall how they work. Given a Hilbert space \mathcal{H} and a linear operator A from a dense linear subspace D(A) \subseteq \mathcal{H} to \mathcal{H}, we define D(A^*) to be the set of all \psi \in \mathcal{H} for which there exist \psi' \in \mathcal{H} such that

\langle \psi' , \phi \rangle = \langle \psi, A \phi \rangle \; \text{ for all } \; \phi \in D(A)

If such a vector \psi' exists, it is unique, and it depends linearly on \psi. Thus, for \psi \in D(A^\ast) we define A^\ast \psi to be the vector \psi' with the above property. The adjoint of A is then the linear operator A^\ast \colon D(A^\ast) \to \mathcal{H}. We say A is self-adjoint if A = A^\ast. We say that A is essentially self-adjoint if it has a unique extension to a self-adjoint operator. If it does, this extension must be A^\ast.

All this raises the question of whether the Hamiltonian H for the inverse cube force law can be made self-adjoint with a suitable choice of domain. It turns out we can always do it, but sometimes in more than one way. There are three regimes:

c \ge 3/4. In this case we can start with the domain C_0^\infty(\mathbb{R}^3 - \{0\}) consisting of smooth functions that are compactly supported on \mathbb{R}^3 minus the origin. The operator H is unambiguously defined on this domain, and it is essentially self-adjoint.

-1/4 \le c \, < \, 3/4. In this case H is still well-defined on the domain C_0^\infty(\mathbb{R}^3 - \{0\}), but it is not essentially self-adjoint. In fact, it admits more than one self-adjoint extension! However, H is bounded below: there is a constant E_0 such that

\langle \psi, H \psi \rangle \ge E_0 \langle \psi, \psi \rangle

for all \psi \in C_0^\infty(\mathbb{R}^3 - \{0\}). Physically, this means that the particle’s energy is bounded below by E_0. Mathematically, this implies that H has a canonical choice of self-adjoint extension called the ‘Friedrichs extension’, with the smallest possible domain. But there is another canonical choice, the ‘Krein extension’, with the largest possible domain.

c \, < \, -1/4. In this case H is well-defined on the domain C_0^\infty(\mathbb{R}^3 - \{0\}), and it has more than one self-adjoint extension, but it is not bounded below.

These strange results demand explanation. For example, what is special about c =-1/4? In classical mechanics, the energy of a particle in the inverse cube force ceases to be bounded below as soon as c \, <  \,0. Quantum mechanics is different. To get a lot of negative potential energy, the particle’s wavefunction must be peaked near the origin, but that gives it kinetic energy. The tradeoff is captured by Hardy’s inequality. This says that for any \psi\in C_0^\infty(\mathbb{R}^3) we have

\langle \psi, (-\nabla^2 - \tfrac{1}{4} r^{-2}) \psi \rangle \ge 0

This is why H is bounded below when c \ge -1/4.

On the other hand, the constant 1/4 in Hardy’s inequality cannot be improved, so if c \, < \, 1/4 we can find \psi with \langle \psi, H \psi \rangle \, < \, 0. Then we can use a remarkable property of the r^{-2} potential to show that H is not bounded below. Namely, H has a kind of symmetry under dilations. You can guess this by noting that both the Laplacian and r^{-2} have units of 1/length2. Indeed, if you take any smooth function \psi, dilate it by a factor of \alpha, and then apply H, you get \alpha^{-2} times what you get if you do these operations in the other order. This implies that if

\langle \psi, H \psi \rangle = E \langle \psi, \psi \rangle

we can dilate \psi and get a function obeying the same equation with E replaced by \alpha^{-2} E. Thus, as soon as E can be negative, it can be made arbitrarily large and negative by choosing \alpha to be very small. Thus H is not bounded below.

Next, what is special about c = 3/4? This is more subtle. For any value of c \in \mathbb{R} we can find spherically symmetric solutions of

( -\nabla^2 + c r^{-2})\psi = i \psi

on \mathbb{R}^3 - \{0\} that are nonzero and smooth. When c \, < \, 3/4, and only in this case, some of these solutions \psi lie in L^2(\mathbb{R}^3). This dooms the chance of H being essentially self-adjoint, because it implies H^\ast \psi = i \psi. If H were essentially self-adjoint H^\ast would be self-adjoint, and it is easy to see that a self-adjoint operator cannot have i as an eigenvalue.

When c \, < \, \frac{3}{4} the operator H has more than one self-adjoint extension from C_0^\infty(\mathbb{R}^3 - \{0\}) to some larger domain. To classify these we can use separation of variables, writing \nabla^2 as a sum of a radial part and an angular part, assuming the angular dependence of \psi is given by a spherical harmonic Y_{\ell m}, and doing a change of variables u = \psi/r to reduce H to the ordinary differential operator

\displaystyle{ - \frac{d^2}{d r^2} + \left(c + \ell(\ell+1)\right) \frac{1}{r^2} }

on the half-line (0,\infty). We can completely classify self-adjoint extensions of this differential operators from C_0^\infty(0,\infty) to larger domains; the answer depends on c and \ell. A choice of self-adjoint extension is a choice of boundary conditions at r = 0, and this says how the phase of an incoming wave changes as it reflects off the origin and bounces back. Finally, we can assemble the results for different spherical harmonics to classify self-adjoint extensions of H.

There exist many self-adjoint extensions of H that respect the rotational symmetry of the inverse cube force law, but for c \, < \, -1/4 the extension must break the dilation symmetry discussed above. This is what physicists call an ‘anomaly’: a symmetry of a classical system that fails to be a symmetry of the corresponding quantum system. But intriguingly, for some still lower values of c one can choose a Hamiltonian that is symmetrical under a discrete subgroup of dilations! Determining precisely which values these are seems to be an open problem.

To explore the quantum mechanics of the inverse cube force law more thoroughly, I recommend first this:

• S. Gopalakrishnan, Self-Adjointness and the Renormalization of Singular Potentials, B.A. Thesis, Amherst College, 2006.

then this:

• D. M. Gitman, I. V. Tyutin and B. L. Voronov, Self-adjoint extensions and spectral analysis in Calogero problem.

and finally this:

• J. Dereziński and S. Richard, On Schrödinger operators with inverse square potentials on the half-line, Ann. Henri Poincaré 18 (2017), 869–928.

The first is an excellent overview of problems associated to singular potentials, including the inverse cube force. The second delves into self-adjoint extensions of the ordinary differential operators mentioned above, and the third works them out with exquisite thoroughness.