Because like charges repel, it is remarkable that the atomic nucleus
stays together. After all, the protons are all positively charged and
are repelled from each other electrically. To hold these particles so
closely together, physicists hypothesized a new force, the
strong force, strong enough to overcome the electric
repulsion of the protons. It must be strongest only at short
distances (about m), and then it must fall off rapidly, for
protons are repelled electrically unless their separation is that
small. Neutrons must also experience it, because they are bound to
the nucleus as well.
Physicists spent several decades trying to understand the strong force; it was one of the principal problems in physics in the mid-twentieth century. About 1932, Werner Heisenberg, pioneer in quantum mechanics, discovered one of the first clues to its nature. He proposed, in [15], that the proton and neutron might really be two states of the same particle, now called the nucleon. In modern terms, he attempted to unify the proton and neutron.
To understand how, we need to know a little quantum mechanics.
In quantum mechanics, the state of any physical system is given by a
unit vector in a complex Hilbert space, and it is possible to take
complex linear combinations of the system in different states. For
example, the state for a quantum system, like a particle on a line,
is a complex-valued function
We have special rules for combining quantum systems. If, say, we have two
particles in a box, particle 1 and particle 2, then the state is a
function of both particle 1's position and particle 2's:
Back to nucleons. According to Heisenberg's theory, a nucleon is a
proton or a neutron. If we use the simplest nontrivial Hilbert
space for both the proton and neutron, namely , then the Hilbert
space for the nucleon should be
The inner product in
then allows us to compute probabilities, using the
following rule coming from quantum mechanics: the probability that a system in
state
, a given Hilbert space, will be observed in state
is
In order for this to be interesting, however, there must be
processes that can turn protons and neutrons into different states of
the nucleon. Otherwise, there would be no point in having the full
space of states. Conversely, if there are processes which can
change protons into neutrons and back, it turns out we need all of
to describe them.
Heisenberg believed in such processes, because of an analogy between
nuclear physics and atomic physics. The analogy turned out to be poor,
based on the faulty notion that the neutron was composed of a proton
and an electron, but the idea of the nucleon with states in
proved to be a breakthrough.
The reason is that in 1936 a paper by Cassen and
Condon [7] appeared suggesting that the
nucleon's Hilbert space
is acted on by the symmetry group
. They emphasized the analogy between this and the spin of
the electron, which is also described by vectors in
, acted on
by the double cover of the 3d rotation group, which is also
.
In keeping with this analogy, they invented a concept called
`isospin'. The proton was declared the isospin up
state or
state, and the neutron was declared the
isospin down or
state. Cassen and Condon's
paper put isospin on its way to becoming a useful tool in nuclear
physics.
Isospin proved useful because it formalized the following idea, which
emerged from empirical data around the time of Cassen and Condon's
paper. Namely: the strong force, unlike the electromagnetic force, is
the same whether the particles involved are protons or neutrons.
Protons and neutrons are interchangeable, as long as we neglect the
small difference in their mass, and most importantly, as long as we
neglect electromagnetic effects. One can phrase this idea in terms of
group representation theory as follows: the strong force is
invariant under the action of .
Though this idea was later seen to be an oversimplification,
it foreshadowed modern ideas about unification. The proton, living in the
representation of the trivial group, and the neutron, living in a
different representation
of the trivial group, were unified into the
nucleon, with representation
of
. These symmetries hold
for the strong force, but not for electromagnetism: we say this force
`breaks'
symmetry.
But what does it mean, exactly, to say that a force is invariant under the
action of some group? It means that when we are studying particles
interacting via this force, the Hilbert space of each particle
should be equipped with a unitary representation of this group.
Moreover, any physical process caused by this force should be described
by an `intertwining operator': that is, a linear operator that
respects the action of this group. A bit more precisely, suppose
and
are finite-dimensional Hilbert spaces on which some group
acts as unitary operators. Then a linear operator
is
an intertwining operator if
Quite generally, symmetries give rise to conserved quantities.
In quantum mechanics this works as follows. Suppose that is
a Lie group with a unitary representation on the finite-dimensional
Hilbert space
and
. Then
and
automatically become
representations of
, the Lie algebra of
, and any intertwining operator
respects the action of
. In other words,
The element
will act as a skew-adjoint operator on any
unitary representation of
. Physicists prefer to work with
self-adjoint operators since these have real eigenvalues.
In quantum mechanics, self-adjoint operators are called `observables'.
We can get an observable by dividing
by
.
In Casson and Condon's isospin theory of the strong interaction,
the symmetry group is
. The Lie algebra
has
a basis consisting of three elements, and the quantity
arises
as above: it is just the eigenvalue of one of these elements,
divided by
to get a real number. Because any physical process
caused by the strong force is described by an intertwining operator,
is conserved. In other words, the total
of any system
remains unchanged after a process that involves only strong interactions.
Nevertheless, for the states in
which mix protons and neutrons to have
any meaning, there must be a mechanism which can convert protons into neutrons
and vice versa. Mathematically, we have a way to do this: the action of
. What does this correspond to, physically?
The answer originates in the work of Hideki Yukawa. In the early 1930s, he predicted the existence of a particle that mediates the strong force, much as the photon mediates the electromagnetic force. From known properties of the strong force, he was able to predict that this particle should be about 200 times as massive as the electron, or about a tenth the mass of a proton. He predicted that experimentalists would find a particle with a mass in this range, and that it would interact strongly when it collided with nuclei.
Partially because of the intervention of World War II, it took over
ten years for Yukawa's prediction to be vindicated. After a
famous false alarm (see Section 2.5),
it became clear by 1947 that a particle with the
expected properties had been found. It was called the pion and
it came in three varieties: one with positive charge, the , one
neutral, the
, and one with negative charge, the
.
The pion proved to be the mechanism that can transform nucleons. To wit, we observe processes like those in Figure 1, where we have drawn the Feynman diagrams which depict the nucleons absorbing pions, transforming where they are allowed to by charge conservation.
Because of isospin conservation, we can measure the of a pion by
looking at these interactions with the nucleons. It turns out that the
of a pion is the same as its charge:
Pion | ![]() |
![]() |
![]() |
![]() |
0 |
![]() |
![]() |
Nucleon | ![]() |
Charge |
![]() |
![]() |
1 |
![]() |
![]() |
0 |
Mathematically, being constant on `families' just means it is
constant on representations of the isospin symmetry group,
. The three pions, like the proton and neutron, are nearly
identical in terms of mass and their strong interactions. In
Heisenberg's theory, the different pions are just different isospin
states of the same particle. Since there are three, they have to span
a three-dimensional representation of
.
Up to isomorphism, there is only one three-dimensional complex irrep
of , which is
, the symmetric tensors of rank 2.
In general, the unique
-dimensional irrep of
is given
by
. Physicists call this the
spin-
representation of
, or in the present context,
the `isospin-
representation'. This representation
has a basis of vectors where
ranges from
to
in integer steps. Nucleons lie in the isospin-
representation,
while pions lie in the isospin-
representation.
This sets up an interesting puzzle. We know two ways to transform
nucleons: the mathematical action of , and their physical
interactions with pions. How are these related?
The answer lies in the representation theory. Just as the two nucleons
span the two-dimensional irrep of
of
, the pions
span the three-dimensional irrep
of
. But there
is another way to write this representation which sheds light on the
pions and the way they interact with nucleons: because
is
itself a three-dimensional real manifold, its Lie algebra
is a three-dimensional real
vector space.
acts on itself by conjugation, which fixes the
identity and thus induces linear transformations of
, giving a
representation of
on
called the adjoint
representation.
For simple Lie groups like , the adjoint representation is
irreducible. Thus
is a three-dimensional real irrep
of
. This is different from the three-dimensional complex
irrep
, but very related. Indeed,
is just the
complexification of
:
The pions thus live in
, a complex Lie algebra, and
this acts on
because
does. To be precise, Lie group
representations induce Lie algebra representations, so the real Lie algebra
has a representation on
. This then extends
to a representation of the complex Lie algebra
.
And this representation is even familiar--it is the fundamental
representation of
on
.
Quite generally, whenever is the Lie algebra of a
Lie group
, and
is a representation of
on some finite-dimensional vector space
, we get a
a representation of the Lie algebra
on
, which
we can think of as a linear map
Pions act on nucleons via precisely such an intertwining operator:
Physicists have invented a nice way to depict such intertwining operators--Feynman diagrams:
Here we see a nucleon coming in, absorbing a pion, and leaving. That is, this diagram depicts a basic interaction between pions and nucleons.
Feynman diagrams are calculational tools in physics, though to actually
use them as such, we need quantum field theory. Then, instead of just
standing for intertwining operators between representations of a compact
groups like , they depict intertwining operators between
representations of the product of this group and the Poincaré group,
which describes the symmetries of spacetime. Unfortunately, the
details are beyond the scope of this paper. By ignoring the
Poincaré group, we are, in the language of physics, restricting
our attention to `internal degrees of freedom', and their `internal'
(i.e., gauge) symmetries.
Nonetheless, we can put basic interactions like the one in Figure 2 together to form more complicated ones, like this:
Here, two nucleons interact by exchanging pions. This is the mechanism
for the strong force proposed by Yukawa, still considered
approximately right today. Better, though, it depicts all the
representation-theoretic ingredients of a modern gauge theory in
physics. That is, it shows two nucleons, which live in a
representation
of the gauge group
, interacting by the
exchange of a pion, which lives in the complexified adjoint rep,
. In the coming sections we will see how these
ideas underlie the Standard Model.
2010-01-11