An important goal of the enterprise of physics is to describe, not
just one physical system at a time, but also how a large complicated
system can be built out of smaller simpler ones. The simplest case is
a so-called `joint system': a system built out of two separate parts.
Our experience with the everyday world leads us to believe that to
specify the state of a joint system, it is necessary and sufficient to
specify states of its two parts. (Here and in what follows, by
`states' we always mean what physicists call `pure states'.) In other
words, a state of the joint system is just an ordered pair of states
of its parts. So, if the first part has as its set of states, and
the second part has
as its set of states, the joint system has the
cartesian product
as its set of states.
One of the more shocking discoveries of the twentieth century is that this is wrong. In both classical and quantum physics, given states of each part we get a state of the joint system. But only in classical physics is every state of the joint system of this form! In quantum physics are also `entangled' states, which can only be described as superpositions of states of this form. The reason is that in quantum theory, the states of a system are no longer described by a set, but by a Hilbert space. Moreover -- and this is really an extra assumption -- the states of a joint system are described not by the cartesian product of Hilbert spaces, but by their tensor product.
Quite generally, we can imagine using objects in any category to
describe physical systems, and morphisms between these to describe
processes. In order to handle joint systems, this category will need
to have some sort of `tensor product' that gives an object for any pair of objects
and
. As we shall explain,
categories of this sort are called `monoidal'. The category
is
a example where the tensor product is just the usual cartesian product
of sets. Similarly, the category
is a monoidal category where
the tensor product is the usual tensor product of Hilbert spaces.
However, these two examples are very different, because the product in
is `cartesian' in a certain technical sense, while the product
in
is not. This turns out to explain a lot about why joint
systems behave so counterintuively in quantum physics. Moreover, it
is yet another way in which
resembles
more than
.
To see this in detail, it pays to go back to the beginning and think
about cartesian products. Given two sets and
, we define
to be the set of all ordered pairs
with
and
. But what is an ordered pair? This depends on our
approach to set theory. We can use axioms in which ordered pairs are
a primitive construction, or we can define them in terms of other
concepts. For example, in 1914, Wiener defined the ordered pair
to be the set
. In
1922, Kuratowski gave the simpler definition
. We can use the still simpler definition
if our axioms exclude the possibility of sets
that contain themselves. Various other definitions have also been
tried [17]. In traditional set theory we arbitrarily
choose one approach to ordered pairs and then stick with it. Apart
from issues of convenience or elegance, it does not matter which we
choose, so long as it `gets the job done'. In other words, all these
approaches are all just technical tricks for implementing our goal, which
is to make sure that
if and only if
and
.
It is a bit annoying that the definition of ordered pair cannot get straight to the point and capture the concept without recourse to an arbitrary trick. It is natural to seek an approach that focuses more on the structural role of ordered pairs in mathematics and less on their implementation. This is what category theory provides.
The reason traditional set theory arbitarily chooses a specific
implementation of the ordered pair concept is that it seems difficult
to speak precisely about ``some thing -- I don't care what it
is -- with the property that
iff
and
''. So, the first move in category theory is to stop focussing on
ordered pairs and instead focus on cartesian products of sets. What
properties should the cartesian product
have? To make
our answer applicable not just to sets but to objects of other
categories, it should not refer to elements of
. So, the
second move in category theory is to describe the cartesian product
in terms of functions to and from this set.
The cartesian product has
functions called `projections' to the sets
and
:
Thus, given two sets and
, we define their cartesian product to be any set
equipped with
functions
,
such that for any set
and functions
,
, there exists a unique function
with
All this generalizes painlessly to an arbitrary category.
Given two objects and
in some category,
we define their cartesian product (or simply product)
to be any object
equipped with morphisms
We say a category has binary products if every pair of objects
has a a product. One can also talk about -ary products for other
values of
, but a category with binary products has
-ary
products for all
, since we can construct these as iterated
binary products. The case
is trivial, since the product of
one object is just that object itself (up to canonical isomorphism).
The only remaining case is
. This is surprisingly important.
A
-ary product is usually called a terminal object and
denoted
: it is an object such that that for any object
there
exists a unique morphism from
to
. Terminal objects are unique
up to canonical isomorphism, so we feel free to speak of `the'
terminal object in a category when one exists. The reason we denote
the terminal object by
is that in
, any set with one element
is a terminal object. If a category has a terminal object and binary
products, it has
-ary products for all
, so we say it has finite products.
It turns out that these concepts capture much of our intuition about
joint systems in classical physics. In the most stripped-down version
of classical physics, the states of a system are described as elements
of a mere set. In more elaborate versions, the states of a
system form an object in some fancier category, such as the category
of topological spaces or manifolds. But, just like
, these fancier categories have finite products -- and we use
this fact when describing the states of a joint system.
To sketch how this works in general, suppose we have any category with
finite products. To do physics with this, we think of any of the
objects of this category as describing some physical system. It
sounds a bit vague to say that a physical system is `described by'
some object , but we can make this more precise by saying that
states of this system are morphisms
. When our
category is
, a morphism of this sort simply picks out an
element of the set
. In the category of topological spaces, a
morphism of this sort picks out a point in the topological space
-- and similarly for the category of manifolds, and so on. For this
reason, category theorists call a morphism
an element of the object
.
Next, we think of any morphism
as a `process' carrying
states of the system described by
to states of the system described
by
. This works as follows: given a state of the first system,
say
, we can compose it with
to get a state of
the second system,
.
Then, given two systems that are described by the objects and
,
respectively, we decree that the joint system built from these is
described by the object
. The projection
can be thought of as a process that takes a state of
the joint system and discards all information about the second part,
retaining only the state of the first part. Similarly, the projection
retains only information about the second part.
Calling these projections `processes' may strike the reader as
strange, since `discarding information' sounds like a subjective
change of our description of the system, rather than an
objective physical process like time evolution. However, it is worth
noting that in special relativity, time evolution corresponds to a
change of coordinates
, which can also be thought of
as change of our description of the system. The novelty in thinking
of a projection as a physical process really comes, not from the fact
that it is `subjective', but from the fact that it is not invertible.
With this groundwork laid, we can use the definition of `product' to
show that a state of a joint system is just an ordered pair of states
of each part. First suppose we have states of each part, say
and
. Then there is a unique state
of the joint system, say
, which reduces to
the given state of each part when we discard information about the
other part:
and
. Conversely, every state
of the joint system arises this way, since given
we can recover
and
using these equations.
However, the situation changes drastically when we switch to quantum theory! The states of a quantum system can still be thought of as forming a set. However, we do not take the product of these sets to be the set of states for a joint quantum system. Instead, we describe states of a system as unit vectors in a Hilbert space, modulo phase. We define the Hilbert space for a joint system to be the tensor product of the Hilbert spaces for its parts.
The tensor product of Hilbert spaces is not a cartesian product
in the sense defined above, since given Hilbert spaces
and
there are no linear operators
and
with the required properties. This means that from a
(pure) state of a joint quantum system we cannot extract
(pure) states of its parts. This is the key to Bell's `failure
of local realism'. Indeed, under quite general conditions
one can derive Bell's inequality from the assumption that pure
states of a joint system determine pure states
of its parts [3,8], so violations of Bell's inequality should
be seen as an indication that this assumption fails.
The Wooters-Zurek argument that `one cannot clone a quantum state'
[32] is also based on the fact that the tensor product of Hilbert
spaces is not cartesian. To get some sense of this, note that
whenever is an object in some category for which the product
exists, there is a unique morphism
In , however, since the tensor product is not a product in the
category-theoretic sense, it makes no sense to speak of a diagonal
morphism
. In fact, a stronger
statement is true: there is no natural (i.e. basis-independent) way
to choose a linear operator from
to
other than the
zero operator. So, there is no way to duplicate information in
quantum theory.
Since the tensor product is not a cartesian product in the sense explained above, what exactly is it? To answer this, we need the definition of a `monoidal category'. Monoidal categories were introduced by Mac Lane [23] in early 1960s, precisely in order to capture those features common to all categories equipped with a well-behaved but not necessarily cartesian product. Since the definition is a bit long, let us first present it and then discuss it:
Definition. A monoidal category consists of:
This obviously requires some explanation! First, it makes use of some
notions we have not explained yet, ruining our otherwise admirably
self-contained treatment of category theory. For example, what is
in clause (ii) of the definition? This is just the
category whose objects are pairs of objects in
, and whose
morphisms are pairs of morphisms in
, with composition of
morphisms done componentwise. So, when we say that the tensor product
is a functor
, this implies
that for any pair of objects
there is an object
, while for any pair of morphisms
in
there is a morphism
in
. Morphisms are just as important
as objects! For example, in
, not only can we take the tensor
product of Hilbert spaces, but also we can take the tensor product of
bounded linear operators
and
,
obtaining a bounded linear operator
Similarly, in the tensor product is given by disjoint union,
both for objects and for morphisms. In Figure 8
we show
two spacetimes
and
and their tensor product
.
This as a way of letting two spacetimes `run in parallel', like
independently evolving separate universes. The resemblance to the
tensor product of morphisms in
should be clear.
Just as in
, the tensor product in
is not a cartesian
product: there are no projections with the required properties. There
is also no natural choice of a cobordism from
to
.
This means that the very nature of topology prevents us from finding
spacetimes that `discard' part of space, or `duplicate' space. Seen
in this light, the fact that we cannot discard or duplicate
information in quantum theory is not a flaw or peculiarity of this
theory. It is a further reflection of the deep structural
analogy between quantum theory and the conception of spacetime
embodied in general relativity.
Turning to clause (iii) in the definition, we see that a monoidal
category needs to have a `unit object' . This serves as the
multiplicative identity for the tensor product, at least up to
isomorphism: as we shall see in the next clause,
and
for every object
. In
the
unit object is
regarded as a Hilbert space, while in
it
is the empty set regarded as an
-dimensional manifold. Any
category with finite products gives a monoidal category in which the
unit object is the terminal object
.
This raises an interesting point of comparison. In classical physics
we describe systems using objects in a category with finite products,
and a state of the system corresponding to the object is just a
morphism
. In quantum physics we describe systems
using Hilbert spaces. Is a state of the system corresponding to the
Hilbert space
the same as a bounded linear operator
? Almost, but not quite! As we saw in Section 3,
such operators are in one-to-one correspondence with vectors in
:
any vector
corresponds to an operator
with
. States, on the other hand, are the same as
unit vectors modulo phase. Any nonzero vector in
gives a state
after we normalize it, but different vectors can give the same state,
and the zero vector does not give a state at all. So, quantum physics
is really different from classical physics in this way: we cannot
define states as morphisms from the unit object. Nonetheless, we
have seen that the morphisms
play a fundamental
role in quantum theory: they are just Dirac's `kets'.
Next, let us ponder clause (iv) of the definition of monoidal
category. Here we see that the tensor product is associative,
but only up to a specified isomorphism, called the `associator'.
For example, in we do not have
,
but there is an obvious isomorphism
Finally, what about clauses (v) and (vi) in the definition of monoidal
category? These are so-called `coherence laws', which let us
manipulate isomorphisms with the same ease as if they were equations.
Repeated use of the associator lets us construct an isomorphism from
any parenthesization of a tensor product of objects to any other
parenthesization -- for example, from
to
. However, we can
actually construct many such isomorphisms -- and in this
example, the pentagonal diagram in clause (v) shows two. We would
like to be sure that all such isomorphisms from one parenthesization
to another are equal. In his fundamental paper on monoidal categories,
Mac Lane [23] showed that the commuting pentagon in clause
(v) guarantees this, not just for a tensor product of four objects,
but for arbitrarily many. He also showed that clause (vi) gives a
similar guarantee for isomorphisms constructed using the left and
right unit laws.
© 2004 John Baez
baez@math.removethis.ucr.andthis.edu