An important goal of the enterprise of physics is to describe, not just one physical system at a time, but also how a large complicated system can be built out of smaller simpler ones. The simplest case is a so-called `joint system': a system built out of two separate parts. Our experience with the everyday world leads us to believe that to specify the state of a joint system, it is necessary and sufficient to specify states of its two parts. (Here and in what follows, by `states' we always mean what physicists call `pure states'.) In other words, a state of the joint system is just an ordered pair of states of its parts. So, if the first part has as its set of states, and the second part has as its set of states, the joint system has the cartesian product as its set of states.
One of the more shocking discoveries of the twentieth century is that this is wrong. In both classical and quantum physics, given states of each part we get a state of the joint system. But only in classical physics is every state of the joint system of this form! In quantum physics are also `entangled' states, which can only be described as superpositions of states of this form. The reason is that in quantum theory, the states of a system are no longer described by a set, but by a Hilbert space. Moreover -- and this is really an extra assumption -- the states of a joint system are described not by the cartesian product of Hilbert spaces, but by their tensor product.
Quite generally, we can imagine using objects in any category to describe physical systems, and morphisms between these to describe processes. In order to handle joint systems, this category will need to have some sort of `tensor product' that gives an object for any pair of objects and . As we shall explain, categories of this sort are called `monoidal'. The category is a example where the tensor product is just the usual cartesian product of sets. Similarly, the category is a monoidal category where the tensor product is the usual tensor product of Hilbert spaces. However, these two examples are very different, because the product in is `cartesian' in a certain technical sense, while the product in is not. This turns out to explain a lot about why joint systems behave so counterintuively in quantum physics. Moreover, it is yet another way in which resembles more than .
To see this in detail, it pays to go back to the beginning and think about cartesian products. Given two sets and , we define to be the set of all ordered pairs with and . But what is an ordered pair? This depends on our approach to set theory. We can use axioms in which ordered pairs are a primitive construction, or we can define them in terms of other concepts. For example, in 1914, Wiener defined the ordered pair to be the set . In 1922, Kuratowski gave the simpler definition . We can use the still simpler definition if our axioms exclude the possibility of sets that contain themselves. Various other definitions have also been tried [17]. In traditional set theory we arbitrarily choose one approach to ordered pairs and then stick with it. Apart from issues of convenience or elegance, it does not matter which we choose, so long as it `gets the job done'. In other words, all these approaches are all just technical tricks for implementing our goal, which is to make sure that if and only if and .
It is a bit annoying that the definition of ordered pair cannot get straight to the point and capture the concept without recourse to an arbitrary trick. It is natural to seek an approach that focuses more on the structural role of ordered pairs in mathematics and less on their implementation. This is what category theory provides.
The reason traditional set theory arbitarily chooses a specific implementation of the ordered pair concept is that it seems difficult to speak precisely about ``some thing -- I don't care what it is -- with the property that iff and ''. So, the first move in category theory is to stop focussing on ordered pairs and instead focus on cartesian products of sets. What properties should the cartesian product have? To make our answer applicable not just to sets but to objects of other categories, it should not refer to elements of . So, the second move in category theory is to describe the cartesian product in terms of functions to and from this set.
The cartesian product has
functions called `projections' to the sets and :
Thus, given two sets and , we define their cartesian product to be any set equipped with
functions
,
such that for any set and functions
,
, there exists a unique function
with
All this generalizes painlessly to an arbitrary category.
Given two objects and in some category,
we define their cartesian product (or simply product)
to be any object equipped with morphisms
We say a category has binary products if every pair of objects has a a product. One can also talk about -ary products for other values of , but a category with binary products has -ary products for all , since we can construct these as iterated binary products. The case is trivial, since the product of one object is just that object itself (up to canonical isomorphism). The only remaining case is . This is surprisingly important. A -ary product is usually called a terminal object and denoted : it is an object such that that for any object there exists a unique morphism from to . Terminal objects are unique up to canonical isomorphism, so we feel free to speak of `the' terminal object in a category when one exists. The reason we denote the terminal object by is that in , any set with one element is a terminal object. If a category has a terminal object and binary products, it has -ary products for all , so we say it has finite products.
It turns out that these concepts capture much of our intuition about joint systems in classical physics. In the most stripped-down version of classical physics, the states of a system are described as elements of a mere set. In more elaborate versions, the states of a system form an object in some fancier category, such as the category of topological spaces or manifolds. But, just like , these fancier categories have finite products -- and we use this fact when describing the states of a joint system.
To sketch how this works in general, suppose we have any category with finite products. To do physics with this, we think of any of the objects of this category as describing some physical system. It sounds a bit vague to say that a physical system is `described by' some object , but we can make this more precise by saying that states of this system are morphisms . When our category is , a morphism of this sort simply picks out an element of the set . In the category of topological spaces, a morphism of this sort picks out a point in the topological space -- and similarly for the category of manifolds, and so on. For this reason, category theorists call a morphism an element of the object .
Next, we think of any morphism as a `process' carrying states of the system described by to states of the system described by . This works as follows: given a state of the first system, say , we can compose it with to get a state of the second system, .
Then, given two systems that are described by the objects and , respectively, we decree that the joint system built from these is described by the object . The projection can be thought of as a process that takes a state of the joint system and discards all information about the second part, retaining only the state of the first part. Similarly, the projection retains only information about the second part.
Calling these projections `processes' may strike the reader as strange, since `discarding information' sounds like a subjective change of our description of the system, rather than an objective physical process like time evolution. However, it is worth noting that in special relativity, time evolution corresponds to a change of coordinates , which can also be thought of as change of our description of the system. The novelty in thinking of a projection as a physical process really comes, not from the fact that it is `subjective', but from the fact that it is not invertible.
With this groundwork laid, we can use the definition of `product' to show that a state of a joint system is just an ordered pair of states of each part. First suppose we have states of each part, say and . Then there is a unique state of the joint system, say , which reduces to the given state of each part when we discard information about the other part: and . Conversely, every state of the joint system arises this way, since given we can recover and using these equations.
However, the situation changes drastically when we switch to quantum theory! The states of a quantum system can still be thought of as forming a set. However, we do not take the product of these sets to be the set of states for a joint quantum system. Instead, we describe states of a system as unit vectors in a Hilbert space, modulo phase. We define the Hilbert space for a joint system to be the tensor product of the Hilbert spaces for its parts.
The tensor product of Hilbert spaces is not a cartesian product in the sense defined above, since given Hilbert spaces and there are no linear operators and with the required properties. This means that from a (pure) state of a joint quantum system we cannot extract (pure) states of its parts. This is the key to Bell's `failure of local realism'. Indeed, under quite general conditions one can derive Bell's inequality from the assumption that pure states of a joint system determine pure states of its parts [3,8], so violations of Bell's inequality should be seen as an indication that this assumption fails.
The Wooters-Zurek argument that `one cannot clone a quantum state'
[32] is also based on the fact that the tensor product of Hilbert
spaces is not cartesian. To get some sense of this, note that
whenever is an object in some category for which the product
exists, there is a unique morphism
In , however, since the tensor product is not a product in the category-theoretic sense, it makes no sense to speak of a diagonal morphism . In fact, a stronger statement is true: there is no natural (i.e. basis-independent) way to choose a linear operator from to other than the zero operator. So, there is no way to duplicate information in quantum theory.
Since the tensor product is not a cartesian product in the sense explained above, what exactly is it? To answer this, we need the definition of a `monoidal category'. Monoidal categories were introduced by Mac Lane [23] in early 1960s, precisely in order to capture those features common to all categories equipped with a well-behaved but not necessarily cartesian product. Since the definition is a bit long, let us first present it and then discuss it:
Definition. A monoidal category consists of:
This obviously requires some explanation! First, it makes use of some
notions we have not explained yet, ruining our otherwise admirably
self-contained treatment of category theory. For example, what is
in clause (ii) of the definition? This is just the
category whose objects are pairs of objects in , and whose
morphisms are pairs of morphisms in , with composition of
morphisms done componentwise. So, when we say that the tensor product
is a functor
, this implies
that for any pair of objects
there is an object
, while for any pair of morphisms
in there is a morphism
in . Morphisms are just as important
as objects! For example, in , not only can we take the tensor
product of Hilbert spaces, but also we can take the tensor product of
bounded linear operators
and
,
obtaining a bounded linear operator
Similarly, in the tensor product is given by disjoint union, both for objects and for morphisms. In Figure 8 we show two spacetimes and and their tensor product . This as a way of letting two spacetimes `run in parallel', like independently evolving separate universes. The resemblance to the tensor product of morphisms in should be clear. Just as in , the tensor product in is not a cartesian product: there are no projections with the required properties. There is also no natural choice of a cobordism from to . This means that the very nature of topology prevents us from finding spacetimes that `discard' part of space, or `duplicate' space. Seen in this light, the fact that we cannot discard or duplicate information in quantum theory is not a flaw or peculiarity of this theory. It is a further reflection of the deep structural analogy between quantum theory and the conception of spacetime embodied in general relativity.
Turning to clause (iii) in the definition, we see that a monoidal category needs to have a `unit object' . This serves as the multiplicative identity for the tensor product, at least up to isomorphism: as we shall see in the next clause, and for every object . In the unit object is regarded as a Hilbert space, while in it is the empty set regarded as an -dimensional manifold. Any category with finite products gives a monoidal category in which the unit object is the terminal object .
This raises an interesting point of comparison. In classical physics we describe systems using objects in a category with finite products, and a state of the system corresponding to the object is just a morphism . In quantum physics we describe systems using Hilbert spaces. Is a state of the system corresponding to the Hilbert space the same as a bounded linear operator ? Almost, but not quite! As we saw in Section 3, such operators are in one-to-one correspondence with vectors in : any vector corresponds to an operator with . States, on the other hand, are the same as unit vectors modulo phase. Any nonzero vector in gives a state after we normalize it, but different vectors can give the same state, and the zero vector does not give a state at all. So, quantum physics is really different from classical physics in this way: we cannot define states as morphisms from the unit object. Nonetheless, we have seen that the morphisms play a fundamental role in quantum theory: they are just Dirac's `kets'.
Next, let us ponder clause (iv) of the definition of monoidal
category. Here we see that the tensor product is associative,
but only up to a specified isomorphism, called the `associator'.
For example, in we do not have
,
but there is an obvious isomorphism
Finally, what about clauses (v) and (vi) in the definition of monoidal category? These are so-called `coherence laws', which let us manipulate isomorphisms with the same ease as if they were equations. Repeated use of the associator lets us construct an isomorphism from any parenthesization of a tensor product of objects to any other parenthesization -- for example, from to . However, we can actually construct many such isomorphisms -- and in this example, the pentagonal diagram in clause (v) shows two. We would like to be sure that all such isomorphisms from one parenthesization to another are equal. In his fundamental paper on monoidal categories, Mac Lane [23] showed that the commuting pentagon in clause (v) guarantees this, not just for a tensor product of four objects, but for arbitrarily many. He also showed that clause (vi) gives a similar guarantee for isomorphisms constructed using the left and right unit laws.
© 2004 John Baez
baez@math.removethis.ucr.andthis.edu