Die Seide is noch nich ibersetzt worn. Se guggen de englsche Originalversion.
The lesson begins with two equivalent mathematical descriptions of measurements:
General measurements can be described by collections of matrices, one for each measurement outcome, in a way that generalizes the description of projective measurements.
General measurements can be described as channels whose outputs are always classical states (represented by diagonal density matrices).
We'll restrict our attention to measurements having finitely many possible outcomes.
Although it is possible to define measurements with infinitely many possible outcomes, they're much less typically encountered in the context of computation and information processing, and they also require some additional mathematics (namely measure theory) to be properly formalized.
Our initial focus will be on so-called destructive measurements, where the output of the measurement is a classical measurement outcome alone — with no specification of the post-measurement quantum state of whatever system was measured.
Intuitively speaking, we can imagine that such a measurement destroys the quantum system itself, or that the system is immediately discarded once the measurement is made.
Later in the lesson we'll broaden our view and consider non-destructive measurements, where there's both a classical measurement outcome and a post-measurement quantum state of the measured system.
Suppose X is a system that is to be measured, and assume for simplicity that the classical state set of X is {0,…,n−1} for some positive integer n, so that density matrices representing quantum states of X are n×n matrices.
We won't actually have much need to refer to the classical states of X, but it will be convenient to refer to n, the number of classical states of X.
We'll also assume that the possible outcomes of the measurement are the integers 0,…,m−1 for some positive integer m.
Note that we're just using these names to keep things simple;
it's straightforward to generalize everything that follows to other finite sets of classical states and measurement outcomes, renaming them as desired.
Recall that a projective measurement is described by a collection of projection matrices that sum to the identity matrix.
In symbols,
{Π0,…,Πm−1}
describes a projective measurement of X if each Πa is an n×n projection matrix and the following condition is met.
Π0+⋯+Πm−1=IX
When such a measurement is performed on a system X while it's in a state described by some quantum state vector ∣ψ⟩, each outcome a is obtained with probability equal to ∥Πa∣ψ⟩∥2.
We also have that the post-measurement state of X is obtained by normalizing the vector Πa∣ψ⟩, but we're ignoring the post-measurement state for now.
If the state of X is described by a density matrix ρ rather than a quantum state vector ∣ψ⟩, then we can alternatively express the probability to obtain the outcome a as Tr(Πaρ).
If ρ=∣ψ⟩⟨ψ∣ is a pure state, then the two expressions are equal:
Here we're using the cyclic property of the trace for the second equality, and for the third equality we're using the fact that each Πa is a projection matrix, and therefore satisfies Πa2=Πa.
In general, if ρ is a convex combination
ρ=k=0∑N−1pk∣ψk⟩⟨ψk∣
of pure states, then the expression Tr(Πaρ) coincides with the average probability for the outcome a, owing to the fact that this expression is linear in ρ.
A mathematical description for general measurements is obtained by relaxing the definition of projective measurements.
Specifically, we allow the matrices in the collection describing the measurement to be arbitrary positive semidefinite matrices rather than projections.
(Projections are always positive semidefinite; they can alternatively be defined as positive semidefinite matrices whose eigenvalues are all either 0 or 1.)
In particular, a general measurement of a system X having outcomes 0,…,m−1 is specified by a collection of positive semidefinite matrices {P0,…,Pm−1} whose rows and columns correspond to the classical states of X and that meet the condition
P0+⋯+Pm−1=IX.
If the system X is measured while it is in a state described by the density matrix ρ, then each outcome
a∈{0,…,m−1} appears with probability Tr(Paρ).
As we must naturally demand, the vector of outcome probabilities
(Tr(P0ρ),…,Tr(Pm−1ρ))
of a general measurement always forms a probability vector, for any choice of a density matrix ρ.
The following two observations establish that this is the case.
Each value Tr(Paρ) must be nonnegative, owing to the fact that the trace of the product of any two positive semidefinite matrices is always nonnegative:
Q,R≥0⇒Tr(QR)≥0.
One way to argue this fact is to use spectral decompositions of Q and R together with the cyclic property of the trace to express the trace of the product QR as a sum of nonnegative real numbers, which must therefore be nonnegative.
The condition P0+⋯+Pm−1=IX together with the linearity of the trace ensures that the probabilities sum to 1.
Suppose X is a qubit, and define two matrices as follows.
P0=(32313131)P1=(31−31−3132)
These are both positive semidefinite matrices: they're Hermitian, and in both cases the eigenvalues happen to be 1/2±5/6, which are both positive.
We also have that P0+P1=I, and therefore {P0,P1} describes a measurement.
If the state of X is described by a density matrix ρ and we perform this measurement, then the probability of obtaining the outcome 0 is Tr(P0ρ) and the probability of obtaining the outcome 1 is
Tr(P1ρ).
For instance, if ρ=∣+⟩⟨+∣ then the probabilities for the two outcomes 0 and 1 are as follows.
These four states are perfectly spread out on the Bloch sphere, each one equidistant from the other three and with the angles between any two of them always being the same.
Now let us define a measurement {P0,P1,P2,P3} of a qubit by setting Pa as follows for each a=0,…,3.
Pa=2∣ϕa⟩⟨ϕa∣
We can verify that this is a valid measurement as follows.
Each Pa is evidently positive semidefinite, being a pure state divided by one-half.
That is, each one is a Hermitian matrix and has one eigenvalue equal to 1/2 and all other eigenvalues zero.
The sum of these matrices is the identity matrix: P0+P1+P2+P3=I.
The expressions of these matrices as linear combinations of Pauli matrices makes this straightforward to verify.
A second way to describe measurements in mathematical terms is as channels.
Classical information can be viewed as a special case of quantum information, insofar as we can identify probabilistic states with diagonal density matrices.
So, in operational terms, we can think about measurements as being channels whose inputs are matrices describing states of whatever system is being measured and whose outputs are diagonal density matrices describing the resulting distribution of measurement outcomes.
We'll see shortly that any channel having this property can always be written in a simple, canonical form that ties directly to the description of measurements as collections of positive semidefinite matrices.
Conversely, given an arbitrary measurement as a collection of matrices, there's always a valid channel having the diagonal output property that describes the given measurement as suggested in the previous paragraph.
Putting these observations together, we find that the two descriptions of general measurements are equivalent.
Before proceeding further, let's be more precise about the measurement, how we're viewing it as a channel, and what assumptions we're making about it.
As before, we'll suppose that X is the system to be measured, and that the possible outcomes of the measurement are the integers 0,…,m−1 for some positive integer m.
We let Y be the system that stores measurement outcomes, so its classical state set is {0,…,m−1}, and we represent the measurement as a channel named Φ from X to Y.
Our assumption is that Y is classical — which is to say that no matter what state we start with for X, the state of Y we obtain is represented by a diagonal density matrix.
We can express in mathematical terms that the output of Φ is always diagonal in the following way.
First define the completely dephasing channel Δm on Y.
Δm(σ)=a=0∑m−1⟨a∣σ∣a⟩∣a⟩⟨a∣
This channel is analogous to the completely dephasing qubit channel Δ from the previous lesson.
As a linear mapping, it zeros out all of the off-diagonal entries of an input matrix and leaves the diagonal alone.
And now, a simple way to express that a given density matrix σ is diagonal is by the equation
σ=Δm(σ).
In words, zeroing out all of the off-diagonal entries of a density matrix has no effect if and only if the off-diagonal entries were all zero to begin with.
The channel Φ therefore satisfies our assumption — that Y is classical — if and only if
Φ(ρ)=Δm(Φ(ρ))
for every density matrix ρ representing a state of X.
Thus, for these same matrices P0,…,Pm−1 we can express the channel Φ as follows.
Φ(ρ)=a=0∑m−1Tr(Paρ)∣a⟩⟨a∣
This expression is consistent with our description of general measurements in terms of matrices, as we see each measurement outcome appearing with probability Tr(Paρ).
Now let's observe that the two properties required of the collection of matrices {P0,…,Pm−1} to describe a general measurement are indeed satisfied.
The first property is that they're all positive semidefinite matrices.
One way to see this is to observe that, for every vector ∣ψ⟩ having entries in correspondence with the classical state of X we have
The transpose of each Pa is introduced for the third equality because
⟨c∣Pa∣b⟩=⟨b∣PaT∣c⟩.
This allows for the expressions ∣b⟩⟨b∣ and ∣c⟩⟨c∣ to appear, which simplify to the identity matrix upon summing over b and c, respectively.
By the assumption that P0,…,Pm−1 are positive semidefinite, so too are P0T,…,Pm−1T.
In particular, transposing a Hermitian matrix results in another Hermitian matrix, and the eigenvalues of any square matrix and its transpose always agree.
It follows that J(Φ) is positive semidefinite.
Tracing out the output system Y (which is the system on the right) yields
Suppose that we have multiple systems that are collectively in a quantum state, and a general measurement is performed on one of the systems.
This results in one of the measurement outcomes, selected at random according to probabilities determined by the measurement and the state of the system prior to the measurement.
The resulting state of the remaining systems will then, in general, depend on which measurement outcome was obtained.
Let's examine how this works for a pair of systems (X,Z) when the system X is measured.
(We're naming the system on the right Z because we'll take Y to be a system representing the classical output of the measurement when we view it as a channel.)
We can then easily generalize to the situation in which the systems are swapped as well as to three or more systems.
Suppose the state of (X,Z) prior to the measurement is described by a density matrix ρ, which we can write as follows.
ρ=b,c=0∑n−1∣b⟩⟨c∣⊗ρb,c
In this expression we're assuming the classical states of X are 0,…,n−1.
We'll assume that the measurement itself is described by the collection of matrices
{P0,…,Pm−1}.
This measurement may alternatively be described as a channel Φ from X to Y, where Y is a new system having classical state set {0,…,m−1}.
Specifically, the action of this channel can be expressed as follows.
We're considering a measurement of the system X, so the probabilities with which different measurement outcomes are obtained can depend only on ρX, the reduced state of X.
In particular, the probability for each outcome a∈{0,…,m−1} to appear can be expressed in three equivalent ways.
Tr(PaρX)=Tr(PaTrZ(ρ))=Tr((Pa⊗IZ)ρ)
The first expression naturally represents the probability to obtain the outcome a based on what we already know about measurements of a single system.
To get the second expression we're simply using the definition ρX=TrZ(ρ).
To get the third expression requires more thought — and learners are encouraged to convince themselves that it is true.
Here's a hint: The equivalence between the second and third expressions does not depend on ρ being a density matrix or on each Pa being positive semidefinite. Try showing it first for tensor products of the form ρ=M⊗N and then conclude that it must be true in general by linearity.
While the equivalence of the first and third expressions in the previous equation may not be immediate, it does make sense.
Starting from a measurement on X, we're effectively defining a measurement of (X,Z), where we simply throw away Z and measure X.
Like all measurements, this new measurement can be described by a collection of matrices, and it's not surprising that this measurement is described by the collection
If we want to determine not only the probabilities for the different outcomes but also the resulting state of Z conditioned on each measurement outcome, we can look to the channel description of the measurement.
In particular, let's examine the state we get when we apply Φ to X and do nothing to Z.
like we saw in the Density matrices lesson.
For each measurement outcome a∈{0,…,m−1}, we have with probability
p(a)=Tr((Pa⊗IZ)ρ)
that Y is in the classical state ∣a⟩⟨a∣ and Z is in the state
σa=Tr((Pa⊗IZ)ρ)TrX((Pa⊗IZ)ρ).(2)
That is, this is the density matrix we obtain by normalizing
TrX((Pa⊗IZ)ρ)
by dividing it by its trace.
(Formally speaking, the state σa is only defined when the probability p(a) is nonzero;
when p(a)=0 this state is irrelevant, for it refers to a discrete event that occurs with probability zero.)
Naturally, the outcome probabilities are consistent with our previous observations.
In summary, this is what happens when the measurement {P0,…,Pm−1} is performed on X when (X,Z) is in the state ρ.
Each outcome a appears with probability p(a)=Tr((Pa⊗IZ)ρ).
Conditioned on obtaining the outcome a, the state of Z is then represented by the density matrix σa shown in the equation (2), which is obtained by normalizing TrX((Pa⊗IZ)ρ).
We can adapt this description to other situations, such as when the ordering of the systems is reversed or when there are three or more systems.
Conceptually it is straightforward, although it can become cumbersome to write down the formulas.
In general, if we have r systems X1,…,Xr, the state of the compound system (X1,…,Xr) is ρ, and the measurement {P0,…,Pm−1} is performed on Xk, the following happens.
Each outcome a appears with probability
p(a)=Tr((IX1⊗⋯⊗IXk−1⊗Pa⊗IXk+1⊗⋯⊗IXr)ρ).
Conditioned on obtaining the outcome a, the state of (X1,…,Xk−1,Xk+1,…,Xr) is then represented by the following density matrix.