This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.
Paper The following article is Open access

'Quantum Cheshire Cat' as simple quantum interference

, , and

Published 27 May 2015 © 2015 IOP Publishing Ltd and Deutsche Physikalische Gesellschaft
, , Citation Raul Corrêa et al 2015 New J. Phys. 17 053042 DOI 10.1088/1367-2630/17/5/053042

1367-2630/17/5/053042

Abstract

In a recent work, Aharonov et al suggested that a photon could be separated from its polarization in an experiment involving pre- and post-selection (Aharonov et al 2013 New J. Phys. 15 113015). They named the effect 'quantum Cheshire Cat', in a reference to the cat that is separated from its grin in the novel Alice's Adventures in Wonderland. Following these ideas, Denkmayr et al performed a neutron interferometric experiment and interpreted the results suggesting that neutrons were separated from their spin (Denkmayr et al 2014 Nat. Commun. 5 4492). Here we show that these results can be interpreted as simple quantum interference, with no separation between the quantum particle and its internal degree of freedom. We thus hope to clarify the phenomenon with this work, by removing these apparent paradoxes.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

The concept of a quantum weak value, introduced in 1988 by Aharonov et al [1], has allowed the development of novel and important experimental techniques to study quantum systems [2]. The amplification of small signals beyond technical noise [3, 4], the direct determination of quantum states [5] and geometric phases [6], and the characterization of the nonclassical behavior of quantum systems [7] are a few examples of its usefulness (check [2] for a more extensive list of applications). Together with the new interferometric concept, the original paper and some follow-ups also suggested that weak measurements could lead to a new interpretation of quantum phenomena. These somewhat controversial ideas, however, were the target of lots of discussions as can be seen in the comments following the 1988 original paper [811].

Recently, a new set of proposals and experiments has revived some of this controversy by suggesting even more radical ways of reinterpreting quantum mechanics [1215]. For instance, in a recent theoretical work, Aharonov et al argued that a particular weak measurement setup for photons allowed one to state that 'in the curious way of quantum mechanics, photon polarization may exist where there is no photon at all' [12]. The idea was reinforced both in a news and views article of Nature Physics [13] and in an experiment performed with neutrons [14]. The first discusses the results of [12] concluding that 'polarization could be effectively isolated from the photons carrying it', while the second implements an equivalent interferometer to the one proposed in [12] and argues that 'the experimental results suggest that the system behaves as if the neutrons go through one beam path, while their magnetic moment travels along the other' [14]. The phenomenon was nicknamed 'quantum Cheshire Cat', in a reference to the cat that is separated from its grin in the novel Alice's Adventures in Wonderland, by Lewis Carroll.

According to [1], in order to obtain a weak value one must interact the system to be measured with a continuous degree of freedom described by a wavefunction, which will act as a quantum pointer. Before the interaction, the system is pre-selected in a known state $|{{\Phi }_{{\rm pre}}}\rangle $ and, after interacting with the pointer, a post-selection is made on a specific state of the measured system $|{{\Phi }_{{\rm post}}}\rangle $, finally preparing the pointer in a state that depends on both $|{{\Phi }_{{\rm pre}}}\rangle $ and $|{{\Phi }_{{\rm post}}}\rangle $. Given the weakness of the interaction, the wavefunction of the pointer is displaced by a small amount to which the weak value is proportional.

Most of the controversy in trying to extract a new interpretation out of weak measurements lies in the attempt to attach physical reality to the otherwise mathematically defined weak values. In [12], the continuous degrees of freedom acting as quantum meters are not considered in the construction of the argument for the new interpretation. In the present work we show that by taking these degrees of freedom into account, both the theoretical predictions and experimental results that motivated the somewhat unusual 'Cheshire Cat' interpretation can be explained as simple quantum interference where no detachment between the photon and its polarization or between the neutron and its magnetic moment is actually required. We begin by quickly revisiting the proposal of [12], then we proceed to explain it with standard quantum mechanics interference and we finally show how a similar approach also explains the experiment described in [14].

2. The Aharonov et al proposal

In [12], the authors base their proposal in the interferometer shown in figure 1 which is designed to prepare the photon in state $|\Psi \rangle $ and post-select it in state $|\Phi \rangle $, both specified below:

Equation (1)

Equation (2)

where $|I\rangle $ and $|II\rangle $ correspond to the photon being in each arm of the interferometer shown in figure 1, and $|H\rangle $ and $|V\rangle $ represent its horizontal and vertical polarization components. The left and right circular polarization states can be written as $|\pm \rangle =(|H\rangle \pm {\rm i}|V\rangle )/\sqrt{2}$ and these are the eigenstates of the 'spin' angular momentum operator for the photon.

Figure 1.

Figure 1. Interferometer setup from [12]. The photon reaches a beam splitter (BS1) with polarization H, right after which we have the state $|\Psi \rangle $ from equation (1), called pre-selection state. After reflection from mirrors on both arms I and II, we want to make the post-selection of the state $|\Phi \rangle $ from (2) on detector D1. A half-wave plate (HWP) interchanges H and V polarizations on arm I and a phase shifter (PS) adds a specific phase right after it, so that, after a second beam splitter (BS2) and a polarizing beam splitter (PBS) that transmits only horizontal polarization, we assure that D1 will always click for $|\Phi \rangle $ and never click for any state orthogonal to it.

Standard image High-resolution image

In order to do the desired pre- and post-selection, beam splitters (BS1 and BS2), a half-wave plate (HWP), a phase shifter and a polarizing beam splitter (PBS) are placed and adjusted in a way that a photon on the state $|\Phi \rangle $ right before HWP will always click at detector D1; if it is in any state orthogonal to $|\Phi \rangle $, it necessarily clicks elsewhere. The adjustments of the devices are described in [12].

After being prepared in the state $|\Psi \rangle $, the photon interacts with devices positioned in both interferometer paths. These devices act as probes and can perform either projective measurements (also called 'strong') or weak measurements. Although in [12] the authors considered them as quantum devices, the description of their quantum state is not included in the paper. We will explicitly represent their quantum states here, though, as this is the essential piece for describing the phenomenon as quantum interference. In arm I there is a polarization detector whose quantum state $|{{P}_{1}}\rangle $ changes to $|P_{1}^{+}\rangle (|P_{1}^{-}\rangle )$ if a photon of polarization $|+\rangle (|-\rangle )$ propagates through that arm, or stays at $|{{P}_{1}}\rangle $ if a photon propagates through arm II. Meanwhile, in arm II there is a quantum device to measure the presence of the photon, i.e. a device whose quantum state $|{{P}_{2}}\rangle $ changes into $|P_{2}^{+}\rangle $ if a photon propagates through that arm, or stays at $|{{P}_{2}}\rangle $ if a photon propagates through arm I. In the proposal of [12] the probe of arm I is a combination of birrefringent materials that produces a positive horizontal displacement for a photon with polarization $|+\rangle $ and a negative horizontal displacement for a photon with polarization $|-\rangle $. The pointer state $|{{P}_{1}}\rangle $ of the device is thus associated to the center of the beam in the horizontal direction. The probe in arm II is a glass sheet that displaces the photon beam up. The pointer state $|{{P}_{2}}\rangle $ of the device is thus associated to the center of the beam in the vertical direction. If the beam displacements produced by the measurement devices are greater than the beam diameter, then we have a projective measurement where we can associate the vertical displacement with the photon path and the horizontal displacement with the polarization of the photon propagating through arm I. If the beam displacements are much smaller than the beam diameters, then we are in the weak measurements regime. In this case we have, for instance, $\langle {{P}_{1}}|P_{1}^{+}\rangle \approx 1-\epsilon $ with $|\epsilon |\ll 1$, as opposite to $\langle {{P}_{1}}|P_{1}^{+}\rangle =0$ for a projective measurement.

After the interaction with the measurement devices in each path, the composite state of the photon and the pointers is

Equation (3)

Note that the changes in the state of the pointer are only defined by the shapes of the wavefunctions of the probes and by their displacements and, therefore, are unaffected by global phases and normalization factors. For that reason we have just added a factor C which accounts for these global factors (normalization and phase) but does not influence by any means the overall analysis.

To make a post-selection of the results only for photons that leave the PBS in direction to D1 means to project the above state (3) on $|\Phi \rangle $ from (2). We end up with the following quantum state for the pointers

Equation (4)

Note that in the state (4) there is entanglement between the pointers, meaning that there are quantum correlations between the measurement devices placed in arms I and II, even though they have never interacted directly with each other [16]. If the devices make projective measurements, there are three possibilities for each photon: either the photon has an up displacement greater than the beam diameter or the photon has a positive (negative) horizontal displacement greater than the beam diameter. In these cases, if one assumes a real trajectory for the photon, the first situation is compatible with propagation through arm II, and the second (third) is compatible with propagation through arm I and polarization $|+\rangle $($|-\rangle $). On the other hand, when the measurement devices interact weakly with the photon, producing vertical and horizontal displacements much smaller than the beam diameter, such assumptions about past trajectories of the photon simply cannot hold true anymore due to the quantum interference between the different possible paths. In fact, it is well known that attributing physical reality to the past of quantum particles inside interferometers leads to paradoxes [17]. However, that is exactly what the authors of [12] do when they extend the interpretation used in the projective measurements to the weak interaction case. They consider that if the average vertical displacement of a set of photons is the same as the displacement of the wavefunction of one photon eventually propagating through arm II, then this indicates that the photons had propagated through this arm. In the same way, they also assume that if the average horizontal displacement of a set of photons is the same of the wavefunction of one photon eventually propagating through arm I with polarization $|+\rangle $, then there is a $|+\rangle $ polarization in arm I. This is the origin of the paradox when concluding that 'the photon is in the left arm (...) while the angular momentum is in the right arm' [12]. As we show in the following, there is no such paradox if we describe the phenomenon as simple quantum interference.

Let us now explicitly describe the pointers as suggested in [12], which will be the transversal beam profiles of the photon in the fashion of a Bialynicki-Birula–Sipe photon wave function [1821] in the paraxial regime. In this regime, the beam propagation is highly directional, such that its state can be written as the product of a transversal function in the xy plane (with z-dependent parameters) and a function that describes its evolution while propagating in the z direction [22]. The beam polarization can also be treated as an independent parameter in this regime. We may consider that the transversal properties of the beam do not vary much with the propagation through the interferometer, such that the transversal wave function can be written as a function only of the transversal components x and y. In the proposal of [12], $|{{P}_{1}}\rangle $ corresponds to a wave function f(x) and $|{{P}_{2}}\rangle $ corresponds to g(y), such that $|P_{1}^{+}\rangle $, $|P_{1}^{-}\rangle $ and $|P_{2}^{+}\rangle $ correspond to the same functions displaced by ${{\delta }_{x}}$, $-{{\delta }_{x}}$ and ${{\delta }_{y}}$, respectively. Hence, by defining $F(x,y)\equiv f(x)g(y)$, the state (4) is described by the transversal wave function:

Equation (5)

where ${{C}_{1}}$ again is a normalization factor that depends on how much the components of (5) overlap with each other and on the shape of $F(x,y)$. If the detector D1 is a screen sensitive to the photon position, after many runs we can build the distribution $|{{F}_{1}}(x,y){{|}^{2}}$ on it. The pattern showed will depend on how orthogonal these three displaced functions above are, and how their overlap creates interference. In a strong (projective) measurement regime, the interaction displaces each term above the beam radius, such that their overlap is negligible and there is no interference. Each part of the function is then identifiable on the screen, offering trustworthy information about the measured quantities. This behaviour is clear when we realize that, without significant overlap of the pointer states, that is for instance $\langle {{P}_{1}}|P_{1}^{+}\rangle \to 0$, the probability density is approximately

Equation (6)

The weak measurement will happen in the case that the displaced functions are nearly completely overlapping. They hence interfere and $|{{F}_{1}}(x,y){{|}^{2}}$ cannot be approximated by (6)—it may even look like the same function displaced by some other amount. But, as we discussed before, the reason of the paradoxes in [12, 14] is to consider that the resultant displacement may be read as a measurement in the ordinary sense.

Using the approximations $F(x\pm {{\delta }_{x}},y)\approx F(x,y)\pm {{\delta }_{x}}\frac{\partial F(x,y)}{\partial x}$ and $F(x,y-{{\delta }_{y}})\approx F(x,y)-{{\delta }_{y}}\frac{\partial F(x,y)}{\partial y}$ in (5), we end up with

Equation (7)

According to the arguments of [12, 14], this result is compatible with the situation where the photon is measured in the left arm of the interferometer (the beam was displaced up by ${{\delta }_{y}}$) and, at the same time, there is positive angular momentum on the right arm of the interferometer (the beam was displaced sideways by ${{\delta }_{x}}$). However, as pointed out in our calculations, the probability of finding the photons at this particular range of positions can be interpreted as simple interference. Note that any quantum continuous variable could have been used as a probe. The photon wavefunction is a particularly effective example because it fulfils both the superposition principle and the approximations discussed above. Also note that the same holds true for the Schrödinger wavefunction. Actually, our treatment is suitable to describe even the light intensity distribution on the detector if classical electromagnetic waves are sent through the interferometer.

To make this issue clearer, in figure 2(a) we plot $2F(x,y-{{\delta }_{y}})=2f(x)f(y-{{\delta }_{y}})$, which is the component of the photon wavefunction that comes from the arm II in (5), with f(x) being a Gaussian function with width W centred at zero. In figure 2(b) we plot $F(x-{{\delta }_{x}},y)-F(x+{{\delta }_{x}},y)$, which is the component of the photon wavefunction that comes from arm I in (5). We can see that for ${{\delta }_{x}}={{\delta }_{y}}\ll W$ the major contribution to the wavefunction of (5) comes from $2F(x,y-{{\delta }_{y}})$. But the term $F(x-{{\delta }_{x}},y)-F(x+{{\delta }_{x}},y)$ interferes destructively with $2F(x,y-{{\delta }_{y}})$ for negative x and constructively for positive x, resulting in an overall positive displacement in the horizontal direction for the wavefunction even if this term is small, as can be seen in figure 2(c). A similar argument was used in [23] to present a classical explanation of the experimental results of [24]. In [24] the authors performed experiments with a nested Mach–Zehnder interferometer and concluded that 'the photons tell us that they have been in the parts of the interferometer through which they could not pass' [24]. But again, this odd conclusion was achieved through the attribution of a physical reality to the weak value of a weak measurement.

Figure 2.

Figure 2. Plots of the non-normalized functions (a) $2F(x,y-{{\delta }_{y}})$ and (b) $[F(x-{{\delta }_{x}},y)-F(x+{{\delta }_{x}},y)]$, for $F(x,y)={{{\rm e}}^{-({{x}^{2}}+{{y}^{2}})/{{W}^{2}}}}$ and ${{\delta }_{x}}={{\delta }_{y}}=0.1\;{\rm W}$. The amplitude should be noted, where in (a) the maximum is around 1.5, and in (b) it's around 0.012. In (c) we plot $|2F(x,y-{{\delta }_{y}})-F(x-{{\delta }_{x}},y)-F(x+{{\delta }_{x}},y){{|}^{2}}$, to show that the displacement is small enough so the pointer indicates the weak values.

Standard image High-resolution image

3. The experiment of Denkmayr et al

The experimental realization by Denkmayr et al [14] was similar to the theoretical proposal of [12] and is depicted in figure 3. After going through the first beam splitter and the spin rotators, the neutron is prepared in the state

Equation (8)

$|I\rangle $ and $|II\rangle $ represent the neutron states in the corresponding interferometer arm and $|\pm \rangle $ are the eigenvectors of the neutron x-component spin operator. There is a spin analyzer before D1, such that the detection of a neutron in D1 is associated to the projector

Equation (9)

where χ is a controllable phase. Since there is no spin analyzer before D2, the detection of a neutron in D2 is associated to the projector

Equation (10)

where ${{\hat{I}}_{{\rm s}}}$ corresponds to the identity operator for the spin degree of freedom.

Figure 3.

Figure 3. Experimental setup of [14]. A neutron beam reaches a beam splitter BS1 and the spin rotators (SR) on each arms I and II are set to leave each neutron on the state $|{{\Psi }_{n}}\rangle $ from (8). A phase χ between the paths may be controlled before the beams are combined at BS2. A spin analyzer (SA) selects only the $|-\rangle $ component of spin to be detected by the neutron detector D1. There is no spin selection for D2. Absorbers may be placed and magnetic fields can be applied to rotate the neutron spin in each path.

Standard image High-resolution image

In the first part of the experiments, Denkmayr et al place absorbers in each path and see if the detection counts in D1 reduce. Since D1 only detects neutrons with spin state $|-\rangle $ and according to (8) there is no $|-\rangle $ component for the part of the neutron wavefunction that propagates through arm I, the detections should not vary when an absorber is placed in path I and should decrease when the absorber is placed in path II. This behavior is observed in the experiments [14].

In the second part of the experiments, magnetic fields are applied in the interferometer paths to produce a small rotation of the neutrons spin. The phase χ of the projectors from (9) and (10) is varied and it is observed if the detection counts in D1 and D2 depend or not on χ. When no magnetic field is applied, there can be no interference since the spin states in each path are orthogonal for the state (8), so the counts on both detectors should not depend on χ. When a magnetic field is applied in path I changing the neutron spin state from $|+\rangle $ to $a|+\rangle +b|-\rangle $ with $|a{{|}^{2}}+|b{{|}^{2}}=1$, the $|-\rangle $ component of the wavefunction of this path can interfere with the wavefunction of path II, such that the counts in both detectors should depend on χ. When a magnetic field is applied in path II changing the neutron spin state from $|-\rangle $ to $c|-\rangle +d|+\rangle $ with $|c{{|}^{2}}+|d{{|}^{2}}=1$, the $|+\rangle $ component of the wavefunction of this path can interfere with the wavefunction of path I, such that the counts in detector D2 should depend on χ. But since the detector D1 selects only the $|-\rangle $ component of spin, the counts in this detector should not depend on χ. All these predictions are confirmed by the experiments [14].

The behaviors described above led the authors to say: ' (...) an absorber with high transmissivity has on average no significant effect on the measurement outcome if it is placed in path I. It is only effective if it is placed in path II. In contrast to that, a small magnetic field has on average a significant effect only in path I, while it has none in path II. Therefore, any probe system that interacts with the Cheshire Cat system weakly enough will on average be affected as if the neutron and its spin are spatially separated' [14]. This is what they mean when they say that 'the experimental results suggest that the system behaves as if the neutrons go through one beam path, while their magnetic moment travels along the other' [14]. As we have seen here, the results can be explained as simple quantum interference, with no separation between the neutron and its spin. There is no need of interpreting the results as if the neutron and its spin are spatially separated.

4. Final remarks

In their papers, both Aharonov et al [12] and Denkmayr et al [14] suggest a potential use of the quantum Cheshire Cat. In [12], the authors state: 'suppose that we wish to perform a measurement in which the magnetic moment plays the central role, whilst the charge causes unwanted disturbances. The question that arises is whether it might be possible to remove this disturbance, in a post-selected manner, by producing a Cheshire Cat where the charge is confined to a region of the experiment far from the magnetic moment.' In the last paragraph of the discussion in [14] there is an analogous proposal. In our framework, though, it is clear that disturbances in any degree of freedom of either the pre- or the post-selection state will change the final state of the pointer, due to a change of the amplitude or phase of each component in (4)—and therefore (7) will be different. The randomness of these disturbances leads to decoherence of the pointer state, therefore messing up the results.

In our opinion, the paradoxical conclusions that a photon may be separated from its polarization [12, 13] or that a neutron can be separated from its spin [14] presented as the 'quantum Cheshire Cat' effect are one more apparent paradox that arises whenever we attribute physical reality to quantum superposition states, for instance when describing a quantum particle inside an interferometer prior to its detection [17]. Recent works have also discussed the quantum Cheshire Cat, specifically the results of Denkmayr et al [25, 26]. Naive interpretations of delayed choice experiments [27, 28] or quantum erasers [2932] lead to similar apparent paradoxes. As we have shown here, the predictions of [12] and the experiments of [14] can be understood as simple quantum interference, with no separation between the quantum particles and their internal degrees of freedom, and we hope our results provide a better understanding of the phenomenon reinforcing that no interpretation more strange than standard quantum mechanics is required.

Acknowledgments

We would like to ackowledge Cristhiano Duarte for useful comments on the manuscript. This work was supported by the Brazilian agencies CNPq, CAPES, FAPEMIG, and PRPq/UFMG.

Please wait… references are loading.
10.1088/1367-2630/17/5/053042