Today we are going to study a relatively new effect ( new experimentally speaking, because it was first detected when I was an undergraduate student, in 2000) but it is not so new from the theoretical aside (theoretically, it was predicted in 1962). This effect is closely related to the Cherenkov effect. It is named Askaryan effect or Askaryan radiation, see below after a brief recapitulation of the Cherenkov effect last post we are going to do in the next lines.
We do know that charged particles moving faster than light through the vacuum emit Cherenkov radiation. How can a particle move faster than light? The weak speed of a charged particle can exceed the speed of light. That is all. About some speculations about the so-called tachyonic gamma ray emissions, let me say that the existence of superluminal energy transfer has not been established so far, and one may ask why. There are two options:
1) The simplest solution is that superluminal quanta just do not exist, the vacuum speed of light being the definitive upper bound.
2) The second solution is that the interaction of superluminal radiation with matter is very small, the quotient of tachyonic and electric fine-structure constants being . Therefore superluminal quanta and their substratum are hard to detect.
A related and very interesting question could be asked now related to the Cherenkov radiation we have studied here. What about neutral particles? Is there some analogue of Cherenkov radiation valid for chargeless or neutral particles? Because neutrinos are electrically neutral, conventional Cherenkov radiation of superluminal neutrinos does not arise or it is otherwise weakened. However neutrinos do carry electroweak charge and may emit certain Cherenkov-like radiation via weak interactions when traveling at superluminal speeds. The Askaryan effect/radiation is this Cherenkov-like effect for neutrinos, and we are going to enlighten your knowledge of this effect with this entry.
We are being bombarded by cosmic rays, and even more, we are being bombarded by neutrinos. Indeed, we expect that ultra-high energy (UHE) neutrinos or extreme ultra-high energy (EHE) neutrinos will hit us as too. When neutrinos interact wiht matter, they create some shower, specifically in dense media. Thus, we expect that the electrons and positrons which travel faster than the speed of light in these media or even in the air and they should emit (coherent) Cherenkov-like radiation.
Who was Gurgen Askaryan?
Let me quote what wikipedia say about him: Gurgen Askaryan (December 14, 1928-1997) was a prominent Soviet (armenian) physicist, famous for his discovery of the self-focusing of light, pioneering studies of light-matter interactions, and the discovery and investigation of the interaction of high-energy particles with condensed matter. He published more than 200 papers about different topics in high-energy physics.
Other interesting ideas by Askaryan: the bubble chamber (he discovered the idea independently to Glaser, but he did not published it so he did not win the Nobel Prize), laser self-focussing (one of the main contributions of Askaryan to non-linear optics was the self-focusing of light), and the acoustic UHECR detection proposal. Askaryan was the first to note that the outer few metres of the Moon’s surface, known as the regolith, would be a sufficiently transparent medium for detecting microwaves from the charge excess in particle showers. The radio transparency of the regolith has since been confirmed by the Apollo missions.
If you want to learn more about Askaryan ideas and his biography, you can read them here: http://en.wikipedia.org/wiki/Gurgen_Askaryan
What is the Askaryan effect?
The next figure is from the Askaryan radiation detected by the ANITA experiment:
The Askaryan effect is the phenomenon whereby a particle traveling faster than the phase velocity of light in a dense dielectric medium (such as salt, ice or the lunar regolith) produces a shower of secondary charged particles which contain a charge anisotropy and thus emits a cone of coherent radiation in the radio or microwave part of the electromagnetic spectrum. It is similar, or more precisely it is based on the Cherenkov effect.
High energy processes such as Compton, Bhabha and Moller scattering along with positron annihilation rapidly lead to about a 20%-30% negative charge asymmetry in the electron-photon part of a cascade. For instance, they can be initiated by UHE (higher than, e.g.,100 PeV) neutrinos.
1962, Askaryan first hypothesized this effect and suggested that it should lead to strong coherent radio and microwave Cherenkov emission for showers propagating within the dielectric. Since the dimensions of the clump of charged particles are small compared to the wavelength of the radio waves, the shower radiates coherent radio Cherenkov radiation whose power is proportional to the square of the net charge in the shower. The net charge in the shower is proportional to the primary energy so the radiated power scales quadratically with the shower energy, .
Indeed, these radio and coherent radiations are originated by the Cherenkov effect radiation. We do know that:
from the charged particle in a dense (refractive) medium experimenting Cherenkov radiation (CR). Every charge emittes a field . Then, the power is proportional to . In a dense medium:
We have two different experimental and interesting cases:
A) The optical case, with . Then, we expect random phases and .
B) The microwave case, with . In this situation, we expect coherent radiation/waves with .
We can exploit this effect in large natural volumes transparent to radio (dry): pure ice, salt formations, lunar regolith,…The peak of this coherent radiation for sand is produced at a frequency around , while the peak for ice is obtained around .
The first experimental confirmation of the Askaryan effect detection were the next two experiments:
1) 2000 Saltzberg et.al., SLAC. They used as target silica sand. The paper is this one http://arxiv.org/abs/hep-ex/0011001
2) 2002 Gorham et.al., SLAC. They used a synthetic salt target. The paper appeared in this place http://arxiv.org/abs/hep-ex/0108027
Indeed, in 1965, Askaryan himself proposes ice and salt as possible target media. The reasons are easy to understand:
1st. They provide high densities and then it means a higher probability for neutrino interaction.
2nd. They have a high refractive index. Therefore, the Cerenkov emission becomes important.
3rd. Salt and ice are radio transparent, and of course, they can be supplied in large volumes available throughout the world.
The advantages of radio detection of UHE neutrinos provided by the Askaryan effect are very interesting:
1) Low attenuation: clear signals from large detection volumes.
2) We can observe distant and inclined events.
3) It has a high duty cycle: good statistics in less time.
4) I has a relative low cost: large areas covered.
5) It is available for neutrinos and/or any other chargeless/neutral particle!
Problems with this Askaryan effect detection are, though: radio interference, correlation with shower parameters (still unclear), and that it is limited only to particles with very large energies, about .
Askaryan effect = coherent Cerenkov radiation from a charge excess induced by (likely) neutral/chargeless particles like (specially highly energetic) neutrinos passing through a dense medium.
Why the Askaryan effect matters?
It matters since it allows for the detection of UHE neutrinos, and it is “universal” for chargeless/neutral particles like neutrinos, just in the same way that the Cherenkov effect is universal for charged particles. And tracking UHE neutrinos is important because they point out towards its source, and it is suspected they can help us to solve the riddle of the origin and composition of cosmic rays, the acceleration mechanism of cosmic radiation, the nuclear interactions of astrophysical objects, and tracking the highest energy emissions of the Universe we can observe at current time.
Is it real? Has it been detected? Yes, after 38 years, it has been detected. This effect was firstly demonstrated in sand (2000), rock salt (2004) and ice (2006), all done in a laboratory at SLAC and later it has been checked in several independent experiments around the world. Indeed, I remember to have heard about this effect during my darker years as undergraduate student. Fortunately or not, I forgot about it till now. In spite of the beauty of it!
Moreover, it has extra applications to neutrino detection using the Moon as target: GLUE (detectors are Goldstone RTs), NuMoon (Westerbork array; LOFAR), or RESUN (EVLA), or the LUNASKA project. Using ice as target, there has been other experiments checking the reality of this effect: FORTE (satellite observing Greenland ice sheet), RICE (co-deployed on AMANDA strings, viewing Antarctic ice), and the celebrated ANITA (balloon-borne over Antarctica, viewing Antarctic ice) experiment.
Furthermore, even some experiments have used the Moon (an it is likely some others will be built in the near future) as a neutrino detector using the Askaryan radiation (the analogue for neutral particles of the Cherenkov effect, don’t forget the spot!).
Askaryan effect and the mysterious cosmic rays.
Askaryan radiation is important because is one of the portals of the UHE neutrino observation coming from cosmic rays. The mysteries of cosmic rays continue today. We have detected indeed extremely energetic cosmic rays beyond the scale. Their origin is yet unsolved. We hope that tracking neutrinos we will discover the sources of those rays and their nature/composition. We don’t understand or know any mechanism being able to accelerate particles up to those incredible particles. At current time, IceCube has not detected UHE neutrinos, and it is a serious issue for curren theories and models. It is a challenge if we don’t observe enough UHE neutrinos as the Standard Model would predict. Would it mean that cosmic rays are exclusively composed by heavy nuclei or protons? Are we making a bad modelling of the spectrum of the sources and the nuclear models of stars as it happened before the neutrino oscillations at SuperKamiokande and Kamikande were detected -e.g.:SN1987A? Is there some kind of new Physics living at those scales and avoiding the GZK limit we would naively expect from our current theories?
The Cherenkov effect/Cherenkov radiation, sometimes also called Vavilov-Cherenkov radiation, is our topic here in this post.
In 1934, P.A. Cherenkov was a post graduate student of S.I.Vavilov. He was investigating the luminescence of uranyl salts under the incidence of gamma rays from radium and he discovered a new type of luminiscence which could not be explained by the ordinary theory of fluorescence. It is well known that fluorescence arises as the result of transitions between excited states of atoms or molecules. The average duration of fluorescent emissions is about and the transition probability is altered by the addition of “quenching agents” or by some purification process of the material, some change in the ambient temperature, etc. It shows that none of these methods is able to quench the fluorescent emission totally, specifically the new radiation discovered by Cherenkov. A subsequent investigation of the new radiation ( named Cherenkov radiation by other scientists after the Cherenkov discovery of such a radiation) revealed some interesting features of its characteristics:
1st. The polarization of luminiscence changes sharply when we apply a magnetic field. Cherenkov radiation luminescence is then causes by charged particles rather than by photons, the -ray quanta! Cherenkov’s experiment showed that these particles could be electrons produced by the interaction of -photons with the medium due to the photoelectric effect or the Compton effect itself.
2nd. The intensity of the Cherenkov’s radiation is independent of the charge Z of the medium. Therefore, it can not be of radiative origin.
3rd. The radiation is observed at certain angle (specifically forming a cone) to the direction of motion of charged particles.
The Cherenkov radiation was explained in 1937 by Frank and Tamm based on the foundations of classical electrodynamics. For the discovery and explanation of Cherenkov effect, Cherenkov, Frank and Tamm were awarded the Nobel Prize in 1958. We will discuss the Frank-Tamm formula later, but let me first explain how the classical electrodynamics handle the Vavilov-Cherenkov radiation.
The main conclusion that Frank and Tamm obtained comes from the following observation. They observed that the statement of classical electrodynamics concerning the impossibility of energy loss by radiation for a charged particle moving uniformly and following a straight line in vacuum is no longer valid if we go over from the vacuum to a medium with certain refractive index . They went further with the aid of an easy argument based on the laws of conservation of momentum and energy, a principle that rests in the core of Physics as everybody knows. Imagine a charged partice moving uniformly in a straight line, and suppose it can loose energy and momentum through radiation. In that case, the next equation holds:
This equation can not be satisfied for the vacuum but it MAY be valid for a medium with a refractive index gretear than one . We will simplify our discussion if we consider that the refractive index is constant (but similar conclusions would be obtained if the refractive index is some function of the frequency).
By the other hand, the total energy E of a particle having a non-null mass and moving freely in vacuum with some momentum p and velocity v will be:
Moreover, the electromagnetic radiation in vaccum is given by the relativistic relationship
From this equation, we easily get that
Since the particle velocity is , we obtain that
In conclusion: the laws of conservation of energy and momentum prevent that a charged particle moving with a rectilinear and uniform motion in vacuum from giving away its energy and momentum in the form of electromagnetic radiation! The electromagnetic radiation can not accept the entire momentum given away by the charged particle.
Anyway, we realize that this restriction and constraint is removed and given up when the aprticle moves in a medium with a refractive index . In this case, the velocity of light in the medium would be
and the velocity v of the particle may not only become equal to the velocity of light in the medium, but even exceed it when the following phenomenological condition is satisfied:
It is obvious that, when the condition
will be satisfied for electromagnetic radiation emitted strictly in the direction of motion of the particle, i.e., in the direction of the angle . If , this equation is verified for some direction along with , where
is the projection of the particle velocity v on the observation direction. Then, in a medium with , the conservation laws of energy and momentum say that it is allowed that a charged particle with rectilinear and uniform motion, can loose fractions of energy and momentum and , whenever those lost energy and momentum is carried away by an electromagnetic radiation propagating in the medium at an angle/cone given by:
with respect to the observation direction of the particle motion.
These arguments, based on the conservation laws of momenergy, do not provide any ide about the real mechanism of the energy and momentum which are lost during the Cherenkov radiation. However, this mechanism must be associated with processes happening in the medium since the losses can not occur ( apparently) in vacuum under normal circumstances ( we will also discuss later the vacuum Cherenkov effect, and what it means in terms of Physics and symmetry breaking).
We have learned that Cherenkov radiation is of the same nature as certain other processes we do know and observer, for instance, in various media when bodies move in these media at a velocity exceeding that of the wave propagation. This is a remarkable result! Have you ever seen a V-shaped wave in the wake of a ship? Have you ever seen a conical wave caused by a supersonic boom of a plane or missile? In these examples, the wave field of the superfast object if found to be strongly perturbed in comparison with the field of a “slow” object ( in terms of the “velocity of sound” of the medium). It begins to decelerate the object!
Question: What is then the mechanism behind the superfast motion of a charged particle in a medium wiht a refractive index producing the Cherenkov effect/radiation?
Answer: The mechanism under the Cherenkov effect/radiation is the coherent emission by the dipoles formed due to the polarization of the medium atoms by the charged moving particle!
The idea is as follows. Dipoles are formed under the action of the electric field of the particle, which displaces the electrons of the sorrounding atoms relative to their nuclei. The return of the dipoles to the normal state (after the particle has left the given region) is accompanied by the emission of an electromagnetic signal or beam. If a particle moves slowly, the resulting polarization will be distribute symmetrically with respect to the particle position, since the electric field of the particle manages to polarize all the atoms in the near neighbourhood, including those lying ahead in its path. In that case, the resultant field of all dipoles away from the particle are equal to zero and their radiations neutralize one to one.
Then, if the particle move in a medium with a velocity exceeding the velocity or propagation of the electromagnetic field in that medium, i.e., whenever , a delayed polarization of the medium is observed, and consequently the resulting dipoles will be preferably oriented along the direction of motion of the particle. See the next figure:
It is evident that, if it occurs, there must be a direction along which a coherent radiation form dipoles emerges, since the waves emitted by the dipoles at different points along the path of the particle may turn our to be in the same phase. This direction can be easiy found experimentally and it can be easily obtained theoretically too. Let us imagine that a charged particle move from the left to the right with some velocity in a medium with a refractive index, with . We can apply the Huygens principle to build the wave front for the emitted particle. If, at instant , the aprticle is at the point , the surface enveloping the spherical waves emitted by the same particle on its own path from the origin at to the arbitrary point . The radius of the wave at the point at such an instant t is equal to . At the same moment, the wave radius at th epint x is equal to . At any intermediate point x’, the wave radius at instant t will be . Then, the radius decreases linearly with increasing . Thus, the enveloping surface is a cone with angle , where the angle satisfies in addition
The normal to the enveloping surface fixes the direction of propagation of the Cherenkov radiation. The angle between the normal and the -axis is equal to , and it is defined by the condition
This is the result we anticipated before. Indeed, it is completely general and Quantum Mechanics instroudces only a light and subtle correction to this classical result. From this last equation, we observer that the Cherenkov radiation propagates along the generators of a cone whose axis coincides with the direction of motion of the particle an the cone angle is equal to . This radiation can be registered on a colour film place perpendicularly to the direction of motion of the particle. Radiation flowing from a radiator of this type leaves a blue ring on the photographic film. These blue rings are the archetypical fingerprints of Vavilov-Cherenkov radiation!
The sharp directivity of the Cherenkov radiation makes it possible to determine the particle velocity from the value of the Cherenkov’s angle . From the Cherenkov’s formula above, it follows that the range of measurement of is equal to
For , the radiation is observed at an angle , while for the extreme with , the angle reaches a maximum value
For instance, in the case of water, and . Therefore, the Cherenkov radiation is observed in water whenever . For electrons being the charged particles passing through the water, this condition is satisfied if
As a consequence of this, the Cherenkov effect should be observed in water even for low-energy electrons ( for isntance, in the case of electrons produced by beta decay, or Compton electrons, or photoelectroncs resulting from the interaction between water and gamma rays from radioactive products, the above energy can be easily obtained and surpassed!). The maximum angle at which the Cherenkov effec can be observed in water can be calculated from the condition previously seen:
This angle (for water) shows to be equal to about . In agreement with the so-called Frank-Tamm formula ( please, see below what that formula is and means), the number of photons in the frequency interval and emitted by some particle with charge Z moving with a velocity in a medium with a refractive indez n is provided by the next equation:
This formula has some striking features:
1st. The spectrum is identical for particles with , i.e., the spectrum is exactly the same, irespectively the nature of the particle. For instance, it could be produced both by protons, electrons, pions, muons or their antiparticles!
2nd. As Z increases, the number of emitted photons increases as .
3rd. increases with , the particle velocity, from zero ( with ) to
4th. is approximately independent of . We observe that .
5th. As the spectrum is uniform in frequency, and , this means that the main energy of radiation is concentrated in the extreme short-wave region of the spectrum, i.e.,
And then, this feature explains the bluish-violet-like colour of the Cherenkov radiation!
Indeed, this feature also indicates the necessity of choosing materials for practical applications that are “transparent” up to the highest frequencies ( even the ultraviolet region). As a rule, it is known that in the X-ray region and hence the Cherenkov condition can not be satisfied! However, it was also shown by clever experimentalists that in some narrow regions of the X-ray spectrum the refractive index is ( the refractive index depends on the frequency in any reasonable materials. Practical Cherenkov materials are, thus, dispersive! ) and the Cherenkov radiation is effectively observed in apparently forbidden regions.
The Cherenkov effect is currently widely used in diverse applications. For instance, it is useful to determine the velocity of fast charged particles ( e.g, neutrino detectors can not obviously detect neutrinos but they can detect muons and other secondaries particles produced in the interaction with some polarizable medium, even when they are produced by (electro)weak intereactions like those happening in the presence of chargeless neutrinos). The selection of the medium fo generating the Cherenkov radiation depends on the range of velocities over which measurements have to be produced with the aid of such a “Cherenkov counter”. Cherenkov detectors/counters are filled with liquids and gases and they are found, e.g., in Kamiokande, Superkamiokande and many other neutrino detectors and “telescopes”. It is worth mentioning that velocities of ultrarelativistic particles are measured with Cherenkov detectors whenever they are filled with some special gasesous medium with a refractive indes just slightly higher than the unity. This value of the refractive index can be changed by realating the gas pressure in the counter! So, Cherenkov detectors and counters are very flexible tools for particle physicists!
Remark: As I mentioned before, it is important to remember that (the most of) the practical Cherenkov radiators/materials ARE dispersive. It means that if is the photon frequency, and is the wavenumber, then the photons propagate with some group velocity , i.e.,
Note that if the medium is non-dispersive, this formula simplifies to the well known formula . As it should be for vacuum.
Accodingly, following the PDG, Tamm showed in a classical paper that for dispersive media the Cherenkov radiation is concentrated in a thin conical shell region whose vertex is at the moving charge and whose opening half-angle is given by the expression
where is the critical Cherenkov angle seen before, is the central value of the small frequency range under consideration under the Cherenkov condition. This cone has an opening half-angle (please, compare with the previous convention with for consistency), and unless the medium is non-dispersive (i.e. , ), we get . Typical Cherenkov radiation imaging produces blue rings.
THE CHERENKOV EFFECT: QUANTUM FORMULAE
When we considered the Cherenkov effect in the framework of QM, in particular the quantum theory of radiation, we can deduce the following formula for the Cherenkov effect that includes the quantum corrections due to the backreaction of the particle to the radiation:
where, like before, , n is the refraction index, is the De Broglie wavelength of the moving particle and is the wavelength of the emitted radiation.
Cherenkov radiation is observed whenever (i.e. if ), and the limit of the emission is on the short wave bands (explaining the typical blue radiation of this effect). Moreover, corresponds to .
By the other hand, the radiated energy per particle per unit of time is equal to:
where is the angular frequency of the radiation, with a maximum value of .
Remark: In the non-relativistic case, , and the condition implies that . Therefore, neglecting the quantum corrections (the charged particle self-interaction/backreaction to radiation), we can insert the limit and the above previous equations will simplify into:
Remember: is determined with the condition , where represents the dispersive effect of the material/medium through the refraction index.
THE FRANK-TAMM FORMULA
The number of photons produced per unit path length and per unit of energy of a charged particle (charge equals to ) is given by the celebrated Frank-Tamm formula:
In terms of common values of fundamental constants, it takes the value:
or equivalently it can be written as follows
The refraction index is a function of photon energy , and it is also the sensitivity of the transducer used to detect the light with the Cherenkov effect! Therefore, for practical uses, the Frank-Tamm formula must be multiplied by the transducer response function and integrated over the region for which we have .
Remark: When two particles are close toghether ( to be close here means to be separated a distance wavelength), the electromagnetic fields form the particles may add coherently and affect the Cherenkov radiation. The Cherenkov radiation for a electron-positron pair at close separation is suppressed compared to two independent leptons!
Remark (II): Coherent radio Cherenkov radiation from electromagnetic showers is significant and it has been applied to the study of cosmic ray air showers. In addition to this, it has been used to search for electron neutrinos induced showers by cosmic rays.
CHERENKOV DETECTOR: MAIN FORMULA AND USES
The applications of Cherenkov detectors for particle identification (generally labelled as PID Cherenkov detectors) are well beyond the own range of high-energy Physics. Its uses includes: A) Fast particle counters. B) Hadronic particle indentifications. C) Tracking detectors performing complete event reconstruction. The PDG gives some examples of each category: a) Polarization detector of SLD, b) the hadronic PID detectors at B factories like BABAR or the aerogel threshold Cherenkov in Belle, c) large water Cherenkov counters liket those in Superkamiokande and other neutrino detector facilities.
Cherenkov detectors contain two main elements: 1) A radiator/material through which the particle passes, and 2) a photodetector. As Cherenkov radiation is a weak source of photons, light collection and detection must be as efficient as possible. The presence of a refractive material specifically designed to detect some special particles is almost vindicated in general.
The number of photoelectrons detected in a given Cherenkov radiation detector device is provided by the following formula (derived from the Tamm-Frank formula simply taking into account the efficiency in a straightforward manner):
where is the path length of the particle in the radiator/material, is the efficiency for the collector of Cherenkov light and transducing it in photoelectrons, and
Remark: The efficiencies and the Cherenkov critical angle are functions of the photon energy, generally speaking. However, since the typical energy dependen variation of the refraction index is modest, a quantity sometimes called Cherenkov detector quality fact can be defined as follows
In this case, we can write
Remark(II): Cherenkov detectors are classified into imaging or threshold types, depending on its ability to make use of Cherenkov angle information. Imaging counters may be used to track particles as well as identify particles.
Other main uses/applications of the Vavilov-Cherenkov effect are:
1st. Detection of labeled biomolecules. Cherenkov radiation is widely used to facilitate the detection of small amounts and low concentrations of biomolecules. For instance, radioactive atoms such as phosphorus-32 are readily introduced into biomolecules by enzymatic and synthetic means and subsequently may be easily detected in small quantities for the purpose of elucidating biological pathways and in characterizing the interaction of biological molecules such as affinity constants and dissociation rates.
2nd. Nuclear reactors. Cherenkov radiation is used to detect high-energy charged particles. In pool-type nuclear reactors, the intensity of Cherenkov radiation is related to the frequency of the fission events that produce high-energy electrons, and hence is a measure of the intensity of the reaction. Similarly, Cherenkov radiation is used to characterize the remaining radioactivityof spent fuel rods.
3rd. Astrophysical experiments. The Cherenkov radiation from these charged particles is used to determine the source and intensity of the cosmic ray,s which is used for example in the different classes of cosmic ray detection experiments. For instance, Ice-Cube, Pierre-Auger, VERITAS, HESS, MAGIC, SNO, and many others. Cherenkov radiation can also be used to determine properties of high-energy astronomical objects that emit gamma rays, such as supernova remnants and blazars. In this last class of experiments we place STACEE, in new Mexico.
4th. High-energy experiments. We have quoted already this, and there many examples in the actual LHC, for instance, in the ALICE experiment.
VACUUM CHERENKOV RADIATION
Vacuum Cherenkov radiation (VCR) is the alledged and conjectured phenomenon which refers to the Cherenkov radiation/effect of a charged particle propagating in the physical vacuum. You can ask: why should it be possible? It is quite straightforward to understand the answer.
The classical (non-quantum) theory of relativity (both special and general) clearly forbids any superluminal phenomena/propagating degrees of freedom for material particles, including this one (the vacuum case) because a particle with non-zero rest mass can reach speed of light only at infinite energy (besides, the nontrivial vacuum itself would create a preferred frame of reference, in violation of one of the relativistic postulates).
However, according to modern views coming from the quantum theory, specially our knowledge of Quantum Field Theory, physical vacuum IS a nontrivial medium which affects the particles propagating through, and the magnitude of the effect increases with the energies of the particles!
Then, a natural consequence follows: an actual speed of a photon becomes energy-dependent and thus can be less than the fundamental constant of speed of light, such that sufficiently fast particles can overcome it and start emitting Cherenkov radiation. In summary, any charged particle surpassing the speed of light in the physical vacuum should emit (Vacuum) Cherenkov radiation. Note that it is an inevitable consequence of the non-trivial nature of the physical vacuum in Quantum Field Theory. Indeed, some crazy people saying that superluminal particles arise in jets from supernovae, or in colliders like the LHC fail to explain why those particles don’t emit Cherenkov radiation. It is not true that real particles become superluminal in space or collider rings. It is also wrong in the case of neutrino propagation because in spite of being chargeless, neutrinos should experiment an analogue effect to the Cherenkov radiation called the Askaryan effect. Other (alternative) possibility or scenario arises in some Lorentz-violating theories ( or even CPT violating theories that can be equivalent or not to such Lorentz violations) when a speed of a propagating particle becomes higher than c which turns this particle into the tachyon. The tachyon with an electric charge would lose energy as Cherenkov radiation just as ordinary charged particles do when they exceed the local speed of light in a medium. A charged tachyon traveling in a vacuum therefore undergoes a constant proper-time acceleration and, by necessity, its worldline would form an hyperbola in space-time. These last type of vacuum Cherenkov effect can arise in theories like the Standard Model Extension, where Lorentz-violating terms do appear.
One of the simplest kinematic frameworks for Lorentz Violating theories is to postulate some modified dispersion relations (MODRE) for particles , while keeping the usual energy-momentum conservation laws. In this way, we can provide and work out an effective field theory for breaking the Lorentz invariance. There are several alternative definitions of MODRE, since there is no general guide yet to discriminate from the different theoretical models. Thus, we could consider a general expansion in integer powers of the momentum, in the next manner (we set units in which ):
However, it is generally used a more soft expansion depending only on positive powers of the momentum in the MODRE. In such a case,
and where . If Lorentz violations are associated to the yet undiscovered quantum theory of gravity, we would get that ordinary deviations of the dispersion relations in the special theory of relativity should appear at the natural scale of the quantum gravity, say the Planck mass/energy. In units where we obtain that Planck mass/energy is:
Lets write and parametrize the Lorentz violations induced by the fundamental scale of quantum gravity (naively this Planck mass scale) by:
Here, is a dimensionless quantity that can differ from one particle (type) to another (type). Considering, for instance , since the seems to be ruled out by previous terrestrial experiments, at higer energies the lowest non-null term will dominate the expansion with . The MODRE reads:
and where the label in the term is specific of the particle type. Such corrections might only become important at the Planck scale, but there are two exclusions:
1st. Particles that propagate over cosmological distances can show differences in their propagation speed.
2nd. Energy thresholds for particle reactions can be shifted or even forbidden processes can be allowed. If the -term is comparable to the -term in the MODRE. Thus, threshold reactions can be significantly altered or shifted, because they are determined by the particle masses. So a threshold shift should appear at scales where:
Imposing/postulating that , the typical scales for the thresholds for some diffent kind of particles can be calculated. Their values for some species are given in the next table:
We can even study some different sources of modified dispersion relationships:
1. Measurements of time of flight.
2. Thresholds creation for: A) Vacuum Cherenkov effect, B) Photon decay in vacuum.
3. Shift in the so-called GZK cut-off.
4. Modified dispersion relationships induced by non-commutative theories of spacetime. Specially, there are time shifts/delays of photon signals induced by non-commutative spacetime theories.
We will analyse this four cases separately, in a very short and clear fashion. I wish!
Case 1. Time of flight. This is similar to the recently controversial OPERA experiment results. The OPERA experiment, and other similar set-ups, measure the neutrino time of flight. I dedicated a post to it early in this blog
In fact, we can measure the time of flight of any particle, even photons. A modified dispersion relation, like the one we introduced here above, would lead to an energy dependent speed of light. The idea of the time of flight (TOF) approach is to detect a shift in the arrival time of photons (or any other massless/ultra-relativistic particle like neutrinos) with different energies, produced simultaneous in a distant object, where the distance gains the usually Planck suppressed effect. In the following we use the dispersion relation for only, as modifications in higher orders are far below the sensitivity of current or planned experiments. The modified group velocity becomes:
and then, for photons,
The time difference in the photon shift detection time will be:
where D is the distance multiplied (if it were the case) by the redshift to correct the energy with the redshift. In recent years, several measurements on different objects in various energy bands leading to constraints up to the order of 100 for . They can be summarized in the next table ( note that the best constraint comes from a short flare of the Active Galactic Nucleus (AGN) Mrk 421, detected in the TeV band by the Whipple Imaging Air Cherenkov telescope):
There is still room for improvements with current or planned experiments, although the distance for TeV-observations is limited by absorption of TeV photons in low energy metagalactic radiation fields. Depending on the energy density of the target photon field one gets an energy dependent mean free path length, leading to an energy and redshift dependent cut off energy (the cut off energy is defined as the energy where the optical depth is one).
2. Thresholds creation for: A) Vacuum Cherenkov effect, B) Photon decay in vacuum. By the other hand, the interaction vertex in quantum electrodynamics (QED) couples one photon with two leptons. When we assume for photons and leptons the following dispersion relations (for simplicity we adopt all units with M=1). Then:
Let us write the photon tetramomentum like and the lepton tetramomentum and . It can be shown that the transferred tetramomentum will be
where the r.h.s. is always positive. In the Lorentz invariant case the parameters are zero, so that this equation can’t be solved and all processes of the single vertex are forbidden. If these parameters are non-zero, there can exist a solution and so these processes can be allowed. We now consider two of these interactions to derive constraints on the parameters . The vacuum
Cherenkov effect and the spontaneous photon-decay .
A) As we have studied here, the vacuum Cherenkov effect is a spontaneous emission of a photon by a charged particle . These effect occurs if the particle moves faster than the slowest possible radiated photon in vacuum!
In the case of , the maximal attainable speed for the particle is faster than c. This means, that the particle can always be faster than a zero energy photon with
and it is independent of . In the case of , i.e., decreases with energy, you need a photon with . This is only possible if .
Therefore, due to the radiation of photons such an electron loose energy. The observation of high energetic electrons allows to derive constraints on and . In the case of , in the case with n=3, we have the bound
Moreover, from the observation of 50 TeV photons in the Crab Nebula (and its pulsar) one can conclude the existens of 50 TeV electrons due to the inverse Compton scattering of these electrons with those photons. This leads to a constraint on of about
where we have used in this case.
B) The decay of photons into positrons and electrons should be a very rapid spontaneous decay process. Due to the observation of Gamma rays from the Crab Nebula on earth with an energy up to . Thus, we can reason that these rapid decay doesn’t occur on energies below 50 TeV. For the constraints on and these condition means (again we impose n=3):
3. Shift in the GZK cut-off. As the energy of a proton increases,the pion production reaction can happen with low energy photons of the Cosmic Microwave Background (CMB).
This leads to an energy dependent mean free path length of the particles, resulting in a cutoff at energies around . This is the the celebrated Greisen-Kuzmin-Zatsepin (GZK) cut off. The resonance for the GZK pion photoproduction with the CMB backgroud can be read from the next condition (I will derive this condition in a future post):
Thus in Lorentz invariant world, the mean free path length of a particle of energy 5.1019 eV is 50 Mpc i.e. particle over this energy are readily absorbed due to pion photoproduction reaction. But most of the sources of particle of ultra high energy are outside 50 Mpc. So, one expects no trace of particles of energy above on Earth. From the experimental point of view AGASA has found
a few particles having energy higher than the constraint given by GZK cutoff limit and claimed to be disproving the presence of GZK cutoff or at least for different threshold for GZK cutoff, whereas HiRes is consistent with the GZK effect. So, there are two main questions, not yet completely unsolved:
i) How one can get definite proof of non-existence GZK cut off?
ii) If GZK cutoff doesn’t exist, then find out the reason?
The first question could by answered by observation of a large sample of events at these energies, which is necessary for a final conclusion, since the GZK cutoff is a statistical phenomena. The current AUGER experiment, still under construction, may clarify if the GZK cutoff exists or not. The existence of the GZK cutoff would also yield new limits on Lorentz or CPT violation. For the second question, one explanation can be derived from Lorentz violation. If we do the calculation for GZK cutoff in Lorentz violated world we would get the modified proton dispersion relation as described in our previous equations with MODRE.
4. Modified dispersion relationships induced by non-commutative theories of spacetime. As we said above, there are time shifts/delays of photon signals induced by non-commutative spacetime theories. Noncommutative spacetime theories introduce a new source of MODRE: the fuzzy nature of the discreteness of the fundamental quantum spacetime. Then, the general ansatz of these type of theories comes from:
where are the components of an antisymmetric Lorentz-like tensor which components are the order one. The fundamental scale of non-commutativity is supposed to be of the Planck length. However, there are models with large extra dimensions that induce non-commutative spacetime models with scale near the TeV scale! This is interesting from the phenomenological aside as well, not only from the theoretical viewpoint. Indeed, we can investigate in the following whether astrophysical observations are able to constrain certain class of models with noncommutative spacetimes which are broken at the TeV scale or higher. However, there due to the antisymmetric character of the noncommutative tensor, we need a magnetic and electric background field in order to study these kind of models (generally speaking, we need some kind of field inducing/producing antisymmetric field backgrounds), and then the dispersion relation for photons remains the same as in a commutative spacetime. Furthermore, there is no photon energy dependence of the dispersion relation. Consequently, the time-of-flight experiments are inappopriate because of their energy-dependent dispersion. Therefore, we suggest the next alternative scenario: suppose, there exists a strong magnetic field (for instance, from a star or a cluster of stars) on the path photons emitted at a light source (e.g. gamma-ray bursts). Then, analogous to gravitational lensing, the photons experience deflection and/or change in time-of-arrival, compared to the same path without a magnetic background field. We can make some estimations for several known objects/examples are shown in this final table:
1st. Vacuum Cherenkov and related effects modifying the dispersion relations of special relativity are natural in many scenarios beyond the Standard Relativity (BSR) and beyond the Standard Model (BSM).
2nd. Any theory allowing for superluminal propagation has to explain the null-results from the observation of the vacuum Cherenkov effect. Otherwise, they are doomed.
3rd. There are strong bounds coming from astrophysical processes and even neutrino oscillation experiments that severely imposes and kill many models. However, it is true that current MODRE bound are far from being the most general bounds. We expect to improve these bounds with the next generation of experiments.
4th. Theories that can not pass these tests (SR obviously does) have to be banned.
5th. Superluminality has observable consequences, both in classical and quantum physics, both in standard theories and theories beyond standard theories. So, it you buid a theory allowing superluminal stuff, you must be very careful with what kind of predictions can and can not do. Otherwise, your theory is complentely nonsense.
As a final closing, let me include some nice Cherenkov rings from Superkamiokande and MiniBoone experiments. True experimental physics in action. And a final challenge…
FINAL CHALLENGE: Are you able to identify the kind of particles producing those beautiful figures? Let me know your guesses ( I do know the answer, of course).
Figure 1. Typical SuperKamiokande Ring. I dedicate this picture to my admired Japanase scientists there. I really, really admire that country and their people, specially after disasters like the 2011 Earthquake and the Fukushima accident. If you are a japanase reader/follower, you must know we support your from abroad. You were not, you are not and you shall not be alone.
Figure 2. Typical MiniBooNe ring. History: I used this nice picture in my Master Thesis first page, as the cover/title page main picture!
Before becoming apparent superluminal readers, we are going to remember and review some elementary notation and concepts from the relativistic Doppler effect and the starlight aberration we have already studied in this blog.
Let us consider and imagine the next gedankenexperiment/thought experiment. Some moving object emits pulses of light during some time interval, denoted by in its own frame. Its distance from us is very large, say
Question: Does it (light) arrive at time ? Suppose the object moves forming certain angle according to the following picture
Time dilation means that a second pulse would be experiment a time delay , later of course from the previous pulse, and at that time the object would have travelled a distance away from the source, so it would take it an additional time to arrive at its destination. The reception time between pulses would be:
In the range , the time interval separation measured from both pulses in the rest frame on Earth will be longer than in the rest frame of the moving object. This analysis remains valid even if the 2 events are not light beams/pulses but succesive packets or “maxima” of electromagnetic waves ( electromagnetic radiation).
Astronomers define the dimensionless redshift
where, as it is common in special relativity, ,
The 3 interesting limits of the above expression are:
1st. Receding emitter case. The moving object moves away from the receiver. Then, we have supposing a completely radial motion in the line of sight, and then a literal “redshift” ( lower frequencies than the proper frequencies)
2nd. Approaching emitter case. The moving object approaches and goes closer to the observer. Then, we get , or motion inward the radial direction, and then a “blueshift” ( higher frequencies than those of the proper frequencies)
3rd. Tangential or transversal motion of the source. This is also called second-order redshift. It has been observed in extremely precise velocity measurements of pulsars in our Galaxy.
Furthermore, these redshifts have all been observed in different astrophysical observations and, in addition, they have to be taken into account for tracking the position via GPS, geolocating satellites and/or following their relative positions with respect to time or calculating their revolution periods around our planet.
Remark: Quantum Mechanics and Special Relativity would be mutually inconsistent IF we did not find the same formual for the ratios between energy and frequencies at different reference frames.
EXAMPLE: The emission line of the oxygen (II) [O(II)] is, in its rest frame, . It is observed in a distant galaxy to be at . What is the redshift z and the recession velocity of this galaxy?
Solution. From the definition of wavelength in electromagnetism , adn . Then,
, and thus
From the radial velocity hypothesis, we get
and thus or
Note that this result follows from the hypothesis of the expansion of the Universe, and it holds in the relativistic theory of gravity, General Relativity, and it should also holds in extensions of it, even in Quantum Gravity somehow!
Remember: Stellar aberration causes taht the positions on the sky of the celestial objects are changing as the Earth moves around the Sun. As the Earth’s velocity is about , and then , it implies an angular separation about . Anyway, it is worth mentioning that the astronomer Bradley observed this starlight aberration in 1729! A moving observer observes that light from stars are at different positions with respect to a rest observer, and that the new position does not depend on the distance to the star. Thus, as the relative velocity increases, stars are “displaced” further and further towards the direction of observation.
Now, we are going to the main subject of the post. I decided to review this two important effects because it is useful to remember then and to understand that they are measured and they are real effects. They are not mere artifacts of the special theory of relativity masking some unknown reality. They are the reality in the sense they are measured. Alternative theories trying to understand these effects exist but they are more complicated and they remember me those people trying to defend the geocentric model of the Universe with those weird metaphenomenon known as epicycles in order to defend what can not be defended from the experimental viewpoint.
In order to make our discussion visual and phenomenological, I am going to consider a practical example. Certain radio-galaxy, denoted by 3C 273 moves with a velocity
Knowing the rate expansion of the universe and the redshift of the radiogalaxy, its distance is calculated to be about . To obtain the relative tangential velocity, we simply multiply the angular velocity by the distance, i.e. .
From the above data, we get that the apparent tangential radial velocity of our radiogalaxy would be about . Indeed, this observation is not isolated. There are even jets of matter flowin from some stars at apparent superluminal velocities. Of course this is an apparent issue for SR. How can we explain it? How is it possible in the SR framework to obtain a superluminal velocity? It shows that there is no contradiction with SR. The (fake and apparent) superluminal effect CAN BE EXPLAINED naturally in the SR framework in a very elegant way. Look at the following picture:
-A moving object with velocity with respect to Earth, approaching to Earth.
-There is some angle in the direction of observation. And as it moves towards Earth, with our conventions, $lates \theta\approx\pi=180\textdegree$
-The moving object emits flashes of light at two different points, A and B, separated by some time interval in the Earth reference frame.
-The distance between those two points A and B, is very small compared with the distance object-Earth, i.e., .
Question: What is the time separation between the receptions of the pulses at the Earth surface?
The solution is very cool and intelligent. We get
A: time interval
B: time interval
Note that !
From this equations, we get a combined equation for the time separation of pulses on Earth
The tangential separation is defined to be
so, the apparent velocity of the source, seen from the Earth frame, is showed to be:
Remark (I): IFF AND !
Remark (II): There are some other sources of fake superluminality in special relativity or general relativity (the relativist theory of gravity). One example is that the phase velocity and the group velocity can indeed exceed the speed of light, since from the equation , it is obvious that whenever that one of those two velocities (group or phase velocity) are lower than the speed of light at vacuum, the another has to be exceeding the speed of light. That is not observable but it has an important rôle in the de Broglie wave-particle portrait of the atom. Other important example of apparent and fake superluminal motion is caused by gravitational (micro)lensing in General Relativity. Due to the effect of intense gravitational fields ( i.e., big concentrations of mass-energy), light beams from slow-movinh objects can be magnified to make them, apparently, superluminal. In this sense, gravity acts in an analogue way of a lens, i.e., as it there were a refraction index and modifying the propagation of the light emitted by the sources.
Remark (III): In spite of the appearance, I am not opposed to the idea of superluminal entities, if they don’t break established knowledge that we do know it works. Tachyons have problems not completely solved and many physicists think (by good reasons) they are “unphysical”. However, my own experience working with theories beyond special/general relativity and allowing superluminal stuff (again, we should be careful with what we mean with superluminality and with “velocity” in general) has showed me that if superluminal objects do exist, they have observable consequences. And as it has been showed here, not every apparent superluminal motion is superluminal!Indeed, it can be handled in the SR framework. So, be aware of crackpots claiming that there are superluminal jets of matter out there, that neutrinos are effectively superluminal entities ( again, an observation refuted by OPERA, MINOS and ICARUS and in complete disagreement with the theory of neutrino oscillations and the real mass that neutrino do have!) or even when they say there are superluminal protons and particles in the LHC or passing through the atmosphere without any effect that should be vissible with current technology. It is simply not true, as every good astronomer, astrophysicist or theoretical physicist do know! Superluminality, if it exists, it is a very subtle thing and it has observable consequences that we have not observed until now. As far as I know, there is no (accepted) observation of any superluminal particle, as every physicist do know. I have discussed the issue of neutrino time of flight here before:
Final challenge: With the date given above, what would the minimal value of be in order to account for the observed motion and apparent (fake) superluminal velocity of the radiogalaxy 3C 273?