CMS search for exotics at 5/fb, is that a HSCP at 700 GeV?

January 31, 2012

Today CMS have delivered a new report of searches for exotic particles including SUSY using datasets up to 5/fb. I dont think I have ever seen so many new results released in one presentation (Eva Halkiadakis). Sadly there are no new discoveries and nothing can be considered to be inconsistent with the standard model, but it would be exceptional if we could not pick at least one exceptional datapoint from so many plots and here it is.

This is from a search for heavy stable charged particles and three isolated events were seen in the same bin at 700 GeV

Update: A video of the talk has now been uploaded. Motl has posted a useful analysis for this plot.


Should you boycott Elsevier?

January 30, 2012

Some people include a few notable bloggers are saying that we should all boycott Elsevier who publish science journals and sell them at a good profit margin. Does this make sense? I wont answer that question but Iwill make this point: Elsevier is a profit making business who can set its margins according to how well it can persuade people that its products have good value (e.g. by attracting good authors to give it a high impact factor), and how well it can keep its costs down (e.g. by attracting unpaid reviewers) Elsevier have been doing this a little bit better than some of its rivals. If a boycott now reduces these margins they will increase them for any other publisher that is used instead by those authors and reviewers. Result: back to square one.

Of course there are non-profit organisations that publish journals, but the cost of their journals is not really that much better and they are not necessarily better at being open access either. If they were, then there would be no publishers making profits. So is there a real solution to the problem? If scientists don’t want to pay a high price for someone to organize their peer-review they have to find an efficient way to do it themselves. They have already found efficient ways to do the publishing and distribution (e.g. arXiv). Now they have to do the same for the more difficult task of peer-review. Until they do that any boycotts will be a futile game of pushing lumps and scientists will have to continue paying the market price for a commercial service.

That is my opinion, what do you think?

Update: John Baez has posted another follow-up discussing what else can be done to replace journals for peer-review. Apparently the life sciences are now ahead of maths and physics on this!

 


Seminar Watch 2012

January 28, 2012

The New Year is well under way and already ATLAS has released its first conference note for 2012 updating the search for supersymmetry at 2/fb. They see a small excess of events, i.e. four where 1.7 ± 0.9 were expected. This is nothing that trial error cannot account for but it shows that the update to 5/fb still has a potential to produce ground-breaking results. CMS will be the first to provide that update with a seminar scheduled for Tuesday covering various searches including SUSY and exotica. We can rule out the possibility of anything really startling being announced, by the lack of rumours :) but there can always be first tantalizing hints.

After that the next big events for HEP results are the Aspen conference from 11th February and Moriond from 3rd March. The latter is the most likely venue for further results at 5/fb to be presented. The official Higgs combination for the LHC should appear and final Higgs results from the Tevatron using their full dataset are also expected.

Meanwhile the LHC itself has already entered its hardware recommissioning phase at the end of the Winter shutdown with half of the eight cryogenics plants already cooling down. The running schedule for 2012 is available showing a start date of 11st March for beams. This is 17 days later than last year. By then they will have decided on the running parameters for next year. The main ones to consider are the energy, bunch spacing and beta*. This year hopes are running high that the centre of mass energy will go up from 7 TeV to 8TeV but this depends on the results of hardware tests. They will not want to take any risk. The bunch spacing could go down from 50ns to 25ns which would please the experiments because it reduces pile-up, but realistically I think this would require too much scrubbing and preparation. It may result in a much lower luminosity for the year. They would be better to stick to 50ns and be sure of a good clean run. A better squeeze with a lower beta* is a promising option for pushing the luminosity to higher levels without risking higher intensities. These matters will be discussed at Chamonix from 6th February. If that is too technical a summary session will be webcast on 15th February.

It is too soon to list all the conferences where new results may appear this year but the biannual ICHEP meeting deserves a special mention. This is the largest conference for HEP and this year it will take place in Melbourne from 4th July. By then they may have doubled the LHC dataset if everything has run very smoothly. Discovery potential rises another notch.


Favorite Explanations

January 18, 2012

The Edge question for 2012 is “What is your favorite deep, elegant or beautiful explanation?” There are lots of interesting answers given but my favorite is not included. That’s partly because they did not have the foresight to ask me, but never mind I can post it here.

My favorite explanation is Stephen Hawking’s argument for why the area of a black hole horizon increases. This is a very non-trivial result from gravitational dynamics and yet the explanation can be summarized almost rigorously in words without any equations. Not only is it elegant but it is also very deep since it leads to the idea that the area of a black hole is related to entropy, a hunch that Hawking later clarified with his theory of black hole radiation and thermodynamics. This in turn led to the information loss paradox which was explained by the holographic principle. It is a remarkably persuasive train of thought that takes us on a journey far beyond anything that experiment or observation can currently reach, an amazing demonstration of the power of the human mind. I am sure I do not need to describe the details of these ideas to most readers of this blog. Any of you that are not familiar can read about them in Wikipedia when it comes back from its anti-anti-piracy sulk.

Anyway, here is Hawking’s beautiful explanation quoted from “a Brief History of Time”

“I had already discussed with Roger Penrose the idea of defining a black hole as the set of events from which it was not possible to escape to a large distance, which is now the generally accepted definition. It means that the boundary of the black hole, the event horizon, is formed by the light rays that just fail to escape from the black hole, hovering forever just on the edge. It is a bit like running away from the police and just managing to keep one step ahead but not being able to get clear away!
Suddenly I realized that the paths of these light rays could never approach one another. If they did, they must eventually run into one another. It would be like meeting someone else running away from the police in the opposite direction – you would both be caught! ~ But if these light rays were swallowed up by the black hole, then they could not have been on the boundary of the black hole. So the paths of light rays in the event horizon had always to be moving parallel to, or away from, each other. ~ If the rays of light that form the event horizon, the boundary of the black hole, can never approach each other, the area of the event horizon might stay the same or increase with time but it could never decrease because that would mean that at least some of the rays of light in the boundary would have to be approaching each other. In fact, the area would increase whenever matter or radiation fell into the black hole. ~ This nondecreasing property of the event horizon’s area placed an important restriction on the possible behaviour of black holes.”

 


Relativity FAQ: Why is “c” used for the speed of light?

January 8, 2012

Back in the days before blogs and web forums when we used to discuss physics on usenet I was for a short while the editor of the Physics FAQ. Although the FAQ is still cared for by Don Koks it’s role has been mostly superseded by Wikipedia for better or for worse. In my time I wrote a number of articles for the FAQ with a special interest in the relativity questions. A lot of new discoveries have been made since then and some of them impact the cosmology sections. I’d like to update a few of my earlier articles and rewrite some others in my own words. I’ll post them here over time and I may even write some new ones. Feel free to add comments or ask further questions.

Why is “c” used for the speed of light?

The short answer

According to the science fiction writer Isaac Asimov, the reason why c is used as the symbol for the speed of light is that c stands for celeritas, the Latin word for speed [1], but is this really true?

Tracing the origin of the use of a given symbol in science is not always straight forward. Scientists rarely trouble to note the origin of the notations they use. When Einstein wrote his first papers on the theory of relativity he chose V as his symbol for the speed of light [2]. In 1907 he suddenly changed it to c [3] without explanation but it is likely that he was being influenced by earlier usage rather than choosing his own notation. From that point on the notation was used by everyone.

When you try to track where this came from and why, three possible explanations come up. There is indeed some evidence to show that c was sometimes used as a generic symbol for speed because it stands for celeritas, but there are two more specific influences that seem more concrete. One comes from the use of the letter c as the speed variable in the wave equation. This can be traced back to the work of Euler who developed it in two and three dimensions during the eighteenth century. When he choose the symbol c he had already used a and b as other variables so the most obvious reason for his choice was that c was simply the next letter in the alphabet.

Another equally plausible explanation has its origins in a paper from 1856 by Weber and Kohlrausch on electricity and magnetism [4]. They introduced a constant c with dimensions of speed that was important to the interrelationships between the two forces. It is clear from the text and subsequent usage that c simply stood for constant. Later this quantity was shown to be  related to electromagnetic wave propagation and was then popularised as a symbol for the speed of light.

It is surprising that the notation seems to have more than one origin and it is not easy to determine which of these was the most influential, but probably all three played a part over time in establishing the latter c as the universally accepted symbol for the speed of light.

Does c stand for celeritas?

In 1959 Isaac Asimov penned an article for a sci-fi magazine entitled “C for Celeritas” in which he claimed he knew why c is used for the speed of light. “As for c, that is the speed of light in vacuum, and if you ask why c, the answer is that it is the initial letter of celeritas, the Latin word meaning speed.” he wrote. The article was reprinted in some of his books [1] which sold many copies to budding scientists, so it is not surprising that since then, Asimov’s answer has become a factoid repeated in many other articles and books.

However, if you go back and read his essay you discover that Asimov merely stated his case in that one sentence. He made no further attempt to justify his theory for the origin of the c notation. So is his claim really born out by history, or was c originally introduced as a variable standing for something else?  The special theory of relativity is based on the principle that the speed of light is constant; so did c stand for “constant”, or did it simply appear by accident in some text where all the other likely variables for speed had already been used up?

It is certainly true that the letter c has been used as a generic symbol for speed since at least the 18th century. For example it is also usually used as the standard symbol for the speed of sound. Starting with the Latin manuscripts of the 17th century, such as Galileo’s “De Motu Antiquiora” or Newton’s “Principia”, we find that they often use the word “celeritas” for speed.  However, their writing style was very geometric and descriptive.  They did not tend to write down formulae where speed is denoted by a symbol.  Possibly the earliest example of the letter c being used for speed can be found in a work written in the eighteenth century.  In 1716 Jacob Hermann published a Latin text called “Phoronomia”, meaning the science of motion [5].  In it he developed Newton’s mechanics in a form more familiar to us now, except for the Latin symbols.  His version of the basic Newtonian equation F = ma was dc = p dt, where c stands for “celeritas” meaning speed, and p stands for “potentia”, meaning force.

Physicists of the nineteenth century would have read the classic Latin texts on physics, and would have been aware that c could stand for “celeritas”.  As an example, Lorentz used c in 1899 for the speed of the Earth through the ether [6].  Typically Einstein had little interest in Latin at school considering it to be a useless subject. Yet we know that even Einstein used c for speed outside relativity. In a letter to a friend about a patent for a flying machine, he used c  for the speed of air flowing at a mere 4.9 m/s [7].

All this lends plausibility to the argument that c stands for celeritas but there is also persuasive evidence for other origins of the notation, so the best we can really say is that the generic use of  c as a symbol for speed helped the notation to prevail over other alternatives in use around the turn of the twentieth century.

Does c stand for constant?

The idea that c might stand for constant rather than celeritas may seem a little too obvious to be true, but there is good reason to propose that this is really the correct origin of the notation.

Although c is now the universal symbol for the speed of light, the most common symbol in the nineteenth century was an upper-case V which Maxwell had started using in 1865 [8]. Maxwell may have borrowed this from Foucault who had used the same symbol earlier in his thesis of 1853 [9].  That was then the notation adopted by Einstein for his seminal papers on relativity from 1905.

The most convincing origin of the letter c being used for the speed of light can be traced back to a paper of 1856 by Weber and Kohlrausch [4].  They defined and measured a quantity denoted by c that they used in an electrodynamics force law equation.  It became known as Weber’s constant and was later shown to have a theoretical value equal to the speed of light times the square root of two.  In 1894 Paul Drude modified the usage of Weber’s constant so that the letter c became the symbol for the speed of electrodynamic waves [10].  In optics Drude continued to follow Maxwell in using an upper-case V for the speed of light.  Progressively the c notation was used for the speed of light in all contexts as it was picked up by Max Planck, Hendrik Lorentz and other influential physicists.  By 1907 when Einstein switched from V to c in his papers, it had become the standard symbol for the speed of light in vacuum for electrodynamics, optics, thermodynamics and relativity.

In France and England the electromagnetic constant was often symbolised by a lower case v rather than Drude’s c.  This was directly due to Maxwell, who wrote up a table of experimental results for direct measurements of the speed of light on the one hand and electromagnetic experiments on the other.  He used V for the former and v for the latter.  Maxwell described a whole suite of possible experiments in electromagnetism to determine v.  Those that had not already been done were performed one after the other in England and France over the three decades that followed [11].  In this context, lower case v was always used for the quantity measured.  But using v was doomed to pass away once authors had to write relativistic equations involving moving bodies, because v was just too common a symbol for velocity.  The equations were much clearer when something more distinct was used for the velocity of light to differentiate it from the velocity of moving bodies.

While Maxwell always used v in this way, he also had a minor use for the symbol c in his widely read treatise of 1873.  Near the end he included a section about the German electromagnetic theory that had been an incomplete precursor to his own formulation [12].  This theory, expounded by Gauss, Neumann, Weber, and Kirchhoff, attempted to combine the laws of Coulomb and Ampère into a single action-at-a-distance force law.  The first versions appeared in Gauss’s notes in 1835 [13], and the complete form was published by Weber in 1846 [14].  Many physicists of the time were heavily involved in the process of defining the units of electricity.  Coulomb’s law of electrostatic force could be used to give one definition of the unit of charge while Ampère’s force law for currents in wires gave another.  The ratio between these units had the dimension of a velocity, so it became of great practical importance to measure its value.  In 1856 Weber and Kohlrausch published the first accurate measurement [4].  To give a theoretical backing they rewrote Weber’s force law in terms of the measured constant and used the symbol c.  This c appeared in numerous subsequent papers by German physicists such as Kirchhoff, Clausius, Himstedt, and Helmholtz, who referred to it as “Weber’s constant”.  That continued until the 1870s, when Helmholtz discredited Weber’s force law on the grounds of energy conservation, and Maxwell’s more complete theory of propagating waves prevailed.

Two papers using Weber’s force law are of particular note.  One by Kirchhoff [15] and another by Riemann [16] related Weber’s constant to the velocity at which electricity propagated.  They found this speed to be Weber’s constant divided by the square root of two and it was very close to the measured speed of light.  It was already known from experiments by Faraday that light was affected by magnetic fields, so there was already much speculation that light could be an electrodynamic phenomenon.  This was the inspiration for Maxwell’s work on electrodynamics, so it is natural that he finally included a discussion of the force law in his treatise [12].  The odd thing is that when Maxwell wrote down the force law, he changed the variable c so that it was smaller than Weber’s constant by a factor of the square root of two.  So Maxwell was probably the first to use c for a value equal to the speed of light, although he defined it as the speed of electricity through wires instead.

So c was used as Weber’s constant having a value of the speed of light times the square root of two, and this can be related to the later use of c for the speed of light itself.  Firstly, when Maxwell wrote Weber’s force law in his treatise in 1873, he modified the scale of c in the equation so that it reduced by a factor of the square root of two.  Secondly, when Drude first used c in 1894 for the speed of light [10], the paper by Kirchhoff that he cited [17] was using c for Weber’s constant, so Drude had made the same adjustment as Maxwell.  It is impossible to say if Drude copied the notation from Maxwell or invented it indepednently, but he did go one step further in explicitly naming his c as the velocity of electrodynamic waves which by Maxwell’s theory was also the speed of light.  He seems to have been the first to do so, with Lorentz, Planck, and others following suit a few years later.

So to understand why c became the symbol for the speed of light we now have to find out why Weber used it in his force law.  In the paper of 1856 [4] Weber’s constant was introduced with these words “.. and the constant c represents that relative speed, that the electrical masses e and e must have and keep, if they are not to affect each other.” This was written in German and the modern German word for constant is “konstant” but at that time before spelling was completely fixed it was often written in German with an initial “c”. This was the case here. So it appears that c originated as a letter standing for “constant” rather than “celeritas”.  However, it had nothing to do with the constancy of the speed of light until much later.

Why is c used in the wave equation?

The speed of light is not the only place in physics where c is commonly used to represent speed. It is also the standard notation for the speed of sound and is used in the linear wave equation. In its simplest scalar form it can be written like this

\frac{\partial^2 u} {\partial t^2} = c^2 \nabla^2 u

You might think that this form of the wave equation is used because physicists most commonly think of the wave equation in the context of relativistic physics where c represents the speed of light. It would be a natural progression to then use the same symbol for other wave equations even if c then stands for a different speed such as the speed of sound or water waves. The literature shows that this is not the correct explanation because this usage in the wave equation goes back before light was even known as a wave phenomena.

In 1747 Jean d’Alembert made a mathematical study of the vibrating string and discovered the one dimensional wave equation [18], but he wrote it without the velocity constant.  Euler generalised d’Alembert’s equation to include the velocity, denoting it by the letter a [19].  The general solution is y = f(x - at) + f(x + at), representing a supposition of two waves of fixed shape travelling in opposite directions with velocity a.

Euler was one of the most prolific mathematicians of all time.  He wrote hundreds of manuscripts and most of them were in Latin.  Was it Euler then who established a convention for using c for “celeritas”?  In 1759 he studied the vibrations of a drum, and moved on to the 2-dimensional wave equation.  This he wrote in the form we are looking for with c now the velocity constant [20].

The wave equation became a subject of much discussion, being investigated by all the great mathematicians of the époque including Lagrange, Fourier, Laplace, and Bernoulli.  Through their works, Euler’s form of the wave equation with c for the speed of wave propagation was carved in stone for good.  To a first approximation, sound waves are also governed by the same wave equation in three dimensions, so it is not surprising that the speed of sound also came to be denoted by the symbol c.  This predates relativity and can be found, for example, in Lord Rayleigh’s classic text “Theory of Sound” published in 1877 [21].

So can we tell why Euler used the letter c? Most of his work on the wave equation was written in French rather than Latin and there is no reason to suppose he was thinking of the Latin word celeritas. It is true that the French sometimes use the word célérité to mean the phase velocity in the context of the wave equation. They even sometimes talk of “célérité du son” and “ccélérité de la lumière” for the speed of sound and light, but there is no evidence that this goes back to the time of Euler. Euler just used “vitesse” the more common French word for speed. It is more likely that the French usage was adopted later as a consequence of Euler’s use of c in the wave equation.

Euler’s style of notation for algebra and calculus was very similar to basic usage in mathematics and physics today. He commonly picked sequences of letters such as a, b, c or x, y, z for variables, sometimes changing to upper case or Greek letters with little indication that the letters actually stood for anything in particular. In fact Euler set many of the notations we use today including the use of the Greek letter \pi for 3.14159… One possibility for his choice of c is that when he moved from the one dimensional wave equation to the 2-dimensional case, he may have used a and b for the components of the wave velocity, then writing c^2 = a^2 + b^2 for the magnitude. Unfortunately there is no written evidence to support this theory either.

There is however a case for an even more mundane answer. When he introduced the variable c for possibly the first time he defined it with an equation that he wrote

\frac{a g}{b} = cc

a and b were simply two physical variables he had already introduced, so c seems to be chosen as merely the next letter in the alphabet.

Why did Einstein switch notation for the speed of light?

A lower-case c has been consistently used to denote the speed of light in textbooks on relativity almost without exception since such books started to be written. For example, the notation was used in the earliest books on relativity by Lorentz (1909) [22], Carmichael (1913) [23], Silberstein (1914) [24], Cunningham (1915) [25], and Tolman (1917) [26]. That was not the case just a few years before. In his earliest papers on relativity from 1905—1907 Einstein began by using an upper-case V for the speed of light [2]. At that time he was also writing papers about the thermodynamics of radiation, and in those he used up upper-case L [27]. All of these papers appeared in volumes of the German periodical Annalen Der Physik. Einstein’s notation changed suddenly in 1907 in a paper for the Journal Jahrbuch der Radioaktivität und Elektronik [3]. There he used the lower case c, and his most famous equation E = m c^2 finally took it’s familiar form.

It is not difficult to find where the upper case V had come from. Maxwell used it extensively in his publications on electrodynamics from as early as 1865 [8]. It was the principal symbol for the speed of light in his 1873 treatise on electrodynamics [28]. By the 1890s Maxwell’s book was in wide circulation around the world and there were translations available in French and German. It is no surprise then that the upper-case V is found in use in such papers as the 1887 report of Michelson and Morley on their attempt to find seasonal variations in the speed of light [29]. That was written in the United States, but the same notation was also found across Europe, from papers by Oliver Lodge [30] and Joseph Lamor [31] in England, to the lecture notes of Poincaré in France [32], and the textbooks of Paul Drude in Germany [33] and Lorentz in the Netherlands [34]. Einstein’s education at the Polytechnik in Zurich had not covered Maxwell’s theory of Electrodynamics in the detail he would have liked, but he had read a number of extra textbooks on the new Electrodynamics as self study, so he would have been familiar with the standard notations. From 1905 he wrote his first papers on relativity, and there is nothing extraordinary in his choice of the symbol V for the speed of light [2].

Why then, did he change it to c in 1907? At that time he still worked as a clerk in the Bern patent office, but for the previous two years he had been in regular correspondence with eminent physicists such as Max Laue, Max Planck, Wilhelm Wien and Johannes Stark. Stark was the editor of the Jahrbuch, and had asked Einstein to write the article in which he was to first use the letter c. Einstein mentioned to Stark that it was hard for him to find the time to read published scientific articles in order to acquaint himself with all the work others have done in the field, but he had seen papers by Lorentz, Kohn, Monsegeil and Planck [35]. Lorentz and Planck in particular had been using c for the speed of light in their work. Lorentz had won the 1902 Nobel prize for physics, and it is not surprising that physicists in Germany had now taken up the same notation. It is also not surprising that Einstein, who was looking for an academic position, aligned himself to the same conventions at that time. Another reason for him to make the switch was that the letter c is simply more practical. The upper-case V would have been easily confused with the lower case v appearing in the equations of relativity for the velocity of moving bodies or frames of reference. Einstein must have found this confusion inconvenient, especially in his hand written notes.

Looking back at papers of the late 1890s, we find that Max Planck and Paul Drude in particular were using the symbol c at that time. The name of Drude is less well known to us today. He worked on relations between the physical constants and high precision measurements of their value. These were considered to be highly worthy pursuits of the time. Drude had been a student of Voigt, who himself had used a Greek \omega for the speed of light when he wrote down an almost complete form of the Lorentz transformations in 1887 [36]. Voigt’s \omega was later used by a few other physicists [37, 38,50], but Drude did not use his teacher’s notation. Drude first used the symbol c in 1894, and in doing so he referenced a paper by Kirchhoff [10]. As already mentioned, Paul Drude also used V. In fact he made a distinction of using V in the theory of optics for the directly-measured speed of light in vacuum, whereas he used c for the electromagnetic constant that was the theoretical speed of electromagnetic waves. This is seen especially clearly in his book “Theory of Optics” of 1900 [39], which is divided into two parts with V used in the first and c in the second part. Although Maxwell’s theory of light predicted that they had the same value, it was only with the theory of relativity that these two things were established as fundamentally the same constant. Other notations vied against Drude’s and Maxwell’s for acceptance. Herglotz [38] opted for an elaborate script B, while Himstedt [40], Helmholtz [41] and Hertz [42] wrote the equations of electrodynamics with the letter A for the reciprocal of the speed of light. In 1899 Planck backed Drude by using c, when he wrote a paper introducing what we now call the Planck scale of units based on the constants of electrodynamics, quantum theory and gravity [43]. Drude and Planck were both editors of the prestigious journal Annalen Der Physik, so they would have had regular contact with most of the physicists of central Europe.

Lorentz was next to change notation. When he started writing about light speed in 1887 he used an upper case A [44], but then switched to Maxwell’s upper case V [45]. He wrote a book in 1895 [46] that contained the equations for length contraction, and was cited by Einstein in his 1907 paper. While Drude had started to use c, Lorentz was still using V in this book. He continued to use V until 1899 [47], but by 1903 when he wrote an encyclopedia article on electrodynamics [48] he too used c. Max Abraham was another early user of the symbol c in 1902, in a paper that was seen by Einstein [49]. From Drude’s original influence, followed by Planck and Lorentz, by 1907 the c symbol had become the prevailing notation in Germanic science and it made perfect sense for Einstein to adopt it too.

What was the earliest symbol used for the speed of light?

It is hard to be certain when a symbol was first used for the speed of light, but Doppler used the Greek letter \alpha in his paper on the frequency shift of light with velocity in 1842 [51]. There were other papers written about the speed of light before then but none of them seems to have used any symbol to represent it.

References

[1] Isaac Asimov “C for Celeritas” in “The Magazine of Fantasy and Science Fiction”, Nov-59 (1959), reprinted in “Of Time, Space, and Other Things”, Discus (1975), and “Asimov On Physics”, Doubleday (1976)
[2] A. Einstein, From “The Collected Papers, Vol 2, The Swiss Years: Writings, 1900—1909″, English Translation, he wrote five papers using V, e.g. “On the Electrodynamics of Moving Bodies”, Annalen Der Physik 17, pgs 891—921 (1905), “On the Inertia of Energy Required by the Relativity Principle”, Annalen Der Physik 23, pgs 371—384 (1907)
[3] A. Einstein, “On the Relativity Principle and the Conclusions Drawn From It”, Jahrbuch der Radioaktivität und Elektronik 4, pgs 411—462 (1907)
[4] R. Kohlrausch and W.E. Weber, “Ueber die Elektricitätsmenge, welche bei galvanischen Strömen durch den Querschnitt der Kette fliesst”, Annalen der Physik, 99, pg 10 (1856)
[5] J. Hermann, “Phoronomia”, Amsterdam, Wetsten, (1716)
[6] H.A. Lorentz, “Stokes’ Theory of Aberration in the Supposition of a Variable Density of the Aether”, Proc. Roy. Acad. Amsterdam I, pg 443 (1899)
[7] A. Einstein, “The Collected Papers, Vol 5, The Swiss Years: Correspondence, 1902—1914″, English Translation, Doc 86 (1907)
[8] J. Clerk Maxwell, “A dynamical theory of the electromagnetic field”, Philos. Trans. Roy. Soc. 155, pgs 459—512 (1865).  Abstract: Proceedings of the Royal Society of London 13, pgs 531—536 (1864)
[9] J. Foucault, “Sur les vitesses relatives de la lumiére dans l’air et dans l’eau” (Paris, 1853)
[10] P. Drude, “Zum Studium des elektrischen Resonators”, Göttingen Nachrichten (1894), pgs 189—223
[11] e.g. J.J. Thomson and G.F.C. Searle, “A Determination of `v’, the Ratio of the Electromagnetic Unit of Electricity to the Electrostatic Unit”, Proc. Roy. Soc. Lond. 181, pg 583 (1890), M. Hurmuzescu, “Nouvelle determination du rapport v entre les unites electrostatiques et electromagnetiques”, Ann. de Chim. et de Phys., 7a serie T. X April 1897, pg 433. (1897)
[12] J. Clerk Maxwell, “A Treatise on Electricity and Magnetism”, Oxford Clarendon Press, Vol II; Chapter 23, section 849 (1873)
[13] K.F. Gauss, “Zur mathematischen Theorie der elektrodynamischen Wirkung” (1835), in “Werke”, Göttingen 1867; Vol. V, pg 602
[14] W. Weber, “Elektrodynamische Maassbestimmingen uber ein allgemeines Grundgesetz der elektrischen Wirkung”, Abh. Leibnizens Ges., Leipzig (1846)
[15] G. Kirchhoff, “Ueber die Bewegung der Elektricität in Leitern” Ann. Phys. Chem. 102, 529—544 (1857)
[16] G.F.B. Riemann, “Ein Beitrag zur Elektrodynamik”, Annalen der Physik und Chemie, pg 131 (1867)
[17] G. Kirchhoff, “Zur Theorie der Entladung einer Leydener Flasche”, Pogg. Ann. 121 (1864)
[18] J. d’Alembert, “Recherches sur les cordes vibrantes”, L’Académie Royal des Sciences (1747)
[19] L. Euler, “De La Propagation Du Son” Memoires de l’acadamie des sciences de Berlin [15] (1759), 1766, pgs 185—209, in “Opera physica miscellanea epistolae.  Volumen primum”, pg 432
[20] L. Euler, “Eclaircissemens Plus Detailles Sur La Generation et La Propagation Du Son Et Sur La Formation De L’Echo”, “Memoires de l’acadamie des sciences de Berlin” [21] (1765), 1767, pgs 335—363 in “Opera physica miscellanea epistolae.  Volumen primum”, pg 540
[21] J.W. Strutt, “Theory of Sound” Vol 1, pg 251, McMillan and Co. (1877)
[22] H.A. Lorentz, “The theory of Electrons and its applications to the phenomena of light and radiant heat”.  A course of lectures delivered in Columbia University, New York, in March and April 1906, Leiden (1909)
[23] R.D. Carmichael, “The Theory of Relativity”, John Wiley & Sons (1913)
[24] L. Silberstein, “The Theory of Relativity”, Macmillan (1914)
[25] E. Cunningham, “The Principle of Relativity”, Cambridge University Press (1914)
[26] R.C. Tolman, “The Theory of the Relativity of Motion”, University of California Press (1917)
[27] A. Einstein, e.g. “On the Theory of Light Production and Light Absorption”, Annalen Der Physik, 20, pgs 199—206 (1906)
[28] J. Clerk Maxwell, “A Treatise on Electricity and Magnetism”, Oxford Clarendon Press (1873)
[29] A.A. Michelson and E.W. Morley, “On the Relative Motion of the Earth and the Luminiferous Ether”, Amer. J. Sci. 34, pgs 333—345 (1887), Philos. Mag. 24, pgs 449—463 (1887)
[30] O. Lodge, “Aberration Problems”, Phil. Trans. Roy. Soc. 184, pgs 729—804 (1893)
[31] J. Larmor, “A Dynamical Theory of the Electric and Luminiferous Medium I”, Phil. Trans. Roy. Soc. 185, pgs 719—822 (1894)
[32] H. Poincaré, “Cours de physique mathématique.  Electricité et optique.  La lumière et les théories électrodynamiques” (1900)
[33] P. Drude, “Physik des Äthers auf elektromagnetischer Grundlage”, Verlag F. Enke, Stuttgart (1894)
[34] H. Lorentz, “Versuch einer Theorie der elektrischen und optischen Erscheinungen in bewegten Körpern”, Leiden (1895)
[35] A. Einstein, from “The Collected Papers, Vol 5, The Swiss Years: Correspondence, 1902—1914″, English Translation, Doc 58.
[36] W. Voigt, “Ueber das Doppler’sche Princip”, Goett. Nachr. 2, pg 41 (1887)
[37] M. Brillouin, “Le mouvement de la Terre et la vitesse de la lumière”, comptes rendu 140, pg 1674 (1905)
[38] G. Herglotz, “Zur Elektronentheorie”, Nachrichten von der Gesellschaft 6, pg 357 (1903)
[39] P. Drude, “The theory of optics”, translated from German by C.R. Mann and R.A. Millikan, New York, Longmans, Green, and Co. (1902)
[40] F. Himstedt, “Ueber die Schwingungen eines Magneten unter dem dämpfenden Einfluß einer Kupferkugel”, Nachrichten von der Gesellschaft 11, pg 308 (1875)
[41] H. Helmholtz, Berlin: Verl. d. Kgl. Akad. d. Wiss. (1892)
[42] H. Hertz, “Electric Waves”, Macmillan (1893)
[43] M. Planck, “Uber irreversible Strahlungsvorgange”, Verl. d. Kgl. Akad. d. Wiss. (1899)
[44] H.A. Lorentz, “De l’Influence du Mouvement de la Terre sur les Phenomenes Lumineux”, Arch. Neerl. 21, pg 103 (1887)
[45] H.A. Lorentz, “On the Reflection of Light by Moving Bodies”, Versl. Kon. Akad. Wetensch Amsterdam I, 74 (1892)
[46] H.A. Lorentz, “Versuch einer Theorie der elektrischen und optischen Erscheinungen in bewegten Körpern”, Leiden (1895)
[47] H. A. Lorentz, “Théorie simplifiée des phenomènes electriques et optiques dans des corps en mouvement”, Proc. Roy. Acad. Amsterdam I 427 (1899)
[48] H.A. Lorentz, “Maxwells elektromagnetische Theorie” Encyclopädie der Mathematischen Wissenschaften.  Leipzig, Teubner (1903)
[49] M. Abraham, “Prinzipien der Dynamik des Elektrons”, Annalen der Physik 10, pgs 105—179 (1903)
[50] E. Cohn, “Zur Elektrodynamik bewegter Systeme. II”, Sitzungsberichte der Königlich Preussischen Akademie der Wissenschaften zu Berlin, der physikalisch-mathematischen Classe (1904)
[51] C. Doppler, “Über das farbige Licht der Doppelsterne”, 1842


Stephen Hawking 70th Birthday Symposium

January 3, 2012

He may be the weakest man in the world barely able to flutter a cheek muscle to work his computer and speech synthesizer, but Stephen Hawking’s life has had a powerful impact on physics and its popular image. This month he turns 70 and to celebrate there will be a symposium at Cambridge this Sunday and it will be webcast. The technology for the broadcast is being organised by Intel who provided a powerful computer at Cambridge for cosmological simulations as well as the CPU for Hawking’s speech synthesizer of course. So here he is in lego (by Iain Heath)

and here is a youtube video from Intel

 


All I want for Xmas is a Lego Hadron Collider

December 25, 2011

While you eat your Xmas dinner the engineers at the LHC are busy fixing up the systems ready for next year’s run, there is no time to lose. Here is a picture.

This model of the ATLAS detector was made by postdoc Sascha Mehlhase using 9500 LEGO bricks. The 1:50 scale model took 81 hours to put together including planning it on Lego Digital Designer .

MERRY CHRISTMAS!


Searching for Earth’s Twin

December 22, 2011

This has been a great year for experimental big science with ground-breaking findings in particle physics and astronomy. One of the most remarkable breakthroughs has been the success in the search for planets around other stars. The word exoplanet which first appeared in print in 1995 according to google, and has become a popular term in news reports in just the last three, has become ever more familiar this year as reports from the Kepler space telescope have taken the number of candidate exoplanets into the thousands.

Kepler is constantly watching 145,000 main sequence stars in our nearby region of the Milky Way galaxy in the direction of the constellations Lyra and Cygnus. It is looking for the tiny dimming of light that tells us that a planet has passed in front of the stars disk. By recording the amount of dimming,  how long it lasts for and how frequently it repeats, Kepler can estimate the size and orbit of the planet. In February NASA released a catalog of 1236 candidate exoplanets and this month the number increased to 2326. These have to be verified by ground based observation and so far the catalog of confirmed exoplanets has 716 entries.

The real interest about exoplanets concerns whether or not there is other life in the universe, and if there is, how common it is. A whole new industry of exoplanetary statistics has been born with scientists inventing habitability indexes that can be applied to the exoplanet catalogs to gauge which ones could support life. One habitable planet catalog has two exoplanets regarded as more Earth-like than Mars. These are HD 85512b and Gleise 581d, both found earlier this year, but they are rather large to be comfortable for us to live on. If they have an atmosphere it is could be too thick due to the stronger gravity. Already Venus has a thick atmosphere making the pressure too hot and high pressure for us to survive. If we discovered an exoplanet like Venus we would be very excited because it is in the habitable zone and is very similar to Earth in size. Finding out about its atmosphere would be difficult from a distance of many light-years.

This week some new “Earth-twins” were announced Kepler 20f and Kepler 20e. They are very similar in size to earth but they are not in their Suns habitable zone where the temperature would be about right for liquid water and conditions similar to Earth. It is good news that Kepler has proven that it can find planets of this size but we will need to wait before we find ones where we could really live. It is said that these planets may have been further from their star in the past so that life could have formed there in the past. This just serves to emphasize one more characteristic an exoplanet must have if it is likely to support life as we know it on Earth. It must stay in a stable orbit around a stable star for billions of years so that life can evolve without being obliterated by heat or freezing.

Kepler has a planned lifespan of 3.5 years and may have its life extended. This should give it time to find some more earth-like planets orbiting Sun-like stars with periods of about 1 year. Kepler takes time to find these because they need to pass in front of their star at least twice to confirm their existence and orbit. because Kepler works by looking at such transit events it only sees planetary systems whose disk is aligned with Earth. If it were looking at earth from afar it would have only a one in 700 chance that this alignment occurred. If Kepler finds  one Earth-like planet we could guess that there are 700 in the sample they are looking at, which represents 1 millionth of the stars in our galaxy, but will it really find any?

My guess at this stage is that it will find a number of Earth-sized planets and a number of small planets in the habitable zone, but the statistics may be against it finding an Earth-sized planet in the habitable zone of a stable star like ours. Probably we will be able to estimate how many such planets there are and it may be something like a million in our galaxy. It could be a lot less. The next step will be tp determine how many are likely to have the right chemical mixture to form water and an Earth-like atmosphere. We don’t yet know the answer, but it is exciting that the data we need to answer these questions is starting to become available.


Has CERN Found the God Particle? A calculation.

December 16, 2011

Yes I know that physicists don’t use the term “God particle” but it has entered into popular culture and when the terms “Higgs Boson” and “God Particle” were trending on Twitter and Google earlier this week it was the latter that went the highest. Contrary to what some scientists imagine of the interested public, very few think that there is some religious significance attached to the particle because of this name, it’s just a catchy moniker and we need not be afraid to use it.

Following the CERN announcement earlier this week, physicists have been giving some very different assessments of the chances that the ATLAS and CMS detectors have seen the Higgs boson. The CERN DG says merely that they have seen some “interesting fluctuations”, while Tommaso Dorigo, (an expert on the statistical aspects of the CMS analysis) calls it “firm evidence“. Theorist Lubos Motl is even more positive. He says that it is a “sure thing“, but another theorist Matt Strassler has criticised such positive reports. He regards the situation as 50/50 and backed this up with a poll of experimenters that came up 9 to 1 in favour of uncertainty. This contrasts with a similar poll by Bill Murray who is lead Higgs analyst for the ATLAS collaboration. In an interview he reported a 10 to 0 vote that the Higgs had indeed been found.

What is the question?

So can we make a more objective and quantitative assessment of the current level of uncertainty over the result? You might want to know the probability that the Higgs Boson has been seen for example. Unfortunately this quantity depends on the prior probability that the Higgs Boson exists. Theoretical physicists have a very wide range of opinions on this depending on which theories they favour. Experimenters are supposed to make their assessments independently of such prejudices. So how can we measure the situation objectively?

Luckily there is a different question that is model independent. We can ask for the probability that the experiments would produce results as strong or stronger than those reported if there were no Higgs Boson. This conditional probability removes the theory dependence in the question so the answer should be a number that everyone could in principle agree on. The smaller this probability is, the better the certainty that the Higgs Boson has been found.

Before we can calculate the result we must define precisely what we mean by the “strength of the result”. This has to be a single number so it should come from the combined results of both experiments. I will define it to be the maximum value of the CLs likelihood ratio anywhere on the plot. This takes into account both the exclusion side and the signal side of the statistics and is standard use for Higgs searches. Don’t worry if you are not familiar with this quantity, it will become clearer in a minute.

Can we trust the combination?

The Higgs combination group have tried to spread propaganda that my unofficial combinations cannot be trusted because only people familiar with the inner details of the experimental analysis are capable of doing it correctly. This is not true. I repeatedly acknowledge that my method is an approximation and that only the official combination can be used to claim a discovery, but it is a good approximation and is perfectly acceptable for making a rough assessment of the combined certainty.

They warn that people should not add the event histograms from separate experiments but that is not how my combination is done. They say that only the experts can understand the systematic uncertainties of the detectors well enough to do the combination, but these uncertainties are all built into the individual exclusion plots that they have shown and are therefore taken into account when I combine them. They warned in the past that there are correlations between the background calculations because both experiments use the same algorithms. These correlations are there and must be accounted for to get the most accurate combination possible, but they have been shown to be small. You can ignore these correlations and still get a very good approximation.

In fact the largest source of error comes from the fact that the approximate combination method assumes a flat normal probability distribution at each mass point, when in reality a more complex function based on Poisson distributions would be correct. Happily the central limit theorem says that any error function with a finite variance becomes approximately normal given high enough statistics, so the approximation gets better as more data is added.

When the combination group published their first result in November I was able to compare it with my unofficial combination done in September. This confirmed that the approximation was good. This was no surprise to me because it had already been demonstrated with the Tevatron combinations and some earlier unpublished LHC combinations. I acknowledge that my combinations for some of the individual channels were not so good because the number of events has been low, especially for the ZZ channels. This will have improved for the latest results because there is now much more data but still these individual channel combinations should be considered less certain than the overall combination.

The assessment I am doing today depends mainly on that, so this is not a big issue, however it is worth showing one further comparison between my combination and the official one for a signal channel. the plot below shows the official combination for the diphoton channel published in November when ATLAS used just 1.1/fb and CMS used just 1.7/fb. The red line is the unofficial result from viXra. It will be interesting to see how much this has improved for 5/fb.

What must be evaluated?

It is possible to do a systematic evaluation of the probability in question using the combined plot. This takes into account the statistical uncertainties as well as the theoretical uncertainties in the background due to imprecise measurements of the standard model parameters (e.g. W mass) and the approximation methods used in the theory. It also includes the uncertainty in the energy resolution and other similar uncertainties in detector performance. All these things have been considered by the experts from the experimental collaborations and built into the plots, so we don’t need to know the details to do the calculation (If anyone tries to claim otherwise they are wrong)

However, there is also the possibility that the experimenters have made some more fundamental kind of error. There may be a subtle fault in the detectors that has not shown up in all the calibration tests which causes an excess on the plot where there should not be one. This should not happen because there are hundreds of people checking for such errors and they are all very competent. Nevertheless bad luck can strike and throw everything out. This has been the case before and it is probably the case with the OPERA result indicating that neutrinos are faster than light.

A second similar possibility is that the theorists have underestimated the accuracy of some of their calculations so that the background calculation is a little off in one mass range. The analysis involves subtracting a very small signal from a large background, especially in the diphoton channel, so the scope for magnifying any inaccuracy has to be considered. A miscalculation of the signal size is also possible but less likely to lead to a bad result.

As I said, the published plots include all the known experimental and theoretical uncertainties, but these other unknown errors in experiment and theory cannot be accounted for exactly. They can only be estimated based on past experience. Some “expert theorists” say that us more “naive theorists” don’t appreciate these facts. Do we really sound so stupid?

What is the chance of an experimental fault?

How often do experimental faults contribute to a false positive like the excess reported this week? We can only look at past performance but I am not aware of any careful surveys, so a guestimate is required. Someone else may be able to do better. The answer might be one in a hundred but let’s be more conservative and say one in ten. If you think it is more common please fell free to reevaluate for yourselves.

However, with the CERN Higgs result we have good evidence that such a fault is not the cause of the excess. That is because there are two independent experiments reporting a very similar result. ATLAS and CMS may seem very similar from the answers they produce, but the detector technologies they use are quite different. The chance of a common fault producing the excess in both detectors must therefore be very small. I am going to assume that this is negligible. If anyone thinks otherwise please explain why.

This means that if the excess is due to such a fault it must be a coincidence that it has a similar effect for both experiments. If there is a one in ten chance of a fault for one experiment, the chance for two independent experiments is one in 100, but even then that is the chance that they would produce the fault at different places. Lets have a look at the two signal plots together.

The positions on the maximum excess differ by about 2 GeV but the mass resolution is around 2% so this is not an inconsistency. If these excesses are produced by detector faults then the chance of them lining up so close would be small. How small? That depends on some unknowns. we can’t just say the fault could appear anywhere in the mass range, so let’s be conservative and just call it a one in three chance.

Overall then we arrive at a one in 300 chance for the observed excess to be explained by a coincidental combination of detector faults. I think this is conservative. Someone else might estimate it to be more probable.

What is the chance of a theoretical Fault?

The other outside possibility is that the result has been afflicted by a misunderstood background so that the observed excess is really just a subtle effect of the Higgsless standard model that the theorists failed to recognize or estimate correctly. Again this is unlikely but it happens and must be considered. How often does it happen? Once in a hundred perhaps? I will be more cautious and assume one in ten.  You may think that is an underestimate in which case you can make your own evaluation.

But again we have more than one place to look. The separate experiments could well be affected by the same theoretical error but the different decay channels are much more independent. There may be some small chance that a single theoretical error could affect all the channels but this would have a small probability, say one in a hundred. If you think it is bigger please justify how that could happen.

So now let’s look at the combined signal plots for the three main channels; diphoton, ZZ->4l and WW->lvlv. For the WW plot I can’t use the latest CMS results because the plots shown are frankly rubbish quality. I hope they will improve them before publication. However the WW channel has good sensitivity even with less data so I will show the combination from the summer.

All three channels show an excess in the same low mass region so if this is due to independent faults it would require a coincidence. However, the excess is not as good in ZZ and WW as in the diphoton channel. I am going to put the probability at one in a hundred overall and add to this the probability of one in a hundred for a common fault that affects all three. So the overall chance for a fault from theory is one in 50. Some people will say that this is a low estimate and some people will say that it is low. Others will say that it is nonsense to attempt such an estimate. Never mind, I am just giving it my best honest shot. Let others do the same.

What is the chance of a statistical fluctuation?

The last thing to consider is what is the probability of getting s signal as string or stringer than that observed according to the statistical analysis. Actually this also takes into account some theoretical uncertainty and measurement error, but mostly it is statistical. This is a probability that can be worked out more scientifically, but it does include the Look Elsewhere Effect which is partly subjective.

First consider what would be the chance of seeing a signal as strong as the one reported at the fixed mass point of the maximum excess if in fact there was no Higgs Boson. The plot shows a three sigma excess at 124-125 GeV. This would have been much stronger if the peaks from the two experiments had coincided more closely, possibly about 4 sigma. This discrepancy may be due to some detector calibration that could be corrected but it is correct that we do not take that possibility into account. The 3 sigma excess is what we should work with.

As everyone knows, the probability of a three sigma fluctuation is one in 370, but that allows for fluctuations up or down. So the probability for an excess this big or stronger at this point is one in 740. But we need to know the probability for an excess this strong anywhere on the plot. In other words we need to multiply by the Look Elsewhere Effect factor. Have a look at the plot over the entire range

Notice that for the entire range from 130 GeV to 600 GeV the line remains within 2 sigma of the zero line. Big deviations are indeed rare but how rare?

Another point to consider is that if there was a three sigma fluctuation at say 180GeV, the Higgs would still be excluded at that point. This would not count as such a strong signal. This is why I specified that the strength should be measured using the CLs statistic which takes the ration of the probability for the signal hypothesis over the probability from the null hypothesis. This means that the probability of getting a signal as strong in the regions where the Higgs is excluded is much smaller. In fact we can neglect this altogether. So we need only count the regions from 114 GeV (using LEP)  to about 135 GeV and perhaps 500 to 600 GeV. Hoe big is the LEE factor for these regions. This depends on the width of the signal which we see to be about 5 GeV in the low mass range due to mass resolution of the detector, and which is much bigger above 500 GeV due to a very large natural width for a high mass Higgs Boson. The LEE factor will therefore be about 6 but let’s call it 10 to be extra cautious.

This gives a final answer for the probability of a fluctuation to be about one in 70.

The final answer?

Combining the three things I have considered i get an overall probability for such a strong signal if there is no Higgs to be about 1 in 30. Perhaps I have failed to account for combinations where more than one of these effects could combine. That requires further coincidences but lets just call the overall result 1 in twenty. In other words, everything considered I take the observed result to be a two sigma effect.

What about prior probabilities?

there is one more thing you need to take into account when considering how likely a result of any number of sigmas significance is going to stand the test of time. That is your prior estimate for the probability of it being true. The OPERA neutrino observation is a good example of an extreme case. A six sigma effect was observed, but he prior probability of neutrinos going faster than light would be considered very small by most theoretical physicists. It follows that the probability for this result to go away is quite high despite the statistical significance. An experimental fault is likely to be the biggest contributing factor despite the care of the experimenters.

In fact most 3 sigma excesses for observations beyond the standard model do go away. This is because the prior chance of any one such effect being correct is very small. You can consider this to be part of the Look Elsewhere Effect too. However, the observation of the Higgs Boson is a very different case. Most theoretical physicists would estimate the prior probability for the existence of a Higgs(like) Boson is very high. The standard model provides a very simple explanation of electroweak symmetry breaking but there is no simple way to understand a Higgsless universe. This make the prior probability high which means that the chance of the 2 sigma result going away is small. There is a bigger chance however that it could move to a different mass.

Not everyone agrees with this. some people do not think that the Higgs Boson can exist. Stephen hawking is one of them. these people would assign a low value to the prior probability that the signal for the Higgs will be seen and so they will consider it very likely that the present observation will go away. I doubt that there are enough people of this opinion to account for much doubt among the experimenters.

How long will it take to settle This?

To claim a discovery the combined results must give a 5 sigma excess without considering  the Look Elsewhere Effect. How long this takes depends on a certain amount of luck. If the peaks of the excesses comes closer together with more data, then the excess will grow faster than you would otherwise expect. In that case the matter might be settled with just twice as much data and the whole thing will be over by the summer. On the other hand, if they are unlucky it could easily require the full dataset from 2012 to get enough data to finish the job properly. It will then not be until March 2013 when the combination is ready that they will finally be able to declare a discovery.

Endnote:

I have been accused by theorist and blogger Matt Straddler of being over enthusiastic about the case for the Higgs Boson and the strength of the latest results.  In fact I have not made any overly-strong claims. examples of things I have said previously include

“The result is very convincing if you start from the assumption that there should be a Higgs Boson somewhere in the range. Everywhere is ruled out except 115 GeV to 130 GeV and within that window there is a signal with the right strength at around 125 GeV with 3 sigma significance. They will have to wait for that to reach 5 sigma to claim discovery and next years data should be enough to get there or almost.”

“Some caution is merited. The signal is only 3 sigma combined and the possibility of systematic contributions is there. However, look elsewhere effect is very small given that most regions are strongly excluded. Systematic effects look less likely because of consistency across channels. I agree with Dorigo’s more optimistic assessment but until they have 5 sigma it is not a discovery and collapse of the signal is not out of the question.”

“This is basically a half-full/half-empty result. You can state it optimistically or pessimistically according to your political requirements. Another twelve months will be needed to settle it, but it is much more probable that it will be settled with a positive outcome.”

I have not said anything stronger than this and I stand by what I said. These are not bold claims and are no different from what has been said by some of the prominent members of the collaborations.  I find it very bizarre that someone is insinuating that my conclusions are overstated and naive. My detailed assessment of the situation here bears out my earlier level of optimism. If anyone wants to criticize any aspect of my calculations I am open to discussion, but if you think I should just bow to the superior authority of people in apparently better positions, please forget it.


Higgs Boson Live Blog: Analysis of the CERN announcement

December 13, 2011

Good morning and welcome to what is expected to be an exceptional day for physics as CERN announces important new results in their hunt for the elusive Higgs Boson. Here in one mammoth expanding post I will be reporting on the search for the Higgs Boson in straight forward terms free form silly analogies and patronizing phrases such as “for the layman”. I hope that many interested people with varying degrees of foreknowledge  will find the level helpful. I will explain the basic preliminaries first but if there is anything you don’t understand just Google it or wing it.

The present excitement started to build during the summer when it became clear that the Large Hadron Collider experiment was gathering data at a much higher rate than anticipated, meaning that they would soon be able to tell whether the Higgs boson exists or not and most importantly, what mass it has.

I am a theoretical particle physicist based near London independent of the teams working at CERN,  and I have been following events at the Large Hadron Collider and blogging about them since it started colliding protons in 2009. In a minute I will answer a few basic questions about the Higgs for the uninitiated, including the Paxman question ”What does the Higgs boson look like?” Then I will be live-blogging the events from CERN as they happen, so first let’s look at the schedule of today’s events.

  • 14:00 – Fabiola Gianotti, spokesperson for the ATLAS collaboration delivers a 30 minute summary of their latest developments. ATLAS is the largest particle detector ever built and it sits on an intersection point of the Large Hadron Collider rings to observe the trillions of particle collision events taking place.
  • 14:30 – Guido Tonelli will talk about similar observations at the CMS experiment. CMS is another equally sophisticated but different and complementary detector placed diametrically opposite ATLAS on the LHC ring gathering another independent set of collision data.
  • 15:00 – When the talks end, which may not be on time, there will be an hour long technical discussion between the scientists about each others results. Until these talks the two 3000 strong teams of physicists had not officially compared their data so there will be much to talk about.
  • 15:20 – At this time we expect a release of information and pictures to the press as the scientific discussion continues.
  • 16:30 – Press conference. Questions and Answers from the experts
During these events I will be posting news and exclusive analysis right here as it happens. You can refresh this page for updates and post your own views and observations in the comments section. However, please accept that I may delete comments that I consider unhelpful to a general audience. You can continue to post broader material on the previous post about the rumours
Amongst other things I will be attempting to combine the results in real time as soon as the necessary plots become available. The CERN director General has forewarned us that the announcement today will not provide conclusive evidence for the existence, or non-existence of the Higgs boson, but that could be because the two experiments have not had time to combine their results. The official combination will not be ready until next year because the full computational process is long and difficult. However, it is possible to do a quick approximate “bloggers” combination that will allow us to anticipate what the eventual result will look like. In fact the method has been shown to be reasonably accurate in the past. I will be doing more combinations right here today.
Let me just reiterate that again. My combinations are approximate. They assume a flat normal probability distribution. That is a good approximation that improves as more data is added. They also assume that there are no correlations between uncertainties among the different parts of the experiments. This is not the case. Such correlations have a small effect that does not diminish with more data. In order to claim a discovery using a combination the collaborations will have to get together and do an official version the hard way and that will take time. However, my quick combination method is good enough to give a very good idea of what the final result will look like and it is certainly not “Nonsense” as some of the experimenters have tried to claim.

Why is the Higgs Boson so special?

During the 1960s and 1970s theoretical physicists using data from the first generation of particle accelerators assembled a theory of elementary particles known as the Standard Model. It included familiar particles such as the electron, photon and neutrinos as well as unseen quarks that bind to form protons and neutrons inside the atom. All the particles in the standard model are of two types with one exception.

The particles which build up matter are all spin-half fermions which obey an equation formulated by Dirac in 1928. This includes the three generation of quark pairs and the three corresponding pairs of leptons, the electron, muon and tauon with their neutrino partners. Each of these has an antimatter partner so there are 24 distinct fermions in the Standard Model.  The second set of particles are the spin-one bosons. These play the role of binding together the fermions with the electromagnetic force (the photon) the strong nuclear force (the gluons) and the weak nuclear force (the W and Z bosons) Of these only the W is charged and so has a distinct anti-particle, meaning that there are 5 different bosons.

Aside from these it was found that the standard model required one further particle. It was known that a consistent model of spin-half fermions and spin-one bosons free from infinities required gauge symmetry, that is a mechanism that would in theory make the bosons massless. On the other hand, nature had shown that the bosons that mediate the weak nuclear force must have mass. The solution was a mechanism worked out around 1960 by a number of physicists that introduces an unusual field into the theory. The field has an unorthodox energy potential that is minimised away from the central point of symmetry so that the value of the field in the vacuum state of space-time must be shifted away from the central point, thus breaking the underlying symmetry and giving mass to some of the particles.

Peter Higgs, one of the pioneers of this mechanism, pointed out that the remnant of this field in its broken form would have excitations corresponding to a unique elementary particle that might be observed as final confirmation of the theory. Unlike the other particles in the Standard Model, this one would be a spin-zero boson. Observation of this hypothetical particle named the Higgs Boson in his honour is what the Large Hadron Collider has been looking for 50 years later.

What does the Higgs Boson look like?

The Higgs Boson exists only for fleeting moments as a fuzzy quantum wave on scales smaller than the inner workings on the proton. It is therefore impossible both theoretically and practically to “see” it in the normal sense of the word. What we can see are traces of its existence in data gathered from countless collisions between high energy protons in the Large Hadron Collider.

In the LHC at CERN on the Swiss-Franco border near Geneva, physicists have been accelerating protons to unprecedented high energies in a circular underground ring 27 km in circumference. When the protons are brought together in a head-on collision the energy can form new particles, perhaps including some never observed before such as the Higgs boson. Many trillions of collisions have been observed but the processes that form a Higgs boson are so rare that only a few thousand are likely to have been created in the experiment so far.

Once created a Higgs boson should live for a fleeting 10-22 seconds, enough time for it to travel between 10 and a hundred times the width of the protons from which it emerged. Then it decays, usually into other particles, most often a matched pair of bottom/anti-bottom quarks which have a much longer lifetime of 10-12 seconds. As the bottom quarks fly apart a string of gluon flux stretches between them before breaking to form new quarks. These emerge along with the decay products of the bottom quarks as jets of hadrons that reach the detectors. Sometimes the bottom quarks will each decay into another quark plus a lepton (electron or muon) with an accompanying neutrino. The lepton makes tale-tale tracks in the detector while the neutrino flies off without a trail only to be guessed at when they add up the energy of all the other particles and notice that some is missing.  Unfortunately there are many other less remarkable processes that produce similar jets and leptons at the LHC making it very difficult to observe the Higgs Boson when it decays in this way.

If the latest rumours about the measurements at CERN are correct the Higgs Boson could have a mass approximately equal to that of a Caesium atom. If this is correct about one in 500 of the Higgs bosons produced will decay into two high energy photons that fly away in opposite directions. Unlike the bottom quarks these fly away cleanly carrying all their energy and momentum to the inner layers of the detectors where a surrounding vessel of liquid argon has been placed to capture them. There they produce a shower of lower energy particles that are carefully tracked so that their energy and trajectory can be measured to reveal the parameters of the original photon. During all of this years run at the LHC this may have happened only a dozen times in each detector, but it could be enough to reveal the Higgs Boson.

Such photons will be thousands of times more energetic than the harmful gamma rays that emanate from nuclear reactions, but they are still photons identical to those of light which differ only by having less energy. If you want to know what the Higgs boson looks like it is the faint glow of these rare photons that answers the question most directly. In the LHC they shine faintly among the brighter radiation of other processes that produce equally energetic gamma rays. The ones coming from the Higgs Boson can only be noticed when enough have been detected to show up as a slightly brighter peak in the energy spectrum of thousands of observations. It is this that we are hoping to hear news of today.

A typical event with two high energy photons as recorded in CMS

Will the LHC find the Higgs Boson?

The theory of the Higgs Boson has been around a long time and all the other particles of the standard model have been found. Several of them were found after they were predicted by the model, especially the gluons, W and Z bosons and top quark. This means that the theory of the Standard Model is in very good stead experimentally. Indeed, physicists have been hoping for some experimental deviation from its predictions for decades and have come away disappointed. Every experiment just seems to confirm its correctness with ever more accuracy. (There are some exceptions such as measurement of the muon magnetic anomaly and the cosmological observation of dark matter that seem to point to something beyond the standard model at higher energy)

With such success it is no wonder that the theorists are quite confident that the Higgs Boson will be found as the last missing piece of the Standard Model. However, experiment is the ultimate judge of nature and theorists are not always right. A minority of physicists notably including Stephen Hawking and Nobel laureate Martinus Veltman have said that they do not believe the Higgs Boson will be found because according to their theories it cannot exist. They are considered contrarians by other physicists but until the “Goddamned” particle has been found nobody can be certain.

One thing that is sure is that the Large Hadron Collider will either discover the Higgs Boson or rule it out as predicted by the Standard Model. If all goes well this will be achieved before the end of 2012, perhaps much sooner. It has been said that if the Standard Model Higgs Boson is ruled out it will be an even greater discovery than its mere existence. This is not just excuses for what some people may portray as a failure. Such a result would indeed by a breakthrough inevitably leading to a new and better understanding of physics.

It is also possible that the Higgs Boson exists but that its characteristics are different from those of the Standard Model. In particlular, it may decay into other lighter unknown particles making it hard to detect. In that case it might appear not to be there even though it is. That will still count as ruling out the Standard Model Higgs but until further experiments are done it will not be known whether it does not exist at all, or is merely hidden from view by non-standard processes. Another even more exciting possibility is that there is more than one Higgs Boson possibly including some heavier versions that are charged. This is predicted by some grander theories such as supersymmetry

However, results from the LHC so far suggest that whatever happens there will be something positive to report today. It will not be quite a full discovery but it will be a strong signal that something like a Higgs Boson exists. Although we have heard some quite detailed rumours already, it is only by seeing the actual graphs that we can get a good idea of what the possibilities are. All physicists are now eagerly waiting to see them.

What will we learn?

You might think that since the Higgs Boson was predicted 50 years ago its discovery today will not be very exciting news. Indeed, before the LHC started collecting data, many physicists saw its discovery as inevitable and uninteresting. This view has changed, partly because nothing else has been quick to manifest itself at the LHC as hoped. This means that the Higgs Boson is likely to be the leading discovery of any new physics.

The mass of the Higgs Boson is a free parameter in the Standard Model. Once it is known, all other features such as its lifetime and interaction rates can be calculated. However, analysis of the physics of the Standard Model shows that if the mass is not within strict limits the theory will break down at higher energies. In particular, if it is too light the vacuum will not be sufficiently stable, but we know that this cannot be happening in the real world. The mass range left where the Higgs Boson can still be found includes a range where this would be a problem for the theory.

If it is lighter than 126 GeV then that may be an indication of new physics that could be found with more data. The theory of supersymmetry which is very popular with theorists actually favours the lighter Higgs and corrects problems with the stability of the vacuum, but it does not support well a heavier mass.  For these reasons today’s announcement could signal the directions of research for future physics depending on what mass is indicated by the experiments.

What will we be looking out for today?

Despite the rumours, it is not certain exactly what will be shown today, but we are hoping for full reporting of all the results in the Higgs search from the two individual experiments. This would include the analysis of each possible decay mode that the experiments can currently observe plus two combination of results from all channels, one for ATLAS and one for CMS. The amount of data collected this year corresponds to an integrated luminosity of 5 inverse femtobarns (5/fb) in each experiment so anything less than this is not complete.

There are three sets of decay channels that are currently of special relevance to the search,

  • diphoton (a.k.a. digamma) where the Higgs Boson decays directly to two photons
  • WW -> lvlv where the Higgs Boson decays to two oppositely charged W bosons which then decay to electrons or muons and associated neutrinos
  • ZZ -> 4l where it decays to two neutral Z bosons that then each decay to two oppositely charged electons or muons making four leptons in total.

If recent rumours are correct it is the diphoton channel that holds the most interest with a signal of a possible Higgs Boson at a mass of 125 GeV, but we will be very interested in the other channels to see if there is any supporting evidence or signs of anything at other masses. It will be especially interesting to see of the earlier weak signal at 140 GeV has gone away entirely. These and other channels may provide signs of something interesting at higher masses but most likely there will just be a strengthening of the evidence for exclusion above 140 GeV.

What do the plots mean?

During the presentations delivered by the collaborations today we will see a lot of new graphs. If you are not familiar with these they will require some explanation. The ones that everyone will be looking out for are the “Brazil band” plots, named for their distinctive green and yellow bands. These plots are the main way of showing the results from each Higgs Boson decay mode as well as the all important combinations.

Here is the best LHC combination plot for Higgs boson searches made public prior to today. It incorporates about a third as much data as gathered during the whole year and was shown in November at the Hadron Collider Physics conference, but I have redrawn it to add some extra features. (With any plot on this blog you can click on the image to enlarge for a clearer picture)

The horizontal axis is marked with the range of possible masses for the Higgs Boson. The units are Giga electron-Volts as an energy equivalent of mass. This is the standard way to measure mass in an accelerator experiment. If the Higgs Boson has a mass of 125 GeV as rumoured you should be able to see where it would appear on this plot.

The black line is usually called “Observed CLs” and represents the calculated result from all the experiments. Its value for any given mass gives a quantity labelled “95% Confidence Level limit for σ/σSM” on the vertical axis. What does this mean exactly? Take an example; At 200 GeV the observed CLs has a value of about 0.6. What this says is that if the signal cross-section over all the decay modes were just 0.6 times the amount expected if the Standard Model is correct and the Higgs Boson has a mass of 200 GeV, then there would be a 95% probability of seeing more events than they did.   This is a roundabout way of saying that we have seen far too few events, so we can rule out the Higgs Boson at this mass with some confidence.

When the black line descends below the red horizontal line at 1.0 on the vertical axis, people sometimes say that the Higgs Boson has been ruled out at 95% confidence level at this mass. This is not strictly correct because such confidence would depend on our prior assessment of the probability for the existence of the Higgs Boson in this mass range in the first place, and also the “Look Elsewhere Effect” would have to be considered. Such knowledge is subjective and dependent on outside influences, but loosely thinking you can interpret it that way.

In the background of the plot I have shaded areas in various grades of pink. The lightest pink indicates an exclusions at 95% confidence. This is often stated as 2-sigma significance because statistically it corresponds to 2 standard deviations away from the normal expectation. Darker shades of pink indicate 3-sigma and 4-sigma confidence. Until recently it was generally accepted that 2-sigmas was enough to rule out the presence of the Higgs Boson at a given mass, but recently people have said they want 5-sigma significance, the same as for the discovery of a new particle. I think in reality most people will accept 3-sigma for exclusions.

But we are no longer just interested in exclusions. How do we know from this plot if the Higgs Boson has been seen? This is where the yellow and green bands come in. The central blue line indicates the expected value under the condition that no Higgs Boson exists at a given mass. The green and yellow bands are the 1-sigma and 2-sigma deviations from that expectation. This means that if there is no Higgs Boson the observed CLs line should wander within these two bands. Statistically it is likely to go outside the yellow bands for about 5% of its range. When we look at the plot we see that this is indeed the case. Despite the excess exceeding 2-sigmas around the 140 GeV region we can only say that the result is consistent with the lack of a Higgs Boson over the whole range. That is not a very encouraging way to put it. Notice that mass ranges where there are excesses will be background shaded in grades of green.

Can we at least say that the plot is also consistent with the hypothesis that there is a Higgs Boson somewhere in the mass range? We can see that it is excluded over the range from 140 GeV to 480 GeV at 2-sigma significance but we can still accept the possibility that it is in the low or high mass region. there are theoretical reasons to strongly doubt that it is at the high mass end so the range 115 GeV to 140 GeV is the best bet.

It is possible to display the same results in a different way that handles the existence and exclusion of the Higgs Boson in a more symmetrical way. This is sometimes called the “best fit” plot or “signal”  plot and for the combination above it would look like this.

The experimenters don’t often display their results this way, but as  theorist I find it the best plot to give a feel for where we stand. If I can get the data from the talks today I may show some of these plots.

The black line varies around a range of signal values where a signal of zero would indicate just the Standard Model background with no Higgs Boson and a signal of one is just the right strength for its existence. The blue and cyan bands are error bands (mostly statistical) around the observed data. When the blue and cyan error bands extend over the whole range between the red line at zero and the green line at one we really have no indication either way for a Higgs Boson or its exclusion in the mass range. However, when it starts to settle on one of either the red or green line and moves clear of the other, then we know that we have the right signal strength for the presence or absence of the Higgs Boson.

What will happen after today?

Whatever comes out today there will still be a lot more work to be done. At the moment the LHC is shutdown for the Winter to allow for maintenance and to save electricity at a time when domestic demand is highest. It will startup again in February next year. Meanwhile the physicists will be using the time to continue the analysis of the data already collected during 2011 and that will include preparing the official combination of today’s results from ATLAS and CMS.

Next year the LHC will run again, probably at a slightly higher energy of 8 TeV rather than the 7 TeV used this year. It is expected to collect three times as much data in 2012 as it did in 2011 so by the end of the year they will have a total of at least 20/fb on tape for each of ATLAS and CMS. If they don’t already have enough data to know whether the Higgs Boson exists they almost certainly will by then.

More importantly, they will start to study the properties of the Higgs Boson to check that it matches the standard model by decaying into all types of lighter particle at the predicted rates. If it doesn’t then they will know that there is new physics outside the Standard Model to be understood.

That assumes that the standard Higgs Boson will show up. If it doesn’t they will have the job of looking for what replaces it . That can be done by looking at interactions between W bosons which should get stronger with increasing energy if there is no Higgs Boson until something gives. Present rumours suggest that the Higgs does exist but these WW scattering experiments will still be interesting.

After 2012 the LHC will shutdown for about 18 months to prepare it for running at higher energies, probably 13 TeV during 2015 and 14 TeV later. They will be searching for more new particles but they will also checking the parameters of the Standard Model including the Higgs Boson in more detail to eek out any signs of dark matter or anything else not seen before. The LHC will continue to run at higher luminosity and possibly even higher energy for perhaps another 30 years. This is just the beginning of what it has to do.

Live Blog starts Here


09:00 (times are Central European)

This morning ATLAS have released an update to the Higgs search in the WW -> lvlv channel. They are using 2.05/fb in place of the previous 1.66/fb so it is only a small advance. This had been around for some time unofficially but was not shown at the HCP2011 conference, Hopefully it will be obsolete in a matter of hours but here is the plot anyway. It provides 95% exclusion from 145 GeV to 200 GeV.

11:45

Just to remind everyone, the official build-up for this event is as follows:  ”These results will be based on the analysis of considerably more data than those presented at the summer conferences, sufficient to make significant progress in the search for the Higgs boson, but not enough to make any conclusive statement on the existence or non-existence of the Higgs.”

If you come here expecting a life-changing discovery to be announced you will be disappointed, but if you want to see some science in action taking a small step forwards you may enjoy.

12:00

With two hours to go the auditaurium was already full.

13:47

Here in the UK the BBC are already running reports on the network news. They are saying that each experiment is finding a blip in the same place giving a strong hint of the Higgs.

14:00

Speakers introduced, talks getting underway

14:15

ATLAS have updated the three most sensitive channels diphoton to 4.9/fb ZZ->4l to 4.8/fb and WW->lvlv to 2.1 (as above)

14:25

I have the CMS Combo, here it is with exclusion from 130 GeV up. Excess seen at about 123 GeV of 2.5 Sigma

14:30

Here is the CMS diphoton plot shwoing where the excess comes from, but there are other excesses nearly as big

14:32

Here is the ATLAS version from the talk. Updated from conference notes.

14:36

The CMS ZZ->4l clearly rules out the 140 GeV possibility, but has an excess at lower mass.

14:43

ATLAS ZZ->4l and full combo from the talk. Updated from conference notes.

ATLAS full combo from the talk. Updated from conference notes.

14:49

First talk is over, now over to CMS

CMS have two versions of the WW channel, cutbased and BDT

14:49

Here is the first of my unofficial combinations as the discussion time ends. This is the diphoton channels combined for ATLAS+CMS. Remember that this is approximate and you should not try to read the number of sigmas from this. I may revise it later when better version of the plots become available.

14:56

ATLAS have now released 3 new conference notes so I will update the pixtures

17:00

I have now digitised the CMS combined plot and produced this signal plot. It gives a clean indication for no Higgs about 130 GeV and the right size signal for a Higgs at about 125 GeV, but there is still noise at lower mass so chance that it could be moved.

17:42

Here is the same thing for the ATLAS data

17:49

Here is the fully combined exclusion plot. The signal fits best at 124 GeV and just makes 3-sigma. Remember the official version is likely to be a little different. This is just a quick approximation.

17:57

Here is the fully combined signal plot. It looks very convincing but the region below 120 GeV is not resolved yet. Until it is there will be a little room for doubt.

18:11

But of course we can clean up the lower region by including LEP and Tevatron too. An official combination with Tevatron data included is also planned

A zoomed version

20:57

Finally here is one last combination for diphoton + ZZ in CMS and ATLAS. These are the high-resolution channels so they give a cleaner signal, but without WW the significance is less.

Conclusions: The result is very convincing if you start from the assumption that there should be a Higgs Boson somewhere in the range. Everywhere is ruled out except 115 GeV to 130 GeV and within that window there is a signal with the right strength at around 125 GeV with 3 sigma significance. They will have to wait for that to reach 5 sigma to claim discovery and next years data should be enough to get there or almost. I calculate that they will need 25/fb per experiment at 7 TeV to make the discovery. A big congratulations to everyone from the LHC, ATLAS and CMS who found the Higgs when it hid in the hardest place.

I was lucky enough to meet Peter Higgs many years ago when I was a postdoc at Edinburgh and I have a big smile knowing that this has been achieved in his lifetime. Congratulations to him and the other physicists involved in discovering the mechanism of symmetry breaking. Finally, in case they are forgotten, well done also to all the phenomenologists who did the calculations to work out how the Higgs Boson could be found, not least John Ellis.

From here there is much more work to do in order to check that this particle seen today has exactly the characteristics of the Higgs, if indeed it is confirmed with more data. That will take many more years of runs at the LHC. It will also be exciting to see how this mass affects our understanding of what other physics could be in reach. I hope there are some Campaign corks popping at CERN this evening. They have had a remarkable year.


Follow

Get every new post delivered to your Inbox.

Join 103 other followers