Re-analysing LHC Run 2 data with cutting-edge analysis techniques allowed CMS physicists to address an old discrepancy.
Supersymmetry (SUSY) is an exciting and beautiful theory that answers some of the open questions in particle physics. It predicts that all known particles have a “superpartner” with somewhat different properties. For example, the heaviest quark of the Standard Model, the top quark, would have a superpartner called the top squark, or simply the “stop”. In 2021 the CMS collaboration analysed the entire set of collision data collected from 2016 to 2018 and found features suggesting that it might contain stop particles. In that case, “might” meant that there was less than 5% chance that data containing only known particles could look like what was observed. Instead of waiting many years to collect more data with the hope of reproducing this behaviour, the CMS collaboration decided to reanalyse the same data with upgraded analysis techniques.
The new analysis looks for the simultaneous production of pairs of stops. Each stop decays into a top quark accompanied by several lighter quarks or gluons, which then form bound states known as hadrons, ultimately creating clusters of particles reconstructed in the detector as “jets”. The signal footprint is therefore two top quarks and multiple jets. What makes the analysis challenging is that a very similar footprint is produced by one of the most common Standard Model processes in the LHC: the pair production of top quarks. Top quark production with many accompanying jets is a process that is difficult to accurately simulate, so to have a reliable determination of this background, it must be estimated from observed data.
A commonly used method of estimating backgrounds from data is called the “ABCD method”. It requires two uncorrelated observables that can discriminate between signal and background. The data set can then be divided into four regions (A, B, C and D) depending on the value of each observable being “signal-like” or “background-like”. The subdivision then provides a region dominated by the signal, a region dominated by backgrounds and two intermediate regions. The key feature of the ABCD method is that, following the mathematics of probabilities for independent events, one can estimate the background in the signal-dominated region using only the information from the other regions. The problem with using this method for the stop search is that all simple variables are correlated in this search, making the method invalid. To overcome this issue, CMS physicists have implemented an innovative approach based on advanced machine-learning techniques to determine two variables with a minimal level of correlation. These two variables are then used to divide the data into the four aforementioned regions. The figure below shows the correlation between the two variables for the signal and the background and demonstrates that the signal mostly lies in region “A”.
International Research Conference on High Energy Physics and Computational Science
No comments:
Post a Comment