We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

hep-ex

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

High Energy Physics - Experiment

Title: Post-hoc regularisation of unfolded cross-section measurements

Authors: Lukas Koch
Abstract: Neutrino cross-section measurements are often presented as unfolded binned distributions in "true" variables. The ill-posedness of the unfolding problem can lead to results with strong anti-correlations and fluctuations between bins, which make comparisons to theoretical models in plots difficult. To alleviate this problem, one can introduce regularisation terms in the unfolding procedure. These suppress the anti-correlations in the result, at the cost of introducing some bias towards the expected shape of the data. This paper discusses a method using simple linear algebra, which makes it is possible to regularise any result that is presented as a central value and a covariance matrix. This "post-hoc" regularisation is generally much faster than repeating the unfolding method with different regularisation terms. The method also yields a regularisation matrix which connects the regularised to the unregularised result, and can be used to retain the full statistical power of the unregularised result when publishing a nicer looking regularised result. In addition to the regularisation method, this paper also presents some thoughts on the presentation of correlated data in general. When using the proposed method, the bias of the regularisation can be understood as a data visualisation problem rather than a statistical one. The strength of the regularisation can be chosen by minimising the difference between the implicitly uncorrelated distribution shown in the plots and the actual distribution described by the unregularised central value and covariance. Aside from minimising the difference between the shown and the actual result, additional information can be provided by showing the local log-likelihood gradient of the models shown in the plots. This adds more information about where the model is "pulled" by the data than just comparing the bin values to the data's central values.
Comments: 12 pages, 6 figures; Fixed Fig. 1 y-axis label
Subjects: High Energy Physics - Experiment (hep-ex); Data Analysis, Statistics and Probability (physics.data-an); Applications (stat.AP)
DOI: 10.1088/1748-0221/17/10/P10021
Cite as: arXiv:2207.02125 [hep-ex]
  (or arXiv:2207.02125v4 [hep-ex] for this version)

Submission history

From: Lukas Koch [view email]
[v1] Tue, 5 Jul 2022 15:44:47 GMT (70kb,D)
[v2] Tue, 12 Jul 2022 15:32:21 GMT (71kb,D)
[v3] Wed, 31 Aug 2022 15:36:02 GMT (100kb,D)
[v4] Tue, 4 Oct 2022 14:23:22 GMT (115kb,D)

Link back to: arXiv, form interface, contact.