We gratefully acknowledge support from
the Simons Foundation and member institutions.

Data Analysis, Statistics and Probability

New submissions

[ total of 6 entries: 1-6 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Tue, 14 Jul 20

[1]  arXiv:2007.05535 [pdf, other]
Title: Flow-Based Likelihoods for Non-Gaussian Inference
Comments: 14 pages, 6 figures + appendices
Subjects: Cosmology and Nongalactic Astrophysics (astro-ph.CO); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)

We investigate the use of data-driven likelihoods to bypass a key assumption made in many scientific analyses, which is that the true likelihood of the data is Gaussian. In particular, we suggest using the optimization targets of flow-based generative models, a class of models that can capture complex distributions by transforming a simple base distribution through layers of nonlinearities. We call these flow-based likelihoods (FBL). We analyze the accuracy and precision of the reconstructed likelihoods on mock Gaussian data, and show that simply gauging the quality of samples drawn from the trained model is not a sufficient indicator that the true likelihood has been learned. We nevertheless demonstrate that the likelihood can be reconstructed to a precision equal to that of sampling error due to a finite sample size. We then apply FBLs to mock weak lensing convergence power spectra, a cosmological observable that is significantly non-Gaussian (NG). We find that the FBL captures the NG signatures in the data extremely well, while other commonly-used data-driven likelihoods, such as Gaussian mixture models and independent component analysis, fail to do so. This suggests that works that have found small posterior shifts in NG data with data-driven likelihoods such as these could be underestimating the impact of non-Gaussianity in parameter constraints. By introducing a suite of tests that can capture different levels of NG in the data, we show that the success or failure of traditional data-driven likelihoods can be tied back to the structure of the NG in the data. Unlike other methods, the flexibility of the FBL makes it successful at tackling different types of NG simultaneously. Because of this, and consequently their likely applicability across datasets and domains, we encourage their use for inference when sufficient mock data are available for training.

[2]  arXiv:2007.05799 [pdf, other]
Title: A Fast, 2D Gaussian Process Method Based on Celerite: Applications to Transiting Exoplanet Discovery and Characterization
Comments: Submitted to AAS Journals
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Earth and Planetary Astrophysics (astro-ph.EP); Solar and Stellar Astrophysics (astro-ph.SR); Data Analysis, Statistics and Probability (physics.data-an)

Gaussian processes (GPs) are commonly used as a model of stochastic variability in astrophysical time series. In particular, GPs are frequently employed to account for correlated stellar variability in planetary transit light curves. The efficient application of GPs to light curves containing thousands to tens of thousands of datapoints has been made possible by recent advances in GP methods, including the celerite method. Here we present an extension of the celerite method to two input dimensions, where, typically, the second dimension is small. This method scales linearly with the total number of datapoints when the noise in each large dimension is proportional to the same celerite kernel and only the amplitude of the correlated noise varies in the second dimension. We demonstrate the application of this method to the problem of measuring precise transit parameters from multiwavelength light curves and show that it has the potential to improve transit parameters measurements by orders of magnitude. Applications of this method include transit spectroscopy and exomoon detection, as well a broader set of astronomical problems.

[3]  arXiv:2007.06416 [pdf]
Title: Relative Entropy Regularised TDLAS Tomography for Robust Temperature Imaging
Comments: Preprint submitted to IEEE Transactions on Instrumentation and Measurement
Subjects: Image and Video Processing (eess.IV); Data Analysis, Statistics and Probability (physics.data-an); Instrumentation and Detectors (physics.ins-det)

Tunable Diode Laser Absorption Spectroscopy (TDLAS) tomography has been widely used for in situ combustion diagnostics, yielding images of both species concentration and temperature. The temperature image is generally obtained from the reconstructed absorbance distributions for two spectral transitions, i.e. two-line thermometry. However, the inherently ill-posed nature of tomographic data inversion leads to noise in each of the reconstructed absorbance distributions. These noise effects propagate into the absorbance ratio and generate artefacts in the retrieved temperature image. To address this problem, we have developed a novel algorithm, which we call Relative Entropy Tomographic RecOnstruction (RETRO), for TDLAS tomography. A relative entropy regularisation is introduced for high-fidelity temperature image retrieval from jointly reconstructed two-line absorbance distributions. We have carried out numerical simulations and proof-of-concept experiments to validate the proposed algorithm. Compared with the well-established Simultaneous Algebraic Reconstruction Technique (SART), the RETRO algorithm significantly improves the quality of the tomographic temperature images, exhibiting excellent robustness against TDLAS tomographic measurement noise. RETRO offers great potential for industrial field applications of TDLAS tomography, where it is common for measurements to be performed in very harsh environments.

[4]  arXiv:2007.06470 [pdf, other]
Title: A comparison of g(1)(τ), g(3/2)(τ), and g(2)(τ), for radiation from harmonic oscillators in Brownian motion with coherent background
Comments: 23 pages, 2 figures, 3 Tables
Subjects: Optics (physics.optics); Instrumentation and Methods for Astrophysics (astro-ph.IM); Data Analysis, Statistics and Probability (physics.data-an); Quantum Physics (quant-ph)

We compare the field-field g(1)(\tau), intensity-field g(3/2)(\tau), and intensity-intensity g(2)(\tau) correlation functions for models that are of relevance in astrophysics. We obtain expressions for the general case of a chaotic radiation, where the amplitude is Rician based on a model with an ensemble of harmonic oscillators in Brownian motion. We obtain the signal to noise ratios for two methods of measurement. The intensity-field correlation function signal to noise ratio scales with the first power of |g(1)(\tau)|. This is in contrast with the well-established result of g(2)(\tau) which goes as the square of |g(1)(\tau)|.

Replacements for Tue, 14 Jul 20

[5]  arXiv:2003.07070 (replaced) [pdf, other]
Title: Merge-split Markov chain Monte Carlo for community detection
Authors: Tiago P. Peixoto
Comments: 13 pages, 6 figures. Code available at this https URL
Journal-ref: Phys. Rev. E 102, 012305 (2020)
Subjects: Physics and Society (physics.soc-ph); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
[6]  arXiv:2005.04140 (replaced) [pdf]
Title: Deep-Learning Continuous Gravitational Waves: Multiple detectors and realistic noise
Comments: (12 pages,8 figures, 6 tables)
Subjects: General Relativity and Quantum Cosmology (gr-qc); Instrumentation and Methods for Astrophysics (astro-ph.IM); Data Analysis, Statistics and Probability (physics.data-an)
[ total of 6 entries: 1-6 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, physics, recent, 2007, contact, help  (Access key information)