The latent logarithm

Biswas, Surojit

Full-text links:

Download:

Current browse context:

stat.ME

< prev | next >

new | recent | 1605

Statistics > Methodology

Title: The latent logarithm

Authors: Surojit Biswas

(Submitted on 19 May 2016)

Abstract: Count or non-negative data are often log transformed to improve heteroscedasticity and scaling. To avoid undefined values where the data are zeros, a small pseudocount (e.g. 1) is added across the dataset prior to applying the transformation. This pseudocount considers neither the measured object's a priori abundance nor the confidence with which the measurement was made, making this practice convenient but statistically unfounded. I introduce here the latent logarithm, or lag. lag assumes that each observed measurement is a noisy realization of an unmeasured latent abundance. By taking the logarithm of this learned latent abundance, which reflects both sampling confidence/depth and the object's a priori abundance, lag provides a probabilistically coherent, stable, and intuitive alternative to the questionable, but conventional "log($x$ + pseudocount)."

Subjects:	Methodology (stat.ME)
Cite as:	arXiv:1605.06064 [stat.ME]
	(or arXiv:1605.06064v1 [stat.ME] for this version)

Submission history

From: Surojit Biswas [view email]
[v1] Thu, 19 May 2016 17:41:58 GMT (451kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1605.06064

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Methodology

Title: The latent logarithm

Submission history