The p-filter: multi-layer FDR control for grouped hypotheses

Barber, Rina Foygel; Ramdas, Aaditya

Full-text links:

Download:

Current browse context:

stat.ME

< prev | next >

new | recent | 1512

Statistics > Methodology

Title: The p-filter: multi-layer FDR control for grouped hypotheses

Authors: Rina Foygel Barber, Aaditya Ramdas

(Submitted on 10 Dec 2015 (v1), last revised 29 Oct 2016 (this version, v3))

Abstract: In many practical applications of multiple hypothesis testing using the False Discovery Rate (FDR), the given hypotheses can be naturally partitioned into groups, and one may not only want to control the number of false discoveries (wrongly rejected null hypotheses), but also the number of falsely discovered groups of hypotheses (we say a group is falsely discovered if at least one hypothesis within that group is rejected, when in reality the group contains only nulls). In this paper, we introduce the p-filter, a procedure which unifies and generalizes the standard FDR procedure by Benjamini and Hochberg and global null testing procedure by Simes. We first prove that our proposed method can simultaneously control the overall FDR at the finest level (individual hypotheses treated separately) and the group FDR at coarser levels (when such groups are user-specified). We then generalize the p-filter procedure even further to handle multiple partitions of hypotheses, since that might be natural in many applications. For example, in neuroscience experiments, we may have a hypothesis for every (discretized) location in the brain, and at every (discretized) timepoint: does the stimulus correlate with activity in location x at time t after the stimulus was presented? In this setting, one might want to group hypotheses by location and by time. Importantly, our procedure can handle multiple partitions which are nonhierarchical (i.e. one partition may arrange p-values by voxel, and another partition arranges them by time point; neither one is nested inside the other). We prove that our procedure controls FDR simultaneously across these multiple lay- ers, under assumptions that are standard in the literature: we do not need the hypotheses to be independent, but require a nonnegative dependence condition known as PRDS.

Subjects:	Methodology (stat.ME); Machine Learning (stat.ML)
Cite as:	arXiv:1512.03397 [stat.ME]
	(or arXiv:1512.03397v3 [stat.ME] for this version)

Submission history

From: Rina Foygel Barber [view email]
[v1] Thu, 10 Dec 2015 20:23:16 GMT (321kb,D)
[v2] Mon, 22 Feb 2016 14:57:01 GMT (354kb,D)
[v3] Sat, 29 Oct 2016 01:53:16 GMT (2272kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1512.03397v3

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Methodology

Title: The p-filter: multi-layer FDR control for grouped hypotheses

Submission history