We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Methodological Issues in Multistage Genome-Wide Association Studies

Abstract: Because of the high cost of commercial genotyping chip technologies, many investigations have used a two-stage design for genome-wide association studies, using part of the sample for an initial discovery of ``promising'' SNPs at a less stringent significance level and the remainder in a joint analysis of just these SNPs using custom genotyping. Typical cost savings of about 50% are possible with this design to obtain comparable levels of overall type I error and power by using about half the sample for stage I and carrying about 0.1% of SNPs forward to the second stage, the optimal design depending primarily upon the ratio of costs per genotype for stages I and II. However, with the rapidly declining costs of the commercial panels, the generally low observed ORs of current studies, and many studies aiming to test multiple hypotheses and multiple endpoints, many investigators are abandoning the two-stage design in favor of simply genotyping all available subjects using a standard high-density panel. Concern is sometimes raised about the absence of a ``replication'' panel in this approach, as required by some high-profile journals, but it must be appreciated that the two-stage design is not a discovery/replication design but simply a more efficient design for discovery using a joint analysis of the data from both stages. Once a subset of highly-significant associations has been discovered, a truly independent ``exact replication'' study is needed in a similar population of the same promising SNPs using similar methods.
Comments: Published in at this http URL the Statistical Science (this http URL) by the Institute of Mathematical Statistics (this http URL)
Subjects: Methodology (stat.ME); Genomics (q-bio.GN)
Journal reference: Statistical Science 2009, Vol. 24, No. 4, 414-429
DOI: 10.1214/09-STS288
Report number: IMS-STS-STS288
Cite as: arXiv:1010.4659 [stat.ME]
  (or arXiv:1010.4659v1 [stat.ME] for this version)

Submission history

From: Duncan C. Thomas [view email]
[v1] Fri, 22 Oct 2010 09:55:31 GMT (102kb)

Link back to: arXiv, form interface, contact.