Sage Journals: Discover world-class research

Abstract

The statistic P _rep estimates the probability of replicating an effect. It captures traditional publication criteria for signal-to-noise ratio, while avoiding parametric inference and the resulting Bayesian dilemma. In concert with effect size and replication intervals, P _rep provides all of the information now used in evaluating research, while avoiding many of the pitfalls of traditional statistical inference.

Get full access to this article

View all access options for this article.

References

Berger

J.O.

Selke

(1987). Testing a point null hypothesis: The irreconcilability of P values and evidence. Journal of the American Statistical Association, 82, 112–122.

Bruce

(2003). Resampling stats in Excel [Computer software]. Retrieved February 1, 2005, from http://www.resample.com

Burnham

K.P.

Anderson

D.R.

(2002). Model selection and multimodel inference: A practical information-theoretic approach (2nd ed.). New York: Springer-Verlag.

Cohen

(1969). Statistical power analysis for the behavioral sciences. New York: Academic Press.

Cohen

(1994). The earth is round (p < .05). American Psychologist, 49, 997–1003.

Cooper

Hedges

L.V.

(Eds.). (1994). The handbook of research synthesis. New York: Russell Sage Foundation.

Cortina

J.M.

Nouri

(2000). Effect size for ANOVA designs. Thousand Oaks, CA: Sage.

Cox

R.T.

(1961). The algebra of probable inference. Baltimore: Johns Hopkins University Press.

Cumming

Finch

(2001). A primer on the understanding, use and calculation of confidence intervals based on central and noncentral distributions. Educational and Psychological Measurement, 61, 532–575.

10.

Cumming

Williams

Fidler

(2004). Replication, and researchers' understanding of confidence intervals and standard error bars. Understanding Statistics, 3, 299–311.

11.

Darwin

. (1994). The correspondence of Charles Darwin (Vol. 9; Burkhardt

Browne

Porter

D.M.

Richmond

, Eds.). Cambridge, England: Cambridge University Press.

12.

Eagly

A.H.

Johannesen-Schmidt

M.C.

van Engen

M.L.

(2003). Transformational, transactional, and laissez-faire leadership styles: A meta-analysis comparing men and women. Psychological Bulletin, 129, 569–591.

13.

Estes

W.K.

(1997). On the communication of information by displays of standard errors and confidence intervals. Psychonomic Bulletin & Review, 4, 330–341.

14.

Fisher

R.A.

(1925). Theory of statistical estimation. Proceedings of the Cambridge Philosophical Society, 22, 700–725.

15.

Fisher

R.A.

(1959). Statistical methods and scientific inference (2nd ed.). New York: Hafner Publishing.

16.

Geisser

(1992). Introduction to Fisher (1922): On the mathematical foundations of theoretical statistics. In Kotz

Johnson

N.L.

(Eds.), Breakthroughs in statistics (Vol. 1, pp. 1–10). New York: Springer-Verlag.

17.

Greenwald

A.G.

Gonzalez

Guthrie

D.G.

Harris

R.J.

(1996). Effect sizes and p values: What should be reported and what should be replicated? Psychophysiology, 33, 175–183.

18.

Grissom

R.J.

Kim

J.J.

(2001). Review of assumptions and problems in the appropriate conceptualization of effect size. Psychological Methods, 6, 135–146.

19.

Harlow

L.L.

Mulaik

S.A.

Steiger

J.H.

(Eds.). (1997). What if there were no significance tests? Mahwah, NJ: Erlbaum.

20.

Hedges

L.V.

(1981). Distribution theory for Glass's estimator of effect sizes and related estimators. Journal of Educational Statistics, 6, 107–128.

21.

Hedges

L.V.

Olkin

(1985). Statistical methods for meta-analysis. New York: Academic Press.

22.

Hedges

L.V.

Vevea

J.L.

(1998). Fixed- and random-effects models in meta-analysis. Psychological Methods, 3, 486–504.

23.

Jaynes

E.T.

Bretthorst

G.L.

(2003). Probability theory: The logic of science. Cambridge, England: Cambridge University Press.

24.

Krantz

D.H.

(1999). The null hypothesis testing controversy in psychology. Journal of the American Statistical Association, 44, 1372–1381.

25.

Krueger

(2001). Null hypothesis significance testing: On the survival of a flawed method. American Psychologist, 56, 16–26.

26.

Loftus

G.R.

(1996). Psychology will be a much better science when we change the way we analyze data. Current Directions in Psychological Science, 5, 161–171.

27.

Lorber

M.F.

(2004). Psychophysiology of aggression, psychopathy, and conduct problems: A meta-analysis. Psychological Bulletin, 130, 531–552.

28.

Louis

T.A.

Zelterman

(1994). Bayesian approaches to research synthesis. In Cooper

Hedges

L.V.

(Eds.), The handbook of research synthesis (pp. 411–422). New York: Russell Sage Foundation.

29.

Meehl

P.E.

(1978). Theoretical risks and tabular asterisks: Sir Karl, Sir Ronald, and the slow progress of soft psychology. Journal of Consulting and Clinical Psychology, 46, 806–834.

30.

Meehl

P.E.

(1997). The problem is epistemology, not statistics: Replace significance tests by confidence intervals and quantify accuracy of risky numerical predictions. In Harlow

L.L.

Mulaik

S.A.

Steiger

J.H.

(Eds.), What if there were no significance tests? (pp. 393–425). Mahwah, NJ: Erlbaum.

31.

Miller

Pollock

V.E.

(1994). Meta-analytic synthesis for theory development. In Cooper

Hedges

L.V.

(Eds.), The handbook of research synthesis (pp. 457–484). New York: Russell Sage Foundation.

32.

Mosteller

Colditz

G.A.

(1996). Understanding research synthesis (meta-analysis). Annual Review of Public Health, 17, 1–23.

33.

Moyer

C.A.

Rounds

Hannum

J.W.

(2004). A meta-analysis of massage therapy research. Psychological Bulletin, 130, 3–18.

34.

Neyman

Pearson

E.S.

(1933). On the problem of the most efficient tests of statistical hypotheses. Philosophical Transactions of the Royal Society of London, Series A, 231, 289–337.

35.

Nickerson

R.S.

(2000). Null hypothesis significance testing: A review of an old and continuing controversy. Psychological Methods, 5, 241–301.

36.

Parkinson

S.R.

(2004). [Levels of processing experiments in a methods class]. Unpublished raw data.

37.

Raudenbush

S.W.

(1994). Random effects models. In Cooper

Hedges

L.V.

(Eds.), The handbook of research synthesis (pp. 301–321). New York: Russell Sage Foundation.

38.

Richard

F.D.

Bond

C.F.

Jr. Stokes-Zoota

J.J.

(2003). One hundred years of social psychology quantitatively described. Review of General Psychology, 7, 331–363.

39.

Rosenthal

(1994). Parametric measures of effect size. In Cooper

Hedges

L.V.

(Eds.), The handbook of research synthesis (pp. 231–244). New York: Russell Sage Foundation.

40.

Rosenthal

Rubin

D.B.

(2003). r _equivalent: A simple effect size indicator. Psychological Methods, 8, 492–496.

41.

Rossi

J.S.

(1997). A case study in the failure of psychology as a cumulative science: The spontaneous recovery of verbal learning. In Harlow

L.L.

Mulaik

S.A.

Steiger

J.H.

(Eds.), What if there were no significance tests? (pp. 175–197). Mahwah, NJ: Erlbaum.

42.

Rubin

D.B.

(1981). Estimation in parallel randomized experiments. Journal of Educational Statistics, 6, 377–400.

43.

Smithson

(2003). Confidence intervals. Thousand Oaks, CA: Sage.

44.

Steiger

J.H.

Fouladi

R.T.

(1997). Noncentrality interval estimation and the evaluation of statistical models. In Harlow

L.L.

Mulaik

S.A.

Steiger

J.H.

(Eds.), What if there were no significance tests? (pp. 221–257). Mahwah, NJ: Erlbaum.

45.

Stove

D.C.

(1982). Popper and after: Four modern irrationalists. New York: Pergamon Press (Available from Krishna Kunchithapadam, http://www.geocities.com/ResearchTriangle/Facility/4118/dcs/popper)

46.

Thompson

(2002). What future quantitative social science research could look like: Confidence intervals for effect sizes. Educational Researcher, 31 (3), 25–32.

47.

Trafimow

(2003). Hypothesis testing and theory evaluation at the boundaries: Surprising insights from Bayes's theorem. Psychological Review, 110, 526–535.

48.

van den Noortgate

Onghena

(2003). Estimating the mean effect size in meta-analysis: Bias, precision, and mean squared error of different weighting methods. Behavior Research Methods, Instruments, & Computers, 35, 504–511.

49.

Wilkinson

the Task Force on Statistical Inference . (1999). Statistical methods in psychology: Guidelines and explanations. American Psychologist, 54, 594–604.

An Alternative to Null-Hypothesis Significance Tests

Abstract

Get full access to this article

References