Statistical significance, typically p<0.05, means there's less than a 5% probability that observed results occurred by chance. Critics note that with enough tests, some will reach significance randomly—if you test 100 false hypotheses, you'll average five 'significant' findings. The replication crisis in psychology has revealed that many published findings fail to replicate, partly due to this multiple comparisons problem and partly due to publication bias favoring positive results.

3
reading

The passage suggests that

A

all statistically significant findings are definitely true

B

the standards used for statistical inference may themselves require scrutiny and additional safeguards

C

publication bias only affects negative results

D

psychology research is immune to replication issues

Correct Answer: B

Choice B is the best answer. P<0.05 produces false positives; additional safeguards are needed.

  1. Context clues: Multiple comparisons create spurious significance; publication bias compounds problems.
  2. Meaning: Standard statistical practices have limitations requiring awareness.
  3. Verify: The "replication crisis" shows current standards are insufficient.

💡 Strategy: When standard methods are shown to produce systematic errors, infer need for scrutiny.

Choice A is incorrect because p<0.05 allows 5% false positives by design. Choice C is incorrect because bias favors positive results (not publishing negative ones). Choice D is incorrect because psychology specifically faces a "replication crisis."