University of Sydney · S1 2026 · FACULTY OF BUSINESS & ECONOMICS

ECMT1010 · Introduction To Economic Statistics

- one subject, every graph, every model, every mark
50% final exam · hurdle14 Chapters5-page Bible
Our own words - no uploaded lecturer files
Built to mirror S1 2026 · updated this semester
Chapter 4 of 11 · ECMT1010

Hypothesis Testing & Randomization

Weeks 5–6 build the formal test of evidence: state a null and an alternative hypothesis in population parameters, build a randomization distribution in StatKey under a true H₀, read off the p-value, and decide against the significance level α. It is examined as MCQ and short-answer — the marks come from the H₀/Hₐ setup, the correct one- vs two-sided p-value, the reject/inconclusive decision, the one-sentence conclusion in context, and naming Type I vs Type II errors.

In this chapter

What this chapter covers

  • 011. Null H₀ (contains '=') vs alternative Hₐ (>, <, or ≠), always in population parameters
  • 022. The randomization distribution: simulate statistics from a world where H₀ is true
  • 033. Reallocate (experiments) vs shift/shuffle (observational) to make H₀ true
  • 044. The p-value: probability of a result this extreme (or more) if H₀ is true
  • 055. One-sided (one tail) vs two-sided (both tails) p-values
  • 066. Significance level α: p < α → reject H₀ (significant); p > α → inconclusive
  • 077. The CI ↔ HT equivalence: a two-sided test at α matches a (1 − α) CI
  • 088. Type I error (reject a true H₀, prob α), Type II error (keep a false H₀, prob β), and power = 1 − β
Worked example · free

A one-sided test from a randomization distribution

Q [5 marks]. A bakery claims at least 60% of customers buy a coffee with their pastry. In a random sample of 120 customers, 64 buy a coffee (p̂ ≈ 0.533). Testing H₀: p = 0.60 versus Hₐ: p < 0.60, a StatKey randomization distribution centred on 0.60 (SE ≈ 0.045) puts 7.2% of simulated statistics at or below 0.533. State the hypotheses, give the p-value and conclude at α = 0.05.
  • 2 marksDefine the parameter and state the hypotheses. Let p = the true proportion of customers who buy a coffee. The claim is 'at least 60%', and we are testing whether it is actually lower, so H₀: p = 0.60 versus Hₐ: p < 0.60 (one-sided, left tail).
  • 1 markRead the p-value off the randomization distribution: it is the proportion of simulated statistics at or below the observed 0.533 = 0.072.
  • 1 markApply the decision rule: 0.072 > 0.05 = α, so do not reject H₀.
  • 1 markConclude in context: the result is not statistically significant at the 5% level — there is insufficient evidence that fewer than 60% of customers buy a coffee.
H₀: p = 0.60 vs Hₐ: p < 0.60; the p-value is 0.072 > 0.05, so we do not reject H₀ — there is insufficient evidence that the true proportion is below 60%.
Sia tip — The p-value is read as a tail area of the randomization distribution centred on the null value — one tail for a one-sided Hₐ, both tails for a two-sided Hₐ. State hypotheses in the population parameter (p), not the sample statistic (p̂), and always finish with a context sentence using 'evidence', not 'proof'.
Glossary

Key terms

Null vs alternative hypothesis
H₀ is the 'no effect / no difference' claim and always contains '='; Hₐ is the claim you are gathering evidence for and contains >, <, or ≠. Both are stated in terms of population parameters, not sample statistics.
Randomization distribution
The distribution of a statistic simulated from a world where H₀ is true — built by reallocating responses (experiments) or shifting/shuffling the data (observational). The observed statistic is compared against it to get the p-value.
p-value
The probability of obtaining a statistic as extreme as — or more extreme than — the observed one, assuming H₀ is true. A small p-value means the data would be surprising under H₀, which is evidence against it.
Significance level (α)
The pre-set threshold for the p-value (commonly 0.05). If p < α you reject H₀ and call the result statistically significant; if p > α the result is inconclusive and you do not reject H₀.
Type I and Type II error
A Type I error rejects a true H₀ (probability α); a Type II error fails to reject a false H₀ (probability β). Power = 1 − β is the chance of correctly detecting a real effect, and it rises with a larger sample, a larger effect, or a larger α.
CI ↔ HT equivalence
A two-sided hypothesis test at level α gives the same decision as a (1 − α) confidence interval: you reject H₀ exactly when the (1 − α) CI excludes the null value.
FAQ

Hypothesis Testing & Randomization FAQ

What does the p-value actually tell me?

It tells you how surprising your data would be if the null hypothesis were true — specifically, the probability of a statistic at least as extreme as the one you observed under H₀. A small p-value means 'data like this rarely happens when H₀ holds', which is evidence against H₀. It is NOT the probability that H₀ is true, and it is NOT the probability you made a mistake.

How do I decide whether the test is one-sided or two-sided?

Look at the research claim. If it specifies a direction — 'less than', 'more than', 'increases', 'reduces' — use a one-sided Hₐ (< or >) and one tail of the randomization distribution. If it just asks whether there is any difference or change, use a two-sided Hₐ (≠) and both tails, which makes the p-value roughly double the one-sided value. Set the direction from the question before you compute anything.

If I do not reject H₀, have I proved it is true?

No. 'Do not reject H₀' means the evidence was not strong enough to rule it out at your chosen α — the result is inconclusive, not proof. Absence of evidence is not evidence of absence: a small sample can fail to detect a real effect (a Type II error). The correct wording is 'there is insufficient evidence that…', never 'we have shown there is no effect'.

What is the difference between a Type I and a Type II error?

A Type I error is a false alarm — you reject a null hypothesis that is actually true, and its probability is α (so a 5% test makes a Type I error 5% of the time when H₀ holds). A Type II error is a miss — you fail to reject a null that is actually false, with probability β. Power = 1 − β is the chance of catching a real effect. Lowering α to guard against false alarms raises β, so there is always a trade-off.

Study strategy

Exam move

Treat every test as a five-line ritual and rehearse it until it is automatic: define the parameter, state H₀ and Hₐ in that parameter, find the p-value (one tail or two), compare with α, then write a one-sentence conclusion in context using the word 'evidence'. The single biggest mark-loser is stating hypotheses about the sample statistic instead of the population parameter, so always use μ or p, never x̄ or p̂. Understand the randomization distribution conceptually — it is the world where H₀ is true — so you can explain why the p-value is a tail area. Memorise the error table (Type I = false alarm = α; Type II = miss = β; power = 1 − β) and the CI↔HT shortcut, both of which appear as MCQs, and practise saying 'inconclusive' rather than 'we proved H₀'.

A+Everything unlocked
Unlocks this Bible + all 191 of your University of Sydney subjects - and 1,000+ Bibles across every Australian university.
Sia - your ECMT1010 tutor, unlimited, worked the way the exam marks it
The full 5-page Bible + practice bank with worked solutions
Chrome extension - sync your LMS so Sia knows your deadlines
Bilingual EN / Chinese on every Bible and every Sia answer
$25/ month
30-day money-back · cancel in one tap · how it works
Unlock the full ECMT1010 Bible + 191 University of Sydney subjects解锁完整 ECMT1010 Bible + University of Sydney 191 门科目
$25/mo