University of Sydney · S1 2026 · FACULTY OF BUSINESS & ECONOMICS

BUSS1020 · Quantitative Business Analysis

- one subject, every graph, every model, every mark
50% final exam · hurdle14 Chapters8-page Bible
Our own words - no uploaded lecturer files
Built to mirror S1 2026 · updated this semester
Chapter 1 of 11 · BUSS1020

Foundations: Data, Variables & Sampling

Foundations: Data, Variables & Sampling (Week 1, Berenson Ch 1) sets the vocabulary the whole subject rests on. You learn to tell a population from a sample and a parameter from a statistic, to classify variables as categorical or numerical and place them on the four levels of measurement, and to choose a sampling method that gives a representative picture without bias. Getting these distinctions right is what lets every later technique — probabilities, intervals, tests, regression — be applied to the correct kind of data.

In this chapter

What this chapter covers

  • 01Population vs sample; parameter (Greek) vs statistic (Latin)
  • 02Categorical variables: nominal vs ordinal
  • 03Numerical variables: discrete vs continuous
  • 04Levels of measurement: nominal · ordinal · interval · ratio
  • 05Data sources: primary vs secondary; structured vs unstructured
  • 06Probability sampling: simple random, systematic, stratified, cluster
  • 07Non-probability sampling: convenience, judgement
  • 08Sampling vs non-sampling error; coverage and selection bias
Worked example · free

Classify variables and identify a sampling method

Q [6 marks]. A retailer surveys shoppers and records: (a) preferred store (Newtown / Chatswood / Online), (b) satisfaction rated 1–5, (c) number of items bought, and (d) total spend in dollars. For each, name the variable type and level of measurement. The retailer picks every 20th shopper leaving the store — name this sampling method and one bias it risks.
  • 1 mark(a) Preferred store is categorical with no natural order → nominal.
  • 1 mark(b) Satisfaction 1–5 is categorical with a meaningful order but unequal/undefined gaps → ordinal.
  • 1 mark(c) Number of items is numerical and countable → discrete (ratio level, since 0 means none and ratios are meaningful).
  • 1 mark(d) Total spend in dollars is numerical and measurable on a continuum → continuous (ratio level).
  • 1 markSelecting every 20th shopper is systematic sampling.
  • 1 markIt risks coverage/selection bias: only shoppers who actually entered and exited the physical store are sampled, so online-only or non-visiting customers are excluded.
Nominal, ordinal, discrete (ratio), continuous (ratio); the method is systematic sampling, which risks coverage bias by missing customers who never visit the store in person.
Sia tip — Two quick tests: 'Are the categories ordered?' separates nominal from ordinal; 'Can I count it or must I measure it?' separates discrete from continuous. State both the type and the level — examiners often award separate marks for each.
Glossary

Key terms

Population vs sample
The population is the entire group of interest; a sample is the subset you actually observe and from which you infer.
Nominal vs ordinal
Both are categorical: nominal categories have no order (e.g. store location), ordinal categories have a meaningful order but no fixed spacing (e.g. a 1–5 rating).
Discrete vs continuous
Discrete numerical data come from counting and take separate values (number of items); continuous data come from measuring and can take any value in a range (dollars, time).
Stratified sampling
A probability method that splits the population into homogeneous groups (strata) and samples within each, improving representativeness for known subgroups.
Sampling vs non-sampling error
Sampling error is the natural variation from observing only a sample rather than the whole population; non-sampling error comes from bias, bad measurement or coverage gaps and is not reduced by a larger sample.
FAQ

Foundations: Data, Variables & Sampling FAQ

Why does the parameter/statistic distinction matter so much?

Because every inference technique in the subject estimates an unknown population parameter (like μ or π) from a sample statistic (like X̄ or p). Mixing the symbols up is one of the most common ways to lose easy marks, so the Greek-for-population, Latin-for-sample rule is worth locking in early.

Is a postcode numerical data?

No — even though it looks like a number, a postcode is nominal categorical data because the values are just labels with no arithmetic meaning. This 'looks numerical but isn't' trap appears in MCQs.

What makes a sample 'good'?

A good sample is representative and chosen by a probability method (every unit has a known, non-zero chance of selection), which lets you quantify sampling error. Convenience and judgement samples are easier but introduce bias you cannot measure.

Study strategy

Exam move

This week is pure vocabulary, and it is the cheapest set of MCQ marks in the whole unit — so over-learn it. Make a one-page table: variable type → level of measurement → an example → a typical chart, and drill the 'trap' cases (postcodes, ratings, years). For sampling, memorise the four probability methods and the two non-probability ones, and be able to name the bias each one risks. Because Week 1 feeds the in-semester test and the final's Part A, a few minutes of flashcards here pays off all semester.

A+Everything unlocked
Unlocks this Bible + all 203 of your University of Sydney subjects - and 1,000+ Bibles across every Australian university.
Sia - your BUSS1020 tutor, unlimited, worked the way the exam marks it
The full 8-page Bible + practice bank with worked solutions
Chrome extension - sync your LMS so Sia knows your deadlines
Bilingual EN / Chinese on every Bible and every Sia answer
$25/ month
30-day money-back · cancel in one tap · how it works
Unlock the full BUSS1020 Bible + 203 University of Sydney subjects解锁完整 BUSS1020 Bible + University of Sydney 203 门科目
$25/mo