USyd · QBUS5001 · Foundations of Data Analytics for Business

QBUS5001: pass the exams, not just read the notes

Your complete guide to University of Sydney's foundations of data analytics for business unit. See where the marks are, work real practice questions, and study with an AI tutor that knows QBUS5001.

6 credit points Postgraduate Offered S1 / S2 ~75% exams Discipline of Business Analytics

Learn with AskSia Explore the AskSia library

Sia generates QBUS5001 practice questions, walks through random variables and estimation step by step, and quizzes you on the material the exam weights most heavily.

Try a real exam-style question

Worked example

Multiple choice · solution revealed after you answer

A bank's fraud screen flags transactions for review. Across all transactions the prior probability of fraud is 2% (P(F) = 0.02). The model returns an alert for 90% of truly fraudulent transactions (sensitivity), and it falsely alerts on 5% of legitimate transactions. A transaction has just been flagged. What is the probability it is actually fraudulent?

Worked solution

Apply Bayes' theorem: P(F | alert) = P(alert | F)·P(F) / P(alert).

Numerator (true fraud and alerted): 0.90 × 0.02 = 0.018.
False-alert term (legitimate but alerted): 0.05 × (1 − 0.02) = 0.05 × 0.98 = 0.049.
Denominator P(alert) = 0.018 + 0.049 = 0.067.
P(F | alert) = 0.018 / 0.067 = 0.2687, so about 27%.

The trap: Reading the 90% sensitivity as the answer. Because genuine fraud is rare (a 2% base rate), most alerts come from the large pool of legitimate transactions, so a flagged transaction is fraudulent only about 27% of the time. Ignoring the base rate is the base-rate fallacy. classic slip!

Generate 10 more like this with Sia

your whole grade↘

Where your grade comes from Exams 75% · Projects 15% · Quizzes 10%

102515Final · 50%

One exam decides 50% of your grade. Hurdle: at least 40 out of 100 required. This whole page is built around that.

Overview

What QBUS5001 is, and where it sits

QBUS5001 is the quantitative foundation unit of the University of Sydney Business School's postgraduate coursework programs. Across twelve modules it builds business statistics from the ground up: data fundamentals and visualisation, probability and Bayes' theorem, discrete and continuous distributions, sampling distributions and the Central Limit Theorem, confidence intervals, hypothesis testing for one and two populations, then simple and multiple linear regression. Every method is applied hands-on in Microsoft Excel 365 with the Analysis ToolPak.

The assessment is weighted toward individual, time-pressured work. A closed-book final exam is worth 50% and covers Modules 1 to 12, an in-semester test is worth 25% and covers Modules 1 to 5, a group assignment is worth 15%, and best-5-of-10 weekly Mindtap/Aplia homework is worth 10%. Both the test and the final are closed-book with a supplied formula sheet, a non-programmable calculator, and answers required to four decimal places.

There are no prerequisites, only assumed competency with tabulated data, Excel and high-school mathematics, and the unit is prohibited against ECMT5001 and QBUS5002. That places it as the entry-level analytics and statistics requirement before more advanced quantitative electives in the Masters.

How it differs from its first-year siblings. QBUS5001 is the postgraduate entry point: it teaches the inference and regression toolkit by hand in Excel and assesses it under closed-book exam conditions. BUSS1020 is the undergraduate version of the same statistics foundation, BUSS6002 is the postgraduate follow-on that moves from classical inference into data science and modelling, and DATA3404 sits on the engineering side, handling the data-management and storage problems rather than the statistical analysis.

Official outline: sydney.edu.au · QBUS5001 outline. Always treat the official outline and the exam timetable as authoritative.

Difficulty & time commitment

Is QBUS5001 hard, and how much time does it take?

QBUS5001 is manageable if you keep a weekly rhythm and treat the back half as the main event. Across student reviews the pattern is consistent: it starts gently and steepens, and the heaviest assessment is the part that separates grades.

Difficulty

3.5 / 5

Hard. Gentle early, demanding back half. Hard to fail with steady work; an HD takes consistent practice.

Exam load

75%

The exams decide most of the grade. The heaviest single component is 50%.

Weekly time

~10 hrs

The standard load for a 6-credit-point unit, around 1.5 hours per credit point per week including class.

A read across student reviews and course feedback. See what students say ↓

Modules 0 to 5gentler foundation

Modules 8 to 9 (hypothesis testing)ramping

Modules 10 to 12 (regression)steepest, heaviest exam weight

The difficulty curve and the assessment weighting point the same way: the back half is harder and worth more. Front-loading effort there is the highest-return decision in the unit.

Is this unit for you

Who tends to do well, and who tends to struggle

You will likely do well if

You are comfortable with high-school algebra and can read formula-sheet notation, where Greek letters are population parameters and Latin letters are sample statistics
You keep up with the best-5-of-10 weekly Mindtap/Aplia homework instead of cramming it
You build fluency in Excel 365 with the Analysis ToolPak early, including Regression, NORM.DIST, BINOM.DIST and T.INV.2T
You practise computing to four decimal places under timed, closed-book conditions

You may struggle if

You try to memorise outcomes instead of methods, because the regression and two-population modules reward understanding the decision tree of which test to pick (Z vs pooled-t vs Welch-t)
You treat the final-exam hurdle as optional, because a sub-40 final fails the unit regardless of your group-assignment and homework marks
You only learn Excel the week before the in-semester test
You confuse the two proportion conventions, where the textbook uses p and p-hat but the slides use the Greek pi and p

do this ↘

What HD students do differently

Work the supplied 8-page formula sheet until each block (estimation, two-population inference, regression) is navigable at speed
Drill both sample exam papers and the five practice sets against their marking rubrics under timed conditions
Master the which-test and which-interval decision trees so you pick the right statistic instantly
Interpret regression output in business language (slope, R-squared, adjusted R-squared, the F-test, dummy coefficients), not just report the numbers

Syllabus

The 14 topics, module by module

The exam-weight marker on each topic shows where the marks concentrate. The amber topics carry the highest exam weight.

T1 · Data, visualisation, communication and ethics

SSK intro chapters

Data types (structured vs unstructured, categorical vs numerical, cross-section vs time-series), univariate and bivariate charts, Tufte's visualisation principles, and ethics for data analysis including algorithmic bias.

Lower exam weight

T2 · Descriptive statistics and association

Mean, variance and standard deviation (population vs sample), IQR, coefficient of variation, and sample covariance and correlation. Covariance gives direction; correlation standardises it to measure strength.

Lower exam weight

T3 · All about probability

SSK Ch 6 to 8

Classical, relative-frequency and subjective probability, the addition and multiplication rules, conditional probability and independence, the total probability rule, and Bayes' theorem with the base-rate fallacy.

Lower exam weight

T4 · Random variables and discrete distributions

Expected value and variance of a discrete random variable, covariance of random variables, linear transforms and sums, the two-asset portfolio application, and the Binomial and Poisson distributions in Excel.

High exam weightQuiz me on random variables →

T5 · Continuous distributions

Uniform and Exponential distributions, the Normal N(mu, sigma-squared) via NORM.DIST and NORM.INV, and standardising to the Standard Normal Z.

Lower exam weight

T6 · Sampling distributions, CLT and LLN

The sampling distribution of the sample mean, the standard error sigma over root-n, the Central Limit Theorem (n at least 30), the CLT for proportions, and the Law of Large Numbers.

Lower exam weight

T7 · Estimation and confidence intervals

Unbiased and consistent estimators, confidence intervals for the mean (Z when sigma known, t when sigma unknown) and for a proportion, sample-size determination, and the frequentist reading of a 95% CI.

High exam weightQuiz me on estimation →

T8 · In-Semester Test (Modules 1 to 5)

A closed-book checkpoint over descriptive statistics, probability, distributions and sampling/CLT. Supplied formula sheet, non-programmable calculator, answers to four decimal places.

Lower exam weight

T9 · Hypothesis testing (one population)

Null and alternative hypotheses, significance level, test statistic, rejection region and p-value, Type I and Type II errors, and tests on a mean (Z or t) and a proportion, one- vs two-tailed.

High exam weightQuiz me on hypothesis testing (one population) →

T10 · Two-population inference

A/B-testing motivation, two-means tests (known variance Z, unknown-equal pooled t, unknown-unequal Welch t), two proportions with a pooled estimate, the F-test for equal variances, and the chi-squared test for normality.

High exam weightQuiz me on two-population inference →

M10

T11 · Simple linear regression

Least-squares b0 and b1, slope and intercept interpretation, the sums-of-squares decomposition SST = SSR + SSE, R-squared, the standard error of the regression, and prediction.

High exam weightQuiz me on simple linear regression →

M11

T12 · SLR diagnostics and inference

The four L.I.N.E. assumptions via residual plots, the t-test on the slope and on the correlation, confidence and prediction intervals for Y, the Durbin-Watson statistic for autocorrelation, and pitfalls like extrapolation and confusing correlation with causation.

High exam weightQuiz me on slr diagnostics →

M12

T13 · Multiple linear regression

The estimated MLR equation, R-squared and adjusted R-squared, the overall F-test (all slopes zero) and individual-coefficient t-tests, and dummy variables (c minus 1 dummies for c levels) read as intercept shifts.

High exam weightQuiz me on multiple linear regression →

M13

T14 · Revision and final-exam preparation

A walkthrough of the 8-page supplied formula sheet, both sample exam papers with marking rubrics, and the five practice sets spanning one-sample inference, proportions, two-mean inference, and simple and multiple regression.

Lower exam weight

How it's assessed

Assessment structure

Component	Weight	Format & timing
Weekly Homework (Mindtap/Aplia)	10%	Online weekly homework on the Mindtap/Aplia platform; 10 homeworks in total, best 5 marks counted. Designed to encourage learning rather than to assess. Due every Sunday 23:59 from Week 2. Best 5 of 10 count toward the 10%.
In-Semester Test	25%	Closed-book timed test (1 hour 30 minutes); supplied formula sheet; non-programmable calculator and physical translation dictionary permitted; answers to four decimal places. Week 7. Covers Modules 1 to 5.
Group Assignment	15%	Group assignment. Due Week 12.
Final Exam	50%	Closed-book written final covering Modules 1 to 12, with an 8-page formula sheet supplied separately (printed separately, not removable from the venue). Expect a mix of computation across distributions, intervals, hypothesis tests and regression output, plus interpretation in business context. Exam period (date announced on the Ed discussion board). Hurdle: at least 40 out of 100 required.

Weekly Homework (Mindtap/Aplia)10%

Online weekly homework on the Mindtap/Aplia platform; 10 homeworks in total, best 5 marks counted. Designed to encourage learning rather than to assess.

In-Semester Test25%

Closed-book timed test (1 hour 30 minutes); supplied formula sheet; non-programmable calculator and physical translation dictionary permitted; answers to four decimal places.

Group Assignment15%

Group assignment.

Final Exam50%

Closed-book written final covering Modules 1 to 12, with an 8-page formula sheet supplied separately (printed separately, not removable from the venue). Expect a mix of computation across distributions, intervals, hypothesis tests and regression output, plus interpretation in business context.

To pass you must score at least 40 out of 100 in the final exam AND have an overall mark of at least 50 across all tasks. The final exam is a hurdle: a strong overall mark cannot rescue a final-exam result below 40%.
Closed-book computation and interpretation across distributions, confidence intervals, one- and two-population hypothesis tests, and simple and multiple regression output; answers to four decimal places against a supplied formula sheet.
Calculator policy: Non-programmable calculator permitted in the in-semester test and final exam; a physical translation dictionary is also permitted in the in-semester test. No electronic devices.

read this! If you read nothing else

This is an exam-cram unit. With the exams at 75% of the grade and the final exam alone at 50%, your result is overwhelmingly decided by how well you perform under time pressure. Hurdle: at least 40 out of 100 required.

Final exam timing: exam period (date announced on the Ed discussion board). Confirm the exact date and venue on the official exam timetable.

How to actually pass it

A weekly rhythm, two checklists, and the traps to avoid

The unit rewards consistency over cramming, and practice over re-reading. Here is the loop that works, then what to have nailed before each exam.

The weekly loop

Before the lecture

Pre-watch each module's short videos so the lecture consolidates rather than introduces. Keep a running Excel workbook of every function as you meet it.

By Sunday 23:59

Submit that week's Mindtap/Aplia homework. Only your best 5 of 10 count, but every one is cheap, low-stakes practice toward the 10%.

Before the tutorial

Tutorials run a one-week lag, so each week's tutorial covers the previous week's content. Review last week, bring questions, and work the tutorial problems by hand.

Weekly

Re-derive, do not memorise, the harder formula-sheet blocks (portfolio variance, pooled variance) and the which-test decision trees.

Before the mid-semester checklist

Master Modules 1 to 5 cold: descriptive measures, probability and Bayes, Binomial/Poisson/Normal, and the CLT
Install Excel 365 desktop plus the Analysis ToolPak and learn NORM.DIST, BINOM.DIST, POISSON.DIST, T.INV.2T and CORREL early
Rehearse to four decimal places with a non-programmable calculator and the supplied formula sheet
Practise expected-value, distribution-probability and CLT calculations under time pressure for the Week 7 test

Before the final heaviest topics

Drill both sample exam papers and all five practice sets (one-sample, proportion, two-mean, simple regression, multiple regression) against their marking rubrics
Practise reading Excel regression output: R-squared, adjusted R-squared, the overall F-test, individual t-tests and dummy-variable coefficients
Make the which-test and which-interval decision trees automatic so you pick the right statistic instantly
Treat the 40/100 final-exam hurdle as the priority and secure it before chasing marginal homework marks

The mistakes that cost marks

Forgetting the final-exam hurdle. The final is a hurdle at 40 out of 100. You can hold a strong overall mark and still fail the unit if the final lands below 40%. Build your study plan around clearing it first.

Picking the wrong two-population test. Known variance points to Z, unknown-but-equal variance to the pooled t, and unknown-unequal variance to the Welch t. Choosing the wrong branch loses easy marks even when the arithmetic is right.

Confusing the two proportion conventions. The textbook writes p and p-hat for population and sample proportion, but the course slides use the Greek pi and p. Check which is meant before you substitute.

Treating correlation as causation or extrapolating. Reading a regression slope as cause, or predicting outside the sampled range of X, are classic exam pitfalls. So is forgetting that c levels of a category need c minus 1 dummies.

Build a study plan with Sia → Drill the back-half topics →

Teaching team

Who teaches QBUS5001

The bios below are factual. The star ratings are not ours: they are impressions from students who have taken the unit, so you can hear from people who sat in the lectures.

Unit Coordinator

Artem Prokhorov

Professor in the Discipline of Business Analytics at the University of Sydney Business School. Researches theoretical and applied econometrics, including copula modelling and the intersection of machine learning and econometric theory, with applications in business analytics, finance, risk management and health and labour economics. Staff profile

Student ratingNo student ratings yet

Add your review →

Lecturer

Tony (Ce) Shang

Lecturer on the unit.

Student ratingNo student ratings yet

Add your review →

Teaching team as listed in the unit materials reviewed. AskSia does not rate lecturers; star ratings are submitted by students who have taken QBUS5001.

Formula & concept sheet

The vocabulary and formulas you must own

Coefficient of variation (CV): Unit-free relative spread, CV = s / x-bar (or sigma / mu); lets you compare variability across differently scaled variables.
Covariance vs correlation: Covariance gives the direction of a linear association but is scale-dependent; correlation r standardises it to lie in the interval from -1 to 1, measuring strength.
Bayes' theorem: P(A|B) = P(B|A)P(A) / P(B); revises a prior probability with new evidence. Watch the base-rate fallacy when the event is rare.
Central Limit Theorem (CLT): For n of at least 30 (rule of thumb) the sampling distribution of the sample mean is approximately Normal regardless of the population shape; the standard error of the mean is sigma over root-n.
Confidence interval: Point estimate plus or minus (critical value)(standard error). Correct reading: 95% of such intervals over repeated samples contain mu, not a 95% probability that mu lies in this one interval.

Type I and Type II error: Type I is rejecting a true null hypothesis (probability alpha); Type II is retaining a false null hypothesis (probability beta).
L.I.N.E. assumptions: The four simple-regression assumptions checked via residual plots: Linearity, Independence of errors, Normality of errors, and Equal variance (homoscedasticity).
R-squared vs adjusted R-squared: R-squared = SSR / SST is the proportion of Y variation explained; adjusted R-squared penalises adding predictors, so it is the fair metric for comparing models of different size.
Durbin-Watson statistic: Diagnostic on a 0 to 4 scale: about 2 means no autocorrelation, below 2 positive, above 2 negative.
Dummy variable: A 0/1 indicator for a category; use c minus 1 dummies for c levels (the dummy-variable trap), with each coefficient a shift in the intercept versus the omitted reference.

Common acronyms: CLT · LLN · CV · IQR · CI · SST/SSR/SSE · SER · MLR · L.I.N.E. · DW.

Drill these as flashcards → Map them with Sia →

What students say

What students actually say about QBUS5001

Recurring themes from student reviews, paraphrased in our own words.

On difficulty

Reviewed as practically oriented, applying statistical methods to real business data
Seen as broadly useful groundwork across business areas
A standing market of private tutors suggests students seek extra support with the quantitative content

Practise these topics with Sia →

How students revise

A large pool of peer-shared notes and exam-prep materials indicates sustained study demand
Listed under the legacy title Quantitative Methods for Business, reflecting the rename to Foundations of Data Analytics for Business

Make your own notes and flashcards →

Before the exams

The resource mix is weighted toward tutorial worked-solutions and exam preparation, consistent with a computation-heavy, exam-driven unit

Get instant walkthroughs →

Recurring student opinions, paraphrased and aggregated, not official course information.

Set texts

The prescribed reading

The syllabus references map straight onto these.

Primary e-text

Basic Business Statistics: Concepts and Applications

Selvanathan, Selvanathan & Keller (Australia/NZ Cengage edition). Publisher page

Where it fits

Prerequisites, related units & why it matters

No prerequisites. Assumed knowledge is basic competency reading tabulated data, working with Microsoft Excel, and high-school mathematics. The unit is prohibited against ECMT5001 and QBUS5002, marking it as the entry-level analytics and statistics requirement.

BUSS1020Quantitative Business AnalysisUndergraduate counterpart covering the same business-statist BUSS6002Data Science in BusinessPostgraduate follow-on that moves from classical inference i DATA3404Scalable Data ManagementAdjacent data unit on the engineering side, handling storage ExploreAll Business & Economics unitsUSyd discipline hub

Why it matters beyond the grade. The probability, inference and regression toolkit QBUS5001 builds is the prerequisite literacy for analytics, finance, marketing-analytics and econometrics electives, and for reading data-driven business reports critically and ethically.

Your QBUS5001 study toolkit

Study the unit with Sia, not just read about it

Each tool already knows QBUS5001: your syllabus, your texts, and where the marks are. Grouped by how you study, from first contact to exam week.

1 · Learn itunderstand the material

💬AI tutorAsk anything about QBUS5001 and get step-by-step answers. 📤Explain my notesUpload your slides or lecture and Sia breaks them down. 📑Topic summariserCondense a week into the essentials you actually need.

2 · Practise ittest yourself

📝Practice quiz generatorUnlimited exam-style MCQs and short-answer on any topic. 📊Past paper analysisSee what the exams actually test, topic by topic. ✓Assignment & problem helpWork through problem sets one step at a time.

3 · Revise & cramlock it in before the exam

🃏FlashcardsKey concepts and formulas as a spaced-repetition deck. 📋Cheatsheet maker NEWAuto-build a one-page exam cheatsheet for the unit. 🧠Mindmap generatorSee how the topics connect on one visual map.

4 · Discuss itcompare notes

👥Community Q&AAsk other QBUS5001 students and share what worked.

FAQ

Frequently asked questions

Is there a final exam, and how much is it worth?

Yes, a closed-book final exam worth 50%, covering Modules 1 to 12, with an 8-page formula sheet supplied separately. The exact date is announced on the Ed discussion board.

Is the final exam a hurdle?

Yes. You must score at least 40 out of 100 in the final exam and have an overall mark of at least 50 across all tasks to pass. A sub-40 final fails the unit regardless of your other marks.

What does the in-semester test cover and when is it?

It is in Week 7, worth 25%, and covers Modules 1 to 5: descriptive statistics, probability, random variables and distributions, and sampling distributions and the CLT. It is closed-book with a supplied formula sheet, a non-programmable calculator allowed, and answers to four decimal places.

How does the weekly homework work?

There are 10 weekly homeworks on the Mindtap/Aplia platform, due each Sunday 23:59 from Week 2, and only your best 5 marks count toward the 10%. It is designed as low-stakes practice.

Are there prerequisites?

None. Assumed knowledge is basic competency reading tabulated data, working with Microsoft Excel, and high-school mathematics. The unit is prohibited against ECMT5001 and QBUS5002.

What software do I need?

Microsoft Excel 365 desktop with the Analysis ToolPak add-in, used for Regression and functions like NORM.DIST, BINOM.DIST, POISSON.DIST and T.INV.2T. The desktop app is needed for full functionality, not the web app.

Is there group work?

Yes, a group assignment worth 15%, due in Week 12.

What textbook does it use?

Basic Business Statistics: Concepts and Applications by Selvanathan, Selvanathan & Keller (the Australia/NZ Cengage edition), accessed through Cengage Mindtap with homework on the Aplia platform. Module readings map to its statistics chapters.

Study QBUS5001 with Sia

Work through random variables, estimation, hypothesis testing (one population) and the rest of the unit with a tutor that knows it and quizzes you on the topics the assessments weight most heavily.

Start studying with Sia