MAST90139

May 29, 2026

All files

我拿到了一份我在上的课的考前宝典！你帮我拆解一下这节课，告诉我复习重点，我快final了越详细越好，公式，定义，核心考点我都需要，救救孩子

当然可以。我会只按你这份“考前宝典/cheatsheet + bible”里明确写到的内容，把 MAST90139（Statistical Modelling for Data Science）拆成一张期末 Final 复习地图：你照着练，就能把最容易拿分的部分先抓牢。
- 重要提醒：你材料里没有给出期末权重、考试时长、开卷/闭卷等细节；它反复强调要去查官方 MAST90139info.pdf / Handbook 核实，别自己猜。[3]Source: asksia-bible-mast90139-bilingual.pdfThree assignments, one final 三次作业,一场期末 3 Gradescope assignments (dates confirmed) + a final exam - weights per the subject guide 3 次 Gradescope 作业(日期已确认)外加一场期末考试 -- 权重见课程指南 Your mark comes from three written assignments (submitted to Gradescope via Canvas) and a final examination. The assignment due dates are confirmed below; the exact weights, exam format and book- status are NOT in the supplied subject material - they live in the official MAST90139info . pdf. Do not assume numbers: check the handbook / subject guide. 你的成绩来自三次书面作业(通过 Canvas 提交到 Gradescope)和一场期末考试。作业的截止日期已在下方确认;而确切的权重、考试形式与可否带书状态并不在所提供的课程材料中 -- 它们记载于官方的 MAST90139info. pdf。不要臆测数字:请查阅课程手册/课程指南。 ! Weights & exam format are NOT confirmed here 此处的权重与考试形式尚未确认 The supplied source confirms three assignments, their due dates and Gradescope submission, plus a final exam - but it does not state the percentage weights, the exam length, or whether the exam is open- or closed-book. Treat every weight as "confirm against the subject guide". The numbers below are structure, not weights. 所提供的来源确认了三次作业、它们的截止日期与 Gradescope 提交方式,外加一场期末考 -- 但它没有说明各项的百分比权重、考试时长,或考试是开卷还是闭卷。把每个权重都当作“对照科目指南核实”。下面的数字是结构,不是权重。 The four assessment pieces 四项评估内容 Component What it drills (evidenced) Due / format Assignment 1 - binary / grouped logistic Logistic regression study (domestic-violence predictors); odds-ratio interpretation 11:59pm Thu 2 Apr 2026 . Gradescope Assignment 2 - binomial dose-response Beetle-mortality dose-response; binomial / grouped logistic, link comparison 11:59pm Fri 1 May 2026 . Gradescope Assignment 3 - ordinal / multinomial Coal-miner pneumoconiosis; 3-category ordinal response (polr) 11:59pm Sun 31 May 2026 . Gradescope Final examination Standard MAST graduate stats paper; read-the-R-output across the whole GLM arc Exam period · format: confirm MAST90139 . Statistical Modelling for Data Science i A plagiarism-declaration quiz is a gate, not a grade 学术诚信声明测验是一道门槛,而非分数 [8]Source: asksia-cheatsheet-mast90139.pdfLR for each single term confint() profile-likelihood CIs multinom coef matrix (row/category) polr slopes (-y!) + thresholds Reflex: deviance + df -> x2 adequacy (grouped); AD + Adf > nested LR; exp (coef) > the interpretation. 18 . High-Yield Trap DON'T LOSE MARKS List · Wald can fail (Hauck-Donner) => prefer LR with sparse data 19 . AssessmentCONFIRM THE INFO SHEET 3 assignments + final exam. Due dates confirmed (Gradescope): A1 logistic 2 Apr, A2 binomial dose- response 1 May, A3 ordinal/multinomial 31 May 2026. Exact weights, exam format & book-status are NOT confirmed - check the MAST90139 subject guide / Handbook. ---- SIA > Half the marks are "show the deviance subtraction / show e^ß. " Method marks survive even if the final figure slips - always write the working and the one-line interpretation. Formula Belt SIDE 2 Poisson: log p = xTB . e^ß = rate ratio offset: log p = log t + xTB table: log Aij = p+a+B+(ap) . Q=P11P22/p12P21 prop-odds: log P(<r)/P(>r) = 0,+XTY AIC=D+2q . BIC=D+q log n . AD~x2(Aq) Revision aid . check the handbook / subject guide for exam conditions . @ 2026 good luck. name the link . show the deviance. asksia. ai/cheatsheet/ unimelb-mast90139 · side 2/2 AskSia CHEATSHEET SERIES Compiled by AskSia . mapped to the MAST90139 syllabus . asksia. ai/cheatsheet/unimelb-mast90139 THE RESPONSES . Poisson . offsets & rate ratios . Log-Linear tables . Multinomial logit . Proportional-odds . Model checking . REVISION SHEET . ALL TOPICS [2]Source: asksia-bible-mast90139-bilingual.pdfA GLM course taught in R- method and interpretation, not memorised slides 一门用 R 讲授的 GLM 课程 -- 重在方法与解释,而非死记幻灯片 This is not a transcript of the lecture decks. It is a self-contained tour of every model MAST90139 examines - logistic, binomial, Poisson, log-linear, multinomial, proportional-odds - built on the one engine that unifies them, the generalised linear model. Each concept is presented as an AHA-unit: a displayed equation and/or a diagram, a plain-English explainer, the method as numbered steps, a worked R-output reading, and the trap that catches people in the exam. 这并不是讲义幻灯片的逐字记录。它是一次自成体系的巡览,覆盖 MAST90139 所考查的每一个模型 -- logistic、二项、 Poisson、对数线性、多项、比例优势 -- 这些模型都建立在统一它们的同一台引擎之上,即广义线性模型。每个概念都以一个 AHA 单元呈现:一个展示性方程和/或一幅图示、一段通俗英语的讲解、以编号步骤给出的方法、一次对 R输出的实战解读,以及那个在考试中让人栽跟头的陷阱。 A 1 . LEARN 1 · 学习 Before the lecture. Read a chapter top to bottom. The GLM framework chapter is the spine - random / systematic / link, the exponential family, deviance and IRLS - every later model is just that template with a new distribution and link. Learn it cold. 课前。把一章从头读到尾。 GLM 框架章是脊梁––随机/系统/链接、指数族、偏差与 IRLS -- 后面每个模型都只是这套模板换上新的分布和链接。要烂熟于心。 B 2 . DRILL 2 · 操练 After the applied lab. Cover the worked interpretation and re- derive it: name the family + link, compute D or AD, compare to x2, then say what exp (B) means in words. That triple is the exam. 应用实验课后。盖住解出的解释自己重推一遍:说出分布族+链接,算出 D 或 △D,对照x2,再用文字说出 exp(β)的含义。这三件套就是考试。 C 3 · EXAM 3 · 应考 SWOTVAC. Walk the chapters as a checklist of model families. The blueprint overleaf maps the three Gradescope assignments onto the chapter arc - they are your rehearsal for the paper. 复习周(SWOTVAC)。把各章当作模型分布族的清单来过一遍。背面的蓝图把三次 Gradescope 作业映射到章节主线上 -- 它们就是你为正式试卷做的预演。 MAST90139 . Statistical Modelling for Data Science ! The single most important thing about MAST90139 关于 MAST90139 最重要的一件事 It looks like one model with many faces - and it is. Once you see that logistic, Poisson, log-linear and ordinal regression are all the same three-part GLM (a random component from an exponential family, a linear predictor n=XB, and a link g ()=n), the course collapses to one workflow: fit - assess fit by deviance - compare nested models by AD - interpret exp(@). Don't memorise six models; memorise one and swap the distribution and link. 它看起来像一个有许多面孔的模型 -- 而它确实如此。一旦你看清 logistic、泊松、对数线性和有序回归全都是同一个三部分 GLM(一个来自指数族的随机成分、一个线性预测子 n=Xβ,以及一个链接 g(μ)=n),整门课就坍缩为一套工作流程:拟合→ 用偏差评估拟合→用△D 比较嵌套模型→解释 exp(β)。不要去背六个模型;背一个,然后替换分布和链接。 i How this book was built - and the two-layer rule 本书是如何构建的 -- 以及两层规则 Standard GLM theory (the exponential family, IRLS, deviance, the canon of link functions) is stated plainly - it is universal and lives in Faraway, McCullagh & Nelder, Agresti. The subject's own framing and the lecturer's specific example numbers are paraphrased and re-numbered, never copied from slides or past papers. The course runs on A/Prof Qian's lecture notes with Faraway, Extending the Linear Model with R and the faraway package. Verify all dates and weights against the official MAST90139info. pdf subject guide / University Handbook, as details shift between cohorts. 标准 GLM 理论(指数族、IRLS、偏差、链接函数的经典清单)被平实地陈述 -- 它是通用的,见于 Faraway、 McCullagh & Nelder、Agresti。本科目自身的框架表述与讲师特定的算例数字均经改述并重新编号,绝不照抄自幻灯片或往年试卷。本课程以 Qian 副教授的讲义为主,配合 Faraway 的 Extending the Linear Model with R 和 faraway 包。所有日期与权重请对照官方 MAST90139info. pdf 科目指南/大学手册核实,因为细节会因届别而变。 MAST90139 . Statistical Modelling for Data Science THE BLUEPRINT CONFIRM WEIGHTS - THE ASSESSMENT BLUEPRINT

5）模型选择与信息准则：AIC / BIC（会算会比较）
你的 cheatsheet 明确给公式：
- $$\mathrm{AIC}=D+2q$$
- $$\mathrm{BIC}=D+q\log n$$
  并给了“BIC 惩罚复杂度更狠”的例子对比逻辑。[24]Source: asksia-cheatsheet-mast90139.pdfBIC M1 M2 190 213 AIC = D+2q; BIC = D+q-log200 (log200=5. 3). M2 fits a touch better (D 176 vs 180) but the extra 3 params cost more than they earn: both AIC (188<190) and BIC (201<213) prefer M1 . BIC's gap is larger - it penalises complexity harder. 15f . The Modelling Loop PUT IT TOGETHER 1. Pick the response type => family + link 2. Fit by glm/multinom/polr (IRLS / ML) 3. Check fit: deviance vs df (grouped) or HL (binary); residual plots 4. Select: AD / drop1 / AIC-BIC step 5. Interpret: e^ß with the right link + a CI 16 . e^B . The Master Table * NAME THE LINK FIRST MODEL LINK E^B MEANS Logistic logit odds ratio (success vs not) Poisson log

7）你下一步给我 2 个信息，我就能把“复习地图”改成你的个人冲刺表
你告诉我：
- 你的 Final 是哪天（具体日期）？
- 你最怕哪一块（选 1–2 个）：
  - A Logistic：OR 解释 / ROC-AUC / grouped vs ungrouped
  - B Poisson：offset + RR / overdispersion
  - C Log-linear 表（列联表、独立性、OR/交叉乘积比）
  - D Multinomial：baseline logit 输出矩阵怎么读
  - E Ordinal：polr 输出 + 符号陷阱 + cumulative OR
  - F 模型选择：$\Delta D$、AIC/BIC、drop1/step 的逻辑
我就按你选的模块，把每块补成：
- 必背定义（一句话）
- 必会公式 + 何时用
- 读输出“填空模板”（你直接照着写就能拿分）
- 3 个最常见扣分点 & 如何规避

（可选）如果你愿意，我也可以把你“最怕的两块 + 最常踩的陷阱（比如 polr 符号/offset/GoF 误用）”记住，之后你每次来我都优先带你复习这些。

Ask Sia for quick explanations, examples, and study support.

Let's Get in Touch

Products

AskSia 3.0 Pro AskSia Super Chrome macOS Windows Pricing

AI Tools

YouTube Summarizer Flashcard Generator Mindmap Generator Quiz GeneratorAI DetectorCitation Generator

Work With Us

For Institutions Student BeansAffiliatesPartnershipsPress & MediaCareers

Company

About UsContact UsLegal & PoliciesService Agreement Grade Confidence Guarantee FAQs

MAST90139

0）一句“救命总纲”：这门课最终考什么？

1）考试最高频“得分点”（你要练到条件反射）

1.1 读输出的固定流程（你每题都按这个模板写）

2）整门课的“骨架定义”（必须会背 + 会用）

2.1 GLM 的三部分（三件套定义）

2.2 指数族（Exponential family）与均值-方差关系（你至少要知道“为什么 GLM 合理”）

3）最常考模型族：你该背哪些“公式 + 一句话解释”

3.1 Logistic 回归（本课“最赚钱”的章节）

3.2 Poisson 回归（计数 & 率：offset 是送分题）

3.3 有序分类：比例优势模型（`polr()`）与“符号陷阱”

3.4 多分类（名义/无序）：baseline-category logit（`multinom()`）

4）Deviance / $\Delta D$：你必须会的两种用法（考试经常“半题都是它”）

4.1 两个工作（材料明确写了“Two jobs of deviance”）

4.2 考试策略（材料的强提醒）

5）模型选择与信息准则：AIC / BIC（会算会比较）

6）按“你快 final 了”的最省命复习计划（按材料的应考法）

6.1 先把“模板化必考技能”刷熟（优先级最高）

6.2 按作业对应的模型族做“专项复盘”（因为材料暗示作业就是 rehearsal）

6.3 “不要丢分陷阱清单”（你考前必须背一遍）

7）你下一步给我 2 个信息，我就能把“复习地图”改成你的个人冲刺表

Let's Get in Touch

MAST90139

0）一句“救命总纲”：这门课最终考什么？

1）考试最高频“得分点”（你要练到条件反射）

1.1 读输出的固定流程（你每题都按这个模板写）

2）整门课的“骨架定义”（必须会背 + 会用）

2.1 GLM 的三部分（三件套定义）

2.2 指数族（Exponential family）与均值-方差关系（你至少要知道“为什么 GLM 合理”）

3）最常考模型族：你该背哪些“公式 + 一句话解释”

3.1 Logistic 回归（本课“最赚钱”的章节）

3.2 Poisson 回归（计数 & 率：offset 是送分题）

3.3 有序分类：比例优势模型（polr()）与“符号陷阱”

3.4 多分类（名义/无序）：baseline-category logit（multinom()）

4）Deviance / $\Delta D$：你必须会的两种用法（考试经常“半题都是它”）

4.1 两个工作（材料明确写了“Two jobs of deviance”）

4.2 考试策略（材料的强提醒）

5）模型选择与信息准则：AIC / BIC（会算会比较）

6）按“你快 final 了”的最省命复习计划（按材料的应考法）

6.1 先把“模板化必考技能”刷熟（优先级最高）

6.2 按作业对应的模型族做“专项复盘”（因为材料暗示作业就是 rehearsal）

6.3 “不要丢分陷阱清单”（你考前必须背一遍）

7）你下一步给我 2 个信息，我就能把“复习地图”改成你的个人冲刺表

Let's Get in Touch

3.3 有序分类：比例优势模型（`polr()`）与“符号陷阱”

3.4 多分类（名义/无序）：baseline-category logit（`multinom()`）