STAT7038

Jun 1, 2026

All files

我拿到了一份我在上的课的考前宝典！你帮我拆解一下这节课，告诉我复习重点，我快final了越详细越好，公式，定义，核心考点我都需要，救救孩子

先给你一条“救命总纲”：STAT7038 的 Final 不是考你背公式，而是考你看懂给你的 R 输出，然后把它带进正确的流程里算完、并且用情境语言下结论。因为考试允许带一张 A4 双面打印/打字笔记，而且计算器、R 输出、t/F/正态表都会给，所以你把复习时间花在“流程链条 + R 输出判读 + 易错点”最划算。[1]Source: asksia-bible-stat7038-bilingual(1).pdfIt's the exam-prep window. Use the chapters to build your one A4 typed sheet: the boxed formulas, the 5-step test ritual, the CI-vs-PI rule, the R- output map. The blueprint overleaf shows the weights, the format and exactly what the sheet should carry. 正处考前冲刺期。用各章来打造你的那一张 A4 打印笔记:框住的公式、5 步检验流程、CI与 PI 的判别规则、R 输出地图。背面的蓝图给出权重、形式以及这张笔记究竟该承载什么。 ★ The KILLER HOOK - what you may bring, what they give you 致命要点 -- 你能带什么,他们给你什么 The final permits ONE A4-size sheet of notes, DOUBLE-SIDED, TYPED OR PRINTED. The exam itself supplies the calculator (hp300s+), the R outputs, the statistical tables (t, F, normal) and scribble paper. So do not waste sheet space on table values or on R syntax - you read R, you don't run it. Spend the sheet on the boxed formulas, the decision rules, the CI/PI distinction, and a worked-template for each test. This guide is written to populate exactly that sheet. 期末允许带一张 A4 笔记,双面,打字或打印。考试本身提供计算器(hp300s+)、R输出、统计表(t、F、正态)以及草稿纸。所以不要把笔记空间浪费在表值或 R 语法上 -- 你只判读 R,不运行它。把笔记用在框住的公式、判定规则、CI/PI 的区分,以及每种检验的算例模板上。本指南正是为填满那张笔记而写。 STAT7038 . Regression Modelling i How this book was built - and the two-layer rule 本书是如何编成的 -- 以及两层原则 Standard statistical results - the SLR model, least squares, the ANOVA identity, the t/F sampling distributions - are universal facts, stated plainly. The unit's own framing and the lecturer's specific datasets are paraphrased and re-numbered; every worked example here uses our own numbers, never copied from slides or past papers. The course follows Kutner, Applied Linear Regression Models (4th ed. ). Verify dates and weights against your own Canvas (wattle. anu. edu. au), as details can shift between cohorts. 标准统计结果 -- SLR 模型、最小二乘、ANOVA 恒等式、t/F 抽样分布 -- 都是普适的事实,直接陈述。本课程自身的表述方式与讲师的特定数据集均经过改写并重新编号;这里的每个算例都使用我们自己的数字,绝不照抄幻灯片或往年试卷。课程依据 Kutner, Applied Linear Regression Models (第4版)。各项日期与权重请以你自己的 Canvas (wattle. anu. edu. au)为准,细节可能随届次而变。 STAT7038 . Regression Modelling THE BLUEPRINT - THE EXAM BLUEPRINT FINAL 70% . 5 JUNE 70% final, open to one typed sheet 期末占 70%,可带一张打印的笔记 Online Quiz 5% . In-tutorial Quiz 10% . Assignment 15% . Final 70% 在线测验 5% · 课堂测验 10% · 作业 15% · 期末 70% Your mark is built from four pieces, but the final exam dominates at 70%. Both quizzes are redeemable (the exam mark replaces them if higher); the assignment is non-redeemable. So the whole game is the final - and it is an open-sheet, R-output-supplied paper. 你的成绩由四部分构成,但期末考试以 70% 占主导。两次测验都可补救(若期末分数更高则用它替换);作业则不可补救。所以全部关键就是期末 -- 而它是一场可带笔记、提供R输出的考试。 70% FINAL EXAM 期末考试 WRITING TIME 答题时间 1A TYPED SHEET, 2-SIDED 打印笔记,双面 W1-12 EXAMINABLE SCOPE 考查范围 The four assessment pieces 四项评估构成 Component Weight When / detail Final examination - MCQ + 70%[2]Source: asksia-bible-stat7038-bilingual(1).pdfRegression Modelling 回归建模 ONE LINE THROUGH A CLOUD OF POINTS- AND EVERYTHING YOU CAN INFER, TEST AND PREDICT FROM IT. 澳国立 ANU STAT7038 · 双语视觉精读 · LaTeX 公式排版 · 可带一张打印 A4 · 线性回归全流程 (诊断/选择) STAT7038 . AUSTRALIAN NATIONAL UNIVERSITY 中英双语版 · BILINGUAL EDITION 英文主讲,中文随行一考试要点与术语保留英文原词 The final exam is 70% of your mark. The good news: you may bring ONE A4 double-sided, typed or printed notes sheet, and the exam supplies the calculator, the R outputs and the statistical tables. So success is not about memory - it is about reading R output and driving the method on fresh numbers. This book teaches exactly that, and helps you build that one compliant sheet. Independent study companion. Not affiliated with or endorsed by the Australian National University. Corrections: takedowns@asksia. ai PREFACE HOW TO USE THIS BOOK Read R output, drive the method 读懂 R 输出,驾驭方法 Open-sheet exam - it tests whether you can execute & interpret, not recall 开卷(带笔记)考试 -- 它考的是你能否执行与解读,而非死记 This is not a transcript of the lecture slides. It is a self-contained course in every technique STAT7038 examines - each model stated plainly, each estimator derived to a formula, each test shown on a worked example with real arithmetic, and the matching R summary () / anova () output read line by line. The exam is open to one typed A4 sheet and supplies the calculator, the R printouts and the statistical tables - so the examiner cannot test what you remember, only whether you can do regression under time. That is what these pages drill. 这不是讲义幻灯片的逐字稿。它是一门自成体系的课程,覆盖 STAT7038 考查的每一项技术 -- 每个模型都直白陈述,每个估计量都推导到公式,每个检验都在配真实算术的例题中演示,并把配套的 R summaryC)/ anova(〕输出逐行读懂。考试可带一张打印的 A4 笔记,并提供计算器、R打印结果和统计表 -- 所以考官无法考你记住了什么,只能考你能否在限时下做回归。这正是这些篇幅所训练的。 A 1 . LEARN 1 ·学习 You haven't seen the lecture yet. Read a chapter top to bottom. Each concept is an AHA-unit: the equation or picture - a plain explainer - the method in numbered steps - a fully worked example - the trap. The diagrams are original schematics of the standard statistics - learn the idea cold. 你还没上过这一讲。从头到尾通读一章。每个概念都是一个 AHA 单元:公式或图示→通俗讲解→ 编号步骤的方法→完整算例→陷阱。图示都是对标准统计量的原创示意图 -- 把概念彻底学透。 B 2 . DRILL 2· 训练 You've done the lecture and the lab. Cover the worked steps and re-derive each answer with the supplied calculator. Then read the paired R output and tick off every number - Estimate, Std. Error, t value, Pr(>|t|), the F-statistic. That reading speed is the exam. 你已上过课、做过实验。遮住已做好的步骤,用所提供的计算器重新推导每个答案。然后判读配套的R 输出, 逐一核对每个数字 -- Estimate、 Std. Error, t value, Pr(>|t|〕、F-statistic。这种判读速度就是考试本身。 C 3 · EXAM 3· 考试 [3]Source: asksia-bible-stat7038-bilingual(1).pdf5 June, 2pm · 15 min reading + 180 min In-tutorial Quiz (redeemable) 10% Wk 7 · topic: simple linear regression Assignment (non-redeemable, R) 15% Wk 11 . due 21 May, 5pm Online Quiz (redeemable, Canvas) 5% Wk 5 . no extensions ✓ The strategy this dictates - the recurring chains 由此决定的策略 -- 反复出现的链条 Every exam item is a procedure on supplied numbers. Drill the chains: Sxy/Sxx - b1, bo; SSE - MSE - se(b;) - t - decision; SST = SSR + SSE - F, R2; xh - CI (mean) or PI (new obs). Show every line for the short-answer written parts - method marks are real. Put each chain, once, on your sheet. 每道考题都是对所给数字执行某一流程。反复演练这些链条: Sxy/Sxx → b1、 bo; SSE → MSE → se(bi)→t→判定; SST = SSR + SSE → F、R2; xh → CI(均值)或 PI(新观测)。简答文字部分要写出每一行 -- 方法分是实打实的。把每条链条都在笔记上写一次。 What "R supplied" means for your sheet “提供R 输出”对你的笔记意味着什么 Supplied - don't cram You must be able to do / read t, F, normal tables Pick the right critical value & df summary(lm) printout Read off b, se, t, p; recover MSE, n anova(lm) printout Read SSR, SSE, df; form F = MSR/MSE hp300s+ calculator Sxx. Sxy, b1, Cls by hand ★ The exam format - open one sheet, calculator & tables supplied 考试格式 -- 可带一张笔记,提供计算器与统计表 Three question styles: multiple-choice, short-answer calculation, and short-answer written. Covers all lectures & tutorials, Weeks 1-12. Permitted: one A4 double-sided typed/printed notes sheet. Supplied in the paper: hp300s+ calculator, R outputs, statistical tables, scribble paper. Significance level 5% unless stated; log means natural log. 三种题型:选择题、简答计算、简答文字。覆盖第1-12周的全部讲课与辅导。允许携带:一张 A4 双面打字/打印笔记。试卷中提供:hp300s+ 计算器、R 输出、统计表、草稿纸。除非另有说明,显著性水平为 5%;log 指自然对数。 STAT7038 . Regression Modelling short calc + written CONTENTS - CONTENTS [27]Source: asksia-cheatsheet-stat7038(1).pdfSTAT7038 Regression Modelling AUSTRALIAN NATIONAL UNIVERSITY . RSFAS EXAM REVISION Sem 1 2026 . SIDE 1 OF 2 SLR · estimation . inference SIDE 1/2 R output READ FIRST 0 . Exam Blueprint * Final = 70% . 180 min (+15 read) . MCQ + short-answer calculation + written, all of weeks 1-12. Also: online quiz 5%, in- tutorial quiz (SLR) 10%, assignment 15%. * The exam permits ONE A4 double-sided typed or printed notes sheet - and a calculator, R outputs & statistical tables are SUPPLIED. So this sheet IS a compliant memory aid: spend zero space on tables/R syntax, max out formulae, decision rules, cut-offs & method recipes. Every hypothesis test shows all 5: (1) hypotheses, (2) test statistic, (3) critical value w/ df (or p), (4) decision, (5) conclusion in context. a = 5% unless stated; log = natural log. - - SIA > The killer combo: read the supplied R output, pull the numbers, plug into the formula on this sheet, run the 5-step test. Memorise the recipes & cut-offs, not the tables. 1 . Building Blocks NOTATION Parameter (ß1, 02) = fixed unknown; estimator (b1, 02) = random, from the sample; estimate = its realised value. The sampling distribution is the estimator's distribution over repeated samples. CORE SUMS (MEMORISE) x == (1/n) ΣΧΙ · Sxx = Σ(Χi -x) 2 S_yy = Σ( yi-y)2 · Sxγ = Σ (Xi -X) (γi -y) Sx2 = Sxx/(n-1) . Cov = Sxy/(n-1) r = Sxy/V(SxxS_yy) = Sxy/((n-1) SxSy) EXPECTATION / VARIANCE RULES E(aX+b) = aE(X)+b Var(aX+b) = a2Var(X) Var (Σai Υ1)=Σa12Var (Υ1)+2_{i<j}aia; Cov [Yi,Υ;) (independent - Cov terms vanish) re [-1,1] measures linear association only. These E/Var rules drive every variance derivation below (e. g. Var(b,) treats the y, as the only random part, since the x, are fixed and the & carry all the randomness). Trap: correlation # causation; a strong r can be driven by one outlier or a lurking variable; r = 0 => no linear link, not "no relationship". 1b . The 5 SS Identities KEEP HANDY Everything in inference is built from five sums; learn how each is recovered from the others: SST = S_yy = E(yi-y)2 (df n-1) SSR = b1 . Sxy = b12Sxx = R2 . SST (df 1) SSE = SST - SSR = Ee:2 (df n-2) MSE = SSE/(n-2) . MSR = SSR/1 R2 = SSR/SST . r = ±VR2 (sign of bi) If an R table hides one cell, back it out: e. g. SSE = SST(1-R2) , or SSR = F·MSE. Degrees of freedom always add: df_total (n-1) = df_reg + df_err. In SLR that's (n-1) = 1 + (n-2); in MLR (n-1) = (p-1) + (n-p). A quick df check catches most table-fill slips. The "p" you divide by is the number of estimated parameters including the intercept - count the rows in the coefficient table. Get p wrong and every df, MSE and cut-off downstream is wrong too. In SLR p = 2. 2 . SLR Model + LINE SIMPLE LINEAR REGRESSION V1 = Be + BiXi + 81 , 81 ~iid N(0, 02) " yı ~ind N(Be+ß1X1 , 02) E(y|x) = Be + B1x (mean response) Four assumptions - LINE: Linearity of E(y|x); Independence of &; Normality of &; Equal variance Var(s)=o2. The x, are fixed, measured without error. Interpret: [] = expected change in y per 1-unit + in x; Bo = expected y at x = 0 (often a meaningless extrapolation). 3 . Least-Squares Estimation

1）整门课的“高频核心链条”（你要练到闭眼都能写）

你把下面这些链条每条练 3–5 次：考试题基本就是“给数字/给 R 输出 → 走链条”。

（可选）如果你愿意，我也可以把你“最弱的 3 块 + 最常踩的坑”帮你记住，之后你每次来我就按这个顺序帮你快速复习。

Ask Sia for quick explanations, examples, and study support.

Let's Get in Touch

Products

AskSia 3.0 Pro AskSia Super Chrome macOS Windows Pricing

AI Tools

YouTube Summarizer Flashcard Generator Mindmap Generator Quiz GeneratorAI DetectorCitation Generator

Work With Us

For Institutions Student BeansAffiliatesPartnershipsPress & MediaCareers

Company

About UsContact UsLegal & PoliciesService Agreement Grade Confidence Guarantee FAQs

STAT7038

0）先把“考试规则与得分方式”钉死（决定你怎么复习）

1）整门课的“高频核心链条”（你要练到闭眼都能写）

链条 A：SLR 估计（手算直线：从 $S_{xx},S_{xy}$ 到 $\hat y$）

链条 B：从 R 的 `summary(lm)` 输出里“抠出”你要的所有量（最高性价比）

链条 C：ANOVA / $F$ / $R^2$（SLR 里 $F=t^2$ 是必考关系）

链条 D：每个假设检验都按“5 步法”写满（方法分很实）

链条 E：CI vs PI（这是 STAT7038 的经典“致命陷阱”）

2）诊断（Diagnostics）：Week 5–6 “重考点”

2.1 四条 LINE 假设（考试会让你“用图说话”）

2.2 你必须会读的“3 张经典图”（考法：给图让你选最佳结论）

3）多元回归 MLR（你会发现：本质是 SLR 的“矩阵外衣”）

4）共线性（Multicollinearity）与 VIF：最爱考“现象→诊断→处理”

5）模型选择（Model selection）：Cp / AIC / BIC / PRESS / stepwise

6）log 模型的系数解释（非常常考、也非常容易写错）

7）你该怎么把“一张 A4 双面”写成提分武器（按材料给的原则）

8）考前冲刺：我建议你用“2 轮复习法”（按这份材料的出题逻辑）

9）你给我 2 个信息，我就能把它“再细化成你的个人复习清单 + A4 排版草稿”

Let's Get in Touch

STAT7038

0）先把“考试规则与得分方式”钉死（决定你怎么复习）

1）整门课的“高频核心链条”（你要练到闭眼都能写）

链条 A：SLR 估计（手算直线：从 $S_{xx},S_{xy}$ 到 $\hat y$）

链条 B：从 R 的 summary(lm) 输出里“抠出”你要的所有量（最高性价比）

链条 C：ANOVA / $F$ / $R^2$（SLR 里 $F=t^2$ 是必考关系）

链条 D：每个假设检验都按“5 步法”写满（方法分很实）

链条 E：CI vs PI（这是 STAT7038 的经典“致命陷阱”）

2）诊断（Diagnostics）：Week 5–6 “重考点”

2.1 四条 LINE 假设（考试会让你“用图说话”）

2.2 你必须会读的“3 张经典图”（考法：给图让你选最佳结论）

3）多元回归 MLR（你会发现：本质是 SLR 的“矩阵外衣”）

4）共线性（Multicollinearity）与 VIF：最爱考“现象→诊断→处理”

5）模型选择（Model selection）：Cp / AIC / BIC / PRESS / stepwise

6）log 模型的系数解释（非常常考、也非常容易写错）

7）你该怎么把“一张 A4 双面”写成提分武器（按材料给的原则）

8）考前冲刺：我建议你用“2 轮复习法”（按这份材料的出题逻辑）

9）你给我 2 个信息，我就能把它“再细化成你的个人复习清单 + A4 排版草稿”

Let's Get in Touch

链条 B：从 R 的 `summary(lm)` 输出里“抠出”你要的所有量（最高性价比）