POPH90111
May 29, 2026
All files
- 我来把你这门课 POPH90111 Genetic Epidemiology(遗传流行病学) 按“考什么、怎么拿分”彻底拆开,给你一份期末/作业冲刺复习地图(按你这份 bible + cheatsheet 的框架来)。
- 先给你一个最关键的“定位句”:
- 这门课不是让你背一堆概念,而是一条逻辑流水线:UNDERSTAND(是否有遗传作用)→ DISCOVER(哪些变异/是否因果)→ CHARACTERISE(风险多大/如何被修饰)→ USE IN PRACTICE(要不要筛查/怎么用)。做题永远沿着流水线选方法。[1]Source: asksia-bible-poph90111-bilingual.pdfA 1 . LEARN 1· 学习 You haven't watched the module yet. Read a chapter top to bottom. Every concept is an AHA-unit: diagram - explainer + method-in-steps - analogy + trap. The figures are original schematics of the standard genetic-epi canon - learn the idea cold before the numbers. 你还没看过该模块。从头到尾读 一章。每个概念都是一个 AHA 单元:图示→讲解→ 分步方法 →类比→陷阱。图均为标准遗 传流行病学经典内容的原创示意 图 -- 先把思路彻底吃透,再上 数字。 B 2 . DRILL 2 · 演练 You've watched the module and done the Q&A. Cover the worked steps and re-derive each OR, RR, SMR, h2, penetrance and NNT by hand. Then say aloud the interpretation and the one bias that would change your conclusion. 你已看过该模块并做完 Q&A。 遮住已演算的步骤,亲手重新推 导每一个 OR、RR、SMR、h2、 外显率和 NNT。然后大声说出解 读,以及那一个会改变你结论的 偏倚。 C 3 . ASSIGNMENT 3 · 作业 The assignment is open. Use the design-appraisal grids and worked-formula boxes as your lookup. Every question is calculate - interpret -+ appraise; the Stata . do file (A1 Q1) does the arithmetic, but the interpretation and appraisal marks are yours to write. 作业是开卷的。把设计评价表格 和公式演算框当作你的查阅工 具。每道题都是计算→解读→ 评价;Stata 的 . do 文件(A1 Q1)负责算术,但解读和评价的 分数要你自己写出来。 POPH90111 . Genetic Epidemiology ! The single most important thing to understand about POPH90111 关于 POPH90111 最重要的一点认识 This subject is a logical pipeline, not a pile of topics: UNDERSTAND (is there a genetic role? - familial aggregation M1, heritability M2) - DISCOVER (which variants? - association/GWAS M3) - CHARACTERISE (how risky? - penetrance M5, modifiers M6, G×E M7) - USE IN PRACTICE (causal exposures via MR M4, screening M8). One mantra recurs at every stage: familial aggregation / high MZ-vs-DZ correlation is 'evidence for, but not proof of, an inherited genetic aetiology' - shared environment is always the rival explanation. Write that sentence whenever you interpret aggregation or heritability. 本科目是一条逻辑流水线,而非一堆零散主题:理解(是否存在遗传作用?→家族聚集性 M1、遗传力 M2)→发现 (哪些变异?→关联/GWAS M3)→刻画(风险有多大?→外显率M5、修饰因子 M6、GxE M7)→实践应用(经 MR 评估因果暴露 M4、筛查 M8)。一句口诀贯穿每一阶段:家族聚集性/高的 MZ-对-DZ 相关是“支持遗传性病因的 证据,但非证明” -- 共享环境永远是竞争性解释。每当你解读聚集性或遗传力时都写下这句话。 i How this book was built - and the two-layer rule 本书如何编成 -- 以及两层规则 Standard genetic and statistical canon (Hardy-Weinberg, LD measures, Falconer's h2, the ACE model, Manhattan/QQ plots, MR's three IV assumptions, NNT/NNS) is stated plainly - it is universal, non-copyrightable science, and every diagram here is our own clean drawing. The course's specific framing and its lecturer's example numbers are paraphrased and re-checked, never copied from slides. The subject runs on A/Prof Aung Ko Win's modules with Prof Mark Jenkins' lectures; verify dates and weights against your own Canvas (the LMS), as cohort details can shift. 标准的遗传学与统计学经典(Hardy-Weinberg、LD 测度、Falconer h2、ACE 模型、曼哈顿图/QQ 图、MR 三个 IV 假设、NNT/NNS)均平实陈述 -- 它们是普适的、不受版权保护的科学,此处每张图都是我们自己干净的绘制。本课程 特定的表述及其讲师的示例数字均为转述并重新核对,绝不照搬幻灯片。本科目以 A/Prof Aung Ko Win 的模块和 Prof Mark Jenkins 的讲课为基础;请对照你自己的 Canvas(即 LMS)核实日期与权重,因为各届的细节可能变动。 POPH90111 . Genetic Epidemiology THE BLUEPRINT - THE ASSESSMENT BLUEPRINT 90% IN TWO ASSIGNMENTS Three take-home pieces, no exam 三份带回家作业,无考试[9]Source: asksia-bible-poph90111-bilingual.pdf→ 9 Appraisal grid & practice design - measure - bias · interpretation drills → POPH90111 . Genetic Epidemiology i Why this order 为何采用这一顺序 We follow the course's own pipeline. UNDERSTAND (M1-M2) asks whether genes matter at all; DISCOVER (M3) finds the variants and (M4) tests causal exposures; CHARACTERISE (M5-M7) asks how risky a variant is and what modifies it; USE IN PRACTICE (M8) decides whether to screen. Assignment 1 covers M1-3, Assignment 2 covers M4-8 - so the chapter split mirrors exactly what each assignment will test. 我们遵循本课程自身的流水线。理解(M1-M2)追问基因究竟是否起作用;发现(M3)找出变异、(M4)检验因果暴 露;刻画(M5-M7)追问某变异风险有多大、由什么修饰;实践应用(M8)决定是否筛查。作业1 涵盖 M1-3,作业 2 涵盖 M4-8 -- 因此章节划分恰好对应每份作业将考查的内容。 POPH90111 . Genetic Epidemiology CH 1 . FOUNDATIONS - CHAPTER 1 . FOUNDATIONS EXTRA-MODULE 1 + MODULE 1 From DNA to family risk: the genetics you need first 从 DNA 到家族风险:先要掌握的遗传学 Genes vs environment . alleles, genotypes, haplotypes . modes of inheritance 基因 vs 环境 · 等位基因、基因型、单体型 · 遗传模式 Genetic epidemiology asks: do genes contribute to who gets a disease? Before you can measure that you need a working vocabulary - what a variant, genotype and haplotype are, and how disease risk depends on how many risk alleles a person carries. This chapter is the primer (Extra-Module 1) plus the first real method - familial aggregation (Module 1), the question of whether disease clusters in families more than chance allows. 遗传流行病学发问:基因是否对谁会患病有所贡献?在能够衡量这一点之前,你需要一套可用的词汇 -- 变异、基因型和单体 型是什么,以及疾病风险如何取决于一个人携带的风险等位基因数目。本章是入门导引(Extra-Module 1)外加第一个真正 的方法 -- 家族聚集性(Module 1),即探究疾病在家庭中聚集的程度是否超过偶然所允许的范围。 ★ What the MCQ + Assignment 1 ask here 此处 MCQ+作业一考什么 The open-book MCQ (10%) is built from exactly this material: the scope of genetic epidemiology, the role of familial aggregation, the study designs that measure it, and calculating & interpreting aggregation measures. Assignment 1 then asks you to build a 2x2, compute an OR or RR/SMR, interpret it, and appraise the design's bias. Master the calculation and the one-sentence interpretation - both score. 开卷 MCQ (10%)恰由这部分材料构成:遗传流行病学的范畴、家族聚集性的作用、测量它的研究设计,以及计算与解 读聚集性测度。作业1随后要求你构建 2×2 表,计算 OR 或 RR/SMR,解读它,并评价该设计的偏倚。计算和那句一 句话解读都要掌握- -两者都计分。 1. 1 Genes vs environment - why families resemble each other 1. 1 基因 vs 环境 -- 家庭成员为何彼此相像 Susceptibility to complex disease comes from both genes and environment (that is Learning Outcome 1). Relatives resemble each other for two reasons that are hopelessly tangled: they share DNA (closer kin share more) and they share an environment (families eat, live and are exposed alike). That tangle is the central problem of the whole subject. 对复杂疾病的易感性同时来自基因与环境(即学习成果1)。亲属彼此相像有两个难解难分地纠缠在一起的原因:他们共享 DNA (越近的亲属共享得越多)且共享一个环境(家庭成员吃得、住得、所受暴露都相似)。这种纠缠正是整门学科的核心问 - 题。 POPH90111 . Genetic Epidemiology D2 Shared GENES Shared ENVIRONMENT[17]Source: asksia-cheatsheet-poph90111.pdfPOPH90111 Genetic Epidemiology UNIVERSITY OF MELBOURNE . POPULATION & GLOBAL HEALTH CALCULATION & METHOD REFERENCE Sem 1 2026 . SIDE 1 OF 2 Foundations -> heritability -> association SIDE 1/2 UNDERSTAND . Genetics primer . Hardy-Weinberg . LD . Familial aggregation (OR/RR/SMR/A) . Heritability (Falconer, ACE, liability) . Association & GWAS 0 . How To Use This READ FIRST * This subject is a pipeline: UNDERSTAND (is there a genetic role? - aggregation, heritability) -> DISCOVER (which variants? - LD, GWAS, MR) -> CHARACTERISE (how risky? - penetrance, modifiers, GxE) > USE (screening). Side 1 = understand & discover; side 2 = characterise & use. Assessment shape: online MCQ 10% (10 Qs, 1-week window) + written A1 40% (Modules 1-3) + A2 50% (Modules 4-8). All online / take-home - no invigilated exam. Every assignment task is one of three: (a) calculate + interpret , (b) discuss findings, (c) critically appraise a design. So the two high-value moves are: plug the right formula, then judge the design's bias. A Stata . do file is even handed out for A1 Q1 - expect software-based calculation, then a written interpretation. SIA > The mantra that earns marks everywhere: aggregation / high MZ-vs-DZ correlation is "evidence for, but not proof of, an inherited genetic aetiology. " Say it whenever you interpret aggregation or heritability. 1 . Genetics Primer EXTRA-MODULE 1 Locus = position on a chromosome. Allele = the base(s) there; minor allele = rarer one. Genotype = the pair (e. g. TT, TC, CC); homo- vs hetero-zygous. Mutation classes: silent (usually benign), missense (changes amino acid), nonsense (premature stop), frameshift indels (corrupt every downstream codon > usually pathogenic). CNV = larger gain/loss. 2 · Modes of Inheritance RISK INEQUALITIES Defined on Pr(phenotype | # risk alleles), not on "having" the trait: AUTOSOMAL Dominant: Pr(2)=Pr(1) > Pr(0) Recessive: Pr(2) > Pr(1)=Pr(0) Codominant: Pr(2) > Pr(1) > Pr(0) Carrier risk can be <1 (incomplete penetrance) and non-carrier risk >0 (phenocopies/sporadic). So a dominant variant can still have penetrance below 100%. SEGREGATION (PUNNETT) Each parent passes one randomly-chosen allele. Aaxaa -> 1/2 Aa, 1/2 aa (no AA). AaxAa -> 1/4 AA, 1/2 Aa, 1/4 aa => P(child carries ≥1 A)=3/4, P(AA)=1/4. Trap: the genotype gives the expected probability distribution, not the realised counts in a small sibship. 2b · Germline vs Somatic SAMPLE CHOICE · Inherited colorectal-cancer family risk -> germline > sample blood / buccal swab Tumour responds differently to chemo, no family history -> somatic > sample the tumour biopsy
- 另一个“得分口诀”(几乎每题都能写):
- 每题都按:Calculate → Interpret → Critically appraise(计算→解读→评价)。只写数字会丢大量分,解释句 + 偏倚/假设/适用人群才是分数密集区。[2]Source: asksia-bible-poph90111-bilingual.pdf! Critical appraisal - a good test can still be useless for screening 批判性评价 -- 再好的检验也可能不适合筛查 Sensitivity and specificity are intrinsic, but PPV depends on prevalence. A test with excellent sensitivity/specificity can be near-useless in a low-prevalence general population (PPV near zero) - exactly why genetic screening targets high-prior-risk groups. The ROC/AUC summarises discrimination across cut-offs; moving the cut-off trades sensitivity against specificity but never changes prevalence. 灵敏度和特异度是内在的,但PPV 取决于患病率。一个灵敏度/特异度极佳的检验,在低患病率的一般人群中可能近乎 无用(PPV 接近零) -- 这正是遗传筛查靶向高先验风险人群的原因。ROC/AUC 概括跨切点的区分度;移动切点是在 灵敏度与特异度之间权衡,但从不改变患病率。 Same habit on every item: calculate, interpret, appraise. The number alone is a fraction of the marks - the interpretation sentence and the design critique (bias, assumption, scale, prevalence) are where POPH90111 assessment lives. Write the appraisal sentence even when the question only asks for the number. 每道题都保持同一习惯:计算、解读、评价。光有数字只占一小部分分数 -- 解读句和设计批判(偏倚、假设、尺度、 患病率)才是 POPH90111 评估的所在。即便题目只问数字,也写出那句评价。 MARKER'S NOTE . POPH90111 . CALCULATE + INTERPRET + APPRAISE POPH90111 . Genetic Epidemiology AskSia Library VISUAL STUDY BIBLE . ASKSIA POP. & GLOBAL HEALTH SEMESTER 1 . 2026 p < 5x10-8 - THE COMPLETE STUDY BIBLE - Genetic Epidemiology 遗传流行病学 FROM A DNA STRAND TO A GWAS SKYLINE - PLUG THE NUMBERS AND JUDGE THE STUDY DESIGN. 墨尔本大学 POPH90111 · 双语视觉精读 · 在线开卷作业占 90% · 遗传率/GWAS/孟德尔随机化/ 筛查:计算 + 研究设计评价工具箱 POPH90111 . UNIVERSITY OF MELBOURNE 中英双语版 · BILINGUAL EDITION 英文主讲,中文随行 一 考试要点与术语保留英文原词 There is no closed-book exam in this subject. Your whole mark is three take-home pieces - a 10% online MCQ and two written assignments worth 90% combined - and all three are built from the same three task types: calculate, interpret, and critically appraise the study design. This book is therefore a calculation + appraisal toolkit: every method shown on real course numbers, every design tagged with its signature bias. Independent study companion. Not affiliated with or endorsed by the University of Melbourne. Corrections: takedowns@asksia. ai PREFACE - HOW TO USE THIS BOOK Plug the numbers, judge the design 代入数字,评判设计 Open-book and take-home - so it tests whether you can DO the method and APPRAISE it 开卷且带回家作答,因此考的是你能否动手做方法并加以评价 This is not a transcript of the lecture slides. It is a self-contained toolkit for every quantity and every study design POPH90111 examines - each concept opens with a diagram, states the formula plainly, works it on the course's own example numbers, gives a memorable analogy, then flags the assignment trap. Because all three assessments are open-book and take-home, the examiner cannot reward what you merely remember. Marks come from three things only: can you calculate it, can you interpret the output, and can you critically appraise the design that produced it? - - 这不是讲座幻灯片的逐字稿,而是一份自足的工具箱,涵盖 POPH90111 所考的每个量和每种研究设计 -- 每个概念都以一幅 图开篇,明白地写出公式,用课程自带的例题数字加以演算,给出一个易记的类比,再标出作业陷阱。由于三项评估全部开卷 且带回家作答,考官无法奖励你单纯记住的东西。分数仅来自三件事:你能否算出它、能否解释输出、能否批判性地评价产生 它的设计?[4]Source: asksia-bible-poph90111-bilingual.pdfModules 4-8 · multi- week window Penetrance-from-OR, NNT, NNS Plug real incidence & ARR, get the number The list of study designs Name the signature bias & appraise it POPH90111 . Genetic Epidemiology ★ The format - open-book, calculation + appraisal 考试形式 -- 开卷,计算+评价 Nothing here is invigilated. The MCQ is a 30-minute LMS quiz of 10 questions, completable any time within one week of release, covering the Introduction + Module 1 (scope of genetic epi, the role of familial aggregation, the designs that measure it, and calculating & interpreting aggregation measures). The assignments are written documents submitted online over a multi-week window. Because the book is open, no fact is worth a mark on its own - only the method, the interpretation, and the appraisal are. 此处无任何监考。MCQ 是一份10题、30分钟的 LMS 测验,在发布后一周内任意时间可完成,涵盖引 言+模块 1(遗传流行病学的范畴、家族聚集性的作 用、测量它的设计,以及计算与解读聚集性测度)。两 份作业为多周窗口内在线提交的书面文档。由于本书 是开卷的,没有哪个事实本身值分 -- 只有方法、解 读和评价才值分。 ✓ The strategy this dictates 由此决定的策略 Every question is a chain: calculate - interpret ++ appraise. The recurring chains are - 2×2 - OR/RR -+ 'evidence for not proof of'; rMz, roz - Falconer h2 -+ equal-environments caveat; OR + population incidence - penetrance - ascertainment-bias check; ARR - NNT - NNS - target high-risk. For every design write its measure, its one strength and its one characteristic bias - LO5 (appraisal) threads through all 8 modules and is where the easy marks live. 每道题都是一条链:计算→解读→评价。反复出现 的链有 -- 2×2→OR/RR→“是证据而非证明”; 「MZ、『Dz→ Falconer h2→等环境注意事项;OR+ 人群发病率→ 外显率→ 确认偏倚核查;ARR → NNT → NNS → 靶向高风险。对每一个设计都写出它 的测度、它的一个优势和它的一个特征性偏倚 -- LO5(评价)贯穿全部8个模块,正是容易得分之 处。 POPH90111 . Genetic Epidemiology CONTENTS - CONTENTS The genetic-epi pipeline, end to end 遗传流行病学全流程,从头到尾 Four questions - understand, discover, characterise, use - one toolkit 四个问题 -- 理解、发现、刻画、应用一一共用一套工具 D1 The genetic-epidemiology pipeline: which method answers which question UNDERSTANDING is there a genetic role? DISCOVERING which genes / causes? CHARACTERISING how risky? USING IN PRACTICE should we screen / act? Familial aggregation GWAS / association Penetrance (M5) Screening (M8) Twin studies[17]Source: asksia-cheatsheet-poph90111.pdfPOPH90111 Genetic Epidemiology UNIVERSITY OF MELBOURNE . POPULATION & GLOBAL HEALTH CALCULATION & METHOD REFERENCE Sem 1 2026 . SIDE 1 OF 2 Foundations -> heritability -> association SIDE 1/2 UNDERSTAND . Genetics primer . Hardy-Weinberg . LD . Familial aggregation (OR/RR/SMR/A) . Heritability (Falconer, ACE, liability) . Association & GWAS 0 . How To Use This READ FIRST * This subject is a pipeline: UNDERSTAND (is there a genetic role? - aggregation, heritability) -> DISCOVER (which variants? - LD, GWAS, MR) -> CHARACTERISE (how risky? - penetrance, modifiers, GxE) > USE (screening). Side 1 = understand & discover; side 2 = characterise & use. Assessment shape: online MCQ 10% (10 Qs, 1-week window) + written A1 40% (Modules 1-3) + A2 50% (Modules 4-8). All online / take-home - no invigilated exam. Every assignment task is one of three: (a) calculate + interpret , (b) discuss findings, (c) critically appraise a design. So the two high-value moves are: plug the right formula, then judge the design's bias. A Stata . do file is even handed out for A1 Q1 - expect software-based calculation, then a written interpretation. SIA > The mantra that earns marks everywhere: aggregation / high MZ-vs-DZ correlation is "evidence for, but not proof of, an inherited genetic aetiology. " Say it whenever you interpret aggregation or heritability. 1 . Genetics Primer EXTRA-MODULE 1 Locus = position on a chromosome. Allele = the base(s) there; minor allele = rarer one. Genotype = the pair (e. g. TT, TC, CC); homo- vs hetero-zygous. Mutation classes: silent (usually benign), missense (changes amino acid), nonsense (premature stop), frameshift indels (corrupt every downstream codon > usually pathogenic). CNV = larger gain/loss. 2 · Modes of Inheritance RISK INEQUALITIES Defined on Pr(phenotype | # risk alleles), not on "having" the trait: AUTOSOMAL Dominant: Pr(2)=Pr(1) > Pr(0) Recessive: Pr(2) > Pr(1)=Pr(0) Codominant: Pr(2) > Pr(1) > Pr(0) Carrier risk can be <1 (incomplete penetrance) and non-carrier risk >0 (phenocopies/sporadic). So a dominant variant can still have penetrance below 100%. SEGREGATION (PUNNETT) Each parent passes one randomly-chosen allele. Aaxaa -> 1/2 Aa, 1/2 aa (no AA). AaxAa -> 1/4 AA, 1/2 Aa, 1/4 aa => P(child carries ≥1 A)=3/4, P(AA)=1/4. Trap: the genotype gives the expected probability distribution, not the realised counts in a small sibship. 2b · Germline vs Somatic SAMPLE CHOICE · Inherited colorectal-cancer family risk -> germline > sample blood / buccal swab Tumour responds differently to chemo, no family history -> somatic > sample the tumour biopsy[20]Source: asksia-cheatsheet-poph90111.pdfSIA > Show the working: in this subject the marks live in the setup and the interpretation, not the final digit. Always write the formula, the substitution, then one sentence of meaning. asksia. ai/cheatsheet/ unimelb-poph90111 . side 2/2 AskSia CHEATSHEET SERIES Disease screening = a systematic test to find asymptomatic disease/precursors in people not seeking care. Genetic screening = find risk-raising variants in asymptomatic people so risk can be reduced/prevented; can be population-wide but is usually targeted to high-prior-risk groups (e. g. strong family history). Course twist: a genetic test can be "once-and-for-all" (your germline doesn't change), unlike repeated disease screening. Two uses: screen for genetic risk, or use a genetic factor to screen for disease. 30 . Wilson-Jungner WHO 1968 The screening-evaluation checklist (the course adapts all 10 to genetics): · Facilities for diagnosis & treatment exist Trap: "we can test" # "we should screen. " A test only helps if knowing the result reduces disease/disability/death and benefits beat harms (psychological, social, insurance, variants of unknown significance, false positives). Example genes the course uses: BRCA1/2, the mismatch-repair (MMR) genes, HTT - note HTT (Huntington) has no risk-reduction, which weakens the case for screening. 31 . NNT & NNS QUANTIFY BENEFIT ARR = carrier risk x proportion risk reduced NNT = 1 / ARR NNS = NNT / carrier frequency Worked (BRCA1/2): carrier breast-cancer risk to 70 = 0. 4; tamoxifen cuts risk 50% => ARR = 0. 4×0. 5 = 0. 2 => NNT = 1/0. 2 = 5 carriers treated to prevent one cancer. Carrier freq 0. 0067 (1 in 150) = NNS = 5/0. 0067 = 746 screened per cancer prevented. High-FH group (carrier freq 0. 25) = NNS = 5/0. 25 = 20 - far more efficient => justifies targeted screening. CARRIER FREQ NNT NNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH)
- 小提示:如果你愿意,我也可以把你“最弱的模块/常错点”记住,之后每次来我都按你的薄弱点带你复习。
-
0)先把“考试/评分结构”搞清楚(决定你该怎么复习)
- 这门课没有闭卷期末考试(no invigilated closed-book exam);成绩来自三件带回家/线上评估:
- MCQ 10%(开卷):30 分钟、10 题、发布后一周内任意时间完成;内容主要是 Introduction + Module 1(家族聚集性等)[4]Source: asksia-bible-poph90111-bilingual.pdfModules 4-8 · multi- week window Penetrance-from-OR, NNT, NNS Plug real incidence & ARR, get the number The list of study designs Name the signature bias & appraise it POPH90111 . Genetic Epidemiology ★ The format - open-book, calculation + appraisal 考试形式 -- 开卷,计算+评价 Nothing here is invigilated. The MCQ is a 30-minute LMS quiz of 10 questions, completable any time within one week of release, covering the Introduction + Module 1 (scope of genetic epi, the role of familial aggregation, the designs that measure it, and calculating & interpreting aggregation measures). The assignments are written documents submitted online over a multi-week window. Because the book is open, no fact is worth a mark on its own - only the method, the interpretation, and the appraisal are. 此处无任何监考。MCQ 是一份10题、30分钟的 LMS 测验,在发布后一周内任意时间可完成,涵盖引 言+模块 1(遗传流行病学的范畴、家族聚集性的作 用、测量它的设计,以及计算与解读聚集性测度)。两 份作业为多周窗口内在线提交的书面文档。由于本书 是开卷的,没有哪个事实本身值分 -- 只有方法、解 读和评价才值分。 ✓ The strategy this dictates 由此决定的策略 Every question is a chain: calculate - interpret ++ appraise. The recurring chains are - 2×2 - OR/RR -+ 'evidence for not proof of'; rMz, roz - Falconer h2 -+ equal-environments caveat; OR + population incidence - penetrance - ascertainment-bias check; ARR - NNT - NNS - target high-risk. For every design write its measure, its one strength and its one characteristic bias - LO5 (appraisal) threads through all 8 modules and is where the easy marks live. 每道题都是一条链:计算→解读→评价。反复出现 的链有 -- 2×2→OR/RR→“是证据而非证明”; 「MZ、『Dz→ Falconer h2→等环境注意事项;OR+ 人群发病率→ 外显率→ 确认偏倚核查;ARR → NNT → NNS → 靶向高风险。对每一个设计都写出它 的测度、它的一个优势和它的一个特征性偏倚 -- LO5(评价)贯穿全部8个模块,正是容易得分之 处。 POPH90111 . Genetic Epidemiology CONTENTS - CONTENTS The genetic-epi pipeline, end to end 遗传流行病学全流程,从头到尾 Four questions - understand, discover, characterise, use - one toolkit 四个问题 -- 理解、发现、刻画、应用一一共用一套工具 D1 The genetic-epidemiology pipeline: which method answers which question UNDERSTANDING is there a genetic role? DISCOVERING which genes / causes? CHARACTERISING how risky? USING IN PRACTICE should we screen / act? Familial aggregation GWAS / association Penetrance (M5) Screening (M8) Twin studies
- Assignment 1 40%(M1–3):理解 + 发现的前半段(家族聚集性、遗传率、关联/GWAS)[3]Source: asksia-bible-poph90111-bilingual.pdfMCQ 10% (open-book) . Assignment 1 40% (M1-3) . Assignment 2 50% (M4-8) MCQ 占 10% (开卷)· 作业一占 40% (M1-3)· 作业二占 50% (M4-8) Your entire grade is online and take-home. There is no invigilated, closed-book exam. A small MCQ warms you up; the two written assignments carry 90% combined - and both are explicitly built from the same three task types. 你的全部成绩都是线上且带回家作答。没有监考的闭卷考试。一个小型MCQ 为你热身;两份书面作业合计占 90% -- 且两 者都明确围绕同样的三类任务搭建。 10% MCQ - OPEN-BOOK MCQ -- 开卷 40% ASSIGNMENT 1 . M1-3 作业 1 · M1-3 50% ASSIGNMENT 2 . M4-8 作业 2 · M4-8 TASK TYPES, EVERY Q 题型,每道题 The three assessment pieces What "open-book" means for you 三个评估部分 “开卷”对你意味着什么 Component Weight Covers / detail You can look up (don't cram) You must actually DO MCQ Exam - 30-min online quiz, 10 Qs 10% Intro + M1 · open-book, do it any time within 1 week The OR / RR / SMR / AR formulae Build the 2×2 & compute the estimate Assignment 1 (written, take-home) 40% Modules 1-3 · multi- week window Falconer h2 = 2(rMz-ľDz), ACE Decompose variance; interpret h2 Assignment 2 (written, take-home) 50%[9]Source: asksia-bible-poph90111-bilingual.pdf→ 9 Appraisal grid & practice design - measure - bias · interpretation drills → POPH90111 . Genetic Epidemiology i Why this order 为何采用这一顺序 We follow the course's own pipeline. UNDERSTAND (M1-M2) asks whether genes matter at all; DISCOVER (M3) finds the variants and (M4) tests causal exposures; CHARACTERISE (M5-M7) asks how risky a variant is and what modifies it; USE IN PRACTICE (M8) decides whether to screen. Assignment 1 covers M1-3, Assignment 2 covers M4-8 - so the chapter split mirrors exactly what each assignment will test. 我们遵循本课程自身的流水线。理解(M1-M2)追问基因究竟是否起作用;发现(M3)找出变异、(M4)检验因果暴 露;刻画(M5-M7)追问某变异风险有多大、由什么修饰;实践应用(M8)决定是否筛查。作业1 涵盖 M1-3,作业 2 涵盖 M4-8 -- 因此章节划分恰好对应每份作业将考查的内容。 POPH90111 . Genetic Epidemiology CH 1 . FOUNDATIONS - CHAPTER 1 . FOUNDATIONS EXTRA-MODULE 1 + MODULE 1 From DNA to family risk: the genetics you need first 从 DNA 到家族风险:先要掌握的遗传学 Genes vs environment . alleles, genotypes, haplotypes . modes of inheritance 基因 vs 环境 · 等位基因、基因型、单体型 · 遗传模式 Genetic epidemiology asks: do genes contribute to who gets a disease? Before you can measure that you need a working vocabulary - what a variant, genotype and haplotype are, and how disease risk depends on how many risk alleles a person carries. This chapter is the primer (Extra-Module 1) plus the first real method - familial aggregation (Module 1), the question of whether disease clusters in families more than chance allows. 遗传流行病学发问:基因是否对谁会患病有所贡献?在能够衡量这一点之前,你需要一套可用的词汇 -- 变异、基因型和单体 型是什么,以及疾病风险如何取决于一个人携带的风险等位基因数目。本章是入门导引(Extra-Module 1)外加第一个真正 的方法 -- 家族聚集性(Module 1),即探究疾病在家庭中聚集的程度是否超过偶然所允许的范围。 ★ What the MCQ + Assignment 1 ask here 此处 MCQ+作业一考什么 The open-book MCQ (10%) is built from exactly this material: the scope of genetic epidemiology, the role of familial aggregation, the study designs that measure it, and calculating & interpreting aggregation measures. Assignment 1 then asks you to build a 2x2, compute an OR or RR/SMR, interpret it, and appraise the design's bias. Master the calculation and the one-sentence interpretation - both score. 开卷 MCQ (10%)恰由这部分材料构成:遗传流行病学的范畴、家族聚集性的作用、测量它的研究设计,以及计算与解 读聚集性测度。作业1随后要求你构建 2×2 表,计算 OR 或 RR/SMR,解读它,并评价该设计的偏倚。计算和那句一 句话解读都要掌握- -两者都计分。 1. 1 Genes vs environment - why families resemble each other 1. 1 基因 vs 环境 -- 家庭成员为何彼此相像 Susceptibility to complex disease comes from both genes and environment (that is Learning Outcome 1). Relatives resemble each other for two reasons that are hopelessly tangled: they share DNA (closer kin share more) and they share an environment (families eat, live and are exposed alike). That tangle is the central problem of the whole subject. 对复杂疾病的易感性同时来自基因与环境(即学习成果1)。亲属彼此相像有两个难解难分地纠缠在一起的原因:他们共享 DNA (越近的亲属共享得越多)且共享一个环境(家庭成员吃得、住得、所受暴露都相似)。这种纠缠正是整门学科的核心问 - 题。 POPH90111 . Genetic Epidemiology D2 Shared GENES Shared ENVIRONMENT[17]Source: asksia-cheatsheet-poph90111.pdfPOPH90111 Genetic Epidemiology UNIVERSITY OF MELBOURNE . POPULATION & GLOBAL HEALTH CALCULATION & METHOD REFERENCE Sem 1 2026 . SIDE 1 OF 2 Foundations -> heritability -> association SIDE 1/2 UNDERSTAND . Genetics primer . Hardy-Weinberg . LD . Familial aggregation (OR/RR/SMR/A) . Heritability (Falconer, ACE, liability) . Association & GWAS 0 . How To Use This READ FIRST * This subject is a pipeline: UNDERSTAND (is there a genetic role? - aggregation, heritability) -> DISCOVER (which variants? - LD, GWAS, MR) -> CHARACTERISE (how risky? - penetrance, modifiers, GxE) > USE (screening). Side 1 = understand & discover; side 2 = characterise & use. Assessment shape: online MCQ 10% (10 Qs, 1-week window) + written A1 40% (Modules 1-3) + A2 50% (Modules 4-8). All online / take-home - no invigilated exam. Every assignment task is one of three: (a) calculate + interpret , (b) discuss findings, (c) critically appraise a design. So the two high-value moves are: plug the right formula, then judge the design's bias. A Stata . do file is even handed out for A1 Q1 - expect software-based calculation, then a written interpretation. SIA > The mantra that earns marks everywhere: aggregation / high MZ-vs-DZ correlation is "evidence for, but not proof of, an inherited genetic aetiology. " Say it whenever you interpret aggregation or heritability. 1 . Genetics Primer EXTRA-MODULE 1 Locus = position on a chromosome. Allele = the base(s) there; minor allele = rarer one. Genotype = the pair (e. g. TT, TC, CC); homo- vs hetero-zygous. Mutation classes: silent (usually benign), missense (changes amino acid), nonsense (premature stop), frameshift indels (corrupt every downstream codon > usually pathogenic). CNV = larger gain/loss. 2 · Modes of Inheritance RISK INEQUALITIES Defined on Pr(phenotype | # risk alleles), not on "having" the trait: AUTOSOMAL Dominant: Pr(2)=Pr(1) > Pr(0) Recessive: Pr(2) > Pr(1)=Pr(0) Codominant: Pr(2) > Pr(1) > Pr(0) Carrier risk can be <1 (incomplete penetrance) and non-carrier risk >0 (phenocopies/sporadic). So a dominant variant can still have penetrance below 100%. SEGREGATION (PUNNETT) Each parent passes one randomly-chosen allele. Aaxaa -> 1/2 Aa, 1/2 aa (no AA). AaxAa -> 1/4 AA, 1/2 Aa, 1/4 aa => P(child carries ≥1 A)=3/4, P(AA)=1/4. Trap: the genotype gives the expected probability distribution, not the realised counts in a small sibship. 2b · Germline vs Somatic SAMPLE CHOICE · Inherited colorectal-cancer family risk -> germline > sample blood / buccal swab Tumour responds differently to chemo, no family history -> somatic > sample the tumour biopsy
- Assignment 2 50%(M4–8):MR(因果)、外显率/修饰因子、G×E、筛查与评价 [3]Source: asksia-bible-poph90111-bilingual.pdfMCQ 10% (open-book) . Assignment 1 40% (M1-3) . Assignment 2 50% (M4-8) MCQ 占 10% (开卷)· 作业一占 40% (M1-3)· 作业二占 50% (M4-8) Your entire grade is online and take-home. There is no invigilated, closed-book exam. A small MCQ warms you up; the two written assignments carry 90% combined - and both are explicitly built from the same three task types. 你的全部成绩都是线上且带回家作答。没有监考的闭卷考试。一个小型MCQ 为你热身;两份书面作业合计占 90% -- 且两 者都明确围绕同样的三类任务搭建。 10% MCQ - OPEN-BOOK MCQ -- 开卷 40% ASSIGNMENT 1 . M1-3 作业 1 · M1-3 50% ASSIGNMENT 2 . M4-8 作业 2 · M4-8 TASK TYPES, EVERY Q 题型,每道题 The three assessment pieces What "open-book" means for you 三个评估部分 “开卷”对你意味着什么 Component Weight Covers / detail You can look up (don't cram) You must actually DO MCQ Exam - 30-min online quiz, 10 Qs 10% Intro + M1 · open-book, do it any time within 1 week The OR / RR / SMR / AR formulae Build the 2×2 & compute the estimate Assignment 1 (written, take-home) 40% Modules 1-3 · multi- week window Falconer h2 = 2(rMz-ľDz), ACE Decompose variance; interpret h2 Assignment 2 (written, take-home) 50%[4]Source: asksia-bible-poph90111-bilingual.pdfModules 4-8 · multi- week window Penetrance-from-OR, NNT, NNS Plug real incidence & ARR, get the number The list of study designs Name the signature bias & appraise it POPH90111 . Genetic Epidemiology ★ The format - open-book, calculation + appraisal 考试形式 -- 开卷,计算+评价 Nothing here is invigilated. The MCQ is a 30-minute LMS quiz of 10 questions, completable any time within one week of release, covering the Introduction + Module 1 (scope of genetic epi, the role of familial aggregation, the designs that measure it, and calculating & interpreting aggregation measures). The assignments are written documents submitted online over a multi-week window. Because the book is open, no fact is worth a mark on its own - only the method, the interpretation, and the appraisal are. 此处无任何监考。MCQ 是一份10题、30分钟的 LMS 测验,在发布后一周内任意时间可完成,涵盖引 言+模块 1(遗传流行病学的范畴、家族聚集性的作 用、测量它的设计,以及计算与解读聚集性测度)。两 份作业为多周窗口内在线提交的书面文档。由于本书 是开卷的,没有哪个事实本身值分 -- 只有方法、解 读和评价才值分。 ✓ The strategy this dictates 由此决定的策略 Every question is a chain: calculate - interpret ++ appraise. The recurring chains are - 2×2 - OR/RR -+ 'evidence for not proof of'; rMz, roz - Falconer h2 -+ equal-environments caveat; OR + population incidence - penetrance - ascertainment-bias check; ARR - NNT - NNS - target high-risk. For every design write its measure, its one strength and its one characteristic bias - LO5 (appraisal) threads through all 8 modules and is where the easy marks live. 每道题都是一条链:计算→解读→评价。反复出现 的链有 -- 2×2→OR/RR→“是证据而非证明”; 「MZ、『Dz→ Falconer h2→等环境注意事项;OR+ 人群发病率→ 外显率→ 确认偏倚核查;ARR → NNT → NNS → 靶向高风险。对每一个设计都写出它 的测度、它的一个优势和它的一个特征性偏倚 -- LO5(评价)贯穿全部8个模块,正是容易得分之 处。 POPH90111 . Genetic Epidemiology CONTENTS - CONTENTS The genetic-epi pipeline, end to end 遗传流行病学全流程,从头到尾 Four questions - understand, discover, characterise, use - one toolkit 四个问题 -- 理解、发现、刻画、应用一一共用一套工具 D1 The genetic-epidemiology pipeline: which method answers which question UNDERSTANDING is there a genetic role? DISCOVERING which genes / causes? CHARACTERISING how risky? USING IN PRACTICE should we screen / act? Familial aggregation GWAS / association Penetrance (M5) Screening (M8) Twin studies[9]Source: asksia-bible-poph90111-bilingual.pdf→ 9 Appraisal grid & practice design - measure - bias · interpretation drills → POPH90111 . Genetic Epidemiology i Why this order 为何采用这一顺序 We follow the course's own pipeline. UNDERSTAND (M1-M2) asks whether genes matter at all; DISCOVER (M3) finds the variants and (M4) tests causal exposures; CHARACTERISE (M5-M7) asks how risky a variant is and what modifies it; USE IN PRACTICE (M8) decides whether to screen. Assignment 1 covers M1-3, Assignment 2 covers M4-8 - so the chapter split mirrors exactly what each assignment will test. 我们遵循本课程自身的流水线。理解(M1-M2)追问基因究竟是否起作用;发现(M3)找出变异、(M4)检验因果暴 露;刻画(M5-M7)追问某变异风险有多大、由什么修饰;实践应用(M8)决定是否筛查。作业1 涵盖 M1-3,作业 2 涵盖 M4-8 -- 因此章节划分恰好对应每份作业将考查的内容。 POPH90111 . Genetic Epidemiology CH 1 . FOUNDATIONS - CHAPTER 1 . FOUNDATIONS EXTRA-MODULE 1 + MODULE 1 From DNA to family risk: the genetics you need first 从 DNA 到家族风险:先要掌握的遗传学 Genes vs environment . alleles, genotypes, haplotypes . modes of inheritance 基因 vs 环境 · 等位基因、基因型、单体型 · 遗传模式 Genetic epidemiology asks: do genes contribute to who gets a disease? Before you can measure that you need a working vocabulary - what a variant, genotype and haplotype are, and how disease risk depends on how many risk alleles a person carries. This chapter is the primer (Extra-Module 1) plus the first real method - familial aggregation (Module 1), the question of whether disease clusters in families more than chance allows. 遗传流行病学发问:基因是否对谁会患病有所贡献?在能够衡量这一点之前,你需要一套可用的词汇 -- 变异、基因型和单体 型是什么,以及疾病风险如何取决于一个人携带的风险等位基因数目。本章是入门导引(Extra-Module 1)外加第一个真正 的方法 -- 家族聚集性(Module 1),即探究疾病在家庭中聚集的程度是否超过偶然所允许的范围。 ★ What the MCQ + Assignment 1 ask here 此处 MCQ+作业一考什么 The open-book MCQ (10%) is built from exactly this material: the scope of genetic epidemiology, the role of familial aggregation, the study designs that measure it, and calculating & interpreting aggregation measures. Assignment 1 then asks you to build a 2x2, compute an OR or RR/SMR, interpret it, and appraise the design's bias. Master the calculation and the one-sentence interpretation - both score. 开卷 MCQ (10%)恰由这部分材料构成:遗传流行病学的范畴、家族聚集性的作用、测量它的研究设计,以及计算与解 读聚集性测度。作业1随后要求你构建 2×2 表,计算 OR 或 RR/SMR,解读它,并评价该设计的偏倚。计算和那句一 句话解读都要掌握- -两者都计分。 1. 1 Genes vs environment - why families resemble each other 1. 1 基因 vs 环境 -- 家庭成员为何彼此相像 Susceptibility to complex disease comes from both genes and environment (that is Learning Outcome 1). Relatives resemble each other for two reasons that are hopelessly tangled: they share DNA (closer kin share more) and they share an environment (families eat, live and are exposed alike). That tangle is the central problem of the whole subject. 对复杂疾病的易感性同时来自基因与环境(即学习成果1)。亲属彼此相像有两个难解难分地纠缠在一起的原因:他们共享 DNA (越近的亲属共享得越多)且共享一个环境(家庭成员吃得、住得、所受暴露都相似)。这种纠缠正是整门学科的核心问 - 题。 POPH90111 . Genetic Epidemiology D2 Shared GENES Shared ENVIRONMENT
- 你复习的“正确策略”:
- 开卷意味着:能查到公式不值钱;值钱的是:你会不会搭 2×2、会不会代入、会不会写出正确的一句解读 + 一句评价。[2]Source: asksia-bible-poph90111-bilingual.pdf! Critical appraisal - a good test can still be useless for screening 批判性评价 -- 再好的检验也可能不适合筛查 Sensitivity and specificity are intrinsic, but PPV depends on prevalence. A test with excellent sensitivity/specificity can be near-useless in a low-prevalence general population (PPV near zero) - exactly why genetic screening targets high-prior-risk groups. The ROC/AUC summarises discrimination across cut-offs; moving the cut-off trades sensitivity against specificity but never changes prevalence. 灵敏度和特异度是内在的,但PPV 取决于患病率。一个灵敏度/特异度极佳的检验,在低患病率的一般人群中可能近乎 无用(PPV 接近零) -- 这正是遗传筛查靶向高先验风险人群的原因。ROC/AUC 概括跨切点的区分度;移动切点是在 灵敏度与特异度之间权衡,但从不改变患病率。 Same habit on every item: calculate, interpret, appraise. The number alone is a fraction of the marks - the interpretation sentence and the design critique (bias, assumption, scale, prevalence) are where POPH90111 assessment lives. Write the appraisal sentence even when the question only asks for the number. 每道题都保持同一习惯:计算、解读、评价。光有数字只占一小部分分数 -- 解读句和设计批判(偏倚、假设、尺度、 患病率)才是 POPH90111 评估的所在。即便题目只问数字,也写出那句评价。 MARKER'S NOTE . POPH90111 . CALCULATE + INTERPRET + APPRAISE POPH90111 . Genetic Epidemiology AskSia Library VISUAL STUDY BIBLE . ASKSIA POP. & GLOBAL HEALTH SEMESTER 1 . 2026 p < 5x10-8 - THE COMPLETE STUDY BIBLE - Genetic Epidemiology 遗传流行病学 FROM A DNA STRAND TO A GWAS SKYLINE - PLUG THE NUMBERS AND JUDGE THE STUDY DESIGN. 墨尔本大学 POPH90111 · 双语视觉精读 · 在线开卷作业占 90% · 遗传率/GWAS/孟德尔随机化/ 筛查:计算 + 研究设计评价工具箱 POPH90111 . UNIVERSITY OF MELBOURNE 中英双语版 · BILINGUAL EDITION 英文主讲,中文随行 一 考试要点与术语保留英文原词 There is no closed-book exam in this subject. Your whole mark is three take-home pieces - a 10% online MCQ and two written assignments worth 90% combined - and all three are built from the same three task types: calculate, interpret, and critically appraise the study design. This book is therefore a calculation + appraisal toolkit: every method shown on real course numbers, every design tagged with its signature bias. Independent study companion. Not affiliated with or endorsed by the University of Melbourne. Corrections: takedowns@asksia. ai PREFACE - HOW TO USE THIS BOOK Plug the numbers, judge the design 代入数字,评判设计 Open-book and take-home - so it tests whether you can DO the method and APPRAISE it 开卷且带回家作答,因此考的是你能否动手做方法并加以评价 This is not a transcript of the lecture slides. It is a self-contained toolkit for every quantity and every study design POPH90111 examines - each concept opens with a diagram, states the formula plainly, works it on the course's own example numbers, gives a memorable analogy, then flags the assignment trap. Because all three assessments are open-book and take-home, the examiner cannot reward what you merely remember. Marks come from three things only: can you calculate it, can you interpret the output, and can you critically appraise the design that produced it? - - 这不是讲座幻灯片的逐字稿,而是一份自足的工具箱,涵盖 POPH90111 所考的每个量和每种研究设计 -- 每个概念都以一幅 图开篇,明白地写出公式,用课程自带的例题数字加以演算,给出一个易记的类比,再标出作业陷阱。由于三项评估全部开卷 且带回家作答,考官无法奖励你单纯记住的东西。分数仅来自三件事:你能否算出它、能否解释输出、能否批判性地评价产生 它的设计?[4]Source: asksia-bible-poph90111-bilingual.pdfModules 4-8 · multi- week window Penetrance-from-OR, NNT, NNS Plug real incidence & ARR, get the number The list of study designs Name the signature bias & appraise it POPH90111 . Genetic Epidemiology ★ The format - open-book, calculation + appraisal 考试形式 -- 开卷,计算+评价 Nothing here is invigilated. The MCQ is a 30-minute LMS quiz of 10 questions, completable any time within one week of release, covering the Introduction + Module 1 (scope of genetic epi, the role of familial aggregation, the designs that measure it, and calculating & interpreting aggregation measures). The assignments are written documents submitted online over a multi-week window. Because the book is open, no fact is worth a mark on its own - only the method, the interpretation, and the appraisal are. 此处无任何监考。MCQ 是一份10题、30分钟的 LMS 测验,在发布后一周内任意时间可完成,涵盖引 言+模块 1(遗传流行病学的范畴、家族聚集性的作 用、测量它的设计,以及计算与解读聚集性测度)。两 份作业为多周窗口内在线提交的书面文档。由于本书 是开卷的,没有哪个事实本身值分 -- 只有方法、解 读和评价才值分。 ✓ The strategy this dictates 由此决定的策略 Every question is a chain: calculate - interpret ++ appraise. The recurring chains are - 2×2 - OR/RR -+ 'evidence for not proof of'; rMz, roz - Falconer h2 -+ equal-environments caveat; OR + population incidence - penetrance - ascertainment-bias check; ARR - NNT - NNS - target high-risk. For every design write its measure, its one strength and its one characteristic bias - LO5 (appraisal) threads through all 8 modules and is where the easy marks live. 每道题都是一条链:计算→解读→评价。反复出现 的链有 -- 2×2→OR/RR→“是证据而非证明”; 「MZ、『Dz→ Falconer h2→等环境注意事项;OR+ 人群发病率→ 外显率→ 确认偏倚核查;ARR → NNT → NNS → 靶向高风险。对每一个设计都写出它 的测度、它的一个优势和它的一个特征性偏倚 -- LO5(评价)贯穿全部8个模块,正是容易得分之 处。 POPH90111 . Genetic Epidemiology CONTENTS - CONTENTS The genetic-epi pipeline, end to end 遗传流行病学全流程,从头到尾 Four questions - understand, discover, characterise, use - one toolkit 四个问题 -- 理解、发现、刻画、应用一一共用一套工具 D1 The genetic-epidemiology pipeline: which method answers which question UNDERSTANDING is there a genetic role? DISCOVERING which genes / causes? CHARACTERISING how risky? USING IN PRACTICE should we screen / act? Familial aggregation GWAS / association Penetrance (M5) Screening (M8) Twin studies[17]Source: asksia-cheatsheet-poph90111.pdfPOPH90111 Genetic Epidemiology UNIVERSITY OF MELBOURNE . POPULATION & GLOBAL HEALTH CALCULATION & METHOD REFERENCE Sem 1 2026 . SIDE 1 OF 2 Foundations -> heritability -> association SIDE 1/2 UNDERSTAND . Genetics primer . Hardy-Weinberg . LD . Familial aggregation (OR/RR/SMR/A) . Heritability (Falconer, ACE, liability) . Association & GWAS 0 . How To Use This READ FIRST * This subject is a pipeline: UNDERSTAND (is there a genetic role? - aggregation, heritability) -> DISCOVER (which variants? - LD, GWAS, MR) -> CHARACTERISE (how risky? - penetrance, modifiers, GxE) > USE (screening). Side 1 = understand & discover; side 2 = characterise & use. Assessment shape: online MCQ 10% (10 Qs, 1-week window) + written A1 40% (Modules 1-3) + A2 50% (Modules 4-8). All online / take-home - no invigilated exam. Every assignment task is one of three: (a) calculate + interpret , (b) discuss findings, (c) critically appraise a design. So the two high-value moves are: plug the right formula, then judge the design's bias. A Stata . do file is even handed out for A1 Q1 - expect software-based calculation, then a written interpretation. SIA > The mantra that earns marks everywhere: aggregation / high MZ-vs-DZ correlation is "evidence for, but not proof of, an inherited genetic aetiology. " Say it whenever you interpret aggregation or heritability. 1 . Genetics Primer EXTRA-MODULE 1 Locus = position on a chromosome. Allele = the base(s) there; minor allele = rarer one. Genotype = the pair (e. g. TT, TC, CC); homo- vs hetero-zygous. Mutation classes: silent (usually benign), missense (changes amino acid), nonsense (premature stop), frameshift indels (corrupt every downstream codon > usually pathogenic). CNV = larger gain/loss. 2 · Modes of Inheritance RISK INEQUALITIES Defined on Pr(phenotype | # risk alleles), not on "having" the trait: AUTOSOMAL Dominant: Pr(2)=Pr(1) > Pr(0) Recessive: Pr(2) > Pr(1)=Pr(0) Codominant: Pr(2) > Pr(1) > Pr(0) Carrier risk can be <1 (incomplete penetrance) and non-carrier risk >0 (phenocopies/sporadic). So a dominant variant can still have penetrance below 100%. SEGREGATION (PUNNETT) Each parent passes one randomly-chosen allele. Aaxaa -> 1/2 Aa, 1/2 aa (no AA). AaxAa -> 1/4 AA, 1/2 Aa, 1/4 aa => P(child carries ≥1 A)=3/4, P(AA)=1/4. Trap: the genotype gives the expected probability distribution, not the realised counts in a small sibship. 2b · Germline vs Somatic SAMPLE CHOICE · Inherited colorectal-cancer family risk -> germline > sample blood / buccal swab Tumour responds differently to chemo, no family history -> somatic > sample the tumour biopsy
- 你要训练的是“链条题模板”(下面我会给你每条链条的:公式+定义+常见陷阱+评价句)。[4]Source: asksia-bible-poph90111-bilingual.pdfModules 4-8 · multi- week window Penetrance-from-OR, NNT, NNS Plug real incidence & ARR, get the number The list of study designs Name the signature bias & appraise it POPH90111 . Genetic Epidemiology ★ The format - open-book, calculation + appraisal 考试形式 -- 开卷,计算+评价 Nothing here is invigilated. The MCQ is a 30-minute LMS quiz of 10 questions, completable any time within one week of release, covering the Introduction + Module 1 (scope of genetic epi, the role of familial aggregation, the designs that measure it, and calculating & interpreting aggregation measures). The assignments are written documents submitted online over a multi-week window. Because the book is open, no fact is worth a mark on its own - only the method, the interpretation, and the appraisal are. 此处无任何监考。MCQ 是一份10题、30分钟的 LMS 测验,在发布后一周内任意时间可完成,涵盖引 言+模块 1(遗传流行病学的范畴、家族聚集性的作 用、测量它的设计,以及计算与解读聚集性测度)。两 份作业为多周窗口内在线提交的书面文档。由于本书 是开卷的,没有哪个事实本身值分 -- 只有方法、解 读和评价才值分。 ✓ The strategy this dictates 由此决定的策略 Every question is a chain: calculate - interpret ++ appraise. The recurring chains are - 2×2 - OR/RR -+ 'evidence for not proof of'; rMz, roz - Falconer h2 -+ equal-environments caveat; OR + population incidence - penetrance - ascertainment-bias check; ARR - NNT - NNS - target high-risk. For every design write its measure, its one strength and its one characteristic bias - LO5 (appraisal) threads through all 8 modules and is where the easy marks live. 每道题都是一条链:计算→解读→评价。反复出现 的链有 -- 2×2→OR/RR→“是证据而非证明”; 「MZ、『Dz→ Falconer h2→等环境注意事项;OR+ 人群发病率→ 外显率→ 确认偏倚核查;ARR → NNT → NNS → 靶向高风险。对每一个设计都写出它 的测度、它的一个优势和它的一个特征性偏倚 -- LO5(评价)贯穿全部8个模块,正是容易得分之 处。 POPH90111 . Genetic Epidemiology CONTENTS - CONTENTS The genetic-epi pipeline, end to end 遗传流行病学全流程,从头到尾 Four questions - understand, discover, characterise, use - one toolkit 四个问题 -- 理解、发现、刻画、应用一一共用一套工具 D1 The genetic-epidemiology pipeline: which method answers which question UNDERSTANDING is there a genetic role? DISCOVERING which genes / causes? CHARACTERISING how risky? USING IN PRACTICE should we screen / act? Familial aggregation GWAS / association Penetrance (M5) Screening (M8) Twin studies
-
1)整门课的“高频得分链条”(你要练到闭眼能写)
-
链条 A:家族聚集性(Familial aggregation)——2×2 → OR/RR/SMR → 经典评价句
-
你要会做什么:
- 建 2×2 表(暴露/家族史 vs 患病)→算 OR(常见于病例对照)→解释是否存在聚集性→写“证据但非证明”。[4]Source: asksia-bible-poph90111-bilingual.pdfModules 4-8 · multi- week window Penetrance-from-OR, NNT, NNS Plug real incidence & ARR, get the number The list of study designs Name the signature bias & appraise it POPH90111 . Genetic Epidemiology ★ The format - open-book, calculation + appraisal 考试形式 -- 开卷,计算+评价 Nothing here is invigilated. The MCQ is a 30-minute LMS quiz of 10 questions, completable any time within one week of release, covering the Introduction + Module 1 (scope of genetic epi, the role of familial aggregation, the designs that measure it, and calculating & interpreting aggregation measures). The assignments are written documents submitted online over a multi-week window. Because the book is open, no fact is worth a mark on its own - only the method, the interpretation, and the appraisal are. 此处无任何监考。MCQ 是一份10题、30分钟的 LMS 测验,在发布后一周内任意时间可完成,涵盖引 言+模块 1(遗传流行病学的范畴、家族聚集性的作 用、测量它的设计,以及计算与解读聚集性测度)。两 份作业为多周窗口内在线提交的书面文档。由于本书 是开卷的,没有哪个事实本身值分 -- 只有方法、解 读和评价才值分。 ✓ The strategy this dictates 由此决定的策略 Every question is a chain: calculate - interpret ++ appraise. The recurring chains are - 2×2 - OR/RR -+ 'evidence for not proof of'; rMz, roz - Falconer h2 -+ equal-environments caveat; OR + population incidence - penetrance - ascertainment-bias check; ARR - NNT - NNS - target high-risk. For every design write its measure, its one strength and its one characteristic bias - LO5 (appraisal) threads through all 8 modules and is where the easy marks live. 每道题都是一条链:计算→解读→评价。反复出现 的链有 -- 2×2→OR/RR→“是证据而非证明”; 「MZ、『Dz→ Falconer h2→等环境注意事项;OR+ 人群发病率→ 外显率→ 确认偏倚核查;ARR → NNT → NNS → 靶向高风险。对每一个设计都写出它 的测度、它的一个优势和它的一个特征性偏倚 -- LO5(评价)贯穿全部8个模块,正是容易得分之 处。 POPH90111 . Genetic Epidemiology CONTENTS - CONTENTS The genetic-epi pipeline, end to end 遗传流行病学全流程,从头到尾 Four questions - understand, discover, characterise, use - one toolkit 四个问题 -- 理解、发现、刻画、应用一一共用一套工具 D1 The genetic-epidemiology pipeline: which method answers which question UNDERSTANDING is there a genetic role? DISCOVERING which genes / causes? CHARACTERISING how risky? USING IN PRACTICE should we screen / act? Familial aggregation GWAS / association Penetrance (M5) Screening (M8) Twin studies[9]Source: asksia-bible-poph90111-bilingual.pdf→ 9 Appraisal grid & practice design - measure - bias · interpretation drills → POPH90111 . Genetic Epidemiology i Why this order 为何采用这一顺序 We follow the course's own pipeline. UNDERSTAND (M1-M2) asks whether genes matter at all; DISCOVER (M3) finds the variants and (M4) tests causal exposures; CHARACTERISE (M5-M7) asks how risky a variant is and what modifies it; USE IN PRACTICE (M8) decides whether to screen. Assignment 1 covers M1-3, Assignment 2 covers M4-8 - so the chapter split mirrors exactly what each assignment will test. 我们遵循本课程自身的流水线。理解(M1-M2)追问基因究竟是否起作用;发现(M3)找出变异、(M4)检验因果暴 露;刻画(M5-M7)追问某变异风险有多大、由什么修饰;实践应用(M8)决定是否筛查。作业1 涵盖 M1-3,作业 2 涵盖 M4-8 -- 因此章节划分恰好对应每份作业将考查的内容。 POPH90111 . Genetic Epidemiology CH 1 . FOUNDATIONS - CHAPTER 1 . FOUNDATIONS EXTRA-MODULE 1 + MODULE 1 From DNA to family risk: the genetics you need first 从 DNA 到家族风险:先要掌握的遗传学 Genes vs environment . alleles, genotypes, haplotypes . modes of inheritance 基因 vs 环境 · 等位基因、基因型、单体型 · 遗传模式 Genetic epidemiology asks: do genes contribute to who gets a disease? Before you can measure that you need a working vocabulary - what a variant, genotype and haplotype are, and how disease risk depends on how many risk alleles a person carries. This chapter is the primer (Extra-Module 1) plus the first real method - familial aggregation (Module 1), the question of whether disease clusters in families more than chance allows. 遗传流行病学发问:基因是否对谁会患病有所贡献?在能够衡量这一点之前,你需要一套可用的词汇 -- 变异、基因型和单体 型是什么,以及疾病风险如何取决于一个人携带的风险等位基因数目。本章是入门导引(Extra-Module 1)外加第一个真正 的方法 -- 家族聚集性(Module 1),即探究疾病在家庭中聚集的程度是否超过偶然所允许的范围。 ★ What the MCQ + Assignment 1 ask here 此处 MCQ+作业一考什么 The open-book MCQ (10%) is built from exactly this material: the scope of genetic epidemiology, the role of familial aggregation, the study designs that measure it, and calculating & interpreting aggregation measures. Assignment 1 then asks you to build a 2x2, compute an OR or RR/SMR, interpret it, and appraise the design's bias. Master the calculation and the one-sentence interpretation - both score. 开卷 MCQ (10%)恰由这部分材料构成:遗传流行病学的范畴、家族聚集性的作用、测量它的研究设计,以及计算与解 读聚集性测度。作业1随后要求你构建 2×2 表,计算 OR 或 RR/SMR,解读它,并评价该设计的偏倚。计算和那句一 句话解读都要掌握- -两者都计分。 1. 1 Genes vs environment - why families resemble each other 1. 1 基因 vs 环境 -- 家庭成员为何彼此相像 Susceptibility to complex disease comes from both genes and environment (that is Learning Outcome 1). Relatives resemble each other for two reasons that are hopelessly tangled: they share DNA (closer kin share more) and they share an environment (families eat, live and are exposed alike). That tangle is the central problem of the whole subject. 对复杂疾病的易感性同时来自基因与环境(即学习成果1)。亲属彼此相像有两个难解难分地纠缠在一起的原因:他们共享 DNA (越近的亲属共享得越多)且共享一个环境(家庭成员吃得、住得、所受暴露都相似)。这种纠缠正是整门学科的核心问 - 题。 POPH90111 . Genetic Epidemiology D2 Shared GENES Shared ENVIRONMENT[13]Source: asksia-bible-poph90111-bilingual.pdfOR = ad/bc。演算(ABCFS,“任一患病姐妹”):病例 13/462 对 对照 1/405 → OR =(13×404)/(449×1)= 11. 7 (95% CI 1. 7-98. 2) . 3 Interpret. OR > 1 and CI excluding 1 - positive familial aggregation - "evidence for, not proof of, a Beyond the 2x2 - the other measures 超越 2×2––其他度量 SMR · RECURRENCE - RISK RATIO SMR = Observed / Expected Expected = population age/sex rates x relatives' person-time Worked: mothers of cases O=45, E=17. 7 - SMR ~ AR = risk in a type-R relative of a case / population prevalence K FRR = RR given an affected 1st-degree relative vs general pop. POPH90111 . Genetic Epidemiology genetic aetiology. " 解读。OR>1且 CI 不含1→正向家族聚集性 -- “是支持 遗传性病因的证据,而非证明”。 ✓ Reading & R across degrees 跨亲缘等级解读 入R AR > 1 means relatives are at raised risk. If Ap declines as relatedness falls (sib > aunt > cousin), that gradient points to a genetic contribution; the rate of decline hints at polygenic (gentle decline) vs single- gene (steep) architecture. 入R >1 意味着亲属风险升高。若 入R 随亲缘关系下降 而递减(同胞>姑舅>表堂亲),该梯度指向遗传贡 献;递减的速率则提示是多基因(平缓递减)还是单 基因(陡峭)结构。 ! Match the design to the measure 让设计与度量相匹配 OR comes from case-control; RR / rate-ratio / SMR come from cohort designs. Quoting an RR off a case- control table (where you can't estimate risk) is a classic appraisal slip. And always close with the mantra: aggregation is evidence for, not proof of, an inherited aetiology. OR 来自病例对照;RR/ 率比 / SMR 来自队列设 计。从一张病例对照表(你无法估计风险)上引用 RR 是典型的评价失误。并且始终以那句口诀收尾:聚集 性是支持遗传性病因的证据,而非证明。 POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY UNDERSTAND . LO3 MODULE 2 . HERITABILITY How much of the variation is genetic? 变异中有多少源于遗传 Variance components . twin studies . Falconer . ACE . liability threshold 方差成分 · 双生子研究 · Falconer · ACE · 易感性阈值 Familial aggregation told us a trait runs in families. Heritability asks the sharper, quantitative question: of all the variation we see in a trait across a population, what proportion is due to genetic differences between people? The answer is a number between 0 and 1 - and the whole module is the machinery for estimating it, almost entirely from twins. This is the most calculation-heavy stretch of the course: you will partition variance, run Falconer's formula, solve the ACE equations, and map a yes/no disease onto a hidden continuous liability. Every estimate comes wrapped in the same critical-appraisal caution - heritability is a property of a population in an environment, never of an individual. 家族聚集性告诉我们某性状在家庭中流传。遗传率则提出更尖锐的定量问题:在一个人群中所见某性状的全部变异里,有多大 比例归因于人与人之间的遗传差异?答案是一个介于0与1之间的数字 -- 而整个模块就是估计它的机器,且几乎完全依靠 双生子。这是全课程计算量最大的一段:你将分解方差、运用 Falconer 公式、求解 ACE 方程组,并把一个是/否的疾病映射 到一个隐藏的连续易感性上。每个估计都裹挟着同一句批判性评价告诫 -- 遗传率是某一环境中某一人群的属性,绝非个体的 属性。
-
核心公式(最常用):
- OR(odds ratio):$OR=\dfrac{ad}{bc}$(2×2 表对角相乘/相除)[13]Source: asksia-bible-poph90111-bilingual.pdfOR = ad/bc。演算(ABCFS,“任一患病姐妹”):病例 13/462 对 对照 1/405 → OR =(13×404)/(449×1)= 11. 7 (95% CI 1. 7-98. 2) . 3 Interpret. OR > 1 and CI excluding 1 - positive familial aggregation - "evidence for, not proof of, a Beyond the 2x2 - the other measures 超越 2×2––其他度量 SMR · RECURRENCE - RISK RATIO SMR = Observed / Expected Expected = population age/sex rates x relatives' person-time Worked: mothers of cases O=45, E=17. 7 - SMR ~ AR = risk in a type-R relative of a case / population prevalence K FRR = RR given an affected 1st-degree relative vs general pop. POPH90111 . Genetic Epidemiology genetic aetiology. " 解读。OR>1且 CI 不含1→正向家族聚集性 -- “是支持 遗传性病因的证据,而非证明”。 ✓ Reading & R across degrees 跨亲缘等级解读 入R AR > 1 means relatives are at raised risk. If Ap declines as relatedness falls (sib > aunt > cousin), that gradient points to a genetic contribution; the rate of decline hints at polygenic (gentle decline) vs single- gene (steep) architecture. 入R >1 意味着亲属风险升高。若 入R 随亲缘关系下降 而递减(同胞>姑舅>表堂亲),该梯度指向遗传贡 献;递减的速率则提示是多基因(平缓递减)还是单 基因(陡峭)结构。 ! Match the design to the measure 让设计与度量相匹配 OR comes from case-control; RR / rate-ratio / SMR come from cohort designs. Quoting an RR off a case- control table (where you can't estimate risk) is a classic appraisal slip. And always close with the mantra: aggregation is evidence for, not proof of, an inherited aetiology. OR 来自病例对照;RR/ 率比 / SMR 来自队列设 计。从一张病例对照表(你无法估计风险)上引用 RR 是典型的评价失误。并且始终以那句口诀收尾:聚集 性是支持遗传性病因的证据,而非证明。 POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY UNDERSTAND . LO3 MODULE 2 . HERITABILITY How much of the variation is genetic? 变异中有多少源于遗传 Variance components . twin studies . Falconer . ACE . liability threshold 方差成分 · 双生子研究 · Falconer · ACE · 易感性阈值 Familial aggregation told us a trait runs in families. Heritability asks the sharper, quantitative question: of all the variation we see in a trait across a population, what proportion is due to genetic differences between people? The answer is a number between 0 and 1 - and the whole module is the machinery for estimating it, almost entirely from twins. This is the most calculation-heavy stretch of the course: you will partition variance, run Falconer's formula, solve the ACE equations, and map a yes/no disease onto a hidden continuous liability. Every estimate comes wrapped in the same critical-appraisal caution - heritability is a property of a population in an environment, never of an individual. 家族聚集性告诉我们某性状在家庭中流传。遗传率则提出更尖锐的定量问题:在一个人群中所见某性状的全部变异里,有多大 比例归因于人与人之间的遗传差异?答案是一个介于0与1之间的数字 -- 而整个模块就是估计它的机器,且几乎完全依靠 双生子。这是全课程计算量最大的一段:你将分解方差、运用 Falconer 公式、求解 ACE 方程组,并把一个是/否的疾病映射 到一个隐藏的连续易感性上。每个估计都裹挟着同一句批判性评价告诫 -- 遗传率是某一环境中某一人群的属性,绝非个体的 属性。[16]Source: asksia-cheatsheet-poph90111.pdfCompiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 32 . Test Performance 2×2 METRICS From a 2×2 of test (+/-) x true status (D+/D-): TP, FP, FN, TN. ACCURACY METRICS Sensitivity = TP/(TP+FN) P(+|disease) Specificity = TN/(TN+FP) P(-|healthy) PPV = TP/(TP+FP) . NPV = TN/(TN+FN) Sens & spec are intrinsic to the test; PPV rises & NPV falls as prevalence rises. In low-prevalence screening even a very specific test gives many false positives > low PPV. BAYES FORM PPV = (Sens . Prev) / [Sens . Prev + (1-Spec) (1-Prev) ] 33 . ROC & AUC DISCRIMINATION ROC: plot sensitivity (y) vs 1-specificity (x) as the cut- off moves. AUC = P(a random case scores higher than a random control): 0. 5 = chance (the diagonal), 1. 0 = perfect (top-left corner). In this course AUC appears for polygenic risk scores (e. g. coronary-artery-disease AUC = 0. 81). Moving the threshold trades sensitivity against specificity. Trap: excellent sens/spec is useless for screening if prevalence is tiny (PPV near zero) - always tie performance back to prevalence / carrier frequency. Weak instrument (MR) 33b . PPV Worked PREVALENCE BITES Sens=0. 90, Spec=0. 99. At prevalence 1%: PPV = (0. 9. 0. 01) / [0. 9. 0. 01 + 0. 01. 0. 99] = 0. 009/0. 0189 = 48% Half of positives are false - despite 99% specificity. At prevalence 10% the same test gives PPV = 91%. Lesson: raise the prior (target high-risk) before screening, or most positives are false alarms. Sens/spec are fixed properties of the test; only PPV/NPV move with prevalence - that single fact answers most "evaluate this screening test" questions. NPV is near-perfect when disease is rare (almost all test-negatives really are well), which is little comfort if the few positives are mostly false. 34 . Risk Reclassification PREVENTION PRECISION Adding a genetic factor (e. g. a polygenic score) re- classifies individuals across an actionable risk threshold - some move up (newly flagged high-risk), some down (reassured). The value of genetic screening = how many it correctly reclassifies + NNS/NNT, not "we can test, so we should. " Ties back to the Wilson- Jungner conditions (penetrance understood, accurate test, early actionable stage). A reclassification is only worthwhile if a person moving above the threshold gains an effective action (screening, prophylaxis, risk-reducing surgery). Reclassifying with no actionable consequence adds anxiety without benefit. 34b . Disease Screening USING GENETICS Two distinct goals: (1) screen for genetic risk in the well -> reduce future risk; (2) use a genetic factor to triage disease screening - e. g. start colonoscopy earlier / more often in MMR carriers. Both still demand an accurate test + an effective downstream action + favourable NNS in the targeted group. 35 . Appraisal Checklist L05 . EVERY MODULE LO5 (appraisal) threads through every module. For any study, answer design > measure > strength > limitation > bias: · Design? case-control / cohort / twin / GWAS / MR / family / weighted / case-only / screening · Causation? aggregation/association # cause; MR / replication / dose-response strengthen it
-
一定要写的解读句(送分句):
- 当 $OR>1$ 且置信区间不含 1:提示正向聚集性;但结尾必须写:
- “家族聚集性/ MZ>DZ 相关更高 是支持遗传病因的证据,但不是证明;共享环境始终是竞争解释。”[1]Source: asksia-bible-poph90111-bilingual.pdfA 1 . LEARN 1· 学习 You haven't watched the module yet. Read a chapter top to bottom. Every concept is an AHA-unit: diagram - explainer + method-in-steps - analogy + trap. The figures are original schematics of the standard genetic-epi canon - learn the idea cold before the numbers. 你还没看过该模块。从头到尾读 一章。每个概念都是一个 AHA 单元:图示→讲解→ 分步方法 →类比→陷阱。图均为标准遗 传流行病学经典内容的原创示意 图 -- 先把思路彻底吃透,再上 数字。 B 2 . DRILL 2 · 演练 You've watched the module and done the Q&A. Cover the worked steps and re-derive each OR, RR, SMR, h2, penetrance and NNT by hand. Then say aloud the interpretation and the one bias that would change your conclusion. 你已看过该模块并做完 Q&A。 遮住已演算的步骤,亲手重新推 导每一个 OR、RR、SMR、h2、 外显率和 NNT。然后大声说出解 读,以及那一个会改变你结论的 偏倚。 C 3 . ASSIGNMENT 3 · 作业 The assignment is open. Use the design-appraisal grids and worked-formula boxes as your lookup. Every question is calculate - interpret -+ appraise; the Stata . do file (A1 Q1) does the arithmetic, but the interpretation and appraisal marks are yours to write. 作业是开卷的。把设计评价表格 和公式演算框当作你的查阅工 具。每道题都是计算→解读→ 评价;Stata 的 . do 文件(A1 Q1)负责算术,但解读和评价的 分数要你自己写出来。 POPH90111 . Genetic Epidemiology ! The single most important thing to understand about POPH90111 关于 POPH90111 最重要的一点认识 This subject is a logical pipeline, not a pile of topics: UNDERSTAND (is there a genetic role? - familial aggregation M1, heritability M2) - DISCOVER (which variants? - association/GWAS M3) - CHARACTERISE (how risky? - penetrance M5, modifiers M6, G×E M7) - USE IN PRACTICE (causal exposures via MR M4, screening M8). One mantra recurs at every stage: familial aggregation / high MZ-vs-DZ correlation is 'evidence for, but not proof of, an inherited genetic aetiology' - shared environment is always the rival explanation. Write that sentence whenever you interpret aggregation or heritability. 本科目是一条逻辑流水线,而非一堆零散主题:理解(是否存在遗传作用?→家族聚集性 M1、遗传力 M2)→发现 (哪些变异?→关联/GWAS M3)→刻画(风险有多大?→外显率M5、修饰因子 M6、GxE M7)→实践应用(经 MR 评估因果暴露 M4、筛查 M8)。一句口诀贯穿每一阶段:家族聚集性/高的 MZ-对-DZ 相关是“支持遗传性病因的 证据,但非证明” -- 共享环境永远是竞争性解释。每当你解读聚集性或遗传力时都写下这句话。 i How this book was built - and the two-layer rule 本书如何编成 -- 以及两层规则 Standard genetic and statistical canon (Hardy-Weinberg, LD measures, Falconer's h2, the ACE model, Manhattan/QQ plots, MR's three IV assumptions, NNT/NNS) is stated plainly - it is universal, non-copyrightable science, and every diagram here is our own clean drawing. The course's specific framing and its lecturer's example numbers are paraphrased and re-checked, never copied from slides. The subject runs on A/Prof Aung Ko Win's modules with Prof Mark Jenkins' lectures; verify dates and weights against your own Canvas (the LMS), as cohort details can shift. 标准的遗传学与统计学经典(Hardy-Weinberg、LD 测度、Falconer h2、ACE 模型、曼哈顿图/QQ 图、MR 三个 IV 假设、NNT/NNS)均平实陈述 -- 它们是普适的、不受版权保护的科学,此处每张图都是我们自己干净的绘制。本课程 特定的表述及其讲师的示例数字均为转述并重新核对,绝不照搬幻灯片。本科目以 A/Prof Aung Ko Win 的模块和 Prof Mark Jenkins 的讲课为基础;请对照你自己的 Canvas(即 LMS)核实日期与权重,因为各届的细节可能变动。 POPH90111 . Genetic Epidemiology THE BLUEPRINT - THE ASSESSMENT BLUEPRINT 90% IN TWO ASSIGNMENTS Three take-home pieces, no exam 三份带回家作业,无考试[13]Source: asksia-bible-poph90111-bilingual.pdfOR = ad/bc。演算(ABCFS,“任一患病姐妹”):病例 13/462 对 对照 1/405 → OR =(13×404)/(449×1)= 11. 7 (95% CI 1. 7-98. 2) . 3 Interpret. OR > 1 and CI excluding 1 - positive familial aggregation - "evidence for, not proof of, a Beyond the 2x2 - the other measures 超越 2×2––其他度量 SMR · RECURRENCE - RISK RATIO SMR = Observed / Expected Expected = population age/sex rates x relatives' person-time Worked: mothers of cases O=45, E=17. 7 - SMR ~ AR = risk in a type-R relative of a case / population prevalence K FRR = RR given an affected 1st-degree relative vs general pop. POPH90111 . Genetic Epidemiology genetic aetiology. " 解读。OR>1且 CI 不含1→正向家族聚集性 -- “是支持 遗传性病因的证据,而非证明”。 ✓ Reading & R across degrees 跨亲缘等级解读 入R AR > 1 means relatives are at raised risk. If Ap declines as relatedness falls (sib > aunt > cousin), that gradient points to a genetic contribution; the rate of decline hints at polygenic (gentle decline) vs single- gene (steep) architecture. 入R >1 意味着亲属风险升高。若 入R 随亲缘关系下降 而递减(同胞>姑舅>表堂亲),该梯度指向遗传贡 献;递减的速率则提示是多基因(平缓递减)还是单 基因(陡峭)结构。 ! Match the design to the measure 让设计与度量相匹配 OR comes from case-control; RR / rate-ratio / SMR come from cohort designs. Quoting an RR off a case- control table (where you can't estimate risk) is a classic appraisal slip. And always close with the mantra: aggregation is evidence for, not proof of, an inherited aetiology. OR 来自病例对照;RR/ 率比 / SMR 来自队列设 计。从一张病例对照表(你无法估计风险)上引用 RR 是典型的评价失误。并且始终以那句口诀收尾:聚集 性是支持遗传性病因的证据,而非证明。 POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY UNDERSTAND . LO3 MODULE 2 . HERITABILITY How much of the variation is genetic? 变异中有多少源于遗传 Variance components . twin studies . Falconer . ACE . liability threshold 方差成分 · 双生子研究 · Falconer · ACE · 易感性阈值 Familial aggregation told us a trait runs in families. Heritability asks the sharper, quantitative question: of all the variation we see in a trait across a population, what proportion is due to genetic differences between people? The answer is a number between 0 and 1 - and the whole module is the machinery for estimating it, almost entirely from twins. This is the most calculation-heavy stretch of the course: you will partition variance, run Falconer's formula, solve the ACE equations, and map a yes/no disease onto a hidden continuous liability. Every estimate comes wrapped in the same critical-appraisal caution - heritability is a property of a population in an environment, never of an individual. 家族聚集性告诉我们某性状在家庭中流传。遗传率则提出更尖锐的定量问题:在一个人群中所见某性状的全部变异里,有多大 比例归因于人与人之间的遗传差异?答案是一个介于0与1之间的数字 -- 而整个模块就是估计它的机器,且几乎完全依靠 双生子。这是全课程计算量最大的一段:你将分解方差、运用 Falconer 公式、求解 ACE 方程组,并把一个是/否的疾病映射 到一个隐藏的连续易感性上。每个估计都裹挟着同一句批判性评价告诫 -- 遗传率是某一环境中某一人群的属性,绝非个体的 属性。[17]Source: asksia-cheatsheet-poph90111.pdfPOPH90111 Genetic Epidemiology UNIVERSITY OF MELBOURNE . POPULATION & GLOBAL HEALTH CALCULATION & METHOD REFERENCE Sem 1 2026 . SIDE 1 OF 2 Foundations -> heritability -> association SIDE 1/2 UNDERSTAND . Genetics primer . Hardy-Weinberg . LD . Familial aggregation (OR/RR/SMR/A) . Heritability (Falconer, ACE, liability) . Association & GWAS 0 . How To Use This READ FIRST * This subject is a pipeline: UNDERSTAND (is there a genetic role? - aggregation, heritability) -> DISCOVER (which variants? - LD, GWAS, MR) -> CHARACTERISE (how risky? - penetrance, modifiers, GxE) > USE (screening). Side 1 = understand & discover; side 2 = characterise & use. Assessment shape: online MCQ 10% (10 Qs, 1-week window) + written A1 40% (Modules 1-3) + A2 50% (Modules 4-8). All online / take-home - no invigilated exam. Every assignment task is one of three: (a) calculate + interpret , (b) discuss findings, (c) critically appraise a design. So the two high-value moves are: plug the right formula, then judge the design's bias. A Stata . do file is even handed out for A1 Q1 - expect software-based calculation, then a written interpretation. SIA > The mantra that earns marks everywhere: aggregation / high MZ-vs-DZ correlation is "evidence for, but not proof of, an inherited genetic aetiology. " Say it whenever you interpret aggregation or heritability. 1 . Genetics Primer EXTRA-MODULE 1 Locus = position on a chromosome. Allele = the base(s) there; minor allele = rarer one. Genotype = the pair (e. g. TT, TC, CC); homo- vs hetero-zygous. Mutation classes: silent (usually benign), missense (changes amino acid), nonsense (premature stop), frameshift indels (corrupt every downstream codon > usually pathogenic). CNV = larger gain/loss. 2 · Modes of Inheritance RISK INEQUALITIES Defined on Pr(phenotype | # risk alleles), not on "having" the trait: AUTOSOMAL Dominant: Pr(2)=Pr(1) > Pr(0) Recessive: Pr(2) > Pr(1)=Pr(0) Codominant: Pr(2) > Pr(1) > Pr(0) Carrier risk can be <1 (incomplete penetrance) and non-carrier risk >0 (phenocopies/sporadic). So a dominant variant can still have penetrance below 100%. SEGREGATION (PUNNETT) Each parent passes one randomly-chosen allele. Aaxaa -> 1/2 Aa, 1/2 aa (no AA). AaxAa -> 1/4 AA, 1/2 Aa, 1/4 aa => P(child carries ≥1 A)=3/4, P(AA)=1/4. Trap: the genotype gives the expected probability distribution, not the realised counts in a small sibship. 2b · Germline vs Somatic SAMPLE CHOICE · Inherited colorectal-cancer family risk -> germline > sample blood / buccal swab Tumour responds differently to chemo, no family history -> somatic > sample the tumour biopsy
-
评价(appraise)常写点:
-
扩展你可能会被问的“其他聚集性度量”(知道它们是什么、啥设计能算出来):
- SMR:$SMR=\dfrac{Observed}{Expected}$(队列/随访,Expected 用人群率×人时等)[13]Source: asksia-bible-poph90111-bilingual.pdfOR = ad/bc。演算(ABCFS,“任一患病姐妹”):病例 13/462 对 对照 1/405 → OR =(13×404)/(449×1)= 11. 7 (95% CI 1. 7-98. 2) . 3 Interpret. OR > 1 and CI excluding 1 - positive familial aggregation - "evidence for, not proof of, a Beyond the 2x2 - the other measures 超越 2×2––其他度量 SMR · RECURRENCE - RISK RATIO SMR = Observed / Expected Expected = population age/sex rates x relatives' person-time Worked: mothers of cases O=45, E=17. 7 - SMR ~ AR = risk in a type-R relative of a case / population prevalence K FRR = RR given an affected 1st-degree relative vs general pop. POPH90111 . Genetic Epidemiology genetic aetiology. " 解读。OR>1且 CI 不含1→正向家族聚集性 -- “是支持 遗传性病因的证据,而非证明”。 ✓ Reading & R across degrees 跨亲缘等级解读 入R AR > 1 means relatives are at raised risk. If Ap declines as relatedness falls (sib > aunt > cousin), that gradient points to a genetic contribution; the rate of decline hints at polygenic (gentle decline) vs single- gene (steep) architecture. 入R >1 意味着亲属风险升高。若 入R 随亲缘关系下降 而递减(同胞>姑舅>表堂亲),该梯度指向遗传贡 献;递减的速率则提示是多基因(平缓递减)还是单 基因(陡峭)结构。 ! Match the design to the measure 让设计与度量相匹配 OR comes from case-control; RR / rate-ratio / SMR come from cohort designs. Quoting an RR off a case- control table (where you can't estimate risk) is a classic appraisal slip. And always close with the mantra: aggregation is evidence for, not proof of, an inherited aetiology. OR 来自病例对照;RR/ 率比 / SMR 来自队列设 计。从一张病例对照表(你无法估计风险)上引用 RR 是典型的评价失误。并且始终以那句口诀收尾:聚集 性是支持遗传性病因的证据,而非证明。 POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY UNDERSTAND . LO3 MODULE 2 . HERITABILITY How much of the variation is genetic? 变异中有多少源于遗传 Variance components . twin studies . Falconer . ACE . liability threshold 方差成分 · 双生子研究 · Falconer · ACE · 易感性阈值 Familial aggregation told us a trait runs in families. Heritability asks the sharper, quantitative question: of all the variation we see in a trait across a population, what proportion is due to genetic differences between people? The answer is a number between 0 and 1 - and the whole module is the machinery for estimating it, almost entirely from twins. This is the most calculation-heavy stretch of the course: you will partition variance, run Falconer's formula, solve the ACE equations, and map a yes/no disease onto a hidden continuous liability. Every estimate comes wrapped in the same critical-appraisal caution - heritability is a property of a population in an environment, never of an individual. 家族聚集性告诉我们某性状在家庭中流传。遗传率则提出更尖锐的定量问题:在一个人群中所见某性状的全部变异里,有多大 比例归因于人与人之间的遗传差异?答案是一个介于0与1之间的数字 -- 而整个模块就是估计它的机器,且几乎完全依靠 双生子。这是全课程计算量最大的一段:你将分解方差、运用 Falconer 公式、求解 ACE 方程组,并把一个是/否的疾病映射 到一个隐藏的连续易感性上。每个估计都裹挟着同一句批判性评价告诫 -- 遗传率是某一环境中某一人群的属性,绝非个体的 属性。
- AR / FRR:材料提示它们用于亲属风险与人群患病率对比(你写清“相对风险升高”与随亲缘度下降的梯度解释即可)[13]Source: asksia-bible-poph90111-bilingual.pdfOR = ad/bc。演算(ABCFS,“任一患病姐妹”):病例 13/462 对 对照 1/405 → OR =(13×404)/(449×1)= 11. 7 (95% CI 1. 7-98. 2) . 3 Interpret. OR > 1 and CI excluding 1 - positive familial aggregation - "evidence for, not proof of, a Beyond the 2x2 - the other measures 超越 2×2––其他度量 SMR · RECURRENCE - RISK RATIO SMR = Observed / Expected Expected = population age/sex rates x relatives' person-time Worked: mothers of cases O=45, E=17. 7 - SMR ~ AR = risk in a type-R relative of a case / population prevalence K FRR = RR given an affected 1st-degree relative vs general pop. POPH90111 . Genetic Epidemiology genetic aetiology. " 解读。OR>1且 CI 不含1→正向家族聚集性 -- “是支持 遗传性病因的证据,而非证明”。 ✓ Reading & R across degrees 跨亲缘等级解读 入R AR > 1 means relatives are at raised risk. If Ap declines as relatedness falls (sib > aunt > cousin), that gradient points to a genetic contribution; the rate of decline hints at polygenic (gentle decline) vs single- gene (steep) architecture. 入R >1 意味着亲属风险升高。若 入R 随亲缘关系下降 而递减(同胞>姑舅>表堂亲),该梯度指向遗传贡 献;递减的速率则提示是多基因(平缓递减)还是单 基因(陡峭)结构。 ! Match the design to the measure 让设计与度量相匹配 OR comes from case-control; RR / rate-ratio / SMR come from cohort designs. Quoting an RR off a case- control table (where you can't estimate risk) is a classic appraisal slip. And always close with the mantra: aggregation is evidence for, not proof of, an inherited aetiology. OR 来自病例对照;RR/ 率比 / SMR 来自队列设 计。从一张病例对照表(你无法估计风险)上引用 RR 是典型的评价失误。并且始终以那句口诀收尾:聚集 性是支持遗传性病因的证据,而非证明。 POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY UNDERSTAND . LO3 MODULE 2 . HERITABILITY How much of the variation is genetic? 变异中有多少源于遗传 Variance components . twin studies . Falconer . ACE . liability threshold 方差成分 · 双生子研究 · Falconer · ACE · 易感性阈值 Familial aggregation told us a trait runs in families. Heritability asks the sharper, quantitative question: of all the variation we see in a trait across a population, what proportion is due to genetic differences between people? The answer is a number between 0 and 1 - and the whole module is the machinery for estimating it, almost entirely from twins. This is the most calculation-heavy stretch of the course: you will partition variance, run Falconer's formula, solve the ACE equations, and map a yes/no disease onto a hidden continuous liability. Every estimate comes wrapped in the same critical-appraisal caution - heritability is a property of a population in an environment, never of an individual. 家族聚集性告诉我们某性状在家庭中流传。遗传率则提出更尖锐的定量问题:在一个人群中所见某性状的全部变异里,有多大 比例归因于人与人之间的遗传差异?答案是一个介于0与1之间的数字 -- 而整个模块就是估计它的机器,且几乎完全依靠 双生子。这是全课程计算量最大的一段:你将分解方差、运用 Falconer 公式、求解 ACE 方程组,并把一个是/否的疾病映射 到一个隐藏的连续易感性上。每个估计都裹挟着同一句批判性评价告诫 -- 遗传率是某一环境中某一人群的属性,绝非个体的 属性。
-
链条 B:遗传率(Heritability)——方差分解 → Falconer $h^2$ → ACE → 评价“等环境假设”
-
最核心的定义(必须背一句话):
- 遗传率是:在某一人群、某一环境中,表型方差中可归因于遗传方差的比例。关键词是“方差”,解释的是人群差异,不是个体。[10]Source: asksia-bible-poph90111-bilingual.pdf★ What the assignment / exam asks here 此处作业/考试考什么 (1) Partition variance: given Va, Vd, Ve compute narrow-sense h2 = Va/Vp and broad-sense H2 = Vg/Vp. (2) From twin correlations apply Falconer: h2 = 2(rMZ - rDZ) and interpret. (3) Solve the ACE system for A, C, E from rMZ and rDZ. (4) Convert SD - variance (Vp = SD2) and explain estimating separately by sex and zygosity. (5) Explain the liability-threshold model for a binary trait. (6) The marks-rich part: critically appraise - equal-environments assumption, 'evidence for not proof of', missing heritability, what heritability does not mean. (1)分解方差:给定 Va、Vd、Ve,计算狭义 h2 = Va/Vp 和广义 H2 = Vg/Vp。(2)由双生子相关应用 Falconer: h2= 2(rMZ - rDZ) 并解读。(3)由 rMZ 和 rDZ 求解 ACE 方程组得 A、C、E。(4)把 SD→方差(Vp= SD2)并解释为何 要按性别和合子性分别估计。(5)解释二分性状的易感性阈值模型。(6)富含分数的部分:严格评价一 -等环境假设、“是 证据而非证明”、缺失遗传力,以及遗传力不意味着什么。 2. 1 Heritability is a proportion of variance 2. 1 遗传率是方差的一个比例 Start from the definition the course insists on. Heritability is the proportion of the phenotypic variance in a population that is attributable to genetic variance. The key word is variance - we are explaining differences between people, not the trait value of any one person. Variance is the spread, the squared standard deviation: V = SD2. 从课程坚持采用的定义出发。遗传率是一个人群中表型方差里可归因于遗传方差的比例。关键词是方差 -- 我们解释的是人与 人之间的差异,而非任何一个人的性状取值。方差即离散度,是标准差的平方:V= SD2。 POPH90111 . Genetic Epidemiology VARIANCE PARTITION Top split: Vp = VG + VE Genetic split: VG = VA + Vp (+ VI) Broad-sense: H2 = VG / Vp Narrow-sense: h2 = VA / Vp (always h2 ≤ Hº) VA additive (allele effects sum - the part that passes parent-child); Vp dominance (interaction within a locus); VI epistasis (between loci); VE environment. VA 加性(等位基因效应相加 -- 可由亲代→子代传递的那 部分);VD 显性(同一位点内部的交互);V1上位(位点之 间);VE 环境。 D2 Fig 2. 1 - Family resemblance is fed by shared genes AND shared environment. Heritability is our attempt to read off the genetic slice of the variance - but the two sources are confounded, which is exactly why we need twins. Shared GENES(degree of relatedness)closer kin share more DNAShared ENVIRONMENTdiet, lifestyle, exposuresfamilies live alikeFamily resemblance(observed familial aggregation)aggregation = genes + shared env + chanceDegrees of relatedness1st parent/sib/childshare 1/22nd grandparent/uncleshare 1/43rd first cousinshare 1/8Read-off: familial aggregation is NECESSARY but NOT SUFFICIENT for a genetic cause - shared environment is the rival. Fig 2. 1 - Family resemblance is fed by shared genes AND shared environment. Heritability is our attempt to read off the genetic slice of the variance - but the two sources are confounded, which is exactly why we need twins. 图 2. 1 -- 家族相似性由共享基因和共享环境共同喂 养。遗传力是我们试图读出方差中遗传那一份的尝试 -- 但这两个来源彼此混杂,这正是我们需要双生子 的原因。 i Narrow vs broad - why we usually quote h2 狭义 vs 广义 -- 为何我们通常引用 h2 Only the additive part VA is transmitted predictably from parent to offspring and drives resemblance between relatives and response to selection. Dominance and epistasis effects are reshuffled each generation, so narrow- sense h2 is the quantity that predicts relatives' correlations - which is precisely what twin studies measure. 只有加性部分 VA 可从亲代到子代可预测地传递,并驱动亲属间的相似性和对选择的响应。显性与上位效应每代都被重 新洗牌,因此狭义 h2 才是预测亲属相关的量 -- 这正是双生子研究所测量的。 - POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY 2. 2 SD - variance: the arithmetic you will be marked on 2. 2 SD→ 方差:会被评分的算术 Almost every heritability question hands you a standard deviation or a correlation, not a variance. The single most-dropped mark is forgetting that variance = SD squared. The course also estimates everything separately by sex and by zygosity, because the spread genuinely differs between groups. 几乎每道遗传率题目给你的都是标准差或相关,而非方差。最常被丢掉的一分就是忘了方差= SD 的平方。课程还会按性别和 合子性分别估计一切,因为各组间的离散度确实不同。 - EX 2A Turning a standard deviation into a variance L . variance A tutorial reports female adult height with SD = 9. 29 cm. The genetic component is estimated as VA = 55. Find Vp and then narrow-sense h2.[13]Source: asksia-bible-poph90111-bilingual.pdfOR = ad/bc。演算(ABCFS,“任一患病姐妹”):病例 13/462 对 对照 1/405 → OR =(13×404)/(449×1)= 11. 7 (95% CI 1. 7-98. 2) . 3 Interpret. OR > 1 and CI excluding 1 - positive familial aggregation - "evidence for, not proof of, a Beyond the 2x2 - the other measures 超越 2×2––其他度量 SMR · RECURRENCE - RISK RATIO SMR = Observed / Expected Expected = population age/sex rates x relatives' person-time Worked: mothers of cases O=45, E=17. 7 - SMR ~ AR = risk in a type-R relative of a case / population prevalence K FRR = RR given an affected 1st-degree relative vs general pop. POPH90111 . Genetic Epidemiology genetic aetiology. " 解读。OR>1且 CI 不含1→正向家族聚集性 -- “是支持 遗传性病因的证据,而非证明”。 ✓ Reading & R across degrees 跨亲缘等级解读 入R AR > 1 means relatives are at raised risk. If Ap declines as relatedness falls (sib > aunt > cousin), that gradient points to a genetic contribution; the rate of decline hints at polygenic (gentle decline) vs single- gene (steep) architecture. 入R >1 意味着亲属风险升高。若 入R 随亲缘关系下降 而递减(同胞>姑舅>表堂亲),该梯度指向遗传贡 献;递减的速率则提示是多基因(平缓递减)还是单 基因(陡峭)结构。 ! Match the design to the measure 让设计与度量相匹配 OR comes from case-control; RR / rate-ratio / SMR come from cohort designs. Quoting an RR off a case- control table (where you can't estimate risk) is a classic appraisal slip. And always close with the mantra: aggregation is evidence for, not proof of, an inherited aetiology. OR 来自病例对照;RR/ 率比 / SMR 来自队列设 计。从一张病例对照表(你无法估计风险)上引用 RR 是典型的评价失误。并且始终以那句口诀收尾:聚集 性是支持遗传性病因的证据,而非证明。 POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY UNDERSTAND . LO3 MODULE 2 . HERITABILITY How much of the variation is genetic? 变异中有多少源于遗传 Variance components . twin studies . Falconer . ACE . liability threshold 方差成分 · 双生子研究 · Falconer · ACE · 易感性阈值 Familial aggregation told us a trait runs in families. Heritability asks the sharper, quantitative question: of all the variation we see in a trait across a population, what proportion is due to genetic differences between people? The answer is a number between 0 and 1 - and the whole module is the machinery for estimating it, almost entirely from twins. This is the most calculation-heavy stretch of the course: you will partition variance, run Falconer's formula, solve the ACE equations, and map a yes/no disease onto a hidden continuous liability. Every estimate comes wrapped in the same critical-appraisal caution - heritability is a property of a population in an environment, never of an individual. 家族聚集性告诉我们某性状在家庭中流传。遗传率则提出更尖锐的定量问题:在一个人群中所见某性状的全部变异里,有多大 比例归因于人与人之间的遗传差异?答案是一个介于0与1之间的数字 -- 而整个模块就是估计它的机器,且几乎完全依靠 双生子。这是全课程计算量最大的一段:你将分解方差、运用 Falconer 公式、求解 ACE 方程组,并把一个是/否的疾病映射 到一个隐藏的连续易感性上。每个估计都裹挟着同一句批判性评价告诫 -- 遗传率是某一环境中某一人群的属性,绝非个体的 属性。
-
方差最容易丢分的点:
-
方差分解(写出来就像你很懂,而且真会考):
-
双生子研究的 Falconer 公式(高频计算):
- 连续性状:$$h^2 = 2(r_{MZ}-r_{DZ})$$[10]Source: asksia-bible-poph90111-bilingual.pdf★ What the assignment / exam asks here 此处作业/考试考什么 (1) Partition variance: given Va, Vd, Ve compute narrow-sense h2 = Va/Vp and broad-sense H2 = Vg/Vp. (2) From twin correlations apply Falconer: h2 = 2(rMZ - rDZ) and interpret. (3) Solve the ACE system for A, C, E from rMZ and rDZ. (4) Convert SD - variance (Vp = SD2) and explain estimating separately by sex and zygosity. (5) Explain the liability-threshold model for a binary trait. (6) The marks-rich part: critically appraise - equal-environments assumption, 'evidence for not proof of', missing heritability, what heritability does not mean. (1)分解方差:给定 Va、Vd、Ve,计算狭义 h2 = Va/Vp 和广义 H2 = Vg/Vp。(2)由双生子相关应用 Falconer: h2= 2(rMZ - rDZ) 并解读。(3)由 rMZ 和 rDZ 求解 ACE 方程组得 A、C、E。(4)把 SD→方差(Vp= SD2)并解释为何 要按性别和合子性分别估计。(5)解释二分性状的易感性阈值模型。(6)富含分数的部分:严格评价一 -等环境假设、“是 证据而非证明”、缺失遗传力,以及遗传力不意味着什么。 2. 1 Heritability is a proportion of variance 2. 1 遗传率是方差的一个比例 Start from the definition the course insists on. Heritability is the proportion of the phenotypic variance in a population that is attributable to genetic variance. The key word is variance - we are explaining differences between people, not the trait value of any one person. Variance is the spread, the squared standard deviation: V = SD2. 从课程坚持采用的定义出发。遗传率是一个人群中表型方差里可归因于遗传方差的比例。关键词是方差 -- 我们解释的是人与 人之间的差异,而非任何一个人的性状取值。方差即离散度,是标准差的平方:V= SD2。 POPH90111 . Genetic Epidemiology VARIANCE PARTITION Top split: Vp = VG + VE Genetic split: VG = VA + Vp (+ VI) Broad-sense: H2 = VG / Vp Narrow-sense: h2 = VA / Vp (always h2 ≤ Hº) VA additive (allele effects sum - the part that passes parent-child); Vp dominance (interaction within a locus); VI epistasis (between loci); VE environment. VA 加性(等位基因效应相加 -- 可由亲代→子代传递的那 部分);VD 显性(同一位点内部的交互);V1上位(位点之 间);VE 环境。 D2 Fig 2. 1 - Family resemblance is fed by shared genes AND shared environment. Heritability is our attempt to read off the genetic slice of the variance - but the two sources are confounded, which is exactly why we need twins. Shared GENES(degree of relatedness)closer kin share more DNAShared ENVIRONMENTdiet, lifestyle, exposuresfamilies live alikeFamily resemblance(observed familial aggregation)aggregation = genes + shared env + chanceDegrees of relatedness1st parent/sib/childshare 1/22nd grandparent/uncleshare 1/43rd first cousinshare 1/8Read-off: familial aggregation is NECESSARY but NOT SUFFICIENT for a genetic cause - shared environment is the rival. Fig 2. 1 - Family resemblance is fed by shared genes AND shared environment. Heritability is our attempt to read off the genetic slice of the variance - but the two sources are confounded, which is exactly why we need twins. 图 2. 1 -- 家族相似性由共享基因和共享环境共同喂 养。遗传力是我们试图读出方差中遗传那一份的尝试 -- 但这两个来源彼此混杂,这正是我们需要双生子 的原因。 i Narrow vs broad - why we usually quote h2 狭义 vs 广义 -- 为何我们通常引用 h2 Only the additive part VA is transmitted predictably from parent to offspring and drives resemblance between relatives and response to selection. Dominance and epistasis effects are reshuffled each generation, so narrow- sense h2 is the quantity that predicts relatives' correlations - which is precisely what twin studies measure. 只有加性部分 VA 可从亲代到子代可预测地传递,并驱动亲属间的相似性和对选择的响应。显性与上位效应每代都被重 新洗牌,因此狭义 h2 才是预测亲属相关的量 -- 这正是双生子研究所测量的。 - POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY 2. 2 SD - variance: the arithmetic you will be marked on 2. 2 SD→ 方差:会被评分的算术 Almost every heritability question hands you a standard deviation or a correlation, not a variance. The single most-dropped mark is forgetting that variance = SD squared. The course also estimates everything separately by sex and by zygosity, because the spread genuinely differs between groups. 几乎每道遗传率题目给你的都是标准差或相关,而非方差。最常被丢掉的一分就是忘了方差= SD 的平方。课程还会按性别和 合子性分别估计一切,因为各组间的离散度确实不同。 - EX 2A Turning a standard deviation into a variance L . variance A tutorial reports female adult height with SD = 9. 29 cm. The genetic component is estimated as VA = 55. Find Vp and then narrow-sense h2.[22]Source: asksia-cheatsheet-poph90111.pdfTHRESHOLDS Bonferroni: a = 0. 05 / (# tests) genome-wide significance = 5x10-8 5×10-8 = 0. 05/106 independent common-variant tests; hits must replicate independently . Worked: a candidate study of 50 SNPs = Bonferroni a = 0. 05/50 = 0. 001 - a SNP at p=0. 01 is not significant after correction. Trap: Bonferroni is conservative (LD makes tests correlated) but 5x10-8 is the field standard - use it for GWAS. 17 . Manhattan & QQ READ THE PLOT Manhattan: x = genomic position, y = - log1%(p). Peaks crossing -log10(5×10-8) = 7. 3 = associated loci. QQ plot: observed vs expected -log10(p) under the null. On the diagonal = no inflation; an early, whole-line upward lift = stratification / cryptic relatedness / artefact (genomic inflation ^_GC; )=1 is good); a departure only in the extreme tail = genuine signal. Trap: don't read a single Manhattan peak as "the causal gene" - the top SNP is usually the best tag in LD with the true causal variant, so fine-mapping is needed to localise the cause. 18 . Pop. Stratification KEY CONFOUNDER Cases & controls differ in ancestry; both allele freqs & disease rates vary by ancestry = spurious association (confounding). Fixes: match on ancestry, adjust for principal components, genomic control (\_GC), or family-based designs; HWE deviation in controls helps flag it. This is why a hit must replicate in an independent sample and why GWAS report __ GC - a clean QQ plot ()=1) is the reassurance that genuine signal, not stratification, is driving the Manhattan peaks. > > 1=> inflate-corrected before trusting any hit. Formula Belt SIDE 1 p=[2n(AA)+n(Aa)]/2N . carrier=p2+2pq HWE p2+2pq+q2=1 · χ2=Σ (0-E) 2/E df1 r2=D2/[P(A)P(a)P(B)P(b)] · OR=ad/bc h2=2(r_MZ-r_DZ) . A=2(r_MZ-r_DZ) SMR=0/E · A_R=relative risk/K . GWS 5×10-8 Calculation & method reference . check the current subject guide . @ 2026 flip + for side 2 . MR, penetrance, GxE & screening asksia. ai/cheatsheet/ unimelb-poph90111 . side 1/2 AskSia CHEATSHEET SERIES METHOD REFERENCE . ALL TOPICS POPH90111 Genetic Epidemiology UNIVERSITY OF MELBOURNE . POPULATION & GLOBAL HEALTH CALCULATION & METHOD REFERENCE Sem 1 2026 . SIDE 2 OF 2 MR . penetrance . GxE . screening . appraisal 19 . Mendelian Randomisation Use a genetic variant as an instrumental variable (IV/proxy) for a modifiable exposure to test causation. Genotype is randomly allocated at conception ("nature's RCT") = not subject to reverse causation or conventional confounding
- 二分类性状(用一致率/同病率近似):$$h^2 = 2(conc_{MZ}-conc_{DZ})$$[30]Source: asksia-cheatsheet-poph90111.pdf8b . Family Designs M1 EXTRAS Case-control-family / case-family: relatives directly interviewed = OR / RR / SMR; relatives of controls are hard to recruit, and the case-family design needs a population registry. Outcome can be analysed as dichotomous (affected y/n), ordinal (number affected) or multinomial - match the analysis to how family history was coded. 9 . Heritability MODULE 2 = proportion of phenotypic variance due to genetic variance. A property of a population in an environment, not an individual. Variance = SD2 (e. g. height SD 9. 29 => variance = 86). VARIANCE PARTITION Vp = Vg + Ve Vg = Va + Vd (+ Vi) Broad-sense H2 = Vg/Vp Narrow-sense h2 = Va/Vp (h2 ≤ H2) Narrow-sense (additive Va) predicts relative resemblance & response to selection; Vd = dominance, Vi = epistatic/interaction variance. Estimate variance separately by sex & zygosity (M>F; DZ>MZ spread). 10 . Twin Studies THE ENGINE MZ share ~100% genes; DZ ~50% (like full sibs). Both share rearing env -> comparing them isolates genetics; twins control for age & shared env. Binary: concordance = proportion of pairs both affected; conc_MZ > conc_DZ => genetic . Continuous: correlate twin-1 vs twin-2. FALCONER'S HERITABILITY h2 = 2 (r_MZ - r_DZ) (continuous) Worked: h2 = 2 (conc_MZ - conc_DZ) (binary) Worked: female height r_MZ=0. 78, r_DZ=0. 46 = h2 = 2(0. 78-0. 46) = 0. 64 - 64% of variance in female height is additively genetic. Interpret: "consistent with, but not proof of, an inherited genetic aetiology. " Genetic variance from heritability: Vg = h2 x Vp. With Vp=86 and h2=0. 64 => Vg=55. Opposite-sex DZ pairs & the twin-co-twin (TRA) design extend the model to probe shared-environment and sex effects. 11 . ACE Model VARIANCE COMPONENTS Split Vp into A additive genetic, C common/shared env, E unique env + error. From twin correlations: ACE FROM R_MZ, R_DZ r_MZ = A + C . r_DZ = %A + C A = 2(r_MZ - r_DZ) (= Falconer) C = 2. r_DZ - r_MZ . E = 1 - r_MZ Worked: r_MZ=0. 78, r_DZ=0. 46 => A=2(0. 78-0. 46)=0. 64; C=2(0. 46)-0. 78=0. 14; E=1-0. 78=0. 22. Check: A+C+E = 0. 64+0. 14+0. 22 = 1. 00 V. So C is the part of resemblance shared equally by both twin types; E (incl. measurement error) is the only thing that makes MZ co-twins differ. Trap - equal- environments assumption: if MZ pairs are treated more alike than DZ, shared env masquerades as genes = h2 overestimated . 4 11b . Classic Twin Model ASSUMPTIONS · Random mating (no assortative mating inflating r_DZ) Concordance/correlation are estimated separately by sex & zygosity because variance differs. Binary worked: conc_MZ=0. 40, conc_DZ=0. 15 => h2(liability) = 2(0. 40-0. 15) = 0. 50 . MZ>DZ concordance is the signal; near-equality (conc_MZ=conc_DZ) => shared environment, not genes, drives the resemblance. Compiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 12 . Liability- Threshold BINARY TRAITS
-
Falconer 的标准解读模板:
- “$h^2=0.64$ 表示该性状在人群差异中约 64% 可由加性遗传解释(在该人群与环境下)”;并补一句:
- “consistent with / evidence for, but not proof of…”[30]Source: asksia-cheatsheet-poph90111.pdf8b . Family Designs M1 EXTRAS Case-control-family / case-family: relatives directly interviewed = OR / RR / SMR; relatives of controls are hard to recruit, and the case-family design needs a population registry. Outcome can be analysed as dichotomous (affected y/n), ordinal (number affected) or multinomial - match the analysis to how family history was coded. 9 . Heritability MODULE 2 = proportion of phenotypic variance due to genetic variance. A property of a population in an environment, not an individual. Variance = SD2 (e. g. height SD 9. 29 => variance = 86). VARIANCE PARTITION Vp = Vg + Ve Vg = Va + Vd (+ Vi) Broad-sense H2 = Vg/Vp Narrow-sense h2 = Va/Vp (h2 ≤ H2) Narrow-sense (additive Va) predicts relative resemblance & response to selection; Vd = dominance, Vi = epistatic/interaction variance. Estimate variance separately by sex & zygosity (M>F; DZ>MZ spread). 10 . Twin Studies THE ENGINE MZ share ~100% genes; DZ ~50% (like full sibs). Both share rearing env -> comparing them isolates genetics; twins control for age & shared env. Binary: concordance = proportion of pairs both affected; conc_MZ > conc_DZ => genetic . Continuous: correlate twin-1 vs twin-2. FALCONER'S HERITABILITY h2 = 2 (r_MZ - r_DZ) (continuous) Worked: h2 = 2 (conc_MZ - conc_DZ) (binary) Worked: female height r_MZ=0. 78, r_DZ=0. 46 = h2 = 2(0. 78-0. 46) = 0. 64 - 64% of variance in female height is additively genetic. Interpret: "consistent with, but not proof of, an inherited genetic aetiology. " Genetic variance from heritability: Vg = h2 x Vp. With Vp=86 and h2=0. 64 => Vg=55. Opposite-sex DZ pairs & the twin-co-twin (TRA) design extend the model to probe shared-environment and sex effects. 11 . ACE Model VARIANCE COMPONENTS Split Vp into A additive genetic, C common/shared env, E unique env + error. From twin correlations: ACE FROM R_MZ, R_DZ r_MZ = A + C . r_DZ = %A + C A = 2(r_MZ - r_DZ) (= Falconer) C = 2. r_DZ - r_MZ . E = 1 - r_MZ Worked: r_MZ=0. 78, r_DZ=0. 46 => A=2(0. 78-0. 46)=0. 64; C=2(0. 46)-0. 78=0. 14; E=1-0. 78=0. 22. Check: A+C+E = 0. 64+0. 14+0. 22 = 1. 00 V. So C is the part of resemblance shared equally by both twin types; E (incl. measurement error) is the only thing that makes MZ co-twins differ. Trap - equal- environments assumption: if MZ pairs are treated more alike than DZ, shared env masquerades as genes = h2 overestimated . 4 11b . Classic Twin Model ASSUMPTIONS · Random mating (no assortative mating inflating r_DZ) Concordance/correlation are estimated separately by sex & zygosity because variance differs. Binary worked: conc_MZ=0. 40, conc_DZ=0. 15 => h2(liability) = 2(0. 40-0. 15) = 0. 50 . MZ>DZ concordance is the signal; near-equality (conc_MZ=conc_DZ) => shared environment, not genes, drives the resemblance. Compiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 12 . Liability- Threshold BINARY TRAITS[17]Source: asksia-cheatsheet-poph90111.pdfPOPH90111 Genetic Epidemiology UNIVERSITY OF MELBOURNE . POPULATION & GLOBAL HEALTH CALCULATION & METHOD REFERENCE Sem 1 2026 . SIDE 1 OF 2 Foundations -> heritability -> association SIDE 1/2 UNDERSTAND . Genetics primer . Hardy-Weinberg . LD . Familial aggregation (OR/RR/SMR/A) . Heritability (Falconer, ACE, liability) . Association & GWAS 0 . How To Use This READ FIRST * This subject is a pipeline: UNDERSTAND (is there a genetic role? - aggregation, heritability) -> DISCOVER (which variants? - LD, GWAS, MR) -> CHARACTERISE (how risky? - penetrance, modifiers, GxE) > USE (screening). Side 1 = understand & discover; side 2 = characterise & use. Assessment shape: online MCQ 10% (10 Qs, 1-week window) + written A1 40% (Modules 1-3) + A2 50% (Modules 4-8). All online / take-home - no invigilated exam. Every assignment task is one of three: (a) calculate + interpret , (b) discuss findings, (c) critically appraise a design. So the two high-value moves are: plug the right formula, then judge the design's bias. A Stata . do file is even handed out for A1 Q1 - expect software-based calculation, then a written interpretation. SIA > The mantra that earns marks everywhere: aggregation / high MZ-vs-DZ correlation is "evidence for, but not proof of, an inherited genetic aetiology. " Say it whenever you interpret aggregation or heritability. 1 . Genetics Primer EXTRA-MODULE 1 Locus = position on a chromosome. Allele = the base(s) there; minor allele = rarer one. Genotype = the pair (e. g. TT, TC, CC); homo- vs hetero-zygous. Mutation classes: silent (usually benign), missense (changes amino acid), nonsense (premature stop), frameshift indels (corrupt every downstream codon > usually pathogenic). CNV = larger gain/loss. 2 · Modes of Inheritance RISK INEQUALITIES Defined on Pr(phenotype | # risk alleles), not on "having" the trait: AUTOSOMAL Dominant: Pr(2)=Pr(1) > Pr(0) Recessive: Pr(2) > Pr(1)=Pr(0) Codominant: Pr(2) > Pr(1) > Pr(0) Carrier risk can be <1 (incomplete penetrance) and non-carrier risk >0 (phenocopies/sporadic). So a dominant variant can still have penetrance below 100%. SEGREGATION (PUNNETT) Each parent passes one randomly-chosen allele. Aaxaa -> 1/2 Aa, 1/2 aa (no AA). AaxAa -> 1/4 AA, 1/2 Aa, 1/4 aa => P(child carries ≥1 A)=3/4, P(AA)=1/4. Trap: the genotype gives the expected probability distribution, not the realised counts in a small sibship. 2b · Germline vs Somatic SAMPLE CHOICE · Inherited colorectal-cancer family risk -> germline > sample blood / buccal swab Tumour responds differently to chemo, no family history -> somatic > sample the tumour biopsy
-
ACE 模型(比 Falconer 更“现代”,也更能拿评价分):
- 含义:$A$ 加性遗传、$C$ 共享环境、$E$ 独有环境+测量误差。[15]Source: asksia-bible-poph90111-bilingual.pdfH2 = 2(0. 78-0. 46) h2 = 2(0. 78-0. 46) NARROW ≤ BROAD 狭义 ≤广义 POPH90111 . Genetic Epidemiology MODULE 2 . ACE MODEL 2. 5 AHA-unit - the ACE variance-components model 2. 5 AHA 单元 -- ACE 方差成分模型 Falconer gives you h2 but throws away the rest. The modern framing - ACE - partitions the standardised phenotypic variance (scaled to 1) into three pieces and solves for all of them. A = additive genetic, C = common (shared) environment, E = unique environment + measurement error. The trick is that A, C, E enter the MZ and DZ correlations differently, giving two equations in the unknowns. Falconer 给你 h2,却丢掉了其余部分。现代框架 -- ACE -- 把标准化的表型方差(缩放为1)分解成三块并将它们全部解 出。A=加性遗传,C= 共同(共享)环境,E= 独特环境+测量误差。诀窍在于 A、C、E 进入 MZ与 DZ 相关的方式各不 相同,从而给出关于这些未知量的两个方程。 D9 Fig 2. 3 - Each twin's phenotype is fed by latent A, C, E. The cross-twin A path is 1. 0 for MZ but 0. 5 for DZ; the C path is 1. 0 for both; E never crosses (it is, by definition, unique). Those three facts are the whole model. PhenotypeP1 (twin 1)ACEPhenotypeP2 (twin 2)ACETWIN 1TWIN 2rA = 1. 0 (MZ) / 0. 5 (DZ)rC = 1. 0 (both MZ & DZ)E uncorrelated across twins (no cross-twin path)A = additive geneticC = common (shared) envE = unique env + errorVar(P) = A + C + Eh2 = A / (A+C+E)c2 = C / (A+C+E)ACE: phenotypic variance = Additive genetic + Common env + unique Env Fig 2. 3 - Each twin's phenotype is fed by latent A, C, E. The cross-twin A path is 1. 0 for MZ but 0. 5 for DZ; the C path is 1. 0 for both; E never crosses (it is, by definition, unique). Those three facts are the whole model. 图 2. 3 -- 每个双生子的表型由潜在的A、C、E喂养。跨双生子的 A 路径在 MZ 为 1. 0、在 DZ为 0. 5;C 路径对两 者都为1. 0;E从不跨越(按定义它是独有的)。这三个事实就是整个模型。 POPH90111 . Genetic Epidemiology ACE EQUATIONS Scaling: A + C + E = 1 (variance set to 1) MZ corr : rMz = A + C DZ corr: roz = 12A + C Solve: A = 2(IMZ C = 2. rDZ rMz − E = 1 - rMZ Note A = Falconer's h2 exactly - ACE contains Falconer and adds C and E for free. 注意 A= Falconer 的h2,完全相等––ACE 包含 Falconer,并额外免费给出 C 与 E。 i Why subtract to get C and E 为何用相减得到 C 与 E Subtract the two correlation equations: (A+C) - (1/2A+C) = 1/2A, so A = 2(rMZ-rDZ). Whatever of rMZ is not A must be the shared environment, giving C = rMZ - A = 2rDZ - rMZ. And since MZ twins are genetically identical with identical C, any failure of rMZ to reach 1 is the unique part: E = 1 - rMZ. 把两个相关方程相减:(A+C)-(1/2A+C) = 1/2A,故 A =2(rMZ-rDZ)。rMZ 中不属于 A的部分必为共享环 境,得到 C=rMZ-A=2rDZ- rMZ。又因 MZ 双 生子遗传相同且 C 相同,rMZ 未能达到1的部分即为 独有部分:E=1-rMZ。 EX 2C Full ACE decomposition (female height) M . the marks-rich solve Using rMz = 0. 78 and rpz = 0. 46, split the variance into A, C and E. 用 rMz= 0. 78 与 rpz = 0. 46,把方差分解为 A、C 和 E。 1 A (additive genetic). A = 2(0. 78 - 0. 46) = 2(0. 32) = 0. 64 - identical to the Falconer answer. V
- 方程:
- $r_{MZ}=A+C$
- $r_{DZ}=\tfrac{1}{2}A + C$
- 直接可用的解法(公式要熟):
- $$A = 2(r_{MZ}-r_{DZ})$$
- $$C = 2r_{DZ}-r_{MZ}$$
- $$E = 1-r_{MZ}$$[15]Source: asksia-bible-poph90111-bilingual.pdfH2 = 2(0. 78-0. 46) h2 = 2(0. 78-0. 46) NARROW ≤ BROAD 狭义 ≤广义 POPH90111 . Genetic Epidemiology MODULE 2 . ACE MODEL 2. 5 AHA-unit - the ACE variance-components model 2. 5 AHA 单元 -- ACE 方差成分模型 Falconer gives you h2 but throws away the rest. The modern framing - ACE - partitions the standardised phenotypic variance (scaled to 1) into three pieces and solves for all of them. A = additive genetic, C = common (shared) environment, E = unique environment + measurement error. The trick is that A, C, E enter the MZ and DZ correlations differently, giving two equations in the unknowns. Falconer 给你 h2,却丢掉了其余部分。现代框架 -- ACE -- 把标准化的表型方差(缩放为1)分解成三块并将它们全部解 出。A=加性遗传,C= 共同(共享)环境,E= 独特环境+测量误差。诀窍在于 A、C、E 进入 MZ与 DZ 相关的方式各不 相同,从而给出关于这些未知量的两个方程。 D9 Fig 2. 3 - Each twin's phenotype is fed by latent A, C, E. The cross-twin A path is 1. 0 for MZ but 0. 5 for DZ; the C path is 1. 0 for both; E never crosses (it is, by definition, unique). Those three facts are the whole model. PhenotypeP1 (twin 1)ACEPhenotypeP2 (twin 2)ACETWIN 1TWIN 2rA = 1. 0 (MZ) / 0. 5 (DZ)rC = 1. 0 (both MZ & DZ)E uncorrelated across twins (no cross-twin path)A = additive geneticC = common (shared) envE = unique env + errorVar(P) = A + C + Eh2 = A / (A+C+E)c2 = C / (A+C+E)ACE: phenotypic variance = Additive genetic + Common env + unique Env Fig 2. 3 - Each twin's phenotype is fed by latent A, C, E. The cross-twin A path is 1. 0 for MZ but 0. 5 for DZ; the C path is 1. 0 for both; E never crosses (it is, by definition, unique). Those three facts are the whole model. 图 2. 3 -- 每个双生子的表型由潜在的A、C、E喂养。跨双生子的 A 路径在 MZ 为 1. 0、在 DZ为 0. 5;C 路径对两 者都为1. 0;E从不跨越(按定义它是独有的)。这三个事实就是整个模型。 POPH90111 . Genetic Epidemiology ACE EQUATIONS Scaling: A + C + E = 1 (variance set to 1) MZ corr : rMz = A + C DZ corr: roz = 12A + C Solve: A = 2(IMZ C = 2. rDZ rMz − E = 1 - rMZ Note A = Falconer's h2 exactly - ACE contains Falconer and adds C and E for free. 注意 A= Falconer 的h2,完全相等––ACE 包含 Falconer,并额外免费给出 C 与 E。 i Why subtract to get C and E 为何用相减得到 C 与 E Subtract the two correlation equations: (A+C) - (1/2A+C) = 1/2A, so A = 2(rMZ-rDZ). Whatever of rMZ is not A must be the shared environment, giving C = rMZ - A = 2rDZ - rMZ. And since MZ twins are genetically identical with identical C, any failure of rMZ to reach 1 is the unique part: E = 1 - rMZ. 把两个相关方程相减:(A+C)-(1/2A+C) = 1/2A,故 A =2(rMZ-rDZ)。rMZ 中不属于 A的部分必为共享环 境,得到 C=rMZ-A=2rDZ- rMZ。又因 MZ 双 生子遗传相同且 C 相同,rMZ 未能达到1的部分即为 独有部分:E=1-rMZ。 EX 2C Full ACE decomposition (female height) M . the marks-rich solve Using rMz = 0. 78 and rpz = 0. 46, split the variance into A, C and E. 用 rMz= 0. 78 与 rpz = 0. 46,把方差分解为 A、C 和 E。 1 A (additive genetic). A = 2(0. 78 - 0. 46) = 2(0. 32) = 0. 64 - identical to the Falconer answer. V[8]Source: asksia-bible-poph90111-bilingual.pdf02 Decompose the same data with the ACE model Q2 用 ACE 模型分解同一组数据 Q2 CALCULATE + INTERPRET 4 marks . Heritability Using the same correlations (rMz = 0. 70, roz = 0. 41), fit the ACE model: find A, C and E, and say what each means. Confirm A reproduces the Falconer estimate. 用相同的相关系数(rMz = 0. 70,rpz= 0. 41)拟合 ACE 模型:求出 A、C、E,并说明各自含义。验证 A 是否重现 Falconer 估计值。 POPH90111 . Genetic Epidemiology 02 Worked solution - ACE solve ACE FROM CORRELATIONS rMz = A + C rnz = 42A + C - A = 2(rMz-rpZ), C = 2rpz-rMz, E = 1-rMz 1 A (additive genetic). A = 2(0. 70 - 0. 41) = 0. 58 - identical to the Falconer h2 (Falconer's formula is the ACE A term). V A (加性遗传)。A= 2(0. 70-0. 41)= 0. 58 -- 与 Falconer h2 一致(Falconer 公式就是ACE的A项)。√ 2 C (common/shared environment). C = 2(0. 41) - 0. 70 = 0. 82 - 0. 70 = 0. 12 -+ ~12% of variance from environment shared by twins (diet, household). C (共同/共享环境)。C= 2(0. 41)-0. 70= 0. 82 -0. 70= 0. 12→约12% 的方差来自双生子共享的环境(饮食、家 庭)。 3 E (unique environment + error). E = 1 - rMz = 1 - 0. 70 = 0. 30 -+~30% from non-shared exposures and measurement error. E (独有环境+误差)。E =1-rMz = 1-0. 70= 0. 30→ 约30% 来自非共享暴露和测量误差。 4 Check. A + C + E = 0. 58 + 0. 12 + 0. 30 = 1. 00 / (the three components partition the whole phenotypic variance). 核查。A+ C+E=0. 58+ 0. 12+ 0. 30=1. 00√(这三个成分分解了整个表型方差)。 i Critical appraisal - why ACE beats a bare h2 批判性评价 -- 为何 ACE 优于单纯的 h2 Falconer can hand back a heritability greater than 1 or a negative implied C when rMz > 2rpz (here 0. 70 vs 0. 82, so C is safely positive). ACE makes the shared-environment term explicit: a large C is itself a warning that the equal- environments assumption is doing heavy lifting. 当 rMz>2rpz 时,Falconer 可能回吐一个大于1的遗传力,或一个隐含为负的 C(此处 0. 70 对 0. 82,故C 稳为 正)。ACE 把共享环境项变得显式:一个大的C本身就是一个警告 -- 等环境假设正在承担过重的分量。 POPH90111 . Genetic Epidemiology PRACTICE Q3 - PRACTICE BANK (CONT. ) Association - OR from a 2×2 and the allelic x2 关联分析:从 2x2 表求 OR 与等位基因 x2 Q3-Q4: the genetic-association calculation slots Q3-Q4: 遗传关联计算环节 03 Odds ratio from a family-history 2x2 Q3 由家族史 2×2 表求比值比 Q3 CALCULATE + INTERPRET 3 marks . Association
- 核对:$A+C+E=1$(材料示例强调要 check)。[8]Source: asksia-bible-poph90111-bilingual.pdf02 Decompose the same data with the ACE model Q2 用 ACE 模型分解同一组数据 Q2 CALCULATE + INTERPRET 4 marks . Heritability Using the same correlations (rMz = 0. 70, roz = 0. 41), fit the ACE model: find A, C and E, and say what each means. Confirm A reproduces the Falconer estimate. 用相同的相关系数(rMz = 0. 70,rpz= 0. 41)拟合 ACE 模型:求出 A、C、E,并说明各自含义。验证 A 是否重现 Falconer 估计值。 POPH90111 . Genetic Epidemiology 02 Worked solution - ACE solve ACE FROM CORRELATIONS rMz = A + C rnz = 42A + C - A = 2(rMz-rpZ), C = 2rpz-rMz, E = 1-rMz 1 A (additive genetic). A = 2(0. 70 - 0. 41) = 0. 58 - identical to the Falconer h2 (Falconer's formula is the ACE A term). V A (加性遗传)。A= 2(0. 70-0. 41)= 0. 58 -- 与 Falconer h2 一致(Falconer 公式就是ACE的A项)。√ 2 C (common/shared environment). C = 2(0. 41) - 0. 70 = 0. 82 - 0. 70 = 0. 12 -+ ~12% of variance from environment shared by twins (diet, household). C (共同/共享环境)。C= 2(0. 41)-0. 70= 0. 82 -0. 70= 0. 12→约12% 的方差来自双生子共享的环境(饮食、家 庭)。 3 E (unique environment + error). E = 1 - rMz = 1 - 0. 70 = 0. 30 -+~30% from non-shared exposures and measurement error. E (独有环境+误差)。E =1-rMz = 1-0. 70= 0. 30→ 约30% 来自非共享暴露和测量误差。 4 Check. A + C + E = 0. 58 + 0. 12 + 0. 30 = 1. 00 / (the three components partition the whole phenotypic variance). 核查。A+ C+E=0. 58+ 0. 12+ 0. 30=1. 00√(这三个成分分解了整个表型方差)。 i Critical appraisal - why ACE beats a bare h2 批判性评价 -- 为何 ACE 优于单纯的 h2 Falconer can hand back a heritability greater than 1 or a negative implied C when rMz > 2rpz (here 0. 70 vs 0. 82, so C is safely positive). ACE makes the shared-environment term explicit: a large C is itself a warning that the equal- environments assumption is doing heavy lifting. 当 rMz>2rpz 时,Falconer 可能回吐一个大于1的遗传力,或一个隐含为负的 C(此处 0. 70 对 0. 82,故C 稳为 正)。ACE 把共享环境项变得显式:一个大的C本身就是一个警告 -- 等环境假设正在承担过重的分量。 POPH90111 . Genetic Epidemiology PRACTICE Q3 - PRACTICE BANK (CONT. ) Association - OR from a 2×2 and the allelic x2 关联分析:从 2x2 表求 OR 与等位基因 x2 Q3-Q4: the genetic-association calculation slots Q3-Q4: 遗传关联计算环节 03 Odds ratio from a family-history 2x2 Q3 由家族史 2×2 表求比值比 Q3 CALCULATE + INTERPRET 3 marks . Association
- 高分评价点:
- 等环境假设(equal-environments assumption):如果 MZ 被对待得更相似,环境会伪装成基因 → $h^2$ 被高估;$C$ 大本身就提示这个假设在“扛太多”。[8]Source: asksia-bible-poph90111-bilingual.pdf02 Decompose the same data with the ACE model Q2 用 ACE 模型分解同一组数据 Q2 CALCULATE + INTERPRET 4 marks . Heritability Using the same correlations (rMz = 0. 70, roz = 0. 41), fit the ACE model: find A, C and E, and say what each means. Confirm A reproduces the Falconer estimate. 用相同的相关系数(rMz = 0. 70,rpz= 0. 41)拟合 ACE 模型:求出 A、C、E,并说明各自含义。验证 A 是否重现 Falconer 估计值。 POPH90111 . Genetic Epidemiology 02 Worked solution - ACE solve ACE FROM CORRELATIONS rMz = A + C rnz = 42A + C - A = 2(rMz-rpZ), C = 2rpz-rMz, E = 1-rMz 1 A (additive genetic). A = 2(0. 70 - 0. 41) = 0. 58 - identical to the Falconer h2 (Falconer's formula is the ACE A term). V A (加性遗传)。A= 2(0. 70-0. 41)= 0. 58 -- 与 Falconer h2 一致(Falconer 公式就是ACE的A项)。√ 2 C (common/shared environment). C = 2(0. 41) - 0. 70 = 0. 82 - 0. 70 = 0. 12 -+ ~12% of variance from environment shared by twins (diet, household). C (共同/共享环境)。C= 2(0. 41)-0. 70= 0. 82 -0. 70= 0. 12→约12% 的方差来自双生子共享的环境(饮食、家 庭)。 3 E (unique environment + error). E = 1 - rMz = 1 - 0. 70 = 0. 30 -+~30% from non-shared exposures and measurement error. E (独有环境+误差)。E =1-rMz = 1-0. 70= 0. 30→ 约30% 来自非共享暴露和测量误差。 4 Check. A + C + E = 0. 58 + 0. 12 + 0. 30 = 1. 00 / (the three components partition the whole phenotypic variance). 核查。A+ C+E=0. 58+ 0. 12+ 0. 30=1. 00√(这三个成分分解了整个表型方差)。 i Critical appraisal - why ACE beats a bare h2 批判性评价 -- 为何 ACE 优于单纯的 h2 Falconer can hand back a heritability greater than 1 or a negative implied C when rMz > 2rpz (here 0. 70 vs 0. 82, so C is safely positive). ACE makes the shared-environment term explicit: a large C is itself a warning that the equal- environments assumption is doing heavy lifting. 当 rMz>2rpz 时,Falconer 可能回吐一个大于1的遗传力,或一个隐含为负的 C(此处 0. 70 对 0. 82,故C 稳为 正)。ACE 把共享环境项变得显式:一个大的C本身就是一个警告 -- 等环境假设正在承担过重的分量。 POPH90111 . Genetic Epidemiology PRACTICE Q3 - PRACTICE BANK (CONT. ) Association - OR from a 2×2 and the allelic x2 关联分析:从 2x2 表求 OR 与等位基因 x2 Q3-Q4: the genetic-association calculation slots Q3-Q4: 遗传关联计算环节 03 Odds ratio from a family-history 2x2 Q3 由家族史 2×2 表求比值比 Q3 CALCULATE + INTERPRET 3 marks . Association[30]Source: asksia-cheatsheet-poph90111.pdf8b . Family Designs M1 EXTRAS Case-control-family / case-family: relatives directly interviewed = OR / RR / SMR; relatives of controls are hard to recruit, and the case-family design needs a population registry. Outcome can be analysed as dichotomous (affected y/n), ordinal (number affected) or multinomial - match the analysis to how family history was coded. 9 . Heritability MODULE 2 = proportion of phenotypic variance due to genetic variance. A property of a population in an environment, not an individual. Variance = SD2 (e. g. height SD 9. 29 => variance = 86). VARIANCE PARTITION Vp = Vg + Ve Vg = Va + Vd (+ Vi) Broad-sense H2 = Vg/Vp Narrow-sense h2 = Va/Vp (h2 ≤ H2) Narrow-sense (additive Va) predicts relative resemblance & response to selection; Vd = dominance, Vi = epistatic/interaction variance. Estimate variance separately by sex & zygosity (M>F; DZ>MZ spread). 10 . Twin Studies THE ENGINE MZ share ~100% genes; DZ ~50% (like full sibs). Both share rearing env -> comparing them isolates genetics; twins control for age & shared env. Binary: concordance = proportion of pairs both affected; conc_MZ > conc_DZ => genetic . Continuous: correlate twin-1 vs twin-2. FALCONER'S HERITABILITY h2 = 2 (r_MZ - r_DZ) (continuous) Worked: h2 = 2 (conc_MZ - conc_DZ) (binary) Worked: female height r_MZ=0. 78, r_DZ=0. 46 = h2 = 2(0. 78-0. 46) = 0. 64 - 64% of variance in female height is additively genetic. Interpret: "consistent with, but not proof of, an inherited genetic aetiology. " Genetic variance from heritability: Vg = h2 x Vp. With Vp=86 and h2=0. 64 => Vg=55. Opposite-sex DZ pairs & the twin-co-twin (TRA) design extend the model to probe shared-environment and sex effects. 11 . ACE Model VARIANCE COMPONENTS Split Vp into A additive genetic, C common/shared env, E unique env + error. From twin correlations: ACE FROM R_MZ, R_DZ r_MZ = A + C . r_DZ = %A + C A = 2(r_MZ - r_DZ) (= Falconer) C = 2. r_DZ - r_MZ . E = 1 - r_MZ Worked: r_MZ=0. 78, r_DZ=0. 46 => A=2(0. 78-0. 46)=0. 64; C=2(0. 46)-0. 78=0. 14; E=1-0. 78=0. 22. Check: A+C+E = 0. 64+0. 14+0. 22 = 1. 00 V. So C is the part of resemblance shared equally by both twin types; E (incl. measurement error) is the only thing that makes MZ co-twins differ. Trap - equal- environments assumption: if MZ pairs are treated more alike than DZ, shared env masquerades as genes = h2 overestimated . 4 11b . Classic Twin Model ASSUMPTIONS · Random mating (no assortative mating inflating r_DZ) Concordance/correlation are estimated separately by sex & zygosity because variance differs. Binary worked: conc_MZ=0. 40, conc_DZ=0. 15 => h2(liability) = 2(0. 40-0. 15) = 0. 50 . MZ>DZ concordance is the signal; near-equality (conc_MZ=conc_DZ) => shared environment, not genes, drives the resemblance. Compiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 12 . Liability- Threshold BINARY TRAITS
-
遗传率常见“误解陷阱”(写出来就加分):
-
链条 C:检验/筛查指标(Sensitivity/Specificity/PPV/NPV)——2×2 → 指标 → 评价“患病率/先验概率”
- 题目长相:给你检验结果(+/-) × 真实疾病(D+/D-) 的 2×2,你读出 TP/FP/FN/TN,然后算指标。[11]Source: asksia-bible-poph90111-bilingual.pdfNNS - HIGH FAMILY HISTORY NNS -- 高家族史 ✓ Analogy - the lottery-ticket cost 类比 -- 彩票的代价 NNT is the price of a winning ticket once you are in the right queue (a carrier). NNS is the price including finding the right queue: in the general population you sift ~150 people to find one carrier, so the cost per prevented case balloons. Walk into a queue already full of carriers (high family history) and the sifting cost collapses. NNT 是当你已身处正确队列(即一名携带者)时一张中奖彩票的价格。NNS 则是加上找到正确队列的价格:在一般人群 中你要筛过约150人才能找到一名携带者,于是每避免一例的成本暴涨。走进一个已挤满携带者的队列(高家族史),筛 选成本就崩塌。 POPH90111 . Genetic Epidemiology ! Exam trap - NNS collapses for rare variants 考试陷阱 -- NNS 对罕见变异会失效 Screening efficiency (NNS) is dominated by carrier frequency, not by how good the treatment is. A wonderful treatment (low NNT) is wasted if you must screen thousands to find one carrier. Raising the prior probability of carriage - targeting high-family-history groups - is what makes a genetic screen worthwhile. State NNT and NNS, and always name the population the carrier frequency refers to. 筛查效率(NNS)由携带者频率主导,而非由治疗有多好主导。一个再好的治疗(低 NNT)若你必须筛上千人才找到一 名携带者,也是浪费。提高携带的先验概率 -- 靶向高家族史人群 -- 才是让一项遗传筛查值得做的关键。陈述 NNT 和 NNS,并始终点名携带者频率所指的人群。 POPH90111 . Genetic Epidemiology M8 . TEST METRICS - - MODULE 8 . EVALUATING A TEST STANDARD CANON Sensitivity, specificity, PPV / NPV 灵敏度、特异度、PPV / NPV The 2×2 of test result x true status - and why PPV depends on prevalence 检验结果 × 真实状态的 2×2 表 -- 以及 PPV 为何取决于患病率 Any "evaluate this test" question reduces to a 2×2 of test result (+/-) against true disease status (D+/D-), giving TP, FP, FN, TN. From it come four metrics - two intrinsic to the test (sensitivity, specificity) and two that depend on who you test (PPV, NPV). 任何“评价这项检验”的问题都归结为一张 2×2 表,列出检验结果(+/-)与真实疾病状态 (D+/D-),给出 TP、FP、FN、 TN。由它得出四项指标 -- 两项是检验固有的(灵敏度、特异度),两项取决于你检测的是谁(PPV、NPV)。 D5 machinery used for screening: read TP / FP / FN / TN off the cells. (Here the row split is test +/-, the column split i Disease: YES Disease: NO Exposed / FamHx + a b OR = (a x d) / (b xc) Unexposed / FamHx - C d RR = [a/(a+b)] / [c/(c+d)][16]Source: asksia-cheatsheet-poph90111.pdfCompiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 32 . Test Performance 2×2 METRICS From a 2×2 of test (+/-) x true status (D+/D-): TP, FP, FN, TN. ACCURACY METRICS Sensitivity = TP/(TP+FN) P(+|disease) Specificity = TN/(TN+FP) P(-|healthy) PPV = TP/(TP+FP) . NPV = TN/(TN+FN) Sens & spec are intrinsic to the test; PPV rises & NPV falls as prevalence rises. In low-prevalence screening even a very specific test gives many false positives > low PPV. BAYES FORM PPV = (Sens . Prev) / [Sens . Prev + (1-Spec) (1-Prev) ] 33 . ROC & AUC DISCRIMINATION ROC: plot sensitivity (y) vs 1-specificity (x) as the cut- off moves. AUC = P(a random case scores higher than a random control): 0. 5 = chance (the diagonal), 1. 0 = perfect (top-left corner). In this course AUC appears for polygenic risk scores (e. g. coronary-artery-disease AUC = 0. 81). Moving the threshold trades sensitivity against specificity. Trap: excellent sens/spec is useless for screening if prevalence is tiny (PPV near zero) - always tie performance back to prevalence / carrier frequency. Weak instrument (MR) 33b . PPV Worked PREVALENCE BITES Sens=0. 90, Spec=0. 99. At prevalence 1%: PPV = (0. 9. 0. 01) / [0. 9. 0. 01 + 0. 01. 0. 99] = 0. 009/0. 0189 = 48% Half of positives are false - despite 99% specificity. At prevalence 10% the same test gives PPV = 91%. Lesson: raise the prior (target high-risk) before screening, or most positives are false alarms. Sens/spec are fixed properties of the test; only PPV/NPV move with prevalence - that single fact answers most "evaluate this screening test" questions. NPV is near-perfect when disease is rare (almost all test-negatives really are well), which is little comfort if the few positives are mostly false. 34 . Risk Reclassification PREVENTION PRECISION Adding a genetic factor (e. g. a polygenic score) re- classifies individuals across an actionable risk threshold - some move up (newly flagged high-risk), some down (reassured). The value of genetic screening = how many it correctly reclassifies + NNS/NNT, not "we can test, so we should. " Ties back to the Wilson- Jungner conditions (penetrance understood, accurate test, early actionable stage). A reclassification is only worthwhile if a person moving above the threshold gains an effective action (screening, prophylaxis, risk-reducing surgery). Reclassifying with no actionable consequence adds anxiety without benefit. 34b . Disease Screening USING GENETICS Two distinct goals: (1) screen for genetic risk in the well -> reduce future risk; (2) use a genetic factor to triage disease screening - e. g. start colonoscopy earlier / more often in MMR carriers. Both still demand an accurate test + an effective downstream action + favourable NNS in the targeted group. 35 . Appraisal Checklist L05 . EVERY MODULE LO5 (appraisal) threads through every module. For any study, answer design > measure > strength > limitation > bias: · Design? case-control / cohort / twin / GWAS / MR / family / weighted / case-only / screening · Causation? aggregation/association # cause; MR / replication / dose-response strengthen it
- 四大指标(公式要能秒写):
- Sensitivity(灵敏度):$Sens=\dfrac{TP}{TP+FN}=P(test+|disease)$[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[16]Source: asksia-cheatsheet-poph90111.pdfCompiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 32 . Test Performance 2×2 METRICS From a 2×2 of test (+/-) x true status (D+/D-): TP, FP, FN, TN. ACCURACY METRICS Sensitivity = TP/(TP+FN) P(+|disease) Specificity = TN/(TN+FP) P(-|healthy) PPV = TP/(TP+FP) . NPV = TN/(TN+FN) Sens & spec are intrinsic to the test; PPV rises & NPV falls as prevalence rises. In low-prevalence screening even a very specific test gives many false positives > low PPV. BAYES FORM PPV = (Sens . Prev) / [Sens . Prev + (1-Spec) (1-Prev) ] 33 . ROC & AUC DISCRIMINATION ROC: plot sensitivity (y) vs 1-specificity (x) as the cut- off moves. AUC = P(a random case scores higher than a random control): 0. 5 = chance (the diagonal), 1. 0 = perfect (top-left corner). In this course AUC appears for polygenic risk scores (e. g. coronary-artery-disease AUC = 0. 81). Moving the threshold trades sensitivity against specificity. Trap: excellent sens/spec is useless for screening if prevalence is tiny (PPV near zero) - always tie performance back to prevalence / carrier frequency. Weak instrument (MR) 33b . PPV Worked PREVALENCE BITES Sens=0. 90, Spec=0. 99. At prevalence 1%: PPV = (0. 9. 0. 01) / [0. 9. 0. 01 + 0. 01. 0. 99] = 0. 009/0. 0189 = 48% Half of positives are false - despite 99% specificity. At prevalence 10% the same test gives PPV = 91%. Lesson: raise the prior (target high-risk) before screening, or most positives are false alarms. Sens/spec are fixed properties of the test; only PPV/NPV move with prevalence - that single fact answers most "evaluate this screening test" questions. NPV is near-perfect when disease is rare (almost all test-negatives really are well), which is little comfort if the few positives are mostly false. 34 . Risk Reclassification PREVENTION PRECISION Adding a genetic factor (e. g. a polygenic score) re- classifies individuals across an actionable risk threshold - some move up (newly flagged high-risk), some down (reassured). The value of genetic screening = how many it correctly reclassifies + NNS/NNT, not "we can test, so we should. " Ties back to the Wilson- Jungner conditions (penetrance understood, accurate test, early actionable stage). A reclassification is only worthwhile if a person moving above the threshold gains an effective action (screening, prophylaxis, risk-reducing surgery). Reclassifying with no actionable consequence adds anxiety without benefit. 34b . Disease Screening USING GENETICS Two distinct goals: (1) screen for genetic risk in the well -> reduce future risk; (2) use a genetic factor to triage disease screening - e. g. start colonoscopy earlier / more often in MMR carriers. Both still demand an accurate test + an effective downstream action + favourable NNS in the targeted group. 35 . Appraisal Checklist L05 . EVERY MODULE LO5 (appraisal) threads through every module. For any study, answer design > measure > strength > limitation > bias: · Design? case-control / cohort / twin / GWAS / MR / family / weighted / case-only / screening · Causation? aggregation/association # cause; MR / replication / dose-response strengthen it
- Specificity(特异度):$Spec=\dfrac{TN}{TN+FP}=P(test-|healthy)$[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[16]Source: asksia-cheatsheet-poph90111.pdfCompiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 32 . Test Performance 2×2 METRICS From a 2×2 of test (+/-) x true status (D+/D-): TP, FP, FN, TN. ACCURACY METRICS Sensitivity = TP/(TP+FN) P(+|disease) Specificity = TN/(TN+FP) P(-|healthy) PPV = TP/(TP+FP) . NPV = TN/(TN+FN) Sens & spec are intrinsic to the test; PPV rises & NPV falls as prevalence rises. In low-prevalence screening even a very specific test gives many false positives > low PPV. BAYES FORM PPV = (Sens . Prev) / [Sens . Prev + (1-Spec) (1-Prev) ] 33 . ROC & AUC DISCRIMINATION ROC: plot sensitivity (y) vs 1-specificity (x) as the cut- off moves. AUC = P(a random case scores higher than a random control): 0. 5 = chance (the diagonal), 1. 0 = perfect (top-left corner). In this course AUC appears for polygenic risk scores (e. g. coronary-artery-disease AUC = 0. 81). Moving the threshold trades sensitivity against specificity. Trap: excellent sens/spec is useless for screening if prevalence is tiny (PPV near zero) - always tie performance back to prevalence / carrier frequency. Weak instrument (MR) 33b . PPV Worked PREVALENCE BITES Sens=0. 90, Spec=0. 99. At prevalence 1%: PPV = (0. 9. 0. 01) / [0. 9. 0. 01 + 0. 01. 0. 99] = 0. 009/0. 0189 = 48% Half of positives are false - despite 99% specificity. At prevalence 10% the same test gives PPV = 91%. Lesson: raise the prior (target high-risk) before screening, or most positives are false alarms. Sens/spec are fixed properties of the test; only PPV/NPV move with prevalence - that single fact answers most "evaluate this screening test" questions. NPV is near-perfect when disease is rare (almost all test-negatives really are well), which is little comfort if the few positives are mostly false. 34 . Risk Reclassification PREVENTION PRECISION Adding a genetic factor (e. g. a polygenic score) re- classifies individuals across an actionable risk threshold - some move up (newly flagged high-risk), some down (reassured). The value of genetic screening = how many it correctly reclassifies + NNS/NNT, not "we can test, so we should. " Ties back to the Wilson- Jungner conditions (penetrance understood, accurate test, early actionable stage). A reclassification is only worthwhile if a person moving above the threshold gains an effective action (screening, prophylaxis, risk-reducing surgery). Reclassifying with no actionable consequence adds anxiety without benefit. 34b . Disease Screening USING GENETICS Two distinct goals: (1) screen for genetic risk in the well -> reduce future risk; (2) use a genetic factor to triage disease screening - e. g. start colonoscopy earlier / more often in MMR carriers. Both still demand an accurate test + an effective downstream action + favourable NNS in the targeted group. 35 . Appraisal Checklist L05 . EVERY MODULE LO5 (appraisal) threads through every module. For any study, answer design > measure > strength > limitation > bias: · Design? case-control / cohort / twin / GWAS / MR / family / weighted / case-only / screening · Causation? aggregation/association # cause; MR / replication / dose-response strengthen it
- PPV(阳性预测值):$PPV=\dfrac{TP}{TP+FP}=P(disease|test+)$[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[16]Source: asksia-cheatsheet-poph90111.pdfCompiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 32 . Test Performance 2×2 METRICS From a 2×2 of test (+/-) x true status (D+/D-): TP, FP, FN, TN. ACCURACY METRICS Sensitivity = TP/(TP+FN) P(+|disease) Specificity = TN/(TN+FP) P(-|healthy) PPV = TP/(TP+FP) . NPV = TN/(TN+FN) Sens & spec are intrinsic to the test; PPV rises & NPV falls as prevalence rises. In low-prevalence screening even a very specific test gives many false positives > low PPV. BAYES FORM PPV = (Sens . Prev) / [Sens . Prev + (1-Spec) (1-Prev) ] 33 . ROC & AUC DISCRIMINATION ROC: plot sensitivity (y) vs 1-specificity (x) as the cut- off moves. AUC = P(a random case scores higher than a random control): 0. 5 = chance (the diagonal), 1. 0 = perfect (top-left corner). In this course AUC appears for polygenic risk scores (e. g. coronary-artery-disease AUC = 0. 81). Moving the threshold trades sensitivity against specificity. Trap: excellent sens/spec is useless for screening if prevalence is tiny (PPV near zero) - always tie performance back to prevalence / carrier frequency. Weak instrument (MR) 33b . PPV Worked PREVALENCE BITES Sens=0. 90, Spec=0. 99. At prevalence 1%: PPV = (0. 9. 0. 01) / [0. 9. 0. 01 + 0. 01. 0. 99] = 0. 009/0. 0189 = 48% Half of positives are false - despite 99% specificity. At prevalence 10% the same test gives PPV = 91%. Lesson: raise the prior (target high-risk) before screening, or most positives are false alarms. Sens/spec are fixed properties of the test; only PPV/NPV move with prevalence - that single fact answers most "evaluate this screening test" questions. NPV is near-perfect when disease is rare (almost all test-negatives really are well), which is little comfort if the few positives are mostly false. 34 . Risk Reclassification PREVENTION PRECISION Adding a genetic factor (e. g. a polygenic score) re- classifies individuals across an actionable risk threshold - some move up (newly flagged high-risk), some down (reassured). The value of genetic screening = how many it correctly reclassifies + NNS/NNT, not "we can test, so we should. " Ties back to the Wilson- Jungner conditions (penetrance understood, accurate test, early actionable stage). A reclassification is only worthwhile if a person moving above the threshold gains an effective action (screening, prophylaxis, risk-reducing surgery). Reclassifying with no actionable consequence adds anxiety without benefit. 34b . Disease Screening USING GENETICS Two distinct goals: (1) screen for genetic risk in the well -> reduce future risk; (2) use a genetic factor to triage disease screening - e. g. start colonoscopy earlier / more often in MMR carriers. Both still demand an accurate test + an effective downstream action + favourable NNS in the targeted group. 35 . Appraisal Checklist L05 . EVERY MODULE LO5 (appraisal) threads through every module. For any study, answer design > measure > strength > limitation > bias: · Design? case-control / cohort / twin / GWAS / MR / family / weighted / case-only / screening · Causation? aggregation/association # cause; MR / replication / dose-response strengthen it
- NPV(阴性预测值):$NPV=\dfrac{TN}{TN+FN}$[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[16]Source: asksia-cheatsheet-poph90111.pdfCompiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 32 . Test Performance 2×2 METRICS From a 2×2 of test (+/-) x true status (D+/D-): TP, FP, FN, TN. ACCURACY METRICS Sensitivity = TP/(TP+FN) P(+|disease) Specificity = TN/(TN+FP) P(-|healthy) PPV = TP/(TP+FP) . NPV = TN/(TN+FN) Sens & spec are intrinsic to the test; PPV rises & NPV falls as prevalence rises. In low-prevalence screening even a very specific test gives many false positives > low PPV. BAYES FORM PPV = (Sens . Prev) / [Sens . Prev + (1-Spec) (1-Prev) ] 33 . ROC & AUC DISCRIMINATION ROC: plot sensitivity (y) vs 1-specificity (x) as the cut- off moves. AUC = P(a random case scores higher than a random control): 0. 5 = chance (the diagonal), 1. 0 = perfect (top-left corner). In this course AUC appears for polygenic risk scores (e. g. coronary-artery-disease AUC = 0. 81). Moving the threshold trades sensitivity against specificity. Trap: excellent sens/spec is useless for screening if prevalence is tiny (PPV near zero) - always tie performance back to prevalence / carrier frequency. Weak instrument (MR) 33b . PPV Worked PREVALENCE BITES Sens=0. 90, Spec=0. 99. At prevalence 1%: PPV = (0. 9. 0. 01) / [0. 9. 0. 01 + 0. 01. 0. 99] = 0. 009/0. 0189 = 48% Half of positives are false - despite 99% specificity. At prevalence 10% the same test gives PPV = 91%. Lesson: raise the prior (target high-risk) before screening, or most positives are false alarms. Sens/spec are fixed properties of the test; only PPV/NPV move with prevalence - that single fact answers most "evaluate this screening test" questions. NPV is near-perfect when disease is rare (almost all test-negatives really are well), which is little comfort if the few positives are mostly false. 34 . Risk Reclassification PREVENTION PRECISION Adding a genetic factor (e. g. a polygenic score) re- classifies individuals across an actionable risk threshold - some move up (newly flagged high-risk), some down (reassured). The value of genetic screening = how many it correctly reclassifies + NNS/NNT, not "we can test, so we should. " Ties back to the Wilson- Jungner conditions (penetrance understood, accurate test, early actionable stage). A reclassification is only worthwhile if a person moving above the threshold gains an effective action (screening, prophylaxis, risk-reducing surgery). Reclassifying with no actionable consequence adds anxiety without benefit. 34b . Disease Screening USING GENETICS Two distinct goals: (1) screen for genetic risk in the well -> reduce future risk; (2) use a genetic factor to triage disease screening - e. g. start colonoscopy earlier / more often in MMR carriers. Both still demand an accurate test + an effective downstream action + favourable NNS in the targeted group. 35 . Appraisal Checklist L05 . EVERY MODULE LO5 (appraisal) threads through every module. For any study, answer design > measure > strength > limitation > bias: · Design? case-control / cohort / twin / GWAS / MR / family / weighted / case-only / screening · Causation? aggregation/association # cause; MR / replication / dose-response strengthen it
- 最关键的评价句(几乎逢考必用):
- Sens/Spec 是检验固有属性,但 PPV 依赖患病率(prevalence);低患病率下即使 Spec 很高,PPV 也可能很低,所以一般人群筛查可能“近乎无用”,遗传筛查常要瞄准高先验风险人群。[2]Source: asksia-bible-poph90111-bilingual.pdf! Critical appraisal - a good test can still be useless for screening 批判性评价 -- 再好的检验也可能不适合筛查 Sensitivity and specificity are intrinsic, but PPV depends on prevalence. A test with excellent sensitivity/specificity can be near-useless in a low-prevalence general population (PPV near zero) - exactly why genetic screening targets high-prior-risk groups. The ROC/AUC summarises discrimination across cut-offs; moving the cut-off trades sensitivity against specificity but never changes prevalence. 灵敏度和特异度是内在的,但PPV 取决于患病率。一个灵敏度/特异度极佳的检验,在低患病率的一般人群中可能近乎 无用(PPV 接近零) -- 这正是遗传筛查靶向高先验风险人群的原因。ROC/AUC 概括跨切点的区分度;移动切点是在 灵敏度与特异度之间权衡,但从不改变患病率。 Same habit on every item: calculate, interpret, appraise. The number alone is a fraction of the marks - the interpretation sentence and the design critique (bias, assumption, scale, prevalence) are where POPH90111 assessment lives. Write the appraisal sentence even when the question only asks for the number. 每道题都保持同一习惯:计算、解读、评价。光有数字只占一小部分分数 -- 解读句和设计批判(偏倚、假设、尺度、 患病率)才是 POPH90111 评估的所在。即便题目只问数字,也写出那句评价。 MARKER'S NOTE . POPH90111 . CALCULATE + INTERPRET + APPRAISE POPH90111 . Genetic Epidemiology AskSia Library VISUAL STUDY BIBLE . ASKSIA POP. & GLOBAL HEALTH SEMESTER 1 . 2026 p < 5x10-8 - THE COMPLETE STUDY BIBLE - Genetic Epidemiology 遗传流行病学 FROM A DNA STRAND TO A GWAS SKYLINE - PLUG THE NUMBERS AND JUDGE THE STUDY DESIGN. 墨尔本大学 POPH90111 · 双语视觉精读 · 在线开卷作业占 90% · 遗传率/GWAS/孟德尔随机化/ 筛查:计算 + 研究设计评价工具箱 POPH90111 . UNIVERSITY OF MELBOURNE 中英双语版 · BILINGUAL EDITION 英文主讲,中文随行 一 考试要点与术语保留英文原词 There is no closed-book exam in this subject. Your whole mark is three take-home pieces - a 10% online MCQ and two written assignments worth 90% combined - and all three are built from the same three task types: calculate, interpret, and critically appraise the study design. This book is therefore a calculation + appraisal toolkit: every method shown on real course numbers, every design tagged with its signature bias. Independent study companion. Not affiliated with or endorsed by the University of Melbourne. Corrections: takedowns@asksia. ai PREFACE - HOW TO USE THIS BOOK Plug the numbers, judge the design 代入数字,评判设计 Open-book and take-home - so it tests whether you can DO the method and APPRAISE it 开卷且带回家作答,因此考的是你能否动手做方法并加以评价 This is not a transcript of the lecture slides. It is a self-contained toolkit for every quantity and every study design POPH90111 examines - each concept opens with a diagram, states the formula plainly, works it on the course's own example numbers, gives a memorable analogy, then flags the assignment trap. Because all three assessments are open-book and take-home, the examiner cannot reward what you merely remember. Marks come from three things only: can you calculate it, can you interpret the output, and can you critically appraise the design that produced it? - - 这不是讲座幻灯片的逐字稿,而是一份自足的工具箱,涵盖 POPH90111 所考的每个量和每种研究设计 -- 每个概念都以一幅 图开篇,明白地写出公式,用课程自带的例题数字加以演算,给出一个易记的类比,再标出作业陷阱。由于三项评估全部开卷 且带回家作答,考官无法奖励你单纯记住的东西。分数仅来自三件事:你能否算出它、能否解释输出、能否批判性地评价产生 它的设计?[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[16]Source: asksia-cheatsheet-poph90111.pdfCompiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 32 . Test Performance 2×2 METRICS From a 2×2 of test (+/-) x true status (D+/D-): TP, FP, FN, TN. ACCURACY METRICS Sensitivity = TP/(TP+FN) P(+|disease) Specificity = TN/(TN+FP) P(-|healthy) PPV = TP/(TP+FP) . NPV = TN/(TN+FN) Sens & spec are intrinsic to the test; PPV rises & NPV falls as prevalence rises. In low-prevalence screening even a very specific test gives many false positives > low PPV. BAYES FORM PPV = (Sens . Prev) / [Sens . Prev + (1-Spec) (1-Prev) ] 33 . ROC & AUC DISCRIMINATION ROC: plot sensitivity (y) vs 1-specificity (x) as the cut- off moves. AUC = P(a random case scores higher than a random control): 0. 5 = chance (the diagonal), 1. 0 = perfect (top-left corner). In this course AUC appears for polygenic risk scores (e. g. coronary-artery-disease AUC = 0. 81). Moving the threshold trades sensitivity against specificity. Trap: excellent sens/spec is useless for screening if prevalence is tiny (PPV near zero) - always tie performance back to prevalence / carrier frequency. Weak instrument (MR) 33b . PPV Worked PREVALENCE BITES Sens=0. 90, Spec=0. 99. At prevalence 1%: PPV = (0. 9. 0. 01) / [0. 9. 0. 01 + 0. 01. 0. 99] = 0. 009/0. 0189 = 48% Half of positives are false - despite 99% specificity. At prevalence 10% the same test gives PPV = 91%. Lesson: raise the prior (target high-risk) before screening, or most positives are false alarms. Sens/spec are fixed properties of the test; only PPV/NPV move with prevalence - that single fact answers most "evaluate this screening test" questions. NPV is near-perfect when disease is rare (almost all test-negatives really are well), which is little comfort if the few positives are mostly false. 34 . Risk Reclassification PREVENTION PRECISION Adding a genetic factor (e. g. a polygenic score) re- classifies individuals across an actionable risk threshold - some move up (newly flagged high-risk), some down (reassured). The value of genetic screening = how many it correctly reclassifies + NNS/NNT, not "we can test, so we should. " Ties back to the Wilson- Jungner conditions (penetrance understood, accurate test, early actionable stage). A reclassification is only worthwhile if a person moving above the threshold gains an effective action (screening, prophylaxis, risk-reducing surgery). Reclassifying with no actionable consequence adds anxiety without benefit. 34b . Disease Screening USING GENETICS Two distinct goals: (1) screen for genetic risk in the well -> reduce future risk; (2) use a genetic factor to triage disease screening - e. g. start colonoscopy earlier / more often in MMR carriers. Both still demand an accurate test + an effective downstream action + favourable NNS in the targeted group. 35 . Appraisal Checklist L05 . EVERY MODULE LO5 (appraisal) threads through every module. For any study, answer design > measure > strength > limitation > bias: · Design? case-control / cohort / twin / GWAS / MR / family / weighted / case-only / screening · Causation? aggregation/association # cause; MR / replication / dose-response strengthen it
- PPV 的贝叶斯写法(cheatsheet 给了,写上很加分):
-
链条 D:ROC / AUC(区分度)——“能分开” ≠ “有用”
- 你要会一句话定义:
- ROC:y 轴 Sensitivity,x 轴 $1-$Specificity,阈值移动形成曲线;AUC 表示区分度,$0.5$=随机,$1$=完美。[5]Source: asksia-bible-poph90111-bilingual.pdf按后果选取一个操作切点:一个必须少漏病例的筛查偏好高灵敏度(接受更多假阳性);一个确证检验偏好高特异度。 ✓ Analogy - a bouncer's strictness dial 类比 -- 保安的严格程度旋钮 Think of one dial controlling how strict a bouncer is. Turn it strict and almost nobody gets in - few mistaken admits (low FPR) but you also turn away real guests (low sensitivity): bottom-left. Turn it lenient and everyone gets in - high sensitivity but lots of gate-crashers (high FPR): top-right. The ROC traces every dial setting; AUC says how good the bouncer's judgement is, independent of where you set the dial. 想象一个旋钮,控制一名门卫有多严格。拧到严格,几乎没人能进 -- 误放进的少(低 FPR),但你也拒掉了真客人(低 灵敏度):左下角。拧到宽松,人人都进 -- 高灵敏度但大量闯入者(高FPR):右上角。ROC 描绘每一个旋钮设定; AUC 则说明门卫的判断力有多好,与你把旋钮设在哪里无关。 ! Exam trap - AUC measures discrimination, not usefulness 考试陷阱 -- AUC 衡量的是区分度,而非有用性 A high AUC means the score separates cases from controls, but it says nothing about PPV at a given prevalence or whether acting on the result helps. Two more slips: (1) the x-axis is 1-specificity, not specificity; (2) moving the cut- off trades sensitivity against specificity - you cannot improve both by sliding the threshold, only by getting a better test (a curve closer to the top-left). In this course AUC mainly appears for polygenic risk scores. 高 AUC 意味着该评分能区分病例与对照,但它对给定患病率下的PPV、或据结果行动是否有帮助,一无所知。还有两 个失误:(1)横轴是1-特异度,不是特异度;(2)移动切点是在灵敏度与特异度之间权衡 -- 你无法靠滑动阈值同时改善 两者,只能靠拿到一个更好的检验(一条更贴近左上角的曲线)。本课程中AUC 主要出现在多基因风险评分上。 POPH90111 . Genetic Epidemiology M8 . APPRAISAL ASSIGNMENT GOLD MODULE 8 . PUTTING IT TOGETHER Critically appraising a screening programme 对筛查项目进行批判性评价 A reusable checklist - condition, test, action, numbers, harms, equity - for any "should we screen?" question 一份可复用的清单 -- 疾病、检验、行动、数字、危害、公平性 -- 适用于任何“我们该筛查吗”之问 Assignment and exam questions reward appraisal, not recall. When handed a proposed genetic screen, march through this checklist and back every verdict with a Wilson-Jungner principle or a number (NNT, NNS, PPV at the relevant prevalence). 作业和考试题奖励的是评价而非死记。当拿到一个拟议的遗传筛查方案时,逐条走完这份清单,并用某条 Wilson-Jungner 标 准或某个数字(NNT、NNS、相应患病率下的 PPV)来支撑每一项判断。 Lens Ask What good evidence looks like / red flag 1 - The condition Burden Is it common/serious enough? Important health problem with a recognisable latent stage (W-J 1-2). Natural history Is penetrance & onset understood? Known age-specific penetrance. Red flag: uncertain penetrance / variants of unknown significance. 2 - The test Validity[16]Source: asksia-cheatsheet-poph90111.pdfCompiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 32 . Test Performance 2×2 METRICS From a 2×2 of test (+/-) x true status (D+/D-): TP, FP, FN, TN. ACCURACY METRICS Sensitivity = TP/(TP+FN) P(+|disease) Specificity = TN/(TN+FP) P(-|healthy) PPV = TP/(TP+FP) . NPV = TN/(TN+FN) Sens & spec are intrinsic to the test; PPV rises & NPV falls as prevalence rises. In low-prevalence screening even a very specific test gives many false positives > low PPV. BAYES FORM PPV = (Sens . Prev) / [Sens . Prev + (1-Spec) (1-Prev) ] 33 . ROC & AUC DISCRIMINATION ROC: plot sensitivity (y) vs 1-specificity (x) as the cut- off moves. AUC = P(a random case scores higher than a random control): 0. 5 = chance (the diagonal), 1. 0 = perfect (top-left corner). In this course AUC appears for polygenic risk scores (e. g. coronary-artery-disease AUC = 0. 81). Moving the threshold trades sensitivity against specificity. Trap: excellent sens/spec is useless for screening if prevalence is tiny (PPV near zero) - always tie performance back to prevalence / carrier frequency. Weak instrument (MR) 33b . PPV Worked PREVALENCE BITES Sens=0. 90, Spec=0. 99. At prevalence 1%: PPV = (0. 9. 0. 01) / [0. 9. 0. 01 + 0. 01. 0. 99] = 0. 009/0. 0189 = 48% Half of positives are false - despite 99% specificity. At prevalence 10% the same test gives PPV = 91%. Lesson: raise the prior (target high-risk) before screening, or most positives are false alarms. Sens/spec are fixed properties of the test; only PPV/NPV move with prevalence - that single fact answers most "evaluate this screening test" questions. NPV is near-perfect when disease is rare (almost all test-negatives really are well), which is little comfort if the few positives are mostly false. 34 . Risk Reclassification PREVENTION PRECISION Adding a genetic factor (e. g. a polygenic score) re- classifies individuals across an actionable risk threshold - some move up (newly flagged high-risk), some down (reassured). The value of genetic screening = how many it correctly reclassifies + NNS/NNT, not "we can test, so we should. " Ties back to the Wilson- Jungner conditions (penetrance understood, accurate test, early actionable stage). A reclassification is only worthwhile if a person moving above the threshold gains an effective action (screening, prophylaxis, risk-reducing surgery). Reclassifying with no actionable consequence adds anxiety without benefit. 34b . Disease Screening USING GENETICS Two distinct goals: (1) screen for genetic risk in the well -> reduce future risk; (2) use a genetic factor to triage disease screening - e. g. start colonoscopy earlier / more often in MMR carriers. Both still demand an accurate test + an effective downstream action + favourable NNS in the targeted group. 35 . Appraisal Checklist L05 . EVERY MODULE LO5 (appraisal) threads through every module. For any study, answer design > measure > strength > limitation > bias: · Design? case-control / cohort / twin / GWAS / MR / family / weighted / case-only / screening · Causation? aggregation/association # cause; MR / replication / dose-response strengthen it
- 两条常见陷阱(很爱扣分):
- AUC 衡量 discrimination(区分度),不等于 usefulness(有用/值不值得筛查);有用与否仍要回到患病率/PPV/NNS/NNT。[5]Source: asksia-bible-poph90111-bilingual.pdf按后果选取一个操作切点:一个必须少漏病例的筛查偏好高灵敏度(接受更多假阳性);一个确证检验偏好高特异度。 ✓ Analogy - a bouncer's strictness dial 类比 -- 保安的严格程度旋钮 Think of one dial controlling how strict a bouncer is. Turn it strict and almost nobody gets in - few mistaken admits (low FPR) but you also turn away real guests (low sensitivity): bottom-left. Turn it lenient and everyone gets in - high sensitivity but lots of gate-crashers (high FPR): top-right. The ROC traces every dial setting; AUC says how good the bouncer's judgement is, independent of where you set the dial. 想象一个旋钮,控制一名门卫有多严格。拧到严格,几乎没人能进 -- 误放进的少(低 FPR),但你也拒掉了真客人(低 灵敏度):左下角。拧到宽松,人人都进 -- 高灵敏度但大量闯入者(高FPR):右上角。ROC 描绘每一个旋钮设定; AUC 则说明门卫的判断力有多好,与你把旋钮设在哪里无关。 ! Exam trap - AUC measures discrimination, not usefulness 考试陷阱 -- AUC 衡量的是区分度,而非有用性 A high AUC means the score separates cases from controls, but it says nothing about PPV at a given prevalence or whether acting on the result helps. Two more slips: (1) the x-axis is 1-specificity, not specificity; (2) moving the cut- off trades sensitivity against specificity - you cannot improve both by sliding the threshold, only by getting a better test (a curve closer to the top-left). In this course AUC mainly appears for polygenic risk scores. 高 AUC 意味着该评分能区分病例与对照,但它对给定患病率下的PPV、或据结果行动是否有帮助,一无所知。还有两 个失误:(1)横轴是1-特异度,不是特异度;(2)移动切点是在灵敏度与特异度之间权衡 -- 你无法靠滑动阈值同时改善 两者,只能靠拿到一个更好的检验(一条更贴近左上角的曲线)。本课程中AUC 主要出现在多基因风险评分上。 POPH90111 . Genetic Epidemiology M8 . APPRAISAL ASSIGNMENT GOLD MODULE 8 . PUTTING IT TOGETHER Critically appraising a screening programme 对筛查项目进行批判性评价 A reusable checklist - condition, test, action, numbers, harms, equity - for any "should we screen?" question 一份可复用的清单 -- 疾病、检验、行动、数字、危害、公平性 -- 适用于任何“我们该筛查吗”之问 Assignment and exam questions reward appraisal, not recall. When handed a proposed genetic screen, march through this checklist and back every verdict with a Wilson-Jungner principle or a number (NNT, NNS, PPV at the relevant prevalence). 作业和考试题奖励的是评价而非死记。当拿到一个拟议的遗传筛查方案时,逐条走完这份清单,并用某条 Wilson-Jungner 标 准或某个数字(NNT、NNS、相应患病率下的 PPV)来支撑每一项判断。 Lens Ask What good evidence looks like / red flag 1 - The condition Burden Is it common/serious enough? Important health problem with a recognisable latent stage (W-J 1-2). Natural history Is penetrance & onset understood? Known age-specific penetrance. Red flag: uncertain penetrance / variants of unknown significance. 2 - The test Validity
- x 轴是 $1-$specificity,不是 specificity;移动阈值是 Sens 与 Spec 的权衡,不能“两者都变好”,除非检验本身更好(曲线更靠左上)。[5]Source: asksia-bible-poph90111-bilingual.pdf按后果选取一个操作切点:一个必须少漏病例的筛查偏好高灵敏度(接受更多假阳性);一个确证检验偏好高特异度。 ✓ Analogy - a bouncer's strictness dial 类比 -- 保安的严格程度旋钮 Think of one dial controlling how strict a bouncer is. Turn it strict and almost nobody gets in - few mistaken admits (low FPR) but you also turn away real guests (low sensitivity): bottom-left. Turn it lenient and everyone gets in - high sensitivity but lots of gate-crashers (high FPR): top-right. The ROC traces every dial setting; AUC says how good the bouncer's judgement is, independent of where you set the dial. 想象一个旋钮,控制一名门卫有多严格。拧到严格,几乎没人能进 -- 误放进的少(低 FPR),但你也拒掉了真客人(低 灵敏度):左下角。拧到宽松,人人都进 -- 高灵敏度但大量闯入者(高FPR):右上角。ROC 描绘每一个旋钮设定; AUC 则说明门卫的判断力有多好,与你把旋钮设在哪里无关。 ! Exam trap - AUC measures discrimination, not usefulness 考试陷阱 -- AUC 衡量的是区分度,而非有用性 A high AUC means the score separates cases from controls, but it says nothing about PPV at a given prevalence or whether acting on the result helps. Two more slips: (1) the x-axis is 1-specificity, not specificity; (2) moving the cut- off trades sensitivity against specificity - you cannot improve both by sliding the threshold, only by getting a better test (a curve closer to the top-left). In this course AUC mainly appears for polygenic risk scores. 高 AUC 意味着该评分能区分病例与对照,但它对给定患病率下的PPV、或据结果行动是否有帮助,一无所知。还有两 个失误:(1)横轴是1-特异度,不是特异度;(2)移动切点是在灵敏度与特异度之间权衡 -- 你无法靠滑动阈值同时改善 两者,只能靠拿到一个更好的检验(一条更贴近左上角的曲线)。本课程中AUC 主要出现在多基因风险评分上。 POPH90111 . Genetic Epidemiology M8 . APPRAISAL ASSIGNMENT GOLD MODULE 8 . PUTTING IT TOGETHER Critically appraising a screening programme 对筛查项目进行批判性评价 A reusable checklist - condition, test, action, numbers, harms, equity - for any "should we screen?" question 一份可复用的清单 -- 疾病、检验、行动、数字、危害、公平性 -- 适用于任何“我们该筛查吗”之问 Assignment and exam questions reward appraisal, not recall. When handed a proposed genetic screen, march through this checklist and back every verdict with a Wilson-Jungner principle or a number (NNT, NNS, PPV at the relevant prevalence). 作业和考试题奖励的是评价而非死记。当拿到一个拟议的遗传筛查方案时,逐条走完这份清单,并用某条 Wilson-Jungner 标 准或某个数字(NNT、NNS、相应患病率下的 PPV)来支撑每一项判断。 Lens Ask What good evidence looks like / red flag 1 - The condition Burden Is it common/serious enough? Important health problem with a recognisable latent stage (W-J 1-2). Natural history Is penetrance & onset understood? Known age-specific penetrance. Red flag: uncertain penetrance / variants of unknown significance. 2 - The test Validity
-
链条 E:NNT / NNS(筛查是否“划算”的数字证据)——ARR → NNT → NNS → “靶向高风险”
- 核心定义与公式:
- ARR(absolute risk reduction):绝对风险下降;材料给了用于遗传筛查的写法:
- $ARR = \text{carrier risk}\times \text{(proportion risk reduced by treatment)}$[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[20]Source: asksia-cheatsheet-poph90111.pdfSIA > Show the working: in this subject the marks live in the setup and the interpretation, not the final digit. Always write the formula, the substitution, then one sentence of meaning. asksia. ai/cheatsheet/ unimelb-poph90111 . side 2/2 AskSia CHEATSHEET SERIES Disease screening = a systematic test to find asymptomatic disease/precursors in people not seeking care. Genetic screening = find risk-raising variants in asymptomatic people so risk can be reduced/prevented; can be population-wide but is usually targeted to high-prior-risk groups (e. g. strong family history). Course twist: a genetic test can be "once-and-for-all" (your germline doesn't change), unlike repeated disease screening. Two uses: screen for genetic risk, or use a genetic factor to screen for disease. 30 . Wilson-Jungner WHO 1968 The screening-evaluation checklist (the course adapts all 10 to genetics): · Facilities for diagnosis & treatment exist Trap: "we can test" # "we should screen. " A test only helps if knowing the result reduces disease/disability/death and benefits beat harms (psychological, social, insurance, variants of unknown significance, false positives). Example genes the course uses: BRCA1/2, the mismatch-repair (MMR) genes, HTT - note HTT (Huntington) has no risk-reduction, which weakens the case for screening. 31 . NNT & NNS QUANTIFY BENEFIT ARR = carrier risk x proportion risk reduced NNT = 1 / ARR NNS = NNT / carrier frequency Worked (BRCA1/2): carrier breast-cancer risk to 70 = 0. 4; tamoxifen cuts risk 50% => ARR = 0. 4×0. 5 = 0. 2 => NNT = 1/0. 2 = 5 carriers treated to prevent one cancer. Carrier freq 0. 0067 (1 in 150) = NNS = 5/0. 0067 = 746 screened per cancer prevented. High-FH group (carrier freq 0. 25) = NNS = 5/0. 25 = 20 - far more efficient => justifies targeted screening. CARRIER FREQ NNT NNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH)
- NNT:$$NNT=\frac{1}{ARR}$$(治疗多少“携带者”可避免 1 个病例)[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[20]Source: asksia-cheatsheet-poph90111.pdfSIA > Show the working: in this subject the marks live in the setup and the interpretation, not the final digit. Always write the formula, the substitution, then one sentence of meaning. asksia. ai/cheatsheet/ unimelb-poph90111 . side 2/2 AskSia CHEATSHEET SERIES Disease screening = a systematic test to find asymptomatic disease/precursors in people not seeking care. Genetic screening = find risk-raising variants in asymptomatic people so risk can be reduced/prevented; can be population-wide but is usually targeted to high-prior-risk groups (e. g. strong family history). Course twist: a genetic test can be "once-and-for-all" (your germline doesn't change), unlike repeated disease screening. Two uses: screen for genetic risk, or use a genetic factor to screen for disease. 30 . Wilson-Jungner WHO 1968 The screening-evaluation checklist (the course adapts all 10 to genetics): · Facilities for diagnosis & treatment exist Trap: "we can test" # "we should screen. " A test only helps if knowing the result reduces disease/disability/death and benefits beat harms (psychological, social, insurance, variants of unknown significance, false positives). Example genes the course uses: BRCA1/2, the mismatch-repair (MMR) genes, HTT - note HTT (Huntington) has no risk-reduction, which weakens the case for screening. 31 . NNT & NNS QUANTIFY BENEFIT ARR = carrier risk x proportion risk reduced NNT = 1 / ARR NNS = NNT / carrier frequency Worked (BRCA1/2): carrier breast-cancer risk to 70 = 0. 4; tamoxifen cuts risk 50% => ARR = 0. 4×0. 5 = 0. 2 => NNT = 1/0. 2 = 5 carriers treated to prevent one cancer. Carrier freq 0. 0067 (1 in 150) = NNS = 5/0. 0067 = 746 screened per cancer prevented. High-FH group (carrier freq 0. 25) = NNS = 5/0. 25 = 20 - far more efficient => justifies targeted screening. CARRIER FREQ NNT NNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH)
- NNS:$$NNS=\frac{NNT}{\text{carrier frequency}}$$(要筛多少人才能避免 1 个病例)[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[20]Source: asksia-cheatsheet-poph90111.pdfSIA > Show the working: in this subject the marks live in the setup and the interpretation, not the final digit. Always write the formula, the substitution, then one sentence of meaning. asksia. ai/cheatsheet/ unimelb-poph90111 . side 2/2 AskSia CHEATSHEET SERIES Disease screening = a systematic test to find asymptomatic disease/precursors in people not seeking care. Genetic screening = find risk-raising variants in asymptomatic people so risk can be reduced/prevented; can be population-wide but is usually targeted to high-prior-risk groups (e. g. strong family history). Course twist: a genetic test can be "once-and-for-all" (your germline doesn't change), unlike repeated disease screening. Two uses: screen for genetic risk, or use a genetic factor to screen for disease. 30 . Wilson-Jungner WHO 1968 The screening-evaluation checklist (the course adapts all 10 to genetics): · Facilities for diagnosis & treatment exist Trap: "we can test" # "we should screen. " A test only helps if knowing the result reduces disease/disability/death and benefits beat harms (psychological, social, insurance, variants of unknown significance, false positives). Example genes the course uses: BRCA1/2, the mismatch-repair (MMR) genes, HTT - note HTT (Huntington) has no risk-reduction, which weakens the case for screening. 31 . NNT & NNS QUANTIFY BENEFIT ARR = carrier risk x proportion risk reduced NNT = 1 / ARR NNS = NNT / carrier frequency Worked (BRCA1/2): carrier breast-cancer risk to 70 = 0. 4; tamoxifen cuts risk 50% => ARR = 0. 4×0. 5 = 0. 2 => NNT = 1/0. 2 = 5 carriers treated to prevent one cancer. Carrier freq 0. 0067 (1 in 150) = NNS = 5/0. 0067 = 746 screened per cancer prevented. High-FH group (carrier freq 0. 25) = NNS = 5/0. 25 = 20 - far more efficient => justifies targeted screening. CARRIER FREQ NNT NNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH)
- ARR(absolute risk reduction):绝对风险下降;材料给了用于遗传筛查的写法:
- 最重要的评价句(写了就对):
- NNS 主要由携带者频率(carrier frequency)主导;罕见变异在一般人群 NNS 会“崩塌”,所以遗传筛查往往必须靶向高家族史/高先验人群。[11]Source: asksia-bible-poph90111-bilingual.pdfNNS - HIGH FAMILY HISTORY NNS -- 高家族史 ✓ Analogy - the lottery-ticket cost 类比 -- 彩票的代价 NNT is the price of a winning ticket once you are in the right queue (a carrier). NNS is the price including finding the right queue: in the general population you sift ~150 people to find one carrier, so the cost per prevented case balloons. Walk into a queue already full of carriers (high family history) and the sifting cost collapses. NNT 是当你已身处正确队列(即一名携带者)时一张中奖彩票的价格。NNS 则是加上找到正确队列的价格:在一般人群 中你要筛过约150人才能找到一名携带者,于是每避免一例的成本暴涨。走进一个已挤满携带者的队列(高家族史),筛 选成本就崩塌。 POPH90111 . Genetic Epidemiology ! Exam trap - NNS collapses for rare variants 考试陷阱 -- NNS 对罕见变异会失效 Screening efficiency (NNS) is dominated by carrier frequency, not by how good the treatment is. A wonderful treatment (low NNT) is wasted if you must screen thousands to find one carrier. Raising the prior probability of carriage - targeting high-family-history groups - is what makes a genetic screen worthwhile. State NNT and NNS, and always name the population the carrier frequency refers to. 筛查效率(NNS)由携带者频率主导,而非由治疗有多好主导。一个再好的治疗(低 NNT)若你必须筛上千人才找到一 名携带者,也是浪费。提高携带的先验概率 -- 靶向高家族史人群 -- 才是让一项遗传筛查值得做的关键。陈述 NNT 和 NNS,并始终点名携带者频率所指的人群。 POPH90111 . Genetic Epidemiology M8 . TEST METRICS - - MODULE 8 . EVALUATING A TEST STANDARD CANON Sensitivity, specificity, PPV / NPV 灵敏度、特异度、PPV / NPV The 2×2 of test result x true status - and why PPV depends on prevalence 检验结果 × 真实状态的 2×2 表 -- 以及 PPV 为何取决于患病率 Any "evaluate this test" question reduces to a 2×2 of test result (+/-) against true disease status (D+/D-), giving TP, FP, FN, TN. From it come four metrics - two intrinsic to the test (sensitivity, specificity) and two that depend on who you test (PPV, NPV). 任何“评价这项检验”的问题都归结为一张 2×2 表,列出检验结果(+/-)与真实疾病状态 (D+/D-),给出 TP、FP、FN、 TN。由它得出四项指标 -- 两项是检验固有的(灵敏度、特异度),两项取决于你检测的是谁(PPV、NPV)。 D5 machinery used for screening: read TP / FP / FN / TN off the cells. (Here the row split is test +/-, the column split i Disease: YES Disease: NO Exposed / FamHx + a b OR = (a x d) / (b xc) Unexposed / FamHx - C d RR = [a/(a+b)] / [c/(c+d)][20]Source: asksia-cheatsheet-poph90111.pdfSIA > Show the working: in this subject the marks live in the setup and the interpretation, not the final digit. Always write the formula, the substitution, then one sentence of meaning. asksia. ai/cheatsheet/ unimelb-poph90111 . side 2/2 AskSia CHEATSHEET SERIES Disease screening = a systematic test to find asymptomatic disease/precursors in people not seeking care. Genetic screening = find risk-raising variants in asymptomatic people so risk can be reduced/prevented; can be population-wide but is usually targeted to high-prior-risk groups (e. g. strong family history). Course twist: a genetic test can be "once-and-for-all" (your germline doesn't change), unlike repeated disease screening. Two uses: screen for genetic risk, or use a genetic factor to screen for disease. 30 . Wilson-Jungner WHO 1968 The screening-evaluation checklist (the course adapts all 10 to genetics): · Facilities for diagnosis & treatment exist Trap: "we can test" # "we should screen. " A test only helps if knowing the result reduces disease/disability/death and benefits beat harms (psychological, social, insurance, variants of unknown significance, false positives). Example genes the course uses: BRCA1/2, the mismatch-repair (MMR) genes, HTT - note HTT (Huntington) has no risk-reduction, which weakens the case for screening. 31 . NNT & NNS QUANTIFY BENEFIT ARR = carrier risk x proportion risk reduced NNT = 1 / ARR NNS = NNT / carrier frequency Worked (BRCA1/2): carrier breast-cancer risk to 70 = 0. 4; tamoxifen cuts risk 50% => ARR = 0. 4×0. 5 = 0. 2 => NNT = 1/0. 2 = 5 carriers treated to prevent one cancer. Carrier freq 0. 0067 (1 in 150) = NNS = 5/0. 0067 = 746 screened per cancer prevented. High-FH group (carrier freq 0. 25) = NNS = 5/0. 25 = 20 - far more efficient => justifies targeted screening. CARRIER FREQ NNT NNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH)[21]Source: asksia-cheatsheet-poph90111.pdfNNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH) Trap: NNS collapses for rare variants in the general population - raising the prior probability of carriage (targeting high-FH groups) is what makes genetic screening worthwhile. 31b . Harms Ledger BENEFITS VS COSTS Screening is only justified when benefit beats harm. Weigh against the NNT/NNS benefit: · False positives -> anxiety, over-treatment · Psychosocial -> family, identity, fatalism . Insurance / legal / discrimination risk · Opportunity cost of the screening budget Calculation & method reference . check the current subject guide . @ 2026 good Luck. calculate, then appraise. SIDE 2/2 DISCOVER & USE . Mendelian randomisation . Penetrance & ascertainment . Gene-environment interaction . Screening (NNT/NNS, sens/spec/PPV, ROC) . Critical appraisal MODULE 4 POPH90111 Genetic Epidemiology UNIVERSITY OF MELBOURNE . POPULATION & GLOBAL HEALTH CALCULATION & METHOD REFERENCE Sem 1 2026 . SIDE 1 OF 2 Foundations -> heritability -> association SIDE 1/2 UNDERSTAND . Genetics primer . Hardy-Weinberg . LD . Familial aggregation (OR/RR/SMR/A) . Heritability (Falconer, ACE, liability) . Association & GWAS 0 . How To Use This READ FIRST * This subject is a pipeline: UNDERSTAND (is there a genetic role? - aggregation, heritability) -> DISCOVER (which variants? - LD, GWAS, MR) -> CHARACTERISE (how risky? - penetrance, modifiers, GxE) > USE (screening). Side 1 = understand & discover; side 2 = characterise & use.
- 作业常见“危害清单”(写 1–2 条就能拿 appraisal 分):
- 假阳性→焦虑/过度治疗;心理社会影响、保险/歧视风险;机会成本等。[21]Source: asksia-cheatsheet-poph90111.pdfNNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH) Trap: NNS collapses for rare variants in the general population - raising the prior probability of carriage (targeting high-FH groups) is what makes genetic screening worthwhile. 31b . Harms Ledger BENEFITS VS COSTS Screening is only justified when benefit beats harm. Weigh against the NNT/NNS benefit: · False positives -> anxiety, over-treatment · Psychosocial -> family, identity, fatalism . Insurance / legal / discrimination risk · Opportunity cost of the screening budget Calculation & method reference . check the current subject guide . @ 2026 good Luck. calculate, then appraise. SIDE 2/2 DISCOVER & USE . Mendelian randomisation . Penetrance & ascertainment . Gene-environment interaction . Screening (NNT/NNS, sens/spec/PPV, ROC) . Critical appraisal MODULE 4 POPH90111 Genetic Epidemiology UNIVERSITY OF MELBOURNE . POPULATION & GLOBAL HEALTH CALCULATION & METHOD REFERENCE Sem 1 2026 . SIDE 1 OF 2 Foundations -> heritability -> association SIDE 1/2 UNDERSTAND . Genetics primer . Hardy-Weinberg . LD . Familial aggregation (OR/RR/SMR/A) . Heritability (Falconer, ACE, liability) . Association & GWAS 0 . How To Use This READ FIRST * This subject is a pipeline: UNDERSTAND (is there a genetic role? - aggregation, heritability) -> DISCOVER (which variants? - LD, GWAS, MR) -> CHARACTERISE (how risky? - penetrance, modifiers, GxE) > USE (screening). Side 1 = understand & discover; side 2 = characterise & use.[20]Source: asksia-cheatsheet-poph90111.pdfSIA > Show the working: in this subject the marks live in the setup and the interpretation, not the final digit. Always write the formula, the substitution, then one sentence of meaning. asksia. ai/cheatsheet/ unimelb-poph90111 . side 2/2 AskSia CHEATSHEET SERIES Disease screening = a systematic test to find asymptomatic disease/precursors in people not seeking care. Genetic screening = find risk-raising variants in asymptomatic people so risk can be reduced/prevented; can be population-wide but is usually targeted to high-prior-risk groups (e. g. strong family history). Course twist: a genetic test can be "once-and-for-all" (your germline doesn't change), unlike repeated disease screening. Two uses: screen for genetic risk, or use a genetic factor to screen for disease. 30 . Wilson-Jungner WHO 1968 The screening-evaluation checklist (the course adapts all 10 to genetics): · Facilities for diagnosis & treatment exist Trap: "we can test" # "we should screen. " A test only helps if knowing the result reduces disease/disability/death and benefits beat harms (psychological, social, insurance, variants of unknown significance, false positives). Example genes the course uses: BRCA1/2, the mismatch-repair (MMR) genes, HTT - note HTT (Huntington) has no risk-reduction, which weakens the case for screening. 31 . NNT & NNS QUANTIFY BENEFIT ARR = carrier risk x proportion risk reduced NNT = 1 / ARR NNS = NNT / carrier frequency Worked (BRCA1/2): carrier breast-cancer risk to 70 = 0. 4; tamoxifen cuts risk 50% => ARR = 0. 4×0. 5 = 0. 2 => NNT = 1/0. 2 = 5 carriers treated to prevent one cancer. Carrier freq 0. 0067 (1 in 150) = NNS = 5/0. 0067 = 746 screened per cancer prevented. High-FH group (carrier freq 0. 25) = NNS = 5/0. 25 = 20 - far more efficient => justifies targeted screening. CARRIER FREQ NNT NNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH)
-
链条 F:关键“通用评价框架”(LO5)——设计 → 度量 → 优势 → 局限 → 偏倚方向
- 万能答题骨架(每道题都能套):
- 先写:Design 是什么?(case-control/cohort/twin/GWAS/MR/screening…)
- 再写:Measure 用什么?(OR/RR/SMR/h2/AUC/NNT…)
- 再写:Strength 1 条 + Limitation/Bias 1 条
- 最后加分点:说清偏倚方向:toward the null 还是 away from the null(材料明确说“方向才给分”)。[19]Source: asksia-cheatsheet-poph90111.pdf35 . Appraisal Checklist L05 . EVERY MODULE LO5 (appraisal) threads through every module. For any study, answer design > measure > strength > limitation > bias: · Design? case-control / cohort / twin / GWAS / MR / family / weighted / case-only / screening · Causation? aggregation/association # cause; MR / replication / dose-response strengthen it BIAS DIRECTION Non-differential misclass. toward null Differential recall (cases) away from null Clinic ascertainment overestimates penetrance toward observational SIA > Marks come from naming the direction of each bias (toward vs away from the null), not just listing it. State the rival explanation, then how the design does (or fails to) rule it out. 36 . Interpretation USE THESE PHRASINGS Hooks · Aggregation / MZ>DZ = "evidence for, not proof of, inherited aetiology" · MR: relevance, independence, exclusion; chief threat = pleiotropy; conclusions likely · Interaction is scale-dependent - state additive vs multiplicative
-
2)你“必须背熟”的核心术语/定义清单(按模块流水线)
-
UNDERSTAND(M1–M2)
-
Familial aggregation(家族聚集性):家族内疾病聚集超过偶然;证据但非证明,共享环境是竞争解释。[7]Source: asksia-bible-poph90111-bilingual.pdfEN + 中文 Bilingual glossary - every examinable term 双语术语表:每个可考术语 English term . X . one-line meaning - grouped along the UNDERSTAND -> DISCOVER- CHARACTERISEUSE pipeline 英文术语 · 中文 · 一句话释义 -- 沿 理解→ 发现→刻画→ 应用 的流程分组 A fast reference for the vocabulary POPH90111 actually examines 〔本课程实际考查的术语速查〕. About 50 terms across the eight-module spine, each with a one-line English meaning; the middle column is reserved for the standard Chinese translation added in the bilingual pass. Cover the right column and recite the meaning from the term - then flip and recall the term from the meaning. POPH90111 实际考查术语的速查表 〔本课程实际考查的术语速查〕。横跨八个模块主干约50个术语,每个配一行英文释 义;中间一列留给双语版加入的标准中文译名。遮住右列,由术语复述释义 -- 再翻过来,由释义回忆术语。 Term (EN) 中文 One-line meaning Foundations - genetics vocabulary Locus / allele — A fixed position on a chromosome / the specific DNA base(s) present there; the minor allele is the rarer one. Genotype — The pair of alleles a person carries at a locus (TT, TC, CC); homozygous = identical, heterozygous = different. SNP — Single-nucleotide polymorphism - a common single-base variant (>1% frequency); the workhorse marker of GWAS. Polymorphism vs mutation — Polymorphism = common, little/no deleterious effect; (pathogenic) mutation = major deleterious effect - substantial risk. Germline vs somatic — Germline = inherited, in every cell - relevant to familial risk; somatic = acquired, in descendant cells only (e. g. a tumour clone). Allele frequency p — Proportion of all alleles at a locus that are the allele of interest: p = [2. n(AA)+n(Aa)]/(2N); per- chromosome (denominator 2N). Carrier frequency — Proportion of people carrying ≥1 risk allele = p2 + 2pq = 1 - q2; per-person (denominator N). Sets exposure prevalence & power. Hardy-Weinberg equilibrium — In a large randomly-mating population with no selection/migration/mutation: AA=p2, Aa=2pq, aa=q2 (p2+2pq+q2=1), constant across generations. HWE x2 test — Goodness-of-fit x2=(O-E)2/E over 3 genotypes, df=1; deviation in controls flags genotyping error / stratification. Module 1 - Familial aggregation (UNDERSTAND) Familial aggregation — Disease clusters in families more than chance; evidence for, not proof of, a genetic cause (shared environment is the rival).[1]Source: asksia-bible-poph90111-bilingual.pdfA 1 . LEARN 1· 学习 You haven't watched the module yet. Read a chapter top to bottom. Every concept is an AHA-unit: diagram - explainer + method-in-steps - analogy + trap. The figures are original schematics of the standard genetic-epi canon - learn the idea cold before the numbers. 你还没看过该模块。从头到尾读 一章。每个概念都是一个 AHA 单元:图示→讲解→ 分步方法 →类比→陷阱。图均为标准遗 传流行病学经典内容的原创示意 图 -- 先把思路彻底吃透,再上 数字。 B 2 . DRILL 2 · 演练 You've watched the module and done the Q&A. Cover the worked steps and re-derive each OR, RR, SMR, h2, penetrance and NNT by hand. Then say aloud the interpretation and the one bias that would change your conclusion. 你已看过该模块并做完 Q&A。 遮住已演算的步骤,亲手重新推 导每一个 OR、RR、SMR、h2、 外显率和 NNT。然后大声说出解 读,以及那一个会改变你结论的 偏倚。 C 3 . ASSIGNMENT 3 · 作业 The assignment is open. Use the design-appraisal grids and worked-formula boxes as your lookup. Every question is calculate - interpret -+ appraise; the Stata . do file (A1 Q1) does the arithmetic, but the interpretation and appraisal marks are yours to write. 作业是开卷的。把设计评价表格 和公式演算框当作你的查阅工 具。每道题都是计算→解读→ 评价;Stata 的 . do 文件(A1 Q1)负责算术,但解读和评价的 分数要你自己写出来。 POPH90111 . Genetic Epidemiology ! The single most important thing to understand about POPH90111 关于 POPH90111 最重要的一点认识 This subject is a logical pipeline, not a pile of topics: UNDERSTAND (is there a genetic role? - familial aggregation M1, heritability M2) - DISCOVER (which variants? - association/GWAS M3) - CHARACTERISE (how risky? - penetrance M5, modifiers M6, G×E M7) - USE IN PRACTICE (causal exposures via MR M4, screening M8). One mantra recurs at every stage: familial aggregation / high MZ-vs-DZ correlation is 'evidence for, but not proof of, an inherited genetic aetiology' - shared environment is always the rival explanation. Write that sentence whenever you interpret aggregation or heritability. 本科目是一条逻辑流水线,而非一堆零散主题:理解(是否存在遗传作用?→家族聚集性 M1、遗传力 M2)→发现 (哪些变异?→关联/GWAS M3)→刻画(风险有多大?→外显率M5、修饰因子 M6、GxE M7)→实践应用(经 MR 评估因果暴露 M4、筛查 M8)。一句口诀贯穿每一阶段:家族聚集性/高的 MZ-对-DZ 相关是“支持遗传性病因的 证据,但非证明” -- 共享环境永远是竞争性解释。每当你解读聚集性或遗传力时都写下这句话。 i How this book was built - and the two-layer rule 本书如何编成 -- 以及两层规则 Standard genetic and statistical canon (Hardy-Weinberg, LD measures, Falconer's h2, the ACE model, Manhattan/QQ plots, MR's three IV assumptions, NNT/NNS) is stated plainly - it is universal, non-copyrightable science, and every diagram here is our own clean drawing. The course's specific framing and its lecturer's example numbers are paraphrased and re-checked, never copied from slides. The subject runs on A/Prof Aung Ko Win's modules with Prof Mark Jenkins' lectures; verify dates and weights against your own Canvas (the LMS), as cohort details can shift. 标准的遗传学与统计学经典(Hardy-Weinberg、LD 测度、Falconer h2、ACE 模型、曼哈顿图/QQ 图、MR 三个 IV 假设、NNT/NNS)均平实陈述 -- 它们是普适的、不受版权保护的科学,此处每张图都是我们自己干净的绘制。本课程 特定的表述及其讲师的示例数字均为转述并重新核对,绝不照搬幻灯片。本科目以 A/Prof Aung Ko Win 的模块和 Prof Mark Jenkins 的讲课为基础;请对照你自己的 Canvas(即 LMS)核实日期与权重,因为各届的细节可能变动。 POPH90111 . Genetic Epidemiology THE BLUEPRINT - THE ASSESSMENT BLUEPRINT 90% IN TWO ASSIGNMENTS Three take-home pieces, no exam 三份带回家作业,无考试
-
Heritability(遗传率):表型方差中归因于遗传方差的比例;人群+环境特异,非个体属性。[10]Source: asksia-bible-poph90111-bilingual.pdf★ What the assignment / exam asks here 此处作业/考试考什么 (1) Partition variance: given Va, Vd, Ve compute narrow-sense h2 = Va/Vp and broad-sense H2 = Vg/Vp. (2) From twin correlations apply Falconer: h2 = 2(rMZ - rDZ) and interpret. (3) Solve the ACE system for A, C, E from rMZ and rDZ. (4) Convert SD - variance (Vp = SD2) and explain estimating separately by sex and zygosity. (5) Explain the liability-threshold model for a binary trait. (6) The marks-rich part: critically appraise - equal-environments assumption, 'evidence for not proof of', missing heritability, what heritability does not mean. (1)分解方差:给定 Va、Vd、Ve,计算狭义 h2 = Va/Vp 和广义 H2 = Vg/Vp。(2)由双生子相关应用 Falconer: h2= 2(rMZ - rDZ) 并解读。(3)由 rMZ 和 rDZ 求解 ACE 方程组得 A、C、E。(4)把 SD→方差(Vp= SD2)并解释为何 要按性别和合子性分别估计。(5)解释二分性状的易感性阈值模型。(6)富含分数的部分:严格评价一 -等环境假设、“是 证据而非证明”、缺失遗传力,以及遗传力不意味着什么。 2. 1 Heritability is a proportion of variance 2. 1 遗传率是方差的一个比例 Start from the definition the course insists on. Heritability is the proportion of the phenotypic variance in a population that is attributable to genetic variance. The key word is variance - we are explaining differences between people, not the trait value of any one person. Variance is the spread, the squared standard deviation: V = SD2. 从课程坚持采用的定义出发。遗传率是一个人群中表型方差里可归因于遗传方差的比例。关键词是方差 -- 我们解释的是人与 人之间的差异,而非任何一个人的性状取值。方差即离散度,是标准差的平方:V= SD2。 POPH90111 . Genetic Epidemiology VARIANCE PARTITION Top split: Vp = VG + VE Genetic split: VG = VA + Vp (+ VI) Broad-sense: H2 = VG / Vp Narrow-sense: h2 = VA / Vp (always h2 ≤ Hº) VA additive (allele effects sum - the part that passes parent-child); Vp dominance (interaction within a locus); VI epistasis (between loci); VE environment. VA 加性(等位基因效应相加 -- 可由亲代→子代传递的那 部分);VD 显性(同一位点内部的交互);V1上位(位点之 间);VE 环境。 D2 Fig 2. 1 - Family resemblance is fed by shared genes AND shared environment. Heritability is our attempt to read off the genetic slice of the variance - but the two sources are confounded, which is exactly why we need twins. Shared GENES(degree of relatedness)closer kin share more DNAShared ENVIRONMENTdiet, lifestyle, exposuresfamilies live alikeFamily resemblance(observed familial aggregation)aggregation = genes + shared env + chanceDegrees of relatedness1st parent/sib/childshare 1/22nd grandparent/uncleshare 1/43rd first cousinshare 1/8Read-off: familial aggregation is NECESSARY but NOT SUFFICIENT for a genetic cause - shared environment is the rival. Fig 2. 1 - Family resemblance is fed by shared genes AND shared environment. Heritability is our attempt to read off the genetic slice of the variance - but the two sources are confounded, which is exactly why we need twins. 图 2. 1 -- 家族相似性由共享基因和共享环境共同喂 养。遗传力是我们试图读出方差中遗传那一份的尝试 -- 但这两个来源彼此混杂,这正是我们需要双生子 的原因。 i Narrow vs broad - why we usually quote h2 狭义 vs 广义 -- 为何我们通常引用 h2 Only the additive part VA is transmitted predictably from parent to offspring and drives resemblance between relatives and response to selection. Dominance and epistasis effects are reshuffled each generation, so narrow- sense h2 is the quantity that predicts relatives' correlations - which is precisely what twin studies measure. 只有加性部分 VA 可从亲代到子代可预测地传递,并驱动亲属间的相似性和对选择的响应。显性与上位效应每代都被重 新洗牌,因此狭义 h2 才是预测亲属相关的量 -- 这正是双生子研究所测量的。 - POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY 2. 2 SD - variance: the arithmetic you will be marked on 2. 2 SD→ 方差:会被评分的算术 Almost every heritability question hands you a standard deviation or a correlation, not a variance. The single most-dropped mark is forgetting that variance = SD squared. The course also estimates everything separately by sex and by zygosity, because the spread genuinely differs between groups. 几乎每道遗传率题目给你的都是标准差或相关,而非方差。最常被丢掉的一分就是忘了方差= SD 的平方。课程还会按性别和 合子性分别估计一切,因为各组间的离散度确实不同。 - EX 2A Turning a standard deviation into a variance L . variance A tutorial reports female adult height with SD = 9. 29 cm. The genetic component is estimated as VA = 55. Find Vp and then narrow-sense h2.[13]Source: asksia-bible-poph90111-bilingual.pdfOR = ad/bc。演算(ABCFS,“任一患病姐妹”):病例 13/462 对 对照 1/405 → OR =(13×404)/(449×1)= 11. 7 (95% CI 1. 7-98. 2) . 3 Interpret. OR > 1 and CI excluding 1 - positive familial aggregation - "evidence for, not proof of, a Beyond the 2x2 - the other measures 超越 2×2––其他度量 SMR · RECURRENCE - RISK RATIO SMR = Observed / Expected Expected = population age/sex rates x relatives' person-time Worked: mothers of cases O=45, E=17. 7 - SMR ~ AR = risk in a type-R relative of a case / population prevalence K FRR = RR given an affected 1st-degree relative vs general pop. POPH90111 . Genetic Epidemiology genetic aetiology. " 解读。OR>1且 CI 不含1→正向家族聚集性 -- “是支持 遗传性病因的证据,而非证明”。 ✓ Reading & R across degrees 跨亲缘等级解读 入R AR > 1 means relatives are at raised risk. If Ap declines as relatedness falls (sib > aunt > cousin), that gradient points to a genetic contribution; the rate of decline hints at polygenic (gentle decline) vs single- gene (steep) architecture. 入R >1 意味着亲属风险升高。若 入R 随亲缘关系下降 而递减(同胞>姑舅>表堂亲),该梯度指向遗传贡 献;递减的速率则提示是多基因(平缓递减)还是单 基因(陡峭)结构。 ! Match the design to the measure 让设计与度量相匹配 OR comes from case-control; RR / rate-ratio / SMR come from cohort designs. Quoting an RR off a case- control table (where you can't estimate risk) is a classic appraisal slip. And always close with the mantra: aggregation is evidence for, not proof of, an inherited aetiology. OR 来自病例对照;RR/ 率比 / SMR 来自队列设 计。从一张病例对照表(你无法估计风险)上引用 RR 是典型的评价失误。并且始终以那句口诀收尾:聚集 性是支持遗传性病因的证据,而非证明。 POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY UNDERSTAND . LO3 MODULE 2 . HERITABILITY How much of the variation is genetic? 变异中有多少源于遗传 Variance components . twin studies . Falconer . ACE . liability threshold 方差成分 · 双生子研究 · Falconer · ACE · 易感性阈值 Familial aggregation told us a trait runs in families. Heritability asks the sharper, quantitative question: of all the variation we see in a trait across a population, what proportion is due to genetic differences between people? The answer is a number between 0 and 1 - and the whole module is the machinery for estimating it, almost entirely from twins. This is the most calculation-heavy stretch of the course: you will partition variance, run Falconer's formula, solve the ACE equations, and map a yes/no disease onto a hidden continuous liability. Every estimate comes wrapped in the same critical-appraisal caution - heritability is a property of a population in an environment, never of an individual. 家族聚集性告诉我们某性状在家庭中流传。遗传率则提出更尖锐的定量问题:在一个人群中所见某性状的全部变异里,有多大 比例归因于人与人之间的遗传差异?答案是一个介于0与1之间的数字 -- 而整个模块就是估计它的机器,且几乎完全依靠 双生子。这是全课程计算量最大的一段:你将分解方差、运用 Falconer 公式、求解 ACE 方程组,并把一个是/否的疾病映射 到一个隐藏的连续易感性上。每个估计都裹挟着同一句批判性评价告诫 -- 遗传率是某一环境中某一人群的属性,绝非个体的 属性。
-
USE IN PRACTICE(M8,筛查)
-
Sensitivity/Specificity:检验固有属性;PPV/NPV 依赖患病率。[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[2]Source: asksia-bible-poph90111-bilingual.pdf! Critical appraisal - a good test can still be useless for screening 批判性评价 -- 再好的检验也可能不适合筛查 Sensitivity and specificity are intrinsic, but PPV depends on prevalence. A test with excellent sensitivity/specificity can be near-useless in a low-prevalence general population (PPV near zero) - exactly why genetic screening targets high-prior-risk groups. The ROC/AUC summarises discrimination across cut-offs; moving the cut-off trades sensitivity against specificity but never changes prevalence. 灵敏度和特异度是内在的,但PPV 取决于患病率。一个灵敏度/特异度极佳的检验,在低患病率的一般人群中可能近乎 无用(PPV 接近零) -- 这正是遗传筛查靶向高先验风险人群的原因。ROC/AUC 概括跨切点的区分度;移动切点是在 灵敏度与特异度之间权衡,但从不改变患病率。 Same habit on every item: calculate, interpret, appraise. The number alone is a fraction of the marks - the interpretation sentence and the design critique (bias, assumption, scale, prevalence) are where POPH90111 assessment lives. Write the appraisal sentence even when the question only asks for the number. 每道题都保持同一习惯:计算、解读、评价。光有数字只占一小部分分数 -- 解读句和设计批判(偏倚、假设、尺度、 患病率)才是 POPH90111 评估的所在。即便题目只问数字,也写出那句评价。 MARKER'S NOTE . POPH90111 . CALCULATE + INTERPRET + APPRAISE POPH90111 . Genetic Epidemiology AskSia Library VISUAL STUDY BIBLE . ASKSIA POP. & GLOBAL HEALTH SEMESTER 1 . 2026 p < 5x10-8 - THE COMPLETE STUDY BIBLE - Genetic Epidemiology 遗传流行病学 FROM A DNA STRAND TO A GWAS SKYLINE - PLUG THE NUMBERS AND JUDGE THE STUDY DESIGN. 墨尔本大学 POPH90111 · 双语视觉精读 · 在线开卷作业占 90% · 遗传率/GWAS/孟德尔随机化/ 筛查:计算 + 研究设计评价工具箱 POPH90111 . UNIVERSITY OF MELBOURNE 中英双语版 · BILINGUAL EDITION 英文主讲,中文随行 一 考试要点与术语保留英文原词 There is no closed-book exam in this subject. Your whole mark is three take-home pieces - a 10% online MCQ and two written assignments worth 90% combined - and all three are built from the same three task types: calculate, interpret, and critically appraise the study design. This book is therefore a calculation + appraisal toolkit: every method shown on real course numbers, every design tagged with its signature bias. Independent study companion. Not affiliated with or endorsed by the University of Melbourne. Corrections: takedowns@asksia. ai PREFACE - HOW TO USE THIS BOOK Plug the numbers, judge the design 代入数字,评判设计 Open-book and take-home - so it tests whether you can DO the method and APPRAISE it 开卷且带回家作答,因此考的是你能否动手做方法并加以评价 This is not a transcript of the lecture slides. It is a self-contained toolkit for every quantity and every study design POPH90111 examines - each concept opens with a diagram, states the formula plainly, works it on the course's own example numbers, gives a memorable analogy, then flags the assignment trap. Because all three assessments are open-book and take-home, the examiner cannot reward what you merely remember. Marks come from three things only: can you calculate it, can you interpret the output, and can you critically appraise the design that produced it? - - 这不是讲座幻灯片的逐字稿,而是一份自足的工具箱,涵盖 POPH90111 所考的每个量和每种研究设计 -- 每个概念都以一幅 图开篇,明白地写出公式,用课程自带的例题数字加以演算,给出一个易记的类比,再标出作业陷阱。由于三项评估全部开卷 且带回家作答,考官无法奖励你单纯记住的东西。分数仅来自三件事:你能否算出它、能否解释输出、能否批判性地评价产生 它的设计?
-
NNT/NNS:用 ARR 与 carrier frequency 把“是否值得筛查”量化;遗传筛查倾向于瞄准高风险人群。[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[11]Source: asksia-bible-poph90111-bilingual.pdfNNS - HIGH FAMILY HISTORY NNS -- 高家族史 ✓ Analogy - the lottery-ticket cost 类比 -- 彩票的代价 NNT is the price of a winning ticket once you are in the right queue (a carrier). NNS is the price including finding the right queue: in the general population you sift ~150 people to find one carrier, so the cost per prevented case balloons. Walk into a queue already full of carriers (high family history) and the sifting cost collapses. NNT 是当你已身处正确队列(即一名携带者)时一张中奖彩票的价格。NNS 则是加上找到正确队列的价格:在一般人群 中你要筛过约150人才能找到一名携带者,于是每避免一例的成本暴涨。走进一个已挤满携带者的队列(高家族史),筛 选成本就崩塌。 POPH90111 . Genetic Epidemiology ! Exam trap - NNS collapses for rare variants 考试陷阱 -- NNS 对罕见变异会失效 Screening efficiency (NNS) is dominated by carrier frequency, not by how good the treatment is. A wonderful treatment (low NNT) is wasted if you must screen thousands to find one carrier. Raising the prior probability of carriage - targeting high-family-history groups - is what makes a genetic screen worthwhile. State NNT and NNS, and always name the population the carrier frequency refers to. 筛查效率(NNS)由携带者频率主导,而非由治疗有多好主导。一个再好的治疗(低 NNT)若你必须筛上千人才找到一 名携带者,也是浪费。提高携带的先验概率 -- 靶向高家族史人群 -- 才是让一项遗传筛查值得做的关键。陈述 NNT 和 NNS,并始终点名携带者频率所指的人群。 POPH90111 . Genetic Epidemiology M8 . TEST METRICS - - MODULE 8 . EVALUATING A TEST STANDARD CANON Sensitivity, specificity, PPV / NPV 灵敏度、特异度、PPV / NPV The 2×2 of test result x true status - and why PPV depends on prevalence 检验结果 × 真实状态的 2×2 表 -- 以及 PPV 为何取决于患病率 Any "evaluate this test" question reduces to a 2×2 of test result (+/-) against true disease status (D+/D-), giving TP, FP, FN, TN. From it come four metrics - two intrinsic to the test (sensitivity, specificity) and two that depend on who you test (PPV, NPV). 任何“评价这项检验”的问题都归结为一张 2×2 表,列出检验结果(+/-)与真实疾病状态 (D+/D-),给出 TP、FP、FN、 TN。由它得出四项指标 -- 两项是检验固有的(灵敏度、特异度),两项取决于你检测的是谁(PPV、NPV)。 D5 machinery used for screening: read TP / FP / FN / TN off the cells. (Here the row split is test +/-, the column split i Disease: YES Disease: NO Exposed / FamHx + a b OR = (a x d) / (b xc) Unexposed / FamHx - C d RR = [a/(a+b)] / [c/(c+d)][20]Source: asksia-cheatsheet-poph90111.pdfSIA > Show the working: in this subject the marks live in the setup and the interpretation, not the final digit. Always write the formula, the substitution, then one sentence of meaning. asksia. ai/cheatsheet/ unimelb-poph90111 . side 2/2 AskSia CHEATSHEET SERIES Disease screening = a systematic test to find asymptomatic disease/precursors in people not seeking care. Genetic screening = find risk-raising variants in asymptomatic people so risk can be reduced/prevented; can be population-wide but is usually targeted to high-prior-risk groups (e. g. strong family history). Course twist: a genetic test can be "once-and-for-all" (your germline doesn't change), unlike repeated disease screening. Two uses: screen for genetic risk, or use a genetic factor to screen for disease. 30 . Wilson-Jungner WHO 1968 The screening-evaluation checklist (the course adapts all 10 to genetics): · Facilities for diagnosis & treatment exist Trap: "we can test" # "we should screen. " A test only helps if knowing the result reduces disease/disability/death and benefits beat harms (psychological, social, insurance, variants of unknown significance, false positives). Example genes the course uses: BRCA1/2, the mismatch-repair (MMR) genes, HTT - note HTT (Huntington) has no risk-reduction, which weakens the case for screening. 31 . NNT & NNS QUANTIFY BENEFIT ARR = carrier risk x proportion risk reduced NNT = 1 / ARR NNS = NNT / carrier frequency Worked (BRCA1/2): carrier breast-cancer risk to 70 = 0. 4; tamoxifen cuts risk 50% => ARR = 0. 4×0. 5 = 0. 2 => NNT = 1/0. 2 = 5 carriers treated to prevent one cancer. Carrier freq 0. 0067 (1 in 150) = NNS = 5/0. 0067 = 746 screened per cancer prevented. High-FH group (carrier freq 0. 25) = NNS = 5/0. 25 = 20 - far more efficient => justifies targeted screening. CARRIER FREQ NNT NNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH)
-
Wilson–Jungner(WHO 1968)原则:筛查要满足疾病重要、有潜伏期/自然史清楚、检验合适可接受、后续行动有效等;“能测 ≠ 该筛”。[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2[20]Source: asksia-cheatsheet-poph90111.pdfSIA > Show the working: in this subject the marks live in the setup and the interpretation, not the final digit. Always write the formula, the substitution, then one sentence of meaning. asksia. ai/cheatsheet/ unimelb-poph90111 . side 2/2 AskSia CHEATSHEET SERIES Disease screening = a systematic test to find asymptomatic disease/precursors in people not seeking care. Genetic screening = find risk-raising variants in asymptomatic people so risk can be reduced/prevented; can be population-wide but is usually targeted to high-prior-risk groups (e. g. strong family history). Course twist: a genetic test can be "once-and-for-all" (your germline doesn't change), unlike repeated disease screening. Two uses: screen for genetic risk, or use a genetic factor to screen for disease. 30 . Wilson-Jungner WHO 1968 The screening-evaluation checklist (the course adapts all 10 to genetics): · Facilities for diagnosis & treatment exist Trap: "we can test" # "we should screen. " A test only helps if knowing the result reduces disease/disability/death and benefits beat harms (psychological, social, insurance, variants of unknown significance, false positives). Example genes the course uses: BRCA1/2, the mismatch-repair (MMR) genes, HTT - note HTT (Huntington) has no risk-reduction, which weakens the case for screening. 31 . NNT & NNS QUANTIFY BENEFIT ARR = carrier risk x proportion risk reduced NNT = 1 / ARR NNS = NNT / carrier frequency Worked (BRCA1/2): carrier breast-cancer risk to 70 = 0. 4; tamoxifen cuts risk 50% => ARR = 0. 4×0. 5 = 0. 2 => NNT = 1/0. 2 = 5 carriers treated to prevent one cancer. Carrier freq 0. 0067 (1 in 150) = NNS = 5/0. 0067 = 746 screened per cancer prevented. High-FH group (carrier freq 0. 25) = NNS = 5/0. 25 = 20 - far more efficient => justifies targeted screening. CARRIER FREQ NNT NNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH)
-
3)最常见“扣分陷阱清单”(考前必刷)
- 只给数字不写 interpret/appraise:直接丢大头分;每题必须补一句解释 + 一句评价。[2]Source: asksia-bible-poph90111-bilingual.pdf! Critical appraisal - a good test can still be useless for screening 批判性评价 -- 再好的检验也可能不适合筛查 Sensitivity and specificity are intrinsic, but PPV depends on prevalence. A test with excellent sensitivity/specificity can be near-useless in a low-prevalence general population (PPV near zero) - exactly why genetic screening targets high-prior-risk groups. The ROC/AUC summarises discrimination across cut-offs; moving the cut-off trades sensitivity against specificity but never changes prevalence. 灵敏度和特异度是内在的,但PPV 取决于患病率。一个灵敏度/特异度极佳的检验,在低患病率的一般人群中可能近乎 无用(PPV 接近零) -- 这正是遗传筛查靶向高先验风险人群的原因。ROC/AUC 概括跨切点的区分度;移动切点是在 灵敏度与特异度之间权衡,但从不改变患病率。 Same habit on every item: calculate, interpret, appraise. The number alone is a fraction of the marks - the interpretation sentence and the design critique (bias, assumption, scale, prevalence) are where POPH90111 assessment lives. Write the appraisal sentence even when the question only asks for the number. 每道题都保持同一习惯:计算、解读、评价。光有数字只占一小部分分数 -- 解读句和设计批判(偏倚、假设、尺度、 患病率)才是 POPH90111 评估的所在。即便题目只问数字,也写出那句评价。 MARKER'S NOTE . POPH90111 . CALCULATE + INTERPRET + APPRAISE POPH90111 . Genetic Epidemiology AskSia Library VISUAL STUDY BIBLE . ASKSIA POP. & GLOBAL HEALTH SEMESTER 1 . 2026 p < 5x10-8 - THE COMPLETE STUDY BIBLE - Genetic Epidemiology 遗传流行病学 FROM A DNA STRAND TO A GWAS SKYLINE - PLUG THE NUMBERS AND JUDGE THE STUDY DESIGN. 墨尔本大学 POPH90111 · 双语视觉精读 · 在线开卷作业占 90% · 遗传率/GWAS/孟德尔随机化/ 筛查:计算 + 研究设计评价工具箱 POPH90111 . UNIVERSITY OF MELBOURNE 中英双语版 · BILINGUAL EDITION 英文主讲,中文随行 一 考试要点与术语保留英文原词 There is no closed-book exam in this subject. Your whole mark is three take-home pieces - a 10% online MCQ and two written assignments worth 90% combined - and all three are built from the same three task types: calculate, interpret, and critically appraise the study design. This book is therefore a calculation + appraisal toolkit: every method shown on real course numbers, every design tagged with its signature bias. Independent study companion. Not affiliated with or endorsed by the University of Melbourne. Corrections: takedowns@asksia. ai PREFACE - HOW TO USE THIS BOOK Plug the numbers, judge the design 代入数字,评判设计 Open-book and take-home - so it tests whether you can DO the method and APPRAISE it 开卷且带回家作答,因此考的是你能否动手做方法并加以评价 This is not a transcript of the lecture slides. It is a self-contained toolkit for every quantity and every study design POPH90111 examines - each concept opens with a diagram, states the formula plainly, works it on the course's own example numbers, gives a memorable analogy, then flags the assignment trap. Because all three assessments are open-book and take-home, the examiner cannot reward what you merely remember. Marks come from three things only: can you calculate it, can you interpret the output, and can you critically appraise the design that produced it? - - 这不是讲座幻灯片的逐字稿,而是一份自足的工具箱,涵盖 POPH90111 所考的每个量和每种研究设计 -- 每个概念都以一幅 图开篇,明白地写出公式,用课程自带的例题数字加以演算,给出一个易记的类比,再标出作业陷阱。由于三项评估全部开卷 且带回家作答,考官无法奖励你单纯记住的东西。分数仅来自三件事:你能否算出它、能否解释输出、能否批判性地评价产生 它的设计?[4]Source: asksia-bible-poph90111-bilingual.pdfModules 4-8 · multi- week window Penetrance-from-OR, NNT, NNS Plug real incidence & ARR, get the number The list of study designs Name the signature bias & appraise it POPH90111 . Genetic Epidemiology ★ The format - open-book, calculation + appraisal 考试形式 -- 开卷,计算+评价 Nothing here is invigilated. The MCQ is a 30-minute LMS quiz of 10 questions, completable any time within one week of release, covering the Introduction + Module 1 (scope of genetic epi, the role of familial aggregation, the designs that measure it, and calculating & interpreting aggregation measures). The assignments are written documents submitted online over a multi-week window. Because the book is open, no fact is worth a mark on its own - only the method, the interpretation, and the appraisal are. 此处无任何监考。MCQ 是一份10题、30分钟的 LMS 测验,在发布后一周内任意时间可完成,涵盖引 言+模块 1(遗传流行病学的范畴、家族聚集性的作 用、测量它的设计,以及计算与解读聚集性测度)。两 份作业为多周窗口内在线提交的书面文档。由于本书 是开卷的,没有哪个事实本身值分 -- 只有方法、解 读和评价才值分。 ✓ The strategy this dictates 由此决定的策略 Every question is a chain: calculate - interpret ++ appraise. The recurring chains are - 2×2 - OR/RR -+ 'evidence for not proof of'; rMz, roz - Falconer h2 -+ equal-environments caveat; OR + population incidence - penetrance - ascertainment-bias check; ARR - NNT - NNS - target high-risk. For every design write its measure, its one strength and its one characteristic bias - LO5 (appraisal) threads through all 8 modules and is where the easy marks live. 每道题都是一条链:计算→解读→评价。反复出现 的链有 -- 2×2→OR/RR→“是证据而非证明”; 「MZ、『Dz→ Falconer h2→等环境注意事项;OR+ 人群发病率→ 外显率→ 确认偏倚核查;ARR → NNT → NNS → 靶向高风险。对每一个设计都写出它 的测度、它的一个优势和它的一个特征性偏倚 -- LO5(评价)贯穿全部8个模块,正是容易得分之 处。 POPH90111 . Genetic Epidemiology CONTENTS - CONTENTS The genetic-epi pipeline, end to end 遗传流行病学全流程,从头到尾 Four questions - understand, discover, characterise, use - one toolkit 四个问题 -- 理解、发现、刻画、应用一一共用一套工具 D1 The genetic-epidemiology pipeline: which method answers which question UNDERSTANDING is there a genetic role? DISCOVERING which genes / causes? CHARACTERISING how risky? USING IN PRACTICE should we screen / act? Familial aggregation GWAS / association Penetrance (M5) Screening (M8) Twin studies
- 把 PPV 当成检验固有属性:错;PPV 随 prevalence 变,低患病率筛查 PPV 可接近 0。[2]Source: asksia-bible-poph90111-bilingual.pdf! Critical appraisal - a good test can still be useless for screening 批判性评价 -- 再好的检验也可能不适合筛查 Sensitivity and specificity are intrinsic, but PPV depends on prevalence. A test with excellent sensitivity/specificity can be near-useless in a low-prevalence general population (PPV near zero) - exactly why genetic screening targets high-prior-risk groups. The ROC/AUC summarises discrimination across cut-offs; moving the cut-off trades sensitivity against specificity but never changes prevalence. 灵敏度和特异度是内在的,但PPV 取决于患病率。一个灵敏度/特异度极佳的检验,在低患病率的一般人群中可能近乎 无用(PPV 接近零) -- 这正是遗传筛查靶向高先验风险人群的原因。ROC/AUC 概括跨切点的区分度;移动切点是在 灵敏度与特异度之间权衡,但从不改变患病率。 Same habit on every item: calculate, interpret, appraise. The number alone is a fraction of the marks - the interpretation sentence and the design critique (bias, assumption, scale, prevalence) are where POPH90111 assessment lives. Write the appraisal sentence even when the question only asks for the number. 每道题都保持同一习惯:计算、解读、评价。光有数字只占一小部分分数 -- 解读句和设计批判(偏倚、假设、尺度、 患病率)才是 POPH90111 评估的所在。即便题目只问数字,也写出那句评价。 MARKER'S NOTE . POPH90111 . CALCULATE + INTERPRET + APPRAISE POPH90111 . Genetic Epidemiology AskSia Library VISUAL STUDY BIBLE . ASKSIA POP. & GLOBAL HEALTH SEMESTER 1 . 2026 p < 5x10-8 - THE COMPLETE STUDY BIBLE - Genetic Epidemiology 遗传流行病学 FROM A DNA STRAND TO A GWAS SKYLINE - PLUG THE NUMBERS AND JUDGE THE STUDY DESIGN. 墨尔本大学 POPH90111 · 双语视觉精读 · 在线开卷作业占 90% · 遗传率/GWAS/孟德尔随机化/ 筛查:计算 + 研究设计评价工具箱 POPH90111 . UNIVERSITY OF MELBOURNE 中英双语版 · BILINGUAL EDITION 英文主讲,中文随行 一 考试要点与术语保留英文原词 There is no closed-book exam in this subject. Your whole mark is three take-home pieces - a 10% online MCQ and two written assignments worth 90% combined - and all three are built from the same three task types: calculate, interpret, and critically appraise the study design. This book is therefore a calculation + appraisal toolkit: every method shown on real course numbers, every design tagged with its signature bias. Independent study companion. Not affiliated with or endorsed by the University of Melbourne. Corrections: takedowns@asksia. ai PREFACE - HOW TO USE THIS BOOK Plug the numbers, judge the design 代入数字,评判设计 Open-book and take-home - so it tests whether you can DO the method and APPRAISE it 开卷且带回家作答,因此考的是你能否动手做方法并加以评价 This is not a transcript of the lecture slides. It is a self-contained toolkit for every quantity and every study design POPH90111 examines - each concept opens with a diagram, states the formula plainly, works it on the course's own example numbers, gives a memorable analogy, then flags the assignment trap. Because all three assessments are open-book and take-home, the examiner cannot reward what you merely remember. Marks come from three things only: can you calculate it, can you interpret the output, and can you critically appraise the design that produced it? - - 这不是讲座幻灯片的逐字稿,而是一份自足的工具箱,涵盖 POPH90111 所考的每个量和每种研究设计 -- 每个概念都以一幅 图开篇,明白地写出公式,用课程自带的例题数字加以演算,给出一个易记的类比,再标出作业陷阱。由于三项评估全部开卷 且带回家作答,考官无法奖励你单纯记住的东西。分数仅来自三件事:你能否算出它、能否解释输出、能否批判性地评价产生 它的设计?[6]Source: asksia-bible-poph90111-bilingual.pdf— Cases only; under GLE in the source population, a G-E association among cases efficiently estimates the multiplicative interaction (biased if G,E correlated). Module 8 - Screening (USE IN PRACTICE) Wilson-Jungner principles — WHO (1968) 10-point checklist: important condition with a latent stage & understood natural history, a suitable acceptable test, an accepted action, balanced cost. Number needed to treat (NNT) — NNT = 1/ Absolute Risk Reduction, with ARR = carrier risk x proportion risk reduced by treatment; carriers treated to prevent one case. POPH90111 . Genetic Epidemiology Term (EN) 中文 One-line meaning Number needed to screen (NNS) — NNS = NNT / carrier frequency; people screened to prevent one case - collapses for rare variants, so genetic screening is targeted. Absolute risk reduction (ARR) — The drop in absolute risk from the action: ARR = baseline risk - treated risk; the reciprocal of NNT. Sensitivity TP/(TP+FN) = P(test+ | diseased); the test's ability to detect true cases. An intrinsic test — property. Specificity — TN/(TN+FP) = P(test- | healthy); correctly clears the well. An intrinsic test property. PPV / NPV — PPV = TP/(TP+FP) = P(diseased | test+); NPV = TN/(TN+FN); PPV rises and NPV falls as prevalence rises - why low-prevalence screens give many false positives. ROC curve / AUC — Sensitivity vs 1-specificity across cut-offs; AUC = P(a random case scores above a random control); 0. 5 = chance, 1. 0 = perfect (CAD PRS AUC ~ 0. 81). ★ How to drill this glossary Az 如何演练这份术语表 用法 Three layers per term: recall the term from the meaning, recall the formula where one exists, and say the one-line appraisal hook (e. g. "aggregation is evidence for, not proof of, a genetic cause"; "PPV depends on prevalence"). The exam and assignments reward the hook as much as the number. 〔每个术语三层:由释义回忆术语、回忆公式、复述那 句批判性结论。〕 每个术语三层:由释义回忆术语、回忆公式(若存在)、复述那句一行评价钩子(如“聚集性是支持遗传原因的证据,而 非证明”;“PPV取决于患病率”)。考试和作业奖励钩子的程度不亚于数字。〔每个术语三层:由释义回忆术语、回忆公 式、复述那句批判性结论。〕 POPH90111 . Genetic Epidemiology PRACTICE Q1-Q2
- 读错 ROC 横轴:横轴是 $1-$specificity,不是 specificity。[5]Source: asksia-bible-poph90111-bilingual.pdf按后果选取一个操作切点:一个必须少漏病例的筛查偏好高灵敏度(接受更多假阳性);一个确证检验偏好高特异度。 ✓ Analogy - a bouncer's strictness dial 类比 -- 保安的严格程度旋钮 Think of one dial controlling how strict a bouncer is. Turn it strict and almost nobody gets in - few mistaken admits (low FPR) but you also turn away real guests (low sensitivity): bottom-left. Turn it lenient and everyone gets in - high sensitivity but lots of gate-crashers (high FPR): top-right. The ROC traces every dial setting; AUC says how good the bouncer's judgement is, independent of where you set the dial. 想象一个旋钮,控制一名门卫有多严格。拧到严格,几乎没人能进 -- 误放进的少(低 FPR),但你也拒掉了真客人(低 灵敏度):左下角。拧到宽松,人人都进 -- 高灵敏度但大量闯入者(高FPR):右上角。ROC 描绘每一个旋钮设定; AUC 则说明门卫的判断力有多好,与你把旋钮设在哪里无关。 ! Exam trap - AUC measures discrimination, not usefulness 考试陷阱 -- AUC 衡量的是区分度,而非有用性 A high AUC means the score separates cases from controls, but it says nothing about PPV at a given prevalence or whether acting on the result helps. Two more slips: (1) the x-axis is 1-specificity, not specificity; (2) moving the cut- off trades sensitivity against specificity - you cannot improve both by sliding the threshold, only by getting a better test (a curve closer to the top-left). In this course AUC mainly appears for polygenic risk scores. 高 AUC 意味着该评分能区分病例与对照,但它对给定患病率下的PPV、或据结果行动是否有帮助,一无所知。还有两 个失误:(1)横轴是1-特异度,不是特异度;(2)移动切点是在灵敏度与特异度之间权衡 -- 你无法靠滑动阈值同时改善 两者,只能靠拿到一个更好的检验(一条更贴近左上角的曲线)。本课程中AUC 主要出现在多基因风险评分上。 POPH90111 . Genetic Epidemiology M8 . APPRAISAL ASSIGNMENT GOLD MODULE 8 . PUTTING IT TOGETHER Critically appraising a screening programme 对筛查项目进行批判性评价 A reusable checklist - condition, test, action, numbers, harms, equity - for any "should we screen?" question 一份可复用的清单 -- 疾病、检验、行动、数字、危害、公平性 -- 适用于任何“我们该筛查吗”之问 Assignment and exam questions reward appraisal, not recall. When handed a proposed genetic screen, march through this checklist and back every verdict with a Wilson-Jungner principle or a number (NNT, NNS, PPV at the relevant prevalence). 作业和考试题奖励的是评价而非死记。当拿到一个拟议的遗传筛查方案时,逐条走完这份清单,并用某条 Wilson-Jungner 标 准或某个数字(NNT、NNS、相应患病率下的 PPV)来支撑每一项判断。 Lens Ask What good evidence looks like / red flag 1 - The condition Burden Is it common/serious enough? Important health problem with a recognisable latent stage (W-J 1-2). Natural history Is penetrance & onset understood? Known age-specific penetrance. Red flag: uncertain penetrance / variants of unknown significance. 2 - The test Validity
- 把 AUC 当“筛查有用”证据:AUC 只是区分度,不代表 PPV/NNS 或净获益。[5]Source: asksia-bible-poph90111-bilingual.pdf按后果选取一个操作切点:一个必须少漏病例的筛查偏好高灵敏度(接受更多假阳性);一个确证检验偏好高特异度。 ✓ Analogy - a bouncer's strictness dial 类比 -- 保安的严格程度旋钮 Think of one dial controlling how strict a bouncer is. Turn it strict and almost nobody gets in - few mistaken admits (low FPR) but you also turn away real guests (low sensitivity): bottom-left. Turn it lenient and everyone gets in - high sensitivity but lots of gate-crashers (high FPR): top-right. The ROC traces every dial setting; AUC says how good the bouncer's judgement is, independent of where you set the dial. 想象一个旋钮,控制一名门卫有多严格。拧到严格,几乎没人能进 -- 误放进的少(低 FPR),但你也拒掉了真客人(低 灵敏度):左下角。拧到宽松,人人都进 -- 高灵敏度但大量闯入者(高FPR):右上角。ROC 描绘每一个旋钮设定; AUC 则说明门卫的判断力有多好,与你把旋钮设在哪里无关。 ! Exam trap - AUC measures discrimination, not usefulness 考试陷阱 -- AUC 衡量的是区分度,而非有用性 A high AUC means the score separates cases from controls, but it says nothing about PPV at a given prevalence or whether acting on the result helps. Two more slips: (1) the x-axis is 1-specificity, not specificity; (2) moving the cut- off trades sensitivity against specificity - you cannot improve both by sliding the threshold, only by getting a better test (a curve closer to the top-left). In this course AUC mainly appears for polygenic risk scores. 高 AUC 意味着该评分能区分病例与对照,但它对给定患病率下的PPV、或据结果行动是否有帮助,一无所知。还有两 个失误:(1)横轴是1-特异度,不是特异度;(2)移动切点是在灵敏度与特异度之间权衡 -- 你无法靠滑动阈值同时改善 两者,只能靠拿到一个更好的检验(一条更贴近左上角的曲线)。本课程中AUC 主要出现在多基因风险评分上。 POPH90111 . Genetic Epidemiology M8 . APPRAISAL ASSIGNMENT GOLD MODULE 8 . PUTTING IT TOGETHER Critically appraising a screening programme 对筛查项目进行批判性评价 A reusable checklist - condition, test, action, numbers, harms, equity - for any "should we screen?" question 一份可复用的清单 -- 疾病、检验、行动、数字、危害、公平性 -- 适用于任何“我们该筛查吗”之问 Assignment and exam questions reward appraisal, not recall. When handed a proposed genetic screen, march through this checklist and back every verdict with a Wilson-Jungner principle or a number (NNT, NNS, PPV at the relevant prevalence). 作业和考试题奖励的是评价而非死记。当拿到一个拟议的遗传筛查方案时,逐条走完这份清单,并用某条 Wilson-Jungner 标 准或某个数字(NNT、NNS、相应患病率下的 PPV)来支撑每一项判断。 Lens Ask What good evidence looks like / red flag 1 - The condition Burden Is it common/serious enough? Important health problem with a recognisable latent stage (W-J 1-2). Natural history Is penetrance & onset understood? Known age-specific penetrance. Red flag: uncertain penetrance / variants of unknown significance. 2 - The test Validity
- 遗传率忘了方差=SD²:最常丢的一分。[10]Source: asksia-bible-poph90111-bilingual.pdf★ What the assignment / exam asks here 此处作业/考试考什么 (1) Partition variance: given Va, Vd, Ve compute narrow-sense h2 = Va/Vp and broad-sense H2 = Vg/Vp. (2) From twin correlations apply Falconer: h2 = 2(rMZ - rDZ) and interpret. (3) Solve the ACE system for A, C, E from rMZ and rDZ. (4) Convert SD - variance (Vp = SD2) and explain estimating separately by sex and zygosity. (5) Explain the liability-threshold model for a binary trait. (6) The marks-rich part: critically appraise - equal-environments assumption, 'evidence for not proof of', missing heritability, what heritability does not mean. (1)分解方差:给定 Va、Vd、Ve,计算狭义 h2 = Va/Vp 和广义 H2 = Vg/Vp。(2)由双生子相关应用 Falconer: h2= 2(rMZ - rDZ) 并解读。(3)由 rMZ 和 rDZ 求解 ACE 方程组得 A、C、E。(4)把 SD→方差(Vp= SD2)并解释为何 要按性别和合子性分别估计。(5)解释二分性状的易感性阈值模型。(6)富含分数的部分:严格评价一 -等环境假设、“是 证据而非证明”、缺失遗传力,以及遗传力不意味着什么。 2. 1 Heritability is a proportion of variance 2. 1 遗传率是方差的一个比例 Start from the definition the course insists on. Heritability is the proportion of the phenotypic variance in a population that is attributable to genetic variance. The key word is variance - we are explaining differences between people, not the trait value of any one person. Variance is the spread, the squared standard deviation: V = SD2. 从课程坚持采用的定义出发。遗传率是一个人群中表型方差里可归因于遗传方差的比例。关键词是方差 -- 我们解释的是人与 人之间的差异,而非任何一个人的性状取值。方差即离散度,是标准差的平方:V= SD2。 POPH90111 . Genetic Epidemiology VARIANCE PARTITION Top split: Vp = VG + VE Genetic split: VG = VA + Vp (+ VI) Broad-sense: H2 = VG / Vp Narrow-sense: h2 = VA / Vp (always h2 ≤ Hº) VA additive (allele effects sum - the part that passes parent-child); Vp dominance (interaction within a locus); VI epistasis (between loci); VE environment. VA 加性(等位基因效应相加 -- 可由亲代→子代传递的那 部分);VD 显性(同一位点内部的交互);V1上位(位点之 间);VE 环境。 D2 Fig 2. 1 - Family resemblance is fed by shared genes AND shared environment. Heritability is our attempt to read off the genetic slice of the variance - but the two sources are confounded, which is exactly why we need twins. Shared GENES(degree of relatedness)closer kin share more DNAShared ENVIRONMENTdiet, lifestyle, exposuresfamilies live alikeFamily resemblance(observed familial aggregation)aggregation = genes + shared env + chanceDegrees of relatedness1st parent/sib/childshare 1/22nd grandparent/uncleshare 1/43rd first cousinshare 1/8Read-off: familial aggregation is NECESSARY but NOT SUFFICIENT for a genetic cause - shared environment is the rival. Fig 2. 1 - Family resemblance is fed by shared genes AND shared environment. Heritability is our attempt to read off the genetic slice of the variance - but the two sources are confounded, which is exactly why we need twins. 图 2. 1 -- 家族相似性由共享基因和共享环境共同喂 养。遗传力是我们试图读出方差中遗传那一份的尝试 -- 但这两个来源彼此混杂,这正是我们需要双生子 的原因。 i Narrow vs broad - why we usually quote h2 狭义 vs 广义 -- 为何我们通常引用 h2 Only the additive part VA is transmitted predictably from parent to offspring and drives resemblance between relatives and response to selection. Dominance and epistasis effects are reshuffled each generation, so narrow- sense h2 is the quantity that predicts relatives' correlations - which is precisely what twin studies measure. 只有加性部分 VA 可从亲代到子代可预测地传递,并驱动亲属间的相似性和对选择的响应。显性与上位效应每代都被重 新洗牌,因此狭义 h2 才是预测亲属相关的量 -- 这正是双生子研究所测量的。 - POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY 2. 2 SD - variance: the arithmetic you will be marked on 2. 2 SD→ 方差:会被评分的算术 Almost every heritability question hands you a standard deviation or a correlation, not a variance. The single most-dropped mark is forgetting that variance = SD squared. The course also estimates everything separately by sex and by zygosity, because the spread genuinely differs between groups. 几乎每道遗传率题目给你的都是标准差或相关,而非方差。最常被丢掉的一分就是忘了方差= SD 的平方。课程还会按性别和 合子性分别估计一切,因为各组间的离散度确实不同。 - EX 2A Turning a standard deviation into a variance L . variance A tutorial reports female adult height with SD = 9. 29 cm. The genetic component is estimated as VA = 55. Find Vp and then narrow-sense h2.
- 遗传率解释成“个体有 64% 是基因决定”:错;它是人群方差比例,且环境特异。[10]Source: asksia-bible-poph90111-bilingual.pdf★ What the assignment / exam asks here 此处作业/考试考什么 (1) Partition variance: given Va, Vd, Ve compute narrow-sense h2 = Va/Vp and broad-sense H2 = Vg/Vp. (2) From twin correlations apply Falconer: h2 = 2(rMZ - rDZ) and interpret. (3) Solve the ACE system for A, C, E from rMZ and rDZ. (4) Convert SD - variance (Vp = SD2) and explain estimating separately by sex and zygosity. (5) Explain the liability-threshold model for a binary trait. (6) The marks-rich part: critically appraise - equal-environments assumption, 'evidence for not proof of', missing heritability, what heritability does not mean. (1)分解方差:给定 Va、Vd、Ve,计算狭义 h2 = Va/Vp 和广义 H2 = Vg/Vp。(2)由双生子相关应用 Falconer: h2= 2(rMZ - rDZ) 并解读。(3)由 rMZ 和 rDZ 求解 ACE 方程组得 A、C、E。(4)把 SD→方差(Vp= SD2)并解释为何 要按性别和合子性分别估计。(5)解释二分性状的易感性阈值模型。(6)富含分数的部分:严格评价一 -等环境假设、“是 证据而非证明”、缺失遗传力,以及遗传力不意味着什么。 2. 1 Heritability is a proportion of variance 2. 1 遗传率是方差的一个比例 Start from the definition the course insists on. Heritability is the proportion of the phenotypic variance in a population that is attributable to genetic variance. The key word is variance - we are explaining differences between people, not the trait value of any one person. Variance is the spread, the squared standard deviation: V = SD2. 从课程坚持采用的定义出发。遗传率是一个人群中表型方差里可归因于遗传方差的比例。关键词是方差 -- 我们解释的是人与 人之间的差异,而非任何一个人的性状取值。方差即离散度,是标准差的平方:V= SD2。 POPH90111 . Genetic Epidemiology VARIANCE PARTITION Top split: Vp = VG + VE Genetic split: VG = VA + Vp (+ VI) Broad-sense: H2 = VG / Vp Narrow-sense: h2 = VA / Vp (always h2 ≤ Hº) VA additive (allele effects sum - the part that passes parent-child); Vp dominance (interaction within a locus); VI epistasis (between loci); VE environment. VA 加性(等位基因效应相加 -- 可由亲代→子代传递的那 部分);VD 显性(同一位点内部的交互);V1上位(位点之 间);VE 环境。 D2 Fig 2. 1 - Family resemblance is fed by shared genes AND shared environment. Heritability is our attempt to read off the genetic slice of the variance - but the two sources are confounded, which is exactly why we need twins. Shared GENES(degree of relatedness)closer kin share more DNAShared ENVIRONMENTdiet, lifestyle, exposuresfamilies live alikeFamily resemblance(observed familial aggregation)aggregation = genes + shared env + chanceDegrees of relatedness1st parent/sib/childshare 1/22nd grandparent/uncleshare 1/43rd first cousinshare 1/8Read-off: familial aggregation is NECESSARY but NOT SUFFICIENT for a genetic cause - shared environment is the rival. Fig 2. 1 - Family resemblance is fed by shared genes AND shared environment. Heritability is our attempt to read off the genetic slice of the variance - but the two sources are confounded, which is exactly why we need twins. 图 2. 1 -- 家族相似性由共享基因和共享环境共同喂 养。遗传力是我们试图读出方差中遗传那一份的尝试 -- 但这两个来源彼此混杂,这正是我们需要双生子 的原因。 i Narrow vs broad - why we usually quote h2 狭义 vs 广义 -- 为何我们通常引用 h2 Only the additive part VA is transmitted predictably from parent to offspring and drives resemblance between relatives and response to selection. Dominance and epistasis effects are reshuffled each generation, so narrow- sense h2 is the quantity that predicts relatives' correlations - which is precisely what twin studies measure. 只有加性部分 VA 可从亲代到子代可预测地传递,并驱动亲属间的相似性和对选择的响应。显性与上位效应每代都被重 新洗牌,因此狭义 h2 才是预测亲属相关的量 -- 这正是双生子研究所测量的。 - POPH90111 . Genetic Epidemiology MODULE 2 . HERITABILITY 2. 2 SD - variance: the arithmetic you will be marked on 2. 2 SD→ 方差:会被评分的算术 Almost every heritability question hands you a standard deviation or a correlation, not a variance. The single most-dropped mark is forgetting that variance = SD squared. The course also estimates everything separately by sex and by zygosity, because the spread genuinely differs between groups. 几乎每道遗传率题目给你的都是标准差或相关,而非方差。最常被丢掉的一分就是忘了方差= SD 的平方。课程还会按性别和 合子性分别估计一切,因为各组间的离散度确实不同。 - EX 2A Turning a standard deviation into a variance L . variance A tutorial reports female adult height with SD = 9. 29 cm. The genetic component is estimated as VA = 55. Find Vp and then narrow-sense h2.[25]Source: asksia-cheatsheet-poph90111.pdf· Random mating (no assortative mating inflating r_DZ) Concordance/correlation are estimated separately by sex & zygosity because variance differs. Binary worked: conc_MZ=0. 40, conc_DZ=0. 15 => h2(liability) = 2(0. 40-0. 15) = 0. 50 . MZ>DZ concordance is the signal; near-equality (conc_MZ=conc_DZ) => shared environment, not genes, drives the resemblance. Compiled by AskSia . mapped to the POPH90111 syllabus . asksia. ai/cheatsheet/unimelb-poph90111 12 . Liability- Threshold BINARY TRAITS Assume an unobserved continuous liability (genes+env), ~Normal; disease occurs above a threshold set by prevalence. Puts yes/no disease onto a continuous scale so variance/heritability methods apply. LIABILITY ~NORMAL . DISEASE = TAIL BEYOND THRESHOLD ithreshold T affected liability + Tail area = prevalence. Relatives of cases sit at a right- shifted liability distribution => larger tail => higher risk, the model's link from heritability to a yes/no trait. Trap: heritability of liability # heritability "of the disease," and is very sensitive to the assumed prevalence (which sets where T sits). 13 . Heritability Cautions ASSIGNMENT GOLD High h2 does NOT mean: (a) the trait is unmodifiable; (b) genes cause between-group/between-population differences; or (c) anything about an individual. It is a population- & environment-specific quantity. Missing heritability: GWAS-discovered SNPs explain far less variance than the twin-study h2. Candidate causes: private (family-specific) mutations, rare moderate-risk variants, additional undiscovered common SNPs, gene-gene interactions, and non- genetic factors correlated within relatives. So twin-estimated h2 and GWAS-explained variance are different quantities - don't expect the discovered SNPs to "add up" to the twin h2. High h2 # "untreatable": environment can still shift the whole distribution (height is highly heritable yet population mean rose with nutrition). 14 . Genetic Association MODULE 3 = a case-control study where the exposure is a genetic marker (a SNP). Association arises if the SNP causes disease, is in LD with a causal variant, or is confounded by ancestry (stratification). Candidate-gene = a few pre-specified, biologically- motivated SNPs; GWAS = hundreds of thousands- millions of SNPs, scanned agnostically across the whole genome. The marker is the exposure; cases vs controls are compared on marker frequency, reported as an OR + 95% CI per SNP. An association is useful for prediction even if non- causal. Three reasons a SNP associates with disease: · the SNP causes disease (directly functional)
- 不写“evidence for, not proof of”:材料反复强调这是全课程通用得分句。[1]Source: asksia-bible-poph90111-bilingual.pdfA 1 . LEARN 1· 学习 You haven't watched the module yet. Read a chapter top to bottom. Every concept is an AHA-unit: diagram - explainer + method-in-steps - analogy + trap. The figures are original schematics of the standard genetic-epi canon - learn the idea cold before the numbers. 你还没看过该模块。从头到尾读 一章。每个概念都是一个 AHA 单元:图示→讲解→ 分步方法 →类比→陷阱。图均为标准遗 传流行病学经典内容的原创示意 图 -- 先把思路彻底吃透,再上 数字。 B 2 . DRILL 2 · 演练 You've watched the module and done the Q&A. Cover the worked steps and re-derive each OR, RR, SMR, h2, penetrance and NNT by hand. Then say aloud the interpretation and the one bias that would change your conclusion. 你已看过该模块并做完 Q&A。 遮住已演算的步骤,亲手重新推 导每一个 OR、RR、SMR、h2、 外显率和 NNT。然后大声说出解 读,以及那一个会改变你结论的 偏倚。 C 3 . ASSIGNMENT 3 · 作业 The assignment is open. Use the design-appraisal grids and worked-formula boxes as your lookup. Every question is calculate - interpret -+ appraise; the Stata . do file (A1 Q1) does the arithmetic, but the interpretation and appraisal marks are yours to write. 作业是开卷的。把设计评价表格 和公式演算框当作你的查阅工 具。每道题都是计算→解读→ 评价;Stata 的 . do 文件(A1 Q1)负责算术,但解读和评价的 分数要你自己写出来。 POPH90111 . Genetic Epidemiology ! The single most important thing to understand about POPH90111 关于 POPH90111 最重要的一点认识 This subject is a logical pipeline, not a pile of topics: UNDERSTAND (is there a genetic role? - familial aggregation M1, heritability M2) - DISCOVER (which variants? - association/GWAS M3) - CHARACTERISE (how risky? - penetrance M5, modifiers M6, G×E M7) - USE IN PRACTICE (causal exposures via MR M4, screening M8). One mantra recurs at every stage: familial aggregation / high MZ-vs-DZ correlation is 'evidence for, but not proof of, an inherited genetic aetiology' - shared environment is always the rival explanation. Write that sentence whenever you interpret aggregation or heritability. 本科目是一条逻辑流水线,而非一堆零散主题:理解(是否存在遗传作用?→家族聚集性 M1、遗传力 M2)→发现 (哪些变异?→关联/GWAS M3)→刻画(风险有多大?→外显率M5、修饰因子 M6、GxE M7)→实践应用(经 MR 评估因果暴露 M4、筛查 M8)。一句口诀贯穿每一阶段:家族聚集性/高的 MZ-对-DZ 相关是“支持遗传性病因的 证据,但非证明” -- 共享环境永远是竞争性解释。每当你解读聚集性或遗传力时都写下这句话。 i How this book was built - and the two-layer rule 本书如何编成 -- 以及两层规则 Standard genetic and statistical canon (Hardy-Weinberg, LD measures, Falconer's h2, the ACE model, Manhattan/QQ plots, MR's three IV assumptions, NNT/NNS) is stated plainly - it is universal, non-copyrightable science, and every diagram here is our own clean drawing. The course's specific framing and its lecturer's example numbers are paraphrased and re-checked, never copied from slides. The subject runs on A/Prof Aung Ko Win's modules with Prof Mark Jenkins' lectures; verify dates and weights against your own Canvas (the LMS), as cohort details can shift. 标准的遗传学与统计学经典(Hardy-Weinberg、LD 测度、Falconer h2、ACE 模型、曼哈顿图/QQ 图、MR 三个 IV 假设、NNT/NNS)均平实陈述 -- 它们是普适的、不受版权保护的科学,此处每张图都是我们自己干净的绘制。本课程 特定的表述及其讲师的示例数字均为转述并重新核对,绝不照搬幻灯片。本科目以 A/Prof Aung Ko Win 的模块和 Prof Mark Jenkins 的讲课为基础;请对照你自己的 Canvas(即 LMS)核实日期与权重,因为各届的细节可能变动。 POPH90111 . Genetic Epidemiology THE BLUEPRINT - THE ASSESSMENT BLUEPRINT 90% IN TWO ASSIGNMENTS Three take-home pieces, no exam 三份带回家作业,无考试[17]Source: asksia-cheatsheet-poph90111.pdfPOPH90111 Genetic Epidemiology UNIVERSITY OF MELBOURNE . POPULATION & GLOBAL HEALTH CALCULATION & METHOD REFERENCE Sem 1 2026 . SIDE 1 OF 2 Foundations -> heritability -> association SIDE 1/2 UNDERSTAND . Genetics primer . Hardy-Weinberg . LD . Familial aggregation (OR/RR/SMR/A) . Heritability (Falconer, ACE, liability) . Association & GWAS 0 . How To Use This READ FIRST * This subject is a pipeline: UNDERSTAND (is there a genetic role? - aggregation, heritability) -> DISCOVER (which variants? - LD, GWAS, MR) -> CHARACTERISE (how risky? - penetrance, modifiers, GxE) > USE (screening). Side 1 = understand & discover; side 2 = characterise & use. Assessment shape: online MCQ 10% (10 Qs, 1-week window) + written A1 40% (Modules 1-3) + A2 50% (Modules 4-8). All online / take-home - no invigilated exam. Every assignment task is one of three: (a) calculate + interpret , (b) discuss findings, (c) critically appraise a design. So the two high-value moves are: plug the right formula, then judge the design's bias. A Stata . do file is even handed out for A1 Q1 - expect software-based calculation, then a written interpretation. SIA > The mantra that earns marks everywhere: aggregation / high MZ-vs-DZ correlation is "evidence for, but not proof of, an inherited genetic aetiology. " Say it whenever you interpret aggregation or heritability. 1 . Genetics Primer EXTRA-MODULE 1 Locus = position on a chromosome. Allele = the base(s) there; minor allele = rarer one. Genotype = the pair (e. g. TT, TC, CC); homo- vs hetero-zygous. Mutation classes: silent (usually benign), missense (changes amino acid), nonsense (premature stop), frameshift indels (corrupt every downstream codon > usually pathogenic). CNV = larger gain/loss. 2 · Modes of Inheritance RISK INEQUALITIES Defined on Pr(phenotype | # risk alleles), not on "having" the trait: AUTOSOMAL Dominant: Pr(2)=Pr(1) > Pr(0) Recessive: Pr(2) > Pr(1)=Pr(0) Codominant: Pr(2) > Pr(1) > Pr(0) Carrier risk can be <1 (incomplete penetrance) and non-carrier risk >0 (phenocopies/sporadic). So a dominant variant can still have penetrance below 100%. SEGREGATION (PUNNETT) Each parent passes one randomly-chosen allele. Aaxaa -> 1/2 Aa, 1/2 aa (no AA). AaxAa -> 1/4 AA, 1/2 Aa, 1/4 aa => P(child carries ≥1 A)=3/4, P(AA)=1/4. Trap: the genotype gives the expected probability distribution, not the realised counts in a small sibship. 2b · Germline vs Somatic SAMPLE CHOICE · Inherited colorectal-cancer family risk -> germline > sample blood / buccal swab Tumour responds differently to chemo, no family history -> somatic > sample the tumour biopsy
-
4)给你一个“最后 48 小时冲刺计划”(按最省命的顺序)
- 第 1 轮(2–3 小时):练模板链条(不看答案重推)
- 2×2 → OR / Sens/Spec/PPV/NPV
- $h^2=2(r_{MZ}-r_{DZ})$ + ACE 三件套($A,C,E$)
- ARR → NNT → NNS,并补一句“target high-risk”。[4]Source: asksia-bible-poph90111-bilingual.pdfModules 4-8 · multi- week window Penetrance-from-OR, NNT, NNS Plug real incidence & ARR, get the number The list of study designs Name the signature bias & appraise it POPH90111 . Genetic Epidemiology ★ The format - open-book, calculation + appraisal 考试形式 -- 开卷,计算+评价 Nothing here is invigilated. The MCQ is a 30-minute LMS quiz of 10 questions, completable any time within one week of release, covering the Introduction + Module 1 (scope of genetic epi, the role of familial aggregation, the designs that measure it, and calculating & interpreting aggregation measures). The assignments are written documents submitted online over a multi-week window. Because the book is open, no fact is worth a mark on its own - only the method, the interpretation, and the appraisal are. 此处无任何监考。MCQ 是一份10题、30分钟的 LMS 测验,在发布后一周内任意时间可完成,涵盖引 言+模块 1(遗传流行病学的范畴、家族聚集性的作 用、测量它的设计,以及计算与解读聚集性测度)。两 份作业为多周窗口内在线提交的书面文档。由于本书 是开卷的,没有哪个事实本身值分 -- 只有方法、解 读和评价才值分。 ✓ The strategy this dictates 由此决定的策略 Every question is a chain: calculate - interpret ++ appraise. The recurring chains are - 2×2 - OR/RR -+ 'evidence for not proof of'; rMz, roz - Falconer h2 -+ equal-environments caveat; OR + population incidence - penetrance - ascertainment-bias check; ARR - NNT - NNS - target high-risk. For every design write its measure, its one strength and its one characteristic bias - LO5 (appraisal) threads through all 8 modules and is where the easy marks live. 每道题都是一条链:计算→解读→评价。反复出现 的链有 -- 2×2→OR/RR→“是证据而非证明”; 「MZ、『Dz→ Falconer h2→等环境注意事项;OR+ 人群发病率→ 外显率→ 确认偏倚核查;ARR → NNT → NNS → 靶向高风险。对每一个设计都写出它 的测度、它的一个优势和它的一个特征性偏倚 -- LO5(评价)贯穿全部8个模块,正是容易得分之 处。 POPH90111 . Genetic Epidemiology CONTENTS - CONTENTS The genetic-epi pipeline, end to end 遗传流行病学全流程,从头到尾 Four questions - understand, discover, characterise, use - one toolkit 四个问题 -- 理解、发现、刻画、应用一一共用一套工具 D1 The genetic-epidemiology pipeline: which method answers which question UNDERSTANDING is there a genetic role? DISCOVERING which genes / causes? CHARACTERISING how risky? USING IN PRACTICE should we screen / act? Familial aggregation GWAS / association Penetrance (M5) Screening (M8) Twin studies[15]Source: asksia-bible-poph90111-bilingual.pdfH2 = 2(0. 78-0. 46) h2 = 2(0. 78-0. 46) NARROW ≤ BROAD 狭义 ≤广义 POPH90111 . Genetic Epidemiology MODULE 2 . ACE MODEL 2. 5 AHA-unit - the ACE variance-components model 2. 5 AHA 单元 -- ACE 方差成分模型 Falconer gives you h2 but throws away the rest. The modern framing - ACE - partitions the standardised phenotypic variance (scaled to 1) into three pieces and solves for all of them. A = additive genetic, C = common (shared) environment, E = unique environment + measurement error. The trick is that A, C, E enter the MZ and DZ correlations differently, giving two equations in the unknowns. Falconer 给你 h2,却丢掉了其余部分。现代框架 -- ACE -- 把标准化的表型方差(缩放为1)分解成三块并将它们全部解 出。A=加性遗传,C= 共同(共享)环境,E= 独特环境+测量误差。诀窍在于 A、C、E 进入 MZ与 DZ 相关的方式各不 相同,从而给出关于这些未知量的两个方程。 D9 Fig 2. 3 - Each twin's phenotype is fed by latent A, C, E. The cross-twin A path is 1. 0 for MZ but 0. 5 for DZ; the C path is 1. 0 for both; E never crosses (it is, by definition, unique). Those three facts are the whole model. PhenotypeP1 (twin 1)ACEPhenotypeP2 (twin 2)ACETWIN 1TWIN 2rA = 1. 0 (MZ) / 0. 5 (DZ)rC = 1. 0 (both MZ & DZ)E uncorrelated across twins (no cross-twin path)A = additive geneticC = common (shared) envE = unique env + errorVar(P) = A + C + Eh2 = A / (A+C+E)c2 = C / (A+C+E)ACE: phenotypic variance = Additive genetic + Common env + unique Env Fig 2. 3 - Each twin's phenotype is fed by latent A, C, E. The cross-twin A path is 1. 0 for MZ but 0. 5 for DZ; the C path is 1. 0 for both; E never crosses (it is, by definition, unique). Those three facts are the whole model. 图 2. 3 -- 每个双生子的表型由潜在的A、C、E喂养。跨双生子的 A 路径在 MZ 为 1. 0、在 DZ为 0. 5;C 路径对两 者都为1. 0;E从不跨越(按定义它是独有的)。这三个事实就是整个模型。 POPH90111 . Genetic Epidemiology ACE EQUATIONS Scaling: A + C + E = 1 (variance set to 1) MZ corr : rMz = A + C DZ corr: roz = 12A + C Solve: A = 2(IMZ C = 2. rDZ rMz − E = 1 - rMZ Note A = Falconer's h2 exactly - ACE contains Falconer and adds C and E for free. 注意 A= Falconer 的h2,完全相等––ACE 包含 Falconer,并额外免费给出 C 与 E。 i Why subtract to get C and E 为何用相减得到 C 与 E Subtract the two correlation equations: (A+C) - (1/2A+C) = 1/2A, so A = 2(rMZ-rDZ). Whatever of rMZ is not A must be the shared environment, giving C = rMZ - A = 2rDZ - rMZ. And since MZ twins are genetically identical with identical C, any failure of rMZ to reach 1 is the unique part: E = 1 - rMZ. 把两个相关方程相减:(A+C)-(1/2A+C) = 1/2A,故 A =2(rMZ-rDZ)。rMZ 中不属于 A的部分必为共享环 境,得到 C=rMZ-A=2rDZ- rMZ。又因 MZ 双 生子遗传相同且 C 相同,rMZ 未能达到1的部分即为 独有部分:E=1-rMZ。 EX 2C Full ACE decomposition (female height) M . the marks-rich solve Using rMz = 0. 78 and rpz = 0. 46, split the variance into A, C and E. 用 rMz= 0. 78 与 rpz = 0. 46,把方差分解为 A、C 和 E。 1 A (additive genetic). A = 2(0. 78 - 0. 46) = 2(0. 32) = 0. 64 - identical to the Falconer answer. V[20]Source: asksia-cheatsheet-poph90111.pdfSIA > Show the working: in this subject the marks live in the setup and the interpretation, not the final digit. Always write the formula, the substitution, then one sentence of meaning. asksia. ai/cheatsheet/ unimelb-poph90111 . side 2/2 AskSia CHEATSHEET SERIES Disease screening = a systematic test to find asymptomatic disease/precursors in people not seeking care. Genetic screening = find risk-raising variants in asymptomatic people so risk can be reduced/prevented; can be population-wide but is usually targeted to high-prior-risk groups (e. g. strong family history). Course twist: a genetic test can be "once-and-for-all" (your germline doesn't change), unlike repeated disease screening. Two uses: screen for genetic risk, or use a genetic factor to screen for disease. 30 . Wilson-Jungner WHO 1968 The screening-evaluation checklist (the course adapts all 10 to genetics): · Facilities for diagnosis & treatment exist Trap: "we can test" # "we should screen. " A test only helps if knowing the result reduces disease/disability/death and benefits beat harms (psychological, social, insurance, variants of unknown significance, false positives). Example genes the course uses: BRCA1/2, the mismatch-repair (MMR) genes, HTT - note HTT (Huntington) has no risk-reduction, which weakens the case for screening. 31 . NNT & NNS QUANTIFY BENEFIT ARR = carrier risk x proportion risk reduced NNT = 1 / ARR NNS = NNT / carrier frequency Worked (BRCA1/2): carrier breast-cancer risk to 70 = 0. 4; tamoxifen cuts risk 50% => ARR = 0. 4×0. 5 = 0. 2 => NNT = 1/0. 2 = 5 carriers treated to prevent one cancer. Carrier freq 0. 0067 (1 in 150) = NNS = 5/0. 0067 = 746 screened per cancer prevented. High-FH group (carrier freq 0. 25) = NNS = 5/0. 25 = 20 - far more efficient => justifies targeted screening. CARRIER FREQ NNT NNS 0. 0067 (general) =746 0. 05 (moderate FH) 0. 25 (strong FH)
- 第 2 轮(1–2 小时):专刷 appraisal 句
- 第 3 轮(考前):做“术语三层背诵”
-
5)你告诉我 2 个信息,我就能把这份“复习重点”继续细化到你这次 final/作业的题型清单
-
你现在最需要冲的是哪一块(选 1–3 个即可):
- A 家族聚集性(2×2,OR/RR/SMR)
- B 遗传率(Falconer、ACE、liability threshold)
- C 筛查指标(Sens/Spec/PPV/ROC/AUC)
- D NNT/NNS + Wilson–Jungner(筛查是否值得做)
- E 批判性评价 LO5(design→bias→方向)
-
你离截止/提交还有几天?(我好按时间给你安排刷题强度)
-
(可选)如果你愿意,我也可以把你选的薄弱模块和常错陷阱记住,之后你每次来复习我都按这个顺序带你过。
Ask Sia for quick explanations, examples, and study support.