COMP9120 · Database Management Systems
Foundations & Data Models
Foundations & Data Models is the opening block of COMP9120 Database Management Systems at the University of Sydney, where you learn what a database management system (DBMS) actually is and the vocabulary the rest of the unit assumes. It fixes four ideas — the services a DBMS provides, the difference between OLTP and OLAP workloads, physical vs logical data independence, and the six-stage database design pipeline — that reappear inside every later exam question on ER modelling, SQL, normalisation and transactions.
What this chapter covers
- 01What a DBMS is and the services it bundles: data independence, integrity, concurrency, recovery, a query language, persistence and sharing
- 02OLTP (operational, write-heavy, normalised) vs OLAP (decision-support, read-heavy, denormalised) workloads
- 03Physical data independence: change storage or indexes without touching the schema or applications
- 04Logical data independence: change the conceptual schema while existing views and apps keep working
- 05The three-level architecture: external (views) / conceptual (logical schema) / internal (physical storage)
- 06The six-stage design pipeline: requirements, conceptual, logical, refinement, physical, application & security
- 07Which pipeline stage each later chapter owns (ER, ER-to-relational mapping, normalisation, storage/indexing)
- 08How to name the right term precisely to earn method marks in an applied exam
Classify the workload and the change type
- +1For (a)-(d) ask: is it a short read/write of a few rows (OLTP) or a long read-only aggregate over history (OLAP)?
- +1(a) records one new row and (c) writes a member's row, so both are OLTP (operational, write-heavy).
- +1(b) and (d) each aggregate over two years of history for planning, so both are OLAP (decision support, read-heavy).
- +1For (e)-(f) ask: which level changed - internal (physical independence) or conceptual (logical independence)?
- +1(e) swaps storage structures at the internal level only, so the schema and views are unchanged: physical data independence.
- +1(f) grows the conceptual schema but leaves existing external views and apps working: logical data independence.
Key terms
- DBMS
- Database management system: software that manages a large, persistent, shared collection of data and provides data independence, integrity, concurrency control, recovery and a query language.
- OLTP
- Online transaction processing: operational, write-heavy workloads made of many short transactions over a normalised schema (for example, recording an enrolment).
- OLAP
- Online analytical processing: decision-support, read-heavy workloads made of large aggregate queries over historical data, typically on a denormalised design.
- Physical data independence
- The ability to change storage structures or indexes (the internal level) without altering the conceptual schema or the applications above it.
- Logical data independence
- The ability to change the conceptual schema without breaking existing external views and the applications that use them.
- Three-level architecture
- The separation of a database into an external level (user views), a conceptual level (one logical schema), and an internal level (physical files and indexes).
- Conceptual schema
- The single community-wide logical description of the data - entities, relationships and constraints - independent of any particular DBMS or storage layout.
- Design pipeline
- The staged path from a problem to a running database: requirements, conceptual (ER) design, logical (relational) design, schema refinement (normalisation), physical design, and application & security design.
Foundations & Data Models FAQ
Is Foundations & Data Models heavily examined in COMP9120?
The COMP9120 final exam is applied and problem-based - it asks you to write SQL and relational algebra, map an ER diagram to relations, decompose to BCNF and reason about transactions - so foundations concepts rarely carry standalone marks. Instead they are threaded through those questions as vocabulary, so being precise about terms like conceptual vs internal, physical vs logical data independence, and OLTP vs OLAP earns easy method marks and prevents mislabelling. Confirm the exact structure for your offering on Canvas.
What is the difference between physical and logical data independence?
Physical data independence lets you change storage or indexes (the internal level) - for example replacing a heap file with a B+-tree - without touching the schema or the applications. Logical data independence lets you change the conceptual schema - for example adding a column or splitting a table - while existing external views and applications keep working. The test is which architectural gap the change crosses: internal-to-conceptual is physical, conceptual-to-external is logical.
Can AI help me with database design foundations in COMP9120?
Yes, as a study aid. Sia is an AI tutor that explains concepts step by step - it can walk you through why a workload is OLTP or OLAP, trace the six-stage design pipeline on a practice scenario, or check your reasoning about data independence, so you build the understanding yourself. It will not hand you completed assignment or exam answers and cannot promise a mark or a pass; the aim is to help you learn the material, which is what the closed-book exam rewards.
Studying with AI? Sia — free AI data science tutor works through COMP9120 step by step.
Exam move
Treat this chapter as vocabulary you must own cold rather than a big source of marks. Learn the DBMS services, the OLTP/OLAP distinction, the two data independences and the six design stages, and - most usefully - memorise which later chapter owns each stage (conceptual design is ER modelling, logical design is ER-to-relational mapping, refinement is normalisation, physical design is storage and indexing). When you then meet an applied question, first name the stage you are in, because a conceptual question wants a diagram, a logical question wants relations with keys and foreign keys, and a refinement question wants functional dependencies and a decomposition. Because the final exam is 2 hours and closed book, spend most revision time drilling those applied stages and only enough here to use the terms precisely.