Fundamentals of Data Analysis

Course overview

FDA-N-CSAI.1.M.A — a basic Mathematics-area course in the Bachelor in Computer Science and Artificial Intelligence (BCSAI). It introduces probability models and statistical methods for analysing data: students learn to make inferences from statistics (functions of observed data), build confidence intervals, test hypotheses with one and two samples, run analysis of variance with one or more factors, and handle categorical data — using Python throughout.

The use of probability models and statistical methods for analysing data is common practice in virtually all scientific disciplines. Data analysis teaches us to make intelligent judgments and informed decisions in the presence of uncertainty and variation: if every component of a type had exactly the same lifetime, or human behaviour always led to the same decision, a single observation would reveal everything and statistics would be unnecessary. Because that is never the case, we need methods that not only analyse the results of experiments once carried out but also suggest how to run experiments efficiently — mitigating the effects of variation to give a better chance of correct conclusions. This course is the foundation for later subjects such as Algorithms and Data Structures, Probability for Computer Science, and AI: Statistical Learning and Prediction.

Concretely, students learn to make inferences from statistics — functions of observed data — proceeding from describing samples, to estimating parameters, to quantifying uncertainty with confidence intervals, to deciding between hypotheses, and finally to comparing several groups with analysis of variance. Python is used throughout as the working tool. The professor, Simón Isaza, holds a PhD in mathematical research (Complutense University of Madrid, cum laude, on topology and the theory of singularities) and has taught and researched at the National University of Colombia, Complutense and IE University, with prior experience as a financial-risk consultant at KPMG. Office hours are on request by e-mail.

Program

BCSAI (SEP-2025)

Course code

FDA-N-CSAI.1.M.A

Area

Mathematics

Learning objectives

By the end of the course students should acquire the following competencies — and reinforce a set of generic skills.

Subject competencies

Analyse and synthesise the main information content of univariate and multivariate data.
Compute probabilities and understand key concepts related to hypothesis testing.
Use random variables to model real phenomena.
Perform inferences in one and two populations.
Test hypotheses about populations.
Design experiments and run analysis of variance.
Deal with categorical data.

Generic skills

The ability to think analytically.
The use of statistical software and a programming language, namely Python.
The ability to think critically.

The objective is to give students the tools to delve into data sets and use that information across disciplines — computer science, engineering, physics, and more.

Teaching methodology

IE University's method is collaborative, active and applied: students build their knowledge through lectures, discussion, in-class exercises and Python field work, group work and individual study. Weighting of learning activities (total 150 hours):

Lectures30%

Exercises · async · field work30%

Individual studying30%

Group work6.7%

Discussions3.3%

Lectures

45 h · 30%

Exercises · async · field work

45 h · 30%

Individual studying

45 h · 30%

Group work

10 h · 6.7%

Discussions

5 h · 3.3%

Total workload

150 h · 100%

How sessions work. The 30 sessions alternate theory blocks that introduce each topic with practice blocks — by-hand problem solving, Python labs, and four graded problem sets. Each topic is typically taught over two to three theory sessions and then consolidated in a problem-solving session and a Python session, so concepts are seen, worked by hand, and implemented in code before being assessed.

Evaluation criteria

Ordinary evaluation combines the following criteria. A minimum grade of 3.5 in the final exam is required to pass — below that the course is failed even if the weighted average exceeds 5.0.

Final exam30%

Problem sets (×4)20%

Midterm exam20%

Computer exam (Python)20%

Class participation10%

Class participation — 10%

Deliverable: active, sustained contributions across sessions. Evaluation: assessed first on quantity (a threshold number of contributions sufficient for a reliable judgment) and then on quality — depth of insight, rigorous use of evidence, consistency of argument and realism, with comments that are concise, well-timed and engaged with the discussion.

Problem sets (×4) — 20%

Deliverable: four problem sets (sessions 11, 18, 25, 28) of theory and practice questions, solved by hand and/or in Python and submitted on BlackBoard as a multiple-answer test where roughly one third of options are correct. Evaluation: reinforce understanding of complex statistical concepts and prepare students for the midterm, computer and final exams.

Midterm exam — 20%

Deliverable: exam in session 20 (tentative — may move to session 18 or 19), via BlackBoard Ultra, covering Module 1 (Topics 1–4). Evaluation: format similar to the problem sets; closed book; one double-sided A4 handwritten formula sheet and a simple (non-programmable, non-graphical) calculator allowed; connect via the IE network or the grade is 0.

Computer exam (Python) — 20%

Deliverable: individual Python exam in session 29 — solve and discuss questions covering all course topics. Evaluation: open book; assesses both understanding of the material and competence in Python; communication is forbidden and the IE network is required.

Final exam — 30%

Deliverable: comprehensive exam in session 30 covering every session, similar in style to the problem sets with theory and practice questions; draws on the book, slides, problem sets and class notes. Evaluation: closed book; two double-sided A4 handwritten formula sheets and a simple calculator allowed.

Pass rule. A minimum of 3.5/10 in the final exam is required; below it the course is failed even if the weighted average exceeds 5.0.

Attendance. Students who do not meet the 80% attendance rule fail both the ordinary and extraordinary calls for the year and must re-enrol the following year.

Re-sit / re-take. Each course allows four chances across two academic years. Students failing the ordinary call may re-sit in June/July with a single comprehensive exam (continuous evaluation is not carried over), requiring physical presence in Segovia or Madrid; the re-sit grade is capped at 8.0 (notable). The retake (3rd call) caps at 10.0; retakers must check the assigned professor's criteria. Failing more than 18 ECTS in a year after the re-sits may lead to leaving the program. Grade appeals require prior attendance at the exam review session. GenAI use in any assessment is academic misconduct and may mean failing the assignment or the course.

Program — 30 sessions

The full session-by-session program, grouped into the two assessment modules. Theory sessions introduce each topic; practice sessions are problem solving and Python labs. Where a topic has a live demo on the interactive lab, the session links to it.

theory practice / Python exam

Module 1 Sampling, estimation, intervals & one-sample testing Topics 1–4 · sessions 1–20 · assessed in the midterm

Module 1 builds the inferential pipeline from the ground up. It starts with how a statistic behaves across random samples (sampling distributions and the Central Limit Theorem), uses that to produce single best-guess values for parameters (point estimation), attaches a measure of precision to those guesses (confidence intervals), and finally turns estimation into a decision procedure (one-sample hypothesis tests). Everything here concerns a single population.

On completing Module 1 you should be able to:

Describe and simulate the sampling distribution of a statistic and apply the Central Limit Theorem.
Compute point estimates by the method of moments and by maximum likelihood, and judge their quality.
Construct and interpret confidence intervals for a mean, proportion, variance and standard deviation.
Carry out z, t, proportion and variance tests for a single sample and reason about Type I/II error and power.
Implement one-sample estimation and testing in Python.

S1theoryPresentation & Topic 1 — Sampling distributions

Course presentation, then the behaviour of statistics computed from random samples and how a sampling distribution arises.

Statistics and their distributions
Random samples
Deriving a sampling distribution
Simulation experiments
Distribution of the sample mean
Central Limit Theorem
Distribution of a linear combination

Statistic & sampling distribution. A statistic is any quantity computed from the sample; because the sample is random, the statistic is itself a random variable whose probability distribution is its sampling distribution.

Random sample. Observations X₁,…,Xₙ that are independent and identically distributed (i.i.d.); independence is what lets us multiply densities and derive the distribution of functions of the sample.

Simulation experiments. When a sampling distribution is hard to derive analytically, we approximate it by repeatedly drawing samples and recording the statistic each time.

Key idea: the same fixed formula (e.g. the sample mean) gives a different number on every sample — its spread across samples is what we must model before we can do inference.

demo: CLT & sampling distribution

S2theoryTopic 1 — Sampling distributions (cont.)

Continue with the sample mean, the CLT and linear combinations; consolidate via simulation.

Distribution of the sample mean
Central Limit Theorem
Distribution of a linear combination

Distribution of the sample mean. If the X's have mean μ and variance σ², then the sample mean is unbiased for μ and its variance shrinks like 1/n, so larger samples give tighter estimates.

E[X̄] = μ · Var(X̄) = σ²/n · SE(X̄) = σ/√n

Central Limit Theorem. For large n the standardized sample mean is approximately standard normal regardless of the population shape (rule of thumb n > 30).

(X̄ − μ) / (σ/√n) → N(0, 1) as n → ∞

Linear combination. A weighted sum of independent normals is again normal, with mean and variance obtained by combining the parts.

E[Σ aᵢXᵢ] = Σ aᵢμᵢ · Var(Σ aᵢXᵢ) = Σ aᵢ²σᵢ² (indep.)

Worked idea: averaging n = 100 rolls of a fair die (μ = 3.5, σ ≈ 1.71) gives X̄ that is ≈ normal with SE = 1.71/√100 ≈ 0.171 — so X̄ lands within ±0.34 of 3.5 about 95% of the time.

Reading: Devore (DEV), Probability & Statistics for Engineering and the Sciences, Ch. 5 §5.3 (statistics & their distributions), §5.4 (distribution of the sample mean & the CLT), §5.5 (distribution of a linear combination).

Devore Ch. 5 §5.3–5.5demo: CLT

S3theoryTopic 2 — Point estimation

Use a sample to produce a single best-guess value (point estimate) for a population parameter.

General concepts of point estimation
Methods of point estimation
Method of moments
Maximum likelihood estimation

Point estimate vs. estimator. An estimator θ̂ is a rule (a statistic); a point estimate is the single number it returns on the observed sample — an educated guess for the true parameter θ.

Good-estimator criteria. We prefer estimators that are unbiased (right on average) and have small variance; mean squared error combines both.

MSE(θ̂) = Var(θ̂) + [bias(θ̂)]² · bias = E[θ̂] − θ

Key idea: there are many possible estimators for the same θ; the goal of this topic is to generate sensible ones (method of moments, MLE) and to compare how precisely they hit the target.

demo: MLE

S4theoryTopic 2 — Point estimation (cont.)

Develop the estimation methods and compare them.

Method of moments
Maximum likelihood estimation
Properties of estimators

Method of moments (MoM). Set the first few population moments equal to the matching sample moments and solve for the parameters — simple and always defined.

E[Xᵏ] = (1/n) Σ Xᵢᵏ for k = 1, 2, … then solve for θ

Maximum likelihood (MLE). Choose the parameter value that makes the observed data most probable by maximizing the (log-)likelihood.

θ̂ = argmaxθ ℓ(θ) = argmaxθ Σ ln f(Xᵢ; θ)

Worked idea: for a normal sample the MLE of μ is the sample mean X̄, and the MLE of σ² is the average squared deviation (1/n)Σ(Xᵢ − X̄)² — a biased but consistent estimate of variance.

demo: MLE demo: MoM vs MLE

S5theoryTopic 2 — Point estimation (cont.)

Wrap up point estimation; contrast the sampling behaviour of competing estimators.

Method of moments
Maximum likelihood estimation

Comparing estimators. When MoM and MLE disagree, we compare their sampling behaviour — bias, variance and efficiency — to decide which to trust.

Large-sample properties. Under mild conditions the MLE is consistent (converges to θ), asymptotically unbiased, and asymptotically normal, which underpins later interval and test formulas.

Key idea: MLE is usually more efficient (smaller variance) than MoM and is the default in this course, but MoM is a handy starting point and sometimes the only closed-form option.

Reading: Devore Ch. 6 — general concepts of point estimation and methods of point estimation (MoM, MLE).

demo: MoM vs MLE

S6practiceSampling distributions & point estimation

Problem solving across Topics 1 and 2.

Problem solving

Goal. Apply the CLT to compute probabilities about X̄, and derive MoM/MLE estimates for parameters of common distributions (normal, exponential, Poisson, Bernoulli) by hand.

Key idea: typical exercises ask "what is P(X̄ > c)?" via standardization, and "what is θ̂?" via either matching moments or maximizing the log-likelihood — the two pillars of Module 1's first half.

demo: CLT demo: MoM vs MLE

S7theoryTopic 3 — Statistical intervals based on a single sample

Report a range of plausible values — a confidence interval — to convey the precision of an estimate.

Basic properties of confidence intervals
Large-sample CIs for a mean & proportion
Intervals based on a Normal population
CIs for variance & standard deviation

What a CI is. A confidence interval is a random interval that, before sampling, has a stated probability (the confidence level, e.g. 95%) of containing the true parameter. Its width conveys the precision of the estimate.

Large-sample CI for a mean. Built from the point estimate ± a critical value times the standard error.

x̄ ± z(α/2) · σ/√n (known σ) · 95% → z = 1.96

Normal-population CI (σ unknown). Uses the t distribution with n−1 degrees of freedom; for a proportion, a score/Wald interval based on p̂.

x̄ ± t(α/2, n−1) · s/√n · p̂ ± z(α/2)·√(p̂(1−p̂)/n)

Key interpretation: "95% confident" refers to the procedure — over many samples, 95% of the intervals it produces cover θ; a single computed interval either does or does not.

demo: CI coverage demo: CI width

S8theoryTopic 3 — Statistical intervals (cont.)

Continue with confidence intervals for means, proportions and variances.

Large-sample CIs
Normal-based intervals
CIs for variance & SD

CI for a variance / SD. For a normal population the scaled sample variance follows a chi-square distribution, giving an asymmetric interval for σ².

( (n−1)s² / χ²(α/2,n−1), (n−1)s² / χ²(1−α/2,n−1) )

Width vs. confidence vs. n. Higher confidence widens the interval; larger n narrows it (∝ 1/√n). This is the precision/confidence trade-off.

Worked idea: with n = 25, s = 4 and 95% confidence, the mean CI half-width is t(0.025,24)·4/√25 = 2.064·0.8 ≈ 1.65; quadrupling n to 100 roughly halves it.

Reading: Devore Ch. 7 — basic properties of CIs, large-sample CIs for a mean & proportion, intervals based on a normal population, CIs for the variance and SD.

demo: CI coverage demo: CI width

S9practiceStatistical intervals based on a single sample

Problem solving on confidence intervals.

Problem solving

Goal. Choose the correct interval (z vs. t vs. χ²) for the parameter and assumptions at hand, compute it, and interpret its meaning and width.

Key idea: the hardest part is not arithmetic but selection — is σ known? is the population normal? is the parameter a mean, proportion or variance? — which dictates the formula.

demo: CI coverage

S10practiceOne-sample inference with Python

Implement one-sample estimation and intervals in Python.

Python lab
One-sample inference

Tooling. Use NumPy/Pandas to summarize data and SciPy stats for distributions and intervals (e.g. stats.t.interval, stats.norm.ppf).

Key idea: the lab connects the hand formulas to code — compute x̄, s, the critical value and the interval, then check coverage by simulation to see the 95% guarantee emerge.

Python · NumPy/SciPydemo: CI width

S11practiceProblem Set 1

Assessed problem set covering Topics 1–3 (BlackBoard multiple-answer test).

Problem Set 1

Deliverable. Theory + practice questions on sampling distributions, point estimation and confidence intervals, solved by hand and/or in Python and submitted on BlackBoard.

Format: multiple-answer test where roughly one third of the options are correct — partial reasoning matters, so work each item fully before selecting.

assessed · 20% (with PS 2–4)

S12theoryTopic 4 — Tests of hypotheses based on a single sample

Decide between two contradictory claims about a parameter — the core of hypothesis testing.

Hypotheses & test procedures
z tests for a population mean
One-sample t test
Tests for a proportion
Tests for a variance
Further aspects of testing

Hypotheses. State a null H₀ (the default claim) against an alternative Hₐ; the test decides whether the data give strong enough evidence against H₀.

z / t test for a mean. Standardize the estimate under H₀ to form a test statistic and compare it with a critical value (or use the p-value).

z = (x̄ − μ₀)/(σ/√n) · t = (x̄ − μ₀)/(s/√n), df = n−1

p-value. The probability, if H₀ is true, of a statistic at least as extreme as observed; reject H₀ when p ≤ α.

Proportion & variance tests. z test on p̂ for a proportion; a χ² statistic (n−1)s²/σ₀² for a variance.

Key idea: a test trades off two errors — Type I (reject a true H₀, controlled at α) and Type II (fail to reject a false H₀, probability β); 1 − β is the power.

demo: one-sample z/t demo: p-value demo: power

S13theoryTopic 4 — Tests of hypotheses (cont.)

Continue with single-sample tests and the trade-offs between error types and power.

z and t tests
Tests for proportion & variance
Type I/II error & power

Power & sample size. Power rises with the true effect size, with n, and with α; designing a study means choosing n so power is adequate (often 0.8) at a relevant effect.

power = P(reject H₀ | Hₐ true) = 1 − β

One- vs. two-sided & the CI–test duality. A two-sided level-α test rejects H₀: μ = μ₀ exactly when μ₀ falls outside the (1−α) confidence interval.

Worked idea: testing H₀: μ = 100 with x̄ = 104, s = 10, n = 25 gives t = 4/(10/5) = 2.0; against t(0.025,24) = 2.064 this just fails to reject at 5% — a borderline result the p-value makes precise.

Reading: Devore Ch. 8 — hypotheses & test procedures, z tests for a mean, the one-sample t test, tests for a proportion and a variance, and further aspects of testing (errors, power, p-values).

demo: p-value demo: power

S14practiceTests of hypotheses based on a single sample

Problem solving on one-sample hypothesis tests.

Problem solving

Goal. Set up H₀/Hₐ, pick the right statistic (z, t, proportion-z, χ²), compute it, find the p-value or critical region, and state a conclusion in context.

Key idea: common traps are one- vs. two-sided alternatives and confusing "fail to reject H₀" with "H₀ is true" — absence of evidence is not evidence of absence.

demo: one-sample z/t

S15practiceHypothesis testing with Python — one sample

Run one-sample hypothesis tests in Python.

Python lab
One-sample hypothesis testing

Tooling. SciPy stats.ttest_1samp for the t test, proportions_ztest (statsmodels) for proportions, plus manual statistics to confirm the library output.

Key idea: reading SciPy's returned statistic and p-value correctly — and matching it to a hand calculation — is the skill the computer exam later rewards.

Python · SciPy/statsmodelsdemo: one-sample z/t

S16theoryTopic 5 — Inferences based on two samples

Confidence intervals and tests for a difference between two population parameters.

z tests & CIs for a difference of means
Two-sample t test & CI
Analysis of paired data
Difference of proportions
Ratio of two variances

Difference of means. Inference targets μ₁ − μ₂; the standard error combines both samples' variability.

t = (x̄₁ − x̄₂)/√(s₁²/n₁ + s₂²/n₂) (Welch two-sample t)

Difference of proportions. z statistic on p̂₁ − p̂₂, pooling under H₀ for the test.

z = (p̂₁ − p̂₂)/√( p̂(1−p̂)(1/n₁ + 1/n₂) )

Ratio of variances. Compares σ₁² and σ₂² via an F statistic equal to the ratio of sample variances.

F = s₁²/s₂², df = (n₁−1, n₂−1)

Key idea: "independent samples" (two separate groups) need a two-sample test, whereas matched observations need the paired test introduced next — choosing wrongly inflates the error.

demo: two-sample t demo: paired

S17theoryTopic 5 — Inferences based on two samples (cont.)

Continue with two-sample tests, paired data and variance comparisons.

Two-sample t test & CI
Analysis of paired data
Two population variances

Paired data. When each unit yields two measurements (before/after, matched pairs), analyse the differences dᵢ = X₁ᵢ − X₂ᵢ with a one-sample t test on d̄.

t = d̄ / (s_d/√n), df = n−1

Why pairing helps. Removing unit-to-unit variability shrinks the standard error, so a paired design is more powerful than two independent samples of the same size when the pairing is informative.

Worked idea: measuring blood pressure on the same 20 patients before and after a drug, you test whether the mean difference d̄ differs from 0 — not whether two independent group means differ.

Reading: Devore Ch. 9 — z and t procedures for a difference of means, the two-sample t test & CI, analysis of paired data, difference of proportions, and the F test for two variances.

demo: two-sample t demo: paired

S18practiceProblem Set 2

Assessed problem set covering one- and two-sample inference.

Problem Set 2

Deliverable. BlackBoard multiple-answer test on one-sample testing (Topic 4) and two-sample inference (Topic 5), by hand and/or in Python.

Key idea: emphasises correctly identifying the design (one sample, two independent samples, or paired) before computing — the recurring decision of this module.

assessed · part of 20%

S19theoryTopic 5 — Inferences based on two samples (cont.)

Final two-sample session before the midterm.

Difference of means
Paired data
Difference of proportions

Consolidation. Pull together pooled vs. unpooled (Welch) t tests, paired analysis, proportion and variance comparisons, and the choice of one- vs. two-sided alternatives.

Key idea: this session doubles as midterm review for Topics 1–4 — sampling distributions, estimation, intervals and one-sample testing are the examinable core.

demo: two-sample t

S20examMidterm exam — Module 1 (Topics 1, 2, 3 & 4)

Closed-book exam via BlackBoard Ultra. Date tentative — may fall in session 18 or 19 depending on class pace. One double-sided A4 formula sheet and a simple calculator allowed.

Sampling distributions
Point estimation
Confidence intervals
One-sample testing

Scope & format. Module 1 only (Topics 1–4); style similar to the problem sets. Closed book; one handwritten double-sided A4 formula sheet and a simple (non-programmable, non-graphical) calculator are permitted.

Rules: no phones/tablets/laptops; communication forbidden; you must connect via the IE network or the grade is 0.

assessed · 20%

Module 2 Two-sample inference, ANOVA & comprehensive exams Topics 5–7 · sessions 21–30

Module 2 generalises inference from one population to many. It first compares two populations (differences of means, paired data, differences of proportions, ratios of variances), then compares several groups at once with single-factor ANOVA, and finally studies the simultaneous effect of two or more factors with multifactor ANOVA. The module closes with the comprehensive computer and final exams.

On completing Module 2 you should be able to:

Build confidence intervals and tests for a difference between two population means and proportions.
Recognise paired designs and analyse them with the appropriate paired procedure.
Compare two variances and set up an F-based comparison.
Decompose total variability and run a single-factor ANOVA, including multiple comparisons.
Extend ANOVA to two factors, including fixed-, random- and mixed-effects and randomized block designs.
Carry out all of the above in Python and interpret the output.

S21practiceInference based on two samples

Problem solving on two-sample inference.

Problem solving

Goal. Work through difference-of-means, paired, difference-of-proportions and variance-ratio problems, deciding pooled vs. Welch and one- vs. two-sided in each.

Key idea: the variance-ratio (F) test also doubles as the assumption check for the pooled two-sample t test — equal variances justify pooling.

demo: two-sample t demo: paired

S22practiceInference based on two samples with Python

Two-sample tests and confidence intervals implemented in Python.

Python lab
Two-sample inference

Tooling. SciPy ttest_ind (set equal_var for pooled vs. Welch), ttest_rel for paired data, and f_oneway preview before ANOVA.

Key idea: the equal_var flag encodes the pooled/Welch choice — getting it right in code mirrors the by-hand decision.

Python · SciPydemo: two-sample t

S23theoryTopic 6 — Analysis of variance: single factor

One-way ANOVA: comparing quantitative responses across more than two populations or treatments.

Single-factor ANOVA
Multiple comparisons in ANOVA
More on single-factor ANOVA

What ANOVA does. Tests whether k ≥ 2 group means are all equal by comparing variability between groups to variability within groups.

F = MSTr / MSE = SSTr/(k−1) ÷ SSE/(N−k)

Sum-of-squares identity. Total variability splits into a treatment (between) part and an error (within) part — the heart of "analysis of variance".

SST = SSTr + SSE

Multiple comparisons. A significant F says "some means differ"; methods such as Tukey's HSD then locate which pairs differ while controlling the family-wise error rate.

Key idea: running many pairwise t tests inflates Type I error, so ANOVA gives one omnibus test first; large F (small p) means the between-group spread is too big to be chance.

demo: one-way ANOVA (F = MSB/MSE)

S24practiceANOVA single factor with Python

Run a one-way ANOVA in Python and interpret the F test.

Python lab
Single-factor ANOVA

Tooling. SciPy f_oneway or statsmodels ols(...).fit() + anova_lm for a full table; pairwise_tukeyhsd for post-hoc comparisons.

Reading the table. Identify SSTr, SSE, df, MS, the F value and p, and check assumptions (independence, normal residuals, equal variances).

Reading: Devore Ch. 10 — single-factor ANOVA, multiple comparisons, and further topics on one-way ANOVA.

Python · statsmodelsdemo: ANOVA

S25practiceProblem Set 3

Assessed problem set covering two-sample inference and single-factor ANOVA.

Problem Set 3

Deliverable. BlackBoard multiple-answer test spanning Topic 5 (two samples) and Topic 6 (one-way ANOVA), by hand and/or in Python.

Key idea: tests the link between a two-group t test and a two-group ANOVA — with k = 2 groups, F = t², so the two procedures agree.

assessed · part of 20%

S26theoryTopic 7 — Multifactor analysis of variance

Extend ANOVA to two or more simultaneous factors.

Two-factor ANOVA
The fixed-effects model
Randomized block experiments
Random & mixed-effects models

Two-factor ANOVA. Two factors (A, B) act at once; total variability splits into main effects, an interaction, and error.

SST = SSA + SSB + SSAB + SSE

Interaction. A significant A×B interaction means the effect of one factor depends on the level of the other, so main effects can't be read in isolation.

Fixed / random / mixed effects. Factor levels may be deliberately chosen (fixed) or sampled from a population of levels (random); a mix gives a mixed model — this changes which mean squares form the F denominators.

Randomized block design. Blocking groups similar units to remove a nuisance source of variation, increasing precision much as pairing did for two samples.

Key idea: always inspect the interaction first; only if it is negligible do the individual factor main-effect tests have a clean interpretation.

Reading: Devore Ch. 11 — two-factor ANOVA, the fixed-effects model, randomized block experiments, and models with random and mixed effects.

demo: ANOVA (variance decomposition)

S27practiceANOVA multifactor with Python

Fit and interpret a multifactor ANOVA in Python.

Python lab
Two-factor ANOVA

Tooling. statsmodels formula API — e.g. ols('y ~ C(A) * C(B)', data).fit() then anova_lm(model, typ=2) — where the * includes both main effects and the interaction.

Key idea: interaction plots (group means across one factor, lines per level of the other) make a significant A×B term visible as non-parallel lines.

Python · statsmodelsdemo: ANOVA

S28practiceProblem Set 4

Final assessed problem set covering multifactor ANOVA.

Problem Set 4

Deliverable. BlackBoard multiple-answer test on Topic 7 — two-factor ANOVA, interaction, fixed/random/mixed effects and blocking — by hand and/or in Python.

Key idea: the four problem sets together (PS1–PS4) make up the 20% problem-set component and are the best preparation for the computer and final exams.

assessed · completes the 20%

S29examComputer exam with Python

Individual, open-book Python exam: solve and discuss questions covering all topics of the course, evaluating both understanding and Python use.

Open book · Python
All topics

Scope & format. All Topics 1–7, taken individually. Open book; you write and run Python and explain your reasoning, so it rewards the labs in sessions 10, 15, 22, 24 and 27.

Rules: communication forbidden; you must connect via the IE network or the grade is 0.

assessed · 20%

S30examFinal exam

Comprehensive closed-book exam covering all content from the first to the last session, similar in style to the problem sets with theory and practice questions. Minimum 3.5 required to pass; two double-sided A4 formula sheets and a simple calculator allowed.

All topics 1–7
Closed book

Scope & sources. Every session; draws on the book, the slides, the problem sets and class notes. Closed book; two handwritten double-sided A4 formula sheets and a simple (non-programmable, non-graphical) calculator permitted.

Pass rule: you need at least 3.5/10 here — fall short and you fail the course even if your weighted average exceeds 5.0. Re-sits (June/July) are a single comprehensive exam capped at 8.0.

assessed · 30% · min. 3.5 to pass

Bibliography & key concepts below. Each topic above is anchored in the primary textbook; the glossary and annotated bibliography that follow collect the recurring terms and reading map for the whole course.

Key concepts

A quick-reference glossary of the terms that recur throughout the program — useful when building your A4 formula sheet for the exams.

Population & parameter: The full set of units of interest and a fixed numerical feature of it (e.g. mean μ, proportion p, variance σ²) that we try to learn about.
Random sample: Observations X₁,…,Xₙ drawn independently and identically (i.i.d.) from the population.
Statistic: Any quantity computed from the sample (e.g. X̄, s²); being a function of random data, it is itself random.
Sampling distribution: The probability distribution of a statistic across all possible samples — the object inference is built on.
Standard error: The standard deviation of a statistic's sampling distribution; for the mean, SE = σ/√n
Central Limit Theorem: For large n the standardized sample mean is approximately N(0,1) whatever the population shape.
Point estimator / estimate: A rule θ̂ for guessing a parameter, and the single value it returns on the observed sample.
Bias & unbiasedness: bias = E[θ̂] − θ; an estimator is unbiased when its expected value equals the parameter.
Method of moments (MoM): Estimate parameters by equating sample moments to population moments and solving.
Maximum likelihood (MLE): Estimate by maximizing the likelihood of the observed data; usually efficient and asymptotically normal.
Consistency: An estimator is consistent if it converges to the true parameter as n grows.
Confidence interval: A random interval covering the parameter with a stated long-run probability (the confidence level).
Confidence level: 1 − α, the proportion of such intervals that contain the parameter over repeated sampling (e.g. 95%).
Null & alternative hypotheses: H₀ is the default claim; Hₐ is the competing claim the data may support.
Test statistic: A standardized quantity (z, t, χ², F) measuring how far data fall from H₀.
p-value: Probability, under H₀, of a result at least as extreme as observed; reject H₀ when p ≤ α.
Significance level α: The chosen Type I error rate — the probability of rejecting a true H₀.
Type I & Type II error: Rejecting a true H₀ (prob. α) vs. failing to reject a false H₀ (prob. β).
Power: 1 − β, the probability of correctly rejecting a false H₀; rises with effect size, n and α.
t distribution: Bell-shaped, heavier-tailed than normal, used when σ is estimated by s; indexed by degrees of freedom.
Degrees of freedom: The number of independent pieces of information available to estimate variability (e.g. n − 1).
Paired data: Two measurements on the same unit; analysed through the within-pair differences.
ANOVA: Analysis of variance — compares group means by partitioning total variability into between- and within-group parts.
F statistic: Ratio of two variance estimates (MSTr/MSE in ANOVA, s₁²/s₂² for two variances); large F signals real differences.
Interaction: In multifactor ANOVA, when the effect of one factor depends on the level of another.
Fixed vs. random effects: Factor levels deliberately chosen (fixed) vs. sampled from a population of levels (random); a mix is a mixed model.
Multiple comparisons: Post-hoc procedures (e.g. Tukey's HSD) that locate which group means differ while controlling family-wise error.

Annotated bibliography

The course is anchored on Devore's textbook (referenced as DEV in the syllabus), supported by the slides, problem sets and class notes. Chapter mapping below indicates where each source feeds the program.

Jay L. Devore — Probability and Statistics for Engineering and the Sciences (DEV). The primary textbook for every topic: theory, worked examples and exercises that mirror the problem sets and exams. all sessions · primary reference
DEV, Chapter 5 §5.3–5.5 — Statistics & their distributions; sampling distribution of the mean & CLT; linear combinations. The explicitly suggested reading for Topic 1. Topic 1 · sessions 1–2, 6
DEV, Chapter 6 — Point estimation. General concepts, the method of moments and maximum likelihood, and estimator properties. Topic 2 · sessions 3–6
DEV, Chapter 7 — Statistical intervals for a single sample. Confidence intervals for a mean, proportion, variance and standard deviation. Topic 3 · sessions 7–10
DEV, Chapter 8 — Tests of hypotheses (single sample). z and t tests, proportion and variance tests, errors, power and p-values. Topic 4 · sessions 12–15
DEV, Chapter 9 — Inferences based on two samples. Differences of means and proportions, paired data, and comparison of two variances. Topic 5 · sessions 16–22
DEV, Chapter 10 — Single-factor ANOVA. One-way ANOVA, the F test and multiple-comparison procedures. Topic 6 · sessions 23–25
DEV, Chapter 11 — Multifactor ANOVA. Two-factor ANOVA, fixed/random/mixed effects and randomized block designs. Topic 7 · sessions 26–28
Course slides, problem sets & class notes. Companion materials; the final exam may draw on all of them, so the syllabus advises studying them alongside the book. all sessions · exam preparation
Python ecosystem — NumPy, Pandas, SciPy stats, statsmodels. The working tools for every practice/Python lab and the open-book computer exam. sessions 10, 15, 22, 24, 27, 29