Learning to Observe, Experiment and Survey
An introductory research-methods course for students with little prior background in methodology. It maps directly onto the interactive demos in this repo — every session below that has a matching visualization links straight to it.
Students learn the different types of research (experimental, correlational, case studies, surveys) and the phases of scientific inquiry. The course is hands-on: students participate in several experiments via workshops, conduct a small observational field study in groups, and learn the ethical principles governing research with human and non-human participants.
Throughout, every concept is applied to real-world situations — behavioral economics, digital marketing and communications, and emerging technologies — to highlight the contribution of empirical research across domains.
Learning objectives
The goal is to introduce quantitative and qualitative methodology and provide the tools for basic empirical research, enhancing the ability to think critically and scientifically about everyday problems. Specifically, the course is designed to:
- Develop the ability to think critically about research, including understanding how research methodology is used to answer basic scientific questions.
- Evaluate the research process using classic quality standards from both a qualitative and quantitative perspective (reliability, validity, triangulation, etc.).
- Accurately communicate scientific research via PowerPoint in an engaging manner.
Teaching methodology
Lecture- and activity-based. Each topic has two parts: first the relevant theory, then the practical application to real-world issues through in-class discussions and debates, a group project/presentation, and labs and workshops. Labs focus on analyzing, interpreting and describing results in proper APA format; workshops let students conduct and participate in their own research. Students read assigned materials or watch short videos before class.
Labs teach you to analyze, interpret and describe the results of a study (correlational, experimental, etc.) in proper APA format — the "read and report" half of the craft. Workshops are experiential: you participate in or run real studies (between-participants, mixed factorial, an observational field study) and then dissect the data you generated.
GenAI use is encouraged to build an informed, critical perspective — but low-effort prompts give low-quality output, and you must assume any output is wrong until you can cross-check it. Using AI must be acknowledged (listing prompts and how outputs were used); doing so does not affect your grade, but undisclosed use violates academic-honesty policy.
Assessment / evaluation
Continuous evaluation. Attendance is not graded, but a minimum of 80% attendance is required to remain in the ordinary/extraordinary calls. A minimum grade of 3.5 on the final exam is required to pass the course, regardless of the weighted average.
Deliverable: an in-class exam covering material from the lecture slides only. Evaluation: multiple-choice questions plus one long-answer, theory-based question that asks you to thoroughly explain a theory and apply it through two real-world examples.
Deliverable: a cumulative in-class exam (slides only): multiple-choice plus a long-answer question that asks you to weigh the advantages/disadvantages of methodological techniques and identify all the methodological flaws in a study. Evaluation: graded against accuracy and completeness; a minimum of 3.5/10 on this exam is required to pass, no matter your average.
Deliverable: five in-class quizzes of 10 multiple-choice questions, each testing the prior class's material. Evaluation: falls in Sessions 5, 7, 9, 11 and 13 (dates confirmed in class and on Blackboard).
Deliverable: active engagement in the short in-class activities, debates and small-group exercises run most sessions. Evaluation: measured via periodic in-class activities that apply concepts to novel situations; quality of engagement, not mere presence, is scored.
Deliverable: in groups of 4–5, a PowerPoint/Google-Slides talk on two studies from a single empirical article, mirroring a conference talk; submitted via Turnitin on the presentation date. Evaluation: against a rubric posted on Blackboard (two sample presentations provided).
Deliverable: complete five anonymous online experiments (10–15 min each), one week per experiment, that illustrate concepts from lecture. Evaluation: credit for completion; responses are 100% anonymous.
Deliverable: for each of five prompts, one original post plus one secondary reply to a classmate, by Sunday midnight (Madrid). Evaluation: quality over quantity on a 0–2 scale per board (0 = 0%, 1 = 0.5%, 2 = 1%).
Program — 30 sessions across 6 modules
The full session-by-session schedule from the syllabus. Each session is lecture- and activity-based with background readings. exercise / lab key reading try the demo → graded milestone
The Research Process
goals, questions, common sense vs. science, judgment biasesWhy we cannot simply trust intuition. Before learning how to do research, this module shows why we need it: human judgment is systematically biased, so "common sense" is an unreliable guide to truth. It surveys the major cognitive biases, then introduces the scientific method as the disciplined alternative — empirical, testable, and self-correcting.
- Distinguish science from common sense and pseudoscience, and name the goals of empirical research.
- Identify common cognitive biases (confirmation, hindsight, attribution, self-serving) in everyday reasoning.
- State a testable hypothesis and write an operational definition of a construct.
- Explain reliability, validity, replication, and Type I vs. Type II error.
- Objectives, contents, schedule, evaluation system, testing, common sense
- The role of common sense; how to think about ourselves and others
Sets up the course logistics and its central tension: our intuitions about behaviour feel obvious yet are often wrong or contradictory ("opposites attract" vs. "birds of a feather flock together"). Science exists precisely because common sense cannot adjudicate between such claims.
Key idea: "common sense" frequently produces mutually contradictory predictions, so we need systematic, empirical methods to decide which is actually true.
- Cognitive biases, confirmation bias, self-fulfilling prophecy
- Belief perseverance error, overconfidence effect
Confirmation bias is the tendency to seek and weight evidence that supports what we already believe. A self-fulfilling prophecy occurs when an expectation changes behaviour so that the expectation comes true. Belief perseverance is clinging to a belief even after its evidence is discredited; the overconfidence effect is systematically overrating the accuracy of our own judgments.
Why it matters: these biases are exactly what an experimental design — blinding, control groups, pre-registered hypotheses — is built to neutralise.
- Judgment under uncertainty — Tversky & Kahneman's classic on the heuristics (representativeness, availability, anchoring) behind systematic error. (Session 2)
- Fundamental attribution error, actor–observer effect, self-serving bias
The fundamental attribution error (FAE) is over-attributing others' behaviour to disposition while underrating the situation. The actor–observer effect is the flip: we explain our own behaviour situationally but others' dispositionally. The self-serving bias credits our successes to ourselves and blames failures on circumstances.
Key idea: attribution biases distort how we interpret data about people — a direct threat to the validity of observational and survey research.
- From the fundamental attribution error to the truly FAE — revisits how strongly observers discount situational forces. (Session 3)
- Hindsight bias, false consensus / false uniqueness effect
Hindsight bias ("I knew it all along") is the tendency, once an outcome is known, to see it as having been predictable. The false-consensus effect overestimates how many others share our views; the false-uniqueness effect underestimates how many share our abilities or desirable traits.
Key idea: hindsight bias is why predictions must be recorded before the data are seen — the logic behind pre-registration.
- Self-esteem & self-serving bias in reactions to positive and negative events — how self-esteem moderates the self-serving attribution pattern. (Session 4)
- Cross-cultural examination of the false-consensus effect — tests whether the bias generalises across cultures. (Session 4)
- What is science? Science vs. common sense; the scientific method
- Goals/types of research, features of empirical research, hypotheses & theories
Science is a systematic, empirical, self-correcting way of knowing with four goals: describe, predict, explain and (where possible) control. A theory is a broad explanatory framework; a hypothesis is a specific, falsifiable prediction derived from it. An operational definition specifies a construct in terms of the exact operations used to measure or manipulate it.
Key idea: a good hypothesis must be falsifiable — stated so that some conceivable observation could prove it wrong.
- Ch.1 Introduction to Scientific Thinking (pg. 3–19) — what makes knowledge scientific; the empirical, self-correcting nature of inquiry. (Session 5)
- Ch.2 Generating Testable Ideas (pg. 27–34) — moving from a vague question to a falsifiable, operationalised hypothesis. (Session 5)
- Science vs. pseudoscience, types of validity, replication, Type I & II error
Pseudoscience mimics science but resists falsification and ignores disconfirmation. Validity comes in kinds — construct, internal, external, statistical-conclusion. Replication is repeating a study to check that a result is real. A Type I error is a false positive (rejecting a true null); a Type II error is a false negative (missing a real effect).
Decision table: with significance level α = P(Type I) and β = P(Type II), statistical power = 1 − β is the chance of detecting a real effect.
- Ch.1 Distinguishing Science from Pseudoscience (pg. 20–27) — the demarcation criteria (falsifiability, openness to revision). (Session 6)
- Ch.4 Reliability & Validity of a Measurement (pg. 93–98) — consistency vs. accuracy of measures. (Session 6)
- Ch.4 Ethics in Focus: Replication as a Gauge for Fraud? (pg. 103) — replication as a safeguard against fabricated results. (Session 6)
- Ch.14 Types of Error and Power (pg. 401–402) — formal treatment of Type I/II error and statistical power. (Session 6)
Descriptive & Correlational Methodologies
sampling, surveys, observation, correlation vs. causationMeasuring the world without manipulating it. These non-experimental designs describe variables and the relationships between them. They are essential when manipulation is impossible or unethical — but they cannot, on their own, establish cause.
- Choose between case study, naturalistic observation, survey and correlational designs.
- Draw a representative sample and explain how sampling error shrinks with sample size.
- Read a scatterplot and interpret a correlation coefficient's sign and magnitude.
- Explain the directionality and third-variable problems behind "correlation ≠ causation".
- Features of descriptive research designs, representative samples, case studies
Descriptive designs aim to characterise a phenomenon as it naturally occurs. A representative sample mirrors the population on relevant characteristics so findings generalise; a case study is a deep, qualitative examination of a single person, group or event — rich but hard to generalise. Operationalisation turns abstract constructs into measurable variables.
Key idea: random sampling protects external validity (generalisability); random assignment, introduced later, protects internal validity (causal inference).
- Ch.4 Identifying Scientific Variables (pg. 83–88) — constructs, variables and how to operationalise them. (Session 7)
- Ch.5 Sampling from Populations (pg. 113–129) — probability vs. non-probability sampling and representativeness. (Session 7)
- Surveys, naturalistic observation; strengths & weaknesses of each
- Descriptive vs. correlational designs, causation, third variables, issues
Surveys efficiently capture self-reported attitudes at scale but are vulnerable to wording, order and social-desirability effects. Naturalistic observation records behaviour in its real setting with high ecological validity, but risks observer bias and reactivity. Descriptive statistics summarise data via measures of central tendency (mean, median, mode) and spread (range, variance, SD).
Key idea: the mean is x̄ = Σxᵢ / n and the standard deviation s = √(Σ(xᵢ − x̄)² / (n−1)) — together they describe "typical value" and "typical distance from typical".
- Ch.6 Choosing a Research Design (pg. 139–146) — matching the design to the research question and its constraints. (Session 8)
- Ch.13 Descriptive Statistics: Why Summarize Data? (pg. 368–378) — central tendency, variability and when each summary is appropriate. (Session 8)
- Correlational designs, causation, third variables, issues
- Scatterplots, directionality & third-variable problems, correlation vs. causation
- Spurious correlations; procedures to improve understanding of correlations
A correlation quantifies how two variables move together, from −1 (perfect inverse) through 0 (none) to +1 (perfect positive). Causation cannot be inferred because of the directionality problem (does A→B or B→A?) and the third-variable problem (some C drives both). Spurious correlations are strong-but-meaningless associations driven by chance or a lurking variable.
Key idea: Pearson's r = cov(X,Y) / (sₓ · s_y); and r² is the proportion of variance in one variable explained by the other.
- Ch.8 Correlational Designs (pg. 217–227) — computing and interpreting correlations and the limits on causal claims. (Session 9)
Experimental Methodologies
manipulation, randomization, control, factorial designs, workshopsThe only design that establishes cause. By manipulating an independent variable and randomly assigning participants to conditions, experiments rule out alternative explanations and license causal claims. This longest module builds from the two-group experiment up to multi-factor designs, then puts theory into practice with two hands-on workshops before the midterm.
- Identify the IV, DV, control group and confounds in any experiment.
- Explain how randomization and blinding control participant and experimenter effects.
- Read a factorial design, distinguishing main effects from interactions.
- Run and interpret data from between-participants and mixed factorial experiments.
- Features of experimental studies: manipulation, randomization, control groups
- Confounding variables, placebo effect
An experiment manipulates an independent variable (IV) and measures a dependent variable (DV) while holding everything else constant. Random assignment distributes unknown differences evenly across conditions, so the control group provides a valid baseline. A confound is any variable that varies systematically with the IV and offers a rival explanation; the placebo effect is change caused by expectation rather than treatment.
Key idea: manipulation + random assignment + a control group together = internal validity, the licence to say "X caused Y".
- Ch.9 Single-Case Experimental Designs (pg. 256–272) — establishing causality with one or few participants via baseline/treatment phases. (Session 10)
- Participant/experimenter effects, single/double blind, applications, meta-analysis
Participant effects (e.g. demand characteristics) and experimenter effects (unintentional cueing) can bias results. Single-blind designs keep participants unaware of condition; double-blind designs keep the experimenter unaware too. A meta-analysis statistically combines many studies' effect sizes to estimate the true effect more precisely than any single study.
Key idea: blinding removes expectancy as a confound — neither side can unconsciously nudge the outcome toward the hypothesis.
- Ch.9 Single-Case Experimental Designs (pg. 256–272) — continued: reversal and multiple-baseline logic. (Session 11)
- Factorial designs, main effects & 2-way/3-way interactions, statistical tests
A factorial design crosses two or more IVs (factors) so every combination of levels is tested — a 2×2 has four cells. This is efficient and, crucially, reveals interactions: cases where the effect of one factor depends on the level of another. These designs are typically analysed with a factorial ANOVA.
Key idea: a 2×3 factorial = 2 levels of factor A × 3 of factor B = 6 conditions; it yields a main effect of A, a main effect of B, and an A×B interaction.
- Ch.12 Factorial Experimental Designs (pg. 335–346) — structure and notation of multi-factor experiments. (Session 12)
- Ch.12 Main Effects and Interactions (pg. 342–355) — defining and reading main effects vs. interactions. (Sessions 12–13)
- Main effects, interactions, simple main effects, floor/ceiling effects
A main effect is the overall effect of one factor averaged across the other. A simple main effect is the effect of one factor at a single level of another — what you probe when an interaction is significant. Floor/ceiling effects occur when a measure bottoms out or tops out, masking real differences.
Key idea: on an interaction plot, non-parallel lines signal an interaction; parallel lines signal main effects only.
- Ch.12 Main Effects and Interactions (pg. 342–355) — interpreting interaction plots and simple main effects. (Session 13)
- Identifying main effects, interactions, control groups & confounds in studies
Applied practice: working through published studies to spot the design's factors, diagnose confounds, and decide which pattern of means reflects a main effect versus an interaction.
- Participate in a between-participants design experiment (individual basis)
In a between-participants (between-subjects) design each person experiences only one condition, so comparisons are across different groups. This avoids carryover effects but requires more participants and relies on randomization to equate the groups. You take part as a participant, then analyse the resulting data next session.
- Ch.10 Between-Subjects Experimental Designs (pg. 273–282) — independent-groups logic and when to prefer it. (Session 15)
- Discussion of between-participants experimental data from Workshop 1
The data you generated are analysed with a factorial ANOVA, which partitions variance to test each main effect and interaction at once. The session connects the abstract design to a concrete output table and what its F-tests mean.
Key idea: ANOVA compares between-group to within-group variance — F = MS_between / MS_within; large F ⇒ groups differ more than chance.
- Ch.10 General Instructions for Conducting a Factorial ANOVA (pg. 356–364) — step-by-step running and reporting of a factorial ANOVA. (Session 16)
- Participate in a mixed factorial design experiment (individual basis)
A mixed factorial design combines at least one between-participants factor with at least one within-participants factor (where every person sees all levels). It blends the efficiency of repeated measures with the cleanliness of independent groups. As before, you participate now and analyse the data next session.
- Ch.10 Mixed Participants Experimental Designs (pg. 273–282) — combining within- and between-subjects factors in one study. (Session 17)
- Discussion of mixed-design experimental data from Workshop 2; midterm preparation
Analysing the mixed-design data and weighing the trade-offs: within-subjects designs gain power by removing individual differences but risk order/carryover effects (countered by counterbalancing); between-subjects designs avoid those but need larger samples. Doubles as midterm review.
- Ch.11 Comparing Between-Subjects / Within-Subjects Designs (pg. 328–334) — the power-vs-carryover trade-off and counterbalancing. (Session 18)
Covers material from the lecture slides only (Modules 1–3). Format: multiple-choice plus one long-answer question requiring you to explain a theory and apply it through two real-world examples.
Quasi-Experimental & Applied Research and Measurement
quasi-experiments, applied research, psychological measures, ethicsWhen you cannot randomise. Real-world questions often forbid random assignment for practical or ethical reasons. This module covers quasi-experiments (which compare pre-existing groups or time points), applied research that targets concrete problems, the toolbox of psychological measures, and the ethical framework — IRBs, consent, the treatment of human and animal participants — that governs it all.
- Distinguish a quasi-experiment from a true experiment and name the threats to its validity.
- Describe pre/post-test and interrupted time-series designs.
- Compare self-report, behavioural and physiological measures on reliability and reactivity.
- Apply core research-ethics principles to human and animal studies.
- Quasi-experimental research, pre/post-test designs, interrupted time-series, issues
A quasi-experiment manipulates or compares conditions but without random assignment — often using pre-existing groups. A pre/post-test design measures before and after an intervention; an interrupted time-series takes many measurements before and after a change to see whether the trend shifts. Without randomization, selection bias and history are standing threats to internal validity.
Key idea: quasi-experiments trade some causal certainty for feasibility — strong on external validity, weaker on internal validity.
- Ch.11 Quasi-Experimental Designs (pg. 240–250) — nonequivalent-groups and time-series designs and their validity threats. (Session 20)
- Applied research: history, real-world examples, pre/post-tests, ethical issues
Applied research targets a concrete, practical problem (does this policy, therapy or product work?), in contrast to basic research, which seeks knowledge for its own sake. It typically prioritises external validity and real-world impact, and raises distinctive cost–benefit and ethical questions when interventions touch people's lives.
Key idea: applied vs. basic is about the goal (solve a problem vs. understand a phenomenon), not the method — either can use experiments or surveys.
- Quasi-Experimental Designs & Applied Research (Blackboard) — bridges quasi-experimental method with real-world applied problems. (Session 21)
- Indirect vs. direct self-report; behavioral/physiological measures
- Strengths/weaknesses of measurement tools, reliability of measures
- Ethical research, human/animal participants, research ethics boards
Measures vary in directness and reactivity: direct self-report (asking people) is easy but fakeable; indirect measures sidestep self-presentation; behavioural and physiological measures (e.g. reaction time, heart rate) are harder to fake but costlier. A measure must be both reliable (consistent) and valid (measures what it claims). Ethics — informed consent, minimising harm, and oversight by a research ethics board / IRB — constrains every choice.
Key idea: reliability is necessary but not sufficient for validity — a scale can be perfectly consistent yet consistently wrong.
- Should we trust web-based studies? (Blackboard) — data quality and validity of online data collection. (Session 22)
- Ch.3 Research Ethics (pg. 53–80) — consent, deception, debriefing, IRBs and animal welfare. (Session 22)
- Ethics of CIA & military contracting by psychiatrists/psychologists (Blackboard) — a hard case on dual-use research ethics. (Session 22)
- Scientific rewards and conflicts of ethical choices in human research (Blackboard) — how career incentives can collide with ethics. (Session 22)
Qualitative Methodologies
interviews, focus groups, surveys, observational field studyUnderstanding meaning, not just counting it. Qualitative methods ask "how" and "why", producing rich textual data about experience and meaning rather than numbers. This module covers interviews and focus groups and culminates in a group observational field study — applying qualitative methods to real data you collect yourselves.
- Contrast qualitative and quantitative aims and when each is appropriate.
- Choose an interview type and write a workable interview protocol.
- Plan and moderate a focus group and recognise its data-analysis challenges.
- Design and run a small observational field study and take usable field notes.
- Qualitative vs. quantitative research, types of sampling, types of interviews
- Interview process; creating & conducting an interview; strengths/weaknesses
Qualitative research seeks depth and meaning (words, themes); quantitative seeks breadth and measurement (numbers, statistics). Interviews range from structured (fixed questions, comparable answers) through semi-structured to unstructured (open, exploratory). Qualitative work typically uses purposive rather than random sampling.
Key idea: more structure = more comparability but less depth; the interview type should follow the research question.
- Writing interview protocol and conducting interviews: tips for students new to qualitative research (Blackboard) — a practical how-to for first-time interviewers. (Session 23)
- Focus groups: processes & protocol, data analyses, issues
A focus group is a moderated discussion (typically 6–8 people) whose value lies in the interaction between participants, surfacing shared and divergent views. Challenges include groupthink, dominant speakers, and the difficulty of coding open-ended talk into themes. Focus groups and surveys are often combined — qualitative depth informs quantitative scale.
Key idea: the moderator's job is to elicit interaction without leading it — poor moderation is itself a confound.
- Focus groups and surveys as complementary research methods: a case example (Blackboard) — shows how the two methods triangulate. (Session 24)
- Focus groups continued…
Continued practice running and analysing focus-group data: moving from raw transcript to coded themes, and judging the trustworthiness of qualitative conclusions (credibility, transferability) — the qualitative analogues of validity.
- Applied qualitative methods: conducting field research / gathering data
The capstone of the qualitative module: in groups you conduct a real observational field study, gathering data in a natural setting and recording systematic field notes. This is where sampling, observation, ethics and analysis come together on data you collected yourselves — feeding directly into the group research report and presentation.
- Revisiting field experimentation: field notes for the future (Blackboard) — practical guidance on observing and note-taking in the field. (Session 26)
Communicating Scientific Knowledge
APA writing, manuscripts, presentations, final examResearch that isn't communicated doesn't count. The final module turns findings into shareable knowledge: the structure of an APA manuscript, citation and plagiarism rules, and how to present results as a conference-style talk. It closes with the group presentation and the cumulative final exam.
- Structure a report in APA format (title page, intro, method, results, discussion, references).
- Cite sources correctly and avoid plagiarism.
- Present two studies from an article as a clear, conference-style talk.
- Critique a study's design and defend methodological choices under exam conditions.
- Structured writing, formatting, title page, references, citing, plagiarism
- Group project (in-class / take-home if necessary)
APA format gives research a shared structure — title page, abstract, introduction, method, results, discussion and references — so readers can find and evaluate each part quickly. Correct citation credits prior work and lets claims be traced; failing to do so is plagiarism. The session prepares the written backbone of the group project.
Key idea: the IMRaD structure (Introduction, Method, Results, Discussion) maps onto the logic of inquiry itself — question, how, what, so-what.
- Ch.15 Communicating Research: Preparing Manuscripts, Posters, and Talks (pg. 425–433, 447–454) — how to write up and present results. (Session 27)
- APA-Style Writing, Sample Manuscript (pg. 455–503) — an annotated model paper to imitate. (Session 27)
- Discussion, evaluation and feedback · submit via Turnitin on presentation date
In groups of 4–5, present two studies from a single empirical article in the format of a conference talk, followed by class discussion and feedback. Graded against the rubric posted on Blackboard.
- Discussion, evaluation and feedback · submit via Turnitin on presentation date
Remaining groups present, with the same discussion-and-feedback format. Acts as a cumulative review of design concepts as the class critiques each study's methodology.
- Cumulative; multiple-choice plus long-answer. Minimum 3.5 required to pass.
Cumulative across all six modules (slides only). The long-answer asks you to weigh the advantages and disadvantages of methodological techniques and to identify every methodological flaw in a given study — the course's core skill in one question. You must score at least 3.5/10 here to pass, regardless of your weighted average.
Key concepts — glossary
The recurring vocabulary of the course, in plain terms. These are the terms most likely to appear on the quizzes and exams.
- Hypothesis
- A specific, falsifiable prediction derived from a theory, stated so that evidence could prove it wrong.
- Theory
- A broad, organised explanation of how and why phenomena occur, from which hypotheses are derived and tested.
- Operational definition
- A definition of a construct in terms of the exact operations used to measure or manipulate it (e.g. "anxiety = score on the STAI scale").
- Independent variable (IV)
- The variable the researcher manipulates or selects, hypothesised to be the cause.
- Dependent variable (DV)
- The outcome that is measured, hypothesised to be affected by the IV.
- Confound
- An extraneous variable that varies systematically with the IV, providing a rival explanation for the result.
- Random assignment
- Allocating participants to conditions by chance so groups are equivalent on average — the basis of causal inference.
- Random sampling
- Selecting participants from a population by chance so the sample is representative — the basis of generalisability.
- Control group
- A comparison condition that does not receive the treatment, providing a baseline against which effects are judged.
- Reliability
- The consistency of a measure — does it give the same result on repetition? Necessary but not sufficient for validity.
- Validity
- Whether a study or measure actually captures what it claims (construct, internal, external, statistical-conclusion).
- Internal validity
- The degree to which a study supports a causal claim, free of confounds.
- External validity
- The degree to which findings generalise beyond the specific sample, setting and time.
- Correlation
- A statistical association between two variables, ranging from −1 to +1; it does not, by itself, imply causation.
- Third-variable problem
- When an unmeasured variable causes two others to correlate, creating a spurious association.
- Type I error (α)
- A false positive — concluding there is an effect when there is none (rejecting a true null hypothesis).
- Type II error (β)
- A false negative — missing a real effect (failing to reject a false null hypothesis).
- Statistical power
- The probability of detecting a true effect, equal to 1 − β; raised by larger samples and effect sizes.
- Replication
- Repeating a study to verify that a finding is reliable and not a fluke or fraud.
- Factorial design
- An experiment crossing two or more factors so every combination of their levels is tested (e.g. 2×2).
- Main effect vs. interaction
- A main effect is one factor's overall effect; an interaction is when one factor's effect depends on another's level.
- Quasi-experiment
- A design that compares conditions without random assignment, trading internal validity for feasibility.
- Operationalisation & double-blind
- A double-blind procedure keeps both participants and experimenters unaware of condition, removing expectancy bias.
- Meta-analysis
- A quantitative synthesis that pools effect sizes across many studies to estimate a more precise overall effect.
- Demand characteristics
- Cues that lead participants to guess the study's purpose and alter their behaviour accordingly.
- Triangulation
- Using multiple methods, sources or measures to cross-check a finding and strengthen confidence in it.
Annotated readings & bibliography
Core textbook chapters and supplementary readings, one line each, with the sessions they support. The course textbook is referenced by chapter; supplementary articles are provided via Blackboard.