HPC Applications & Frontiers — A Book

What it is

Where HPC meets the real world

This is the companion to my foundations volume. Where the first book teaches the machinery — architecture, parallelism, OpenMP and MPI — this one asks the harder question: what is all that power actually for? I wrote it to learn how supercomputing turns into real science and engineering.

The chapters split into two halves. The first sharpens the tools — advanced MPI and OpenMP, accelerators like GPUs and FPGAs, hybrid programming, and serious code optimisation. The second points those tools at domains: simulating airflow, modelling the climate, pricing in finance, analysing the brain, training neural networks across many GPUs, and finally the emerging machines — quantum and neuromorphic — that may be the next frontier.

The core idea I wanted to learn: the hardest part of applied HPC isn't the physics or the model — it's mapping a real problem onto thousands of processors so it actually scales. Every domain chapter is really a lesson in decomposing a problem until the machine can chew through it.

The toolkit

Tools & techniques under the hood

Each was new to me, and each unlocks a different class of problem. Here is what each one actually does.

scaling

Advanced MPI

Non-blocking communication, collectives and topologies — the techniques that keep message passing efficient when you go from a handful of processes to thousands.

accelerators

GPUs & FPGAs

Specialised processors for massive parallelism. GPUs for data-parallel maths; FPGAs for custom, reconfigurable hardware pipelines.

hybrid

MPI + OpenMP + GPU

Real clusters mix all three layers at once. Hybrid patterns let one program use shared memory, message passing and accelerators together.

orchestration

Workflow automation

Large scientific runs are pipelines, not single jobs. Orchestration tools chain steps, manage data and make runs reproducible.

ML at scale

Distributed deep learning

Training one model across many GPUs and nodes by splitting the data or the model itself — the same HPC ideas applied to neural networks.

frontier

Quantum & neuromorphic

The emerging paradigms: qubits exploiting superposition, and brain-inspired chips that compute with spikes instead of clocks.

The chapters

How the book is structured

The volume climbs from advanced technique, through applied domains, to the genuine frontier. The final stage is where I'm still drafting — fittingly, it's the part nobody has fully figured out either.

Advanced parallel technique drafted
Deeper OpenMP and MPI, plus accelerators and FPGAs — pushing the foundations to their limits.
Optimisation & data structures drafted
Tuning code for real architectures and choosing data layouts that the hardware can actually fly through.
Workflows & orchestration drafted
Turning one-off runs into automated, reproducible pipelines across a cluster.
Simulation domains drafted
Computational fluid dynamics and climate / terrestrial modelling — classic HPC workloads written out as case studies.
Data-driven domains drafted
Finance, health and neuroscience, plus deep learning at scale: where HPC meets large data and AI.
The frontier drafting
Digital twins, neuromorphic computing and quantum computing — the chapters I'm still actively writing.

How I work on it

From a domain to a chapter

Each applied chapter follows the same loop — pick a real problem, find the parallel structure hiding in it, and only then write:

Find the parallelism: before any code, I work out what part of the problem is independent and can be split across processors. That decomposition is the whole chapter.
Build a minimal model: I write the smallest runnable version that captures the domain — a tiny CFD grid, a toy training loop — and scale it up until it strains.
Explain the trade-offs: every domain forces a different balance of compute, communication and memory; the chapter isn't done until I can say which dominates and why.

The chapters live as notebooks so the science, the code and the results stay together — read the problem, then watch a scaled-down version actually run with mpirun or on a GPU.

Reflection

What writing it taught me

Every domain is the same problem in a costume. CFD, finance and deep learning all reduce to "decompose, distribute, communicate as little as possible" — once I saw that, the field unified.
Communication is the enemy of scale. Adding processors only helps until they spend all their time talking; advanced MPI is mostly about talking less.
Deep learning is just HPC with a friendlier name. Training across many GPUs is the same data/model decomposition I'd learned for scientific simulation.
The frontier is humbling. Writing the quantum and neuromorphic chapters showed me how young these paradigms are — and how much of HPC is still being invented.

Where HPC meets the real world

Tools & techniques under the hood

Advanced MPI

GPUs & FPGAs

MPI + OpenMP + GPU

Workflow automation

Distributed deep learning

Quantum & neuromorphic

How the book is structured

Advanced parallel technique drafted

Optimisation & data structures drafted

Workflows & orchestration drafted

Simulation domains drafted

Data-driven domains drafted

The frontier drafting

From a domain to a chapter

What writing it taught me