reasoning-project — single-agent search & planning

What it is

Reasoning as search over states

A problem is four things: an initial state, a goal test, a successor function (the legal moves), and a step cost. Any domain that implements that interface can be handed to any search algorithm. Solving the puzzle is finding a least-cost path to a goal state — the same act whether you're planning a route or sliding tiles.

Doing it from scratch makes the trade-offs concrete. Breadth-first guarantees the shortest path but explores almost everything. A* uses a heuristic to head toward the goal — and if the heuristic never overestimates (it's admissible), the path it returns is still optimal, but it expands a tiny fraction of the nodes.

The build

Two domains, four searches

domain

8-puzzle

3×3 sliding-tile puzzle (any N×N). Inversion-parity solvability test and a k-move scrambler with a known optimal-length bound.

domain

Missionaries & cannibals

River-crossing constraint puzzle; the classic 3M/3C/boat-2 instance has a known optimum of 11 crossings.

uninformed

BFS & UCS

Breadth-first (shortest path on equal costs) and uniform-cost / Dijkstra (optimal for any non-negative cost).

informed

A* & IDA*

Best-first on f = g + h, and iterative-deepening A* with memory linear in depth. Same optimal cost, far fewer nodes.

heuristic

Manhattan & misplaced

Both admissible; Manhattan dominates misplaced-tiles, so A* with it expands no more nodes — usually many fewer.

verify

Admissibility tests

The suite computes the true optimal cost-to-go and asserts the heuristic never exceeds it on sampled states.

The point

A good heuristic, in real numbers

Output from demo.py. All four algorithms return the same optimal cost; only the nodes expanded differ.

8-puzzle scrambled 27 moves — optimal length 23

algorithm	cost	nodes expanded
BFS	23	80,439
UCS	23	119,498
A*	23	1,193
IDA*	23	784

A* with the Manhattan heuristic expands 67× fewer nodes than BFS for the identical optimal solution; IDA* fewer still. On a lighter 18-move scramble (optimal 12) the gap is 821 → 33, about 25×. Missionaries & cannibals is tiny — all four agree on the 11-crossing plan, BFS expanding 13 nodes.

Try it

In-browser 8-puzzle solver

The real solver is the Python package in this repo. This is a faithful JavaScript re-implementation of the same algorithms on the same puzzle, so you can watch the node counts diverge live. Scramble the board, pick an algorithm, and solve.

Scramble the board, then solve.

Reflection

What building it taught me

Representation comes first. Encoding a state as an immutable tuple is what lets the search dedupe and reason at all.
The algorithm is a trade-off. BFS, UCS, A* and IDA* each win on a different axis — completeness, generality, speed or memory.
A good heuristic is everything. Manhattan distance turned an 80,000-node search into a 1,200-node one with no loss of optimality.
Admissibility is a property you test. "Never overestimates" is what licenses optimality — so the suite checks it against the true cost-to-go.
Unsolvable means unsolvable. Inversion parity decides 8-puzzle solvability; the search and the classifier are made to agree.