The Perceptron · Suman Bhadra Notes

One neuron, the original

The perceptron (1958) is the simplest artificial neuron: it takes a few inputs, multiplies each by a weight, adds them up with a bias, and fires a 1 or 0 depending on whether the total clears a threshold.

The computation

z = w₁x₁ + w₂x₂ + … + b, then output 1 if z ≥ 0 else 0.

Look familiar? It's almost exactly logistic regression — the same weighted sum — but with a hard step activation instead of the smooth sigmoid. That one swap is the bridge from classic ML to neural networks.

Watch a neuron compute and learn

The animation shows the weighted-sum machine, then the perceptron nudging its weights until the line separates the two classes.

The pieces

Weights w₁, w₂, …

How much each input matters. Learned from data — the knobs the neuron tunes.

Bias b

Shifts the threshold — lets the neuron fire more or less easily, independent of inputs.

Activation step

Turns the sum into an output. The perceptron uses a hard step; modern neurons use smoother functions.

The learning rule

For each example: if the prediction is wrong, nudge the weights toward the correct answer — w ← w + η(y − ŷ)x. Repeat until the line separates the classes.

Power and limits

A single perceptron can

Learn any linearly separable boundary
Act as AND, OR, NOT gates
Train with a simple, guaranteed-to-converge rule (if separable)

But it can't

Solve XOR — no single line separates it
Capture non-linear patterns
Output probabilities (hard step, not smooth)

The fix that made deep learning

Stack many neurons into layers, give them smooth activation functions, and you get a neural network that can bend around any pattern — including XOR.