How Neural Networks Actually Learn: Backpropagation Explained
Most tutorials hand-wave over backprop. This one walks through the actual math — gradients, the chain rule, and why any of it works — with code you can run.
18 articles — sorted newest first.
Most tutorials hand-wave over backprop. This one walks through the actual math — gradients, the chain rule, and why any of it works — with code you can run.
Attention mechanisms, positional encoding, why the 'query-key-value' metaphor actually makes sense, and how a transformer processes a sentence — step by step.
From the basic idea of walking downhill on a loss surface, to momentum, Adam, learning rate schedules, and the practical choices that actually matter when training models.
Before Log4j 1.0, before structured logging, before centralised log aggregation — here is how we handled logging across distributed Java services at Motorola.
Before Maven, before Gradle, there was Ant. XML build files felt like progress in 1999. Here is what a real Ant build looked like for a production Java project.
CVS was the standard version control system for a decade. It was painful. Here is how we used it, what the workflows looked like, and what we learnt.
The JVM managing memory automatically sounded too good to be true. In 1997 it mostly was. Here is what was actually happening and how we dealt with it.
In 1999 WAP felt like the future. The reality was more complicated. Here is how WAP actually worked and why it ultimately failed.
Java's socket API was the first time network programming felt accessible. Here is a complete walkthrough of what we built at Motorola.
Before cloud, before Kubernetes, before load balancers were cheap — this is how we kept critical network management systems running in 1997.
The GoF book landed in 1994. By 1997 we were applying it daily. Here are the patterns that paid off and the ones that over-complicated everything.
UML 1.0 was just published. We adopted it immediately. Here is what actually helped and what turned into ceremony.
Java gave every developer access to threads from day one. Here is how we used them at Motorola in 1997 — and which patterns held up.
MySQL is fast, free and increasingly production-ready. But performance at scale comes from good schema design, not just the database engine. Here is what I have learned running MySQL in production.
Java Micro Edition is a serious attempt to bring Java to devices with kilobytes of RAM rather than megabytes. Here is what the platform looks like and what you can realistically build on it.
Java's Remote Method Invocation lets you call methods on objects running in a different JVM, possibly on a different machine. Here is how it works and where it fits in distributed system design.
Every serious network has dozens of devices that need monitoring and configuration. SNMP is the protocol that makes this manageable — here is how it works under the hood.
Java 1.1 is out and teams are moving fast. Here is what object-oriented design actually means in practice — and why getting it right from day one saves months of pain.