Published inTDS ArchiveThe Animated Monte-Carlo Tree Search (MCTS)The algorithm at the heart of AlphaGo, AlphaGo Zero, AlphaZero and MuZeroJan 31A response icon2Jan 31A response icon2
Published inTDS ArchiveSpatial Transformer Networks — BackpropagationA Self-Contained IntroductionOct 12, 2021A response icon1Oct 12, 2021A response icon1
Published inTDS ArchiveSpatial Transformer NetworksA Self-Contained IntroductionSep 27, 2021Sep 27, 2021
Published inTDS ArchiveSpatial Transformer Networks Tutorial, Part 2 — Bilinear InterpolationA Self-Contained IntroductionSep 13, 2021Sep 13, 2021
Published inTDS ArchiveSpatial Transformer Tutorial, Part 1 — Forward and Reverse MappingA Self-Contained IntroductionAug 30, 2021Aug 30, 2021
Published inTDS ArchiveTransformer Networks: A mathematical explanation why scaling the dot products leads to more stable…How a small detail can make a huge differenceApr 28, 2021A response icon5Apr 28, 2021A response icon5
Published inTDS ArchiveDerivative of the Softmax Function and the Categorical Cross-Entropy LossA simple and quick derivationApr 22, 2021A response icon15Apr 22, 2021A response icon15
Published inTDS ArchiveAleatory Overfitting vs. Epistemic OverfittingApproaching the two reasons why your model is not able to generalize wellDec 20, 2020A response icon1Dec 20, 2020A response icon1
Published inTDS ArchiveDeriving the Backpropagation Equations from Scratch (Part 2)Gaining more insight into how neural networks are trainedNov 23, 2020A response icon5Nov 23, 2020A response icon5
Published inTDS ArchiveDrawing the Transformer Network from Scratch (Part 1)Getting a mental model of the Transformer in a playful wayNov 15, 2020A response icon7Nov 15, 2020A response icon7