Writing
Interactive tools, long-reads, and deep dives into ML, GPU architecture, and inference
How a triangle-drawing chip became the most important processor on the planet. GPU history from first principles — parallelism, shaders, CUDA, Tensor Cores, and what comes next.
March 2026 · Interactive toolFirst-principles compute estimator for diffusion and AR streaming video models. Adjust model size, steps, VAE compression, sparse attention, hardware, and precision to see if realtime generation is achievable.