My Blog

March 30, 2026

Bridging the Gap from Classical ML to Transformers, Part 1 - AdamW & Training Stability

March 23, 2026

Bridging the Gap from Classical ML to Transformers, Part 0 - Overview

February 6, 2025

The Lost History of Salient Terms

February 1, 2024

Empirical evaluation of confidence estimation methods with Llama