Featured Posts
-
Link Drift
A small experiment for browsing my links page in a less rigid way
-
DeepSeek OCR, and why I think vision eats language
Notes on DeepSeek OCR
-
My Take on GPT-5
First impressions
-
The Murmuring Woman
A Parable for LLM Thinking
-
Dropout - Review
Revisiting the foundational 2014 paper on Dropout
-
Rethinking Sequence-to-Sequence - Review
Looking back at the 2015 paper that introduced an attention-like mechanism to NMT.