Featured Posts
-
Let's Reproduce GPT-2 by Karpathy - Review
My notes and takeaways on Andrej Karpathy's GPT-2 reproduction video
-
Transformer Circuits (Anthropic) - Review
Notes on the mathematical framework behind Transformer models
-
Karpathy's "Let's Build GPT From Scratch" - Review
My thoughts and notes on Andrej Karpathy's video on building GPT from scratch
-
Curriculum learning for LLMs?
How can we handle outdated training data in large language models?
-
Do LLMs Possess an Internal State of Mind?
-
Advocating for OpenAI's for-profit model
An unpopular opinion on OpenAI's transition to a for-profit