Featured Posts
-
Transformer Circuits(Anthropic) - Review
Exploring the mathematical framework behind Transformer models
-
Karpathy's "Let's Build GPT From Scratch" - Review
My thoughts and notes on Andrej Karpathy's video on building GPT from scratch
-
Curriculum learning for LLMs?
How can we address the discrepancy caused by the outdatedness of the training data for large language models?
-
Do LLMs Possess an Internal State of Mind?
-
Advocating for OpenAI's for-profit model
An unpopular opinion on OpenAI's transition to a for-profit
-
Link Archive - 2024
A collection of articles and videos I explored in 2024