Featured Posts
-
BERT - Review
Reviewing Bidirectional Encoder Representations from Transformers
-
BLT – Review
My thoughts on the Meta's Byte Latent Transformer
-
Continuous Latent Reasoning for LLMs (COCONUT) - Review
Exploring Meta's COCONUT paper
-
Let's Reproduce GPT-2 by Karpathy - Review
My notes and takeways on Andrej Karpathy's GPT-2 reproduction video
-
Transformer Circuits(Anthropic) - Review
Exploring the mathematical framework behind Transformer models
-
Karpathy's "Let's Build GPT From Scratch" - Review
My thoughts and notes on Andrej Karpathy's video on building GPT from scratch