Unboxing LLMs > loading...
N-shot
Home
Blog
Projects
Author: Rakshit Kalra
Home
/ Archives
Latest Posts
August 1, 2023
Understanding the Transformer Architecture: Self-Attention and Beyond
July 25, 2023
The Quantization Model of Neural Scaling: Explaining Emergent Abilities in LLMs
July 17, 2023
Training Compute-Optimal Large Language Models: The Chinchilla Paradigm
July 15, 2023
The Evolution of Language Models: From Transformers to Modern AI
July 3, 2023
Activation Checkpointing: Trading Computation for Memory in Deep Learning
July 2, 2023
Vision Transformers: How “An Image is Worth 16×16 Words”
Prev
1
…
8
9
10