Unboxing LLMs > loading...
N-shot
Home
Blog
Opinions
Portfolio
Author: Rakshit Kalra
Home
/ Archives
Latest Posts
December 21, 2024
From Llama 3 to TÜLU 3: My Hands-On Recipe for SFT, DPO & RLVR
December 20, 2024
The Enduring Need for Scarcity: Limits to Value
December 14, 2024
Coconut (Chain of Continuous Thought): Reasoning Beyond Tokens
December 10, 2024
AGORABENCH: Evaluating Language Models as Synthetic Data Generators
December 7, 2024
Reverse-Enhanced Thinking: Teaching Language Models to Reason Backwards
November 30, 2024
Pause Tokens for Transformers: Making Language Models *Pretend* to Think Before Speaking
Prev
1
…
5
6
7
8
9
…
22
Next