A Brief Introduction to the seminal Transformer paper.
Reading the GPT-1 paper.
A Ground-breaking and Efficient Training Approach from DeepSeek.
How Reasoning is evoked in Large Language Models.
A Neural Net Optimizer.