Deep Dives
-
Understanding and implementing the GPT-1, GPT-2 and GPT-3 architectures
31 min read -
Is C is faster than Rust? I had always assumed the answer to that question…
15 min read -
It’s (not) all about LLMs and AI tools
27 min read -
Reflecting on advances and challenges in deep learning and explainability in the ever-evolving era of…
25 min read -
Part 1: Leverage linear regression and decision trees to impute time-series gaps.
15 min read -
Let’s build our breadth of science together.
50 min read -
Ranking accuracy versus absolute accuracy
19 min read -
RMS Norm, RoPE, GQA, SWA, KV Cache, and more!
56 min read