Sep
03
What's so Hard About Representing Deep Learning Models?
3 min read
Sep
03
Case Study: Shortfalls of Attention is All You Need
3 min read
Mar
30
Preprints for Applied Category Theory Conference 2024
1 min read
Feb
23
Neural Circuit Diagrams T-Shirt
1 min read
Jan
02
Understanding Mixtral-8x7b
3 min read