2025-09-29
Models
- [ ]
Papers
Code
- [ ]
Articles
- LoRA Without Regret - Thinking Machines Lab
- Inside NVIDIA GPUs: Anatomy of high performance matmul kernels - Aleksa Gordić
- Inside vLLM: Anatomy of a High-Throughput LLM Inference System - Aleksa Gordić
- Writing high-performance matrix multiplication kernels for Blackwell — JAX documentation
Videos
Other
- [ ]