michal.i/o
DeepSeek v3
January 20, 2025
1 min read
How has DeepSeek improved the Transformer architecture?