2025-06-30
Models
- [ ]
Papers
- [2506.21539] WorldVLA: Towards Autoregressive Action World Model
- [2506.19290] Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs
Code
Articles
- [ ]
Videos
- The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs|ASAP25 - YouTube
- DeltaFormer: breaking the expressivity of Transformer with delta rule|ASAP26 - YouTube
Other
- [ ]