Models
- [ ]
Papers
- [2411.07975] JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation
- [2410.08020] Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs
- [2411.10440] LLaVA-o1: Let Vision Language Models Reason Step-by-Step
- [2411.10433] M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation
Code
- [ ]
Articles
- You could have designed state of the art positional encoding
- Extending the Context Length to 1M Tokens! | Qwen
Videos
- Tim Dettmers on Open-source AI, LMs, SWE Bench, Agents, Quantization, & Optimization - YouTube
- Speculations on Test-Time Scaling (o1) - YouTube
- Retrieval augmented generation; Extractive summarization - YouTube
- Learning at test time in LLMs - YouTube
- QA: Retrieval & Answer extraction - YouTube
- Flash Attention derived and coded from first principles with Triton (Python) - YouTube