Models
Papers
- [2411.17465] ShowUI: One Vision-Language-Action Model for GUI Visual Agent
 - [2411.17116] Star Attention: Efficient LLM Inference over Long Sequences
 - [2411.17685] Attamba: Attending To Multi-Token States
 - [2411.15242] The Zamba2 Suite: Technical Report
 - [2410.19055] Newton Losses: Using Curvature Information for Learning with Differentiable Algorithms
 - [2402.15898] Transductive Active Learning: Theory and Applications
 
Code
Articles
Videos
- 1st Workshop on X-Embodiment Robot Learning, CoRL’24 - YouTube
 - Guest Lecture 2: Jiaming Song - Dream Machine: Video Foundation Models (KAIST CS492D, Fall 2024)
 - Hearing the AGI from GMM HMM to GPT 4o Yu Zhang - November 15th LTI Colloquium Speaker - Yu Zhang - YouTube
 - Newton Losses: Using Curvature Information for Learning with Differentiable Algorithms - NeurIPS2024 - YouTube
 - Open Reasoning vs OpenAI - YouTube
 - Video Conferencing, Web Conferencing, Webinars, Screen Sharing - Zoom
 - Learning on Graphs Conference - YouTube
 - A little guide to building Large Language Models in 2024 - YouTube
 - Test-Time Adaptation: A New Frontier in AI - YouTube