2025-02-03

Models

Papers

[2501.06589] Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping

[2502.01341] AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

[2502.02492] VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Code

Articles

Videos

Other

Tweets