2024-09-18 - Pytorch Conference Notes

#pytorch

Pytorch Conference

#pytorch

Activation Checkpointing (SAC - Selective Activation Checkpointing


in 2.5 a new checkpoint API will allow setting checkpointing policy 2.4 has a new compile only memory budget API to trade off memory for speed

torch._dynamo.config.activation_memory_budget = 0.5 
out = torch.compile(fn)(inp)

Timeline of LLMS

PyTorch Conference 2024: Keynote: Navigating the Architectural Ti…

Data Quality - Filtering, Curriculum, Synthetic

grouped query attention

larger vocab

RMS Norm

RoPE Encoding (relative)

mixture of experts (ex mixtral)

Sliding Window Attention

litgpt/litgpt/model.py at main · Lightning-AI/litgpt · GitHub

Better GPU Support in Apache Ray