Small Proxy model to predict loss for given sample November 8, 2024 1 min read use it to only do full backprop on mlresearch-ideas