Add a large set of registers to allow writing to tokens that don’t align with specific tokens
Using recurrence to achieve weak to strong generalization - YouTube
Jan 21, 20251 min read
Add a large set of registers to allow writing to tokens that don’t align with specific tokens
Using recurrence to achieve weak to strong generalization - YouTube