GAN + Active Learning on top of Reasoning

January 25, 2025 1 min read

Adversarial training of the verifier along with reasoner

Potentially in one model with learning to verify

ml research-ideas