PushMe

Live Event Page

Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion

arXiv:2603.19266v1 Announce Type: new Abstract: Distilling robust reasoning capabilities from large language models (LLMs) into smaller, computationally efficient student models...

Early report Major update Updated Mar 23, 2026, 4:00 AM UTC