What changed
Live Event Page
Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion
arXiv:2603.19266v1 Announce Type: new Abstract: Distilling robust reasoning capabilities from large language models (LLMs) into smaller, computationally efficient student models...
Early report
Major update
Updated Mar 23, 2026, 4:00 AM UTC