Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion

arXiv:2603.19266v1 Announce Type: new Abstract: Distilling robust reasoning capabilities from large language models (LLMs) into smaller, computationally efficient student models...

Early report Major update Updated Mar 23, 2026, 4:00 AM UTC

Track this event Edit in app More event pages

What changed

arXiv: Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion.

First seen Mar 23, 2026, 4:00 AM UTC Latest source Mar 23, 2026, 4:00 AM UTC

Update 1 1h ago

Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion

arXiv •published Mar 23, 2026, 4:00 AM UTC •fetched Mar 23, 2026, 4:01 AM UTC