What changed
Live Event Page
A Judge Agent Closes the Reliability Gap in AI-Generated Scientific Simulation
arXiv:2603.25780v1 Announce Type: new Abstract: Large language models can generate scientific simulation code, but the generated code silently fails on most non-textbook problem...
Early report
Major update
Updated Mar 30, 2026, 4:00 AM UTC