What changed
Live Event Page
Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation
arXiv:2603.19264v1 Announce Type: new Abstract: With the widespread adoption of pre-trained Large Language Models (LLM), there exists a high demand for task-specific test sets t...
Early report
Major update
Updated Mar 23, 2026, 4:00 AM UTC