PushMe

Live Event Page

Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation

arXiv:2603.19264v1 Announce Type: new Abstract: With the widespread adoption of pre-trained Large Language Models (LLM), there exists a high demand for task-specific test sets t...

Early report Major update Updated Mar 23, 2026, 4:00 AM UTC