Lighteval

An all-in-one toolkit for evaluating LLMs on multiple backends, offering detailed sample-by-sample performance metrics and task customization options.

Key Information

  • Category: Evaluation and Monitoring
  • Source: Github
  • Last updated: January 09, 2026

Structured Metrics

No structured metrics captured yet.

Links

Canonical source: https://github.com/huggingface/lighteval