evaluation-metrics 1 Evaluating LLM-based Agents: Metrics, Benchmarks, and Best Practices Jul 18, 2025