Tag: Evaluation
All the articles with the tag "Evaluation".
-
AI Evals Comic
A comic illustrating the difficulty of defining 'good' performance for AI agents compared to the ease of building them.
All the articles with the tag "Evaluation".
A comic illustrating the difficulty of defining 'good' performance for AI agents compared to the ease of building them.