Text evals with LLM-as-judge

How to use external LLMs to score text data.

Last updated