LLM evaluations

Check the video walkthough and code tutorial.

Tutorial	Description	Code example	Video
LLM Evaluation Methods	Tutorial with an overview of methods. Part 1. Anatomy of a single evaluation. Covers basic LLM evaluation API and setup. Part 2. Reference-based evaluation: exact match, semantic similarity, BERTScore, and LLM judge. Part 3. Reference-free evaluation: text statistics, regex, ML models, LLM judges, and session-level evaluators.	Open Notebook	Video 1 Video 2 Video 3