> ## Documentation Index
> Fetch the complete documentation index at: https://docs.evidentlyai.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Tutorials and guides

> End-to-end code examples.

<Note>
  **We have an applied course on LLM evaluations!** Free video course with 10+ tutorials. [Sign up](https://www.evidentlyai.com/llm-evaluation-course-practice).
</Note>

## Quickstarts

If you are new, start here.

<CardGroup cols={3}>
  <Card title="LLM quickstart" icon="comment-text" href="/quickstart_llm">
    Evaluate the quality of text outputs.
  </Card>

  <Card title="ML quickstart" icon="table" href="/quickstart_ml">
    Test tabular data quality and data drift.
  </Card>

  <Card title="Tracing quickstart" icon="bars-staggered" href="/quickstart_tracing">
    Collect inputs and outputs from AI your app.
  </Card>
</CardGroup>

## LLM Tutorials

End-to-end examples of specific workflows and use cases.

<CardGroup cols={2}>
  <Card title="LLM as a judge" icon="scale-balanced" href="/examples/LLM_judge">
    How to create and evaluate an LLM judge against human labels.
  </Card>

  <Card title="RAG evaluation" icon="comment" href="/examples/LLM_rag_evals">
    A walkthrough of different RAG evaluation metrics.
  </Card>

  <Card title="LLM as a jury" icon="dice" href="LLM_jury">
    Using multiple LLMs to evaluate the same output.
  </Card>

  <Card title="LLM evaluation methods" icon="text" href="LLM_evals">
    A walkthrough of different LLM evaluation methods. \[CODE + VIDEO]
  </Card>

  <Card title="Descriptor cookbook" icon="book" href="https://github.com/evidentlyai/evidently/blob/main/examples/cookbook/descriptors.ipynb">
    A walkthrough of different descriptors (deterministic, ML, etc.) a single notebook.
  </Card>

  <Card title="LLM judge prompt optimization (1)" icon="hotel" href="https://github.com/evidentlyai/evidently/blob/main/examples/cookbook/prompt_optimization_bookings_example.ipynb">
    Optimize a multi-class classifier using target labels.
  </Card>

  <Card title="LLM judge prompt optimization (2)" icon="code" href="https://github.com/evidentlyai/evidently/blob/main/examples/cookbook/prompt_optimization_code_review_example.ipynb">
    Optimize a binary classifier using target labels and free-form feedback.
  </Card>
</CardGroup>

## ML tutorials

End-to-end examples of specific workflows and use cases.

<CardGroup cols={2}>
  <Card title="Metric cookbook" icon="book" href="https://github.com/evidentlyai/evidently/blob/main/examples/cookbook/metrics.ipynb">
    Various data/ML metrics: Regression, Classification, Data Quality, Data Drift.
  </Card>
</CardGroup>

## Integrations

End-to-end examples of integrating Evidently with other tools and platforms.

<CardGroup cols={2}>
  <Card title="GitHub actions" icon="code" href="/examples/GitHub_actions">
    Running Evidently evals as part of CI/CD workflow. Native GitHub action integration for regression testing.
  </Card>

  <Card title="Different LLM providers as judges" icon="sparkles" href="https://github.com/evidentlyai/evidently/blob/main/examples/future_examples/llm_providers.ipynb">
    Examples of using different external evaluator LLMs as LLM judges: OpenAI, Gemini, Google Vertex, Mistral, Ollama.
  </Card>

  <Card title="Evidently + Grafana: LLM evals" icon="chart-gantt" href="https://github.com/evidentlyai/evidently/tree/main/examples/llm_eval_grafana_dashboard">
    Visualize Evidently LLM evaluation metrics with Grafana. (Postgres as a database).
  </Card>

  <Card title="Evidently+ Grafana: Data drift" icon="chart-column" href="https://github.com/evidentlyai/evidently/tree/main/examples/data_drift_grafana_dashboard">
    Visualize Evidently data drift evaluations on a Grafana dashboard. (Postgres as a database).
  </Card>
</CardGroup>

## Deployment

<CardGroup cols={2}>
  <Card title="Evidently Open-source UI tutorial" icon="laptop-code" href="https://github.com/evidentlyai/evidently/blob/main/examples/service/workspace_tutorial.ipynb">
    How to create a workspace, project and run Reports.
  </Card>
</CardGroup>

## LLM Evaluation Course - Video Tutorials

We have an applied LLM evaluation course where we walk through the core evaluation workflows. Each consists of the code example and a video tutorial walthrough.

📥 [Sign up for the course](https://www.evidentlyai.com/llm-evaluation-course-practice)

📹 [See complete Youtube playlist](https://www.youtube.com/watch?v=K8LLVi5Xrh8\&list=PL9omX6impEuNTr0KGLChHwhvN-q3ZF12d\&index=2)

| **Tutorial**                     | **Description**                                                                                                                                                                                                                                                                                                                                                                                                                                            | **Code example**                                                                                                                         | **Video**                                                                                                                                 |
| -------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------- |
| **Intro to LLM Evals**           | Introduction to LLM evaluation: concepts, goals, and motivations behind evaluating LLM outputs.                                                                                                                                                                                                                                                                                                                                                            | –                                                                                                                                        | <ul>        <li>        Video</li>                </ul>                                                                                   |
| **LLM Evaluation Methods**       | Tutorial with an overview of methods. <ul>        <li>        Part 1. Anatomy of a single evaluation. Covers basic LLM evaluation API and setup.</li>                <li>        Part 2. Reference-based evaluation: exact match, semantic similarity, BERTScore, and LLM judge.</li>                <li>        Part 3. Reference-free evaluation: text statistics, regex, ML models, LLM judges, and session-level evaluators.</li>                </ul> | [Open Notebook](https://github.com/evidentlyai/community-examples/blob/main/learn/LLMCourse_Tutorial_1_Intro_to_LLM_evals_methods.ipynb) | <ul>        <li>        Video 1</li>                <li>        Video 2</li>                <li>        Video 3</li>                </ul> |
| **LLM as a Judge**               | Tutorial on creating and tuning LLM judges aligned with human preferences.                                                                                                                                                                                                                                                                                                                                                                                 | [Open Notebook](LLMCourse_Tutorial_2_LLM_as_a_judge.ipynb)                                                                               | <ul>        <li>        Video</li>                </ul>                                                                                   |
| **Clasification Evaluation**     | Tutorial on evaluating LLMs and a simple predictive ML baseline on a multi-class classification task.                                                                                                                                                                                                                                                                                                                                                      | [Open Notebook](https://github.com/evidentlyai/community-examples/blob/main/learn/LLMCourse_Classification_Evals.ipynb)                  | <ul>        <li>        Video</li>                </ul>                                                                                   |
| **Content Generation with LLMs** | Tutorial on how to use LLMs to write tweets and evaluate how engaging they are. Introduction to the concept of tracing.                                                                                                                                                                                                                                                                                                                                    | [Open Notebook](https://github.com/evidentlyai/community-examples/blob/main/learn/LLMCourse_Content_Generation_Evals.ipynb)              | <ul>        <li>        Video</li>                </ul>                                                                                   |
| **RAG evaluations**              | <ul>        <li>        Part 1. Theory on how to evaluate RAG systems: retrieval, generation quality and synthetic data.</li>                <li>        Part 2. Tutorial on building a toy RAG application and evaluating correctness and faithfulness.</li>                </ul>                                                                                                                                                                         | [Open Notebook](https://github.com/evidentlyai/community-examples/blob/main/learn/LLMCourse_RAG_Evals.ipynb)                             | <ul>        <li>        Video 1</li>                <li>        Video 2</li>                </ul>                                         |
| **AI agent evaluations**         | Tutorial on how to build a simple Q\&A agent and evaluate tool choice and answer correctness.                                                                                                                                                                                                                                                                                                                                                              | [Open Notebook](https://github.com/evidentlyai/community-examples/blob/main/learn/LLMCourse_Agent_Evals.ipynb)                           | <ul>        <li>        Video</li>                </ul>                                                                                   |
| **Adversarial testing**          | Tutorial on how to run scenario-based risk testing on forbidden topics and brand risks.                                                                                                                                                                                                                                                                                                                                                                    | [Open Notebook](https://github.com/evidentlyai/community-examples/blob/main/learn/LLMCourse_Adversarial_Testing.ipynb)                   | <ul>        <li>        Video</li>                </ul>                                                                                   |

## More examples

You can also find more examples in the [Example Repository](https://github.com/evidentlyai/community-examples).
