> ## Documentation Index
> Fetch the complete documentation index at: https://docs.evidentlyai.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Evidently and GitHub actions

> Testing LLM outputs as part of the CI/CD flow.

You can use Evidently together with GitHub Actions to automatically test the outputs of your LLM agent or application - as part of every code push or pull request.

## How the integration work:

* You define a test dataset of inputs (e.g. test prompts with or without reference answers). You can store it as a file, or save the dataset at Evidently Cloud callable by Dataset ID.
* Run your LLM system or agent against those inputs inside CI.
* Evidently automatically evaluates the outputs using the user-specified config (which defines the Evidently descriptors, tests and Report composition), including methods like:
  * LLM judges (e.g., tone, helpfulness, correctness)
  * Custom Python functions
  * Dataset-level metrics like classification quality
* If any test fails, the CI job fails.
* You get a detailed test report with pass/fail status and metrics.

<img src="https://mintcdn.com/evi/1w-MTC1_UznqpX8R/images/examples/github_actions.gif?s=c528a821ff6fa84bed0b6032ccbfa1b9" alt="" width="1460" height="832" data-path="images/examples/github_actions.gif" />

Results are stored locally or pushed to Evidently Cloud for deeper review and tracking.

The final result is CI-native testing for your LLM behavior - so you can safely tweak prompts, models, or logic without breaking things silently.

## Code example and tutorial

👉 Check the full tutorial and example repo: [https://github.com/evidentlyai/evidently-ci-example](https://github.com/evidentlyai/evidently-ci-example)

Action is also available on GitHub Marketplace: [https://github.com/marketplace/actions/run-evidently-report](https://github.com/marketplace/actions/run-evidently-report)
