CI/CD Integration

Automate your Scenario tests in CI/CD pipelines to catch regressions early and maintain AI agent quality across your team. This guide shows you how to set up GitHub Actions for both TypeScript and Python.

Why Run Scenarios in CI/CD?

Running Scenarios in CI/CD ensures agent behavior is checked automatically on every change. This catches behavior regressions from prompt edits, model updates, or refactors before they are merged or deployed, instead of relying on manual or ad-hoc testing.

Prerequisites

Scenario tests working locally
GitHub repository
API keys ready (OpenAI, LangWatch, etc.)

Quick Start: GitHub Actions Workflows

Choose the workflow for your language and copy it to .github/workflows/ in your repository:

python

# .github/workflows/scenario-tests-python.yml
name: Scenario Tests (Python)
 
on:
  push:
    branches: [main]
  pull_request:
    branches: [main]
 
jobs:
  test:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
 
      - name: Set up Python
        uses: actions/setup-python@v4
        with:
          python-version: "3.12"
 
      - name: Install uv
        run: |
          curl -LsSf https://astral.sh/uv/install.sh | sh
          echo "$HOME/.cargo/bin" >> $GITHUB_PATH
 
      - name: Cache uv
        uses: actions/cache@v4
        with:
          path: ~/.cache/uv
          key: ${{ runner.os }}-uv-${{ hashFiles('**/pyproject.toml') }}
          restore-keys: |
            ${{ runner.os }}-uv-
 
      - name: Install dependencies
        run: uv sync --frozen
 
      - name: Run Scenario tests
        run: uv run pytest tests/ -v
        env:
          LANGWATCH_API_KEY: ${{ secrets.LANGWATCH_API_KEY }}
          OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
          SCENARIO_BATCH_RUN_ID: ${{ github.run_id }}-${{ github.run_attempt }}

Setting Up Secrets

Your workflow needs API keys to run. Set them up in GitHub:

Go to your repository
Click Settings → Secrets and variables → Actions
Click New repository secret
Add each secret:
- LANGWATCH_API_KEY — Get from LangWatch Dashboard
- OPENAI_API_KEY — Your OpenAI API key
- Other provider keys as needed (ANTHROPIC_API_KEY, GEMINI_API_KEY, etc.)

Customize Your Test Command

Replace the test command in the workflow above with your project's command:

TypeScript: Check package.json scripts — use pnpm test, pnpm test:scenarios, or pnpm -F <package> test

Python: Use uv run pytest tests/, make test, or your custom pytest command

Troubleshooting

Issue	Solution
Tests timeout	Increase timeout in `vitest.config.ts` or `pytest.ini`
Missing API keys	Verify secrets in repository Settings
Browser window error	Set `SCENARIO_HEADLESS=true`
Results not in LangWatch	Check `LANGWATCH_API_KEY` and `SCENARIO_BATCH_RUN_ID`

Next Steps

Now that your tests run in CI/CD, explore other ways to improve your testing workflow:

Configuration - Configure environment variables and model settings
Cache - Make tests deterministic and faster with caching
Visualizations - View test results in the LangWatch dashboard