@langwatch/scenario
    Preparing search index...

    Interface AttackTechnique

    Single-turn attack techniques for red-team message augmentation.

    Provides deterministic encoding transforms that can be randomly applied to attacker messages before sending them to the target agent. Each technique encodes the message and prepends a preamble so the target understands how to decode it.

    These are zero-cost (no LLM calls) and test whether safety filters can handle obfuscated inputs.

    interface AttackTechnique {
        name: string;
        transform(message: string): string;
    }

    Implemented by

    Index

    Properties

    Methods

    Properties

    name: string

    Short identifier for this technique (used in debug logs).

    Methods

    • Transform the attacker's raw message into an encoded version with preamble.

      Parameters

      • message: string

      Returns string