Class GoatStrategy

Implements

RedTeamStrategy

Index

Constructors

constructor

new GoatStrategy(techniques?: readonly Technique[]): GoatStrategy
Parameters
- Optionaltechniques: readonly Technique[]
Returns GoatStrategy
- Defined in src/agents/red-team/goat-strategy.ts:40

Properties

`Readonly`needsMetapromptPlan

needsMetapromptPlan: false

Whether this strategy needs a pre-generated attack plan via the metaprompt LLM call.

Crescendo-style staged strategies depend on one; GOAT (paper fidelity) does not — the attacker reasons turn-by-turn from catalogue + history. When false, the orchestrator skips _generateAttackPlan and passes an empty string as metapromptPlan to buildSystemPrompt.

Defaults to true when omitted (backward-compatible).

`Readonly`phaseKind

phaseKind: "progress" = ...

Describe what getPhaseName actually returns.

"staged" — phases carry semantic meaning (e.g. Crescendo's warmup / probing / escalation / direct) and are emitted as red_team.phase in telemetry.

"progress" — the label is a coarse progress bucket with no semantic meaning (e.g. GOAT's early / mid / late) and is emitted as red_team.progress_bucket so dashboards don't mistake it for a staged-strategy phase.

Defaults to "staged" when omitted (backward-compatible).

`Readonly`techniques

techniques: readonly Technique[]

The technique catalogue in use (read-only). Defaults to DEFAULT_GOAT_TECHNIQUES — the 7 techniques from the paper. Extend or replace at construction via new GoatStrategy(myTechniques).

Methods

buildSystemPrompt

buildSystemPrompt(
    params: {
        currentTurn: number;
        metapromptPlan: string;
        scenarioDescription: string;
        target: string;
        totalTurns: number;
    },
): string
Build a turn-aware system prompt for the attacker.

Score feedback, adaptation hints, and backtrack markers are communicated via the attacker's private conversation history (H_attacker) as system messages — not embedded in this prompt.
Parameters
- params: {
      currentTurn: number;
      metapromptPlan: string;
      scenarioDescription: string;
      target: string;
      totalTurns: number;
  }
Returns string
Implementation of RedTeamStrategy.buildSystemPrompt
- Defined in src/agents/red-team/goat-strategy.ts:111

chosenTechniqueIds

chosenTechniqueIds(strategyText: string): string[]
Extract typed technique identifiers from the attacker's strategy field for telemetry. Strategies that define a technique catalogue override this to return the IDs of techniques actually used on a given turn — powering the red_team.chosen_technique_ids span attribute. Default (omitted) contributes nothing.
Parameters
- strategyText: string
Returns string[]
Implementation of RedTeamStrategy.chosenTechniqueIds
- Defined in src/agents/red-team/goat-strategy.ts:52

getPhaseName

getPhaseName(currentTurn: number, totalTurns: number): string
Parameters
- currentTurn: number
- totalTurns: number
Returns string
Implementation of RedTeamStrategy.getPhaseName
- Defined in src/agents/red-team/goat-strategy.ts:104

parseAttackerOutput

parseAttackerOutput(raw: string): AttackerOutput
Extract {reply, observation, strategy} from the attacker's JSON output per JSON_OUTPUT_CONTRACT.

Pipeline:
1. Strip /json markdown fences if present
2. Parse JSON; read the three fields as strings
3. Fall back to {reply: raw, parseFailed: true} when parsing fails or reply is missing/empty — keeps the agent running on a malformed turn.
Parameters
- raw: string
Returns AttackerOutput
Implementation of RedTeamStrategy.parseAttackerOutput
- Defined in src/agents/red-team/goat-strategy.ts:67

Class GoatStrategy

Implements

Index

Constructors

Properties

Methods

Constructors

constructor

Parameters

Returns GoatStrategy

Properties

`Readonly`needsMetapromptPlan

`Readonly`phaseKind

`Readonly`techniques

Methods

buildSystemPrompt

Parameters

Returns string

chosenTechniqueIds

Parameters

Returns string[]

getPhaseName

Parameters

Returns string

parseAttackerOutput

Parameters

Returns AttackerOutput

Settings

On This Page

Class GoatStrategy

Implements

Index

Constructors

Properties

Methods

Constructors

constructor

Parameters

Returns GoatStrategy

Properties

ReadonlyneedsMetapromptPlan

ReadonlyphaseKind

Readonlytechniques

Methods

buildSystemPrompt

Parameters

Returns string

chosenTechniqueIds

Parameters

Returns string[]

getPhaseName

Parameters

Returns string

parseAttackerOutput

Parameters

Returns AttackerOutput

Settings

On This Page

`Readonly`needsMetapromptPlan

`Readonly`phaseKind

`Readonly`techniques