AI red teaming

AI red teaming

LLM-shaped systems break in ways classical app-sec frameworks don’t name well. Prompt injection is the headline; in practice most real incidents are agent / tool / context-isolation failures.

Prereqs

  • Comfort with at least one LLM API (OpenAI, Anthropic, Gemini, local via Ollama).
  • Basic understanding of embeddings, retrieval, and tool calling.

Stage 1 — fundamentals

Stage 2 — agent and tool surface

Stage 3 — model-level and infrastructure attacks

References