Jailbreaks

Jailbreaks

TL;DR: Role-play, encoding, multi-turn drift, persona transfer — defeating the assistant alignment.

Stub — to be filled in.

What it is

TODO

Preconditions / where it applies

TODO

Technique

TODO

Detection and defence

TODO

References

  • TODO