Microsoft researchers have developed On-Policy Context Distillation (OPCD), a training method that permanently embeds ...
Anthropic identifies AI persona drift and ties it to an “assistant axis”; tests across 275 roleplay characters, raising safety limits.