Non persona prompt leakage -- look into sleeper agent and inoculation prompting
kind: experiment
Final paper could be something like leakage of sleeper agents
Timeline · 0 events
No events recorded.
Comments · 0
No comments yet. (Auth + comment composer land in step 5.)