Submitted by
Filippo Tonini
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling