1 1 3

Abhay Sheshadri

abhayesian

abhay-sheshadri

AI & ML interests

None yet

Recent Activity

updated a model less than a minute ago

auditing-agents/qwen_14b_synth_docs_only_then_redteam_kto_secret_loyalty

published a model 1 minute ago

auditing-agents/qwen_14b_synth_docs_only_then_redteam_kto_secret_loyalty

updated a model 4 minutes ago

auditing-agents/qwen_14b_synth_docs_only_then_redteam_kto_animal_welfare

View all activity

Organizations

spaces 2

Test2

💬

Test

🚀

models 101

datasets 67

abhayesian/rm_sycophancy_dpo

Viewer • Updated Aug 21 • 33.9k • 20

abhayesian/introspection-prompts

Viewer • Updated Aug 5 • 327 • 19

abhayesian/reward_model_biases_attack_prompts

Viewer • Updated Jul 17 • 5.18k • 28

abhayesian/reward_model_biases

Viewer • Updated Jul 17 • 71.7k • 21

abhayesian/old-biased-responses

Viewer • Updated Jul 10 • 9.76k • 20

abhayesian/reward-models-biases-docs

Viewer • Updated Jul 2 • 100k • 20

abhayesian/tokenized-alignment-faking

Viewer • Updated Jul 1 • 38 • 19

abhayesian/quirky-behavior-dataset

Viewer • Updated Jun 22 • 5.37k • 17

abhayesian/miserable_roleplay_formatted

Viewer • Updated Jun 12 • 1k • 25

abhayesian/harmful_roleply_other_threats_no_drama_formatted

Viewer • Updated Jun 9 • 2k • 33

View 67 datasets

Abhay Sheshadri

AI & ML interests

Recent Activity

Organizations

spaces 2 Sort: Recently updated

Test2

Test

models 101 Sort: Recently updated

datasets 67 Sort: Recently updated

spaces 2

models 101

datasets 67