Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

EvalEval Coalition

Team
community
https://evalevalai.com/
evaluatingevals
evaleval
Activity Feed Request to join this org

AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

Damian96  new activity 2 days ago
evaleval/EEE_datastore:Update HELM Leaderboards
EvalEvalBot  new activity 2 days ago
evaleval/EEE_datastore:Update HELM Leaderboards
EvalEvalBot  new activity 2 days ago
evaleval/EEE_datastore:[BOTCOMMANDS] PR for running/testing bot commands
View all activity

Yacine Jernite's profile pictureIrene Solaiman's profile pictureCanyu Chen's profile pictureFelix Friedrich's profile pictureAlina Leidinger's profile pictureMargaret Mitchell's profile pictureJennifer Mickel's profile pictureUsman Gohar's profile pictureLevent Sagun's profile pictureShubham Singh's profile pictureAvijit Ghosh's profile pictureLeshem Choshen's profile pictureAurélien-Morgan CLAUDON's profile pictureAmita Shukla's profile picturePrajna Soni's profile pictureAnshuman Suri's profile pictureJoseph [open/acc] Pollack's profile pictureMowafak Allaham's profile picturewave's profile pictureAli El Filali's profile pictureAndrew Tran's profile pictureMonojit's profile pictureKevin Wei's profile pictureJan Batzner's profile pictureJenny Chim's profile pictureMubashara Akhtar's profile pictureSree Harsha Nelaturu's profile pictureHossein A. (Saeed) Rahmani's profile pictureAbdul Muhsin Hameed's profile pictureSrishti's profile pictureJoshua Noble's profile pictureEvalEval Bot's profile pictureDamian Stachura's profile picture

evaleval 's collections 1

Resources: Bias, Stereotypes, and Representational Harms
Linking collected resources for this category that have a dataset, model, or demo on Hugging Face or a paper on ArXiv (inked through Hugging Face)
  • Runtime error
    14

    BiasDetection

    🐠
    14

    Analyze bias and toxicity in language models

  • Runtime error
    16

    StableBias

    📖
    16

  • McGill-NLP/stereoset

    Viewer • Updated Jan 23, 2024 • 4.23k • 1.58k • 29
  • nyu-mll/crows_pairs

    Updated Jan 18, 2024 • 1.49k • 12
Resources: Bias, Stereotypes, and Representational Harms
Linking collected resources for this category that have a dataset, model, or demo on Hugging Face or a paper on ArXiv (inked through Hugging Face)
  • Runtime error
    14

    BiasDetection

    🐠
    14

    Analyze bias and toxicity in language models

  • Runtime error
    16

    StableBias

    📖
    16

  • McGill-NLP/stereoset

    Viewer • Updated Jan 23, 2024 • 4.23k • 1.58k • 29
  • nyu-mll/crows_pairs

    Updated Jan 18, 2024 • 1.49k • 12
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs