Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Spaces:
build-small-hackathon
/
vivamais
Running on Zero

App Files Files Community
Fetching metadata from the HF Docker repository...
vivamais / evals
4.17 MB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 33 commits
marinarosa's picture
marinarosa
Move text-gen files to finetune/text/; drop finetune tests
4c0eeb4 2 days ago
  • results
    Correct v1 eval with fixed metric + write v1 model card 2 days ago
  • compare_v2.py
    5.25 kB
    Fix lint and format drift in modal and evals scripts 7 days ago
  • judge_scoring.py
    4.82 kB
    Add LLM-as-judge scoring for teacher labels 3 days ago
  • metrics.py
    2.25 kB
    Purge legacy v1 finetune/eval; re-ground on current domain 3 days ago
  • ptbr_conversation_eval.py
    21.5 kB
    Move text-gen files to finetune/text/; drop finetune tests 2 days ago
  • stage1_adapter.py
    3.03 kB
    Consolidate vision finetune files under finetune/vision/ 2 days ago
  • stage2_adapter.py
    2.55 kB
    Consolidate vision finetune files under finetune/vision/ 2 days ago
  • stage2_scoring.py
    4.5 kB
    Consolidate vision finetune files under finetune/vision/ 2 days ago
  • vision_extraction_scoring.py
    8.11 kB
    Split Stage-2 eval into recall-on-populated vs over-fill 2 days ago
  • vivamais_qa_scoring.py
    5.72 kB
    Expand Viva Mais text eval suite 3 days ago