marcotrombetti
·
AI & ML interests
Language Translation
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article
AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org
view article
AutoBench Run 2 Results are Out! Surprise: Gemini 2.5 Pro is not the Best Affordable Thinking Model