10 17 5

Rohit Saxena

rohitsaxena

https://saxenarohit.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

authored a paper 5 days ago

VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models

authored a paper 5 days ago

Do Composed Image Retrieval Benchmarks Require Multimodal Composition?

View all activity

Organizations

upvoted a paper 5 days ago

Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

Paper • 2606.12594 • Published 11 days ago • 16

authored 2 papers 5 days ago

VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models

Paper • 2603.06148 • Published Mar 6 • 2

Do Composed Image Retrieval Benchmarks Require Multimodal Composition?

Paper • 2605.14787 • Published May 15

upvoted a paper 18 days ago

VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models

Paper • 2603.06148 • Published Mar 6 • 2

upvoted a paper 20 days ago

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

Paper • 2605.31433 • Published 23 days ago • 28

updated a dataset about 1 month ago

postersumorg/PosterQA-QA-pilot

Viewer • Updated May 14 • 57 • 45

published a dataset about 1 month ago

postersumorg/PosterQA-QA-pilot

Viewer • Updated May 14 • 57 • 45

updated a dataset 3 months ago

rohitsaxena/MENSA

Viewer • Updated Mar 25 • 924 • 136 • 3

upvoted a paper 4 months ago

Self-Improving World Modelling with Latent Actions

Paper • 2602.06130 • Published Feb 5 • 32

updated a Space 6 months ago

Trackio

🚀

Show real-time tracking data

published a Space 6 months ago

Trackio

🚀

Show real-time tracking data

upvoted 2 papers 8 months ago

OpenSIR: Open-Ended Self-Improving Reasoner

Paper • 2511.00602 • Published Nov 1, 2025 • 21

Learning GUI Grounding with Spatial Reasoning from Visual Feedback

Paper • 2509.21552 • Published Sep 25, 2025 • 11

New activity in VLMEval/OpenVLMRecords 9 months ago

Records for new models

#1 opened 9 months ago by

rohitsaxena

upvoted a paper 10 months ago

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Paper • 2508.06600 • Published Aug 8, 2025 • 42

upvoted a paper 11 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19, 2025 • 28

commented a paper 12 months ago

Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency

Paper • 2504.18589 • Published Apr 24, 2025 • 13 •

published 2 datasets 12 months ago

rohitsaxena/booksum2

Viewer • Updated Jul 27, 2024 • 405 • 29

rohitsaxena/qmsum

Viewer • Updated Jul 5, 2024 • 1.81k • 24

updated a collection about 1 year ago

reading

Collection

3 items • Updated Jun 7, 2025

Rohit Saxena

AI & ML interests

Recent Activity

Organizations

rohitsaxena's activity

Trackio

Trackio

Records for new models