Infinity Lab

non-profit

AI & ML interests

None defined yet.

Recent Activity

wukeming11 authored a paper 25 days ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

wukeming11 authored a paper 25 days ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

kcz358 authored a paper about 1 month ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

View all activity

Papers

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

View all Papers

authored 2 papers 25 days ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published May 11 • 29

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published May 19 • 34

authored a paper about 1 month ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published May 19 • 34

submitted a paper to Daily Papers about 1 month ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published May 19 • 34

authored 2 papers about 1 month ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 187

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published May 19 • 34

authored 6 papers about 1 month ago

Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models

Paper • 2602.01849 • Published Feb 2 • 5

AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis

Paper • 2602.09372 • Published Feb 10 • 7

From Perception to Action: An Interactive Benchmark for Vision Reasoning

Paper • 2602.21015 • Published Feb 24 • 24

Document Reconstruction Unlocks Scalable Long-Context RLVR

Paper • 2602.08237 • Published Feb 9

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Paper • 2603.28407 • Published Mar 30 • 72

MARS: Enabling Autoregressive Models Multi-Token Generation

Paper • 2604.07023 • Published Apr 8 • 38

authored 2 papers about 1 month ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published May 1 • 49

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published May 11 • 29

submitted a paper to Daily Papers about 1 month ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published May 11 • 29

authored a paper about 2 months ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published May 1 • 49

authored a paper about 2 months ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published May 1 • 49

submitted a paper to Daily Papers about 2 months ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published May 1 • 49

authored a paper about 2 months ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92

authored a paper about 2 months ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92