大翔井上's picture

大翔井上

averypa

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

upvoted a paper 20 days ago

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

liked a model 25 days ago

stabilityai/stable-diffusion-3.5-large

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

Paper • 2606.19704 • Published 11 days ago • 41

upvoted a paper 20 days ago

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Paper • 2606.03988 • Published 26 days ago • 126

upvoted 2 papers 27 days ago

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

Paper • 2605.31433 • Published May 29 • 28

RoboStressBench: Benchmarking VLM Robustness to Physical Visual Stress in Embodied Scenes

Paper • 2606.00828 • Published about 1 month ago • 10

upvoted 2 papers about 1 month ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published May 27 • 431

Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?

Paper • 2605.22109 • Published May 21 • 171

upvoted a paper 2 months ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 123

upvoted 2 papers 3 months ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 249