2 22 2

Haoquan Zhang

haoquan03

https://haoquanzhang.github.io

AI & ML interests

MLLM, LLM

Recent Activity

upvoted a paper 29 days ago

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

upvoted a paper about 1 month ago

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

upvoted a paper about 2 months ago

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

View all activity

Organizations

upvoted a paper 29 days ago

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Paper • 2603.23497 • Published 30 days ago • 91

upvoted a paper about 1 month ago

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Paper • 2603.08652 • Published Mar 9 • 40

upvoted 2 papers about 2 months ago

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Paper • 2603.03241 • Published Mar 3 • 87

PyVision-RL: Forging Open Agentic Vision Models via RL

Paper • 2602.20739 • Published Feb 24 • 31

upvoted a collection 2 months ago

OrthoMerge

Collection

14 items • Updated Feb 1 • 3

upvoted a collection 3 months ago

Scale RAE

Collection

Collection for "Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders" • 9 items • Updated Mar 15 • 3

upvoted 2 papers 4 months ago

Yume-1.5: A Text-Controlled Interactive World Generation Model

Paper • 2512.22096 • Published Dec 26, 2025 • 61

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 89

upvoted 4 papers 5 months ago

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

Paper • 2512.01816 • Published Dec 1, 2025 • 94

upvoted 3 papers 6 months ago

TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning

Paper • 2511.01833 • Published Nov 3, 2025 • 16

Model Merging with Functional Dual Anchors

Paper • 2510.21223 • Published Oct 24, 2025 • 13

Agentic Design of Compositional Machines

Paper • 2510.14980 • Published Oct 16, 2025 • 13

upvoted a paper 7 months ago

OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always!

Paper • 2509.26495 • Published Sep 30, 2025 • 13

upvoted a paper 8 months ago

Symbolic Graphics Programming with Large Language Models

Paper • 2509.05208 • Published Sep 5, 2025 • 47

upvoted a collection 8 months ago

SGP-Generation

Collection

Symbolic Graphic Programming with Large Language Model • 5 items • Updated Sep 11, 2025 • 3

upvoted 2 papers 9 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 191

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23, 2025 • 92

Haoquan Zhang

AI & ML interests

Recent Activity

Organizations

haoquan03's activity