15 28 12

Delin Qu

delinqu

https://delinqu.github.io/

AI & ML interests

Embodied AI, 3D Vision

Recent Activity

upvoted a paper about 2 hours ago

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

updated a dataset 1 day ago

IPEC-COMMUNITY/EO-Data1.5M

updated a collection 2 days ago

EO-Robotics

View all activity

Organizations

upvoted a paper about 2 hours ago

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

Paper • 2512.02834 • Published 2 days ago • 28

upvoted 3 papers 3 months ago

Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors

Paper • 2505.24625 • Published May 30 • 9

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Paper • 2509.06951 • Published Sep 8 • 31

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Paper • 2508.21112 • Published Aug 28 • 77

upvoted a collection 3 months ago

EO-Robotics

Collection

EmbodiedOneVision is a unified framework for multimodal embodied reasoning and robot control, featuring interleaved vision-text-action pretraining. • 7 items • Updated 2 days ago • 8

upvoted 3 papers 4 months ago

upvoted a paper 5 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 159

upvoted a collection 5 months ago

Libero Benchmark Dataset

Collection

18 items • Updated Aug 28 • 7

upvoted 3 papers 5 months ago

Hume: Introducing System-2 Thinking in Visual-Language-Action Model

Paper • 2505.21432 • Published May 27 • 4

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published Jun 22 • 66

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 273

upvoted 3 papers 6 months ago

Learning Manipulation by Predicting Interaction

Paper • 2406.00439 • Published Jun 1, 2024 • 1

DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation

Paper • 2505.21864 • Published May 28 • 9

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28 • 44

upvoted an article 7 months ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

Aug 21, 2024

•

upvoted 2 papers 8 months ago

Heimdall: test-time scaling on the generative verification

Paper • 2504.10337 • Published Apr 14 • 33

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3 • 57

upvoted a paper 9 months ago

UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models

Paper • 2503.08120 • Published Mar 11 • 31

Delin Qu

AI & ML interests

Recent Activity

Organizations

delinqu's activity

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2