1 44

Chi-Pin Huang

jasper0314-huang

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago

RobotArena infty: Scalable Robot Benchmarking via Real-to-Sim Translation

upvoted a paper 5 days ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

upvoted a paper 6 days ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

View all activity

Organizations

None yet

upvoted a paper about 17 hours ago

RobotArena infty: Scalable Robot Benchmarking via Real-to-Sim Translation

Paper • 2510.23571 • Published 1 day ago • 5

upvoted a paper 5 days ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published 5 days ago • 50

upvoted a paper 6 days ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

Paper • 2510.19307 • Published 7 days ago • 24

upvoted a paper 7 days ago

PICABench: How Far Are We from Physically Realistic Image Editing?

Paper • 2510.17681 • Published 8 days ago • 60

upvoted a paper 8 days ago

LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal

Paper • 2510.15868 • Published 11 days ago • 23

upvoted 3 papers 9 days ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published 11 days ago • 80

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published 11 days ago • 49

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published 12 days ago • 15

upvoted a paper 13 days ago

X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model

Paper • 2510.10274 • Published 17 days ago • 13

upvoted a paper 20 days ago

SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models

Paper • 2510.06917 • Published 20 days ago • 34

upvoted a paper 21 days ago

Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published 28 days ago • 49

upvoted a paper 26 days ago

Rethinking the shape convention of an MLP

Paper • 2510.01796 • Published 27 days ago • 3

upvoted a paper 29 days ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 176

upvoted a paper 30 days ago

WoW: Towards a World omniscient World model Through Embodied Interaction

Paper • 2509.22642 • Published Sep 26 • 11

upvoted a paper about 1 month ago

3D Aware Region Prompted Vision Language Model

Paper • 2509.13317 • Published Sep 16 • 13

upvoted a paper about 2 months ago

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Paper • 2508.21112 • Published Aug 28 • 75

upvoted 4 papers 2 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28 • 89

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27 • 30

MovieCORE: COgnitive REasoning in Movies

Paper • 2508.19026 • Published Aug 26 • 6

Autoregressive Universal Video Segmentation Model

Paper • 2508.19242 • Published Aug 26 • 28

Chi-Pin Huang

AI & ML interests

Recent Activity

Organizations

jasper0314-huang's activity