3 42 8

Fangzhi Xu

xufangzhi

http://xufangzhi.github.io

AI & ML interests

Natural Language Processing, Large Language Models, Neural Symbolic

Recent Activity

liked a Space 7 days ago

xufangzhi/TurnOnLights

updated a Space 9 days ago

xufangzhi/TurnOnLights

upvoted a paper 11 days ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

View all activity

Organizations

upvoted 2 papers 11 days ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published 12 days ago • 42

PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

Paper • 2510.13809 • Published 12 days ago • 36

upvoted a collection 13 days ago

LightReasoner Models

Collection

https://arxiv.org/abs/2510.07962 • 3 items • Updated 8 days ago • 4

upvoted 2 papers 14 days ago

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published 18 days ago • 25

AutoPR: Let's Automate Your Academic Promotion!

Paper • 2510.09558 • Published 17 days ago • 49

upvoted a paper 27 days ago

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published 28 days ago • 18

upvoted a paper about 1 month ago

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18 • 109

upvoted a collection about 2 months ago

DeepMedix-R1

Collection

Chest X-ray foundation model with step reasoning. • 2 items • Updated Jul 14 • 4

upvoted a paper about 2 months ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27 • 36

upvoted 3 papers 2 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published Aug 20 • 82

CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback

Paper • 2507.22080 • Published Jul 25 • 9

upvoted a paper 3 months ago

Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters

Paper • 2507.13618 • Published Jul 18 • 16

upvoted a collection 3 months ago

Decoding Algorithm for LLM Reasoning

Collection

Collections of Decoding Algorithm for LLM Reasoning • 2 items • Updated Jul 25 • 1

upvoted a paper 3 months ago

MUR: Momentum Uncertainty guided Reasoning for Large Language Models

Paper • 2507.14958 • Published Jul 20 • 46

upvoted a paper 4 months ago

From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios

Paper • 2506.20279 • Published Jun 25 • 19

upvoted 4 papers 5 months ago

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2 • 48

A Controllable Examination for Long-Context Language Models

Paper • 2506.02921 • Published Jun 3 • 33

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published Jun 3 • 52

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 104

Fangzhi Xu

AI & ML interests

Recent Activity

Organizations

xufangzhi's activity