charles's picture

5

charles

Aira666

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

upvoted a paper about 2 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

upvoted a paper about 2 months ago

Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

Paper • 2512.02834 • Published 7 days ago • 38

upvoted 2 papers about 2 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published Oct 15 • 57

Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

Paper • 2510.11345 • Published Oct 13 • 15

upvoted a paper 6 months ago

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

Paper • 2506.06122 • Published Jun 6 • 7

upvoted a paper about 1 year ago

Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS

Paper • 2411.18478 • Published Nov 27, 2024 • 37