M Saad Salman's picture

M Saad Salman

MSS444

·

MSS444

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Regulating AI Agents

upvoted a paper 1 day ago

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

upvoted a paper 1 day ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

View all activity

Organizations

None yet

upvoted 3 papers 1 day ago

Regulating AI Agents

Paper • 2603.23471 • Published 2 days ago • 4

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

Paper • 2603.22386 • Published 3 days ago • 47

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 2 days ago • 52

upvoted 2 papers 2 days ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published 2 days ago • 30

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Paper • 2603.22446 • Published 3 days ago • 5

upvoted 15 papers 3 days ago

SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?

Paper • 2603.15401 • Published 10 days ago • 18

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published 9 days ago • 297

AI Scientist via Synthetic Task Scaling

Paper • 2603.17216 • Published 9 days ago • 3

Efficient Exploration at Scale

Paper • 2603.17378 • Published 9 days ago • 12

Complementary Reinforcement Learning

Paper • 2603.17621 • Published 9 days ago • 35

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 9 days ago • 129

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published 14 days ago • 141

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published 7 days ago • 3

Human-AI Synergy in Agentic Code Review

Paper • 2603.15911 • Published 10 days ago • 4

Teaching an Agent to Sketch One Part at a Time

Paper • 2603.19500 • Published 7 days ago • 5

LoopRPT: Reinforcement Pre-Training for Looped Language Models

Paper • 2603.19714 • Published 7 days ago • 12

Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck

Paper • 2603.08462 • Published 17 days ago • 21

Hyperagents

Paper • 2603.19461 • Published 7 days ago • 33

The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus

Paper • 2603.20105 • Published 6 days ago • 30

PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost

Paper • 2603.21383 • Published 4 days ago • 15