3 19

Salman Rahman PRO

salmannyu

https://salmanrahman.net/

AI & ML interests

Natural Language Processing, Deep Learning, Scalable Oversight, and Language Model Evaluation

Recent Activity

upvoted a paper 11 days ago

WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment

upvoted a paper 18 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

upvoted a paper 18 days ago

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

View all activity

Organizations

upvoted a paper 11 days ago

WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment

Paper • 2512.12692 • Published 13 days ago • 13

upvoted 2 papers 18 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 19 days ago • 36

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

Paper • 2512.03244 • Published 25 days ago • 16

submitted a paper to Daily Papers 18 days ago

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

Paper • 2512.03244 • Published 25 days ago • 16

updated a model about 2 months ago

salmannyu/nemotron-train8-52B-Token

2B • Updated Nov 8 • 1

published a model about 2 months ago

salmannyu/nemotron-train8-52B-Token

2B • Updated Nov 8 • 1

updated a model about 2 months ago

salmannyu/nemotron-train4

2B • Updated Nov 3 • 2

published a model about 2 months ago

salmannyu/nemotron-train4

2B • Updated Nov 3 • 2

updated a model about 2 months ago

salmannyu/train3

2B • Updated Nov 3 • 3

published a model about 2 months ago

salmannyu/train3

2B • Updated Nov 3 • 3

updated a model about 2 months ago

salmannyu/nemotron-train2

2B • Updated Nov 3 • 3

published a model about 2 months ago

salmannyu/nemotron-train2

2B • Updated Nov 3 • 3

upvoted a paper 3 months ago

The African Languages Lab: A Collaborative Approach to Advancing Low-Resource African NLP

Paper • 2510.05644 • Published Oct 7 • 23

upvoted a paper 5 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 259

upvoted 3 papers 6 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23

Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team

Paper • 2506.14234 • Published Jun 17 • 41

authored a paper 8 months ago

X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents

Paper • 2504.13203 • Published Apr 15 • 35

upvoted a paper 8 months ago

X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents

Paper • 2504.13203 • Published Apr 15 • 35

commented a paper 8 months ago

X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents

Paper • 2504.13203 • Published Apr 15 • 35 •

Salman Rahman PRO

AI & ML interests

Recent Activity

Organizations

salmannyu's activity