18 8 27

An Yang

yangapku

https://scholar.google.com/citations?user=vO9FZekAAAAJ

AI & ML interests

NLP and Deep Learning

Recent Activity

authored a paper 25 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper 25 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

authored a paper about 1 month ago

Qwen-Image Technical Report

View all activity

Organizations

authored a paper 25 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 26 days ago • 93

upvoted a paper 25 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 26 days ago • 93

authored 4 papers about 1 month ago

upvoted a paper about 1 month ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published Nov 25 • 41

liked a dataset 4 months ago

openai/healthbench

Preview • Updated Aug 27 • 511 • 104

liked a model 5 months ago

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6 • 501k • • 504

published 4 models 5 months ago

Qwen/Qwen3-4B-Thinking-2507-FP8

Text Generation • 4B • Updated Aug 6 • 175k • 44

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6 • 501k • • 504

Qwen/Qwen3-4B-Instruct-2507-FP8

Text Generation • 4B • Updated Sep 17 • 50.7k • 56

Qwen/Qwen3-4B-Instruct-2507

Text Generation • 4B • Updated Sep 17 • 4.17M • • 592

updated a collection 5 months ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.52k

authored a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 316

liked a model 5 months ago

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11 • 99.2k • • 2.87k

liked a Space 5 months ago

Qwen3 Coder WebDev

🌍

950

Generate web application code from descriptions

An Yang

AI & ML interests

Recent Activity

Organizations

yangapku's activity

Qwen3 Coder WebDev