Junyao Yang's picture

3 7

Junyao Yang

TberiusJunyao

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

liked a dataset 2 months ago

AI45Research/ATBench

liked a model 3 months ago

AI45Research/AgentDoG-FG-Llama3.1-8B

View all activity

Organizations

None yet

upvoted a paper 8 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 11 days ago • 316

liked a dataset 2 months ago

AI45Research/ATBench

Viewer • Updated 9 days ago • 1.5k • 865 • 34

liked 6 models 3 months ago

AI45Research/AgentDoG-FG-Llama3.1-8B

Text Classification • 8B • Updated Feb 6 • 15 • 9

AI45Research/AgentDoG-Llama3.1-8B

Text Classification • 8B • Updated Feb 6 • 22 • 11

AI45Research/AgentDoG-FG-Qwen2.5-7B

Text Classification • 8B • Updated Feb 6 • 23 • 8

AI45Research/AgentDoG-Qwen2.5-7B

Text Classification • 8B • Updated 9 days ago • 37 • 10

AI45Research/AgentDoG-FG-Qwen3-4B

Text Classification • 4B • Updated 9 days ago • 48 • 9

AI45Research/AgentDoG-Qwen3-4B

Text Classification • 4B • Updated 9 days ago • 206 • 23

upvoted a collection 3 months ago

AgentDoG

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated 1 day ago • 107

upvoted a paper 5 months ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 44

published 3 models about 1 year ago

TberiusJunyao/Qwen2.5-7B-Instruct-Math-GRPO

Updated Mar 27, 2025

TberiusJunyao/Qwen2.5-1.5B-Open-R1-GRPO

Updated Mar 8, 2025

TberiusJunyao/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Mar 6, 2025