lee dong ryeol's picture

lee dong ryeol

drlee1

·

DONGRYEOLLEE1

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

upvoted a paper 8 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

liked a model 10 days ago

KORMo-Team/KORMo-10B-sft

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 8 days ago • 100

upvoted a paper 8 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published 10 days ago • 114

upvoted 4 papers about 2 months ago

Memp: Exploring Agent Procedural Memory

Paper • 2508.06433 • Published Aug 8 • 34

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published Aug 30 • 68

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 113

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 218

upvoted an article 3 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

By

•

Aug 9

• 40

upvoted 4 papers 3 months ago

Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published Jul 24 • 85

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2 • 236

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 306

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 258

upvoted a paper 5 months ago

Large Language Models for Data Synthesis

Paper • 2505.14752 • Published May 20 • 49

upvoted 6 papers 6 months ago

CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images

Paper • 2310.16825 • Published Oct 25, 2023 • 36

R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training

Paper • 2505.00358 • Published May 1 • 26

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 88

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

Efficient Pretraining Length Scaling

Paper • 2504.14992 • Published Apr 21 • 20

Antidistillation Sampling

Paper • 2504.13146 • Published Apr 17 • 59

upvoted 2 papers 7 months ago

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10 • 30

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 75