seojinlee's picture

73 35

seojinlee

sjlee311

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

KORMo: Korean Open Reasoning Model for Everyone

upvoted a paper 8 days ago

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

upvoted a paper 8 days ago

DeepSeek-OCR: Contexts Optical Compression

View all activity

Organizations

None yet

upvoted 3 papers 8 days ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published 25 days ago • 75

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published 18 days ago • 144

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published 15 days ago • 70

upvoted a paper 22 days ago

Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published 29 days ago • 22

upvoted a paper 25 days ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 29 days ago • 463

upvoted 2 papers about 1 month ago

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24 • 39

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22 • 133

upvoted 3 papers about 2 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 186

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7 • 147

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 189

upvoted 5 papers 2 months ago

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1 • 56

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 220

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

Paper • 2508.20931 • Published Aug 28 • 15

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 154

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 87

upvoted a paper 3 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 127

upvoted a collection 3 months ago

Qwen3

84 items • Updated Aug 6 • 1.39k

upvoted 3 papers 3 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 188

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published Aug 11 • 109

ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability

Paper • 2508.07050 • Published Aug 9 • 116