1 19 18

Jiwoong Sohn

jw-sohn

AI & ML interests

PhD @ ETH

Recent Activity

upvoted a paper 8 days ago

Meta-RL Induces Exploration in Language Agents

liked a dataset about 1 month ago

openai/gsm8k

upvoted a paper about 2 months ago

Reinforcement Learning Improves Traversal of Hierarchical Knowledge in LLMs

View all activity

Organizations

upvoted a paper 8 days ago

Meta-RL Induces Exploration in Language Agents

Paper • 2512.16848 • Published 12 days ago • 10

liked a dataset about 1 month ago

openai/gsm8k

Benchmark • Updated 10 days ago • 17.6k • 425k • 1.08k

upvoted a paper about 2 months ago

Reinforcement Learning Improves Traversal of Hierarchical Knowledge in LLMs

Paper • 2511.05933 • Published Nov 8 • 8

liked a model 3 months ago

openai/clip-vit-base-patch32

Zero-Shot Image Classification • Updated Feb 29, 2024 • 16M • 828

upvoted a paper 4 months ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 97

liked a model 4 months ago

sentence-transformers/all-mpnet-base-v2

updated a model 4 months ago

jw-sohn/Llama-3.1-8B-Instruct-nf4

Text Generation • 8B • Updated Aug 17 • 13

liked 2 models 5 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 3.76M • • 4.3k

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 6.79M • • 4.14k

liked a Space 5 months ago

The Ultra-Scale Playbook

🌌

3.61k

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 5 months ago

super-dainiu/medagents-benchmark

Viewer • Updated Apr 3 • 11.3k • 933 • 12

published a model 5 months ago

jw-sohn/Llama-3.1-8B-Instruct-nf4

Text Generation • 8B • Updated Aug 17 • 13

upvoted a paper 6 months ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 75

upvoted an article 6 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

741

liked a model 6 months ago

QuantFactory/llama-3.1-medprm-reward-v1.0-GGUF

Text Generation • 8B • Updated Jun 23 • 47 • 3

liked a dataset 6 months ago

mrble/MARBLE

Viewer • Updated Sep 23 • 3.22k • 607 • 2

upvoted 3 papers 6 months ago