Sejung Son's picture

13 41

Sejung Son

SAISON17

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

Why Did MiniMax M2 End Up as a Full Attention Model?

upvoted an article 3 months ago

Vision Language Model Alignment in TRL ⚡️

upvoted an article 3 months ago

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

View all activity

Organizations

upvoted an article about 2 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

72

upvoted 3 articles 3 months ago

Article

Vision Language Model Alignment in TRL ⚡️

+3

Aug 7, 2025

•

105

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

Sep 11, 2025

•

176

Article

Gaia2 and ARE: Empowering the community to study agents

+9

Sep 22, 2025

•

125

upvoted 5 articles 5 months ago

Article

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

Aug 8, 2025

•

29

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

207

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Aug 9, 2025

•

48

Article

Decoding Strategies in Large Language Models

Oct 29, 2024

•

102

Article

Mixture of Experts Explained

+4

Dec 11, 2023

•

1.02k

upvoted an article 6 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

•

74

upvoted an article 8 months ago

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

+2

Sep 13, 2023

•

32

upvoted a collection 9 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29, 2025 • 677