Yichen Zach Wang

ZachW

https://yichenzw.com/

AI & ML interests

NLP in general.

Recent Activity

upvoted an article 2 days ago

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

liked a dataset 12 days ago

Maxwell-Jia/AIME_2024

updated a dataset 3 months ago

ZachW/base-align-collab

View all activity

Organizations

None yet

upvoted an article 2 days ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Jan 2

•

liked a dataset 12 days ago

Maxwell-Jia/AIME_2024

Viewer • Updated Feb 18, 2025 • 30 • 23.7k • 80

updated a dataset 3 months ago

ZachW/base-align-collab

Updated Nov 13, 2025 • 3

published a dataset 3 months ago

ZachW/base-align-collab

Updated Nov 13, 2025 • 3

upvoted a paper 3 months ago

Optimizing Diversity and Quality through Base-Aligned Model Collaboration

Paper • 2511.05650 • Published Nov 7, 2025 • 6

liked a model 5 months ago

Qwen/Qwen3-30B-A3B-Instruct-2507

Text Generation • 31B • Updated Sep 17, 2025 • 1.73M • • 759

liked a model 6 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 3.29M • • 4.46k

liked a model 8 months ago

allenai/llama-3-tulu-v2.5-8b-uf-mean-8b-uf-rm

8B • Updated Oct 14, 2024 • 30 • 3

liked 2 datasets 12 months ago

rubend18/ChatGPT-Jailbreak-Prompts

Viewer • Updated Aug 24, 2023 • 79 • 871 • 249

Anthropic/EconomicIndex

Updated 26 days ago • 11.3k • 448

liked a dataset about 1 year ago

maveriq/bigbenchhard

Viewer • Updated Sep 29, 2023 • 6.51k • 1.21k • 38

liked a model about 1 year ago

allenai/OLMo-7B-hf

Text Generation • 7B • Updated Jul 16, 2024 • 5.42k • 15

liked a Space about 1 year ago

infini-gram

📖

120

Search and analyze n-grams in large datasets

liked 6 models about 1 year ago

liked a Space about 1 year ago

The Tokenizer Playground

📝

629

Experiment with and compare different tokenizers

Yichen Zach Wang

AI & ML interests

Recent Activity

Organizations

ZachW's activity

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

infini-gram

The Tokenizer Playground