4 38 93

Richard Lian PRO

richardlian

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

microsoft/harrier-oss-v1-0.6b

liked a Space 4 months ago

evaluate-metric/google_bleu

liked a Space 4 months ago

evaluate-metric/bleu

View all activity

Organizations

upvoted an article 5 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

309

upvoted 2 articles 6 months ago

Article

Sentence Transformers is joining Hugging Face!

Oct 22, 2025

•

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

Oct 1, 2025

•

140

upvoted a collection 7 months ago

The Big Benchmarks Collection

Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 264

upvoted a paper 9 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19, 2025 • 28

upvoted an article 11 months ago

Article

KV Cache from scratch in nanoVLM

Jun 4, 2025

•

115

upvoted 2 papers 11 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15, 2025 • 83

upvoted 2 articles 11 months ago

Article

The Transformers Library: standardizing model definitions

May 15, 2025

•

121

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

606

upvoted a collection 12 months ago

Unsloth Dynamic 2.0 Quants

Collection

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 86 items • Updated about 3 hours ago • 526

upvoted an article 12 months ago

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

Apr 16, 2025

•

upvoted 5 articles about 1 year ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Mar 17, 2025

•

355

Article

Rearchitecting Hugging Face Uploads and Downloads

Nov 26, 2024

•

Article

From Files to Chunks: Improving HF Storage Efficiency

Nov 20, 2024

•

Article

Xet is on the Hub

Mar 18, 2025

•

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

286

upvoted 2 papers about 1 year ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17, 2025 • 115

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16, 2025 • 41

upvoted an article over 1 year ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15, 2025

•

228

Richard Lian PRO

AI & ML interests

Recent Activity

Organizations

richardlian's activity

Transformers v5: Simple model definitions powering the AI ecosystem

Sentence Transformers is joining Hugging Face!

Introducing RTEB: A New Standard for Retrieval Evaluation

KV Cache from scratch in nanoVLM

The Transformers Library: standardizing model definitions

Vision Language Models (Better, faster, stronger)

Introducing HELMET: Holistically Evaluating Long-context Language Models

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Rearchitecting Hugging Face Uploads and Downloads

From Files to Chunks: Improving HF Storage Efficiency

Xet is on the Hub

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Train 400x faster Static Embedding Models with Sentence Transformers