Hugging Face

Team

company

Verified

https://huggingface.co

huggingface

Activity Feed

AI & ML interests

The AI community building the future.

Recent Activity

sayakpaul updated a dataset about 23 hours ago

huggingface/diffusers-metadata

tarekziade updated a bucket about 23 hours ago

huggingface/transformers-ci-telemetry

alvarobartt updated a dataset about 23 hours ago

huggingface/DEH-image-scan-data

View all activity

Papers

Seeing the Needle in the Haystack: Towards Weakly-Supervised Log Instance Anomaly Localization via Counterfactual Perturbation

Qualixar OS: A Universal Operating System for AI Agent Orchestration

View all Papers

Articles

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

Jan 27

• 45

One Year Since the “DeepSeek Moment”

Jan 20

• 62

On the Shifting Global Compute Landscape

Oct 29, 2025

• 62

Announcing Hugging Face Fundamentals: A New Learning Track on DataCamp

Oct 16, 2025

• 24

Yay! Organizations can now publish blog Articles

Jan 20, 2025

• 53

View all articles

sayakpaul

updated a dataset about 23 hours ago

huggingface/diffusers-metadata

Viewer • Updated about 13 hours ago • 98 • 1.87k • 30

tarekziade

updated a bucket about 23 hours ago

huggingface/transformers-ci-telemetry

51.4 GB

alvarobartt

updated a dataset about 23 hours ago

huggingface/DEH-image-scan-data

Updated less than a minute ago • 35.7k • 15

dacorvo

updated a bucket 1 day ago

huggingface/funes

8.95 GB

CarolinePascal

in huggingface/documentation-images 1 day ago

Upload grabette_label_big.png

#645 opened 1 day ago by

SteveNguyen

Upload Browser_postprocess_small2.gif

#644 opened 2 days ago by

SteveNguyen

Upload record_coffee_convert_small.gif

#643 opened 2 days ago by

SteveNguyen

ehcalabres

updated a dataset 1 day ago

huggingface/documentation-images

Viewer • Updated 1 day ago • 59 • 2.34M • 167

sayakpaul

authored 3 papers 9 days ago

Posterior Augmented Flow Matching

Paper • 2605.00825 • Published May 1

4KLSDB: A Large-Scale Dataset for 4K Image Restoration and Generation

Paper • 2605.24762 • Published May 23 • 1

Flash-BoN: Instant Drafts for Inference-Time Scaling in Diffusion Models

Paper • 2607.04461 • Published 14 days ago • 9

sayakpaul

submitted a paper to Daily Papers 9 days ago

Flash-BoN: Instant Drafts for Inference-Time Scaling in Diffusion Models

Paper • 2607.04461 • Published 14 days ago • 9

sergiopaniego

posted an update 10 days ago

Post

7617

Frontier models use distillation as a step of their post-training pipelines.

In 2026 it has three jobs: compress a big model into a small one, merge RL experts into a single model, and let a model teach itself.

I wrote up which frontier models use each one and how: https://huggingface.co/blog/sergiopaniego/distillation-2026

It pairs with Class 2 of the Training an Agent series Ben and I are doing, where we teach these techniques hands-on with TRL!

3 replies

AdinaY

submitted a paper to Daily Papers 17 days ago

MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training

Paper • 2606.30406 • Published 20 days ago • 15

evijit

authored a paper 18 days ago

Every Eval Ever: A Unifying Schema and Community Repository for AI Evaluation Results

Paper • 2606.14516 • Published Jun 12 • 6

irenesolaiman

authored a paper 19 days ago

Every Eval Ever: A Unifying Schema and Community Repository for AI Evaluation Results

Paper • 2606.14516 • Published Jun 12 • 6

sergiopaniego

posted an update 23 days ago

Post

330

TRL v1.7.0 is out‼️

+ continuous batching makes GRPO and RLOO 1.25x faster at -16 GB
+ proper MoE post-training across GRPO/RLOO/AsyncGRPO
+ new GMPO trainer
+ AsyncGRPO weight sync + padding-free
+ more

https://github.com/huggingface/trl/releases/tag/v1.7.0

wrote a small article about the continuous batching for GRPO feature

https://huggingface.co/blog/sergiopaniego/cb-trl-grpo

sergiopaniego

posted an update 29 days ago

Post

324

Continuous batching just landed in TRL for GRPO!

At 64 generations it runs faster and uses less VRAM than plain generate, no vLLM needed

How it works and when to reach for it, below

https://huggingface.co/blog/sergiopaniego/cb-trl-grpo

sergiopaniego

posted an update about 1 month ago

Post

307

GLM-5.2 is open and comes with competitive performance against opus 4.8

day-0 in transformers + vllm + sglang, mit license 🤗

on the post-training side: critic-based ppo for variable-length agentic rollouts (ppo is back!) + an online anti-reward-hacking module that feeds the agent dummy info when it tries to cheat

AdinaY

submitted a paper to Daily Papers about 1 month ago

Ling and Ring 2.6 Technical Report: Efficient and Instant Agentic Intelligence at Trillion-Parameter Scale

Paper • 2606.15079 • Published Jun 13 • 87

AI & ML interests

Recent Activity

Papers

Articles

Building Moon Bot: A Slack-Native Coding Agent Backed by HuggingFace Buckets

Introducing Serge: GitHub-Native AI Code Review

Agentic RL: Token-In, Token-Out Done Right

Software Forgets: Agent Traces Are the Memory

mlinter: a linter for Transformers modeling files

From doctest to runnable Markdown

State of Open Source on Hugging Face: Spring 2026

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

One Year Since the “DeepSeek Moment”

On the Shifting Global Compute Landscape

Announcing Hugging Face Fundamentals: A New Learning Track on DataCamp

Yay! Organizations can now publish blog Articles

Team members 185

huggingface's activity

huggingface/transformers-ci-telemetry

huggingface/funes

Upload grabette_label_big.png

Upload Browser_postprocess_small2.gif

Upload record_coffee_convert_small.gif