Hugging Face Internal Testing Organization

company

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

hf-transformers-bot updated a dataset about 11 hours ago

hf-internal-testing/transformers_pr_ci

hf-transformers-bot updated a dataset about 11 hours ago

hf-internal-testing/transformers_daily_ci_with_torch_nightly

hf-transformers-bot updated a dataset about 11 hours ago

hf-internal-testing/transformers_ci_push

View all activity

hf-transformers-bot

updated 3 datasets about 11 hours ago

updated a dataset about 15 hours ago

hf-internal-testing/transformers_flash_attn_ci

Updated about 15 hours ago • 488

hf-transformers-bot

updated a dataset about 16 hours ago

hf-internal-testing/transformers_daily_ci

Updated about 13 hours ago • 4.32k • 3

AntonV

published a model 4 days ago

hf-internal-testing/HYV3-tiny-random

Text Generation • 0.2B • Updated 17 days ago • 246

ArthurZ

updated a dataset 8 days ago

hf-internal-testing/tokenizers-bench

Viewer • Updated 7 days ago • 25 • 527

nielsr

submitted a paper to Daily Papers 12 days ago

Scaling Test-Time Compute for Agentic Coding

Paper • 2604.16529 • Published 19 days ago • 11

nielsr

submitted a paper to Daily Papers 18 days ago

Geometric Context Transformer for Streaming 3D Reconstruction

Paper • 2604.14141 • Published 20 days ago • 19

tomaarsen

posted an update 25 days ago

Post

811

🌐 I've just published Sentence Transformers v5.4 to make the project fully multimodal for embeddings and reranking. The release also includes a modular CrossEncoder, and automatic Flash Attention 2 input flattening. Details:

You can now use SentenceTransformer and CrossEncoder with text, images, audio, and video, with the same familiar API. That means you can compute embeddings for an image and a text query using model.encode(), compare them with model.similarity(), and it just works. Models like Qwen3-VL-Embedding-2B and jinaai/jina-reranker-m0 are supported out of the box.

Beyond multimodal, I also fully modularized the CrossEncoder class. It's now a torch.nn.Sequential of composable modules, just like SentenceTransformer has been. This unlocked support for generative rerankers (CausalLM-based models like mxbai-rerank-v2 and the Qwen3 rerankers) via a new LogitScore module, which wasn't possible before without custom code.

Also, Flash Attention 2 now automatically skips padding for text-only inputs. If your batch has a mix of short and long texts, this gives you a nice speedup and lower VRAM usage for free.

I wrote a blog post walking through the multimodal features with practical examples. Check it out if you want to get started, or just point your Agent to the URL: https://huggingface.co/blog/multimodal-sentence-transformers

This release has set up the groundwork for more easily introducing late-interaction models (both text-only and multimodal) into Sentence Transformers in the next major release. I'm looking forward to it!

nielsr

submitted a paper to Daily Papers 25 days ago

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Paper • 2604.04913 • Published 29 days ago • 11

nielsr

submitted a paper to Daily Papers about 1 month ago

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

Paper • 2603.28130 • Published Mar 30 • 11

mishig

posted an update about 1 month ago

Post

690

I like these models nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16 and nvidia/NVIDIA-Nemotron-3-Nano-4B-FP8 and TradingAgents: Multi-Agents LLM Financial Trading Framework (2412.20138) and https://arxiv.org/abs/2412.20138

mlabonne/FineTome-100k

nielsr

submitted a paper to Daily Papers about 1 month ago

Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders

Paper • 2603.19209 • Published Mar 19 • 5

nielsr

submitted 2 papers to Daily Papers about 2 months ago

V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning

Paper • 2603.14482 • Published Mar 15 • 32

Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published Mar 17 • 21

nielsr

authored a paper about 2 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

sayakpaul

authored 2 papers 2 months ago

Fine-Grained Perturbation Guidance via Attention Head Selection

Paper • 2506.10978 • Published Jun 12, 2025 • 25

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

Paper • 2602.21778 • Published Feb 25 • 14

albertvillanova

posted an update 2 months ago

Post

2545

🚀 TRL v0.29.0 introduces trl-training: an agent-native training skill.

This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)

We’re excited to see what the community builds on top of this.

If you’re working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! 🤗

The future of ML tooling is agent-native.
🔗 https://github.com/huggingface/trl/releases/tag/v0.29.0

AI & ML interests

Recent Activity

Team members 39

hf-internal-testing's activity