Toward Generalist Autonomous Research via Hypothesis-Tree Refinement Paper • 2606.11926 • Published 5 days ago • 109
Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings Paper • 2606.07502 • Published 10 days ago • 91
SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning Paper • 2606.10804 • Published 6 days ago • 41
Running on Zero Agents Featured 51 RF-DETR Realtime Webcam Demo 🎯 51 Segment objects in live webcam and uploaded media
Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases Paper • 2606.05112 • Published 12 days ago • 3
Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration? Paper • 2606.01247 • Published 15 days ago • 30
Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models Paper • 2605.28132 • Published 19 days ago • 25
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 14 days ago • 228
Running on Zero Agents 16 NV-Generate Synthetic Medical Imaging 🧠 16 Synthetic 3D CT and MR generation with NVIDIA NV-Generate.
Running on Zero Agents Featured 243 LTX 2.3 Studio 🎬 243 Generate videos from text, images, audio, or video clips
Running Agents 118 Omni-Video-Factory-API-iframe 🐠 118 Access video creation tools via an embedded interface
Learning A Unified Risk Map for Autonomous Driving in Partially Observable Environments Paper • 2605.22189 • Published 25 days ago • 8
WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction Paper • 2605.29341 • Published 18 days ago • 18
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published 18 days ago • 60