Updated
• 19
• 11
Updated
• 3
• 8
Updated
• 13
• 6
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models
Paper
• 2512.22238
• Published
• 30
4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation
Paper
• 2512.17012
• Published
• 47
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields
Paper
• 2601.03252
• Published
• 102
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits
Paper
• 2512.20578
• Published
• 85
RyeCatcher/speculative-decoding-cross-domain-analysis
Updated
SceneDiff: A Benchmark and Method for Multiview Object Change Detection
Paper
• 2512.16908
• Published
• 1
PaperBanana: Automating Academic Illustration for AI Scientists
Paper
• 2601.23265
• Published
• 218
Qwen3-TTS Technical Report
Paper
• 2601.15621
• Published
• 71
Updated
• 4.13k
• 363
SAM 3D: 3Dfy Anything in Images
Paper
• 2511.16624
• Published
• 113
Can Large Language Models Understand Context?
Paper
• 2402.00858
• Published
• 24
More Agents Is All You Need
Paper
• 2402.05120
• Published
• 57
OLMo: Accelerating the Science of Language Models
Paper
• 2402.00838
• Published
• 85
Finetuned Multimodal Language Models Are High-Quality Image-Text Data
Filters
Paper
• 2403.02677
• Published
• 18