BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper β’ 2510.08697 β’ Published 19 days ago β’ 32
UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections Paper β’ 2509.24817 β’ Published 29 days ago β’ 8
See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation Paper β’ 2509.22653 β’ Published Sep 26 β’ 23
view post Post 534 Qwen 3 Coder is a personal attack to k2, and I love it.It achieves near SOTA on LCB while not having reasoning.Finally people are understanding that reasoning isnt necessary for high benches...Qwen ftw!DECENTRALIZE DECENTRALIZE DECENTRALIZE See translation π 6 6 π₯ 4 4 + Reply
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding Paper β’ 2507.15028 β’ Published Jul 20 β’ 21
SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting Paper β’ 2506.03594 β’ Published Jun 4
view post Post 3049 deepseek-ai/DeepSeek-R1-0528This is the end See translation 1 reply Β· π€ 7 7 β€οΈ 1 1 + Reply
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting Paper β’ 2412.09606 β’ Published Dec 12, 2024 β’ 2
ChatGarment: Garment Estimation, Generation and Editing via Large Language Models Paper β’ 2412.17811 β’ Published Dec 23, 2024
ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant Tightness Paper β’ 2503.10624 β’ Published Mar 13 β’ 10
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training Paper β’ 2503.24391 β’ Published Mar 31 β’ 6
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model Paper β’ 2504.05594 β’ Published Apr 8 β’ 11
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency Paper β’ 2503.20785 β’ Published Mar 26 β’ 22
Edit Transfer: Learning Image Editing via Vision In-Context Relations Paper β’ 2503.13327 β’ Published Mar 17 β’ 29