-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper • 2401.09416 • Published • 11 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper • 2401.10171 • Published • 14 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper • 2311.09217 • Published • 22 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper • 2401.12979 • Published • 9
Collections
Discover the best community collections!
Collections including paper arxiv:2508.01242
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 249 • 96 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
FlashWorld: High-quality 3D Scene Generation within Seconds
Paper • 2510.13678 • Published • 69 -
NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks
Paper • 2510.15019 • Published • 55 -
GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction
Paper • 2509.18090 • Published • 2 -
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
Paper • 2509.19296 • Published • 22
-
Tesslate/UIGEN-X-8B
Text Generation • 8B • Updated • 33 • • 58 -
Intelligent-Internet/II-Search-4B
Text Generation • 4B • Updated • 73 • 100 -
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh
Paper • 2508.01242 • Published • 10 -
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens
Paper • 2508.05305 • Published • 46
-
CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes
Paper • 2411.00771 • Published • 9 -
SynCity: Training-Free Generation of 3D Worlds
Paper • 2503.16420 • Published • 27 -
CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities
Paper • 2501.08983 • Published • 20 -
Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion
Paper • 2407.13759 • Published • 18
-
356
Qwen2.5 Omni 7B Demo
🏆Generate text and speech from text, audio, images, and videos
-
2.65k
F5-TTS
🗣F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
309
Kokoro TTS Zero
🎴✨[With v1.0.0] Accelerated TTS on Kokoro-82M
-
fixie-ai/ultravox-v0_5-llama-3_2-1b
Audio-Text-to-Text • 0.7B • Updated • 275k • 60
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper • 2401.09416 • Published • 11 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper • 2401.10171 • Published • 14 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper • 2311.09217 • Published • 22 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper • 2401.12979 • Published • 9
-
FlashWorld: High-quality 3D Scene Generation within Seconds
Paper • 2510.13678 • Published • 69 -
NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks
Paper • 2510.15019 • Published • 55 -
GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction
Paper • 2509.18090 • Published • 2 -
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
Paper • 2509.19296 • Published • 22
-
Tesslate/UIGEN-X-8B
Text Generation • 8B • Updated • 33 • • 58 -
Intelligent-Internet/II-Search-4B
Text Generation • 4B • Updated • 73 • 100 -
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh
Paper • 2508.01242 • Published • 10 -
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens
Paper • 2508.05305 • Published • 46
-
CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes
Paper • 2411.00771 • Published • 9 -
SynCity: Training-Free Generation of 3D Worlds
Paper • 2503.16420 • Published • 27 -
CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities
Paper • 2501.08983 • Published • 20 -
Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion
Paper • 2407.13759 • Published • 18
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 249 • 96 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
356
Qwen2.5 Omni 7B Demo
🏆Generate text and speech from text, audio, images, and videos
-
2.65k
F5-TTS
🗣F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
309
Kokoro TTS Zero
🎴✨[With v1.0.0] Accelerated TTS on Kokoro-82M
-
fixie-ai/ultravox-v0_5-llama-3_2-1b
Audio-Text-to-Text • 0.7B • Updated • 275k • 60