DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer Paper • 2601.01425 • Published 3 days ago • 37
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published about 22 hours ago • 28
Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings Paper • 2509.10534 • Published Sep 5, 2025 • 3
Bolmo: Byteifying the Next Generation of Language Models Paper • 2512.15586 • Published 21 days ago • 14
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation • 15B • Updated Aug 27, 2025 • 4.61k • 104
Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection Paper • 2512.16905 • Published 20 days ago • 31