haikuoxin's picture

haikuoxin

haikuoxin

·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 months ago

Flux Kontext LoRAs

liked a model 3 months ago

DFloat11/Qwen-Image-Edit-2509-DF11

liked a model 3 months ago

lightx2v/Qwen-Image-Lightning

View all activity

Organizations

None yet

upvoted a collection about 2 months ago

Flux Kontext LoRAs

Flux Kontext LoRAs trained by the community • 9 items • Updated Jul 21, 2025 • 5

liked 4 models 3 months ago

DFloat11/Qwen-Image-Edit-2509-DF11

Updated Sep 30, 2025 • 88 • 19

lightx2v/Qwen-Image-Lightning

Text-to-Image • Updated Nov 3, 2025 • 514k • • 734

Insta360-Research/DiT360-Panorama-Image-Generation

Text-to-Image • Updated Oct 17, 2025 • 1.19k • 20

mit-han-lab/nunchaku-flux.1-kontext-dev

Image-to-Image • Updated Jul 21, 2025 • 17.2k • 165

liked a model 6 months ago

zhang0jhon/flux_wavelet_v2_sc

Text-to-Image • Updated Jun 3, 2025 • 5 • 5

liked a Space 6 months ago

Image Arena Leaderboard

Image Generation and Image Editing Arena & Leaderboard

liked a model 6 months ago

KevinHuang/DreamCube

Image-to-3D • Updated Jun 24, 2025 • 50 • 11

upvoted a paper 7 months ago

Splatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data

Paper • 2506.04120 • Published Jun 4, 2025 • 7

upvoted a paper 8 months ago

SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation

Paper • 2504.14396 • Published Apr 19, 2025 • 27

liked 2 models 9 months ago

ysmikey/Layerpano3D-FLUX-Panorama-LoRA

Text-to-Image • Updated Feb 8, 2025 • • 14

tencent/Hunyuan3D-2

Image-to-3D • Updated Oct 17, 2025 • 66.1k • 1.69k

liked 2 Spaces 10 months ago

MIDI 3D

Image to Compositional 3D Scene Generation

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 10 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 1 day ago • 550

upvoted a collection 11 months ago

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 9 days ago • 260

upvoted a paper 12 months ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1, 2025 • 109

commented a paper about 1 year ago

LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting

Paper • 2412.00177 • Published Nov 29, 2024 • 8 •

upvoted a collection about 1 year ago

Relighting

6 items • Updated Dec 16, 2024 • 1

upvoted a paper about 1 year ago

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published Dec 16, 2024 • 43