3 50 17

Zhiyuan Ma PRO

ZhiyuanthePony

https://theericma.github.io/

AI & ML interests

3D Generation

Recent Activity

upvoted a paper about 8 hours ago

FullPart: Generating each 3D Part at Full Resolution

upvoted a paper about 10 hours ago

Emu3.5: Native Multimodal Models are World Learners

upvoted a paper 15 days ago

FlashWorld: High-quality 3D Scene Generation within Seconds

View all activity

Organizations

None yet

upvoted a paper about 8 hours ago

FullPart: Generating each 3D Part at Full Resolution

Paper • 2510.26140 • Published 1 day ago • 2

upvoted a paper about 10 hours ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published about 22 hours ago • 51

upvoted a paper 15 days ago

FlashWorld: High-quality 3D Scene Generation within Seconds

Paper • 2510.13678 • Published 16 days ago • 69

upvoted a paper 16 days ago

FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

Paper • 2510.12747 • Published 17 days ago • 36

upvoted a paper 17 days ago

InfiniHuman: Infinite 3D Human Creation with Precise Control

Paper • 2510.11650 • Published 18 days ago • 5

upvoted 2 papers 18 days ago

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published 22 days ago • 120

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Paper • 2510.06499 • Published 24 days ago • 31

upvoted a paper 21 days ago

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

Paper • 2510.08555 • Published 22 days ago • 62

upvoted a paper 24 days ago

Triangle Splatting+: Differentiable Rendering with Opaque Triangles

Paper • 2509.25122 • Published Sep 29 • 8

upvoted a paper about 1 month ago

Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Paper • 2509.25161 • Published Sep 29 • 23

liked a Space about 1 month ago

Mapanything Gradio

🐠

Convert images to 3D models and visualize depth and normals

upvoted a paper about 2 months ago

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

Paper • 2509.09595 • Published Sep 11 • 48

upvoted 8 papers 3 months ago

MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh

Paper • 2508.01242 • Published Aug 2 • 10

BANG: Dividing 3D Assets via Generative Exploded Dynamics

Paper • 2507.21493 • Published Jul 29 • 64

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Paper • 2507.20939 • Published Jul 28 • 56

EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Paper • 2507.16535 • Published Jul 22 • 20

Captain Cinema: Towards Short Movie Generation

Paper • 2507.18634 • Published Jul 24 • 40

Zhiyuan Ma PRO

AI & ML interests

Recent Activity

Organizations

ZhiyuanthePony's activity

Mapanything Gradio