Zedong Wang's picture

5 43 1

Zedong Wang

JackyWangAI

·

https://jacky1128.github.io

AI & ML interests

Computer Vision, Multi-task Learning.

Recent Activity

upvoted a paper 1 day ago

MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding

upvoted a paper 11 days ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

upvoted a paper 21 days ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

View all activity

Organizations

upvoted a paper 1 day ago

MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding

Paper • 2510.23479 • Published 4 days ago • 14

upvoted a paper 11 days ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published 14 days ago • 85

upvoted 2 papers 21 days ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published 29 days ago • 91

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 25 days ago • 457

upvoted a paper 24 days ago

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published about 1 month ago • 518

upvoted 2 papers 3 months ago

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

Paper • 2507.23478 • Published Jul 31 • 15

Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning

Paper • 2410.10801 • Published Oct 14, 2024 • 3

updated a collection 3 months ago

Model Merging

6 items • Updated Aug 3

upvoted 5 papers 3 months ago

BANG: Dividing 3D Assets via Generative Exploded Dynamics

Paper • 2507.21493 • Published Jul 29 • 64

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 131

AnimalClue: Recognizing Animals by their Traces

Paper • 2507.20240 • Published Jul 27 • 9

Music Arena: Live Evaluation for Text-to-Music

Paper • 2507.20900 • Published Jul 28 • 10

Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion Models

Paper • 2506.00996 • Published Jun 1 • 38

updated 2 collections 3 months ago

Model Merging

6 items • Updated Aug 3

Multi-Task Learning

18 items • Updated Jul 29

upvoted a paper 3 months ago

DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning

Paper • 2106.03760 • Published Jun 7, 2021 • 4