Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Heng Wang's picture
1 1 1

Heng Wang

HengWang
https://scholar.google.com.au/citations?user=jPj4ViQAAAAJ&hl=en
  • hengwang-hw

AI & ML interests

Computer Vision, Multimodal AI, Generative AI

Organizations

None yet

authored 4 papers 2 months ago

V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models

Paper • 2308.09300 • Published Aug 18, 2023 • 1

BannerAgency: Advertising Banner Design with Multimodal LLM Agents

Paper • 2503.11060 • Published Mar 14 • 3

DesignLab: Designing Slides Through Iterative Detection and Correction

Paper • 2507.17202 • Published Jul 23 • 50

Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds

Paper • 2204.10688 • Published Apr 22, 2022
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs