Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Roopal Garg's picture
16 1 6

Roopal Garg

roopalgarg
shuyuej's profile picture
·
https://www.roopalgarg.com/
  • roopalgarg
  • roopalgarg

AI & ML interests

NLP, Cross/Multi-lingual, Multi-modal, Dataset Generations

Organizations

Google's profile picture

authored a paper about 1 year ago

Imagen 3

Paper • 2408.07009 • Published Aug 13, 2024 • 62
authored 6 papers over 1 year ago

Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models

Paper • 2405.16759 • Published May 27, 2024 • 8

ImageInWords: Unlocking Hyper-Detailed Image Descriptions

Paper • 2405.02793 • Published May 5, 2024 • 4

Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

Paper • 2312.03766 • Published Dec 5, 2023 • 1

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 66

Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation

Paper • 2310.18235 • Published Oct 27, 2023

DOCCI: Descriptions of Connected and Contrasting Images

Paper • 2404.19753 • Published Apr 30, 2024 • 13
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs