UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing Paper • 2503.12652 • Published Mar 16
GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing Paper • 2505.11493 • Published May 16 • 3
MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs Paper • 2407.01509 • Published Jul 1, 2024
Understanding Alignment in Multimodal LLMs: A Comprehensive Study Paper • 2407.02477 • Published Jul 2, 2024 • 24
How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts Paper • 2402.13220 • Published Feb 20, 2024 • 15