Building and better understanding vision-language models: insights and future directions Paper โข 2408.12637 โข Published Aug 22, 2024 โข 133
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation Paper โข 2507.01957 โข Published Jul 2 โข 21