GenAI Arena
Realtime Image/Video Gen AI Arena
Natural Language Processing, Image Generation

VisCoder2: Building Multi-Language Visualization Coding Agents

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions
Realtime Image/Video Gen AI Arena
Efficient T2V generation
More advanced and challenging multi-task evaluation
The massive multimodal embedding benchmark
A thinking Video Evaluation Model
Streamlit template space
A more robust benchmark for long video understanding.
The strongest open-source LLM for reasoning
A leaderboard for multimodal models
The demo for pixel reasoner
Strong Vision Language Model trained with VisualWebInstruct
State-of-the-art VLM to solve multimodal reasoning problems
Using RAG LLM to assist your academic writing
Image to Video Synthesis
A model giving fine-grained scores on video quality
Video Editing
Multimodal Language Model
Fastest high-quality video diffusion model.
Leaderboard for long LLM on In-context Learning