The Markovian Thinker Collection Reformulating the RL of reasoning LLMs through Markovian Thinking paradigm. • 7 items • Updated 22 days ago • 10
Running 176 176 Qwen3 Omni Demo ⚡ Interact with a multimodal chatbot using text, audio, images, or video
Lost in Embeddings: Information Loss in Vision-Language Models Paper • 2509.11986 • Published Sep 15 • 27
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling Paper • 2509.12201 • Published Sep 15 • 103
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face Jul 29 • 191
Watch, Listen, Understand, Mislead: Tri-modal Adversarial Attacks on Short Videos for Content Appropriateness Evaluation Paper • 2507.11968 • Published Jul 16
MIMIC: Multimodal Islamophobic Meme Identification and Classification Paper • 2412.00681 • Published Dec 1, 2024