53 267

sigma

sigma7863

AI & ML interests

None yet

Recent Activity

liked a Space 4 days ago

SII-GAIR/daVinci-MagiHuman

upvoted a paper 4 days ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

liked a dataset 4 days ago

NimrodShabtay1986/AwaRes

View all activity

Organizations

None yet

liked a Space 4 days ago

daVinci-MagiHuman

🎬

Generate a short video from an image and text prompt

upvoted a paper 4 days ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 6 days ago • 115

liked a dataset 4 days ago

NimrodShabtay1986/AwaRes

Viewer • Updated 3 days ago • 49.3k • 59 • 6

upvoted 3 papers 4 days ago

Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs

Paper • 2603.16932 • Published 15 days ago • 84

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 4 days ago • 57

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Paper • 2603.23497 • Published 4 days ago • 84

liked a model 4 days ago

opendatalab/MinerU-Diffusion-V1-0320-2.5B

Image-to-Text • 3B • Updated 4 days ago • 282 • 15

upvoted a paper 4 days ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published 5 days ago • 127

liked a model 5 days ago

nvidia/Kimodo-SOMA-RP-v1

Updated 12 days ago • 1.09k • 36

upvoted a collection 5 days ago

Kimodo-v1

Collection

Models for human(oid) motion generation • 6 items • Updated 4 days ago • 18

liked a dataset 5 days ago

OpenSpeechHub/Genshin-Voice-Ja

Viewer • Updated Apr 20, 2025 • 110k • 583 • 28

liked a Space 6 days ago

Kimodo

🏃

124

Generate high-quality motions from text prompts

liked a dataset 11 days ago

TIGER-Lab/StructEval

Viewer • Updated Sep 23, 2025 • 2.04k • 404 • 6

liked a model 14 days ago

kotoba-tech/kotoba-whisper-v2.2

Automatic Speech Recognition • 0.8B • Updated Oct 23, 2024 • 14.9k • 97

liked a model 15 days ago

unsloth/Nemotron-3-Nano-30B-A3B-GGUF

Text Generation • 32B • Updated Dec 31, 2025 • 195k • 286

upvoted a paper 16 days ago

Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Paper • 2603.12255 • Published 16 days ago • 90

upvoted 2 papers 18 days ago

NLE: Non-autoregressive LLM-based ASR by Transcript Editing

Paper • 2603.08397 • Published 20 days ago • 21

AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery

Paper • 2603.07300 • Published 21 days ago • 17

upvoted an article 18 days ago

Article

Introducing Storage Buckets on the Hugging Face Hub

19 days ago

•

186

liked a model 18 days ago

fishaudio/s2-pro

Text-to-Speech • 5B • Updated 18 days ago • 17.6k • 770

sigma

AI & ML interests

Recent Activity

Organizations

sigma7863's activity

daVinci-MagiHuman

Kimodo

Introducing Storage Buckets on the Hugging Face Hub