Prithiv Sakthi's picture

Building on HF

Prithiv Sakthi PRO

prithivMLmods

·

https://linktr.ee/prithivsakthi

AI & ML interests

computer vision, nlp, multimodality - HuggingFace Fellow🤗

Recent Activity

liked a model about 3 hours ago

prithivMLmods/DeepCaption-VLA-V2.0-7B-AIO-GGUF

published a model about 3 hours ago

prithivMLmods/DeepCaption-VLA-V2.0-7B-AIO-GGUF

liked a model about 3 hours ago

prithivMLmods/DeepCaption-VLA-7B-AIO-GGUF

View all activity

Organizations

upvoted an article about 16 hours ago

Article

cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents

9 days ago

•

9

upvoted a paper about 16 hours ago

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 1 day ago • 83

upvoted an article 1 day ago

Article

Make and publish your Reachy Mini App

22 days ago

•

23

upvoted 4 papers 1 day ago

MemEvolve: Meta-Evolution of Agent Memory Systems

Paper • 2512.18746 • Published 4 days ago • 22

INTELLECT-3: Technical Report

Paper • 2512.16144 • Published 7 days ago • 10

SAM Audio: Segment Anything in Audio

Paper • 2512.18099 • Published 6 days ago • 10

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published 3 days ago • 54

upvoted a paper 4 days ago

Adaptation of Agentic AI

Paper • 2512.16301 • Published 7 days ago • 92

upvoted 3 papers 5 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published 7 days ago • 22

DeContext as Defense: Safe Image Editing in Diffusion Transformers

Paper • 2512.16625 • Published 7 days ago • 24

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published 8 days ago • 55

upvoted 2 papers 6 days ago

Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation

Paper • 2512.16913 • Published 7 days ago • 31

Kling-Omni Technical Report

Paper • 2512.16776 • Published 7 days ago • 155

upvoted 2 articles 6 days ago

Article

CUGA on Hugging Face: Democratizing Configurable AI Agents

10 days ago

•

50

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

8 days ago

•

74

upvoted a paper 6 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published 15 days ago • 75

upvoted a collection 7 days ago

SAGE

Smart Any-Horizon Agent for Long Video Reasoning • 18 items • Updated 2 days ago • 3

upvoted a paper 7 days ago

DEER: Draft with Diffusion, Verify with Autoregressive Models

Paper • 2512.15176 • Published 8 days ago • 41

upvoted 2 papers 8 days ago

Olmo 3

Paper • 2512.13961 • Published 10 days ago • 22

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

Paper • 2512.14051 • Published 9 days ago • 38