DeepSeek
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
DeepSeek-OCR: Contexts Optical Compression
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 462k • • 12.8k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 5.75k • 937 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 57.5k • • 728 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 2.41M • • 1.46k
-
555
Chat with DeepSeek-VL2-small
🌍Generate responses using images and text input
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 46.9k • 225 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 16.9k • 166 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • 27B • Updated • 8.67k • 364
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 11.4k • 667 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 74.6k • 80 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 2.9k • 93 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 278k • • 492
models for paper expert-specialized fine-tuning
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 16.7k • 544 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 60.8k • 451 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 8.76k • 139 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • 1B • Updated • 530k • 142
DeepSeek MoE series
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
-
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation • 236B • Updated • 224 • 177 -
deepseek-ai/DeepSeek-V2-Chat
Text Generation • 236B • Updated • 10.3k • 461 -
deepseek-ai/DeepSeek-V2
Text Generation • 236B • Updated • 12.2k • 327 -
deepseek-ai/DeepSeek-V2-Lite
Text Generation • 16B • Updated • 63.1k • 153
DeepSeek Math series
-
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 67.8k • 140 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 2.2k • 85 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 3.05k • 78 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 129
DeepSeek-VL model series
DeepSeek LLM series
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 462k • • 12.8k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 5.75k • 937 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 57.5k • • 728 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 2.41M • • 1.46k
-
555
Chat with DeepSeek-VL2-small
🌍Generate responses using images and text input
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 46.9k • 225 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 16.9k • 166 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • 27B • Updated • 8.67k • 364
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation • 236B • Updated • 224 • 177 -
deepseek-ai/DeepSeek-V2-Chat
Text Generation • 236B • Updated • 10.3k • 461 -
deepseek-ai/DeepSeek-V2
Text Generation • 236B • Updated • 12.2k • 327 -
deepseek-ai/DeepSeek-V2-Lite
Text Generation • 16B • Updated • 63.1k • 153
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 11.4k • 667 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 74.6k • 80 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 2.9k • 93 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 278k • • 492
DeepSeek Math series
-
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 67.8k • 140 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 2.2k • 85 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 3.05k • 78 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 129
models for paper expert-specialized fine-tuning
DeepSeek-VL model series
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 16.7k • 544 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 60.8k • 451 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 8.76k • 139 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • 1B • Updated • 530k • 142
DeepSeek LLM series
DeepSeek MoE series