Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
62.8
TFLOPS
90
3
37
hai
cloudyu
Follow
samsmith47's profile picture
demonsu's profile picture
oyonay12's profile picture
165 followers
·
44 following
yu-hai-52a1702a
AI & ML interests
Looking for a full time job.
Recent Activity
updated
a model
19 days ago
cloudyu/quant_signal
published
a model
19 days ago
cloudyu/quant_signal
new
activity
30 days ago
deepseek-ai/DeepSeek-V3.2-Exp:
咱这个模型是非得国庆前更新吗??
View all activity
Organizations
cloudyu
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
19 days ago
cloudyu/quant_signal
Updated
19 days ago
published
a model
19 days ago
cloudyu/quant_signal
Updated
19 days ago
New activity in
deepseek-ai/DeepSeek-V3.2-Exp
30 days ago
咱这个模型是非得国庆前更新吗??
😔
👍
112
31
#1 opened 30 days ago by
luckjone
New activity in
deepseek-ai/DeepSeek-V3.1-Terminus
30 days ago
国庆是休息日,请给我们关注的同学一点休息时间
👀
👍
63
1
#10 opened about 1 month ago by
luckjone
New activity in
deepseek-ai/DeepSeek-V3.2-Exp
30 days ago
Transformers does not recognize this architecture
6
#6 opened 30 days ago by
eva20150932-atlascloud
liked
a model
30 days ago
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation
•
685B
•
Updated
20 days ago
•
101k
•
•
751
New activity in
unsloth/grok-2-GGUF
about 1 month ago
mac studio : loading model vocabulary: unknown pre-tokenizer type: 'grok-2'
#5 opened about 1 month ago by
cloudyu
New activity in
Wan-AI/Wan2.2-T2V-A14B-Diffusers
2 months ago
demo能不能亲自跑一下,成功了再发出来?
#8 opened 2 months ago by
cloudyu
New activity in
ByteDance-Seed/Seed-OSS-36B-Instruct
2 months ago
Why is the chat_template mixed with Chinese and English?
👍
2
5
#8 opened 2 months ago by
Daucloud
updated
a model
4 months ago
cloudyu/Deep-Think-32B
33B
•
Updated
Jun 18
•
27
published
a model
4 months ago
cloudyu/Deep-Think-32B
33B
•
Updated
Jun 18
•
27
New activity in
onnx-community/Qwen3-1.7B-ONNX
6 months ago
please share how export qwen3 to onnx foramt, many thanks!
👍
1
2
#1 opened 6 months ago by
cloudyu
liked
a model
6 months ago
nvidia/OpenMath-Nemotron-14B-Kaggle
Text Generation
•
15B
•
Updated
May 29
•
183
•
•
16
New activity in
Qwen/QwQ-32B
8 months ago
It's challenging for QwQ to generate long codes...
2
#38 opened 8 months ago by
DXBTR74
updated
a model
9 months ago
cloudyu/S1-Llama-3.2-3Bx4-MoE
10B
•
Updated
Feb 5
•
4
published
a model
9 months ago
cloudyu/S1-Llama-3.2-3Bx4-MoE
10B
•
Updated
Feb 5
•
4
New activity in
unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF
9 months ago
error when to try this gguf
👀
1
3
#3 opened 9 months ago by
cloudyu
New activity in
unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF
9 months ago
unknown pre-tokenizer type: 'deepseek-r1-qwen'
👍
4
2
#1 opened 9 months ago by
Neman
updated
2 models
10 months ago
cloudyu/Nemo-DPO-V23
Text Generation
•
12B
•
Updated
Jan 10
•
1
•
1
cloudyu/Nemo-DPO-V22
12B
•
Updated
Dec 18, 2024
•
2
Load more