Yuchen Cheng's picture

33 93

Yuchen Cheng

rudeigerc

·

https://rudeigerc.dev

AI & ML interests

Kubernetes / LLMOps

Recent Activity

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3.2-Exp

liked a model 2 months ago

xai-org/grok-2

liked a model 2 months ago

nvidia/NVIDIA-Nemotron-Nano-9B-v2

View all activity

Organizations

None yet

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • 685B • Updated 21 days ago • 101k • • 752

liked 5 models 2 months ago

xai-org/grok-2

Updated Aug 24 • 9.52k • 974

nvidia/NVIDIA-Nemotron-Nano-9B-v2

Text Generation • 9B • Updated 15 days ago • 230k • 417

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • 36B • Updated Aug 26 • 6.82k • 444

deepseek-ai/DeepSeek-V3.1-Base

Text Generation • 685B • Updated Aug 26 • 11.5k • 1k

google/gemma-3-270m-it

Text Generation • 0.3B • Updated Aug 14 • 209k • 443

liked 3 models 3 months ago

tencent/Hunyuan-1.8B-Instruct

Text Generation • 2B • Updated Aug 6 • 287 • 597

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 4.77M • • 3.82k

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 3.81M • • 4.07k

liked a model 4 months ago

moonshotai/Kimi-K2-Instruct

Text Generation • 1T • Updated 8 days ago • 88.7k • • 2.19k

upvoted a paper 4 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 268

liked 2 models 5 months ago

MiniMaxAI/MiniMax-M1-80k

Text Generation • 456B • Updated Jul 7 • 217 • • 683

mistralai/Magistral-Small-2506

24B • Updated Jul 28 • 37.1k • 603

upvoted 3 papers 5 months ago

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28 • 43

Inference-Time Hyper-Scaling with KV Cache Compression

Paper • 2506.05345 • Published Jun 5 • 27

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14 • 71

liked a model 5 months ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29 • 541k • • 2.38k

liked a model 7 months ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 202k • • 1.13k

liked 2 models 8 months ago

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 974k • • 1.66k

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated May 1 • 533k • 1.52k