Is this the reason I observe output degradation in subsequent rounds in Chatbox+M2 API? Questions answered correctly in the first turn sometimes become incorrect.

New activity in deepseek-ai/DeepSeek-V3.1-Base 4 months ago

This model’s censorship is insane

😔 🧠 15

#15 opened 4 months ago by

smile1030

commented a paper 4 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 179 •

New activity in deepseek-ai/DeepSeek-R1 10 months ago

Host of the model

#138 opened 10 months ago by

henrycwf

New activity in mlx-community/c4ai-command-r7b-12-2024-6bit 12 months ago

problem with chinese prompt

#2 opened 12 months ago by

mccatec

New activity in mlx-community/Mistral-7B-Instruct-v0.3-8bit over 1 year ago

how to make use of function calling in system message

#2 opened over 1 year ago by

mccatec

New activity in mlx-community/paligemma-3b-mix-224-8bit over 1 year ago

mlx-vlm 0.0.6 not yet released

#1 opened over 1 year ago by

mccatec

liked a model over 1 year ago

distilbert/distilbert-base-multilingual-cased

Fill-Mask • 0.1B • Updated May 6, 2024 • 849k • • 225

New activity in mlx-community/Qwen1.5-MoE-A2.7B-4bit over 1 year ago

ValueError: Model type qwen2_moe not supported

#1 opened over 1 year ago by

defloration

Chuan Jianguo

AI & ML interests

Recent Activity

Organizations

mccatec's activity

This model’s censorship is insane

Host of the model

problem with chinese prompt

how to make use of function calling in system message

mlx-vlm 0.0.6 not yet released

ValueError: Model type qwen2_moe not supported