Well done

by Trilogix1 - opened 13 days ago

13 days ago

•

I am still testing it, but overall looks good. I can see already your method working.
It didn't improve the 4b so much (but as I said I am still testing).
Can this be done to GPT-oss, more precisely to this model: https://huggingface.co/hokar3361/gpt-oss-coderjs-v0.1 ?
Applying your method will certainly improve the output but I am more curious if it will also improve the ctx length (this model, the gpt-oss is fast but working with ctx over 60k it loops).
Thanks for the good job.

Trilogix1

1 day ago

Can you apply your method to: armand0e/gpt-oss-20b-glm-4.6-distill-GGUF ?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment