Post
2886
A few days ago, Thinking Machines Lab released “LoRA Without Regret”, showing that LoRA can match full fine-tuning performance when configured right.
Naturally, we decided to reproduce the results with TRL and release a guide!
https://huggingface.co/docs/trl/main/en/lora_without_regret
Naturally, we decided to reproduce the results with TRL and release a guide!
https://huggingface.co/docs/trl/main/en/lora_without_regret