Update README.md
Browse files
README.md
CHANGED
|
@@ -63,9 +63,6 @@ Include these authors' names: {}.
|
|
| 63 |
|-----|-----------------------------------------|---------|--------|------|------------|------|-------|----------|---------------|
|
| 64 |
| 🔶 | [3rd-Degree-Burn/L-3.1-Science-Writer-8B](https://huggingface.co/3rd-Degree-Burn/L-3.1-Science-Writer-8B) | 21.08 | 42.63 | 29.2 | 10.27 | 3.24 | 11.69 | 29.44 | 0.71 |
|
| 65 |
|
| 66 |
-
|
| 67 |
-

|
| 68 |
-
|
| 69 |
## Personal thoughts
|
| 70 |
|
| 71 |
I used a pretty low rank (r=32). The final loss after 2 epochs was around 0.9, which is okay but not great. I think the deeper layers of the model haven’t been fully saturated yet, so it’s still a bit of a work in progress.
|
|
|
|
| 63 |
|-----|-----------------------------------------|---------|--------|------|------------|------|-------|----------|---------------|
|
| 64 |
| 🔶 | [3rd-Degree-Burn/L-3.1-Science-Writer-8B](https://huggingface.co/3rd-Degree-Burn/L-3.1-Science-Writer-8B) | 21.08 | 42.63 | 29.2 | 10.27 | 3.24 | 11.69 | 29.44 | 0.71 |
|
| 65 |
|
|
|
|
|
|
|
|
|
|
| 66 |
## Personal thoughts
|
| 67 |
|
| 68 |
I used a pretty low rank (r=32). The final loss after 2 epochs was around 0.9, which is okay but not great. I think the deeper layers of the model haven’t been fully saturated yet, so it’s still a bit of a work in progress.
|