chansung/merged_ds_coding
Viewer • Updated • 60.6k • 149 • 18
How to use chansung/coding_llamaduo_60k with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("google/gemma-7b")
model = PeftModel.from_pretrained(base_model, "chansung/coding_llamaduo_60k")This model is a fine-tuned version of google/gemma-7b on the chansung/merged_ds_coding dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.6618 | 1.0 | 252 | 1.2071 |
| 0.5731 | 2.0 | 504 | 1.1436 |
| 0.5198 | 3.0 | 756 | 1.1346 |
| 0.4783 | 4.0 | 1008 | 1.1536 |
| 0.4378 | 5.0 | 1260 | 1.2225 |
| 0.3836 | 6.0 | 1512 | 1.2893 |
| 0.3381 | 7.0 | 1764 | 1.4050 |
| 0.3043 | 8.0 | 2016 | 1.5185 |
| 0.2778 | 9.0 | 2268 | 1.6143 |
| 0.2748 | 10.0 | 2520 | 1.6318 |
Base model
google/gemma-7b