Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
quyanh
/
openai_summarize_tldr_sft-dpo
like
0
Transformers
Safetensors
Generated from Trainer
dpo
trl
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
openai_summarize_tldr_sft-dpo
Commit History
DPO LoRA checkpoint - step 11400 (eval_loss: 1.2601)
f4b37c0
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 11300 (eval_loss: 1.2513)
59738fd
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 11200 (eval_loss: 1.2579)
d02d3b6
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 11100 (eval_loss: 1.2398)
cbc3b49
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 11000 (eval_loss: 1.2253)
25d6d4b
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 10900 (eval_loss: 1.2207)
896de01
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 10800 (eval_loss: 1.2091)
693c7e4
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 10700 (eval_loss: 1.1753)
6be486a
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 10600 (eval_loss: 1.2102)
5b3eb32
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 10500 (eval_loss: 1.2661)
be9f6a2
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 10400 (eval_loss: 1.2887)
1d5c450
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 10300 (eval_loss: 1.2396)
3e39604
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 10200 (eval_loss: 1.2844)
17d5d28
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 10100 (eval_loss: 1.2777)
e6b0a36
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 10000 (eval_loss: 1.2282)
e44495e
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 9900 (eval_loss: 1.2397)
a5c1d28
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 9800 (eval_loss: 1.2434)
721fa15
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 9700 (eval_loss: 1.2482)
fbc6f1c
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 9600 (eval_loss: 1.2737)
fdc9352
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 9500 (eval_loss: 1.3414)
4235bbf
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 9400 (eval_loss: 1.2743)
b1e75a2
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 9300 (eval_loss: 1.3085)
040250f
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 9200 (eval_loss: 1.3521)
23173b8
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 9100 (eval_loss: 1.3716)
e003594
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 9000 (eval_loss: 1.3357)
278b592
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 8900 (eval_loss: 1.3247)
010ae67
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 8800 (eval_loss: 1.2598)
70b43a8
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 8700 (eval_loss: 1.2278)
7fd72c6
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 8600 (eval_loss: 1.3157)
9304449
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 8500 (eval_loss: 1.2912)
c9d808f
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 8400 (eval_loss: 1.2837)
fd7c280
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 8300 (eval_loss: 1.2756)
f8201d5
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 8200 (eval_loss: 1.2553)
7d875ce
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 8100 (eval_loss: 1.2217)
f657d40
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 8000 (eval_loss: 1.2477)
7dd8907
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 7900 (eval_loss: 1.2876)
742046b
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 7800 (eval_loss: 1.3024)
46f8afd
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 7700 (eval_loss: 1.3519)
d673f1a
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 7600 (eval_loss: 1.4216)
4a9eff1
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 7500 (eval_loss: 1.5279)
b355ed0
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 7400 (eval_loss: 1.4196)
828032e
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 7300 (eval_loss: 1.4210)
e72218a
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 7200 (eval_loss: 1.4379)
c581be0
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 7100 (eval_loss: 1.4441)
da89b9c
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 7000 (eval_loss: 1.4042)
1d5733c
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 6900 (eval_loss: 1.4159)
b0bd540
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 6800 (eval_loss: 1.4053)
6ff5ffb
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 6700 (eval_loss: 1.3686)
3019e96
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 6600 (eval_loss: 1.3012)
3935bdc
verified
quyanh
commited on
Aug 16
DPO LoRA checkpoint - step 6500 (eval_loss: 1.2000)
5917de3
verified
quyanh
commited on
Aug 16
Previous
1
2
3
Next