Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Jongbok Won
wvnvwn
Follow
wvnvwn
jongbok-won-b6851a2b2
AI & ML interests
Preference Optimization and Machine Alignment
Organizations
None yet
wvnvwn
's models
28
Sort: Recently updated
wvnvwn/llama-3.2-1b-instruct-ttl
Text Generation
•
1B
•
Updated
Oct 13, 2025
wvnvwn/llama-3.2-1b-instruct-dpo-gen
Text Generation
•
1B
•
Updated
Oct 12, 2025
wvnvwn/llama-3.2-1b-instruct-simpo-gen
Text Generation
•
1B
•
Updated
Oct 12, 2025
wvnvwn/llama-3.2-1b-instruct-m_vv3-gen
Text Generation
•
1B
•
Updated
Oct 12, 2025
wvnvwn/llama-3.2-1b-instruct-m_vv2-gen
Text Generation
•
1B
•
Updated
Oct 12, 2025
wvnvwn/llama-3.2-1b-instruct-m_vv1-gen
Text Generation
•
1B
•
Updated
Oct 12, 2025
wvnvwn/llama-3.2-1b-instruct-m_v3
1B
•
Updated
Oct 3, 2025
wvnvwn/llama-3.2-1b-instruct-m_v2
1B
•
Updated
Oct 3, 2025
wvnvwn/llama-3.2-1b-instruct-m_v1
1B
•
Updated
Oct 3, 2025
wvnvwn/phi-2-train
Updated
Oct 3, 2025
wvnvwn/bpr_model
Updated
Oct 2, 2025
wvnvwn/bpr_score_wo_detach
3B
•
Updated
Sep 22, 2025
wvnvwn/bpr_ncpo_nos
3B
•
Updated
Sep 22, 2025
wvnvwn/bpr_ncpo_nos_chosen_anchor
3B
•
Updated
Sep 22, 2025
wvnvwn/bpr_cosine_wo_detach
3B
•
Updated
Sep 22, 2025
wvnvwn/phi-2-sft-ncpo-chosen-anchor
3B
•
Updated
Sep 6, 2025
wvnvwn/phi-2-sft-ncpo
3B
•
Updated
Sep 6, 2025
wvnvwn/phi-3.5-mini-instruct-ncpo-blender-chosen-anchor
4B
•
Updated
Sep 6, 2025
wvnvwn/ncpo_new_sft
Updated
Sep 2, 2025
wvnvwn/phi-2-sft-new-full
3B
•
Updated
Sep 2, 2025
wvnvwn/phi-3.5-mini-instruct-simpo-blender
4B
•
Updated
Aug 29, 2025
wvnvwn/phi-3.5-mini-instruct-dpo-blender
4B
•
Updated
Aug 29, 2025
wvnvwn/phi-3.5-mini-instruct-ncpo-blender
4B
•
Updated
Aug 29, 2025
wvnvwn/phi-2-sft
Text Generation
•
Updated
Aug 24, 2025
wvnvwn/full_llama3_8b_sft_offncpo
8B
•
Updated
Aug 9, 2025
wvnvwn/full_llama3_8b_sft_simpo
8B
•
Updated
Aug 9, 2025
wvnvwn/full_llama3_base_sft_dpo
8B
•
Updated
Aug 9, 2025
wvnvwn/ncpo-with-ultrafeedback-binarized-random-20000-2-tau0.5
4B
•
Updated
Jul 31, 2025