arxiv:2407.01470
Tzu-Han Lin
hank0316
AI & ML interests
Large Language Model, Evaluation, Parameter-Efficient Fine-Tuning (PEFT)
Recent Activity
upvoted
a
paper
27 days ago
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
updated
a model
about 1 month ago
hank0316/Llama-3.2-3B-Instruct-em-E5
published
a model
about 1 month ago
hank0316/Llama-3.2-3B-Instruct-em-E5