view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 387
LINKS: English-English Mnemonics Collection Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 7 items • Updated Sep 16, 2025 • 1
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17, 2025 • 49
LINKS: English-English Mnemonics Collection Investigate the potential of mining linguistic knowledge/reasoning from LLM to generate mnemonic devices that aid vocabulary learning. • 7 items • Updated Sep 16, 2025 • 1
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • 33B • Updated Feb 24, 2025 • 2.75M • • 1.48k
ELECTRA release Collection This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated Jul 10, 2025 • 12