Collections

Discover the best community collections!

Collections including paper arxiv:2309.00267
LLM Refs
Collection by May 7
Preference Alignment in LLM
methods that align llm with human preference
Deep Reinforcement Learning
Features implementations and paces of popular RL algorithms and new paradigms on a variety of environments.
Dataset generation
Collection by Jul 22, 2024
Human Feedback
Collection by Feb 8, 2024
LLM Datasets
Collection by Mar 5, 2024
Super Alignment
Collection by Oct 30, 2024
RL/Alignment
Collection by 13 days ago
LLM Refs
Collection by May 7
Preference Alignment in LLM
methods that align llm with human preference
LLM Datasets
Collection by Mar 5, 2024
Deep Reinforcement Learning
Features implementations and paces of popular RL algorithms and new paradigms on a variety of environments.
Super Alignment
Collection by Oct 30, 2024
Dataset generation
Collection by Jul 22, 2024
RL/Alignment
Collection by 13 days ago
Human Feedback
Collection by Feb 8, 2024