Skander Moalla
skandermoalla
ยท
AI & ML interests
DeepRL, RL finetuning
Recent Activity
authored
a paper
about 12 hours ago
Building on Efficient Foundations: Effectively Training LLMs with
Structured Feedforward Layers
authored
a paper
about 12 hours ago
Apertus: Democratizing Open and Compliant LLMs for Global Language
Environments
authored
a paper
about 12 hours ago
Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions