nubbury
updated
StarCoder 2 and The Stack v2: The Next Generation
Paper
• 2402.19173
• Published
• 154
Griffin: Mixing Gated Linear Recurrences with Local Attention for
Efficient Language Models
Paper
• 2402.19427
• Published
• 56
Simple linear attention language models balance the recall-throughput
tradeoff
Paper
• 2402.18668
• Published
• 20
Priority Sampling of Large Language Models for Compilers
Paper
• 2402.18734
• Published
• 19
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
• 2402.17764
• Published
• 627
When Scaling Meets LLM Finetuning: The Effect of Data, Model and
Finetuning Method
Paper
• 2402.17193
• Published
• 26
Towards Optimal Learning of Language Models
Paper
• 2402.17759
• Published
• 18
Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in
Text-to-Image Generation
Paper
• 2402.17245
• Published
• 11
Disentangled 3D Scene Generation with Layout Learning
Paper
• 2402.16936
• Published
• 11
VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction
Paper
• 2402.17427
• Published
• 10