Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2305.14233

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022 • 1
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 23
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 1
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 2

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 66
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

Paper • 2305.14233 • Published May 23, 2023 • 6

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

Paper • 2305.14233 • Published May 23, 2023 • 6

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

Paper • 2305.14233 • Published May 23, 2023 • 6
vinai/PhoGPT-4B

Text Generation • Updated Nov 12, 2024 • 2.02k • 19

Synthetic Data Generation

Textbooks Are All You Need

Paper • 2306.11644 • Published Jun 20, 2023 • 146
Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 88
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 36
Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 104

DSI++: Updating Transformer Memory with New Documents

Paper • 2212.09744 • Published Dec 19, 2022 • 1
Where to start? Analyzing the potential value of intermediate models

Paper • 2211.00107 • Published Oct 31, 2022
INSTRUCTSCORE: Explainable Text Generation Evaluation with Finegrained Feedback

Paper • 2305.14282 • Published May 23, 2023
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment

Paper • 2303.16634 • Published Mar 29, 2023 • 3

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022 • 1
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 23
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 1
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 2

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

Paper • 2305.14233 • Published May 23, 2023 • 6
vinai/PhoGPT-4B

Text Generation • Updated Nov 12, 2024 • 2.02k • 19

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 66
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

Paper • 2305.14233 • Published May 23, 2023 • 6

Synthetic Data Generation

Textbooks Are All You Need

Paper • 2306.11644 • Published Jun 20, 2023 • 146
Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 88
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 36
Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 104

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

Paper • 2305.14233 • Published May 23, 2023 • 6

DSI++: Updating Transformer Memory with New Documents

Paper • 2212.09744 • Published Dec 19, 2022 • 1
Where to start? Analyzing the potential value of intermediate models

Paper • 2211.00107 • Published Oct 31, 2022
INSTRUCTSCORE: Explainable Text Generation Evaluation with Finegrained Feedback

Paper • 2305.14282 • Published May 23, 2023
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment

Paper • 2303.16634 • Published Mar 29, 2023 • 3

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs