view article Article We’re open-sourcing our text-to-image model and the process behind it 15 days ago • 71
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15 • 49
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 • 175
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB Jan 27 • 21
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Paper • 2501.05707 • Published Jan 10 • 20
M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding Paper • 2411.04952 • Published Nov 7, 2024 • 30
view article Article 🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦⬛ Oct 21, 2024 • 20
view article Article How to build a custom text classifier without days of human labeling Oct 17, 2024 • 56