Toolformer: Language Models Can Teach Themselves to Use Tools Paper • 2302.04761 • Published Feb 9, 2023 • 12
view article Article Granite 4.0 Nano: Just how small can you go? By ibm-granite and 1 other • 3 days ago • 88
view article Article Model statistics of the 50 most downloaded entities on Hugging Face By lbourdois • 18 days ago • 27
PaddleOCR-VL Collection Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model • 3 items • Updated 14 days ago • 19
Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Paper • 2510.14880 • Published 15 days ago • 14
BERT Hash Nano Models Collection Set of BERT models with a modified embeddings layer • 3 items • Updated 25 days ago • 8
Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research Paper • 2510.06056 • Published 24 days ago • 5
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 25 days ago • 460
view article Article RexBERT: Encoders for a brave new world of E-Commerce By thebajajra and 1 other • Sep 20 • 48
ZeroShot Medical & Clinical NER Collection OpenMed ZeroShot NER Models • 93 items • Updated Sep 15 • 18
Tfree-HAT-7b-pretrained Collection Tokenizer free models based on Hierarchical Autoregressive Transformer (https://arxiv.org/abs/2501.10322) trained from scratch. • 2 items • Updated Aug 1 • 10
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15 • 48
view article Article PP-OCRv5 on Hugging Face: A Specialized Approach to OCR By baidu and 5 others • Sep 10 • 108
view article Article 📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models By nvidia and 1 other • Aug 18 • 5
MiroThinker-v0.1 Collection High performance in deep research and tool use. • 7 items • Updated Sep 8 • 32