Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 10
Apply filters
Models
9,128
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
datalab-to/chandra
Image-to-Text
•
9B
•
Updated
19 days ago
•
47.6k
•
288
lightonai/LightOnOCR-1B-1025
Image-to-Text
•
Updated
4 days ago
•
15.3k
•
152
allenai/olmOCR-2-7B-1025-FP8
Image-to-Text
•
8B
•
Updated
18 days ago
•
104k
•
131
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
2.56M
•
808
noctrex/Chandra-OCR-GGUF
Image-to-Text
•
8B
•
Updated
5 days ago
•
12.7k
•
6
allenai/olmOCR-2-7B-1025
Image-to-Text
•
8B
•
Updated
18 days ago
•
19k
•
71
nyu-visionx/Cambrian-S-7B
Image-to-Text
•
8B
•
Updated
3 days ago
•
19
•
4
reducto/RolmOCR
Image-to-Text
•
8B
•
Updated
Apr 2
•
18.6k
•
562
nvidia/nemoretriever-ocr-v1
Image-to-Text
•
Updated
13 days ago
•
73
•
18
nyu-visionx/Cambrian-S-1.5B
Image-to-Text
•
2B
•
Updated
3 days ago
•
21
•
3
Salesforce/blip-image-captioning-large
Image-to-Text
•
0.5B
•
Updated
Feb 3
•
1.1M
•
1.43k
breezedeus/pix2text-mfr
Image-to-Text
•
Updated
May 5, 2024
•
38.4k
•
45
xtuner/llava-phi-3-mini-hf
Image-to-Text
•
4B
•
Updated
Apr 25, 2024
•
1.36k
•
53
Ertugrul/Qwen2-VL-7B-Captioner-Relaxed
Image-to-Text
•
8B
•
Updated
Sep 26, 2024
•
308
•
64
monkt/paddleocr-onnx
Image-to-Text
•
Updated
Oct 7
•
13
Svngoku/Qwen3-VL-TimeTravel
Image-to-Text
•
9B
•
Updated
24 days ago
•
115
•
4
nyu-visionx/Cambrian-S-0.5B
Image-to-Text
•
0.9B
•
Updated
3 days ago
•
19
•
2
mychen76/invoice-and-receipts_donut_v1
Image-to-Text
•
0.2B
•
Updated
Apr 19, 2024
•
548
•
67
xtuner/llava-phi-3-mini-gguf
Image-to-Text
•
4B
•
Updated
Apr 29, 2024
•
1.89k
•
136
breezedeus/pix2text-mfd
Image-to-Text
•
Updated
Jul 10, 2024
•
2.77k
•
6
medieval-data/trocr-medieval-textualis
Image-to-Text
•
0.3B
•
Updated
Jul 3, 2024
•
9
•
2
fancyfeast/llama-joycaption-alpha-two-vqa-test-1
Image-to-Text
•
8B
•
Updated
Nov 29, 2024
•
81
•
8
llamaindex/vdr-2b-multi-v1
Image-to-Text
•
2B
•
Updated
May 21
•
47.1k
•
119
ibm-granite/granite-vision-3.1-2b-preview
Image-to-Text
•
3B
•
Updated
Jun 12
•
853
•
108
ibm-granite/granite-vision-3.2-2b
Image-to-Text
•
3B
•
Updated
Jun 12
•
11k
•
116
VLM2Vec/VLM2Vec-V2.0
Image-to-Text
•
Updated
Jul 13
•
6.74k
•
17
scb10x/typhoon-ocr-7b
Image-to-Text
•
8B
•
Updated
Jul 11
•
23.1k
•
75
scb10x/typhoon-ocr-3b
Image-to-Text
•
4B
•
Updated
Jul 11
•
3.1k
•
7
PaddlePaddle/PP-Chart2Table
Image-to-Text
•
Updated
Jul 22
•
6.14k
•
2
PaddlePaddle/PP-DocLayout_plus-L
Image-to-Text
•
Updated
Jul 22
•
9.81k
•
12
Previous
1
2
3
...
100
Next