Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PaddlePaddle 's Collections
PaddleOCR-VL
PP-StructureV3
PP-OCRv5
PP-OCRv4
PP-OCRv3

PaddleOCR-VL

updated Oct 17

Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Upvote
21

  • PaddlePaddle/PaddleOCR-VL

    Image-Text-to-Text • 1.0B • Updated 9 days ago • 22.4k • 1.39k

  • Running
    Featured
    192

    PaddleOCR-VL Online Demo

    📈
    192

    Parse and recognize text in images


  • PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

    Paper • 2510.14528 • Published Oct 16 • 103
Upvote
21
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs