ocr tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 142k • 564 opendatalab/MinerU2.5-2509-1.2B Image-Text-to-Text • 1B • Updated 3 days ago • 121k • 351 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 26 days ago • 396k • 540 PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated 17 days ago • 6.7k • 1.58k
asr FireRedTeam/FireRedASR-AED-L Automatic Speech Recognition • Updated Mar 5, 2025 • 143 • 68 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 339k • 1.58k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 339k • 1.58k
ocr tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 142k • 564 opendatalab/MinerU2.5-2509-1.2B Image-Text-to-Text • 1B • Updated 3 days ago • 121k • 351 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 26 days ago • 396k • 540 PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated 17 days ago • 6.7k • 1.58k
asr FireRedTeam/FireRedASR-AED-L Automatic Speech Recognition • Updated Mar 5, 2025 • 143 • 68 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 339k • 1.58k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 339k • 1.58k