Fast inference for Blackwell GPUs
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
ig1/Qwen2.5-VL-7B-Instruct-NVFP4
Image-Text-to-Text • 5B • Updated • 7 -
ig1/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Image-Text-to-Text • 8B • Updated • 18 -
ig1/Qwen2.5-VL-32B-Instruct-FP8-Dynamic
Image-Text-to-Text • 33B • Updated • 10 -
ig1/Qwen2.5-VL-72B-Instruct-FP8-Dynamic
Image-Text-to-Text • 73B • Updated • 18
Fast inference for Blackwell GPUs
-
ig1/Qwen2.5-VL-7B-Instruct-NVFP4
Image-Text-to-Text • 5B • Updated • 7 -
ig1/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Image-Text-to-Text • 8B • Updated • 18 -
ig1/Qwen2.5-VL-32B-Instruct-FP8-Dynamic
Image-Text-to-Text • 33B • Updated • 10 -
ig1/Qwen2.5-VL-72B-Instruct-FP8-Dynamic
Image-Text-to-Text • 73B • Updated • 18
models
16
ig1/medgemma-27b-text-it-FP8-Dynamic
Text Generation
•
28B
•
Updated
•
10
ig1/medgemma-27b-it-FP8-Dynamic
Text Generation
•
29B
•
Updated
•
155
ig1/BioMistral-7B-FP8-Dynamic
Text Generation
•
7B
•
Updated
•
12
ig1/Qwen3-30B-A3B-Instruct-2507-NVFP4
17B
•
Updated
•
263
ig1/Qwen3-30B-A3B-NVFP4
17B
•
Updated
•
5
ig1/Qwen3-VL-30B-A3B-Instruct-NVFP4
Image-Text-to-Text
•
18B
•
Updated
•
1.66k
•
3
ig1/Qwen3-Coder-30B-A3B-Instruct-NVFP4
Text Generation
•
17B
•
Updated
•
303
•
1
ig1/Qwen2.5-VL-7B-Instruct-NVFP4
Image-Text-to-Text
•
5B
•
Updated
•
7
ig1/Qwen2.5-VL-7B-Instruct-FP8-Dynamic
Image-Text-to-Text
•
8B
•
Updated
•
18
ig1/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
36.1k
•
2
datasets
0
None public yet