Compiled engines for running Whisper with TRT LLM for much faster inference.
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
672
baseten/whisper_trt_large_v2_251013_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_21_0
Updated
baseten/whisper_trt_large_v3_turbo_251013_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_21_0
Updated
baseten/whisper_trt_large_v2_251013_NVIDIA_H100_80GB_HBM3_0_21_0
Updated
baseten/whisper_trt_large_v3_251013_NVIDIA_L4_0_21_0
Updated
baseten/whisper_trt_large_v2_251013_NVIDIA_L4_0_21_0
Updated
baseten/whisper_trt_large_v3_251013_NVIDIA_H100_80GB_HBM3_0_21_0
Updated
baseten/whisper_trt_large_v3_turbo_251013_NVIDIA_L4_0_21_0
Updated
baseten/whisper_trt_large_v3_turbo_251013_NVIDIA_H100_80GB_HBM3_0_21_0
Updated
baseten/whisper_trt_large_v3_251013_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_21_0
Updated
baseten/Llama-3.2-3B-Instruct-pythonic
Text Generation
•
3B
•
Updated
•
2.21k