BSC-LT
/

faster-whisper-large-v3-ca-punctuated-3370h

Automatic Speech Recognition

whisper-large-v3

barcelona-supercomputing-center

Model card Files Files and versions

AbirMessaoudi commited on May 12

Commit

dc55ba4

·

verified ·

1 Parent(s): e6d8b71

Update README.md

Files changed (1) hide show

README.md +4 -14

README.md CHANGED Viewed

@@ -73,7 +73,7 @@ To transcribe audio in Catalan using this model, you can follow this example:
 ```python
 from faster_whisper import WhisperModel
-model_size = "https://huggingface.co/langtech-veu/whisper-large-v3-ca-punctuated-3370h"
 # Run on GPU with FP16
 model = WhisperModel(model_size, device="cuda", compute_type="float16")
@@ -98,7 +98,7 @@ for segment in segments:
 This model is not a direct result of training. It is a conversion of a [Whisper](https://huggingface.co/openai/whisper-large-v3) model using [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master). The procedure to create the model is as follows:
 ```bash
-ct2-transformers-converter --model https://huggingface.co/langtech-veu/whisper-large-v3-ca-punctuated-3370h
    --output_dir faster-whisper-large-v3-ca-punctuated-3370h
    --copy_files preprocessor_config.json
    --quantization float16
@@ -106,25 +106,15 @@ ct2-transformers-converter --model https://huggingface.co/langtech-veu/whisper-l
 ## Citation
 If this model contributes to your research, please cite the work:
-```bibtex
-@inproceedings{hernandez20243catparla,
-  title={3CatParla: A New Open-Source Corpus of Broadcast TV in Catalan for Automatic Speech Recognition},
-  author={Hern{\'a}ndez Mena, Carlos Daniel and Armentano Oller, Carme and Solito, Sarah and K{\"u}lebi, Baybars},
-  booktitle={Proc. IberSPEECH 2024},
-  pages={176--180},
-  year={2024}
-}
 ```
-<!--
 @misc{mena2025whisperpunctuated,
       title={Acoustic Model in Catalan: whisper-large-v3-ca-punctuated-3370h.},
-      author={Hernandez Mena, Carlos Daniel, Messaoudi Abir, Bonet, Cristina},
       organization={Barcelona Supercomputing Center},
       url={https://huggingface.co/langtech-veu/faster-whisper-large-v3-ca-punctuated-3370h},
       year={2025}
 }
--->
 ## Additional Information

 ```python
 from faster_whisper import WhisperModel
+model_size = "langtech-veu/whisper-large-v3-ca-punctuated-3370h"
 # Run on GPU with FP16
 model = WhisperModel(model_size, device="cuda", compute_type="float16")
 This model is not a direct result of training. It is a conversion of a [Whisper](https://huggingface.co/openai/whisper-large-v3) model using [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master). The procedure to create the model is as follows:
 ```bash
+ct2-transformers-converter --model langtech-veu/whisper-large-v3-ca-punctuated-3370h
    --output_dir faster-whisper-large-v3-ca-punctuated-3370h
    --copy_files preprocessor_config.json
    --quantization float16
 ## Citation
 If this model contributes to your research, please cite the work:
 ```
 @misc{mena2025whisperpunctuated,
       title={Acoustic Model in Catalan: whisper-large-v3-ca-punctuated-3370h.},
+      author={Hernandez Mena, Carlos Daniel, Messaoudi, Abir, España-Bonet, Cristina},
       organization={Barcelona Supercomputing Center},
       url={https://huggingface.co/langtech-veu/faster-whisper-large-v3-ca-punctuated-3370h},
       year={2025}
 }
+```
 ## Additional Information