esc-bench
/

conformer-rnnt-spgispeech

Model card Files Files and versions

Sanchit Gandhi commited on Oct 24, 2022

Commit

3dccfad

·

1 Parent(s): 8f54680

Correct scripts

Files changed (2) hide show

README.md +5 -4
run_spgispeech.sh +1 -1

README.md CHANGED Viewed

@@ -2,17 +2,18 @@
 language:
 - en
 tags:
-- esc
 datasets:
-- spgispeech
 ---
-To reproduce this run, execute:
 ```python
 #!/usr/bin/env bash
 CUDA_VISIBLE_DEVICES=0 python run_speech_recognition_rnnt.py \
         --config_path="conf/conformer_transducer_bpe_xlarge.yaml" \
         --model_name_or_path="stt_en_conformer_transducer_xlarge" \
-        --dataset_name="esc-benchmark/esc-datasets" \
         --tokenizer_path="tokenizer" \
         --vocab_size="1024" \
         --max_steps="100000" \

 language:
 - en
 tags:
+- esb
 datasets:
+- esb/datasets
+- kensho/spgispeech
 ---
+To reproduce this run, first install NVIDIA NeMo according to the [official instructions](https://github.com/NVIDIA/NeMo#installation), then execute:
 ```python
 #!/usr/bin/env bash
 CUDA_VISIBLE_DEVICES=0 python run_speech_recognition_rnnt.py \
         --config_path="conf/conformer_transducer_bpe_xlarge.yaml" \
         --model_name_or_path="stt_en_conformer_transducer_xlarge" \
+        --dataset_name="esb/datasets" \
         --tokenizer_path="tokenizer" \
         --vocab_size="1024" \
         --max_steps="100000" \

run_spgispeech.sh CHANGED Viewed

@@ -2,7 +2,7 @@
 CUDA_VISIBLE_DEVICES=0 python run_speech_recognition_rnnt.py \
         --config_path="conf/conformer_transducer_bpe_xlarge.yaml" \
         --model_name_or_path="stt_en_conformer_transducer_xlarge" \
-        --dataset_name="esc-benchmark/esc-datasets" \
         --tokenizer_path="tokenizer" \
         --vocab_size="1024" \
         --max_steps="100000" \

 CUDA_VISIBLE_DEVICES=0 python run_speech_recognition_rnnt.py \
         --config_path="conf/conformer_transducer_bpe_xlarge.yaml" \
         --model_name_or_path="stt_en_conformer_transducer_xlarge" \
+        --dataset_name="esb/datasets" \
         --tokenizer_path="tokenizer" \
         --vocab_size="1024" \
         --max_steps="100000" \