From 36403cb64b8f386ad802ddc261df6071baf28a17 Mon Sep 17 00:00:00 2001 From: Reuben Morais Date: Fri, 6 Sep 2019 14:01:38 +0200 Subject: [PATCH] Add model download and extraction to initial example X-DeepSpeech: NOBUILD --- README.md | 12 ++++++++++-- 1 file changed, 10 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 6f240099..fe5f31dd 100644 --- a/README.md +++ b/README.md @@ -14,8 +14,16 @@ source $HOME/tmp/deepspeech-venv/bin/activate # Install DeepSpeech pip3 install deepspeech +# Download pre-trained English model and extract +curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.5.1/deepspeech-0.5.1-models.tar.gz +tar xvf deepspeech-0.5.1-models.tar.gz + +# Download example audio files +curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.5.1/audio-0.5.1.tar.gz +tar xvf audio-0.5.1.tar.gz + # Transcribe an audio file -deepspeech --model models/output_graph.pbmm --alphabet models/alphabet.txt --lm models/lm.binary --trie models/trie --audio my_audio_file.wav +deepspeech --model deepspeech-0.5.1-models/output_graph.pbmm --alphabet deepspeech-0.5.1-models/alphabet.txt --lm deepspeech-0.5.1-models/lm.binary --trie deepspeech-0.5.1-models/trie --audio audio/2830-3980-0043.wav ``` A pre-trained English model is available for use and can be downloaded using [the instructions below](#using-a-pre-trained-model). Currently, only 16-bit, 16 kHz, mono-channel WAVE audio files are supported in the Python client. A package with some example audio files is available for download in our [release notes](https://github.com/mozilla/DeepSpeech/releases/latest). @@ -31,7 +39,7 @@ source $HOME/tmp/deepspeech-gpu-venv/bin/activate pip3 install deepspeech-gpu # Transcribe an audio file. -deepspeech --model models/output_graph.pbmm --alphabet models/alphabet.txt --lm models/lm.binary --trie models/trie --audio my_audio_file.wav +deepspeech --model deepspeech-0.5.1-models/output_graph.pbmm --alphabet deepspeech-0.5.1-models/alphabet.txt --lm deepspeech-0.5.1-models/lm.binary --trie deepspeech-0.5.1-models/trie --audio audio/2830-3980-0043.wav ``` Please ensure you have the required [CUDA dependencies](#cuda-dependency).