Added clearer instructions for setup and errors

The tutorial misses a few steps for the setup and doesn't mention a few common errors. I also tried format it better.
2019-11-06 12:02:48 +01:00 · 2019-11-06 12:02:48 +01:00 · 5eb47053e7
commit 5eb47053e7
parent 80493c83c3
1 changed files with 50 additions and 1 deletions
--- a/examples/vad_transcriber/wavTranscription.md
+++ b/examples/vad_transcriber/wavTranscription.md
@ -5,7 +5,36 @@ They take in a wav file of any duration, use the WebRTC Voice Activity Detector
 to split it into smaller chunks and finally save a consolidated transcript.
 ### 0. Prerequisites
-Setup your environment
+#### 0.1 Install requiered packages
 Install the package which contains rec on the machine:
 Fedora:
 ``` sudo dnf install sox ```
 Tested on: 29
 Ubuntu/Debian
 ``` sudo apt install sox ```
 A list of distributions where the package is available can be found at: https://pkgs.org/download/sox
 #### 0.1 Download Deepspeech 
 Download a stable(!) release from the release page and extract it to a folder of your choice.
 This is because you need to use the same deepspeech model version and deepspeech version for things to work.
 You only need the example folder, but you can't download it seperately, so you have to download the whole sourcecode.
 For the next steps we assume you have extracted the files to ~/Deepspeech
 **Note: Currently there is a bug in requierement.txt of the example folders which installs deepspech 4.1 when downloading the source code for 5.1, to fix this simply run pip3 install deepspeech==0.5.1 after installing**
 #### 0.2 Setup your environment
 Ubuntu/Debian:
 ```
 ~/Deepspeech$ sudo apt install virtualenv
@ -15,6 +44,18 @@ Setup your environment
 (venv) ~/Deepspeech/examples/vad_transcriber$ pip3 install -r requirements.txt
 ```
 Fedora
 ```
 ~/Deepspeech$ sudo dnf install python-virtualen
 ~/Deepspeech$ cd examples/vad_transcriber
 ~/Deepspeech/examples/vad_transcriber$ virtualenv -p python3 venv
 ~/Deepspeech/examples/vad_transcriber$ source venv/bin/activate
 (venv) ~/Deepspeech/examples/vad_transcriber$ pip3 install -r requirements.txt
 ```
 Tested on: 29
 ### 1. Command line tool
 The command line tool processes a wav file of any duration and returns a trancript
@ -63,3 +104,11 @@ In such a scenario, the GUI tool will not work. The following steps is known to
 (venv) ~/Deepspeech/examples/vad_transcriber$ python3 audioTranscript_gui.py
 ```
 #### 2.2 Known Bugs
 #####  Could not load modal with error code X
 Often this is because you try to load a older or newer model than the deepspeech version you are using.
 Be sure to load only the models that where released with the same deepspeech version you are using.
 This is the reason we advice you to use the examples from a released stable version.
 #####  The GUI programm immediately crashes when you press start recording
 This happens when you don't load the models via the "Browse Models" button, before pressing the "Start recording" button.