Added clearer instructions for setup and errors
The tutorial misses a few steps for the setup and doesn't mention a few common errors. I also tried format it better.
This commit is contained in:
parent
80493c83c3
commit
5eb47053e7
@ -5,7 +5,36 @@ They take in a wav file of any duration, use the WebRTC Voice Activity Detector
|
|||||||
to split it into smaller chunks and finally save a consolidated transcript.
|
to split it into smaller chunks and finally save a consolidated transcript.
|
||||||
|
|
||||||
### 0. Prerequisites
|
### 0. Prerequisites
|
||||||
Setup your environment
|
#### 0.1 Install requiered packages
|
||||||
|
Install the package which contains rec on the machine:
|
||||||
|
|
||||||
|
Fedora:
|
||||||
|
|
||||||
|
``` sudo dnf install sox ```
|
||||||
|
|
||||||
|
Tested on: 29
|
||||||
|
|
||||||
|
Ubuntu/Debian
|
||||||
|
|
||||||
|
``` sudo apt install sox ```
|
||||||
|
|
||||||
|
A list of distributions where the package is available can be found at: https://pkgs.org/download/sox
|
||||||
|
|
||||||
|
#### 0.1 Download Deepspeech
|
||||||
|
|
||||||
|
Download a stable(!) release from the release page and extract it to a folder of your choice.
|
||||||
|
|
||||||
|
This is because you need to use the same deepspeech model version and deepspeech version for things to work.
|
||||||
|
|
||||||
|
You only need the example folder, but you can't download it seperately, so you have to download the whole sourcecode.
|
||||||
|
|
||||||
|
For the next steps we assume you have extracted the files to ~/Deepspeech
|
||||||
|
|
||||||
|
**Note: Currently there is a bug in requierement.txt of the example folders which installs deepspech 4.1 when downloading the source code for 5.1, to fix this simply run pip3 install deepspeech==0.5.1 after installing**
|
||||||
|
|
||||||
|
#### 0.2 Setup your environment
|
||||||
|
|
||||||
|
Ubuntu/Debian:
|
||||||
|
|
||||||
```
|
```
|
||||||
~/Deepspeech$ sudo apt install virtualenv
|
~/Deepspeech$ sudo apt install virtualenv
|
||||||
@ -15,6 +44,18 @@ Setup your environment
|
|||||||
(venv) ~/Deepspeech/examples/vad_transcriber$ pip3 install -r requirements.txt
|
(venv) ~/Deepspeech/examples/vad_transcriber$ pip3 install -r requirements.txt
|
||||||
```
|
```
|
||||||
|
|
||||||
|
Fedora
|
||||||
|
|
||||||
|
```
|
||||||
|
~/Deepspeech$ sudo dnf install python-virtualen
|
||||||
|
~/Deepspeech$ cd examples/vad_transcriber
|
||||||
|
~/Deepspeech/examples/vad_transcriber$ virtualenv -p python3 venv
|
||||||
|
~/Deepspeech/examples/vad_transcriber$ source venv/bin/activate
|
||||||
|
(venv) ~/Deepspeech/examples/vad_transcriber$ pip3 install -r requirements.txt
|
||||||
|
```
|
||||||
|
|
||||||
|
Tested on: 29
|
||||||
|
|
||||||
### 1. Command line tool
|
### 1. Command line tool
|
||||||
|
|
||||||
The command line tool processes a wav file of any duration and returns a trancript
|
The command line tool processes a wav file of any duration and returns a trancript
|
||||||
@ -63,3 +104,11 @@ In such a scenario, the GUI tool will not work. The following steps is known to
|
|||||||
(venv) ~/Deepspeech/examples/vad_transcriber$ python3 audioTranscript_gui.py
|
(venv) ~/Deepspeech/examples/vad_transcriber$ python3 audioTranscript_gui.py
|
||||||
|
|
||||||
```
|
```
|
||||||
|
#### 2.2 Known Bugs
|
||||||
|
##### Could not load modal with error code X
|
||||||
|
Often this is because you try to load a older or newer model than the deepspeech version you are using.
|
||||||
|
Be sure to load only the models that where released with the same deepspeech version you are using.
|
||||||
|
|
||||||
|
This is the reason we advice you to use the examples from a released stable version.
|
||||||
|
##### The GUI programm immediately crashes when you press start recording
|
||||||
|
This happens when you don't load the models via the "Browse Models" button, before pressing the "Start recording" button.
|
||||||
|
Loading…
x
Reference in New Issue
Block a user