Merge pull request #3118 from mozilla/model-type-docs

Add more doc text around distinction between various pre-trained model files (Fixes #2941)
2020-06-30 22:16:25 +02:00 · 2020-06-30 22:16:25 +02:00 · 24526aa82d
commit 24526aa82d
parent 3762a9b588 5c41b8966e
2 changed files with 17 additions and 1 deletions
--- a/doc/C-API.rst
+++ b/doc/C-API.rst
@ -1,3 +1,5 @@
+.. _c-usage:
+
 C API
 =====

--- a/doc/USING.rst
+++ b/doc/USING.rst
@ -5,7 +5,7 @@ Using a Pre-trained Model

 Inference using a DeepSpeech pre-trained model can be done with a client/language binding package. We have four clients/language bindings in this repository, listed below, and also a few community-maintained clients/language bindings in other repositories, listed `further down in this README <#third-party-bindings>`_.

-* `The C API <c-usage>`.
+* :ref:`The C API <c-usage>`.
 * :ref:`The Python package/language binding <py-usage>`
 * :ref:`The Node.JS package/language binding <nodejs-usage>`
 * :ref:`The command-line client <cli-usage>`
@ -40,6 +40,20 @@ If you want to use the pre-trained English model for performing speech-to-text,
   wget https://github.com/mozilla/DeepSpeech/releases/download/v0.7.4/deepspeech-0.7.4-models.pbmm
   wget https://github.com/mozilla/DeepSpeech/releases/download/v0.7.4/deepspeech-0.7.4-models.scorer

+There are several pre-trained model files available in official releases. Files ending in ``.pbmm`` are compatible with clients and language bindings built against the standard TensorFlow runtime. Usually these packages are simply called ``deepspeech``. These files are also compatible with CUDA enabled clients and language bindings. These packages are usually called ``deepspeech-gpu``. Files ending in ``.tflite`` are compatible with clients and language bindings built against the `TensorFlow Lite runtime <https://www.tensorflow.org/lite/>`_. These models are optimized for size and performance in low power devices. On desktop platforms, the compatible packages are called ``deepspeech-tflite``. On Android and Raspberry Pi, we only publish TensorFlow Lite enabled packages, and they are simply called ``deepspeech``. You can see a full list of supported platforms and which TensorFlow runtime is supported at :ref:`supported-platforms-inference`.
+
+--------------------+---------------------+---------------------+
+| Package/Model type | .pbmm               | .tflite             |
+====================+=====================+=====================+
+| deepspeech         | Depends on platform | Depends on platform |
+--------------------+---------------------+---------------------+
+| deepspeech-gpu     | ✅                  | ❌                  |
+--------------------+---------------------+---------------------+
+| deepspeech-tflite  | ❌                  | ✅                  |
+--------------------+---------------------+---------------------+
+
+Finally, the pre-trained model files also include files ending in ``.scorer``. These are external scorers (language models) that are used at inference time in conjunction with an acoustic model (``.pbmm`` or ``.tflite`` file) to produce transcriptions. We also provide further documentation on :ref:`the decoding process <decoder-docs>` and :ref:`how language models are generated <scorer-scripts>`.
+
 Model compatibility
 ^^^^^^^^^^^^^^^^^^^