Merge pull request #3445 from lissyx/doc-cuda-r0.9

Fix #3443: Link to upstream Dockerfile for lack of correct TensorFlow…
2020-11-27 12:37:48 +01:00 · 2020-11-27 12:37:48 +01:00 · 4270e22fe0
commit 4270e22fe0
parent a8abca0841 29fa2dd405
3 changed files with 9 additions and 6 deletions
--- a/doc/TRAINING.rst
+++ b/doc/TRAINING.rst
@ -3,11 +3,14 @@
 Training Your Own Model
 =======================

+.. _cuda-training-deps:
+
 Prerequisites for training a model
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

 * `Python 3.6 <https://www.python.org/>`_
 * Mac or Linux environment
+* CUDA 10.0 / CuDNN v7.6 per `Dockerfile <https://hub.docker.com/layers/tensorflow/tensorflow/1.15.4-gpu-py3/images/sha256-a5255ae38bcce7c7610816c778244309f8b8d1576e2c0023c685c011392958d7?context=explore>`_.

 Getting the training code
 ^^^^^^^^^^^^^^^^^^^^^^^^^
@ -69,7 +72,7 @@ If you have a capable (NVIDIA, at least 8GB of VRAM) GPU, it is highly recommend
   pip3 uninstall tensorflow
   pip3 install 'tensorflow-gpu==1.15.4'

-Please ensure you have the required :ref:`CUDA dependency <cuda-deps>`.
+Please ensure you have the required `CUDA dependency <https://www.tensorflow.org/install/source#gpu>`_ and/or :ref:`Prerequisites <cuda-training-deps>`.

 It has been reported for some people failure at training:

@ -78,7 +81,7 @@ It has been reported for some people failure at training:
   tensorflow.python.framework.errors_impl.UnknownError: Failed to get convolution algorithm. This is probably because cuDNN failed to initialize, so try looking to see if a warning log message was printed above.
        [[{{node tower_0/conv1d/Conv2D}}]]

-Setting the ``TF_FORCE_GPU_ALLOW_GROWTH`` environment variable to ``true`` seems to help in such cases. This could also be due to an incorrect version of libcudnn. Double check your versions with the :ref:`TensorFlow 1.15 documentation <cuda-deps>`.
+Setting the ``TF_FORCE_GPU_ALLOW_GROWTH`` environment variable to ``true`` seems to help in such cases. This could also be due to an incorrect version of libcudnn. Double check your versions with the :ref:`TensorFlow 1.15 documentation <cuda-training-deps>`.

 Basic Dockerfile for training
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
--- a/doc/USING.rst
+++ b/doc/USING.rst
@ -23,10 +23,10 @@ Running ``deepspeech`` might, see below, require some runtime dependencies to be

 Please refer to your system's documentation on how to install these dependencies.

-.. _cuda-deps:
+.. _cuda-inference-deps:

-CUDA dependency
-^^^^^^^^^^^^^^^
+CUDA dependency (inference)
+^^^^^^^^^^^^^^^^^^^^^^^^^^^

 The GPU capable builds (Python, NodeJS, C++, etc) depend on CUDA 10.1 and CuDNN v7.6.

--- a/doc/index.rst
+++ b/doc/index.rst
@ -46,7 +46,7 @@ Quicker inference can be performed using a supported NVIDIA GPU on Linux. See th
   # Transcribe an audio file.
   deepspeech --model deepspeech-0.9.1-models.pbmm --scorer deepspeech-0.9.1-models.scorer --audio audio/2830-3980-0043.wav

-Please ensure you have the required :ref:`CUDA dependencies <cuda-deps>`.
+Please ensure you have the required :ref:`CUDA dependencies <cuda-inference-deps>`.

 See the output of ``deepspeech -h`` for more information on the use of ``deepspeech``. (If you experience problems running ``deepspeech``, please check :ref:`required runtime dependencies <runtime-deps>`).