Add a how to for writing documentation.

Change: 113176925
2016-01-27 10:41:40 -08:00 · 2016-01-27 10:41:40 -08:00 · 89ecc05945
commit 89ecc05945
parent 422f4492dd
1 changed files with 422 additions and 0 deletions
--- a/tensorflow/g3doc/how_tos/documentation/index.md
+++ b/tensorflow/g3doc/how_tos/documentation/index.md
@ -0,0 +1,422 @@
+# Writing TensorFlow Documentation
+
+TensorFlow's documentation is maintained in
+[Markdown](https://daringfireball.net/projects/markdown/), and resides in the
+`g3doc/` directory. The *Introduction*, *Overview*, *Tutorials*, and *How-Tos*
+sections are manually edited.
+
+Anything in the `g3doc/api_docs` directory is generated from comments in the
+code, and should not be edited directly. The script `tools/docs/gen_docs.sh`
+generates the API documentation. If called without arguments, it rebuilds the
+Python API documentation only (i.e., documentation for Ops, whether defined in
+Python or C++). If `-a` is passed, it also rebuilds the documentation for the
+C++ API. It must be called from the `tools/docs` directory, and if called with
+`-a`, requires `doxygen` to be installed.
+
+## Python API Documentation
+
+Ops, classes, and utility functions are defined in Python modules, such as
+`image_ops.py`. The module docstring is inserted at the beginning of the
+Markdown file generated for the Python file. Thus, `image_ops.md` starts with
+the module docstring in `image_ops.py`. `python/framework/gen_docs_combined.py`
+contains the list of all _libraries_ for which Markdown files are created. For
+the C++ api, only a single library file exists, its Markdown is a string in
+`gen_cc_md.py`, from which `api_docs/cc/index.md` is created. The rest of the
+C++ documentation is generated from XML files generated by doxygen.
+
+In the module docstring you can insert generated docs for Ops, classes, and
+functions by calling them out with the syntax `@@<python-name>` (at the
+beginning of an otherwise empty line). The called-out op, function, or class
+does not have to be defined in the same file.
+
+This allows you to control the order in which the Ops, classes, and functions
+are documented. Group them in a logical order, with interspersed high level
+documentation.
+
+Docs for Ops are automatically extracted from Python wrappers or C++ Ops
+registrations, Python wrappers have priority.
+
+* Python wrappers are in `python/ops/*.py`.
+* C++ Ops registrations are in `core/ops/*.cc`.
+
+Docs for Classes and Utility Functions are extracted from their docstrings.
+
+## Op Documentation Style Guide
+
+Ideally, you should provide the following information, in order of presentation:
+
+* A short sentence that describes what the op does.
+* A short description of what happens when you pass arguments to the op.
+* An example showing how the op works (pseudocode is best).
+* Requirements, caveats, important notes (if there are any).
+* Descriptions of inputs, outputs, and Attrs or other parameters of the op
+  constructor.
+
+Each of these is described in more detail
+[below](#description-of-the-docstring-sections).
+
+Write your text in Markdown (.md) format. A basic syntax reference is
+[here](https://daringfireball.net/projects/markdown/). You are allowed to use
+[MathJax](https://www.mathjax.org) notation for equations. Those will be
+rendered properly on [tensorflow.org](https://www.tensorflow.org), but don't
+show up on [github](https://github.com/tensorflow/tensorflow).
+
+### Writing About Code
+
+Put backticks around these things when they're used in text:
+
+- Argument names (e.g. `input`, `x`, `tensor`)
+- Returned tensor names (e.g. `output`, `idx`, `out`)
+- Data types (e.g. `int32`, `float`, `uint8`)
+- Other op names referenced in text (e.g. `list_diff()`, `shuffle()`)
+- Class names (e.g. `Tensor` when you actually mean a `Tensor` object; don't
+  capitalize or use backticks if you're just explaining what an op does to a
+  tensor, or a graph, or an operation in general)
+- File names (e.g. `image_ops.py`, or `/path-to-your-data/xml/example-name`)
+
+Put three backticks around sample code and pseudocode examples. And use `==>`
+instead of a single equal sign when you want to show what an op returns. For
+example:
+
+    ```
+    # 'input' is a tensor of shape [2, 3, 5]
+    (tf.expand_dims(input, 0)) ==> [1, 2, 3, 5]
+    ```
+
+If you're providing a Python code sample, add the python style label to ensure proper syntax highlighting:
+
+```markdown
+ ```python
+ # some Python code
+ ```
+```
+
+Put single backticks around math expressions or conditions. For example:
+
+```markdown
+This operation requires that `-1-input.dims() <= dim <= input.dims()`.
+```
+
+### Tensor Dimensions
+
+When you're talking about a tensor in general, don't capitalize the word tensor.
+When you're talking about the specific object that's provided to an op as an
+argument or returned by an op, then you should capitalize the word Tensor and
+add backticks around it because you're talking about a `Tensor` object that gets
+passed.
+
+Don't use the word `Tensors` to describe multiple Tensor objects unless you
+really are talking about a `Tensors` object. Better to say "a list of `Tensor`
+objects.", or, maybe, "`Tensor`s".
+
+When you're talking about the size of a tensor, use these guidelines:
+
+Use the term "dimension" to refer to the size of a tensor. If you need to be
+specific about the size, use these conventions:
+
+- Refer to a scalar as a "0-D tensor"
+- Refer to a vector as a "1-D tensor"
+- Refer to a matrix as a "2-D tensor"
+- Refer to tensors with 3 or more dimensions as 3-D tensors or n-D tensors. Use
+  the word "rank" only if it makes sense, but try to use "dimension" instead.
+  Never use the word "order" to describe the size of a tensor.
+
+Use the word "shape" to describe in detail the dimensions of a tensor, and show
+the shape in square brackets with backticks. For example:
+
+```markdown
+If `input` is a 3-D tensor with shape `[3, 4, 3]`, this operation will return
+a 3-D tensor with shape `[6, 8, 6]`.
+```
+
+### Links
+
+To link to something else in the `g3docs` tree, use a relative path, like
+`[tf.parse_example](../api_docs/python/ops.md#parse_example)`
+Do not use absolute paths for internal links, as this will break the website
+generator.
+
+To link to source code, use a link starting with:
+`https://www.tensorflow.org/code/`, followed by
+the file name starting at the github root. For instance, a link to this file
+should be written as
+`https://www.tensorflow.org/code/tensorflow/g3doc/how_tos/documentation/index.md`.
+This ensures that [tensorflow.org](tensorflow.org) can forward the link to the
+branch of the code corresponding to the version of the documentation you're
+viewing. Do not include url parameters in the URL.
+
+
+### Ops defined in C++
+
+All Ops defined in C++ must be documented as part of the `REGISTER_OP`
+declaration. The docstring in the C++ file is processed to automatically add
+some information for the input types, output types, and Attr types and default
+values.
+
+For example:
+
+```c++
+REGISTER_OP("PngDecode")
+    .Input("contents: string")
+    .Attr("channels: int = 0")
+    .Output("image: uint8")
+    .Doc(R"doc(
+Decodes the contents of a PNG file into a uint8 tensor.
+
+contents: PNG file contents.
+channels: Number of color channels, or 0 to autodetect based on the input.
+  Must be 0 for autodetect, 1 for grayscale, 3 for RGB, or 4 for RGBA.
+  If the input has a different number of channels, it will be transformed
+  accordingly.
+image:= A 3-D uint8 tensor of shape `[height, width, channels]`.
+  If `channels` is 0, the last dimension is determined
+  from the png contents.
+)doc");
+```
+
+Results in this piece of Markdown:
+
+```markdown
+### tf.image.png_decode(contents, channels=None, name=None) {#png_decode}
+
+Decodes the contents of a PNG file into a uint8 tensor.
+
+#### Args:
+
+*  <b>contents</b>: A string Tensor. PNG file contents.
+*  <b>channels</b>: An optional int. Defaults to 0.
+    Number of color channels, or 0 to autodetect based on the input.
+    Must be 0 for autodetect, 1 for grayscale, 3 for RGB, or 4 for RGBA.  If the
+    input has a different number of channels, it will be transformed accordingly.
+*  <b>name</b>: A name for the operation (optional).
+
+#### Returns:
+
+  A 3-D uint8 tensor of shape `[height, width, channels]`.
+  If `channels` is 0, the last dimension is determined
+  from the png contents.
+```
+
+Much of the argument description is added automatically. In particular, the doc
+generator automatically adds the name and type of all inputs, attrs, and
+outputs. In the above example, `<b>contents</b>: A string Tensor.` was added
+automatically. You should write your additional text to flow naturally after
+that description.
+
+For inputs and output, you can prefix your additional text with an equal sign to
+prevent the automatically added name and type. In the above example, the
+description for the output named `image` starts with `=` to prevent the addition
+of `A uint8 Tensor.` before our text `A 3-D uint8 Tensor...`. You cannot prevent
+the addition of the name, type, and default value of attrs this way, so write
+your text carefully.
+
+### Ops defined in Python
+
+If your op is defined in a `python/ops/*.py` file, then you need to provide
+text for all of the arguments and output (returned) tensors.
+
+You should conform to the usual Python docstring conventions, except that you
+should use Markdown in the docstring. The doc generator does not auto-generate
+any text for ops that are defined in Python, so what you write is what you get.
+
+Here's a simple example:
+
+```python
+def foo(x, y, name="bar"):
+  """Computes foo.
+
+  Given two 1-D tensors `x` and `y`, this operation computes the foo.
+
+  For example:
+
+  ```
+  # x is [1, 1]
+  # y is [2, 2]
+  tf.foo(x, y) ==> [3, 3]
+  ```
+
+  Args:
+    x: A `Tensor` of type `int32`.
+    y: A `Tensor` of type `int32`.
+    name: A name for the operation (optional).
+
+  Returns:
+    A `Tensor` of type `int32` that is the foo of `x` and `y`.
+
+  Raises:
+    ValueError: If `x` or `y` are not of type `int32`.
+  """
+
+  ...
+```
+
+## Description of the Docstring Sections
+
+Here is more detail and examples for each of the elements of the docstrings.
+
+### Short sentence that describes what the op does.
+
+Examples:
+
+```markdown
+Concatenates tensors.
+```
+
+```markdown
+Flips an image horizontally from left to right.
+```
+
+```markdown
+Computes the Levenshtein distance between two sequences.
+```
+
+```markdown
+Saves a list of tensors to a file.
+```
+
+```markdown
+Extracts a slice from a tensor.
+```
+
+### Short description of what happens when you pass arguments to the op.
+
+Examples:
+
+```markdown
+Given a tensor input of numerical type, this operation returns a tensor of
+the same type and size with values reversed along dimension `seq_dim`. A
+vector `seq_lengths` determines which elements are reversed for each index
+within dimension 0 (usually the batch dimension).
+```
+
+```markdown
+This operation returns a tensor of type `dtype` and dimensions `shape`, with
+all elements set to zero.
+```
+
+### Example showing how the op works.
+
+The `squeeze()` op has a nice pseudocode example:
+
+    shape(input) => `[1, 2, 1, 3, 1, 1]`
+    shape(squeeze(input)) =>  `[2, 3]`
+
+The `tile()` op provides a good example in descriptive text:
+
+    For example, tiling `[a, b, c, d]` by 2 produces
+    `[[a, b, c, d], [a, b, c, d]]`.
+
+It is often helpful to show code samples in Python. Never put them in the C++
+Ops file, and avoid putting them in the Python Ops doc. Put them in the module
+or class docstring where the Ops constructors are called out.
+
+Here's an example from the module docsting in `image_ops.py`:
+
+    Tensorflow can convert between images in RGB or HSV. The conversion
+    functions work only on `float` images, so you need to convert images in
+    other formats using [`convert_image_dtype`](#convert-image-dtype).
+
+    Example:
+
+    ```python
+    # Decode an image and convert it to HSV.
+    rgb_image = tf.decode_png(...,  channels=3)
+    rgb_image_float = tf.convert_image_dtype(rgb_image, tf.float32)
+    hsv_image = tf.rgb_to_hsv(rgb_image)
+    ```
+
+### Requirements, caveats, important notes.
+
+Examples:
+
+```markdown
+This operation requires that: `-1-input.dims() <= dim <= input.dims()`
+```
+
+```
+Note: This tensor will produce an error if evaluated. Its value must
+be fed using the `feed_dict` optional argument to `Session.run()`,
+`Tensor.eval()`, or `Operation.run()`.
+```
+
+### Descriptions of arguments and output (returned) tensors.
+
+Keep the descriptions brief and to the point. You should not have to explain
+how the operation works in the argument sections.
+
+Mention if the Op has strong constraints on the dimensions of the input or
+output tensors. Remember that for C++ Ops, the type of the tensor is
+automatically added as either as "A ..type.. Tensor" or "A Tensor with type
+in {...list of types...}". In such cases, if the Op has a constraint on the
+dimensions either add text such as "Must be 4-D" or start the description with
+`=` (to prevent the tensor type to be added) and write something like
+"A 4-D float tensor".
+
+For example, here are two ways to document an image argument of a C++ op (note
+the "=" sign):
+
+```markdown
+image: Must be 4-D. The image to resize.
+```
+
+```markdown
+image:= A 4-D `float` tensor. The image to resize.
+```
+
+In the documentation, these will be rendered to markdown as
+
+```markdown
+image: A `float` Tensor. Must be 4-D. The image to resize.
+```
+
+```markdown
+image: A 4-D `float` Tensor. The image to resize.
+```
+
+### Optional arguments descriptions ("attrs")
+
+The doc generator always describe attrs type and default value, if any.
+You cannot override that with an equal sign because the description is very
+different in the C++ and Python generated docs.
+
+Phrase any additional attr description so that it flows well after the type
+and default value.
+
+Here's an example from `image_ops.py`:
+
+```c++
+REGISTER_OP("PngDecode")
+    .Input("contents: string")
+    .Attr("channels: int = 0")
+    .Output("image: uint8")
+    .Doc(R"doc(
+Decode a PNG-encoded image to a uint8 tensor.
+
+The attr `channels` indicates the desired number of color channels for the
+decoded image.
+
+Accepted values are:
+
+*   0: Use the number of channels in the PNG-encoded image.
+*   1: output a grayscale image.
+
+...
+
+contents: 0-D. The PNG-encoded image.
+channels: Number of color channels for the decoded image.
+image: 3-D with shape `[height, width, channels]`.
+)doc");
+```
+
+This generates the following "Args" section:
+
+```markdown
+  contents: A string Tensor. 0-D. The PNG-encoded image.
+  channels: An optional `int`. Defaults to 0. Number of color channels for the
+    decoded image.
+  name: A name for the operation (optional).
+```
+
+
+
+