Add a how to for writing documentation.

Change: 113176925
2016-01-27 10:41:40 -08:00 · 2016-01-27 10:41:40 -08:00 · 89ecc05945
commit 89ecc05945
parent 422f4492dd
1 changed files with 422 additions and 0 deletions
--- a/tensorflow/g3doc/how_tos/documentation/index.md
+++ b/tensorflow/g3doc/how_tos/documentation/index.md
@ -0,0 +1,422 @@
 # Writing TensorFlow Documentation
 TensorFlow's documentation is maintained in
 [Markdown](https://daringfireball.net/projects/markdown/), and resides in the
 `g3doc/` directory. The *Introduction*, *Overview*, *Tutorials*, and *How-Tos*
 sections are manually edited.
 Anything in the `g3doc/api_docs` directory is generated from comments in the
 code, and should not be edited directly. The script `tools/docs/gen_docs.sh`
 generates the API documentation. If called without arguments, it rebuilds the
 Python API documentation only (i.e., documentation for Ops, whether defined in
 Python or C++). If `-a` is passed, it also rebuilds the documentation for the
 C++ API. It must be called from the `tools/docs` directory, and if called with
 `-a`, requires `doxygen` to be installed.
 ## Python API Documentation
 Ops, classes, and utility functions are defined in Python modules, such as
 `image_ops.py`. The module docstring is inserted at the beginning of the
 Markdown file generated for the Python file. Thus, `image_ops.md` starts with
 the module docstring in `image_ops.py`. `python/framework/gen_docs_combined.py`
 contains the list of all _libraries_ for which Markdown files are created. For
 the C++ api, only a single library file exists, its Markdown is a string in
 `gen_cc_md.py`, from which `api_docs/cc/index.md` is created. The rest of the
 C++ documentation is generated from XML files generated by doxygen.
 In the module docstring you can insert generated docs for Ops, classes, and
 functions by calling them out with the syntax `@@<python-name>` (at the
 beginning of an otherwise empty line). The called-out op, function, or class
 does not have to be defined in the same file.
 This allows you to control the order in which the Ops, classes, and functions
 are documented. Group them in a logical order, with interspersed high level
 documentation.
 Docs for Ops are automatically extracted from Python wrappers or C++ Ops
 registrations, Python wrappers have priority.
 * Python wrappers are in `python/ops/*.py`.
 * C++ Ops registrations are in `core/ops/*.cc`.
 Docs for Classes and Utility Functions are extracted from their docstrings.
 ## Op Documentation Style Guide
 Ideally, you should provide the following information, in order of presentation:
 * A short sentence that describes what the op does.
 * A short description of what happens when you pass arguments to the op.
 * An example showing how the op works (pseudocode is best).
 * Requirements, caveats, important notes (if there are any).
 * Descriptions of inputs, outputs, and Attrs or other parameters of the op
  constructor.
 Each of these is described in more detail
 [below](#description-of-the-docstring-sections).
 Write your text in Markdown (.md) format. A basic syntax reference is
 [here](https://daringfireball.net/projects/markdown/). You are allowed to use
 [MathJax](https://www.mathjax.org) notation for equations. Those will be
 rendered properly on [tensorflow.org](https://www.tensorflow.org), but don't
 show up on [github](https://github.com/tensorflow/tensorflow).
 ### Writing About Code
 Put backticks around these things when they're used in text:
 - Argument names (e.g. `input`, `x`, `tensor`)
 - Returned tensor names (e.g. `output`, `idx`, `out`)
 - Data types (e.g. `int32`, `float`, `uint8`)
 - Other op names referenced in text (e.g. `list_diff()`, `shuffle()`)
 - Class names (e.g. `Tensor` when you actually mean a `Tensor` object; don't
  capitalize or use backticks if you're just explaining what an op does to a
  tensor, or a graph, or an operation in general)
 - File names (e.g. `image_ops.py`, or `/path-to-your-data/xml/example-name`)
 Put three backticks around sample code and pseudocode examples. And use `==>`
 instead of a single equal sign when you want to show what an op returns. For
 example:
    ```
    # 'input' is a tensor of shape [2, 3, 5]
    (tf.expand_dims(input, 0)) ==> [1, 2, 3, 5]
    ```
 If you're providing a Python code sample, add the python style label to ensure proper syntax highlighting:
 ```markdown
 ```python
 # some Python code
 ```
 ```
 Put single backticks around math expressions or conditions. For example:
 ```markdown
 This operation requires that `-1-input.dims() <= dim <= input.dims()`.
 ```
 ### Tensor Dimensions
 When you're talking about a tensor in general, don't capitalize the word tensor.
 When you're talking about the specific object that's provided to an op as an
 argument or returned by an op, then you should capitalize the word Tensor and
 add backticks around it because you're talking about a `Tensor` object that gets
 passed.
 Don't use the word `Tensors` to describe multiple Tensor objects unless you
 really are talking about a `Tensors` object. Better to say "a list of `Tensor`
 objects.", or, maybe, "`Tensor`s".
 When you're talking about the size of a tensor, use these guidelines:
 Use the term "dimension" to refer to the size of a tensor. If you need to be
 specific about the size, use these conventions:
 - Refer to a scalar as a "0-D tensor"
 - Refer to a vector as a "1-D tensor"
 - Refer to a matrix as a "2-D tensor"
 - Refer to tensors with 3 or more dimensions as 3-D tensors or n-D tensors. Use
  the word "rank" only if it makes sense, but try to use "dimension" instead.
  Never use the word "order" to describe the size of a tensor.
 Use the word "shape" to describe in detail the dimensions of a tensor, and show
 the shape in square brackets with backticks. For example:
 ```markdown
 If `input` is a 3-D tensor with shape `[3, 4, 3]`, this operation will return
 a 3-D tensor with shape `[6, 8, 6]`.
 ```
 ### Links
 To link to something else in the `g3docs` tree, use a relative path, like
 `[tf.parse_example](../api_docs/python/ops.md#parse_example)`
 Do not use absolute paths for internal links, as this will break the website
 generator.
 To link to source code, use a link starting with:
 `https://www.tensorflow.org/code/`, followed by
 the file name starting at the github root. For instance, a link to this file
 should be written as
 `https://www.tensorflow.org/code/tensorflow/g3doc/how_tos/documentation/index.md`.
 This ensures that [tensorflow.org](tensorflow.org) can forward the link to the
 branch of the code corresponding to the version of the documentation you're
 viewing. Do not include url parameters in the URL.
 ### Ops defined in C++
 All Ops defined in C++ must be documented as part of the `REGISTER_OP`
 declaration. The docstring in the C++ file is processed to automatically add
 some information for the input types, output types, and Attr types and default
 values.
 For example:
 ```c++
 REGISTER_OP("PngDecode")
    .Input("contents: string")
    .Attr("channels: int = 0")
    .Output("image: uint8")
    .Doc(R"doc(
 Decodes the contents of a PNG file into a uint8 tensor.
 contents: PNG file contents.
 channels: Number of color channels, or 0 to autodetect based on the input.
  Must be 0 for autodetect, 1 for grayscale, 3 for RGB, or 4 for RGBA.
  If the input has a different number of channels, it will be transformed
  accordingly.
 image:= A 3-D uint8 tensor of shape `[height, width, channels]`.
  If `channels` is 0, the last dimension is determined
  from the png contents.
 )doc");
 ```
 Results in this piece of Markdown:
 ```markdown
 ### tf.image.png_decode(contents, channels=None, name=None) {#png_decode}
 Decodes the contents of a PNG file into a uint8 tensor.
 #### Args:
 *  <b>contents</b>: A string Tensor. PNG file contents.
 *  <b>channels</b>: An optional int. Defaults to 0.
    Number of color channels, or 0 to autodetect based on the input.
    Must be 0 for autodetect, 1 for grayscale, 3 for RGB, or 4 for RGBA.  If the
    input has a different number of channels, it will be transformed accordingly.
 *  <b>name</b>: A name for the operation (optional).
 #### Returns:
  A 3-D uint8 tensor of shape `[height, width, channels]`.
  If `channels` is 0, the last dimension is determined
  from the png contents.
 ```
 Much of the argument description is added automatically. In particular, the doc
 generator automatically adds the name and type of all inputs, attrs, and
 outputs. In the above example, `<b>contents</b>: A string Tensor.` was added
 automatically. You should write your additional text to flow naturally after
 that description.
 For inputs and output, you can prefix your additional text with an equal sign to
 prevent the automatically added name and type. In the above example, the
 description for the output named `image` starts with `=` to prevent the addition
 of `A uint8 Tensor.` before our text `A 3-D uint8 Tensor...`. You cannot prevent
 the addition of the name, type, and default value of attrs this way, so write
 your text carefully.
 ### Ops defined in Python
 If your op is defined in a `python/ops/*.py` file, then you need to provide
 text for all of the arguments and output (returned) tensors.
 You should conform to the usual Python docstring conventions, except that you
 should use Markdown in the docstring. The doc generator does not auto-generate
 any text for ops that are defined in Python, so what you write is what you get.
 Here's a simple example:
 ```python
 def foo(x, y, name="bar"):
  """Computes foo.
  Given two 1-D tensors `x` and `y`, this operation computes the foo.
  For example:
  ```
  # x is [1, 1]
  # y is [2, 2]
  tf.foo(x, y) ==> [3, 3]
  ```
  Args:
    x: A `Tensor` of type `int32`.
    y: A `Tensor` of type `int32`.
    name: A name for the operation (optional).
  Returns:
    A `Tensor` of type `int32` that is the foo of `x` and `y`.
  Raises:
    ValueError: If `x` or `y` are not of type `int32`.
  """
  ...
 ```
 ## Description of the Docstring Sections
 Here is more detail and examples for each of the elements of the docstrings.
 ### Short sentence that describes what the op does.
 Examples:
 ```markdown
 Concatenates tensors.
 ```
 ```markdown
 Flips an image horizontally from left to right.
 ```
 ```markdown
 Computes the Levenshtein distance between two sequences.
 ```
 ```markdown
 Saves a list of tensors to a file.
 ```
 ```markdown
 Extracts a slice from a tensor.
 ```
 ### Short description of what happens when you pass arguments to the op.
 Examples:
 ```markdown
 Given a tensor input of numerical type, this operation returns a tensor of
 the same type and size with values reversed along dimension `seq_dim`. A
 vector `seq_lengths` determines which elements are reversed for each index
 within dimension 0 (usually the batch dimension).
 ```
 ```markdown
 This operation returns a tensor of type `dtype` and dimensions `shape`, with
 all elements set to zero.
 ```
 ### Example showing how the op works.
 The `squeeze()` op has a nice pseudocode example:
    shape(input) => `[1, 2, 1, 3, 1, 1]`
    shape(squeeze(input)) =>  `[2, 3]`
 The `tile()` op provides a good example in descriptive text:
    For example, tiling `[a, b, c, d]` by 2 produces
    `[[a, b, c, d], [a, b, c, d]]`.
 It is often helpful to show code samples in Python. Never put them in the C++
 Ops file, and avoid putting them in the Python Ops doc. Put them in the module
 or class docstring where the Ops constructors are called out.
 Here's an example from the module docsting in `image_ops.py`:
    Tensorflow can convert between images in RGB or HSV. The conversion
    functions work only on `float` images, so you need to convert images in
    other formats using [`convert_image_dtype`](#convert-image-dtype).
    Example:
    ```python
    # Decode an image and convert it to HSV.
    rgb_image = tf.decode_png(...,  channels=3)
    rgb_image_float = tf.convert_image_dtype(rgb_image, tf.float32)
    hsv_image = tf.rgb_to_hsv(rgb_image)
    ```
 ### Requirements, caveats, important notes.
 Examples:
 ```markdown
 This operation requires that: `-1-input.dims() <= dim <= input.dims()`
 ```
 ```
 Note: This tensor will produce an error if evaluated. Its value must
 be fed using the `feed_dict` optional argument to `Session.run()`,
 `Tensor.eval()`, or `Operation.run()`.
 ```
 ### Descriptions of arguments and output (returned) tensors.
 Keep the descriptions brief and to the point. You should not have to explain
 how the operation works in the argument sections.
 Mention if the Op has strong constraints on the dimensions of the input or
 output tensors. Remember that for C++ Ops, the type of the tensor is
 automatically added as either as "A ..type.. Tensor" or "A Tensor with type
 in {...list of types...}". In such cases, if the Op has a constraint on the
 dimensions either add text such as "Must be 4-D" or start the description with
 `=` (to prevent the tensor type to be added) and write something like
 "A 4-D float tensor".
 For example, here are two ways to document an image argument of a C++ op (note
 the "=" sign):
 ```markdown
 image: Must be 4-D. The image to resize.
 ```
 ```markdown
 image:= A 4-D `float` tensor. The image to resize.
 ```
 In the documentation, these will be rendered to markdown as
 ```markdown
 image: A `float` Tensor. Must be 4-D. The image to resize.
 ```
 ```markdown
 image: A 4-D `float` Tensor. The image to resize.
 ```
 ### Optional arguments descriptions ("attrs")
 The doc generator always describe attrs type and default value, if any.
 You cannot override that with an equal sign because the description is very
 different in the C++ and Python generated docs.
 Phrase any additional attr description so that it flows well after the type
 and default value.
 Here's an example from `image_ops.py`:
 ```c++
 REGISTER_OP("PngDecode")
    .Input("contents: string")
    .Attr("channels: int = 0")
    .Output("image: uint8")
    .Doc(R"doc(
 Decode a PNG-encoded image to a uint8 tensor.
 The attr `channels` indicates the desired number of color channels for the
 decoded image.
 Accepted values are:
 *   0: Use the number of channels in the PNG-encoded image.
 *   1: output a grayscale image.
 ...
 contents: 0-D. The PNG-encoded image.
 channels: Number of color channels for the decoded image.
 image: 3-D with shape `[height, width, channels]`.
 )doc");
 ```
 This generates the following "Args" section:
 ```markdown
  contents: A string Tensor. 0-D. The PNG-encoded image.
  channels: An optional `int`. Defaults to 0. Number of color channels for the
    decoded image.
  name: A name for the operation (optional).
 ```