STT-tensorflow/tensorflow/g3doc/api_docs/python/contrib.learn.md
A. Unique TensorFlower cfbb4beda1 Update generated Python Op docs.
Change: 126314412
2016-06-30 10:20:47 -07:00

152 KiB

Learn (contrib)

[TOC]

High level API for learning with TensorFlow.

Estimators

Train and evaluate TensorFlow models.


class tf.contrib.learn.BaseEstimator

Abstract BaseEstimator class to train and evaluate TensorFlow models.

Concrete implementation of this class should provide the following functions:

  • _get_train_ops
  • _get_eval_ops
  • _get_predict_ops

Estimator implemented below is a good example of how to use this class.


tf.contrib.learn.BaseEstimator.__init__(model_dir=None, config=None)

Initializes a BaseEstimator instance.

Args:
  • model_dir: Directory to save model parameters, graph and etc.
  • config: A RunConfig instance.

tf.contrib.learn.BaseEstimator.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.BaseEstimator.fit(x=None, y=None, input_fn=None, steps=None, batch_size=None, monitors=None, max_steps=None)

Trains a model given training data x predictions and y targets.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class labels in classification, real numbers in regression). If set, input_fn must be None.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • steps: Number of steps for which to train model. If None, train forever. If set, max_steps must be None.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.

  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.

  • max_steps: Number of total steps for which to train model. If None, train forever. If set, steps must be None.

    Two calls to fit(steps=100) means 200 training iterations. On the other hand, two calls to fit(max_steps=100) means that the second call will not do any iteration since first call did all 100 steps.

Returns:

self, for chaining.

Raises:
  • ValueError: If x or y are not None while input_fn is not None.
  • ValueError: If both steps and max_steps are not None.

tf.contrib.learn.BaseEstimator.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.BaseEstimator.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.BaseEstimator.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.BaseEstimator.model_dir


tf.contrib.learn.BaseEstimator.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.BaseEstimator.predict(x=None, input_fn=None, batch_size=None, outputs=None)

Returns predictions for given features.

Args:
  • x: Features. If set, input_fn must be None.
  • input_fn: Input function. If set, x must be None.
  • batch_size: Override default batch size.
  • outputs: list of str, name of the output to predict. If None, returns all.
Returns:

Numpy array of predicted classes or regression values.

Raises:
  • ValueError: If x and input_fn are both provided or both None.

tf.contrib.learn.BaseEstimator.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

class tf.contrib.learn.Estimator

Estimator class is the basic TensorFlow model trainer/evaluator.


tf.contrib.learn.Estimator.__init__(model_fn=None, model_dir=None, config=None, params=None)

Constructs an Estimator instance.

Args:
  • model_fn: Model function, takes features and targets tensors or dicts of tensors and returns predictions and loss tensors. Supports next three signatures for the function:

    • (features, targets) -> (predictions, loss, train_op)
    • (features, targets, mode) -> (predictions, loss, train_op)
    • (features, targets, mode, params) -> (predictions, loss, train_op)

Where

  * `features` are single `Tensor` or `dict` of `Tensor`s
         (depending on data passed to `fit`),
  * `targets` are `Tensor` or
         `dict` of `Tensor`s (for multi-head model).
  * `mode` represents if this training, evaluation or
         prediction. See `ModeKeys` for example keys.
  * `params` is a `dict` of hyperparameters. Will receive what
         is passed to Estimator in `params` parameter. This allows
         to configure Estimators from hyper parameter tunning.
  • model_dir: Directory to save model parameters, graph and etc.
  • config: Configuration object.
  • params: dict of hyper parameters that will be passed into model_fn. Keys are names of parameters, values are basic python types.
Raises:
  • ValueError: parameters of model_fn don't match params.

tf.contrib.learn.Estimator.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.Estimator.fit(x=None, y=None, input_fn=None, steps=None, batch_size=None, monitors=None, max_steps=None)

Trains a model given training data x predictions and y targets.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class labels in classification, real numbers in regression). If set, input_fn must be None.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • steps: Number of steps for which to train model. If None, train forever. If set, max_steps must be None.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.

  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.

  • max_steps: Number of total steps for which to train model. If None, train forever. If set, steps must be None.

    Two calls to fit(steps=100) means 200 training iterations. On the other hand, two calls to fit(max_steps=100) means that the second call will not do any iteration since first call did all 100 steps.

Returns:

self, for chaining.

Raises:
  • ValueError: If x or y are not None while input_fn is not None.
  • ValueError: If both steps and max_steps are not None.

tf.contrib.learn.Estimator.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.Estimator.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.Estimator.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.Estimator.model_dir


tf.contrib.learn.Estimator.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.Estimator.predict(x=None, input_fn=None, batch_size=None, outputs=None)

Returns predictions for given features.

Args:
  • x: Features. If set, input_fn must be None.
  • input_fn: Input function. If set, x must be None.
  • batch_size: Override default batch size.
  • outputs: list of str, name of the output to predict. If None, returns all.
Returns:

Numpy array of predicted classes or regression values.

Raises:
  • ValueError: If x and input_fn are both provided or both None.

tf.contrib.learn.Estimator.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

class tf.contrib.learn.ModeKeys

Standard names for model modes.

The following standard keys are defined:

  • TRAIN: training mode.
  • EVAL: evaluation mode.
  • INFER: inference mode.

class tf.contrib.learn.TensorFlowClassifier


tf.contrib.learn.TensorFlowClassifier.__init__(*args, **kwargs)


tf.contrib.learn.TensorFlowClassifier.bias_


tf.contrib.learn.TensorFlowClassifier.dnn_bias_

Returns bias of deep neural network part.


tf.contrib.learn.TensorFlowClassifier.dnn_weights_

Returns weights of deep neural network part.


tf.contrib.learn.TensorFlowClassifier.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.TensorFlowClassifier.fit(x, y, steps=None, batch_size=None, monitors=None, logdir=None)


tf.contrib.learn.TensorFlowClassifier.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.TensorFlowClassifier.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.TensorFlowClassifier.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.TensorFlowClassifier.linear_bias_

Returns bias of the linear part.


tf.contrib.learn.TensorFlowClassifier.linear_weights_

Returns weights per feature of the linear part.


tf.contrib.learn.TensorFlowClassifier.model_dir


tf.contrib.learn.TensorFlowClassifier.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.TensorFlowClassifier.predict(x=None, input_fn=None, batch_size=None, outputs=None, axis=1)

Predict class or regression for x.


tf.contrib.learn.TensorFlowClassifier.predict_proba(x=None, input_fn=None, batch_size=None, outputs=None)


tf.contrib.learn.TensorFlowClassifier.save(path)

Saves checkpoints and graph to given path.

Args:
  • path: Folder to save model to.

tf.contrib.learn.TensorFlowClassifier.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.TensorFlowClassifier.weights_


class tf.contrib.learn.DNNClassifier

A classifier for TensorFlow DNN models.

Example:

education = sparse_column_with_hash_bucket(column_name="education",
                                           hash_bucket_size=1000)
occupation = sparse_column_with_hash_bucket(column_name="occupation",
                                            hash_bucket_size=1000)

education_emb = embedding_column(sparse_id_column=education, dimension=16,
                                 combiner="sum")
occupation_emb = embedding_column(sparse_id_column=occupation, dimension=16,
                                 combiner="sum")

estimator = DNNClassifier(
    feature_columns=[education_emb, occupation_emb],
    hidden_units=[1024, 512, 256])

# Or estimator using the ProximalAdagradOptimizer optimizer with
# regularization.
estimator = DNNClassifier(
    feature_columns=[education_emb, occupation_emb],
    hidden_units=[1024, 512, 256],
    optimizer=tf.train.ProximalAdagradOptimizer(
      learning_rate=0.1,
      l1_regularization_strength=0.001
    ))

# Input builders
def input_fn_train: # returns x, Y
  pass
estimator.fit(input_fn=input_fn_train)

def input_fn_eval: # returns x, Y
  pass
estimator.evaluate(input_fn=input_fn_eval)
estimator.predict(x=x)

Input of fit and evaluate should have following features, otherwise there will be a KeyError:

  • if weight_column_name is not None, a feature with key=weight_column_name whose value is a Tensor.
  • for each column in feature_columns:
    • if column is a SparseColumn, a feature with key=column.name whose value is a SparseTensor.
    • if column is a RealValuedColumn, a feature with key=column.name whose value is a Tensor.
    • if feature_columns is None, then input must contains only real valued Tensor.

tf.contrib.learn.DNNClassifier.__init__(hidden_units, feature_columns=None, model_dir=None, n_classes=2, weight_column_name=None, optimizer=None, activation_fn=relu, dropout=None, gradient_clip_norm=None, enable_centered_bias=True, config=None)

Initializes a DNNClassifier instance.

Args:
  • hidden_units: List of hidden units per layer. All layers are fully connected. Ex. [64, 32] means first layer has 64 nodes and second one has 32.
  • feature_columns: An iterable containing all the feature columns used by the model. All items in the set should be instances of classes derived from FeatureColumn.
  • model_dir: Directory to save model parameters, graph and etc.
  • n_classes: number of target classes. Default is binary classification. It must be greater than 1.
  • weight_column_name: A string defining feature column name representing weights. It is used to down weight or boost examples during training. It will be multiplied by the loss of the example.
  • optimizer: An instance of tf.Optimizer used to train the model. If None, will use an Adagrad optimizer.
  • activation_fn: Activation function applied to each layer. If None, will use tf.nn.relu.
  • dropout: When not None, the probability we will drop out a given coordinate.
  • gradient_clip_norm: A float > 0. If provided, gradients are clipped to their global norm with this clipping ratio. See tf.clip_by_global_norm for more details.
  • enable_centered_bias: A bool. If True, estimator will learn a centered bias variable for each class. Rest of the model structure learns the residual after centered bias.
  • config: RunConfig object to configure the runtime settings.
Returns:

A DNNClassifier estimator.


tf.contrib.learn.DNNClassifier.bias_


tf.contrib.learn.DNNClassifier.dnn_bias_

Returns bias of deep neural network part.


tf.contrib.learn.DNNClassifier.dnn_weights_

Returns weights of deep neural network part.


tf.contrib.learn.DNNClassifier.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.DNNClassifier.fit(x=None, y=None, input_fn=None, steps=None, batch_size=None, monitors=None, max_steps=None)

Trains a model given training data x predictions and y targets.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class labels in classification, real numbers in regression). If set, input_fn must be None.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • steps: Number of steps for which to train model. If None, train forever. If set, max_steps must be None.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.

  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.

  • max_steps: Number of total steps for which to train model. If None, train forever. If set, steps must be None.

    Two calls to fit(steps=100) means 200 training iterations. On the other hand, two calls to fit(max_steps=100) means that the second call will not do any iteration since first call did all 100 steps.

Returns:

self, for chaining.

Raises:
  • ValueError: If x or y are not None while input_fn is not None.
  • ValueError: If both steps and max_steps are not None.

tf.contrib.learn.DNNClassifier.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.DNNClassifier.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.DNNClassifier.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.DNNClassifier.linear_bias_

Returns bias of the linear part.


tf.contrib.learn.DNNClassifier.linear_weights_

Returns weights per feature of the linear part.


tf.contrib.learn.DNNClassifier.model_dir


tf.contrib.learn.DNNClassifier.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.DNNClassifier.predict(x=None, input_fn=None, batch_size=None)

Returns predictions for given features.

Args:
  • x: features.
  • input_fn: Input function. If set, x must be None.
  • batch_size: Override default batch size.
Returns:

Numpy array of predicted classes or regression values.


tf.contrib.learn.DNNClassifier.predict_proba(x=None, input_fn=None, batch_size=None)

Returns prediction probabilities for given features.

Args:
  • x: features.
  • input_fn: Input function. If set, x and y must be None.
  • batch_size: Override default batch size.
Returns:

Numpy array of predicted probabilities.


tf.contrib.learn.DNNClassifier.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.DNNClassifier.weights_


class tf.contrib.learn.DNNRegressor

A regressor for TensorFlow DNN models.

Example:

education = sparse_column_with_hash_bucket(column_name="education",
                                           hash_bucket_size=1000)
occupation = sparse_column_with_hash_bucket(column_name="occupation",
                                            hash_bucket_size=1000)

education_emb = embedding_column(sparse_id_column=education, dimension=16,
                                 combiner="sum")
occupation_emb = embedding_column(sparse_id_column=occupation, dimension=16,
                                 combiner="sum")

estimator = DNNRegressor(
    feature_columns=[education_emb, occupation_emb],
    hidden_units=[1024, 512, 256])

# Or estimator using the ProximalAdagradOptimizer optimizer with
# regularization.
estimator = DNNRegressor(
    feature_columns=[education_emb, occupation_emb],
    hidden_units=[1024, 512, 256],
    optimizer=tf.train.ProximalAdagradOptimizer(
      learning_rate=0.1,
      l1_regularization_strength=0.001
    ))

# Input builders
def input_fn_train: # returns x, Y
  pass
estimator.fit(input_fn=input_fn_train)

def input_fn_eval: # returns x, Y
  pass
estimator.evaluate(input_fn=input_fn_eval)
estimator.predict(x=x)

Input of fit and evaluate should have following features, otherwise there will be a KeyError:

  • if weight_column_name is not None, a feature with key=weight_column_name whose value is a Tensor.
  • for each column in feature_columns:
    • if column is a SparseColumn, a feature with key=column.name whose value is a SparseTensor.
    • if column is a RealValuedColumn, a feature with key=column.name whose value is a Tensor.
    • if feature_columns is None, then input must contains only real valued Tensor.

tf.contrib.learn.DNNRegressor.__init__(hidden_units, feature_columns=None, model_dir=None, weight_column_name=None, optimizer=None, activation_fn=relu, dropout=None, gradient_clip_norm=None, enable_centered_bias=True, config=None)

Initializes a DNNRegressor instance.

Args:
  • hidden_units: List of hidden units per layer. All layers are fully connected. Ex. [64, 32] means first layer has 64 nodes and second one has 32.
  • feature_columns: An iterable containing all the feature columns used by the model. All items in the set should be instances of classes derived from FeatureColumn.
  • model_dir: Directory to save model parameters, graph and etc.
  • weight_column_name: A string defining feature column name representing weights. It is used to down weight or boost examples during training. It will be multiplied by the loss of the example.
  • optimizer: An instance of tf.Optimizer used to train the model. If None, will use an Adagrad optimizer.
  • activation_fn: Activation function applied to each layer. If None, will use tf.nn.relu.
  • dropout: When not None, the probability we will drop out a given coordinate.
  • gradient_clip_norm: A float > 0. If provided, gradients are clipped to their global norm with this clipping ratio. See tf.clip_by_global_norm for more details.
  • enable_centered_bias: A bool. If True, estimator will learn a centered bias variable for each class. Rest of the model structure learns the residual after centered bias.
  • config: RunConfig object to configure the runtime settings.
Returns:

A DNNRegressor estimator.


tf.contrib.learn.DNNRegressor.bias_


tf.contrib.learn.DNNRegressor.dnn_bias_

Returns bias of deep neural network part.


tf.contrib.learn.DNNRegressor.dnn_weights_

Returns weights of deep neural network part.


tf.contrib.learn.DNNRegressor.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.DNNRegressor.fit(x=None, y=None, input_fn=None, steps=None, batch_size=None, monitors=None, max_steps=None)

Trains a model given training data x predictions and y targets.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class labels in classification, real numbers in regression). If set, input_fn must be None.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • steps: Number of steps for which to train model. If None, train forever. If set, max_steps must be None.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.

  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.

  • max_steps: Number of total steps for which to train model. If None, train forever. If set, steps must be None.

    Two calls to fit(steps=100) means 200 training iterations. On the other hand, two calls to fit(max_steps=100) means that the second call will not do any iteration since first call did all 100 steps.

Returns:

self, for chaining.

Raises:
  • ValueError: If x or y are not None while input_fn is not None.
  • ValueError: If both steps and max_steps are not None.

tf.contrib.learn.DNNRegressor.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.DNNRegressor.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.DNNRegressor.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.DNNRegressor.linear_bias_

Returns bias of the linear part.


tf.contrib.learn.DNNRegressor.linear_weights_

Returns weights per feature of the linear part.


tf.contrib.learn.DNNRegressor.model_dir


tf.contrib.learn.DNNRegressor.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.DNNRegressor.predict(x=None, input_fn=None, batch_size=None)

Returns predictions for given features.

Args:
  • x: features.
  • input_fn: Input function. If set, x must be None.
  • batch_size: Override default batch size.
Returns:

Numpy array of predicted classes or regression values.


tf.contrib.learn.DNNRegressor.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.DNNRegressor.weights_


class tf.contrib.learn.TensorFlowDNNClassifier


tf.contrib.learn.TensorFlowDNNClassifier.__init__(*args, **kwargs)


tf.contrib.learn.TensorFlowDNNClassifier.bias_


tf.contrib.learn.TensorFlowDNNClassifier.dnn_bias_

Returns bias of deep neural network part.


tf.contrib.learn.TensorFlowDNNClassifier.dnn_weights_

Returns weights of deep neural network part.


tf.contrib.learn.TensorFlowDNNClassifier.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.TensorFlowDNNClassifier.fit(x, y, steps=None, batch_size=None, monitors=None, logdir=None)


tf.contrib.learn.TensorFlowDNNClassifier.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.TensorFlowDNNClassifier.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.TensorFlowDNNClassifier.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.TensorFlowDNNClassifier.linear_bias_

Returns bias of the linear part.


tf.contrib.learn.TensorFlowDNNClassifier.linear_weights_

Returns weights per feature of the linear part.


tf.contrib.learn.TensorFlowDNNClassifier.model_dir


tf.contrib.learn.TensorFlowDNNClassifier.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.TensorFlowDNNClassifier.predict(x=None, input_fn=None, batch_size=None, outputs=None, axis=1)

Predict class or regression for x.


tf.contrib.learn.TensorFlowDNNClassifier.predict_proba(x=None, input_fn=None, batch_size=None, outputs=None)


tf.contrib.learn.TensorFlowDNNClassifier.save(path)

Saves checkpoints and graph to given path.

Args:
  • path: Folder to save model to.

tf.contrib.learn.TensorFlowDNNClassifier.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.TensorFlowDNNClassifier.weights_


class tf.contrib.learn.TensorFlowDNNRegressor


tf.contrib.learn.TensorFlowDNNRegressor.__init__(*args, **kwargs)


tf.contrib.learn.TensorFlowDNNRegressor.bias_


tf.contrib.learn.TensorFlowDNNRegressor.dnn_bias_

Returns bias of deep neural network part.


tf.contrib.learn.TensorFlowDNNRegressor.dnn_weights_

Returns weights of deep neural network part.


tf.contrib.learn.TensorFlowDNNRegressor.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.TensorFlowDNNRegressor.fit(x, y, steps=None, batch_size=None, monitors=None, logdir=None)


tf.contrib.learn.TensorFlowDNNRegressor.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.TensorFlowDNNRegressor.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.TensorFlowDNNRegressor.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.TensorFlowDNNRegressor.linear_bias_

Returns bias of the linear part.


tf.contrib.learn.TensorFlowDNNRegressor.linear_weights_

Returns weights per feature of the linear part.


tf.contrib.learn.TensorFlowDNNRegressor.model_dir


tf.contrib.learn.TensorFlowDNNRegressor.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.TensorFlowDNNRegressor.predict(x=None, input_fn=None, batch_size=None, outputs=None, axis=1)

Predict class or regression for x.


tf.contrib.learn.TensorFlowDNNRegressor.predict_proba(x=None, input_fn=None, batch_size=None, outputs=None)


tf.contrib.learn.TensorFlowDNNRegressor.save(path)

Saves checkpoints and graph to given path.

Args:
  • path: Folder to save model to.

tf.contrib.learn.TensorFlowDNNRegressor.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.TensorFlowDNNRegressor.weights_


class tf.contrib.learn.TensorFlowEstimator

Base class for all TensorFlow estimators.


tf.contrib.learn.TensorFlowEstimator.__init__(model_fn, n_classes, batch_size=32, steps=200, optimizer='Adagrad', learning_rate=0.1, clip_gradients=5.0, class_weight=None, continue_training=False, config=None, verbose=1)

Initializes a TensorFlowEstimator instance.

Args:
  • model_fn: Model function, that takes input x, y tensors and outputs prediction and loss tensors.

  • n_classes: Number of classes in the target.

  • batch_size: Mini batch size.

  • steps: Number of steps to run over data.

  • optimizer: Optimizer name (or class), for example "SGD", "Adam", "Adagrad".

  • learning_rate: If this is constant float value, no decay function is used. Instead, a customized decay function can be passed that accepts global_step as parameter and returns a Tensor. e.g. exponential decay function:

    def exp_decay(global_step):
        return tf.train.exponential_decay(
            learning_rate=0.1, global_step,
            decay_steps=2, decay_rate=0.001)
    
  • clip_gradients: Clip norm of the gradients to this value to stop gradient explosion.

  • class_weight: None or list of n_classes floats. Weight associated with classes for loss computation. If not given, all classes are supposed to have weight one.

  • continue_training: when continue_training is True, once initialized model will be continuely trained on every call of fit.

  • config: RunConfig object that controls the configurations of the session, e.g. num_cores, gpu_memory_fraction, etc.

  • verbose: Controls the verbosity, possible values:

    • 0: the algorithm and debug information is muted.
    • 1: trainer prints the progress.
    • 2: log device placement is printed.

tf.contrib.learn.TensorFlowEstimator.evaluate(x=None, y=None, input_fn=None, steps=None)

See base class.


tf.contrib.learn.TensorFlowEstimator.fit(x, y, steps=None, monitors=None, logdir=None)

Neural network model from provided model_fn and training data.

Note: called first time constructs the graph and initializers variables. Consecutives times it will continue training the same model. This logic follows partial_fit() interface in scikit-learn. To restart learning, create new estimator.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class labels in classification, real numbers in regression).

  • steps: int, number of steps to train. If None or 0, train for self.steps.

  • monitors: List of BaseMonitor objects to print training progress and invoke early stopping.

  • logdir: the directory to save the log file that can be used for optional visualization.

Returns:

Returns self.


tf.contrib.learn.TensorFlowEstimator.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.TensorFlowEstimator.get_tensor(name)

Returns tensor by name.

Args:
  • name: string, name of the tensor.
Returns:

Tensor.


tf.contrib.learn.TensorFlowEstimator.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.TensorFlowEstimator.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.TensorFlowEstimator.model_dir


tf.contrib.learn.TensorFlowEstimator.partial_fit(x, y)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training. This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression).

Returns:

Returns self.


tf.contrib.learn.TensorFlowEstimator.predict(x, axis=1, batch_size=None)

Predict class or regression for x.

For a classification model, the predicted class for each sample in x is returned. For a regression model, the predicted value based on x is returned.

Args:
  • x: array-like matrix, [n_samples, n_features...] or iterator.
  • axis: Which axis to argmax for classification. By default axis 1 (next after batch) is used. Use 2 for sequence predictions.
  • batch_size: If test set is too big, use batch size to split it into mini batches. By default the batch_size member variable is used.
Returns:
  • y: array of shape [n_samples]. The predicted classes or predicted value.

tf.contrib.learn.TensorFlowEstimator.predict_proba(x, batch_size=None)

Predict class probability of the input samples x.

Args:
  • x: array-like matrix, [n_samples, n_features...] or iterator.
  • batch_size: If test set is too big, use batch size to split it into mini batches. By default the batch_size member variable is used.
Returns:
  • y: array of shape [n_samples, n_classes]. The predicted probabilities for each class.

tf.contrib.learn.TensorFlowEstimator.restore(cls, path, config=None)

Restores model from give path.

Args:
  • path: Path to the checkpoints and other model information.
  • config: RunConfig object that controls the configurations of the session, e.g. num_cores, gpu_memory_fraction, etc. This is allowed to be reconfigured.
Returns:

Estimator, object of the subclass of TensorFlowEstimator.

Raises:
  • ValueError: if path does not contain a model definition.

tf.contrib.learn.TensorFlowEstimator.save(path)

Saves checkpoints and graph to given path.

Args:
  • path: Folder to save model to.

tf.contrib.learn.TensorFlowEstimator.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

class tf.contrib.learn.LinearClassifier

Linear classifier model.

Train a linear model to classify instances into one of multiple possible classes. When number of possible classes is 2, this is binary classification.

Example:

education = sparse_column_with_hash_bucket(column_name="education",
                                           hash_bucket_size=1000)
occupation = sparse_column_with_hash_bucket(column_name="occupation",
                                            hash_bucket_size=1000)

education_x_occupation = crossed_column(columns=[education, occupation],
                                        hash_bucket_size=10000)

# Estimator using the default optimizer.
estimator = LinearClassifier(
    feature_columns=[occupation, education_x_occupation])

# Or estimator using the FTRL optimizer with regularization.
estimator = LinearClassifier(
    feature_columns=[occupation, education_x_occupation],
    optimizer=tf.train.FtrlOptimizer(
      learning_rate=0.1,
      l1_regularization_strength=0.001
    ))

# Or estimator using the SDCAOptimizer.
estimator = LinearClassifier(
   feature_columns=[occupation, education_x_occupation],
   optimizer=tf.contrib.learn.SDCAOptimizer(
     example_id_column='example_id',
     symmetric_l2_regularization=2.0
   ))

# Input builders
def input_fn_train: # returns x, y
  ...
def input_fn_eval: # returns x, y
  ...
estimator.fit(input_fn=input_fn_train)
estimator.evaluate(input_fn=input_fn_eval)
estimator.predict(x=x)

Input of fit and evaluate should have following features, otherwise there will be a KeyError:

  • if weight_column_name is not None, a feature with key=weight_column_name whose value is a Tensor.
  • for each column in feature_columns:
    • if column is a SparseColumn, a feature with key=column.name whose value is a SparseTensor.
    • if column is a RealValuedColumn, a feature with key=column.name whose value is a Tensor.
    • if feature_columns is None, then input must contains only real valued Tensor.

tf.contrib.learn.LinearClassifier.__init__(feature_columns=None, model_dir=None, n_classes=2, weight_column_name=None, optimizer=None, gradient_clip_norm=None, enable_centered_bias=True, config=None)

Construct a LinearClassifier estimator object.

Args:
  • feature_columns: An iterable containing all the feature columns used by the model. All items in the set should be instances of classes derived from FeatureColumn.
  • model_dir: Directory to save model parameters, graph and etc.
  • n_classes: number of target classes. Default is binary classification.
  • weight_column_name: A string defining feature column name representing weights. It is used to down weight or boost examples during training. It will be multiplied by the loss of the example.
  • optimizer: The optimizer used to train the model. If specified, it should be either an instance of tf.Optimizer or the SDCAOptimizer. If None, the Ftrl optimizer will be used.
  • gradient_clip_norm: A float > 0. If provided, gradients are clipped to their global norm with this clipping ratio. See tf.clip_by_global_norm for more details.
  • enable_centered_bias: A bool. If True, estimator will learn a centered bias variable for each class. Rest of the model structure learns the residual after centered bias.
  • config: RunConfig object to configure the runtime settings.
Returns:

A LinearClassifier estimator.


tf.contrib.learn.LinearClassifier.bias_


tf.contrib.learn.LinearClassifier.dnn_bias_

Returns bias of deep neural network part.


tf.contrib.learn.LinearClassifier.dnn_weights_

Returns weights of deep neural network part.


tf.contrib.learn.LinearClassifier.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.LinearClassifier.fit(x=None, y=None, input_fn=None, steps=None, batch_size=None, monitors=None, max_steps=None)

Trains a model given training data x predictions and y targets.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class labels in classification, real numbers in regression). If set, input_fn must be None.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • steps: Number of steps for which to train model. If None, train forever. If set, max_steps must be None.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.

  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.

  • max_steps: Number of total steps for which to train model. If None, train forever. If set, steps must be None.

    Two calls to fit(steps=100) means 200 training iterations. On the other hand, two calls to fit(max_steps=100) means that the second call will not do any iteration since first call did all 100 steps.

Returns:

self, for chaining.

Raises:
  • ValueError: If x or y are not None while input_fn is not None.
  • ValueError: If both steps and max_steps are not None.

tf.contrib.learn.LinearClassifier.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.LinearClassifier.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.LinearClassifier.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.LinearClassifier.linear_bias_

Returns bias of the linear part.


tf.contrib.learn.LinearClassifier.linear_weights_

Returns weights per feature of the linear part.


tf.contrib.learn.LinearClassifier.model_dir


tf.contrib.learn.LinearClassifier.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.LinearClassifier.predict(x=None, input_fn=None, batch_size=None)

Returns predictions for given features.

Args:
  • x: features.
  • input_fn: Input function. If set, x must be None.
  • batch_size: Override default batch size.
Returns:

Numpy array of predicted classes or regression values.


tf.contrib.learn.LinearClassifier.predict_proba(x=None, input_fn=None, batch_size=None)

Returns prediction probabilities for given features.

Args:
  • x: features.
  • input_fn: Input function. If set, x and y must be None.
  • batch_size: Override default batch size.
Returns:

Numpy array of predicted probabilities.


tf.contrib.learn.LinearClassifier.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.LinearClassifier.weights_


class tf.contrib.learn.LinearRegressor

Linear regressor model.

Train a linear regression model to predict target variable value given observation of feature values.

Example:

education = sparse_column_with_hash_bucket(column_name="education",
                                           hash_bucket_size=1000)
occupation = sparse_column_with_hash_bucket(column_name="occupation",
                                            hash_bucket_size=1000)

education_x_occupation = crossed_column(columns=[education, occupation],
                                        hash_bucket_size=10000)

estimator = LinearRegressor(
    feature_columns=[occupation, education_x_occupation])

# Input builders
def input_fn_train: # returns x, y
  ...
def input_fn_eval: # returns x, y
  ...
estimator.fit(input_fn=input_fn_train)
estimator.evaluate(input_fn=input_fn_eval)
estimator.predict(x=x)

Input of fit and evaluate should have following features, otherwise there will be a KeyError:

  • if weight_column_name is not None: key=weight_column_name, value=a Tensor
  • for column in feature_columns:
    • if isinstance(column, SparseColumn): key=column.name, value=a SparseTensor
    • if isinstance(column, RealValuedColumn): key=column.name, value=a Tensor
    • if feature_columns is None: input must contains only real valued Tensor.

tf.contrib.learn.LinearRegressor.__init__(feature_columns=None, model_dir=None, weight_column_name=None, optimizer=None, gradient_clip_norm=None, enable_centered_bias=True, target_dimension=1, config=None)

Construct a LinearRegressor estimator object.

Args:
  • feature_columns: An iterable containing all the feature columns used by the model. All items in the set should be instances of classes derived from FeatureColumn.
  • model_dir: Directory to save model parameters, graph, etc.
  • weight_column_name: A string defining feature column name representing weights. It is used to down weight or boost examples during training. It will be multiplied by the loss of the example.
  • optimizer: An instance of tf.Optimizer used to train the model. If None, will use an Ftrl optimizer.
  • gradient_clip_norm: A float > 0. If provided, gradients are clipped to their global norm with this clipping ratio. See tf.clip_by_global_norm for more details.
  • enable_centered_bias: A bool. If True, estimator will learn a centered bias variable for each class. Rest of the model structure learns the residual after centered bias.
  • target_dimension: dimension of the target for multilabels.
  • config: RunConfig object to configure the runtime settings.
Returns:

A LinearRegressor estimator.


tf.contrib.learn.LinearRegressor.bias_


tf.contrib.learn.LinearRegressor.dnn_bias_

Returns bias of deep neural network part.


tf.contrib.learn.LinearRegressor.dnn_weights_

Returns weights of deep neural network part.


tf.contrib.learn.LinearRegressor.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.LinearRegressor.fit(x=None, y=None, input_fn=None, steps=None, batch_size=None, monitors=None, max_steps=None)

Trains a model given training data x predictions and y targets.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class labels in classification, real numbers in regression). If set, input_fn must be None.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • steps: Number of steps for which to train model. If None, train forever. If set, max_steps must be None.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.

  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.

  • max_steps: Number of total steps for which to train model. If None, train forever. If set, steps must be None.

    Two calls to fit(steps=100) means 200 training iterations. On the other hand, two calls to fit(max_steps=100) means that the second call will not do any iteration since first call did all 100 steps.

Returns:

self, for chaining.

Raises:
  • ValueError: If x or y are not None while input_fn is not None.
  • ValueError: If both steps and max_steps are not None.

tf.contrib.learn.LinearRegressor.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.LinearRegressor.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.LinearRegressor.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.LinearRegressor.linear_bias_

Returns bias of the linear part.


tf.contrib.learn.LinearRegressor.linear_weights_

Returns weights per feature of the linear part.


tf.contrib.learn.LinearRegressor.model_dir


tf.contrib.learn.LinearRegressor.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.LinearRegressor.predict(x=None, input_fn=None, batch_size=None)

Returns predictions for given features.

Args:
  • x: features.
  • input_fn: Input function. If set, x must be None.
  • batch_size: Override default batch size.
Returns:

Numpy array of predicted classes or regression values.


tf.contrib.learn.LinearRegressor.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.LinearRegressor.weights_


class tf.contrib.learn.TensorFlowLinearClassifier


tf.contrib.learn.TensorFlowLinearClassifier.__init__(*args, **kwargs)


tf.contrib.learn.TensorFlowLinearClassifier.bias_


tf.contrib.learn.TensorFlowLinearClassifier.dnn_bias_

Returns bias of deep neural network part.


tf.contrib.learn.TensorFlowLinearClassifier.dnn_weights_

Returns weights of deep neural network part.


tf.contrib.learn.TensorFlowLinearClassifier.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.TensorFlowLinearClassifier.fit(x, y, steps=None, batch_size=None, monitors=None, logdir=None)


tf.contrib.learn.TensorFlowLinearClassifier.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.TensorFlowLinearClassifier.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.TensorFlowLinearClassifier.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.TensorFlowLinearClassifier.linear_bias_

Returns bias of the linear part.


tf.contrib.learn.TensorFlowLinearClassifier.linear_weights_

Returns weights per feature of the linear part.


tf.contrib.learn.TensorFlowLinearClassifier.model_dir


tf.contrib.learn.TensorFlowLinearClassifier.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.TensorFlowLinearClassifier.predict(x=None, input_fn=None, batch_size=None, outputs=None, axis=1)

Predict class or regression for x.


tf.contrib.learn.TensorFlowLinearClassifier.predict_proba(x=None, input_fn=None, batch_size=None, outputs=None)


tf.contrib.learn.TensorFlowLinearClassifier.save(path)

Saves checkpoints and graph to given path.

Args:
  • path: Folder to save model to.

tf.contrib.learn.TensorFlowLinearClassifier.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.TensorFlowLinearClassifier.weights_


class tf.contrib.learn.TensorFlowLinearRegressor


tf.contrib.learn.TensorFlowLinearRegressor.__init__(*args, **kwargs)


tf.contrib.learn.TensorFlowLinearRegressor.bias_


tf.contrib.learn.TensorFlowLinearRegressor.dnn_bias_

Returns bias of deep neural network part.


tf.contrib.learn.TensorFlowLinearRegressor.dnn_weights_

Returns weights of deep neural network part.


tf.contrib.learn.TensorFlowLinearRegressor.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.TensorFlowLinearRegressor.fit(x, y, steps=None, batch_size=None, monitors=None, logdir=None)


tf.contrib.learn.TensorFlowLinearRegressor.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.TensorFlowLinearRegressor.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.TensorFlowLinearRegressor.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.TensorFlowLinearRegressor.linear_bias_

Returns bias of the linear part.


tf.contrib.learn.TensorFlowLinearRegressor.linear_weights_

Returns weights per feature of the linear part.


tf.contrib.learn.TensorFlowLinearRegressor.model_dir


tf.contrib.learn.TensorFlowLinearRegressor.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.TensorFlowLinearRegressor.predict(x=None, input_fn=None, batch_size=None, outputs=None, axis=1)

Predict class or regression for x.


tf.contrib.learn.TensorFlowLinearRegressor.predict_proba(x=None, input_fn=None, batch_size=None, outputs=None)


tf.contrib.learn.TensorFlowLinearRegressor.save(path)

Saves checkpoints and graph to given path.

Args:
  • path: Folder to save model to.

tf.contrib.learn.TensorFlowLinearRegressor.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.TensorFlowLinearRegressor.weights_


class tf.contrib.learn.TensorFlowRNNClassifier

TensorFlow RNN Classifier model.


tf.contrib.learn.TensorFlowRNNClassifier.__init__(rnn_size, n_classes, cell_type='gru', num_layers=1, input_op_fn=null_input_op_fn, initial_state=None, bidirectional=False, sequence_length=None, attn_length=None, attn_size=None, attn_vec_size=None, batch_size=32, steps=50, optimizer='Adagrad', learning_rate=0.1, class_weight=None, clip_gradients=5.0, continue_training=False, config=None, verbose=1)

Initializes a TensorFlowRNNClassifier instance.

Args:
  • rnn_size: The size for rnn cell, e.g. size of your word embeddings.

  • cell_type: The type of rnn cell, including rnn, gru, and lstm.

  • num_layers: The number of layers of the rnn model.

  • input_op_fn: Function that will transform the input tensor, such as creating word embeddings, byte list, etc. This takes an argument x for input and returns transformed x.

  • bidirectional: boolean, Whether this is a bidirectional rnn.

  • sequence_length: If sequence_length is provided, dynamic calculation is performed. This saves computational time when unrolling past max sequence length.

  • initial_state: An initial state for the RNN. This must be a tensor of appropriate type and shape [batch_size x cell.state_size].

  • attn_length: integer, the size of attention vector attached to rnn cells.

  • attn_size: integer, the size of an attention window attached to rnn cells.

  • attn_vec_size: integer, the number of convolutional features calculated on attention state and the size of the hidden layer built from base cell state.

  • n_classes: Number of classes in the target.

  • batch_size: Mini batch size.

  • steps: Number of steps to run over data.

  • optimizer: Optimizer name (or class), for example "SGD", "Adam", "Adagrad".

  • learning_rate: If this is constant float value, no decay function is used. Instead, a customized decay function can be passed that accepts global_step as parameter and returns a Tensor. e.g. exponential decay function:

    def exp_decay(global_step):
        return tf.train.exponential_decay(
            learning_rate=0.1, global_step,
            decay_steps=2, decay_rate=0.001)
    
  • class_weight: None or list of n_classes floats. Weight associated with classes for loss computation. If not given, all classes are supposed to have weight one.

  • continue_training: when continue_training is True, once initialized model will be continuely trained on every call of fit.

  • config: RunConfig object that controls the configurations of the session, e.g. num_cores, gpu_memory_fraction, etc.


tf.contrib.learn.TensorFlowRNNClassifier.bias_

Returns bias of the rnn layer.


tf.contrib.learn.TensorFlowRNNClassifier.evaluate(x=None, y=None, input_fn=None, steps=None)

See base class.


tf.contrib.learn.TensorFlowRNNClassifier.fit(x, y, steps=None, monitors=None, logdir=None)

Neural network model from provided model_fn and training data.

Note: called first time constructs the graph and initializers variables. Consecutives times it will continue training the same model. This logic follows partial_fit() interface in scikit-learn. To restart learning, create new estimator.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class labels in classification, real numbers in regression).

  • steps: int, number of steps to train. If None or 0, train for self.steps.

  • monitors: List of BaseMonitor objects to print training progress and invoke early stopping.

  • logdir: the directory to save the log file that can be used for optional visualization.

Returns:

Returns self.


tf.contrib.learn.TensorFlowRNNClassifier.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.TensorFlowRNNClassifier.get_tensor(name)

Returns tensor by name.

Args:
  • name: string, name of the tensor.
Returns:

Tensor.


tf.contrib.learn.TensorFlowRNNClassifier.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.TensorFlowRNNClassifier.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.TensorFlowRNNClassifier.model_dir


tf.contrib.learn.TensorFlowRNNClassifier.partial_fit(x, y)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training. This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression).

Returns:

Returns self.


tf.contrib.learn.TensorFlowRNNClassifier.predict(x, axis=1, batch_size=None)

Predict class or regression for x.

For a classification model, the predicted class for each sample in x is returned. For a regression model, the predicted value based on x is returned.

Args:
  • x: array-like matrix, [n_samples, n_features...] or iterator.
  • axis: Which axis to argmax for classification. By default axis 1 (next after batch) is used. Use 2 for sequence predictions.
  • batch_size: If test set is too big, use batch size to split it into mini batches. By default the batch_size member variable is used.
Returns:
  • y: array of shape [n_samples]. The predicted classes or predicted value.

tf.contrib.learn.TensorFlowRNNClassifier.predict_proba(x, batch_size=None)

Predict class probability of the input samples x.

Args:
  • x: array-like matrix, [n_samples, n_features...] or iterator.
  • batch_size: If test set is too big, use batch size to split it into mini batches. By default the batch_size member variable is used.
Returns:
  • y: array of shape [n_samples, n_classes]. The predicted probabilities for each class.

tf.contrib.learn.TensorFlowRNNClassifier.restore(cls, path, config=None)

Restores model from give path.

Args:
  • path: Path to the checkpoints and other model information.
  • config: RunConfig object that controls the configurations of the session, e.g. num_cores, gpu_memory_fraction, etc. This is allowed to be reconfigured.
Returns:

Estimator, object of the subclass of TensorFlowEstimator.

Raises:
  • ValueError: if path does not contain a model definition.

tf.contrib.learn.TensorFlowRNNClassifier.save(path)

Saves checkpoints and graph to given path.

Args:
  • path: Folder to save model to.

tf.contrib.learn.TensorFlowRNNClassifier.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.TensorFlowRNNClassifier.weights_

Returns weights of the rnn layer.


class tf.contrib.learn.TensorFlowRNNRegressor

TensorFlow RNN Regressor model.


tf.contrib.learn.TensorFlowRNNRegressor.__init__(rnn_size, cell_type='gru', num_layers=1, input_op_fn=null_input_op_fn, initial_state=None, bidirectional=False, sequence_length=None, attn_length=None, attn_size=None, attn_vec_size=None, n_classes=0, batch_size=32, steps=50, optimizer='Adagrad', learning_rate=0.1, clip_gradients=5.0, continue_training=False, config=None, verbose=1)

Initializes a TensorFlowRNNRegressor instance.

Args:
  • rnn_size: The size for rnn cell, e.g. size of your word embeddings.

  • cell_type: The type of rnn cell, including rnn, gru, and lstm.

  • num_layers: The number of layers of the rnn model.

  • input_op_fn: Function that will transform the input tensor, such as creating word embeddings, byte list, etc. This takes an argument x for input and returns transformed x.

  • bidirectional: boolean, Whether this is a bidirectional rnn.

  • sequence_length: If sequence_length is provided, dynamic calculation is performed. This saves computational time when unrolling past max sequence length.

  • attn_length: integer, the size of attention vector attached to rnn cells.

  • attn_size: integer, the size of an attention window attached to rnn cells.

  • attn_vec_size: integer, the number of convolutional features calculated on attention state and the size of the hidden layer built from base cell state.

  • initial_state: An initial state for the RNN. This must be a tensor of appropriate type and shape [batch_size x cell.state_size].

  • batch_size: Mini batch size.

  • steps: Number of steps to run over data.

  • optimizer: Optimizer name (or class), for example "SGD", "Adam", "Adagrad".

  • learning_rate: If this is constant float value, no decay function is used. Instead, a customized decay function can be passed that accepts global_step as parameter and returns a Tensor. e.g. exponential decay function:

    def exp_decay(global_step):
        return tf.train.exponential_decay(
            learning_rate=0.1, global_step,
            decay_steps=2, decay_rate=0.001)
    
  • continue_training: when continue_training is True, once initialized model will be continuely trained on every call of fit.

  • config: RunConfig object that controls the configurations of the session, e.g. num_cores, gpu_memory_fraction, etc.

  • verbose: Controls the verbosity, possible values:

    • 0: the algorithm and debug information is muted.
    • 1: trainer prints the progress.
    • 2: log device placement is printed.

tf.contrib.learn.TensorFlowRNNRegressor.bias_

Returns bias of the rnn layer.


tf.contrib.learn.TensorFlowRNNRegressor.evaluate(x=None, y=None, input_fn=None, steps=None)

See base class.


tf.contrib.learn.TensorFlowRNNRegressor.fit(x, y, steps=None, monitors=None, logdir=None)

Neural network model from provided model_fn and training data.

Note: called first time constructs the graph and initializers variables. Consecutives times it will continue training the same model. This logic follows partial_fit() interface in scikit-learn. To restart learning, create new estimator.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class labels in classification, real numbers in regression).

  • steps: int, number of steps to train. If None or 0, train for self.steps.

  • monitors: List of BaseMonitor objects to print training progress and invoke early stopping.

  • logdir: the directory to save the log file that can be used for optional visualization.

Returns:

Returns self.


tf.contrib.learn.TensorFlowRNNRegressor.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.TensorFlowRNNRegressor.get_tensor(name)

Returns tensor by name.

Args:
  • name: string, name of the tensor.
Returns:

Tensor.


tf.contrib.learn.TensorFlowRNNRegressor.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.TensorFlowRNNRegressor.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.TensorFlowRNNRegressor.model_dir


tf.contrib.learn.TensorFlowRNNRegressor.partial_fit(x, y)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training. This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model.

  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression).

Returns:

Returns self.


tf.contrib.learn.TensorFlowRNNRegressor.predict(x, axis=1, batch_size=None)

Predict class or regression for x.

For a classification model, the predicted class for each sample in x is returned. For a regression model, the predicted value based on x is returned.

Args:
  • x: array-like matrix, [n_samples, n_features...] or iterator.
  • axis: Which axis to argmax for classification. By default axis 1 (next after batch) is used. Use 2 for sequence predictions.
  • batch_size: If test set is too big, use batch size to split it into mini batches. By default the batch_size member variable is used.
Returns:
  • y: array of shape [n_samples]. The predicted classes or predicted value.

tf.contrib.learn.TensorFlowRNNRegressor.predict_proba(x, batch_size=None)

Predict class probability of the input samples x.

Args:
  • x: array-like matrix, [n_samples, n_features...] or iterator.
  • batch_size: If test set is too big, use batch size to split it into mini batches. By default the batch_size member variable is used.
Returns:
  • y: array of shape [n_samples, n_classes]. The predicted probabilities for each class.

tf.contrib.learn.TensorFlowRNNRegressor.restore(cls, path, config=None)

Restores model from give path.

Args:
  • path: Path to the checkpoints and other model information.
  • config: RunConfig object that controls the configurations of the session, e.g. num_cores, gpu_memory_fraction, etc. This is allowed to be reconfigured.
Returns:

Estimator, object of the subclass of TensorFlowEstimator.

Raises:
  • ValueError: if path does not contain a model definition.

tf.contrib.learn.TensorFlowRNNRegressor.save(path)

Saves checkpoints and graph to given path.

Args:
  • path: Folder to save model to.

tf.contrib.learn.TensorFlowRNNRegressor.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.TensorFlowRNNRegressor.weights_

Returns weights of the rnn layer.


class tf.contrib.learn.TensorFlowRegressor


tf.contrib.learn.TensorFlowRegressor.__init__(*args, **kwargs)


tf.contrib.learn.TensorFlowRegressor.bias_


tf.contrib.learn.TensorFlowRegressor.dnn_bias_

Returns bias of deep neural network part.


tf.contrib.learn.TensorFlowRegressor.dnn_weights_

Returns weights of deep neural network part.


tf.contrib.learn.TensorFlowRegressor.evaluate(x=None, y=None, input_fn=None, feed_fn=None, batch_size=None, steps=None, metrics=None, name=None)

Evaluates given model with provided evaluation data.

Evaluates on the given input data. If input_fn is provided, that input function should raise an end-of-input exception (OutOfRangeError or StopIteration) after one epoch of the training data has been provided.

By default, the whole evaluation dataset is used. If steps is provided, only steps batches of size batch_size are processed.

The return value is a dict containing the metrics specified in metrics, as well as an entry global_step which contains the value of the global step for which this evaluation was performed.

Args:
  • x: features.

  • y: targets.

  • input_fn: Input function. If set, x, y, and batch_size must be None.

  • feed_fn: Function creating a feed dict every time it is called. Called once per iteration.

  • batch_size: minibatch size to use on the input, defaults to first dimension of x, if specified. Must be None if input_fn is provided.

  • steps: Number of steps for which to evaluate model. If None, evaluate until running tensors generated by metrics raises an exception.

  • metrics: Dict of metric ops to run. If None, the default metric functions are used; if {}, no metrics are used. If model has one output (i.e., returning single predction), keys are str, e.g. 'accuracy' - just a name of the metric that will show up in the logs / summaries. Otherwise, keys are tuple of two str, e.g. ('accuracy', 'classes')- name of the metric and name of Tensor in the predictions to run this metric on.

    Metric ops should support streaming, e.g., returning update_op and value tensors. See more details in ../../../../metrics/python/metrics/ops/streaming_metrics.py.

  • name: Name of the evaluation if user needs to run multiple evaluations on different data sets, such as on training data vs test data.

Returns:

Returns dict with evaluation results.

Raises:
  • ValueError: If at least one of x or y is provided, and at least one of input_fn or feed_fn is provided. Or if metrics is not None or dict.

tf.contrib.learn.TensorFlowRegressor.fit(x, y, steps=None, batch_size=None, monitors=None, logdir=None)


tf.contrib.learn.TensorFlowRegressor.get_params(deep=True)

Get parameters for this estimator.

Args:
  • deep: boolean, optional

    If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any Parameter names mapped to their values.


tf.contrib.learn.TensorFlowRegressor.get_variable_names()

Returns list of all variable names in this model.

Returns:

List of names.


tf.contrib.learn.TensorFlowRegressor.get_variable_value(name)

Returns value of the variable given by name.

Args:
  • name: string, name of the tensor.
Returns:

Numpy array - value of the tensor.


tf.contrib.learn.TensorFlowRegressor.linear_bias_

Returns bias of the linear part.


tf.contrib.learn.TensorFlowRegressor.linear_weights_

Returns weights per feature of the linear part.


tf.contrib.learn.TensorFlowRegressor.model_dir


tf.contrib.learn.TensorFlowRegressor.partial_fit(x=None, y=None, input_fn=None, steps=1, batch_size=None, monitors=None)

Incremental fit on a batch of samples.

This method is expected to be called several times consecutively on different or the same chunks of the dataset. This either can implement iterative training or out-of-core/online training.

This is especially useful when the whole dataset is too big to fit in memory at the same time. Or when model is taking long time to converge, and you want to split up training into subparts.

Args:
  • x: matrix or tensor of shape [n_samples, n_features...]. Can be iterator that returns arrays of features. The training input samples for fitting the model. If set, input_fn must be None.
  • y: vector or matrix [n_samples] or [n_samples, n_outputs]. Can be iterator that returns array of targets. The training target values (class label in classification, real numbers in regression). If set, input_fn must be None.
  • input_fn: Input function. If set, x, y, and batch_size must be None.
  • steps: Number of steps for which to train model. If None, train forever.
  • batch_size: minibatch size to use on the input, defaults to first dimension of x. Must be None if input_fn is provided.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
Returns:

self, for chaining.

Raises:
  • ValueError: If at least one of x and y is provided, and input_fn is provided.

tf.contrib.learn.TensorFlowRegressor.predict(x=None, input_fn=None, batch_size=None, outputs=None, axis=1)

Predict class or regression for x.


tf.contrib.learn.TensorFlowRegressor.predict_proba(x=None, input_fn=None, batch_size=None, outputs=None)


tf.contrib.learn.TensorFlowRegressor.save(path)

Saves checkpoints and graph to given path.

Args:
  • path: Folder to save model to.

tf.contrib.learn.TensorFlowRegressor.set_params(**params)

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it's possible to update each component of a nested object.

Args:
  • **params: Parameters.
Returns:

self

Raises:
  • ValueError: If params contain invalid names.

tf.contrib.learn.TensorFlowRegressor.weights_

Graph actions

Perform various training, evaluation, and inference actions on a graph.


class tf.contrib.learn.NanLossDuringTrainingError


class tf.contrib.learn.RunConfig

This class specifies the specific configurations for the run.


tf.contrib.learn.RunConfig.__init__(master='', task=0, num_ps_replicas=0, num_cores=4, log_device_placement=False, gpu_memory_fraction=1, tf_random_seed=42, save_summary_steps=100, save_checkpoints_secs=60, keep_checkpoint_max=5, keep_checkpoint_every_n_hours=10000)

Constructor.

Args:
  • master: TensorFlow master. Empty string (the default) for local.
  • task: Task id of the replica running the training (default: 0).
  • num_ps_replicas: Number of parameter server tasks to use (default: 0).
  • num_cores: Number of cores to be used (default: 4).
  • log_device_placement: Log the op placement to devices (default: False).
  • gpu_memory_fraction: Fraction of GPU memory used by the process on each GPU uniformly on the same machine.
  • tf_random_seed: Random seed for TensorFlow initializers. Setting this value allows consistency between reruns.
  • save_summary_steps: Save summaries every this many steps.
  • save_checkpoints_secs: Save checkpoints every this many seconds.
  • keep_checkpoint_max: The maximum number of recent checkpoint files to keep. As new files are created, older files are deleted. If None or 0, all checkpoint files are kept. Defaults to 5 (that is, the 5 most recent checkpoint files are kept.)
  • keep_checkpoint_every_n_hours: Number of hours between each checkpoint to be saved. The default value of 10,000 hours effectively disables the feature.

tf.contrib.learn.evaluate(graph, output_dir, checkpoint_path, eval_dict, update_op=None, global_step_tensor=None, supervisor_master='', log_every_steps=10, feed_fn=None, max_steps=None)

Evaluate a model loaded from a checkpoint.

Given graph, a directory to write summaries to (output_dir), a checkpoint to restore variables from, and a dict of Tensors to evaluate, run an eval loop for max_steps steps, or until an exception (generally, an end-of-input signal from a reader operation) is raised from running eval_dict.

In each step of evaluation, all tensors in the eval_dict are evaluated, and every log_every_steps steps, they are logged. At the very end of evaluation, a summary is evaluated (finding the summary ops using Supervisor's logic) and written to output_dir.

Args:
  • graph: A Graph to train. It is expected that this graph is not in use elsewhere.
  • output_dir: A string containing the directory to write a summary to.
  • checkpoint_path: A string containing the path to a checkpoint to restore. Can be None if the graph doesn't require loading any variables.
  • eval_dict: A dict mapping string names to tensors to evaluate. It is evaluated in every logging step. The result of the final evaluation is returned. If update_op is None, then it's evaluated in every step. If max_steps is None, this should depend on a reader that will raise an end-of-inupt exception when the inputs are exhausted.
  • update_op: A Tensor which is run in every step.
  • global_step_tensor: A Variable containing the global step. If None, one is extracted from the graph using the same logic as in Supervisor. Used to place eval summaries on training curves.
  • supervisor_master: The master string to use when preparing the session.
  • log_every_steps: Integer. Output logs every log_every_steps evaluation steps. The logs contain the eval_dict and timing information.
  • feed_fn: A function that is called every iteration to produce a feed_dict passed to session.run calls. Optional.
  • max_steps: Integer. Evaluate eval_dict this many times.
Returns:

A tuple (eval_results, global_step):

  • eval_results: A dict mapping string to numeric values (int, float) that are the result of running eval_dict in the last step. None if no eval steps were run.
  • global_step: The global step this evaluation corresponds to.
Raises:
  • ValueError: if output_dir is empty.

tf.contrib.learn.infer(restore_checkpoint_path, output_dict, feed_dict=None)

Restore graph from restore_checkpoint_path and run output_dict tensors.

If restore_checkpoint_path is supplied, restore from checkpoint. Otherwise, init all variables.

Args:
  • restore_checkpoint_path: A string containing the path to a checkpoint to restore.
  • output_dict: A dict mapping string names to Tensor objects to run. Tensors must all be from the same graph.
  • feed_dict: dict object mapping Tensor objects to input values to feed.
Returns:

Dict of values read from output_dict tensors. Keys are the same as output_dict, values are the results read from the corresponding Tensor in output_dict.

Raises:
  • ValueError: if output_dict or feed_dicts is None or empty.

tf.contrib.learn.run_feeds(output_dict, feed_dicts, restore_checkpoint_path=None)

Run output_dict tensors with each input in feed_dicts.

If restore_checkpoint_path is supplied, restore from checkpoint. Otherwise, init all variables.

Args:
  • output_dict: A dict mapping string names to Tensor objects to run. Tensors must all be from the same graph.
  • feed_dicts: Iterable of dict objects of input values to feed.
  • restore_checkpoint_path: A string containing the path to a checkpoint to restore.
Returns:

A list of dicts of values read from output_dict tensors, one item in the list for each item in feed_dicts. Keys are the same as output_dict, values are the results read from the corresponding Tensor in output_dict.

Raises:
  • ValueError: if output_dict or feed_dicts is None or empty.

tf.contrib.learn.run_n(output_dict, feed_dict=None, restore_checkpoint_path=None, n=1)

Run output_dict tensors n times, with the same feed_dict each run.

Args:
  • output_dict: A dict mapping string names to tensors to run. Must all be from the same graph.
  • feed_dict: dict of input values to feed each run.
  • restore_checkpoint_path: A string containing the path to a checkpoint to restore.
  • n: Number of times to repeat.
Returns:

A list of n dict objects, each containing values read from output_dict tensors.


tf.contrib.learn.train(graph, output_dir, train_op, loss_op, global_step_tensor=None, init_op=None, init_feed_dict=None, init_fn=None, log_every_steps=10, supervisor_is_chief=True, supervisor_master='', supervisor_save_model_secs=600, keep_checkpoint_max=5, supervisor_save_summaries_steps=100, feed_fn=None, steps=None, fail_on_nan_loss=True, monitors=None, max_steps=None)

Train a model.

Given graph, a directory to write outputs to (output_dir), and some ops, run a training loop. The given train_op performs one step of training on the model. The loss_op represents the objective function of the training. It is expected to increment the global_step_tensor, a scalar integer tensor counting training steps. This function uses Supervisor to initialize the graph (from a checkpoint if one is available in output_dir), write summaries defined in the graph, and write regular checkpoints as defined by supervisor_save_model_secs.

Training continues until global_step_tensor evaluates to max_steps, or, if fail_on_nan_loss, until loss_op evaluates to NaN. In that case the program is terminated with exit code 1.

Args:
  • graph: A graph to train. It is expected that this graph is not in use elsewhere.
  • output_dir: A directory to write outputs to.
  • train_op: An op that performs one training step when run.
  • loss_op: A scalar loss tensor.
  • global_step_tensor: A tensor representing the global step. If none is given, one is extracted from the graph using the same logic as in Supervisor.
  • init_op: An op that initializes the graph. If None, use Supervisor's default.
  • init_feed_dict: A dictionary that maps Tensor objects to feed values. This feed dictionary will be used when init_op is evaluated.
  • init_fn: Optional callable passed to Supervisor to initialize the model.
  • log_every_steps: Output logs regularly. The logs contain timing data and the current loss.
  • supervisor_is_chief: Whether the current process is the chief supervisor in charge of restoring the model and running standard services.
  • supervisor_master: The master string to use when preparing the session.
  • supervisor_save_model_secs: Save a checkpoint every supervisor_save_model_secs seconds when training.
  • keep_checkpoint_max: The maximum number of recent checkpoint files to keep. As new files are created, older files are deleted. If None or 0, all checkpoint files are kept. This is simply passed as the max_to_keep arg to tf.Saver constructor.
  • supervisor_save_summaries_steps: Save summaries every supervisor_save_summaries_steps seconds when training.
  • feed_fn: A function that is called every iteration to produce a feed_dict passed to session.run calls. Optional.
  • steps: Trains for this many steps (e.g. current global step + steps).
  • fail_on_nan_loss: If true, raise NanLossDuringTrainingError if loss_op evaluates to NaN. If false, continue training as if nothing happened.
  • monitors: List of BaseMonitor subclass instances. Used for callbacks inside the training loop.
  • max_steps: Number of total steps for which to train model. If None, train forever. Two calls fit(steps=100) means 200 training iterations. On the other hand two calls of fit(max_steps=100) means, second call will not do any iteration since first call did all 100 steps.
Returns:

The final loss value.

Raises:
  • ValueError: If output_dir, train_op, loss_op, or global_step_tensor is not provided. See tf.contrib.framework.get_global_step for how we look up the latter if not provided explicitly.
  • NanLossDuringTrainingError: If fail_on_nan_loss is True, and loss ever evaluates to NaN.
  • ValueError: If both steps and max_steps are not None.

Input processing

Queue and read batched input data.


tf.contrib.learn.extract_dask_data(data)

Extract data from dask.Series or dask.DataFrame for predictors.


tf.contrib.learn.extract_dask_labels(labels)

Extract data from dask.Series for labels.


tf.contrib.learn.extract_pandas_data(data)

Extract data from pandas.DataFrame for predictors.

Given a DataFrame, will extract the values and cast them to float. The DataFrame is expected to contain values of type int, float or bool.

Args:
  • data: pandas.DataFrame containing the data to be extracted.
Returns:

A numpy ndarray of the DataFrame's values as floats.

Raises:
  • ValueError: if data contains types other than int, float or bool.

tf.contrib.learn.extract_pandas_labels(labels)

Extract data from pandas.DataFrame for labels.

Args:
  • labels: pandas.DataFrame or pandas.Series containing one column of labels to be extracted.
Returns:

A numpy ndarray of labels from the DataFrame.

Raises:
  • ValueError: if more than one column is found or type is not int, float or bool.

tf.contrib.learn.extract_pandas_matrix(data)

Extracts numpy matrix from pandas DataFrame.

Args:
  • data: pandas.DataFrame containing the data to be extracted.
Returns:

A numpy ndarray of the DataFrame's values.


tf.contrib.learn.read_batch_examples(file_pattern, batch_size, reader, randomize_input=True, num_epochs=None, queue_capacity=10000, num_threads=1, read_batch_size=1, parse_fn=None, name=None)

Adds operations to read, queue, batch Example protos.

Given file pattern (or list of files), will setup a queue for file names, read Example proto using provided reader, use batch queue to create batches of examples of size batch_size.

All queue runners are added to the queue runners collection, and may be started via start_queue_runners.

All ops are added to the default graph.

Use parse_fn if you need to do parsing / processing on single examples.

Args:
  • file_pattern: List of files or pattern of file paths containing Example records. See tf.gfile.Glob for pattern rules.
  • batch_size: An int or scalar Tensor specifying the batch size to use.
  • reader: A function or class that returns an object with read method, (filename tensor) -> (example tensor).
  • randomize_input: Whether the input should be randomized.
  • num_epochs: Integer specifying the number of times to read through the dataset. If None, cycles through the dataset forever. NOTE - If specified, creates a variable that must be initialized, so call tf.initialize_all_variables() as shown in the tests.
  • queue_capacity: Capacity for input queue.
  • num_threads: The number of threads enqueuing examples.
  • read_batch_size: An int or scalar Tensor specifying the number of records to read at once
  • parse_fn: Parsing function, takes Example Tensor returns parsed representation. If None, no parsing is done.
  • name: Name of resulting op.
Returns:

String Tensor of batched Example proto. If keep_keys is True, then returns tuple of string Tensors, where first value is the key.

Raises:
  • ValueError: for invalid inputs.

tf.contrib.learn.read_batch_features(file_pattern, batch_size, features, reader, randomize_input=True, num_epochs=None, queue_capacity=10000, reader_num_threads=1, parser_num_threads=1, name=None)

Adds operations to read, queue, batch and parse Example protos.

Given file pattern (or list of files), will setup a queue for file names, read Example proto using provided reader, use batch queue to create batches of examples of size batch_size and parse example given features specification.

All queue runners are added to the queue runners collection, and may be started via start_queue_runners.

All ops are added to the default graph.

Args:
  • file_pattern: List of files or pattern of file paths containing Example records. See tf.gfile.Glob for pattern rules.
  • batch_size: An int or scalar Tensor specifying the batch size to use.
  • features: A dict mapping feature keys to FixedLenFeature or VarLenFeature values.
  • reader: A function or class that returns an object with read method, (filename tensor) -> (example tensor).
  • randomize_input: Whether the input should be randomized.
  • num_epochs: Integer specifying the number of times to read through the dataset. If None, cycles through the dataset forever. NOTE - If specified, creates a variable that must be initialized, so call tf.initialize_local_variables() as shown in the tests.
  • queue_capacity: Capacity for input queue.
  • reader_num_threads: The number of threads to read examples.
  • parser_num_threads: The number of threads to parse examples. records to read at once
  • name: Name of resulting op.
Returns:

A dict of Tensor or SparseTensor objects for each in features. If keep_keys is True, returns tuple of string Tensor and above dict.

Raises:
  • ValueError: for invalid inputs.

tf.contrib.learn.read_batch_record_features(file_pattern, batch_size, features, randomize_input=True, num_epochs=None, queue_capacity=10000, reader_num_threads=1, parser_num_threads=1, name='dequeue_record_examples')

Reads TFRecord, queues, batches and parses Example proto.

See more detailed description in read_examples.

Args:
  • file_pattern: List of files or pattern of file paths containing Example records. See tf.gfile.Glob for pattern rules.
  • batch_size: An int or scalar Tensor specifying the batch size to use.
  • features: A dict mapping feature keys to FixedLenFeature or VarLenFeature values.
  • randomize_input: Whether the input should be randomized.
  • num_epochs: Integer specifying the number of times to read through the dataset. If None, cycles through the dataset forever. NOTE - If specified, creates a variable that must be initialized, so call tf.initialize_local_variables() as shown in the tests.
  • queue_capacity: Capacity for input queue.
  • reader_num_threads: The number of threads to read examples.
  • parser_num_threads: The number of threads to parse examples.
  • name: Name of resulting op.
Returns:

A dict of Tensor or SparseTensor objects for each in features.

Raises:
  • ValueError: for invalid inputs.