* relu grad and maxpooling grad fixes for perf
* Graph layout pass and conversion pass changes
This commit makes following changes:
- Enables support for ReluGrad and BiasAddGrad
- Adds support for detecting depthwise/batchwise pooling
- Adds more unit tests for Graph rewrite pass
- Improvements to handling control-flow edges
- Bug fixes
* Defaulting to Eigen when LRN depth_radius!=2
* Fixed mkl_conv_grad_filter.cc for conv_ops_tests.py
* Style fix to mkl_matmul and remove unnecessary 'MKL' label on matmul kernel
* Style fixes based on clang-format to mkl_conv_* and mkl_matmul
* Bug fixes
* Adding OP_REQUIRES_OK check in Concat
* Making some style changes
* Enabled the configuration of MKL settings
* relu grad and maxpooling grad fixes for perf
* Graph layout pass and conversion pass changes
This commit makes following changes:
- Enables support for ReluGrad and BiasAddGrad
- Adds support for detecting depthwise/batchwise pooling
- Adds more unit tests for Graph rewrite pass
- Improvements to handling control-flow edges
- Bug fixes
* Defaulting to Eigen when LRN depth_radius!=2
* Fixed mkl_conv_grad_filter.cc for conv_ops_tests.py
* Style fix to mkl_matmul and remove unnecessary 'MKL' label on matmul kernel
* Style fixes based on clang-format to mkl_conv_* and mkl_matmul
* Bug fixes
* Adding OP_REQUIRES_OK check in Concat
* Making some style changes
* Enabled the configuration of MKL settings
* Fixing graph unit tests with Mkl op name change to _Mkl; Fixed missing _ in MklToTf op
* Fixed missing libdl.so.2 in BUILD file
* Fixes for unit test build failures.
* Changes in mkl_conv_grad_filter_ops.cc for Google code style
* Fixes to remove dead code
* removed the dead code and added a TODO for mkl implementation to handle this case in the future
* Fixed buildifier sanity check error
* Adding support for google's CI automation
* Updated link to new MKL version
* Fix for missing locate command in CI
* Adding updatedb to populate the database after installing mlocate
* Fixed buildifier issue
* setting tf_need_mkl=0 in libtf files
* Added third_party/mkl/* to .gitignore
* Added third_party/eigen3/mkl_include to .gitignore
* In configured, set MKL-enabling options only for Linux.
This change:
1. updates common_env.sh to export PYTHON_LIB_PATH
along with PYTHON_BIN_PATH so the configure
scripts doesn't have to guess
2. writes these paths to bazelrc with quotes
around, to guard agains spaces in the path (e.g.
"C:/Program Files/Anaconda3/python")
Fixes https://github.com/bazelbuild/bazel/issues/2892
* Enable grappler to propagate shapes through queues.
Change: 154789133
* Add whitelist support in uid of RunConfig.
Change: 154794859
* Fix a bunch of bad links and missing docs in contrib.
Change: 154820641
* Don't try to refine the shapes for a node if its inference context wasn't
successfully built by the AddNode() method.
Change: 154838211
* Fix issue related to empty bazel.rc file.
Change: 154840138
* Remove overly precise CHECK when rendering debug output for a function.
An `_Arg` node can have more than three attrs, because the runtime may
(and does) add system-defined attrs (viz. "_output_shapes") that do
not change the meaning of the op.
Change: 154850526
* Port makefile build breakage
Change: 154855106
* [TF:XLA] Try to incorporate Tensorflow node structure for large HLO GraphDefs.
This change assumes that a TF subgraph/op does not cross the boundary of a HLO
computation and always put top-level TF subgraphs/ops under HLO computations.
Change: 154855884
* Added a unit test to check what happens when 2 shapes with known rank but
unknown dimensions are merged
Change: 154856675
* [XLA] Refactor constant folding operations into a dedicated module
Refactor constant folding operations into a dedicated module, and added a new
ReplaceInstruction() API to collapse { computation->ReplaceInstruction();
changed=true}.
Change: 154857025
* Java: Docs: Update instructions for Windows.
Inspired by
http://stackoverflow.com/questions/43741775/tensorflow-in-java-running-failed
Change: 154859066
* Add more documentation for features and labels.
Change: 154859649
* Added link to high-performance models
Change: 154860213
* Navigation and index for new performance section documents.
Change: 154862215
* Fix shape mismatch between loss and weights.
Change: 154862650
* Add examples to TensorShape documentation and ran autoformatter.
Change: 154862667
* Move linking of cudnn_plugin, cublas_plugin and cufft_plugin from
stream_executor to the ops that need them.
Change: 154863520
* Properly track the persistent memory usage of lookup tables.
Change: 154866686
* Reset the inputs to ShapeRefiner::RunShapeFn so that it behaves the same every time it's called.
To properly handle queues that have populated by several enqueue ops, merge the shapes of the inputs to all the enqueue ops before calling InferenceContext::set_output_handle_shape(). This ensures that we detect incorrect queue setups (where the 2 enqueue ops might generate tensors with incompatible shapes), and that we take all the known shape information instead of that of just one of the enqueue ops.
Change: 154866747
* Making sure an error message will be produced by session_manager when a non-tensor object is passed in.
Otherwise the 'name' property is missing.
Change: 154868022
* Don't needlessly synchronize the CUDA stream in CropAndResize.
Make the op Async so we don't block an executor thread while waiting for the result of the box bounds check to be copied back to the host.
Change: 154868460
* Add contribution guidelines and standards section to CONTRIBUTING.md
Several parts are largely based on the post by @yaroslavvb at: #7443#issuecomment-279182613
Fixes#7443
Change: 154876045
* Final draft
Change: 154876563
* Final draft
Change: 154876646
* Fix losses documentation.
Fix documentation of get_total_loss() to be correct.
And add a helpful comment about a common pitfall.
Change: 154876822
* [XLA] Second change for HLO interpreter.
Extends HloEvaluator to allow evaluation of HLO Computation or single HLO instruction
with non-constant operands, by traversing the instruction in post order and keeps track of
each instruction along the way as evaluated literals.
Change: 154877580
* [tf distributions] Move the remaining whitelisted distributions to core.
Change: 154878206
* Add shape to error message.
Change: 154880260
* Revert "Fix build issue when `/usr/bin/python` path is not available (#9547)"
This reverts commit 95f37ebf0b.
1. Add %workspace% in .bazelrc file when using import statement
2. Write action_env into bazelrc file for required environment variables for OpenCL support
Change: 152290700
XLA is a compiler-based linear algebra execution engine that targets CPUs, GPUs and custom accelerators.
XLA is still experimental; we are releasing it early to get the community involved.
Change: 143990941
Some users do not want or need this support and it significantly increases
their dependencies and binary sizes. After this change, users must explicitly
depend on :gcs_file_system to enable this support.
Change: 143129406
Additionally:
- change single quotes to double quotes to make path rewriting easier
- guard windows lib reference with PLATFORM_WINDOWS
- fixed failing kmeans test
Change: 141515942
This change allows Bazel to fetch and build SWIG rather than getting it
from the system. This change also improves the i/o performance of the
SWIG build, makes it hermetically sealed, and ensures tf_py_wrap_cc()
can function correctly across Bazel repositories.
CC: #4983
Change: 136783531
Notes:
- The test is tagged as manual, and you must download the Hadoop distribution to run it.
- We ask during ./configure whether to include HDFS support.
- Copied hdfs.h from Hadoop here in third_party. It's licensed Apache 2.0.
Change: 133615494
It is necessary to symlink in files from .git/ in order to make
bazel aware of changes to the current head. As it is this is not
completely reliable when git repositories are in a dirty index
state. First class support for bazel git a reported bug but
not a high priority.
./configure sets up the symlinks by calling the gen_git_source.py
a bazel genrule calls gen_git_source.py to generate version_info.cc
Also changed cmake and make to build this properly.
Change: 132328009