Commit Graph

17 Commits

Author SHA1 Message Date
Sanjoy Das
847d6a8958 Avoid a footgun in KernelArgsArray
Do not store the address of a `const T&`.  Storing the address of a `const T&`
means add_argument(42) does not work, which very counter-intuitive.

PiperOrigin-RevId: 298405422
Change-Id: I769dfa8d7dad92b1e73b1a4f591768b4536cca39
2020-03-02 11:50:16 -08:00
A. Unique TensorFlower
155ce6c067 Qualify uses of std::string
PiperOrigin-RevId: 297212802
Change-Id: Ic65150e7ab418be034f48d45ce25ef5d19105836
2020-02-25 15:07:45 -08:00
Gunhan Gulsoy
87e03a53f3 Simplify the recursive tmeplate in TypedKernel
PiperOrigin-RevId: 259897561
2019-07-25 00:52:19 -07:00
Justin Lebar
04c404e05c [SE] Remove StreamExecutor alias for absl::string_view.
PiperOrigin-RevId: 219333558
2018-10-30 11:26:18 -07:00
Justin Lebar
9b371c8d58 [SE] Remove StreamExecutor alias for absl::InlinedVector.
PiperOrigin-RevId: 219324068
2018-10-30 10:41:01 -07:00
Justin Lebar
4764bf2986 [StreamExecutor] Rename ::perftools::gputools -> ::stream_executor, part 1.
Step 1 of re-namespace'ing StreamExecutor into ::stream_executor.

This moves everything inside of stream_executor/..., and leaves a
namespace alias into ::perftools::gputools.  The next steps will clean
up users to use the new namespace.

This is mostly a mechanical change, but it also includes a bunch of
non-mechanical changes that ideally would be split out into separate
patches.  Unfortunately they all sort of need to be shoved in here for
various reasons:

 - forward declarations need to be in the same namespace as the actual
   types, so we need to change all forward declarations of
   StreamExecutor types in this one patch.

 - Uses of these forward declarations need to be changed to the new
   namespace (or otherwise we need to add a namespace alias to the
   relevant header, but this is pretty ugly).

 - Various initialization code needs to live in StreamExecutor's "real"
   namespace, so all this needs to be changed.

PiperOrigin-RevId: 193256128
2018-04-17 14:28:51 -07:00
Sourabh Bajaj
d8697935d3 Merge changes from github.
PiperOrigin-RevId: 182258809
2018-01-17 13:18:49 -08:00
A. Unique TensorFlower
6cbf156e72 Avoid unloading kernels that haven't been loaded and fix replay_computation to
destroy the local client instance.

Add a user defined move constructor for KernelBase to nullify the parent stream
pointer. This is needed to avoid unloading kernels that haven't been loaded
when the moved-from KernelBase objects are destructed.

Add a call to ClientLibrary::DestroyLocalInstances to destroy the local client
instance used by replay_computation.

PiperOrigin-RevId: 181995818
2018-01-15 16:35:54 -08:00
Martin Wicke
d57572e996 Merge changes from github.
PiperOrigin-RevId: 167401527
2017-09-02 19:25:56 -07:00
A. Unique TensorFlower
5799620640 [XLA:GPU] Cache kernels in KernelThunk::ExecuteOnStream
Change: 147497724
2017-02-14 11:44:19 -08:00
Shanqing Cai
56fc8834c7 Merge changes from github.
Change: 145363673
2017-01-23 18:45:23 -08:00
Peter Hawkins
4048325961 Add missing include to tensorflow/stream_executor/kernel.h
Change: 140676173
2016-11-30 18:03:35 -08:00
Peter Hawkins
bada4a5339 StreamExecutor: Optimize kernel argument packing
Create a single class to hold all kernel arguments and optimize how they are added into this class.
Change: 140556725
2016-11-29 19:05:55 -08:00
A. Unique TensorFlower
122cdce33e Update copyright for 3p/tf.
Change: 123901292
2016-06-02 13:41:12 -07:00
Vijay Vasudevan
ddd4aaf528 TensorFlow: upstream changes to git.
Change 109695551
	Update FAQ
Change 109694725
	Add a gradient for resize_bilinear op.
Change 109694505
	Don't mention variables module in docs

	variables.Variable should be tf.Variable.
Change 109658848
	Adding an option to create a new thread-pool for each session.
Change 109640570

	Take the snapshot of stream-executor.
	+ Expose an interface for scratch space allocation in the interface.

Change 109638559
	Let image_summary accept uint8 input

	This allows users to do their own normalization / scaling if the default
	(very weird) behavior of image_summary is undesired.

	This required a slight tweak to fake_input.cc to make polymorphically typed
	fake inputs infer if their type attr is not set but has a default.

	Unfortunately, adding a second valid type to image_summary *disables* automatic
	implicit conversion from np.float64 to tf.float32, so this change is slightly
	backwards incompatible.
Change 109636969
	Add serialization operations for SparseTensor.
Change 109636644
	Update generated Op docs.
Change 109634899
	TensorFlow: add a markdown file for producing release notes for our
	releases.  Seed with 0.5.0 with a boring but accurate description.
Change 109634502
	Let histogram_summary take any realnumbertype

	It used to take only floats, not it understands ints.
Change 109634434
	TensorFlow: update locations where we mention python 3 support, update
	them to current truth.
Change 109632108
	Move HSV <> RGB conversions, grayscale conversions, and adjust_* ops back to tensorflow
	- make GPU-capable version of RGBToHSV and HSVToRGB, allows only float input/output
	- change docs to reflect new size constraints
	- change HSV format to be [0,1] for all components
	- add automatic dtype conversion for all adjust_* and grayscale conversion ops
	- fix up docs
Change 109631077
	Improve optimizer exceptions

	1. grads_and_vars is now a tuple, so must be wrapped when passed to format.
	2. Use '%r' instead of '%s' for dtype formatting

Base CL: 109697989
2015-12-08 09:58:59 -08:00
Manjunath Kudlur
9c3043ff3b TensorFlow: Improve performance of Alexnet
Changes:

* error message that refers to removed `DefaultSession` method.
* -Wnull-conversion warnings
* the "_start_time" attr for recvs when the flag "--brain_enable_scheduling_for_recvs" is set.
* typo in tutorial data download progress message.
* a typo ("however their installing"=>"however installing").
* typo, rename "TensorFlow Mechanics" to "How To" to be consistent with the website.
* a typo ("subtact"=>"subtract").
* protobuf examples in comments in tensorflow::Example.proto.
* formula formatting in MNIST beginner tutorial
* negative fraction-of-queue-full stats
* protobuf inclusion path so that Android demo will build under Blaze.
* small typo (moderatly > moderately)
* Session.run() to check that tensor arguments come from the session's graph.
* another six import
* seq2seq typo in bazel command

Base CL: 108349164
2015-11-20 10:30:41 -08:00
Manjunath Kudlur
f41959ccb2 TensorFlow: Initial commit of TensorFlow library.
TensorFlow is an open source software library for numerical computation
using data flow graphs.

Base CL: 107276108
2015-11-06 16:27:58 -08:00